
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0279a.6
(1582 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv... 372 e-101
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 366 3e-99
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult... 362 6e-98
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult... 360 2e-97
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 356 4e-96
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa... 350 2e-94
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 345 9e-93
ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cu... 336 4e-90
gb|AAP53029.1| putative retrotransposon-related protein [Oryza s... 335 7e-90
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 324 2e-86
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 312 5e-83
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 309 4e-82
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 308 7e-82
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 305 6e-81
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 304 1e-80
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 297 2e-78
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 293 2e-77
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ... 291 1e-76
ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cu... 288 8e-76
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 281 2e-73
>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
sativa (japonica cultivar-group)]
Length = 1181
Score = 372 bits (956), Expect = e-101
Identities = 202/473 (42%), Positives = 289/473 (60%), Gaps = 27/473 (5%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CM+K L +K+ KQ+L+ K+Q+ G + H+ AF I+ADL + V D++D +ILLC
Sbjct: 90 ICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEKDLALILLC 149
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSV-EEGGGSSGEGLFVKGGQDRG 206
SLP SY + T+ Y +D++TL + L + ++ V EG S EGL V+G Q
Sbjct: 150 SLPSSYANFRDTILYSRDTLTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGSQQEK 209
Query: 207 RGKGKAVD---SGKKKRSKSKDRKTAECYSCKQIGH-----WK-RDCPNRSGK-----SG 252
K+ D S + RSKS+ R + C CK+ GH WK +D R+GK
Sbjct: 210 NTNNKSRDKSSSSYRGRSKSRGRYKS-CKYCKRDGHDISKCWKLQDKDKRTGKYIPKGKK 268
Query: 253 NSSSAANVVQSDGSCSEEDLLCVSYVKC---TDAWVLDSGCSCHMTPHREWFNSFKSCDF 309
A VV + S +E L V+Y C +D W+LD+ C+ HM P+R+WF +++
Sbjct: 269 EEEGKAAVVTDEKSDAE---LLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEVVQG 325
Query: 310 GYVYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSE 369
G V +GDD PC + G+ V+I + DG +RTLS VR++P + ++LISL TL GY +
Sbjct: 326 GTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKRSLISLCTLDRKGYKYSGG 385
Query: 370 ENRDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVE---TDDDATKLWHMRLGHL 426
+ IL+V+KG++ VM+A + N+Y L G+T++G+VA+V ++ DAT LWHMRLGH+
Sbjct: 386 DG--ILKVTKGSLVVMKASIKSANLYHLQGTTILGNVATVSDSLSNSDATNLWHMRLGHM 443
Query: 427 SERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPT 486
SE G+ EL KR LL G + C++C+ GK RV+F T H T+GILDYVHSD+WGP
Sbjct: 444 SEIGLAELSKRGLLDGQSISKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPA 503
Query: 487 KEPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
++ S GG RY +T DD+SRKVW YFLK+K + F FK WK VE QT RK+K
Sbjct: 504 RKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFNVFKEWKTMVERQTERKVK 556
Score = 244 bits (623), Expect = 2e-62
Identities = 119/207 (57%), Positives = 156/207 (74%), Gaps = 3/207 (1%)
Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
AKKILGMEI RE+ + KL+LSQ Y+E V RF+M A PVSTPLA HF+LS + CP++
Sbjct: 895 AKKILGMEITRERHSDKLYLSQKGYIEKVFRRFNMHDAKPVSTPLAAHFRLSSDLCPQSD 954
Query: 1360 SEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKG 1419
IE MS++ ++SAVG LMY M+C+RPDL+ A S V ++M+ PGK+HW+ V+WI RYL G
Sbjct: 955 YNIEYMSRVPYSSAVGSLMYAMICSRPDLSHALSVVSRYMANPGKEHWKDVQWIFRYLHG 1014
Query: 1420 TADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAM 1479
T+ + F R + +VGYVDSD+AGDLD RRS TGYVFT+ G + WK+S+Q+ VA+
Sbjct: 1015 TSSACLQFGRSRDG---LVGYVDSDFAGDLDRRRSLTGYVFTIGGCAVSWKASLQATVAL 1071
Query: 1480 STTEAEYMAVAEAVKEALWLTGLVKKL 1506
STTEAEYMA++EA KEA+WL GL +L
Sbjct: 1072 STTEAEYMAISEACKEAIWLRGLYTEL 1098
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 2340
Score = 366 bits (940), Expect = 3e-99
Identities = 201/473 (42%), Positives = 287/473 (60%), Gaps = 27/473 (5%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CM+K L +K+ KQ L+ K+Q+ G + H+ AF I+ADL + V D+ED +ILLC
Sbjct: 297 ICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSAFKEIIADLESMEVKYDEEDLGLILLC 356
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSV-EEGGGSSGEGLFVKGGQDRG 206
SLP SY + T+ Y +D++TL + L + ++ V EG S EGL V G Q
Sbjct: 357 SLPSSYANFRDTILYSRDTLTLKEVYDALHVKEKMKKMVPSEGSNSQAEGLIVWGRQQEK 416
Query: 207 RGKGKAVD---SGKKKRSKSKDRKTAECYSCKQIGH-----WK-RDCPNRSGK-----SG 252
K ++ D S + RSKS+ R + C CK+ GH WK D R+GK
Sbjct: 417 NTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCKRDGHDIFECWKLHDKDKRTGKYVPKGKK 475
Query: 253 NSSSAANVVQSDGSCSEEDLLCVSYVKC---TDAWVLDSGCSCHMTPHREWFNSFKSCDF 309
A VV + S +E L V+Y C +D W+L++ C HM P+R+WF ++++
Sbjct: 476 EEEGKAAVVTDEKSDAE---LLVAYAGCAQTSDQWILNTACIYHMCPNRDWFATYEAVQV 532
Query: 310 GYVYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSE 369
G V +GDD PC + G+ V+I + DG +RTLS VR++P + ++LISL TL GY +
Sbjct: 533 GTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKRSLISLCTLDRKGYKYSGG 592
Query: 370 ENRDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVE---TDDDATKLWHMRLGHL 426
+ IL+V+KG++ VM+A + N+Y L G+T++G+VA+V ++ DAT LWHMRLGH+
Sbjct: 593 DG--ILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHM 650
Query: 427 SERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPT 486
+E G+ EL KR LL G + C++C+ GK RV+F T H T+GILDYVHSD+WGP
Sbjct: 651 TEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPA 710
Query: 487 KEPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
++ S GG RY +T DD+SRKVW YFLK+K + F FK WK VE QT RK+K
Sbjct: 711 RKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVK 763
Score = 244 bits (622), Expect = 2e-62
Identities = 119/204 (58%), Positives = 157/204 (76%), Gaps = 3/204 (1%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI R++ + KL+LSQ Y+E VL RF+M A PVSTPLA HF+LS + CP++
Sbjct: 1252 AAKKILGMEITRKRHSFKLYLSQKGYIEKVLRRFNMHDAKPVSTPLAAHFRLSSDLCPQS 1311
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
+IE MS++ ++SAVG LMY MVC+RPDL+ A S V ++M+ PGK+HW+AV+WI RYL+
Sbjct: 1312 DYDIEYMSRVPYSSAVGSLMYAMVCSRPDLSHALSVVSRYMANPGKEHWKAVQWIFRYLR 1371
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
GT+ + F R + +VGYVDSD+AGDLD RS GYVFT+ G + WK+S+Q+ VA
Sbjct: 1372 GTSSACLQFGRSRDG---LVGYVDSDFAGDLDRGRSLAGYVFTIGGCAVSWKASLQATVA 1428
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGL 1502
+STTEAEYMA++EA KEA+WL GL
Sbjct: 1429 LSTTEAEYMAISEACKEAIWLRGL 1452
>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1241
Score = 362 bits (929), Expect = 6e-98
Identities = 201/472 (42%), Positives = 286/472 (60%), Gaps = 29/472 (6%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
M+K L +K+ KQ+L+ K+Q+ G + H+ AF I+ADL + V D+ED +ILLCSL
Sbjct: 1 MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60
Query: 150 PGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSV-EEGGGSSGEGLFVKGGQDRGRG 208
P SY + T+ Y +D++TL + L + ++ V EG S EGL V+G Q
Sbjct: 61 PSSYANFRDTILYSRDTLTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGRQQEKNT 120
Query: 209 KGKAVDSGK---KKRSKSKDRKTAECYSCKQIGH-----WK-RDCPNRS------GKSGN 253
K+ D + RSKS+ R + C CK+ GH WK +D R+ GK
Sbjct: 121 NNKSRDKSSSIYRGRSKSRGRYKS-CKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEE 179
Query: 254 SSSAANVVQSDGSCSEEDLLCVSYVKC---TDAWVLDSGCSCHMTPHREWFNSFKSCDFG 310
AA VV + S +E L V+Y C +D W+LD+ C+ HM P+R+WF ++++ G
Sbjct: 180 EGKAA-VVTDEKSDAE---LLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGG 235
Query: 311 YVYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEE 370
V +GDD PC + G+ V+I + DG +RTL VR++P + ++LISL TL GY + +
Sbjct: 236 TVLMGDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGD 295
Query: 371 NRDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVE---TDDDATKLWHMRLGHLS 427
IL+V+KG++ VM+A N+Y L G+T++G+VA+V ++ DAT LWHMRLGH+S
Sbjct: 296 G--ILKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMS 353
Query: 428 ERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTK 487
E G+ EL KR LL G + C++C+ GK RV+F T H T+GILDYVHSD+WGP +
Sbjct: 354 EIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPAR 413
Query: 488 EPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+ S GG RY +T DD+SRKVW YFLK+K + F FK WK VE QT RK+K
Sbjct: 414 KTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVK 465
Score = 241 bits (615), Expect = 1e-61
Identities = 120/208 (57%), Positives = 157/208 (74%), Gaps = 3/208 (1%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI RE+ + KL+LSQ Y+E VL RF+M A VST LA HF+LS + CP++
Sbjct: 954 AAKKILGMEITRERHSGKLYLSQKCYIEKVLHRFNMHDAKLVSTLLAAHFRLSSDLCPQS 1013
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
A +IE MS++ ++SAV LMY MVC+RPDL+ A S V ++M+ PGK+HW+AV+WI RYL+
Sbjct: 1014 AYDIEYMSRVPYSSAVSSLMYAMVCSRPDLSHALSVVSRYMANPGKEHWKAVQWIFRYLR 1073
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
GT+ + F R +VGYVDSD+AGDLD RRS TGYVFT+ G + WK+S+Q+ VA
Sbjct: 1074 GTSSACLQFGRSSDG---LVGYVDSDFAGDLDRRRSLTGYVFTVGGCAVSWKASLQATVA 1130
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
+STTEAEYMA++EA KE +WL GL +L
Sbjct: 1131 LSTTEAEYMAISEACKEVIWLRGLYTEL 1158
>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1280
Score = 360 bits (925), Expect = 2e-97
Identities = 199/473 (42%), Positives = 286/473 (60%), Gaps = 27/473 (5%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CM+K L +K+ KQ+L+ K+Q+ G + H+ F I+ADL + V D+ED +ILLC
Sbjct: 125 ICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSTFKEIVADLESIEVKYDEEDLGLILLC 184
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSV-EEGGGSSGEGLFVKGGQDRG 206
SLP SY + T+ Y D++ L + L + ++ V EG S EGL V+G Q
Sbjct: 185 SLPSSYANFRDTILYSHDTLILKEVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGRQQEK 244
Query: 207 RGKGKAVD---SGKKKRSKSKDRKTAECYSCKQIGH-----WK-RDCPNRSGK-----SG 252
K ++ D S + RSKS+ R + C CK+ GH WK +D R+GK
Sbjct: 245 NTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCKRDGHDISECWKLQDKDKRTGKYIPKGKK 303
Query: 253 NSSSAANVVQSDGSCSEEDLLCVSYVKC---TDAWVLDSGCSCHMTPHREWFNSFKSCDF 309
A VV + S +E L V+Y C +D W+LD+ + HM P+R+WF ++++
Sbjct: 304 EEEGKAAVVTDEKSDTE---LLVAYAGCAQTSDQWILDTAWTYHMCPNRDWFATYEALQG 360
Query: 310 GYVYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSE 369
G V +GDD PC + G+ V+I + DG +RTLS VR++P + ++LISL TL GY +
Sbjct: 361 GTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVRHIPNLKRSLISLCTLDRKGYKYSGG 420
Query: 370 ENRDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVE---TDDDATKLWHMRLGHL 426
+ IL+V+KG++ VM+A + N+Y L G+T++G+VA+V ++ DAT LWHMRLGH+
Sbjct: 421 DG--ILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHM 478
Query: 427 SERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPT 486
SE G+ EL KR LL G + C++C+ GK RV+F T H T+GILDYVHSD+WGP
Sbjct: 479 SEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPA 538
Query: 487 KEPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+ S GG RY +T DD+SRKVW YFLK+K + F FK WK VE QT +K+K
Sbjct: 539 CKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTEKKVK 591
Score = 105 bits (263), Expect = 9e-21
Identities = 67/163 (41%), Positives = 95/163 (58%), Gaps = 24/163 (14%)
Query: 1367 KISHASAVGCLMYV--MVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADR- 1423
K+ SA+ L+YV M+ D ++ A + S+ G + A K IL ++ T +R
Sbjct: 1036 KVVDGSAIYLLLYVDDMLIAAKDKSEIAKLKAQLSSEFGMKDLGAAKKILG-MEITRERH 1094
Query: 1424 -GIMFSREQGVVPLV-------------------VGYVDSDYAGDLDDRRSTTGYVFTLA 1463
G ++ ++G + V VGYVDSD+AGDLD RRS TGYVFT+
Sbjct: 1095 SGKLYLSQKGYIKKVLRRFNMHDVKPFGRSRDGFVGYVDSDFAGDLDRRRSLTGYVFTIG 1154
Query: 1464 GGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
G + WK+S+Q+ VA+STTEAEYMA++EA KEA+WL GL +L
Sbjct: 1155 GCDVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTEL 1197
Score = 48.9 bits (115), Expect = 0.001
Identities = 24/41 (58%), Positives = 30/41 (72%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANP 1339
AAKKILGMEI RE+ + KL+LSQ Y++ VL RF+M P
Sbjct: 1080 AAKKILGMEITRERHSGKLYLSQKGYIKKVLRRFNMHDVKP 1120
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 1373
Score = 356 bits (913), Expect = 4e-96
Identities = 193/464 (41%), Positives = 265/464 (56%), Gaps = 15/464 (3%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CMSK L +K+ K +L++LKM+E + H+ F I+ADL + V DDED ++LLC
Sbjct: 88 ICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMAEFKKIVADLVSMEVKYDDEDLGLLLLC 147
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGGSS--GEGLFVKGGQDR 205
SLP SY + T+ +D +TL + L + + V+ G SS GE L V+G +
Sbjct: 148 SLPNSYANFRDTILLSRDELTLKEVYDALQNKEKMKIMVQNDGSSSSKGEALHVRGRTEN 207
Query: 206 GRGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPN-----RSGKSGNSSSAANV 260
K D + +SK K C CK H +C R K S A+
Sbjct: 208 RTSNEKNYDRRGRSKSKPPGNKKF-CVYCKLKNHNIDECKKVQAKERKNKKDGKVSVASA 266
Query: 261 VQSDGSCSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFG-YVYLGDDKP 319
SD + ++ V D W+LDS CS H+ R WF+S+K G V +GDD P
Sbjct: 267 AASDDDSGDCLVVFAGCVAGHDEWILDSACSFHICTKRNWFSSYKPVQKGDVVRMGDDNP 326
Query: 320 CIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSK 379
C I G+ V+I DDG RTL VRY+P +++NLISL TL GY + + +L+VSK
Sbjct: 327 CAIVGIGSVQIKTDDGMTRTLKNVRYIPGMSRNLISLSTLDAEGYKYSGSDG--VLKVSK 384
Query: 380 GAMTVMRAKRTAGNIYKLLGSTVMGD--VASVETDDDATK--LWHMRLGHLSERGMMELY 435
G++ ++ + +Y L G T+ G A+ T+D+ +K LWHMRLGH+S GM EL
Sbjct: 385 GSLVCLKGDVNSAKLYVLRGCTLTGSDSAAAAITNDEPSKTNLWHMRLGHMSHLGMTELM 444
Query: 436 KRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGFR 495
KRNLLKG S I C++C+ GK RV+F T H TKG LDYVH+D+WGP+K+PS+GG R
Sbjct: 445 KRNLLKGCTSSKIKFCEHCIFGKHKRVQFNTSVHTTKGTLDYVHADLWGPSKKPSLGGAR 504
Query: 496 YFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
Y +T DD+SRKVW YFLK+K + F FK WK +E QT RK+K
Sbjct: 505 YMLTIIDDYSRKVWPYFLKHKDDTFTAFKNWKVMIERQTERKVK 548
Score = 241 bits (615), Expect = 1e-61
Identities = 121/213 (56%), Positives = 160/213 (74%), Gaps = 6/213 (2%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
+AKKILGMEI R++ + L+LSQ++Y++ VL RF+M A VSTP+A HFKLS QCP
Sbjct: 1046 SAKKILGMEISRDRKSGLLFLSQHNYIKKVLQRFNMQNAKAVSTPIAPHFKLSAAQCPSI 1105
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
+EIE MS++ ++SAVG LMY MVC+RPDL+ A S V ++MS PGK+HW AV+WI RYL+
Sbjct: 1106 DAEIEYMSRVPYSSAVGSLMYAMVCSRPDLSYAMSLVSRYMSNPGKEHWRAVQWIFRYLR 1165
Query: 1419 GTADRGIMFSR-EQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIV 1477
GT + F R ++G ++GYVDSDYA DLD RRS TGYVFT+ + W++++QS+V
Sbjct: 1166 GTTYSCLKFGRTDKG----LIGYVDSDYAADLDRRRSLTGYVFTIGSCAVSWRATLQSVV 1221
Query: 1478 AMSTTEAEYMAVAEAVKEALWLTGLVKKL-GVE 1509
A+STTEAEYMA+ EA KE +WL GL +L GVE
Sbjct: 1222 ALSTTEAEYMAICEACKELIWLKGLYAELSGVE 1254
>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
gi|14029020|gb|AAK52561.1| Putative retroelement pol
polyprotein [Oryza sativa]
Length = 1326
Score = 350 bits (898), Expect = 2e-94
Identities = 189/460 (41%), Positives = 264/460 (57%), Gaps = 10/460 (2%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CMSK L +K+ K +L+S K+QE G + H+ F I+ DL + V DDED ++LLC
Sbjct: 92 ICMSKDLTSKMHIKMKLFSHKLQESGSVLNHISVFKEIVVDLVSIEVQFDDEDLGLLLLC 151
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGGSS-GEGLFVKGGQDRG 206
SLP SY + T+ +D +TL + L + + V+ SS GE L V+G ++
Sbjct: 152 SLPSSYANFRDTILLSRDELTLAEVYEALQNREKMKGMVQSDASSSKGEALQVRGRSEQR 211
Query: 207 RGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNSSSA-ANVVQSDG 265
+ + R +SK R C CK+ H+ +C K S A+VV S
Sbjct: 212 TYNDSSDRDKSQSRGRSKSRGKKFCKYCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAE 271
Query: 266 SCSEEDLLCV--SYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFG-YVYLGDDKPCII 322
+ D L V V D W+LD+ CS H+ +R+WF+S+KS G V +GDD P I
Sbjct: 272 NSDSGDCLVVFAGCVASHDEWILDTACSFHICINRDWFSSYKSVQNGDVVRMGDDNPREI 331
Query: 323 KGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSKGAM 382
G+ V+I DG RTL VR++P + +NLISL TL GY + S +++VSKG++
Sbjct: 332 VGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTLDAEGYKYSSSGG--VVKVSKGSL 389
Query: 383 TVMRAKRTAGNIYKLLGSTVMGDVASVETDDDA---TKLWHMRLGHLSERGMMELYKRNL 439
M + N+Y L GST+ G V + D T LWHMRLGH+SE GM EL KRNL
Sbjct: 390 VYMIGDMNSANLYVLRGSTLHGSVTAAAVSKDEPIKTNLWHMRLGHMSELGMAELMKRNL 449
Query: 440 LKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGFRYFVT 499
L G + C++CV GK RV+F T H+TKGILDYVH+D+WGP+++ +GG RY +T
Sbjct: 450 LDGCTQGKMKFCEHCVFGKHKRVKFNTSVHRTKGILDYVHTDLWGPSRKAYLGGARYMLT 509
Query: 500 FTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
DD+SRKVW YFLK+K + FA FK WK +E QT +++K
Sbjct: 510 IIDDYSRKVWPYFLKHKDDTFAAFKEWKVRIERQTEKEVK 549
Score = 165 bits (418), Expect = 1e-38
Identities = 91/208 (43%), Positives = 122/208 (57%), Gaps = 45/208 (21%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGM+I R++ + L+LSQ SY++ VL RF+M A PVSTP+A HFKLS QC T
Sbjct: 946 AAKKILGMKITRDRNSGLLFLSQQSYIKKVLQRFNMHDAKPVSTPIAPHFKLSALQCAST 1005
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
++E MS++ ++SAVG LMY MVC+RPDL+ A S + ++M+
Sbjct: 1006 DEDVEYMSRVPYSSAVGSLMYSMVCSRPDLSHAMSLISRYMANL---------------- 1049
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
DLD RRS TGYVFT+ + WK+++Q +V
Sbjct: 1050 -----------------------------DLDKRRSLTGYVFTIGSCAVSWKATLQPVVV 1080
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
STTEAEYMA+AEA KE++WL GL +L
Sbjct: 1081 QSTTEAEYMAIAEACKESVWLKGLFAEL 1108
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 345 bits (884), Expect = 9e-93
Identities = 190/460 (41%), Positives = 267/460 (57%), Gaps = 10/460 (2%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CMSK L +K+ K +L+S K+ E G + H+ F I+ADL + V DDED ++LLC
Sbjct: 92 ICMSKDLTSKMHIKMKLFSHKLHESGSVLNHISVFKEIVADLVSMEVQFDDEDLGLLLLC 151
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGGSS-GEGLFVKGGQDRG 206
SLP SY + T+ +D +TL + L + + V+ SS GE L V+G ++
Sbjct: 152 SLPSSYANFRHTILLSRDELTLAEVYEALQNREKMKGMVQSYASSSKGEALQVRGRSEQR 211
Query: 207 RGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNSSSA-ANVVQSDG 265
+ R +SK R C CK+ H+ +C K S A+VV S
Sbjct: 212 TYNDSNDHDKSQSRGRSKSRGKKFCKYCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAE 271
Query: 266 SCSEEDLLCV--SYVKCTDAWVLDSGCSCHMTPHREWFNSFKSC-DFGYVYLGDDKPCII 322
+ D L V YV D W+LD+ CS H+ +R+WF+S+KS + V +GDD P I
Sbjct: 272 NSDSGDCLVVFAGYVASHDEWILDTACSFHICINRDWFSSYKSVQNEDVVRMGDDNPREI 331
Query: 323 KGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSKGAM 382
G+ V+I DG RTL VR++P + +NLISL TL GY + +++VSKG++
Sbjct: 332 VGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTLDAEGYKYSGSGG--VVKVSKGSL 389
Query: 383 TVMRAKRTAGNIYKLLGSTVMGDV--ASVETDDDA-TKLWHMRLGHLSERGMMELYKRNL 439
M + N+Y L GST+ G V A+V D+ + T LWHMRLGH+SE GM EL KRNL
Sbjct: 390 VYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDEPSKTNLWHMRLGHMSELGMAELMKRNL 449
Query: 440 LKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGFRYFVT 499
L G + C++CV GK RV+F T H+TKGILDYVH+D+WGP+++PS+GG RY +T
Sbjct: 450 LDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTKGILDYVHADLWGPSRKPSLGGARYMLT 509
Query: 500 FTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
DD+SRK W YFLK+K + FA FK K +E QT +++K
Sbjct: 510 IIDDYSRKEWPYFLKHKDDTFAAFKERKVMIERQTEKEVK 549
Score = 242 bits (618), Expect = 6e-62
Identities = 120/209 (57%), Positives = 158/209 (75%), Gaps = 5/209 (2%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI R++ + L+LSQ SY++ VL RF+M A PVSTP+A HFKLS QC T
Sbjct: 1035 AAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFNMHDAKPVSTPIAPHFKLSALQCAST 1094
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
++E MS++ ++SAVG LMY MVC+ PDL+ A S V ++M+ PGK+HW+AV+WI RYL+
Sbjct: 1095 DEDVEYMSRVPYSSAVGSLMYAMVCSWPDLSHAMSLVSRYMANPGKEHWKAVQWIFRYLR 1154
Query: 1419 GTADRGIMFSR-EQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIV 1477
GTAD + F R ++G +VGYVDSD+A DLD RRS TGYVFT+ + WK+++Q +V
Sbjct: 1155 GTADACLKFGRIDKG----LVGYVDSDFAADLDKRRSLTGYVFTIGSCAVSWKATLQPVV 1210
Query: 1478 AMSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
A STTEAEYMA+AEA KE++WL GL +L
Sbjct: 1211 AQSTTEAEYMAIAEACKESVWLKGLFAEL 1239
>ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475101|gb|AAT44170.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|46576026|gb|AAT01387.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1175
Score = 336 bits (861), Expect = 4e-90
Identities = 184/446 (41%), Positives = 261/446 (58%), Gaps = 12/446 (2%)
Query: 103 RLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSLPGSYDHLVTTLTY 162
+L+S K+QE G + H+ F I+ADL + V DDED ++LLCSLP SY + T+
Sbjct: 2 KLFSHKLQESGSILNHISVFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRDTILL 61
Query: 163 GKDSITLDSISSTLLQHAQRRRSVEEGGGSS-GEGLFVKGGQDRGRGKGKAVDSGKKKRS 221
+ +TL + L + + V+ SS GE L V+G ++ + R
Sbjct: 62 SRSELTLAEVYEALQNREKMKGMVQSDASSSKGEALQVRGRSEQRTYNDSNDRDKNQSRG 121
Query: 222 KSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNSSSA-ANVVQSDGSCSEEDLLCVSYVKC 280
+SK R C CK+ H+ +C K S A+VV S + D L V +V C
Sbjct: 122 RSKSRGKKFCKYCKKKNHFIEECWKLQNKEKRKSDGKASVVTSADNSDSGDCLVV-FVVC 180
Query: 281 T---DAWVLDSGCSCHMTPHREWFNSFKSCDFG-YVYLGDDKPCIIKGM*QVKIALDDGG 336
D W+LD+ CS H+ +R+WF+S+KS G V +GDD P I G+ V+I DG
Sbjct: 181 VSSHDEWILDTTCSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGM 240
Query: 337 VRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYK 396
RTL VR++P + +NLISL TL GY + +++VSKG++ M + N+Y
Sbjct: 241 TRTLKDVRHIPRMARNLISLSTLDAEGYKYSGSGG--VVKVSKGSLVYMIGDMNSANLYV 298
Query: 397 LLGSTVMGDV-ASVETDDDATK--LWHMRLGHLSERGMMELYKRNLLKGVRSCTIGLCKY 453
L GST+ G V A+V + D+ +K +WHMRLGH+SE GM EL KRNLL G + C++
Sbjct: 299 LRGSTLHGYVTAAVVSKDEPSKTNMWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEH 358
Query: 454 CVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRKVWVYFL 513
CV GK RV+F T H+TKGILDYVH+D+WGP+++PS+GG RY +T DD+SRKVW YFL
Sbjct: 359 CVFGKHKRVKFNTSVHRTKGILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKVWPYFL 418
Query: 514 KYKSEVFAKFKLWKAEVENQTGRKIK 539
K+K + FA FK WK ++ QT +++K
Sbjct: 419 KHKDDTFAAFKEWKVMIKRQTEKEVK 444
Score = 227 bits (578), Expect = 3e-57
Identities = 111/208 (53%), Positives = 151/208 (72%), Gaps = 3/208 (1%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI R++ + L+LSQ SY++ VL RF+M PVST +A HFKLS QC T
Sbjct: 888 AAKKILGMEITRDRNSGWLFLSQQSYIKKVLQRFNMHDTKPVSTHIAPHFKLSALQCAST 947
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
++E MS++ ++S VG LMY MVC+R DL+ A S V ++M+ PGK+HW+A++WI RYL+
Sbjct: 948 DEDVEYMSRVPYSSVVGSLMYAMVCSRLDLSHAMSLVSRYMANPGKEHWKAIQWIFRYLR 1007
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
TA+ + F R ++GYVDSD+A DLD RRS TGYVFT+ + WK++++ +VA
Sbjct: 1008 DTANACLKFGRTN---KGLIGYVDSDFAADLDKRRSLTGYVFTIGSCAVSWKATLRHVVA 1064
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
STTEAEYMA+AEA KE++WL GL +L
Sbjct: 1065 QSTTEAEYMAIAEACKESVWLKGLFAEL 1092
>gb|AAP53029.1| putative retrotransposon-related protein [Oryza sativa (japonica
cultivar-group)] gi|37532880|ref|NP_920742.1| putative
retrotransposon-related protein [Oryza sativa (japonica
cultivar-group)] gi|22655747|gb|AAN04164.1| Putative
retrotransposon protein [Oryza sativa (japonica
cultivar-group)] gi|16905223|gb|AAL31093.1| putative
retrotransposon-related protein [Oryza sativa]
Length = 1229
Score = 335 bits (859), Expect = 7e-90
Identities = 187/465 (40%), Positives = 269/465 (57%), Gaps = 20/465 (4%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CMSK L +K+ K +L+S K+QE G + H+ F I+ADL + V DDED ++LLC
Sbjct: 89 ICMSKDLTSKMHIKMKLFSHKLQESGSVLNHISVFKEIIADLVSMEVQFDDEDLGLLLLC 148
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGR 207
SLP Y + T+ +D +TL + L Q+ ++ + + + SS +G K Q RGR
Sbjct: 149 SLPSLYANFRDTILLSRDELTLAEVYEAL-QNREKMKGMVQSDASSSKG---KALQVRGR 204
Query: 208 GKGKAVDSGKKK-----RSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNSSSA-ANVV 261
+ + + + R +SK R C CK+ H+ +C K S A+VV
Sbjct: 205 SEQRTYNDSNDRDKSQSRGRSKSRGKKFCKYCKKKNHFIEECWKLQNKEKRKSDGKASVV 264
Query: 262 QSDGSCSEEDLLCVSYVKCT---DAWVLDSGCSCHMTPHREWFNSFKSCDFG-YVYLGDD 317
S + D L V + C D W+LD+ C + +R+WF+S KS G V +GD+
Sbjct: 265 TSAENSDSADCL-VFFAGCVASHDEWILDTACLFLICINRDWFSSHKSVQNGDVVRMGDN 323
Query: 318 KPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRV 377
P I G+ V+I DG RTL VR++P + +NLISL TL GY + +++V
Sbjct: 324 NPREIMGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTLDAEGYKYSGSGG--VVKV 381
Query: 378 SKGAMTVMRAKRTAGNIYKLLGSTVMGDV--ASVETDDDA-TKLWHMRLGHLSERGMMEL 434
SKG++ M + N+Y L GST+ G + A+V D+ + T LWHMRLGH+SE GM EL
Sbjct: 382 SKGSLVYMIGDMNSANLYVLRGSTLHGSLTAAAVSKDEPSKTNLWHMRLGHMSELGMAEL 441
Query: 435 YKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGF 494
KRNLL G + C++CV GK RV+F T H+TKGILDYVH+D+WGP+++PS+GG
Sbjct: 442 MKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTKGILDYVHADLWGPSRKPSLGGA 501
Query: 495 RYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
Y +T DD+SRKVW YFLK+K + FA FK WK +E Q +++K
Sbjct: 502 CYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWKVMIERQAEKEVK 546
Score = 234 bits (598), Expect = 1e-59
Identities = 118/228 (51%), Positives = 162/228 (70%), Gaps = 5/228 (2%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
A+KKILGMEI R+ + L+LSQ SY++ VL RF++ A PVSTP+A HFKLS QC T
Sbjct: 981 ASKKILGMEITRDINSGLLFLSQQSYIKKVLQRFNIHDAKPVSTPIAPHFKLSALQCTST 1040
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
++E MS++ ++S VG LMY MVC+RP L+ A S V ++M+ PGK+HW+AV+WI RYL+
Sbjct: 1041 DEDVEYMSRVPYSSVVGSLMYAMVCSRPVLSHAMSLVSRYMANPGKEHWKAVQWIFRYLR 1100
Query: 1419 GTADRGIMFSR-EQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIV 1477
GTAD + F R ++G +VGYVDSD+A DLD RRS TGYVFT+ + WK+++Q +V
Sbjct: 1101 GTADACLKFGRTDKG----LVGYVDSDFAADLDKRRSLTGYVFTIGSCAVSWKATLQPVV 1156
Query: 1478 AMSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQLLSIWRTIRCI 1525
A ST EAEYMA+AEA KE++WL GL +L + L +++ C+
Sbjct: 1157 AQSTAEAEYMAIAEACKESVWLKGLFAELCRVDSYINLFCDSQSVICL 1204
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 324 bits (830), Expect = 2e-86
Identities = 184/465 (39%), Positives = 270/465 (57%), Gaps = 39/465 (8%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CM+K L +K+ KQ+L+ K+Q+ + H+ AF I+ADL + V D++D +ILLC
Sbjct: 90 ICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSAFKEIVADLESMEVKYDEDDLGLILLC 149
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSV-EEGGGSSGEGLFVKGGQDRG 206
SLP SY + T+ Y +D++TL + + ++ V EG S EGL V+G Q +
Sbjct: 150 SLPSSYANFRGTILYSRDTLTLKEVYDAFHAKEKMKKMVTSEGSNSQAEGLVVRGRQQKK 209
Query: 207 RGKGKAVD---SGKKKRSKSKDRKTAECYSCKQIGH-----WK-RDCPNRSGKSGNSSSA 257
K ++ D S + R+KS+ R + C CK+ GH WK +D R+GK
Sbjct: 210 NTKNQSRDKSSSSYRGRTKSRGRYKS-CKYCKRDGHDISECWKLQDKDKRTGK------- 261
Query: 258 ANVVQSDGSCSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYLGDD 317
G EE V + +DA +L + C T ++WF ++++ G V +GDD
Sbjct: 262 ---YIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQDWFATYEALQGGTVLMGDD 318
Query: 318 KPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRV 377
PC + G+ V+I + DG +RTLS V+++P + ++LISL IL+V
Sbjct: 319 TPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLISL---------------YGILKV 363
Query: 378 SKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVE---TDDDATKLWHMRLGHLSERGMMEL 434
+KG++ VM+ + N+Y L G+T++G+VA+V ++ DAT LWHMRLGH+SE G+ EL
Sbjct: 364 TKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSNSDATNLWHMRLGHMSEIGLAEL 423
Query: 435 YKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGF 494
KR LL G + C++C+ GK RV+F T H T+GILDYVHSD+WGP + S GG
Sbjct: 424 SKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPAHKTSFGGA 483
Query: 495 RYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
RY +T DD+SRKVW YFLK+K + F FK WK VE QT RK+K
Sbjct: 484 RYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVERQTERKVK 528
Score = 248 bits (632), Expect = 2e-63
Identities = 123/208 (59%), Positives = 158/208 (75%), Gaps = 3/208 (1%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI RE+ + KL+LSQ Y+E VL RF+M A PVSTPLA HF+LS + CP +
Sbjct: 1017 AAKKILGMEITRERHSGKLYLSQKGYIEKVLRRFNMHDAKPVSTPLAAHFRLSSDLCPLS 1076
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
+IE MS++ ++SAVG LMY MVC RPDL+ A S V ++M+ PGK+HW+AV+WI RYL+
Sbjct: 1077 DYDIEYMSRVPYSSAVGSLMYAMVCCRPDLSHALSVVNRYMANPGKEHWKAVQWIFRYLR 1136
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
GT+ + F R + +VGYVDSD+AGDLD RRS TGYVFT+ G + WK+S+Q+ VA
Sbjct: 1137 GTSSACLQFERSRDG---LVGYVDSDFAGDLDRRRSITGYVFTIGGCAVSWKASLQATVA 1193
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
+STTEAEYMA+ EA KEA+WL GL +L
Sbjct: 1194 LSTTEAEYMAIFEACKEAIWLRGLYTEL 1221
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
gi|11278366|pir||T47492 copia-like polyprotein -
Arabidopsis thaliana
Length = 1363
Score = 312 bits (800), Expect = 5e-83
Identities = 177/469 (37%), Positives = 260/469 (54%), Gaps = 26/469 (5%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
MSK L N+++ KQ+LYS KM E ++ ++ F +I+ADL L V V DED+ I+LL SL
Sbjct: 108 MSKALPNRIYLKQKLYSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSL 167
Query: 150 PGSYDHLVTTLTY--GKDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGR 207
P +D L TL Y GK ++LD +++ + SV++ EGL+VK + R
Sbjct: 168 PKPFDQLKDTLKYSSGKTVLSLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAEN-R 226
Query: 208 GKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRS----------------GKS 251
G+ + D GK KRSKSK ++ C+ C + GH K CPN++ GK
Sbjct: 227 GRSEQKDKGKGKRSKSKSKRG--CWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKG 284
Query: 252 GNSSSAANVVQSDGSCSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGY 311
+ N V+S G E L + D W++D+GC HMT REW F G
Sbjct: 285 NLVEGSVNFVESAGMFVSEALSSTD-IHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGS 343
Query: 312 VYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEEN 371
V +G+ +KG+ V+I D+G TL VRY+P++ +NL+SLGT + G+ F+SE
Sbjct: 344 VRMGNKSISRVKGVGTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENG 403
Query: 372 RDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGM 431
+LR+ G ++ +R +Y L G + +V +D T LWH RL H+S++ M
Sbjct: 404 --MLRIKSGNQVLLEGRR-YDTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNM 460
Query: 432 MELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWG-PTKEPS 490
L K+ L + + C+ C+ G+ ++ F QH TK L+YVHSD+WG PT S
Sbjct: 461 SLLIKKGFLDKKKVSMLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMS 520
Query: 491 VGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+G +YF++F DD++RKVWVYFLK K E F KF W + VENQ+G ++K
Sbjct: 521 LGNCQYFISFIDDYTRKVWVYFLKTKDEAFEKFVSWISLVENQSGERVK 569
Score = 219 bits (559), Expect = 4e-55
Identities = 110/217 (50%), Positives = 144/217 (65%), Gaps = 2/217 (0%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILG+EI ++ A LWLSQ SY+ VL F+M ++ P TPL H K+ K
Sbjct: 1074 AAKKILGIEIIIDREAGVLWLSQESYLNKVLKTFNMLESKPALTPLGAHLKMKSATEEKL 1133
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
++E E M+ + ++SAVG +MY M+ TRPDLA V +FMS+P K+HW VKW+LRY+K
Sbjct: 1134 STEEEYMNSVPYSSAVGSIMYAMIGTRPDLAYPVGVVSRFMSQPAKEHWLGVKWVLRYIK 1193
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
GT D + + R + GY D+DYA DLD RRS TG VFTL G I WKS +Q +VA
Sbjct: 1194 GTVDTRLCYKRNSDF--SICGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVA 1251
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQL 1515
S+TE EYM++ EAVKEA+WL GL+K G EQ V++
Sbjct: 1252 QSSTECEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEI 1288
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 309 bits (792), Expect = 4e-82
Identities = 173/468 (36%), Positives = 256/468 (53%), Gaps = 31/468 (6%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
MSK L N+++ KQ+LYS KM E ++ ++ F I+ DL + V + DED+ I+LL +L
Sbjct: 108 MSKALPNRIYPKQKLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTAL 167
Query: 150 PGSYDHLVTTLTY--GKDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGR 207
P ++D L TL Y GK +TLD +++ + SV++ EGL+VK D+
Sbjct: 168 PKAFDQLKDTLKYSSGKSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVK---DKNE 224
Query: 208 GKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNR---------------SGKSG 252
KGK GK K K K +K C++C + GH++ CPN+ SG G
Sbjct: 225 NKGKGEQKGKGKGKKGKSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKG 284
Query: 253 NSSSAANVVQSDGSCSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYV 312
N + AA S+ S E V D W+LD+GCS HMT REWF+ F G V
Sbjct: 285 NLAEAAGYYVSEALSSTE-------VHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSV 337
Query: 313 YLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENR 372
+G+ ++G+ +++ DG L+ VRY+P++ +NL+SLGT + GY F+SE+
Sbjct: 338 RMGNKTVSRVRGVGTIRVKNSDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDG- 396
Query: 373 DILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMM 432
ILR+ G ++ +R +Y L V + +V D T LWH RL H+S++ M
Sbjct: 397 -ILRIKAGNQVLLTGRR-YDTLYLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNME 454
Query: 433 ELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEP-SV 491
L ++ L + ++ +C+ C+ GK R F H TK L+Y+HSD+WG P S+
Sbjct: 455 ILVRKGFLDKKKVSSLDVCEDCIYGKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSL 514
Query: 492 GGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
G +YF++ DDF+RKVWVYF+K K E F KF W VENQT R++K
Sbjct: 515 GKCQYFMSIIDDFTRKVWVYFMKTKDEAFEKFVEWVNLVENQTDRRVK 562
Score = 216 bits (551), Expect = 4e-54
Identities = 107/217 (49%), Positives = 142/217 (65%), Gaps = 2/217 (0%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI R++ LWLSQ Y+ +L ++M++A P TPL HFK K
Sbjct: 1067 AAKKILGMEIIRDRTLGVLWLSQEGYLNKILETYNMAEAKPAMTPLGAHFKFQAATEQKL 1126
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
+ + M + ++SAVG +MY M+ TRPDLA + +FMS+P K+HW VKW+LRY+K
Sbjct: 1127 IRDEDFMKSVPYSSAVGSIMYAMLGTRPDLAYPVGIISRFMSQPIKEHWLGVKWVLRYIK 1186
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
GT + + + +VGY D+DYA DLD RRS TG VFTL G I WKS +Q +VA
Sbjct: 1187 GTLKTRLCYKKSSSF--SIVGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVA 1244
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQL 1515
STTE+EYM++ EAVKEA+WL GL+K G EQ V++
Sbjct: 1245 QSTTESEYMSLTEAVKEAIWLKGLLKDFGYEQKSVEI 1281
>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301697|pir||B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1335
Score = 308 bits (790), Expect = 7e-82
Identities = 176/466 (37%), Positives = 264/466 (55%), Gaps = 26/466 (5%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
M ++L ++++ + Y+ KMQE + ++ F I+ADL L + V DE + I+LL SL
Sbjct: 83 MIRSLPHRVYTQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSL 142
Query: 150 PGSYDHLVTTLTYG--KDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGR 207
P YD LV T+ Y ++ + LD + ++ + R + + EG F +G D G+
Sbjct: 143 PARYDGLVETMKYSNSREKLRLDDV---MVAARDKERELSQNNRPVVEGHFARGRPD-GK 198
Query: 208 GKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDC-----PNRSGKSGNSSSAANVVQ 262
+ + RSKS D K C+ C + GH+K+ C N+S + G+ + +++ +
Sbjct: 199 NNNQGNKGKNRSRSKSADGKRV-CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAK 257
Query: 263 SDGS--------CSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYL 314
S + ++E L+ + + WVLD+GCS HMTP ++WF FK GYV +
Sbjct: 258 STEAFNPAMVLLATDETLVVTDSI--ANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKM 315
Query: 315 GDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDI 374
G+D +KG+ +KI DG L+ VRY+P +T+NLISLGTL + G FKS++ I
Sbjct: 316 GNDTYSPVKGIGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDG--I 373
Query: 375 LRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMMEL 434
L++ KG T+++ ++ +Y L G T G+ S D T LWH RLGH+S++GM L
Sbjct: 374 LKIVKGCSTILKGQK-RDTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKGMEIL 432
Query: 435 YKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEP-SVGG 493
K+ L+ + C+ CV GKQ RV F QH TK L YVHSD+WG P S+G
Sbjct: 433 VKKGCLRREVIKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGN 492
Query: 494 FRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+YF++F DD+SRKVW+YFL+ K E F KF WK VENQ+ RK+K
Sbjct: 493 SQYFISFVDDYSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVK 538
Score = 218 bits (556), Expect = 1e-54
Identities = 111/218 (50%), Positives = 145/218 (65%), Gaps = 2/218 (0%)
Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
AKKILGMEI R++ A L LSQ YV+ VL F M A PVSTPL HFKL +
Sbjct: 1047 AKKILGMEISRDRDAGLLTLSQEGYVKKVLRSFQMDNAKPVSTPLGIHFKLKAATDKEYE 1106
Query: 1360 SEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKG 1419
+ E M + +A+ +G +MY M+ TRPDLA + + +FMSKP K HW+AVKW+LRY++G
Sbjct: 1107 EQFERMKIVPYANTIGSIMYSMIGTRPDLAYSLGVISRFMSKPLKDHWQAVKWVLRYMRG 1166
Query: 1420 TADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAM 1479
T + + F +++ L+ GY DSDY + D RRS TGYVFT+ G I WKS +Q +VA+
Sbjct: 1167 TEKKKLCFRKQEDF--LLRGYCDSDYGSNFDTRRSITGYVFTVGGNTISWKSKLQKVVAI 1224
Query: 1480 STTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQLLS 1517
S+TEAEYMA+ EAVKEALWL G +LG Q V++ S
Sbjct: 1225 SSTEAEYMALTEAVKEALWLKGFAAELGHSQDYVEVHS 1262
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 305 bits (782), Expect = 6e-81
Identities = 174/461 (37%), Positives = 266/461 (56%), Gaps = 19/461 (4%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
MSKTL NKL+ K++LY+L M EG + +H+ FN ++ L LGV +++EDK I+LL SL
Sbjct: 93 MSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSL 152
Query: 150 PGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKG-GQDRGRG 208
P SYD+L TT+ +GK +I L ++S LL + + R+ E + G+ L +G G+ R
Sbjct: 153 PSSYDNLATTILHGKTTIELKDVTSALLLNEKMRKKPE----NQGQALITEGRGRSYQRS 208
Query: 209 KGKAVDSGKKKRSKSKDR-KTAECYSCKQIGHWKRDCPN-------RSGKSGNSSSAANV 260
SG + +SK++ + + CY+C Q GH+KRDCPN SG+ + ++AA V
Sbjct: 209 SNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMV 268
Query: 261 VQSDGSCS--EEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYLGDDK 318
+D E+ C+ WV+D+ S H TP R+ F + + DFG V +G+
Sbjct: 269 QNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTS 328
Query: 319 PCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVS 378
I G+ + I + G L VR+VP++ NLIS L +GY + R++
Sbjct: 329 YSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQK--WRLT 386
Query: 379 KGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMMELYKRN 438
KG++ + + G +Y+ G++ + + D+ + LWH R+GH+SE+G+ L K++
Sbjct: 387 KGSLVIAKGV-ARGTLYRTNAEICQGELNAAQ-DEISVDLWHKRMGHMSEKGLQILAKKS 444
Query: 439 LLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGFRYFV 498
L+ + T+ C YC+ GKQ RV F+T + ILD V+SDV GP + S+GG +YFV
Sbjct: 445 LISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFV 504
Query: 499 TFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
TF DD SRK+WVY LK K +VF F+ + A VE +TGRK+K
Sbjct: 505 TFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLK 545
Score = 244 bits (624), Expect = 1e-62
Identities = 116/211 (54%), Positives = 153/211 (71%), Gaps = 3/211 (1%)
Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
A++ILGM+I RE+ ++KLWLSQ Y+E VL RF+M A PVSTPLA H KLS + CP T
Sbjct: 1041 AQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTV 1100
Query: 1360 SEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKG 1419
E M+K+ ++SAVG LMY MVCTRPD+A A V +F+ PGK+HWEAVKWILRYL+G
Sbjct: 1101 EEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRG 1160
Query: 1420 TADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAM 1479
T + F G P++ GY D+D AGD+D+R+S+TGY+FT +GG I W+S +Q VA+
Sbjct: 1161 TTGDCLCFG---GSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVAL 1217
Query: 1480 STTEAEYMAVAEAVKEALWLTGLVKKLGVEQ 1510
STTEAEY+A E KE +WL +++LG+ Q
Sbjct: 1218 STTEAEYIAATETGKEMIWLKRFLQELGLHQ 1248
>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 304 bits (779), Expect = 1e-80
Identities = 167/458 (36%), Positives = 269/458 (58%), Gaps = 17/458 (3%)
Query: 91 SKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSLP 150
+K+L NK+F K++LY+L+M E + H+ N + + LT L ++ +++ +LL SLP
Sbjct: 90 AKSLHNKIFLKRKLYTLRMSESTSVTEHLNTLNTLFSQLTSLSCKIEPQERAELLLQSLP 149
Query: 151 GSYDHLVTTLTYG--KDSITLDSISSTLLQHAQRRRSVEEGGGS--SGEGLFVKGGQDRG 206
SYD L+ LT D + D +++ +L+ RR++ E+ + E L V G+
Sbjct: 150 DSYDQLIINLTNNILTDYLVFDDVAAAVLEEESRRKNKEDRQVNLQQAEALTVMRGRSTE 209
Query: 207 RGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNSSSAANVVQSDGS 266
RG+ S + RSKS +K CY+C + GH K+DC N + S + A+ DGS
Sbjct: 210 RGQ-----SSGRGRSKSS-KKNLTCYNCGKKGHLKKDCWNLAQNSNPQGNVAST-SDDGS 262
Query: 267 --CSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYLGDDKPCIIKG 324
C E + + D W++DSG + HMT +EWF+ ++ G VY DD I G
Sbjct: 263 ALCCEASIAREGRKRFADIWLIDSGATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIG 322
Query: 325 M*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSKGAMTV 384
+ +K+ + DG V+T+ VR+V + KNL+S G L + ++++ ++++ +GA+ V
Sbjct: 323 IGTIKLKMYDGTVQTVQDVRHVKGLKKNLLSYGILDNSATQIETQKG--VMKIFQGALVV 380
Query: 385 MRAKRTAGNIYKLLGSTVMGDVASVET-DDDATKLWHMRLGHLSERGMMELYKRNLLKGV 443
M+ ++ A N+Y L G T+ ASV D+T LWH +LGH+S++GM L ++ L+ G+
Sbjct: 381 MKGEKIAANLYMLKGETLQEAEASVAACSPDSTLLWHQKLGHMSDQGMKILVEQKLIPGL 440
Query: 444 RSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVGGFRYFVTFTDD 503
++ LC++C+ KQ R++F T + K +L+ VHSDVW PS+GG +YFV+F DD
Sbjct: 441 TKVSLPLCEHCITSKQHRLKFSTSNSRGKVVLELVHSDVW-QAPVPSLGGAKYFVSFIDD 499
Query: 504 FSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIKYY 541
+SR+ WVY +K KS+VFA FK +KA VE +G+KIK +
Sbjct: 500 YSRRCWVYPIKKKSDVFATFKAFKARVELDSGKKIKCF 537
Score = 226 bits (577), Expect = 4e-57
Identities = 110/216 (50%), Positives = 152/216 (69%), Gaps = 3/216 (1%)
Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
A KILGM+IHR++G +K+WLSQ +Y++ +LSRF M +STPL + K+S P
Sbjct: 1011 ANKILGMQIHRDRGNRKIWLSQKNYLKKILSRFSMQDCKSISTPLPINLKVSSSMSPSNE 1070
Query: 1360 SEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKG 1419
MS++ +ASAVG LM+ M+CTRPD+AQA V ++M+ PG++HW VK ILRY+KG
Sbjct: 1071 EGRMEMSRVPYASAVGSLMFAMICTRPDIAQAVGVVSRYMANPGREHWNCVKRILRYIKG 1130
Query: 1420 TADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAM 1479
T+D + + G ++ GYVDSDYAGDLD +STTGYVF +AGG + W S +Q++VA
Sbjct: 1131 TSDVALCYG---GSDFIINGYVDSDYAGDLDKSKSTTGYVFKVAGGAVSWVSKLQAVVAT 1187
Query: 1480 STTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQL 1515
STTEAEY+A +A KEA+WL L+++LG +Q V L
Sbjct: 1188 STTEAEYVAATQASKEAIWLKMLLEELGHKQEFVSL 1223
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 297 bits (760), Expect = 2e-78
Identities = 172/465 (36%), Positives = 261/465 (55%), Gaps = 43/465 (9%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
M+K+L N+++ KQRLY KM E ++ +V F +++DL + V V DED+ I+LL SL
Sbjct: 113 MAKSLPNRIYLKQRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSL 172
Query: 150 PGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGG-----SSGEGLFVKGGQD 204
P +D L TL Y K ++ L+ I+S + R + +E G ++ +GLFV QD
Sbjct: 173 PRQFDQLKETLKYCKTTLHLEEITSAI-----RSKILELGASGKLLKNNSDGLFV---QD 224
Query: 205 RGRGKGKAVDSGKKK-RSKSKDRKTAECYSCKQIGHWKRDC-----PNRSGKSGNSSSAA 258
RGR + + K K RSKSK C+ C + GH+K+ C N+ G + A+
Sbjct: 225 RGRSETRGKGPNKNKSRSKSKGAGKT-CWICGKEGHFKKQCYVWKERNKQGSTSERGEAS 283
Query: 259 NVVQ--SDGSCSEEDLLCVSYVKCT-DAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYLG 315
V +D + + + + T D W+LD+GCS HMT ++W FK G V +G
Sbjct: 284 TVTARVTDAAALVVSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMG 343
Query: 316 DDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDIL 375
+D +KG+ V+I +DG L+ VRY+PE++KNLISLGTL + G F+S+ + IL
Sbjct: 344 NDTYSEVKGIGDVRIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESK--KGIL 401
Query: 376 RVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMMELY 435
+ K +TV+ K+ + +Y L G+T+ G+ ++ + D T LWH RLGH+ +G+ L
Sbjct: 402 TIFKNDLTVLTGKKES-TLYFLQGTTLAGEANVIDKEKDETSLWHSRLGHIGAKGLQVLV 460
Query: 436 KRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEP-SVGGF 494
+ L K + F +H TK LDYVHSD+WG T P S+G
Sbjct: 461 SKG----------------HLDKNIMISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKC 504
Query: 495 RYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+YF+TF DDF+R+ W+YF++ K E F+KF WK ++ENQ +K+K
Sbjct: 505 QYFITFIDDFTRRTWIYFIRTKDEAFSKFVEWKTQIENQQDKKLK 549
Score = 216 bits (550), Expect = 5e-54
Identities = 110/217 (50%), Positives = 144/217 (65%), Gaps = 2/217 (0%)
Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
A+KILGMEI R + L LSQ+ YV GVL F M ++ TPL HFKL A
Sbjct: 1054 ARKILGMEITRNREQGILDLSQSEYVAGVLRAFGMDQSKVSQTPLGAHFKLRAANEKTLA 1113
Query: 1360 SEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKG 1419
+ E M + + +A+G +MY M+ +RPDLA V +FMSKP K+HW+AVKW++RY+KG
Sbjct: 1114 RDAEYMKLVPYPNAIGSIMYSMIGSRPDLAYPVGVVSRFMSKPSKEHWQAVKWVMRYMKG 1173
Query: 1420 TADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAM 1479
T D + F ++ + GY DSDYA DLD RRS TG+VFT G I WKS +Q +VA+
Sbjct: 1174 TQDTCLRFKKDDKFE--IRGYCDSDYATDLDRRRSITGFVFTAGGNTISWKSGLQRVVAL 1231
Query: 1480 STTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQLL 1516
STTEAEYMA+AEAVKEA+WL GL ++G EQ V+++
Sbjct: 1232 STTEAEYMALAEAVKEAIWLRGLAAEMGFEQDAVEVM 1268
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 293 bits (751), Expect = 2e-77
Identities = 177/467 (37%), Positives = 257/467 (54%), Gaps = 31/467 (6%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
MSKTL NKL+ K++LY+L M EG + +H+ N ++ L LGV +++EDK I+LL SL
Sbjct: 94 MSKTLTNKLYLKKQLYTLHMDEGTNFLSHLNVLNGLITQLANLGVKIEEEDKRIVLLNSL 153
Query: 150 PGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGG--------GSSGEGLFVKG 201
P SYD L TT+ +GKDSI L ++S LL + + R+ E G G S +
Sbjct: 154 PSSYDTLSTTILHGKDSIQLKDVTSALLLNEKMRKKPENHGQVFITESRGRSYQRSSSNY 213
Query: 202 GQDRGRGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPN-------RSGKSGNS 254
G+ RGK K RSKSK R CY+C Q GH+KRDCPN SG+ +
Sbjct: 214 GRSGARGKSKV-------RSKSKARN---CYNCDQPGHFKRDCPNPKRGKGESSGQKNDD 263
Query: 255 SSAANVVQSDGSCS--EEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYV 312
++AA V +D E+ C+ WV+D+ S H TP R+ F + + D+G V
Sbjct: 264 NTAAMVQNNDDVVLLINEEEECMHLAGTESEWVVDTAASYHATPVRDLFCRYVAGDYGNV 323
Query: 313 YLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENR 372
+G+ I G+ + + G L VR+VP++ NLIS L ++GY +
Sbjct: 324 KMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRHVPDLRMNLISGIALDQDGYENYFANQK 383
Query: 373 DILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMM 432
R++KGA+ + + G +Y+ G++ + ++ A LWH R+GH SE+G+
Sbjct: 384 --WRLTKGALVIAKGV-ARGTLYRTNAEICQGELNAAHEENSAD-LWHKRMGHTSEKGLQ 439
Query: 433 ELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEPSVG 492
L K++L+ + TI C Y + GKQ RV F+T + ILD V+SDV GP + S+G
Sbjct: 440 ILSKKSLISFTKGTTIKPCNYWLFGKQHRVSFQTSSERKSNILDLVYSDVCGPMEIESMG 499
Query: 493 GFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
G +YFVTF DD SRK+WVY + K +VF F+ + A VE +TGRK K
Sbjct: 500 GNKYFVTFIDDASRKLWVYIFRAKDQVFQVFQKFHALVERETGRKRK 546
Score = 191 bits (484), Expect = 2e-46
Identities = 104/212 (49%), Positives = 140/212 (65%), Gaps = 6/212 (2%)
Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCP-KT 1358
AK+ILGM+I RE+ KKL LS Y+E VL RF+M A P+STPL ++ KL+ + P K
Sbjct: 1042 AKQILGMKIAREE-QKKLGLSHEKYIERVLERFNMKSAKPISTPLVSYLKLTKQMFPTKK 1100
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
E M+K+ ++SAVG MY MVCTRP++ A V +F+ PGK+H EAVKWILRYL+
Sbjct: 1101 KGEKGDMAKVPYSSAVGSFMYAMVCTRPNIV-AVCVVSRFLEIPGKEHLEAVKWILRYLR 1159
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
T F +G P+ GY + D GDLD+R+STT Y+FT +GG I W+S +Q VA
Sbjct: 1160 RTTRDYFCF---EGSDPISKGYTNVDMEGDLDNRKSTTCYLFTFSGGDISWQSKLQKYVA 1216
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQ 1510
+STTEA+Y+A E KE LWL +++ G+ Q
Sbjct: 1217 LSTTEAKYIAGTEVCKEMLWLKRFLQEHGLHQ 1248
>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
gi|4539406|emb|CAB40039.1| putative retrotransposon
[Arabidopsis thaliana] gi|7444416|pir||T04181
hypothetical protein F7L13.40 - Arabidopsis thaliana
Length = 1230
Score = 291 bits (745), Expect = 1e-76
Identities = 172/468 (36%), Positives = 248/468 (52%), Gaps = 45/468 (9%)
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
MSK L N+++ KQ+LYS KMQE ++ ++ F ++ADL V V DED+ I+LL SL
Sbjct: 106 MSKALPNRIYLKQKLYSYKMQENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSL 165
Query: 150 PGSYDHLVTTLTYGKDSITLD------SISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQ 203
P +D L TL YG TL +I S L+ ++S+ EGL+VK
Sbjct: 166 PKQFDQLKDTLKYGSGRTTLSVDEVVAAIYSKELELGSNKKSIR----GQAEGLYVKDKP 221
Query: 204 DRGRGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRS-----GKSGNSSSAA 258
+ RG + + G K RS+S+ + C+ C + GH+K CPN+ GK S S
Sbjct: 222 ET-RGMSEQKEKGNKGRSRSRSKGWKGCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKG 280
Query: 259 NVVQSDGSCSE------EDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYV 312
G+ SE + L + V + WV+D+GC+ HMT +EWF G V
Sbjct: 281 EAATIKGNTSEGSGYYVSEALHSTDVNLGNEWVMDTGCNYHMTHKKEWFEELSEDAGGTV 340
Query: 313 YLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENR 372
+G+ + V+Y+P++ +NL+S+GTL E+GYSF+S+
Sbjct: 341 RMGNKSTSKFR-------------------VKYIPDMDRNLLSMGTLEEHGYSFESKNG- 380
Query: 373 DILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMM 432
+L V +G T++ R +Y L G + +VE +D T LWH RLGH+S++ M
Sbjct: 381 -VLVVKEGTRTLLIGSRHE-KLYLLQGKPEVSHSMTVERRNDDTVLWHRRLGHISQKNMD 438
Query: 433 ELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEP-SV 491
L K+ L G + + LC+ C+ GK R+ F H T+ L+YVHSD+WG P S+
Sbjct: 439 ILVKKGYLDGKKVSKLELCEDCIYGKARRLSFVVATHNTEDKLNYVHSDLWGAPSVPLSL 498
Query: 492 GGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
G +YF++F D +SRK WVYFLK+K E F F W VENQTGRKIK
Sbjct: 499 GKCQYFISFIDVYSRKTWVYFLKHKDEAFGTFAEWSVMVENQTGRKIK 546
Score = 147 bits (370), Expect = 4e-33
Identities = 85/217 (39%), Positives = 122/217 (56%), Gaps = 28/217 (12%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAK+ILGMEI R++ L LSQ Y+ VL +++ + V TPL H K+ +
Sbjct: 967 AAKRILGMEISRDRVKGTLTLSQEDYLSKVLETYNVDQCKFVVTPLGAHLKMHAATEQQL 1026
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
S+ E M + +++AVG +MY M+ TRPDLA + +FMSKP
Sbjct: 1027 LSDEEYMKSVPYSNAVGSIMYSMIDTRPDLAYCVGIISRFMSKP---------------- 1070
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
+G + GY DSDYA +L++RRS +G VFTL G I +S +Q +V
Sbjct: 1071 ------------KGADLTLRGYCDSDYAANLENRRSISGMVFTLGGSTINLRSCLQKVVV 1118
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQL 1515
MS+T+A YM++ EAVKEA+WL GL++ G EQ V++
Sbjct: 1119 MSSTKAGYMSLTEAVKEAIWLKGLLQDFGYEQKTVEI 1155
>ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475188|gb|AAT44257.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1211
Score = 288 bits (738), Expect = 8e-76
Identities = 166/418 (39%), Positives = 246/418 (58%), Gaps = 27/418 (6%)
Query: 88 MCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLC 147
+CM+K L +K+ KQ+L+ K+Q+ G + H+ AF I+ADL + V D+ED +ILLC
Sbjct: 90 ICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKKIVADLESMEVKYDEEDLCLILLC 149
Query: 148 SLPGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSV-EEGGGSSGEGLFVKGGQDRG 206
SLP SY + T+ Y D++TL + L + ++ V EG S EGL V+G Q
Sbjct: 150 SLPSSYANFRDTILYSCDTLTLKEVYDALHAKEKIKKMVPSEGSNSQAEGLVVRGRQQEK 209
Query: 207 RGKGKAVD---SGKKKRSKSKDRKTAECYSCKQIGH-----WK-RDCPNRSGK-----SG 252
K+ D S + RSKS+ R + C K+ GH WK +D R+GK
Sbjct: 210 NTNSKSRDKSSSSYRGRSKSRGRYKS-CKYYKRDGHDISECWKLQDKDKRTGKYVPKGKK 268
Query: 253 NSSSAANVVQSDGSCSEEDLLCVSYVKC---TDAWVLDSGCSCHMTPHREWFNSFKSCDF 309
A VV + S +E L V+Y C +D W+LD+ C+ HM +R+WF ++++
Sbjct: 269 EEEGKAAVVTDEKSDAE---LLVAYAGCAQTSDQWILDTACTYHMCLNRDWFATYEAVQG 325
Query: 310 GYVYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSE 369
G V +GDD PC + G+ V+I + DG +RTLS VR++P + ++LISL TL Y +
Sbjct: 326 GTVLMGDDTPCEVAGVETVQIKMFDGCIRTLSDVRHIPNLKRSLISLCTLDRKVYKYSGG 385
Query: 370 ENRDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVET---DDDATKLWHMRLGHL 426
+ IL+V+KG++ VM+A + N+Y + G+T++G++A+V + DAT LWHMRLGH+
Sbjct: 386 DG--ILKVTKGSLVVMKADIKSANLYHVRGTTILGNIAAVSDSLYNSDATNLWHMRLGHM 443
Query: 427 SERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWG 484
SE G+ EL KR LL G + C++C+ GK RV+F T H T+ ILDYVHSD+WG
Sbjct: 444 SEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTESILDYVHSDLWG 501
Score = 241 bits (615), Expect = 1e-61
Identities = 118/208 (56%), Positives = 156/208 (74%), Gaps = 3/208 (1%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAKKILGMEI RE+ + KL+LSQ Y+E VL RF+M A PVSTPLA HF+LS + CP++
Sbjct: 924 AAKKILGMEITRERHSGKLYLSQKGYIEKVLRRFNMHDAKPVSTPLAAHFRLSSDLCPQS 983
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
+IE MS++ + S VG LMY MVC+R DL+ A S V ++M+ PGK+HW+ V+WI +YL+
Sbjct: 984 DYDIEYMSRVPYLSVVGSLMYAMVCSRLDLSHALSVVSRYMANPGKEHWKVVQWIFKYLR 1043
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
GT+ + F R + +VGYVDSD+AGDLD RRS TGYVFT+ G + WK+S+Q+ VA
Sbjct: 1044 GTSSACLQFGRSRDG---LVGYVDSDFAGDLDRRRSLTGYVFTIGGCAVSWKASLQATVA 1100
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
+STTE EYMA++EA KEA+WL GL +L
Sbjct: 1101 LSTTEVEYMAISEACKEAIWLRGLYTEL 1128
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 281 bits (718), Expect = 2e-73
Identities = 175/466 (37%), Positives = 253/466 (53%), Gaps = 33/466 (7%)
Query: 82 LG*IGIMCMSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDK 141
LG + + MSK L N+++ KQ+LYS KM E ++ ++ F I+ADL V V DED+
Sbjct: 100 LGVLDKLYMSKALPNRIYQKQKLYSFKMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQ 159
Query: 142 TIILLCSLPGSYDHLVTTLTYGKDSITLD------SISSTLLQHAQRRRSVEEGGGSSGE 195
I+LL SLP +D L TL YG +TL +I S L+ ++S++ E
Sbjct: 160 AILLLMSLPKPFDQLRDTLKYGLGRVTLSLDEVVAAIYSKELELGSNKKSIK----GQAE 215
Query: 196 GLFVKGGQD-RGRGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNS 254
GLFVK + RGR + + ++ KK S+SK R C+ C G+S N
Sbjct: 216 GLFVKEKTETRGRTEQRGNNNNNKK-SRSKSRSKKGCWIC--------------GESSNG 260
Query: 255 SSAANVVQSDGSCSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYL 314
SS N +++G E L + D WV+D+GCS HMT REWF G V +
Sbjct: 261 SS--NYSEANGLYVSEALSSTD-IHLEDEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRM 317
Query: 315 GDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDI 374
G+ ++G+ +++ + G V L+ VRY+PE+ +NL+SLGT ++GYSFK E
Sbjct: 318 GNKTVSKVRGIGTIRVKNEAGMVVRLTNVRYIPEMDRNLLSLGTFEKSGYSFKLENG--T 375
Query: 375 LRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMMEL 434
L + G ++ +R +Y L V + SV D T LWH RLGH+S++ M L
Sbjct: 376 LSIIAGDSVLLTVRR-CYTLYLLQWRPVTEESLSVVKRQDDTILWHRRLGHMSQKNMDLL 434
Query: 435 YKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVWGPTKEP-SVGG 493
K+ LL + + C+ C+ GK R+ F QH T+ L+YVHSD+WG P S+G
Sbjct: 435 LKKGLLDKKKVSKLETCEDCIYGKAKRIGFNLAQHDTREKLEYVHSDLWGAPSVPFSLGK 494
Query: 494 FRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+YF++F DD++RKV +YFLK K E F KF W VENQT ++IK
Sbjct: 495 CQYFISFIDDYTRKVRIYFLKTKDEAFDKFVEWANLVENQTDKRIK 540
Score = 207 bits (526), Expect = 3e-51
Identities = 102/217 (47%), Positives = 141/217 (64%), Gaps = 2/217 (0%)
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
AAK+ILGMEI R + LWLSQN Y+ +L ++M+++ V TPL H K+ K
Sbjct: 1044 AAKRILGMEIIRNREENTLWLSQNGYLNKILETYNMAESKHVVTPLGAHLKMRAATVEKQ 1103
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
+ + M I ++SAVG +MY M+ TRPDLA + ++MS+P ++HW VKW+LRY+K
Sbjct: 1104 EQDEDYMKSIPYSSAVGSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWVLRYIK 1163
Query: 1419 GTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVA 1478
G+ + + R VVGY D+D+A D RRS TG VFTL G I WKS Q +VA
Sbjct: 1164 GSLGTKLQYKRSSDF--KVVGYCDADHAACKDRRRSITGLVFTLGGSTISWKSGQQRVVA 1221
Query: 1479 MSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQGGVQL 1515
+STTEAEYM++ EAVKEA+W+ GL+K+ G EQ V++
Sbjct: 1222 LSTTEAEYMSLTEAVKEAVWMKGLLKEFGYEQKSVEI 1258
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.341 0.149 0.500
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,433,258,540
Number of Sequences: 2540612
Number of extensions: 96411017
Number of successful extensions: 318015
Number of sequences better than 10.0: 1203
Number of HSP's better than 10.0 without gapping: 965
Number of HSP's successfully gapped in prelim test: 241
Number of HSP's that attempted gapping in prelim test: 312631
Number of HSP's gapped (non-prelim): 2844
length of query: 1582
length of database: 863,360,394
effective HSP length: 142
effective length of query: 1440
effective length of database: 502,593,490
effective search space: 723734625600
effective search space used: 723734625600
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (22.0 bits)
S2: 82 (36.2 bits)
Lotus: description of TM0279a.6