BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0192.15
(1070 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cult... 621 e-176
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A... 541 e-152
gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25... 538 e-151
gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis tha... 538 e-151
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi... 536 e-150
gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis tha... 529 e-148
gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum] 518 e-145
gb|AAP51797.1| putative copia-type polyprotein [Oryza sativa (ja... 498 e-139
gb|AAD17409.1| putative retroelement pol polyprotein [Arabidopsi... 496 e-138
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi... 485 e-135
gb|AAT38797.1| putative polyprotein [Solanum demissum] 481 e-134
gb|AAF16534.1| T26F17.17 [Arabidopsis thaliana] 480 e-133
gb|AAG51247.1| copia-type polyprotein, putative; 28768-32772 [Ar... 460 e-127
dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana] gi... 460 e-127
gb|AAF25964.2| F6N18.1 [Arabidopsis thaliana] 460 e-127
emb|CAB75932.1| putative protein [Arabidopsis thaliana] gi|11278... 456 e-126
gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsi... 436 e-120
gb|AAT38786.1| putative gag-pol polyprotein [Solanum demissum] 432 e-119
ref|XP_470746.1| putative gag-pol polyprotein [Oryza sativa] gi|... 419 e-115
ref|XP_474043.1| OSJNBb0034I13.10 [Oryza sativa (japonica cultiv... 417 e-114
>gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|50919599|ref|XP_470160.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1335
Score = 621 bits (1602), Expect = e-176
Identities = 313/666 (46%), Positives = 429/666 (63%), Gaps = 36/666 (5%)
Query: 3 RRRESSRNFSKNSQDKNPPCSIC*RLGHAEADCRYRDKPQCNYCKKFGHMEKYCYSKNRH 62
++ + K + N C IC + H C K CN CK+ GH+ KYC ++ +
Sbjct: 240 QKEDGQERREKGTSSSNLWCDICQKSSHTTDMCW--KKMTCNKCKRKGHIAKYCRTREIN 297
Query: 63 QANLAEEQEQDQYLLYATQDSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLG 122
+AN ++E+E+ + ++++ + E W +DSGC+NHMA D ++F+ +D S K+ +G
Sbjct: 298 RANFSQEKEKSEEMVFSCHTAQEEKDDVWVIDSGCTNHMAADPNLFREMDSSYHAKIHMG 357
Query: 123 NGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIY 182
NG++ +S+GKGTV V+T G + I DVLLVP+LK+NLLSIGQ++E GY ++FE C+I
Sbjct: 358 NGSIAQSEGKGTVAVQTADGPKFIKDVLLVPDLKQNLLSIGQLLEHGYAVYFEDFSCKIL 417
Query: 183 DKHDKRVEIAQVKMQKSNISFPLNFKYVANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQ 242
D+ + R+ +A++ M+K N +F L + +A++++VD S LWH+R GH N + LKLL
Sbjct: 418 DRKNNRL-VAKINMEK-NRNFLLRMNHTTQMALRSEVDISDLWHKRMGHLNYRALKLLRT 475
Query: 243 KNMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDN 302
K M++ LP + ++ CE C+ G Q R SF WRA LEL+H D+ G + T S
Sbjct: 476 KGMVQGLPFITLKSDPCEGCVFGKQIRASFPHSGAWRASAPLELVHADIVGKVPTISEGG 535
Query: 303 NRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSRE 362
N YFI F DD++RM WVYFLK KS +FKKFKA+VE QS + IKVLRSD+G+EY S+E
Sbjct: 536 NWYFITFIDDYTRMIWVYFLKEKSAALEIFKKFKAMVENQSNRKIKVLRSDQGREYISKE 595
Query: 363 FDKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAV 422
F+K+CE+ GI RQLT Y QQNGV+ERK RTI +MA SML++KGM +F WAEAV TAV
Sbjct: 596 FEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSF-WAEAVNTAV 654
Query: 423 YILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLG 482
YILNR PT AV N+TP EAW GKKP H+RVFG ICY VP KR K ++K+ R IF+G
Sbjct: 655 YILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSDRCIFVG 714
Query: 483 YSTKSKGYRVYNLQTKKLTISRDVEVDGNASWNWDEEKVEKNILIPT---------QRPQ 533
Y+ KGYR+YNL+ KK+ ISRD D +A+WNW + L+PT
Sbjct: 715 YADGIKGYRLYNLEKKKIIISRDAIFDESATWNWKSPEASSTPLLPTTTITLGQPHMHGT 774
Query: 534 EEVEEEAENPGEPTSPPQQQEQQQDLS---------PESTPRRVRFLVDV---------- 574
EVE+ +P +P+SP D S PES PRRVR +V++
Sbjct: 775 HEVEDHTPSP-QPSSPMSSSSASSDSSPSSEEQISTPESAPRRVRSMVELLESTSQQRGS 833
Query: 575 --YETCNLAILEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVY 632
+E CN +++EP+SF+ A K + W+KAME+EI MIEKNNTWELVD P +++IGVKWVY
Sbjct: 834 EQHEFCNYSVVEPQSFQEAEKHDNWIKAMEDEIHMIEKNNTWELVDRPRDREVIGVKWVY 893
Query: 633 KTKLKP 638
KTKL P
Sbjct: 894 KTKLNP 899
>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis
thaliana] gi|11278363|pir||T49313 copia-type reverse
transcriptase-like protein - Arabidopsis thaliana
Length = 1272
Score = 541 bits (1394), Expect = e-152
Identities = 294/628 (46%), Positives = 391/628 (61%), Gaps = 28/628 (4%)
Query: 29 GHAEADCRY-RDKPQCNYCKKFGHMEKYCYS----KNRHQANLAEE--QEQDQYLLYATQ 81
GH ++ RY + +C C KFGH C + K + +AN EE QE+D L+ + +
Sbjct: 268 GHPKS--RYDKSSVKCYNCGKFGHYASECKAPSNKKFKEKANYVEEKIQEEDMLLMASYK 325
Query: 82 DSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*K 141
+E WYLDSG SNHM +S+F +D+SV+ V LG+ + +E KGKG +++
Sbjct: 326 KDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKN 385
Query: 142 GT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSN 200
G + IS+V +P++K N+LS+GQ++E+GY + + + I DK I +V M K+
Sbjct: 386 GDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDKESNL--ITKVPMSKNR 443
Query: 201 ISFPLNFKY-VANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEAC 259
+ F LN + +A ++SWLWH RFGH N L+LL +K M+R LP + N+ C
Sbjct: 444 M-FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVC 502
Query: 260 ERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWV 319
E CLLG Q ++SF RA+ LELIHTDVCGP++ S + YF+LF DDFSR TWV
Sbjct: 503 EGCLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWV 562
Query: 320 YFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVA 379
YFLK KSEVF +FKKFKA VEK+SG IK +RSD G E+TS+EF K+CED GI RQLTV
Sbjct: 563 YFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVP 622
Query: 380 YIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPI 439
PQQNGV+ERK RTI+EMARSMLK K + WAEAV AVY+LNR PT +V KTP
Sbjct: 623 RSPQQNGVAERKNRTILEMARSMLKSKRLPKEL-WAEAVACAVYLLNRSPTKSVSGKTPQ 681
Query: 440 EAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKK 499
EAW G+KP HLRVFGSI + HVPD KR+KL+DK+ + IF+GY SKGY++YN TKK
Sbjct: 682 EAWSGRKPGVSHLRVFGSIAHAHVPDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDTKK 741
Query: 500 LTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAENPGEPTSPPQQ--QEQQQ 557
ISR++ D W+W+ + + N + + E E EPT+PP Q +
Sbjct: 742 TIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTREEPPSEEPTTPPTSPTSSQIE 801
Query: 558 DLSPESTPRRVRFLVDVYET----------CNLAILEPESFEAASKQEVWVKAMEEEIKM 607
+ S E TP R R + ++YE C A EP F+ A +++ W AM+EEIK
Sbjct: 802 ESSSERTP-RFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEEIKS 860
Query: 608 IEKNNTWELVDCPHGKDIIGVKWVYKTK 635
I+KN+TWEL P+G IGVKWVYK K
Sbjct: 861 IQKNDTWELTSLPNGHKAIGVKWVYKAK 888
>gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25301681|pir||F86246
hypothetical protein [imported] - Arabidopsis thaliana
Length = 1352
Score = 538 bits (1387), Expect = e-151
Identities = 293/628 (46%), Positives = 389/628 (61%), Gaps = 28/628 (4%)
Query: 29 GHAEADCRY-RDKPQCNYCKKFGHMEKYCYS----KNRHQANLAEE--QEQDQYLLYATQ 81
GH ++ RY + +C C KFGH C + K +AN EE QE+D L+ + +
Sbjct: 268 GHPKS--RYDKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYK 325
Query: 82 DSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*K 141
++ WYLDSG SNHM +S+F +D+SV+ V LG+ + +E KGKG +++
Sbjct: 326 KDEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKN 385
Query: 142 GT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSN 200
G + IS+V +P++K N+LS+GQ++E+GY + + + I D+ I +V M K+
Sbjct: 386 GDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL--ITKVPMSKNR 443
Query: 201 ISFPLNFKY-VANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEAC 259
+ F LN + +A ++SWLWH RFGH N L+LL +K M+R LP + N+ C
Sbjct: 444 M-FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVC 502
Query: 260 ERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWV 319
E CLLG Q ++SF RA+ LELIHTDVCGP++ S + YF+LF DDFSR TWV
Sbjct: 503 EGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWV 562
Query: 320 YFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVA 379
YFLK KSEVF +FKKFKA VEK+SG IK +RSDRG E+TS+EF K+CED GI RQLTV
Sbjct: 563 YFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVP 622
Query: 380 YIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPI 439
PQQNGV ERK RTI+EMARSMLK K + WAEAV AVY+LNR PT +V KTP
Sbjct: 623 RSPQQNGVVERKNRTILEMARSMLKSKRLPKEL-WAEAVACAVYLLNRSPTKSVSGKTPQ 681
Query: 440 EAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKK 499
EAW G+KP HLRVFGSI + HVPD KR KL+DK+ + IF+GY SKGY++YN TKK
Sbjct: 682 EAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKK 741
Query: 500 LTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAENPGEPTSPPQQ--QEQQQ 557
ISR++ D W+W+ + + N + + E E EPT+PP Q +
Sbjct: 742 TIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIE 801
Query: 558 DLSPESTPRRVRFLVDVYET----------CNLAILEPESFEAASKQEVWVKAMEEEIKM 607
+ S E TP R R + ++YE C A EP F+ A +++ W AM+EEIK
Sbjct: 802 ESSSERTP-RFRSIQELYEVTENQENLTLFCLFAECEPMDFQKAIEKKTWRNAMDEEIKS 860
Query: 608 IEKNNTWELVDCPHGKDIIGVKWVYKTK 635
I+KN+TWEL P+G IGVKWVYK K
Sbjct: 861 IQKNDTWELTSLPNGHKAIGVKWVYKAK 888
>gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 538 bits (1386), Expect = e-151
Identities = 293/628 (46%), Positives = 389/628 (61%), Gaps = 28/628 (4%)
Query: 29 GHAEADCRY-RDKPQCNYCKKFGHMEKYCYS----KNRHQANLAEE--QEQDQYLLYATQ 81
GH ++ RY + +C C KFGH C + K +AN EE QE+D L+ + +
Sbjct: 268 GHPKS--RYDKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYK 325
Query: 82 DSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*K 141
+E WYLDSG SNHM +S+F +D+SV+ V LG+ + +E KGKG +++
Sbjct: 326 KDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKN 385
Query: 142 GT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSN 200
G + IS+V +P++K N+LS+GQ++E+GY + + + I D+ I +V M K+
Sbjct: 386 GDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL--ITKVPMSKNR 443
Query: 201 ISFPLNFKY-VANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEAC 259
+ F LN + +A ++SWLWH RFGH N L+LL +K M+R LP + N+ C
Sbjct: 444 M-FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVC 502
Query: 260 ERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWV 319
E CLLG Q ++SF RA+ LELIHTDVCGP++ S + YF+LF DDFSR TWV
Sbjct: 503 EGCLLGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWV 562
Query: 320 YFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVA 379
YFLK KSEVF +FKKFKA VEK+SG IK +RSDRG E+TS+EF K+CED GI RQLTV
Sbjct: 563 YFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVP 622
Query: 380 YIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPI 439
PQQNGV+ERK RTI+EMARSMLK K + WAEAV AVY+LNR PT +V KTP
Sbjct: 623 RSPQQNGVAERKNRTILEMARSMLKSKRLPKEL-WAEAVACAVYLLNRSPTKSVSGKTPQ 681
Query: 440 EAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKK 499
EAW G+K HLRVFGSI + HVPD KR KL+DK+ + IF+GY SKGY++YN TKK
Sbjct: 682 EAWSGRKSGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKK 741
Query: 500 LTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAENPGEPTSPPQQ--QEQQQ 557
ISR++ D W+W+ + + N + + E E EPT+PP Q +
Sbjct: 742 TIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIE 801
Query: 558 DLSPESTPRRVRFLVDVYET----------CNLAILEPESFEAASKQEVWVKAMEEEIKM 607
+ S E TP R R + ++YE C A EP F+ A +++ W AM+EEIK
Sbjct: 802 ESSSERTP-RFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEEIKS 860
Query: 608 IEKNNTWELVDCPHGKDIIGVKWVYKTK 635
I+KN+TWEL P+G IGVKWVYK K
Sbjct: 861 IQKNDTWELTSLPNGHKTIGVKWVYKAK 888
>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
gi|11278364|pir||T47925 copia-type polyprotein -
Arabidopsis thaliana
Length = 1352
Score = 536 bits (1382), Expect = e-150
Identities = 292/628 (46%), Positives = 389/628 (61%), Gaps = 28/628 (4%)
Query: 29 GHAEADCRY-RDKPQCNYCKKFGHMEKYCYS----KNRHQANLAEE--QEQDQYLLYATQ 81
GH ++ RY + +C C KFGH C + K +A+ EE QE+D L+ + +
Sbjct: 268 GHPKS--RYDKSSVKCYNCGKFGHYASECKAPSNKKFEEKAHYVEEKIQEEDMLLMASYK 325
Query: 82 DSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*K 141
++ WYLDSG SNHM +S+F +D+SV+ V LG+ + +E KGKG +++
Sbjct: 326 KDEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKN 385
Query: 142 GT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSN 200
G + IS+V +P++K N+LS+GQ++E+GY + + + I D+ I +V M K+
Sbjct: 386 GDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL--ITKVPMSKNR 443
Query: 201 ISFPLNFKY-VANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEAC 259
+ F LN + +A ++SWLWH RFGH N L+LL +K M+R LP + N+ C
Sbjct: 444 M-FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVC 502
Query: 260 ERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWV 319
E CLLG Q ++SF RA+ LELIHTDVCGP++ S + YF+LF DDFSR TWV
Sbjct: 503 EGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWV 562
Query: 320 YFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVA 379
YFLK KSEVF +FKKFKA VEK+SG IK +RSDRG E+TS+EF K+CED GI RQLTV
Sbjct: 563 YFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVP 622
Query: 380 YIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPI 439
PQQNGV ERK RTI+EMARSMLK K + WAEAV AVY+LNR PT +V KTP
Sbjct: 623 RSPQQNGVVERKNRTILEMARSMLKSKRLPKEL-WAEAVACAVYLLNRSPTKSVSGKTPQ 681
Query: 440 EAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKK 499
EAW G+KP HLRVFGSI + HVPD KR KL+DK+ + IF+GY SKGY++YN TKK
Sbjct: 682 EAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKK 741
Query: 500 LTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAENPGEPTSPPQQ--QEQQQ 557
ISR++ D W+W+ + + N + + E E EPT+PP Q +
Sbjct: 742 TIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIE 801
Query: 558 DLSPESTPRRVRFLVDVYET----------CNLAILEPESFEAASKQEVWVKAMEEEIKM 607
+ S E TP R R + ++YE C A EP F+ A +++ W AM+EEIK
Sbjct: 802 ESSSERTP-RFRSIQELYEVTENQENLTLFCLFAECEPMDFQKAIEKKTWRNAMDEEIKS 860
Query: 608 IEKNNTWELVDCPHGKDIIGVKWVYKTK 635
I+KN+TWEL P+G IGVKWVYK K
Sbjct: 861 IQKNDTWELTSLPNGHKAIGVKWVYKAK 888
>gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis thaliana]
gi|12321254|gb|AAG50698.1| copia-type polyprotein,
putative [Arabidopsis thaliana] gi|25301687|pir||F96614
probable copia-type polyprotein T18I24.5 [imported] -
Arabidopsis thaliana
Length = 1320
Score = 529 bits (1363), Expect = e-148
Identities = 288/616 (46%), Positives = 381/616 (61%), Gaps = 36/616 (5%)
Query: 29 GHAEADCRY-RDKPQCNYCKKFGHMEKYCYS----KNRHQANLAEE--QEQDQYLLYATQ 81
GH ++ RY + +C C KFGH C + K +AN EE QE+D L+ + +
Sbjct: 268 GHPKS--RYDKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYK 325
Query: 82 DSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*K 141
+E WYLDSG SNHM +S+F +D+SV+ V LG+ + +E KGKG +++
Sbjct: 326 KDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKN 385
Query: 142 GT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSN 200
G + IS+V +P++K N+LS+GQ++E+GY + + + I D+ I +V M K+
Sbjct: 386 GDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL--ITKVPMSKNR 443
Query: 201 ISFPLNFKY-VANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEAC 259
+ F LN + +A ++SWLWH RFGH N L+LL +K M+R LP + N+ C
Sbjct: 444 M-FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVC 502
Query: 260 ERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWV 319
E CLLG Q ++SF RA+ LELIHTDVCGP++ S + YF+LF DDFSR TWV
Sbjct: 503 EGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWV 562
Query: 320 YFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVA 379
YFLK KSEVF +FKKFKA VEK+SG IK +RSDRG E+TS+EF K+CED GI RQLTV
Sbjct: 563 YFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVP 622
Query: 380 YIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPI 439
PQQNGV+ERK RTI+EMARSMLK K + WAEAV AVY+LNR PT +V KTP
Sbjct: 623 RSPQQNGVAERKNRTILEMARSMLKSKRLPKEL-WAEAVACAVYLLNRSPTKSVSGKTPQ 681
Query: 440 EAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKK 499
EAW G+KP HLRVFGSI + HVPD KR KL+DK+ + IF+GY SKGY++YN TKK
Sbjct: 682 EAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKK 741
Query: 500 LTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAENPGEPTSPPQQQEQQQDL 559
ISR++ D W+W+ + + N + + E E EPT+PP
Sbjct: 742 TIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTREEPPSEEPTTPP--------T 793
Query: 560 SPESTPRRVRFLVDVYETCNLAILEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDC 619
SP S+ + E C EP F+ A +++ W AM+EEIK I+KN+TWEL
Sbjct: 794 SPTSS--------QIEEKC-----EPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSL 840
Query: 620 PHGKDIIGVKWVYKTK 635
P+G IGVKWVYK K
Sbjct: 841 PNGHKAIGVKWVYKAK 856
>gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum]
Length = 1333
Score = 518 bits (1334), Expect = e-145
Identities = 282/629 (44%), Positives = 391/629 (61%), Gaps = 41/629 (6%)
Query: 36 RYRDKPQCNYCKKFGHMEKYCYSKNRHQ---ANLAEEQEQDQYLLYATQDSARETGGSWY 92
+Y+ QC YCKKFGH E C++K + + AN + E++ L A+ W+
Sbjct: 253 QYKSNIQCRYCKKFGHKEVDCWTKQKDEQKDANFTQNVEEESKLFMASSQITESANAVWF 312
Query: 93 LDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGT-RLISDVLL 151
+DSGCSNHM+ +S+F+++D+S K +VRLG+ V +GKGTV ++T +G + + DV
Sbjct: 313 IDSGCSNHMSSSKSLFRDLDESQKSEVRLGDDKQVHIEGKGTVEIKTVQGNVKFLYDVQY 372
Query: 152 VPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVA 211
VP L NLLS+GQ++ GY++ F + C I DK R IA+V M ++ + FPL+ V
Sbjct: 373 VPTLAHNLLSVGQLMTSGYSVVFYDNACDIKDKESGRT-IARVPMTQNKM-FPLDISNVG 430
Query: 212 NIAMKA-QVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEACERCLLGMQRRV 270
N A+ + +++ LWH R+GH N LKLL QK+M+ LP++KE + CE C+ G Q R
Sbjct: 431 NSALVVKEKNETNLWHLRYGHLNVNWLKLLVQKDMVIGLPNIKEL-DLCEGCIYGKQTRK 489
Query: 271 SFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWVYFLKAKSEVFR 330
SF K WRA LEL+H D+CGPM+ S +RYF++FTDD+SR +WVYFLK KSE F
Sbjct: 490 SFPVGKSWRATTCLELVHADLCGPMKMESLGGSRYFLMFTDDYSRFSWVYFLKFKSETFE 549
Query: 331 VFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVAYIPQQNGVSER 390
FKKFKA VE QSG IK LR+DRG E+ S +F+ FCE+ GI R+LT Y P+QNGV+ER
Sbjct: 550 TFKKFKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFCEENGIRRELTAPYTPEQNGVAER 609
Query: 391 KKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPIEAWCGKKPSTK 450
K RT++EMARS LK KG+ + ++W EAV T VY LN PT V N TP+EAW GKKP
Sbjct: 610 KNRTVVEMARSSLKAKGLPD-YFWGEAVATVVYFLNISPTKDVWNTTPLEAWNGKKPRVS 668
Query: 451 HLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKKLTISRDVEVDG 510
HLR+FG I Y V KL++K+ + IF+GYS +SK YR+YN + K+ ISR+V +
Sbjct: 669 HLRIFGCIAYALVN--FHSKLDEKSTKCIFVGYSLQSKAYRLYNPISGKVIISRNVVFNE 726
Query: 511 NASWNWDEEKVEKNI-LIPTQRPQEEVEEEAENPG-EPTSPPQQQEQQQDLSPEST---- 564
+ SWN++ + NI L+PT EE A + G P S P ++P +T
Sbjct: 727 DVSWNFNSGNMMSNIQLLPTD------EESAVDFGNSPNSSPVSSSVSSPIAPSTTVAPD 780
Query: 565 -------PRR---------VRFLVDVYETCNLAIL--EPESFEAASKQEVWVKAMEEEIK 606
P R ++ V +C A+L +P +E A +Q W AM EEI+
Sbjct: 781 ESSVEPIPLRRSTREKKPNPKYSNTVNTSCQFALLVSDPICYEEAVEQSEWKNAMIEEIQ 840
Query: 607 MIEKNNTWELVDCPHGKDIIGVKWVYKTK 635
IE+N+TWELVD P GK++IG+KWV++TK
Sbjct: 841 AIERNSTWELVDAPEGKNVIGLKWVFRTK 869
>gb|AAP51797.1| putative copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37530416|ref|NP_919510.1| putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|18542917|gb|AAL75752.1| Putative
copia-type polyprotein [Oryza sativa]
Length = 1350
Score = 498 bits (1283), Expect = e-139
Identities = 260/600 (43%), Positives = 360/600 (59%), Gaps = 75/600 (12%)
Query: 68 EEQEQDQYLLYATQDSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVV 127
+++++ + ++++ + E W +DSGC+NHMA D ++F+ +D S K+ +GNG++
Sbjct: 269 KKRKKSEEMVFSCHTAQEEKDDVWVIDSGCTNHMAADPNLFREMDSSYHAKIHMGNGSIA 328
Query: 128 ESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDK 187
+S+GK C+I D+ +
Sbjct: 329 QSEGKDFS-------------------------------------------CKILDRKNN 345
Query: 188 RVEIAQVKMQKSNISFPLNFKYVANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMR 247
R+ +A++ M+K N +F L + +A+++++D S LWH+R GH N + LKLL K M++
Sbjct: 346 RL-VAKINMEK-NRNFLLRMNHPTQMALRSEIDISDLWHKRMGHLNYRALKLLRTKGMVQ 403
Query: 248 DLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFI 307
LP + ++ CE C+ G Q R SF WRA LEL+HTD+ G + T S N YFI
Sbjct: 404 GLPFITLKSDPCEGCVFGKQIRASFPHSGAWRASAPLELVHTDIVGKVPTISEGGNWYFI 463
Query: 308 LFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFC 367
F DD++RM WVYFLK KS +FKKFKA+VE QS + IKVLRSD+G EY S+EF+K+C
Sbjct: 464 TFIDDYTRMIWVYFLKEKSAALEIFKKFKAMVENQSNRKIKVLRSDQGGEYISKEFEKYC 523
Query: 368 EDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNR 427
E+ GI RQLT Y QQNGV+ERK RTI +MA SML++KGM +F WAEAV TA+YILNR
Sbjct: 524 ENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSF-WAEAVNTAIYILNR 582
Query: 428 CPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKS 487
PT AV N+TP EAW GKKP H+RVFG ICY VP KR K ++K+ IF+GY+
Sbjct: 583 SPTKAVPNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSDWCIFVGYADGI 642
Query: 488 KGYRVYNLQTKKLTISRDVEVDGNASWNWDEEKVEKNILIPT---------QRPQEEVEE 538
KGYR+YNL+ KK+ ISRDV D +A+WNW + L+PT VE+
Sbjct: 643 KGYRLYNLEKKKIIISRDVIFDESATWNWKSPEASSTPLLPTTTITLGQPHMHGTHGVED 702
Query: 539 EAENP--------GEPTSPPQQQEQQQDLSPESTPRRVRFLVDV------------YETC 578
+P +S ++Q +PES PRRVR +V++ +E C
Sbjct: 703 HTSSPQSSSPMSSSSASSDSSPSSEEQISTPESAPRRVRSMVELLESTSQQRGSEQHEFC 762
Query: 579 NLAILEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTKLKP 638
N +++EP+SF+ A K + W+KAME+EI MIEKNNTWELVD P + +IGVKWVYKTKL P
Sbjct: 763 NYSVVEPQSFQEAEKHDNWIKAMEDEIHMIEKNNTWELVDRPRDRKVIGVKWVYKTKLNP 822
>gb|AAD17409.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301699|pir||F84531 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1347
Score = 496 bits (1276), Expect = e-138
Identities = 268/671 (39%), Positives = 390/671 (57%), Gaps = 62/671 (9%)
Query: 30 HAEADCRYRDKP-----------QCNYCKKFGHMEKYCYSKNRHQAN--LAEEQEQDQYL 76
H E +CR + K +C C K GH C SKN+ +A+ L EE + ++
Sbjct: 250 HTEEECREKPKNDDHGKNKRSNIKCYKCGKIGHYANECRSKNKERAHVTLEEEDVNEDHM 309
Query: 77 LYATQDSARET--GGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGT 134
L++ + T W +DSGC+NHM K+E F NI+ S+KV +R+ NG +V + GKG
Sbjct: 310 LFSASEEESTTLREDVWLVDSGCTNHMTKEERYFSNINKSIKVPIRVRNGDIVMTAGKGD 369
Query: 135 VMVET*KGTRLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQV 194
+ V T G R+I +V LVP L++NLLS+ Q+I GY + F+ C I D + K + +
Sbjct: 370 ITVMTRHGKRIIKNVFLVPGLEKNLLSVPQIISSGYWVRFQDKRCIIQDANGKEI----M 425
Query: 195 KMQKSNISFPLNFKYVANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKE 254
++ ++ SF + V AM A V WH+R GH + + L+ + K ++ LP K
Sbjct: 426 NIEMTDKSFKIKLSSVEEEAMTANVQTEETWHKRLGHVSNKRLQQMQDKELVNGLPRFKV 485
Query: 255 SNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFS 314
+ E C+ C LG Q R SF + + ++ LE++HTDVCGPM+ S D +RY++LF DD++
Sbjct: 486 TKETCKACNLGKQSRKSFPKESQTKTREKLEIVHTDVCGPMQHQSIDGSRYYVLFLDDYT 545
Query: 315 RMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIER 374
M WVYFLK KSE F FKKFKALVEKQS IK L R + FCEDEGI R
Sbjct: 546 HMCWVYFLKQKSETFATFKKFKALVEKQSNCSIKTL----------RPMEVFCEDEGINR 595
Query: 375 QLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQ 434
Q+T+ Y PQQNG +ERK R+++EMARSML E+ + WAEAVYT+ Y+ NR P+ A++
Sbjct: 596 QVTLPYSPQQNGAAERKNRSLVEMARSMLVEQDLPLKL-WAEAVYTSAYLQNRLPSKAIE 654
Query: 435 NK-TPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVY 493
+ TP+E WCG KP+ HLR+FGSICY H+PD KR KL+ K GI +GYS ++KGYRV+
Sbjct: 655 DDVTPMEKWCGHKPNVSHLRIFGSICYVHIPDQKRRKLDAKAKCGILIGYSNQTKGYRVF 714
Query: 494 NLQTKKLTISRDVEVDGNASWNWD-EEKVEKNIL-----IPTQRPQEE--------VEEE 539
L+ +K+ +SRDV + W+WD +E+V+K + I R Q+E +++
Sbjct: 715 LLEDEKVEVSRDVVFQEDKKWDWDKQEEVKKTFVMSINDIQESRDQQETSSHDLSQIDDH 774
Query: 540 AENPGEPTSPP--QQQEQQQDLSPESTPRRVRFLVDV---------------YETCNLAI 582
A N TS Q Q++ +P++ + + ++ E C +A
Sbjct: 775 ANNGEGETSSHVLSQVNDQEERETSESPKKYKSMKEILEKAPRMENDEAAQGIEACLVAN 834
Query: 583 LEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTKLKP*WHY 642
EP++++ A + W +AM EEIK+IEKN TW+LVD P K++I VKW+YK K ++
Sbjct: 835 EEPQTYDEARGDKEWEEAMNEEIKVIEKNRTWKLVDKPEKKNVISVKWIYKIKTDASGNH 894
Query: 643 TKAQGEASSEG 653
K + + G
Sbjct: 895 VKHKARLVARG 905
>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1499
Score = 485 bits (1248), Expect = e-135
Identities = 280/695 (40%), Positives = 398/695 (56%), Gaps = 64/695 (9%)
Query: 4 RRESSRNFSKNSQDKNPPCSIC*RLGHAEADCRYRDKPQ------------CNYCKKFGH 51
R E+ +N ++ + N C +C R H E DC +R K + C C K GH
Sbjct: 219 RGENKQNKIRHGKT-NMWCGVCKRNNHNEVDC-FRKKSESISQRGGSYERRCYVCDKQGH 276
Query: 52 MEKYCYSKNRHQANLAEEQEQDQ------YLLYATQDSARETGG--SWYLDSGCSNHMAK 103
+ + C + +A+L+ E+ +D+ L A ++ T G +W +DSGC+NHM+K
Sbjct: 277 IARDCKLRKGERAHLSIEESEDEKEDECHMLFSAVEEKEISTIGEETWLVDSGCTNHMSK 336
Query: 104 DESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIG 163
D F +D S K+ +R+GNG V S+GKG + V T KG +I DVL VP L NLLS+
Sbjct: 337 DVRHFIALDRSKKIIIRIGNGGKVVSEGKGDIRVSTNKGDHVIKDVLYVPELARNLLSVS 396
Query: 164 QMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVAN---IAMKAQVD 220
QMI GY + FE + C I D ++ I +KM+ SFP+ +K +A + + +
Sbjct: 397 QMISNGYRVIFEDNKCVIQDLKGRK--ILDIKMKDR--SFPIIWKKSREETYMAFEEKEE 452
Query: 221 DSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRA 280
+ LWH+RFGH N ++ + ++ LP + C C +G Q R SF +
Sbjct: 453 QTDLWHKRFGHVNYDKIETMQTLKIVEKLPKFEVIKGICAACEMGKQSRRSFPKKSQSNT 512
Query: 281 KDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVE 340
LELIH+DVCGPM+T S + +RYF+ F DDFSRMTWVYFLK KSEV FK FK VE
Sbjct: 513 NKTLELIHSDVCGPMQTESINGSRYFLTFIDDFSRMTWVYFLKNKSEVITKFKIFKPYVE 572
Query: 341 KQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMAR 400
QS IK LR+D G E+ SREF K C++ GI ++T Y PQQNGV+ER+ RT++EMAR
Sbjct: 573 NQSESRIKRLRTDGGGEFLSREFIKLCQESGIHHEITTPYSPQQNGVAERRNRTLVEMAR 632
Query: 401 SMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQ-NKTPIEAWCGKKPSTKHLRVFGSIC 459
SM++EK + N F WAEA+ T+ Y+ NR P+ +++ TP+E W GKKPS HL+VFG +C
Sbjct: 633 SMIEEKKLSNKF-WAEAIATSTYLQNRLPSKSLEKGVTPMEIWSGKKPSVDHLKVFGCVC 691
Query: 460 YTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKKLTISRDVEVDGNASWNWDEE 519
Y H+PD KR KL+ K +GIF+GYS +SKGYRV+ L +K+ +S+DV D +W+ DE+
Sbjct: 692 YIHIPDEKRRKLDTKAKQGIFVGYSNESKGYRVFLLNEEKIEVSKDVTFDEKKTWSHDEK 751
Query: 520 KVEKNILIPTQRPQEE------------VEEEAENPGEPTSPPQQ---QEQQQDLSPEST 564
K IL + +E A N +S Q +E ++ + P
Sbjct: 752 GERKAILSLVKINSQEQGGGNDLNAHIDQVSNAFNQLHISSRGVQNSHEEGEESVGPRGF 811
Query: 565 PRRVRFLVD----------VYETCNLAILEPESFEAASKQEVWVKAMEEEIKMIEKNNTW 614
R + L+D ++E C + EP++ E A K E W++AM EE++MIEKN TW
Sbjct: 812 -RSINNLMDQTNEVEGEALIHEMCLMMAEEPQALEEAMKDEKWIEAMREELRMIEKNKTW 870
Query: 615 ELVDCPHGKDIIGVKWVYKTKLKP*WHYTKAQGEA 649
E+V P K++I VKW+++ K T A GEA
Sbjct: 871 EVVARPKDKNVISVKWIFRLK-------TDASGEA 898
>gb|AAT38797.1| putative polyprotein [Solanum demissum]
Length = 1758
Score = 481 bits (1238), Expect = e-134
Identities = 259/618 (41%), Positives = 363/618 (57%), Gaps = 65/618 (10%)
Query: 10 NFSKNSQDKNPPCSIC*RLGHAEADCRYRDKPQCNYCKKFGHMEKYCYSKNRHQANL--- 66
N S + + K PPC C R H E C +R C CK+ GH+ K C S+ +L
Sbjct: 214 NNSGDVKKKFPPCKYCKRTTHLEKYCWWRVDAICGNCKQTGHISKVCKSRANASGSLQAQ 273
Query: 67 ---AEEQEQDQYLLYATQDSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGN 123
A + +DQ L + S E+ SW LDSGC++H+ D +FK +DD+ K KV++GN
Sbjct: 274 VADAADAHEDQ-LFAVSYFSINESSDSWILDSGCTHHLCNDAEMFKFLDDTYKSKVKVGN 332
Query: 124 GAVVESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYD 183
G VE KG+GT+ + G + I D+L P++ +NLLS+GQM+E Y+LHF+ C + D
Sbjct: 333 GEAVEVKGRGTMSISIISGIKTIPDILYTPDMSQNLLSVGQMLENNYSLHFKNHECVVSD 392
Query: 184 KHDKRVEIAQVKMQKSNISFPLNFKYVANIAMKAQVDDSW-LWHRRFGHFNTQVLKLLYQ 242
VE+ VKM SNI F ++++ + A + LWH+RFGHFN + + + +
Sbjct: 393 PSG--VELFYVKM--SNIMFSVDWEKITEQAYTITLQTCTNLWHKRFGHFNLRSIAEMKK 448
Query: 243 KNMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDN 302
K ++ ++P + + CE C G Q ++ F + WRA L+LIHTDVCGP++T S
Sbjct: 449 KELVENMPEFLSNAQVCETCQQGKQTKLPFQANQVWRANQKLQLIHTDVCGPIKTDSLSG 508
Query: 303 NRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSRE 362
N+YF+LF DD++RM WVYF++ KSEVF VFK+FKALVE Q IK LRSD G E+TS +
Sbjct: 509 NKYFLLFIDDYTRMCWVYFIRLKSEVFDVFKQFKALVENQCNLRIKALRSDNGGEHTSFQ 568
Query: 363 FDKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAV 422
F +FC IE QLT+ Y PQQNGVSERK RT+MEMAR +L E+ + N F AEA+ T+V
Sbjct: 569 FVEFCNSTCIECQLTLPYTPQQNGVSERKNRTVMEMARCLLLERKIPNQF-LAEAINTSV 627
Query: 423 YILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLG 482
Y+LNR PT A+Q+ TP EAWCG KPS HLR+FG CY VP KR KL++K +GIF+G
Sbjct: 628 YLLNRLPTKALQDMTPYEAWCGNKPSVHHLRIFGCKCYYRVPKTKRTKLDNKAHKGIFMG 687
Query: 483 YSTKSKGYRVYNLQTKKLTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAEN 542
YS+ SKGY+++ L+++KL +SR+V+ D A W+W +K + L ++PQ +E
Sbjct: 688 YSS-SKGYKIFCLRSEKLILSREVKFDEAAGWDWKNQKTSYSDLFSKEQPQLSEDE---- 742
Query: 543 PGEPTSPPQQQEQQQDLSPESTPRRVRFLVDVYETCNLAILEPESFEAASKQEVWVKAME 602
LVD W +AM+
Sbjct: 743 ----------------------------LVD-------------------DVPAWRRAMQ 755
Query: 603 EEIKMIEKNNTWELVDCP 620
+E+ +I+KN TW+LVD P
Sbjct: 756 DELDVIKKNGTWQLVDRP 773
Score = 50.4 bits (119), Expect = 3e-04
Identities = 23/56 (41%), Positives = 30/56 (53%)
Query: 582 ILEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTKLK 637
I P +F ASK W AM+ E + KN+TWELV ++I KW+YK K
Sbjct: 1307 IFTPSTFNQASKHIEWQNAMQAEFDALRKNHTWELVPPDPSNNVIACKWLYKIMRK 1362
>gb|AAF16534.1| T26F17.17 [Arabidopsis thaliana]
Length = 1291
Score = 480 bits (1235), Expect = e-133
Identities = 266/593 (44%), Positives = 357/593 (59%), Gaps = 44/593 (7%)
Query: 59 KNRHQANLAEE--QEQDQYLLYATQDSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVK 116
K +AN EE QE+D L+ + + +E WYLDSG SNHM +S+F +D+SV+
Sbjct: 263 KFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVR 322
Query: 117 VKVRLGNGAVVESKGKGTVMVET*KGT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFE 175
V LG+ + +E KGKG +++ G + IS+V +P++K N+LS+GQ++E+GY + +
Sbjct: 323 GNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLK 382
Query: 176 GDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKY-VANIAMKAQVDDSWLWHRRFGHFNT 234
+ I D+ I +V M K+ + F LN + +A ++SWLWH RFGH N
Sbjct: 383 DNNLSIRDQESNL--ITKVPMSKNRM-FVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNF 439
Query: 235 QVLKLLYQKNMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGP 294
L+LL +K M+R LP + N+ CE CLLG Q ++SF RA+ LELIHTDVCGP
Sbjct: 440 GGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGP 499
Query: 295 MRTSSHDNNRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDR 354
++ S + KSEVF++FKKFKA VEK+SG IK +RSDR
Sbjct: 500 IKPKSLE-----------------------KSEVFKIFKKFKAHVEKESGLVIKTMRSDR 536
Query: 355 GKEYTSREFDKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWW 414
G E+TS+EF K+CED GI RQLTV PQQNGV+ERK RTI+EMARSMLK K + W
Sbjct: 537 GGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELW- 595
Query: 415 AEAVYTAVYILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDK 474
AEAV AVY+LNR PT +V KTP EAW G+KP HLRVFGSI + HVPD KR KL+DK
Sbjct: 596 AEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDK 655
Query: 475 NVRGIFLGYSTKSKGYRVYNLQTKKLTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQE 534
+ + IF+GY SKGY++YN TKK ISR++ D W+W+ + + N + +
Sbjct: 656 SEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEP 715
Query: 535 EVEEEAENPGEPTSPPQQ--QEQQQDLSPESTPRRVRFLVDVYET----------CNLAI 582
E E EPT+ P Q ++ S E TP R R + ++YE C A
Sbjct: 716 EPTREEPPSEEPTTRPTSLTSSQIEESSSERTP-RFRSIQELYEVTENQENLTLFCLFAE 774
Query: 583 LEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTK 635
EP F+ A +++ W AM+EEIK I+KN+TWEL P+G IGVKWVYK K
Sbjct: 775 CEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAK 827
>gb|AAG51247.1| copia-type polyprotein, putative; 28768-32772 [Arabidopsis
thaliana] gi|25301683|pir||E86451 probable copia-type
polyprotein, 28768-32772 [imported] - Arabidopsis
thaliana
Length = 1334
Score = 460 bits (1184), Expect = e-127
Identities = 259/640 (40%), Positives = 359/640 (55%), Gaps = 51/640 (7%)
Query: 38 RDKPQCNYCKKFGHMEKYCYSKNRHQANLAEEQEQDQYLLYATQDSARETGGSWYLDSGC 97
RD +C C K GH + C S + +AN E E+D L+ + E W+LDSGC
Sbjct: 246 RDTVECFKCHKMGHYKAECPSWEK-EANYVE-MEEDLLLMAHVEQIGDEEKQIWFLDSGC 303
Query: 98 SNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKE 157
SNHM F +D K VRLG+ + +GKG + +E ++ISDV VP LK
Sbjct: 304 SNHMCGTREWFLELDSGFKQNVRLGDDRRMAVEGKGKLRLEVDGRIQVISDVYFVPGLKN 363
Query: 158 NLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVANIAMKA 217
NL S+GQ+ ++G EGDVC ++ K +KR+ + M K+ + F A +
Sbjct: 364 NLFSVGQLQQKGLRFIIEGDVCEVWHKTEKRM-VMHSTMTKNRM-----FVVFAAVKKSK 417
Query: 218 QVDDSW----------LWHRRFGHFNTQVLKLLYQKNMMRDLPS--LKESNEACERCLLG 265
+ +++ +WH+RFGH N Q L+ L +K M++ LP L E C+ CL G
Sbjct: 418 ETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLGEEEAVCDICLKG 477
Query: 266 MQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWVYFLKAK 325
Q R S W++ VL+L+HTD+CGP+ +S RY + F DDFSR W Y L K
Sbjct: 478 KQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDFSRKCWTYLLSEK 537
Query: 326 SEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVAYIPQQN 385
SE F+ FK+FKA VE++SGK + LRSDRG EY SREFD++C++ GI+RQLT AY PQQN
Sbjct: 538 SETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIKRQLTAAYTPQQN 597
Query: 386 GVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPIEAWCGK 445
GV+ERK R++M M R ML E + F W EAV AVYILNR P+ A+ + TP E W
Sbjct: 598 GVAERKNRSVMNMTRCMLMEMSVPRKF-WPEAVQYAVYILNRSPSKALNDITPEEKWSSW 656
Query: 446 KPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKKLTISRD 505
KPS +HLR+FGS+ Y VP KR KL++K+++ + G S +SK YR+Y+ T K+ ISRD
Sbjct: 657 KPSVEHLRIFGSLAYALVPYQKRIKLDEKSIKCVMFGVSKESKAYRLYDPATGKILISRD 716
Query: 506 VEVDGNASWNWDEEKVEKNILIPT-----------------QRPQEEVEEEAENPGEPT- 547
V+ D W W+++ +E+ ++ Q+ QEE EEE E E
Sbjct: 717 VQFDEERGWEWEDKSLEEELVWDNSDHEPAGEEGPEINHNGQQDQEETEEEEETVAETVH 776
Query: 548 -------SPPQQQEQQQDLSPESTPRRVRFLVDVYETCNLAIL-----EPESFEAASKQE 595
+ +Q QQ + R L+ E + L +P FE A++ E
Sbjct: 777 QNLPAVGTGGVRQRQQPVWMKDYVVGNARVLITQDEEDEVLALFIGPDDPVCFEEAAQLE 836
Query: 596 VWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTK 635
VW KAME EI IE+NNTWELV+ P +IG+KW++KTK
Sbjct: 837 VWRKAMEAEITSIEENNTWELVELPEEAKVIGLKWIFKTK 876
>dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana]
gi|13872710|emb|CAC37622.1| polyprotein [Arabidopsis
thaliana]
Length = 1334
Score = 460 bits (1184), Expect = e-127
Identities = 259/640 (40%), Positives = 359/640 (55%), Gaps = 51/640 (7%)
Query: 38 RDKPQCNYCKKFGHMEKYCYSKNRHQANLAEEQEQDQYLLYATQDSARETGGSWYLDSGC 97
RD +C C K GH + C S + +AN E E+D L+ + E W+LDSGC
Sbjct: 246 RDTVECFKCHKMGHYKAECPSWEK-EANYVE-MEEDLLLMAHVEQIGDEEKQIWFLDSGC 303
Query: 98 SNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKE 157
SNHM F +D K VRLG+ + +GKG + +E ++ISDV VP LK
Sbjct: 304 SNHMCGTREWFLELDSGFKQNVRLGDDRRMAVEGKGKLRLEVDGRIQVISDVYFVPGLKN 363
Query: 158 NLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVANIAMKA 217
NL S+GQ+ ++G EGDVC ++ K +KR+ + M K+ + F A +
Sbjct: 364 NLFSVGQLQQKGLRFIIEGDVCEVWHKTEKRM-VMHSTMTKNRM-----FVVFAAVKKSK 417
Query: 218 QVDDSW----------LWHRRFGHFNTQVLKLLYQKNMMRDLPS--LKESNEACERCLLG 265
+ +++ +WH+RFGH N Q L+ L +K M++ LP L E C+ CL G
Sbjct: 418 ETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLGEEEAVCDICLKG 477
Query: 266 MQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWVYFLKAK 325
Q R S W++ VL+L+HTD+CGP+ +S RY + F DDFSR W Y L K
Sbjct: 478 KQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDFSRKCWTYLLSEK 537
Query: 326 SEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVAYIPQQN 385
SE F+ FK+FKA VE++SGK + LRSDRG EY SREFD++C++ GI+RQLT AY PQQN
Sbjct: 538 SETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIKRQLTAAYTPQQN 597
Query: 386 GVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPIEAWCGK 445
GV+ERK R++M M R ML E + F W EAV AVYILNR P+ A+ + TP E W
Sbjct: 598 GVAERKNRSVMNMTRCMLMEMSVPRKF-WPEAVQYAVYILNRSPSKALNDITPEEKWSSW 656
Query: 446 KPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKKLTISRD 505
KPS +HLR+FGS+ Y VP KR KL++K+++ + G S +SK YR+Y+ T K+ ISRD
Sbjct: 657 KPSVEHLRIFGSLAYALVPYQKRIKLDEKSIKCVMFGVSKESKAYRLYDPATGKILISRD 716
Query: 506 VEVDGNASWNWDEEKVEKNILIPT-----------------QRPQEEVEEEAENPGEPT- 547
V+ D W W+++ +E+ ++ Q+ QEE EEE E E
Sbjct: 717 VQFDEERGWEWEDKSLEEELVWDNSDHEPAGEEGPEINHNGQQDQEETEEEEETVAETVH 776
Query: 548 -------SPPQQQEQQQDLSPESTPRRVRFLVDVYETCNLAIL-----EPESFEAASKQE 595
+ +Q QQ + R L+ E + L +P FE A++ E
Sbjct: 777 QNLPAVGTGGVRQRQQPVWMKDYVVGNARVLITQDEEDEVLALFIGPGDPVCFEEAAQLE 836
Query: 596 VWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTK 635
VW KAME EI IE+NNTWELV+ P +IG+KW++KTK
Sbjct: 837 VWRKAMEAEITSIEENNTWELVELPEEAKVIGLKWIFKTK 876
>gb|AAF25964.2| F6N18.1 [Arabidopsis thaliana]
Length = 1207
Score = 460 bits (1184), Expect = e-127
Identities = 259/640 (40%), Positives = 359/640 (55%), Gaps = 51/640 (7%)
Query: 38 RDKPQCNYCKKFGHMEKYCYSKNRHQANLAEEQEQDQYLLYATQDSARETGGSWYLDSGC 97
RD +C C K GH + C S + +AN E E+D L+ + E W+LDSGC
Sbjct: 151 RDTVECFKCHKMGHYKAECPSWEK-EANYVE-MEEDLLLMAHVEQIGDEEKQIWFLDSGC 208
Query: 98 SNHMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKE 157
SNHM F +D K VRLG+ + +GKG + +E ++ISDV VP LK
Sbjct: 209 SNHMCGTREWFLELDSGFKQNVRLGDDRRMAVEGKGKLRLEVDGRIQVISDVYFVPGLKN 268
Query: 158 NLLSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVANIAMKA 217
NL S+GQ+ ++G EGDVC ++ K +KR+ + M K+ + F A +
Sbjct: 269 NLFSVGQLQQKGLRFIIEGDVCEVWHKTEKRM-VMHSTMTKNRM-----FVVFAAVKKSK 322
Query: 218 QVDDSW----------LWHRRFGHFNTQVLKLLYQKNMMRDLPS--LKESNEACERCLLG 265
+ +++ +WH+RFGH N Q L+ L +K M++ LP L E C+ CL G
Sbjct: 323 ETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLGEEEAVCDICLKG 382
Query: 266 MQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWVYFLKAK 325
Q R S W++ VL+L+HTD+CGP+ +S RY + F DDFSR W Y L K
Sbjct: 383 KQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDFSRKCWTYLLSEK 442
Query: 326 SEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVAYIPQQN 385
SE F+ FK+FKA VE++SGK + LRSDRG EY SREFD++C++ GI+RQLT AY PQQN
Sbjct: 443 SETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIKRQLTAAYTPQQN 502
Query: 386 GVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPIEAWCGK 445
GV+ERK R++M M R ML E + F W EAV AVYILNR P+ A+ + TP E W
Sbjct: 503 GVAERKNRSVMNMTRCMLMEMSVPRKF-WPEAVQYAVYILNRSPSKALNDITPEEKWSSW 561
Query: 446 KPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKKLTISRD 505
KPS +HLR+FGS+ Y VP KR KL++K+++ + G S +SK YR+Y+ T K+ ISRD
Sbjct: 562 KPSVEHLRIFGSLAYALVPYQKRIKLDEKSIKCVMFGVSKESKAYRLYDPATGKILISRD 621
Query: 506 VEVDGNASWNWDEEKVEKNILIPT-----------------QRPQEEVEEEAENPGEPT- 547
V+ D W W+++ +E+ ++ Q+ QEE EEE E E
Sbjct: 622 VQFDEERGWEWEDKSLEEELVWDNSDHEPAGEEGPEINHNGQQDQEETEEEEETVAETVH 681
Query: 548 -------SPPQQQEQQQDLSPESTPRRVRFLVDVYETCNLAIL-----EPESFEAASKQE 595
+ +Q QQ + R L+ E + L +P FE A++ E
Sbjct: 682 QNLPAVGTGGVRQRQQPVWMKDYVVGNARVLITQDEEDEVLALFIGPDDPVCFEEAAQLE 741
Query: 596 VWVKAMEEEIKMIEKNNTWELVDCPHGKDIIGVKWVYKTK 635
VW KAME EI IE+NNTWELV+ P +IG+KW++KTK
Sbjct: 742 VWRKAMEAEITSIEENNTWELVELPEEAKVIGLKWIFKTK 781
>emb|CAB75932.1| putative protein [Arabidopsis thaliana] gi|11278365|pir||T47841
hypothetical protein T2O9.150 - Arabidopsis thaliana
Length = 1339
Score = 456 bits (1173), Expect = e-126
Identities = 263/633 (41%), Positives = 354/633 (55%), Gaps = 43/633 (6%)
Query: 42 QCNYCKKFGHMEKYC--YSKNRHQANLAEEQEQDQYLLYATQDSARETGGSWYLDSGCSN 99
+C C GH + C + KN + A L EE+E+ + Y Q+ A W+LDSGCSN
Sbjct: 251 ECYKCHNLGHFQYECPEWEKNANYAEL-EEEEELLLMAYVEQNQANRDE-VWFLDSGCSN 308
Query: 100 HMAKDESIFKNIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKENL 159
HM + F +++ V+LGN + GKG+V V+ T++I +V VP L+ NL
Sbjct: 309 HMTGSKEWFSELEEGFNRTVKLGNDTRMSVVGKGSVKVKVNGVTQVIPEVYYVPELRNNL 368
Query: 160 LSIGQMIERGYTLHFEGDVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVAN---IAMK 216
LS+GQ+ ERG + C++Y H + I + M + + F L K N + +
Sbjct: 369 LSLGQLQERGLAILIRDGTCKVY--HPSKGAIMETNMSGNRMFFLLASKPQKNSLCLQTE 426
Query: 217 AQVD-DSWLWHRRFGHFNTQVLKLLYQKNMMRDLPSLKESNEACERCLLGMQRRVSFSTC 275
+D ++ LWH RFGH N + LKLL K M+ LP LK + E C CL G Q R S S
Sbjct: 427 EVMDKENHLWHCRFGHLNQEGLKLLAHKKMVIGLPILKATKEICAICLTGKQHRESMSKK 486
Query: 276 KEWRAKDVLELIHTDVCGPMRTSSHDNNRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKF 335
W++ L+L+H+D+CGP+ SH RY + F DDF+R TWVYFL KSE F FK F
Sbjct: 487 TSWKSSTQLQLVHSDICGPITPISHSGKRYILSFIDDFTRKTWVYFLHEKSEAFATFKIF 546
Query: 336 KALVEKQSGKHIKVLRSDRGKEYTSREFDKFCEDEGIERQLTVAYIPQQNGVSERKKRTI 395
KA VEK+ G + LR+DRG E+TS EF +FC GI RQLT A+ PQQNGV+ERK RTI
Sbjct: 547 KASVEKEIGAFLTCLRTDRGGEFTSNEFGEFCRSHGISRQLTAAFTPQQNGVAERKNRTI 606
Query: 396 MEMARSMLKEKGMHNTFWWAEAVYTAVYILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVF 455
M RSML E+ + FW +EA +V+I NR PT AV+ TP EAW G+KP ++ RVF
Sbjct: 607 MNAVRSMLSERQVPKMFW-SEATKWSVHIQNRSPTAAVEGMTPEEAWSGRKPVVEYFRVF 665
Query: 456 GSICYTHVPDVKRHKLEDKNVRGIFLGYSTKSKGYRVYNLQTKKLTISRDVEVDGNASWN 515
G I Y H+PD KR KL+DK+ + +FLG S +SK +R+Y+ KK+ IS+DV D + SW+
Sbjct: 666 GCIGYVHIPDQKRSKLDDKSKKCVFLGVSEESKAWRLYDPVMKKIVISKDVVFDEDKSWD 725
Query: 516 WDEEKVE-KNILIPTQRPQEEVEEEAENPGEPTSPPQQQEQQQ--------DLSPESTP- 565
WD+ VE K + + +E E P SP SP +P
Sbjct: 726 WDQADVEAKEVTLECGDEDDEKNSEVVEPIAVASPNHVGSDNNVSSSPILAPSSPAPSPV 785
Query: 566 -------RRVRFLVDVYETC-------NLAIL--------EPESFEAASKQEVWVKAMEE 603
RR + YET NL+++ +P F+ A K ++W +AME
Sbjct: 786 AAKVTRERRPPGWMADYETGEGEEIEENLSVMLLMMMTEADPIQFDDAVKDKIWREAMEH 845
Query: 604 EIKMIEKNNTWELVDCPHGKDIIGVKWVYKTKL 636
EI+ I KNNTWEL P G IGVKWVYKTKL
Sbjct: 846 EIESIVKNNTWELTTLPKGFTPIGVKWVYKTKL 878
>gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25411795|pir||G84552 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 822
Score = 436 bits (1121), Expect = e-120
Identities = 235/500 (47%), Positives = 315/500 (63%), Gaps = 20/500 (4%)
Query: 69 EQEQDQYLLYATQDSARETGGS-WYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGAVV 127
E+ + + A Q+ +GG+ W +DSGC+NHM +E +F I+ KV +R+GNGAV+
Sbjct: 7 EEVKKNLVFIARQEIKEPSGGNTWLIDSGCTNHMTPNEKLFTKINRDFKVPIRVGNGAVM 66
Query: 128 ESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDKHDK 187
S+GKG + V T K R I DVLLVP L +NLLS+ QMI GY + + + C I+D K
Sbjct: 67 MSEGKGDIEVMTRKDKRGIRDVLLVPKLGKNLLSVPQMIINGYQVTLKNNYCTIHDSARK 126
Query: 188 RVEIAQVKMQKSNISFPLNFKYVANIAMKAQVDDSWLWHRRFGHFNTQVLKLLYQKNMMR 247
+ I +V+M N SF L + AM A+ + + LWH+R GH LK+L K M+
Sbjct: 127 K--IGEVEMV--NKSFHLRWLSNEETAMVAKDEATELWHKRLGHTGHSNLKILQSKEMVT 182
Query: 248 DLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNNRYFI 307
LP CE C+L R F E RAK LELIH+DVCGPM+ SS + +RY +
Sbjct: 183 GLPKFNVEEGKCESCILSKHSRDPFPKESETRAKHKLELIHSDVCGPMQNSSINGSRYIL 242
Query: 308 LFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREFDKFC 367
F DD +RM WVYFLKAKSEVF+ FKKFK LVE + IK LR DRG EY S+EF +F
Sbjct: 243 TFIDDATRMVWVYFLKAKSEVFQTFKKFKNLVENNANCRIKKLRIDRGTEYLSKEFSEFL 302
Query: 368 EDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVYILNR 427
E GIERQLT AY PQQN VSER+ R+++EMAR+M+K K + WAEAV+ A Y NR
Sbjct: 303 EGNGIERQLTAAYSPQQNEVSERRNRSLVEMARAMIKAKDLPLKL-WAEAVHVAAYAQNR 361
Query: 428 CPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGYSTKS 487
PT ++NKTP+EAW KPS H++VFGSICY H+PD KR K +DK+ R IF+GYS+++
Sbjct: 362 TPTRTLKNKTPLEAWSDSKPSVSHMKVFGSICYVHIPDEKRRKWDDKSKRAIFVGYSSQT 421
Query: 488 KGYRVYNLQTKKLTISRDVEVDGNASWNWDEEKVEKNILIPTQRPQEEVEEEAENPGEPT 547
KGYRVY L+ K+ ISRDV D ++ W+W++++V K+ ++ E E+ G+
Sbjct: 422 KGYRVYLLKENKIDISRDVIFDEDSKWDWEKKEVIKHY---------DMSREPEDRGD-- 470
Query: 548 SPPQQQEQQQDLSPESTPRR 567
QQ ++Q E+ RR
Sbjct: 471 ---QQADEQNSRDNEARGRR 487
>gb|AAT38786.1| putative gag-pol polyprotein [Solanum demissum]
Length = 1133
Score = 432 bits (1110), Expect = e-119
Identities = 217/478 (45%), Positives = 306/478 (63%), Gaps = 12/478 (2%)
Query: 4 RRESSRNFSKNSQDKNPPCSIC*RLGHAEADCRYRDKPQCNYCKKFGHMEKYCYSKNRH- 62
++E S N + + K PPC C + H + C YR QC CK+FGH++K C
Sbjct: 216 KKEQSNNRGRYRRSKYPPCPYCKKTNHTDKFCWYRPGVQCKLCKQFGHVDKVCKINQNQP 275
Query: 63 -QANLAEEQEQDQYLLYATQ-----DSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVK 116
QA + E E + L+A + A + W +DSGC++HM + +IF+ +D
Sbjct: 276 AQAQVTENVEDPEEKLFAASIAGECNVAAQDQDVWLVDSGCTHHMTANLNIFEWLDKKYF 335
Query: 117 VKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIGQMIERGYTLHFEG 176
KVRLG+G +V++ GKG V V+T G ++IS+VL VP + ++LLS+GQ++++ Y L F+
Sbjct: 336 SKVRLGDGRLVDAAGKGAVAVQTPSGMKIISNVLFVPEISQSLLSVGQLLDKNYALLFKD 395
Query: 177 DVCRIYDKHDKRVEIAQVKMQKSNISFPLNFKYVANIAMKAQVDDSWLWHRRFGHFNTQV 236
C I D +++ VKM +N SFPL++K+ A + D++++WH+R G N +
Sbjct: 396 KTCEIMDPTG--IKLLSVKM--NNRSFPLDWKHTDIGAYVSIQDETYVWHKRLGQINFKS 451
Query: 237 LKLLYQKNMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMR 296
LKL+ K+++ D+PS+ E++ C C +G + F + WRA + L+LIHTDVCGPM
Sbjct: 452 LKLMQNKDLVADMPSINETSNVCGVCQIGKLSQSPFPINQAWRATEKLQLIHTDVCGPMS 511
Query: 297 TSSHDNNRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGK 356
T S++ ++YF+LF +D +R WVYFLK KSEVF F++FKA VE Q G IK+LRSD G
Sbjct: 512 TPSYNGSKYFLLFINDLTRFCWVYFLKHKSEVFVAFQRFKATVENQCGSLIKILRSDNGT 571
Query: 357 EYTSREFDKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAE 416
E+TS +F F + GI QLTV Y PQQNGVSERK R+IM MAR +L EKG+ WAE
Sbjct: 572 EFTSNQFKDFLQKAGIHHQLTVTYTPQQNGVSERKNRSIMNMARCLLFEKGLPKVL-WAE 630
Query: 417 AVYTAVYILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDK 474
AV TAVY+ NR PT AV+ KTP EAW G KPS HL+VFG ICY+++PDVKR KL+ +
Sbjct: 631 AVNTAVYLQNRLPTRAVEGKTPYEAWIGTKPSVSHLKVFGCICYSYIPDVKRDKLDQR 688
>ref|XP_470746.1| putative gag-pol polyprotein [Oryza sativa]
gi|18071369|gb|AAL58228.1| putative gag-pol polyprotein
[Oryza sativa]
Length = 1167
Score = 419 bits (1076), Expect = e-115
Identities = 217/423 (51%), Positives = 275/423 (64%), Gaps = 32/423 (7%)
Query: 237 LKLLYQKNMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMR 296
LKLL K M++ LP + ++ CE C+ G Q R SF + WRA LEL+HTD+ G +
Sbjct: 344 LKLLRTKGMVQGLPFITLKSDPCEGCVFGKQIRASFPHSRAWRASAPLELVHTDIVGKVP 403
Query: 297 TSSHDNNRYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGK 356
T S N YFI F DD++RM WVYFLK KS +FKKFKA+VE QS + IKVLRSD+G
Sbjct: 404 TISEGGNWYFITFIDDYTRMIWVYFLKEKSAALEIFKKFKAMVENQSNRKIKVLRSDQGG 463
Query: 357 EYTSREFDKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAE 416
EY S+EF+K+CE+ GI RQLT Y QQNGV+ERK RTI +MA SML++KGM +F WAE
Sbjct: 464 EYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSF-WAE 522
Query: 417 AVYTAVYILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNV 476
AV TA+YILNR PT AV N+TP EAW GKKP H+RVFG ICY VP KR K ++K+
Sbjct: 523 AVNTAIYILNRSPTKAVPNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSD 582
Query: 477 RGIFLGYSTKSKGYRVYNLQTKKLTISRDVEVDGNASWNWDEEKVEKNILIPT------- 529
R IF+GY+ KGYR+YNL+ KK+ ISRDV D +A+WNW K L+PT
Sbjct: 583 RCIFVGYADGIKGYRLYNLEKKKIIISRDVIFDESATWNWKSPKASSTPLLPTTTITLGQ 642
Query: 530 --QRPQEEVEEEAENPGEPTSPPQQQEQQQDLS---------PESTPRRVRFLVDV---- 574
EVE+ +P +P+SP D S PES PRRVR +V++
Sbjct: 643 PHMHGTHEVEDHTPSP-QPSSPMSSSSASSDSSPSSEEQISTPESAPRRVRSMVELLEST 701
Query: 575 --------YETCNLAILEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGKDII 626
+E CN +++EP+SF+ A K + W+KAME+EI MIEKNNTWELVD P +++I
Sbjct: 702 SQQRGSEQHEFCNYSVVEPQSFQEAEKHDNWIKAMEDEIHMIEKNNTWELVDRPRDREVI 761
Query: 627 GVK 629
GVK
Sbjct: 762 GVK 764
Score = 103 bits (258), Expect = 2e-20
Identities = 50/120 (41%), Positives = 74/120 (61%), Gaps = 14/120 (11%)
Query: 50 GHMEKYCYSKNRHQANLAEEQEQDQYLLYATQDSARETGGSWYLDSGCSNHMAKDESIFK 109
G+ +K +S+ + E+E E W +DSGC+NHMA D ++F+
Sbjct: 237 GYFQKNGFSRQKEDGQERREKE--------------EKDDVWVIDSGCTNHMAADPNLFR 282
Query: 110 NIDDSVKVKVRLGNGAVVESKGKGTVMVET*KGTRLISDVLLVPNLKENLLSIGQMIERG 169
+D S K+ +GNG++ +S+GKGTV V+T G + I DVLLVP+LK+NLLSIGQ++E G
Sbjct: 283 EMDSSYHAKIHMGNGSIAQSEGKGTVAVQTADGPKFIKDVLLVPDLKQNLLSIGQLLEHG 342
>ref|XP_474043.1| OSJNBb0034I13.10 [Oryza sativa (japonica cultivar-group)]
gi|21741247|emb|CAD41731.1| OSJNBb0034I13.10 [Oryza
sativa (japonica cultivar-group)]
Length = 1425
Score = 417 bits (1072), Expect = e-114
Identities = 241/675 (35%), Positives = 349/675 (51%), Gaps = 84/675 (12%)
Query: 42 QCNYCKKFGHMEKYCYSKNRH---QANLAEEQEQDQYLLYA------------------- 79
+C C +FGH + C R +ANL + E++ LL A
Sbjct: 299 KCFNCDEFGHYARQCRKPRRQRRGEANLVQAAEEEPTLLMAHVVGVSLAGEATLGRTPSG 358
Query: 80 --------------TQDSARETGGSWYLDSGCSNHMAKDESIFKNIDDSVKVKVRLGNGA 125
E G W+LD+G +NHM S F +D V V+ G+G+
Sbjct: 359 QEVHLTEKKVILDHEDGGEEEVTGDWFLDTGATNHMTGVRSAFAELDTGVVGTVKFGDGS 418
Query: 126 VVESKGKGTVMVET*KGT-RLISDVLLVPNLKENLLSIGQMIERGYTLHFEGDVCRIYDK 184
V+E +G+GTV+ G R + V +P L++N++S+G++ RGY H G VC + D
Sbjct: 419 VIEIQGRGTVVFRCKNGDHRSLDAVYYIPKLRKNIISVGRLDARGYDAHIWGGVCTLRDP 478
Query: 185 HDKRVEIAQVKMQKSNISFPLNFKYVANIAMKAQVDDS-WLWHRRFGHFNTQVLKLLYQK 243
+ + +A+VK + N + L + M A D+ W WH RFGH N Q L+ L Q
Sbjct: 479 NG--LLLAKVK-RDINYLYILKLHIANPVCMAASGGDTAWRWHARFGHLNFQSLRRLAQG 535
Query: 244 NMMRDLPSLKESNEACERCLLGMQRRVSFSTCKEWRAKDVLELIHTDVCGPMRTSSHDNN 303
NM+R LP++ +++ C+ CL G QRR+ F ++RA++ LEL+H D+CGP+ ++
Sbjct: 536 NMVRGLPTIDHTDQLCDGCLAGKQRRLPFPEEAKFRAQEALELVHGDLCGPITPATPGGR 595
Query: 304 RYFILFTDDFSRMTWVYFLKAKSEVFRVFKKFKALVEKQSGKHIKVLRSDRGKEYTSREF 363
+YF+L DD SR W+ L K E K+F+A VE +SG+ ++ LR+DRG E+TS EF
Sbjct: 596 KYFLLLVDDMSRHMWIRLLSGKHEAATAIKQFQAGVELESGRKLRALRTDRGGEFTSVEF 655
Query: 364 DKFCEDEGIERQLTVAYIPQQNGVSERKKRTIMEMARSMLKEKGMHNTFWWAEAVYTAVY 423
+C D G+ R+LT Y PQQN V ER+ +T++ ARSMLK G+ F W EAV AVY
Sbjct: 656 MDYCTDRGMRRELTAPYSPQQNRVVERRNQTVVAAARSMLKAAGLPARF-WGEAVVAAVY 714
Query: 424 ILNRCPTNAVQNKTPIEAWCGKKPSTKHLRVFGSICYTHVPDVKRHKLEDKNVRGIFLGY 483
+LNR PT A+ TP EAW G++PS +HLRVFG + Y KL+D+ R +F+GY
Sbjct: 715 VLNRSPTKALDGVTPYEAWHGRRPSVEHLRVFGCVGYVKTVKPNLRKLDDRGTRMVFIGY 774
Query: 484 STKSKGYRVYNLQTKKLTISRDVEVDGNASWNW--------DEEKVEKNILIPTQRPQEE 535
SK YR+Y+ +++ +SRDV D A+W W +EE+ + + P
Sbjct: 775 EQGSKAYRMYDPVAQRVCVSRDVVFDETATWAWRDPEDAATEEEEFTVDFFVSPVAP--S 832
Query: 536 VEEEAENPGEP-------------TSPPQQQEQQQDLSPEST-------PRRVRFLVDVY 575
V + E G P +SPP+ + P S P R R + D+
Sbjct: 833 VADAGEQTGTPVQAGVSPVSTGVLSSPPRAPNGEFCTPPTSVTPETDGGPVRYRRVQDIL 892
Query: 576 ET------------CNLAILEPESFEAASKQEVWVKAMEEEIKMIEKNNTWELVDCPHGK 623
T C +A EP SF A K E W +AM EE++ +E+N TW L + P G
Sbjct: 893 STTEPVLDFDYSDQCLIATEEPTSFVEAEKHECWRRAMVEELRSVEENQTWSLAELPAGH 952
Query: 624 DIIGVKWVYKTKLKP 638
IG+KWVYK K P
Sbjct: 953 KAIGLKWVYKLKKDP 967
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.331 0.140 0.448
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,691,407,706
Number of Sequences: 2540612
Number of extensions: 68611813
Number of successful extensions: 280895
Number of sequences better than 10.0: 2591
Number of HSP's better than 10.0 without gapping: 1578
Number of HSP's successfully gapped in prelim test: 1015
Number of HSP's that attempted gapping in prelim test: 274276
Number of HSP's gapped (non-prelim): 4629
length of query: 1070
length of database: 863,360,394
effective HSP length: 139
effective length of query: 931
effective length of database: 510,215,326
effective search space: 475010468506
effective search space used: 475010468506
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 81 (35.8 bits)
Lotus: description of TM0192.15