
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144727.8 + phase: 0 /pseudo
(770 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum] 801 0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides] 615 e-174
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 610 e-173
gb|AAT40550.1| putative receptor kinase [Solanum demissum] 577 e-163
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan... 563 e-159
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 552 e-155
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 551 e-155
gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|2... 551 e-155
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 544 e-153
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia... 538 e-151
pir||G86301 probable retroelement polyprotein [imported] - Arabi... 538 e-151
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 524 e-147
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja... 509 e-142
gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum] 498 e-139
gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis tha... 483 e-134
gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis tha... 482 e-134
gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cult... 480 e-134
gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25... 479 e-133
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi... 479 e-133
gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japon... 478 e-133
>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 801 bits (2068), Expect = 0.0
Identities = 402/669 (60%), Positives = 506/669 (75%), Gaps = 45/669 (6%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
SMFK FL Y+ETQF + +K+ RSDS GEYMS+EF+++L KGI+SQ SCP TPQQNG+AE
Sbjct: 551 SMFKTFLAYIETQFSTCIKLLRSDSGGEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAE 610
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
RKNRHLLDVTR+LL + +P ++WVEAL+T V+LINRLPS V+ +SP+F L+ +YS
Sbjct: 611 RKNRHLLDVTRTLLIESSVPSKYWVEALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYS 670
Query: 121 DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFY 180
D HTFGCVCFVHLPP + +KL VQS +CAFMGYS S KGF+CYD +H FR+SRNV FF
Sbjct: 671 DFHTFGCVCFVHLPPSQCNKLSVQSTKCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFE 730
Query: 181 NQFMLHSISPDINDIT-ILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAP-PDTEPPPDP 238
NQ+ +I D++ ++ +LP F + S +R+KP F Y R+ PT P P+T+PPP+
Sbjct: 731 NQYFFPTIV-DLSSVSPLLPTFEDLSSSFKRFKPGFVYERRR----PTLPYPNTDPPPET 785
Query: 239 EP------------VEP-RRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYSQVV 285
P +EP RRS+R SR P+ + +++LS+IS+P+CYSQ
Sbjct: 786 APQLESENSSRSGPLEPTRRSTRVSRTPNWYG----------FSSTLSNISVPSCYSQAS 835
Query: 286 KDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTL 345
K W KAM EE+ AL+EN TWDIVSCP +++P+GCKWVYS+KL+SDG+L+RY ARLV L
Sbjct: 836 KHECWQKAMEEELLALKENDTWDIVSCPSNVRPIGCKWVYSIKLHSDGTLDRYKARLVVL 895
Query: 346 RNKQEYGIDYDETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPP 391
N+QEYG+DY+ETFAPVAKMTTV +Q DVKN FLHGDL EDIYM PP
Sbjct: 896 GNRQEYGVDYEETFAPVAKMTTVRTIIAIAASQNWSLYQKDVKNAFLHGDLKEDIYMKPP 955
Query: 392 QGLFSS-SKGVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIV 450
LFSS + VCKLKRSLYGLKQAPRAW++KF S+LL F F S+Y+SSLF+ +TST V
Sbjct: 956 PDLFSSPTSDVCKLKRSLYGLKQAPRAWFDKFRSTLLQFSFELSKYDSSLFLRKTSTSCV 1015
Query: 451 LLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKY 510
LLL+YVDD++ITG+D++ I L++QL SFHMKDLG L YFLGLEVH+ + G+FL+QHKY
Sbjct: 1016 LLLVYVDDIIITGTDSSLITCLQQQLKDSFHMKDLGTLTYFLGLEVHNVASGVFLNQHKY 1075
Query: 511 ATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFV 570
LIS+AGLQ ++ VDTPLE+NVKY R++GD+LPDP ++RQLVGSLN LTITRPDISF
Sbjct: 1076 TQDLISLAGLQVSSSVDTPLEMNVKYRREEGDLLPDPTIFRQLVGSLNYLTITRPDISFA 1135
Query: 571 VQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRH 630
VQQVSQFM +PRHLHL AV IIRYL G+S GLFF G+ +L+A+SD++WAGCPDTR
Sbjct: 1136 VQQVSQFMQAPRHLHLVAVCHIIRYLLGTSTRGLFFPSGSPIRLNAFSDSDWAGCPDTRR 1195
Query: 631 SVTSWCMFL 639
SV+ WCMFL
Sbjct: 1196 SVSGWCMFL 1204
>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 615 bits (1585), Expect = e-174
Identities = 350/793 (44%), Positives = 456/793 (57%), Gaps = 33/793 (4%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
++ F ++TQ + +K FR D GEY S++F + L G + Q SC +TP+QNG+AER
Sbjct: 584 IYAAFRALIKTQHSAVIKCFRCDLGGEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAER 643
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
K+RH+++ RSLL AF+ FW EA+ T V LIN +PS SPF L+ DYS
Sbjct: 644 KHRHIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYSS 703
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
FGC FV P E++KL +S C F+GY KG+ C+D VS +V F
Sbjct: 704 FRVFGCTYFVLHPHVERNKLSSRSAICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFL-E 762
Query: 182 QFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQ----------------- 224
SI + +T I P S + YVR
Sbjct: 763 HIPFFSIPSTTHSLTKSDLIHIDPFSEDSGNDTSPYVRSICTHNSAGTGTLLSGTPEASF 822
Query: 225 IPTAPPDTEPPPDPEPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYSQV 284
TAP + DP P + R ++++ PD F+ Y S TS A + + P+ Y +
Sbjct: 823 SSTAPQASSEIVDPPPRQSIRIRKSTKLPD-FAYSCYSSSFTSFLAYIHCLFEPSSYKEA 881
Query: 285 VKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVT 344
+ D +AM+EE+ AL + TWD+V PP +GC+WVY +K NSDGS+ RY ARLV
Sbjct: 882 ILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWVYKIKTNSDGSIERYKARLVA 941
Query: 345 LRNKQEYGIDYDETFAPVAKMTTVH--------------QMDVKNIFLHGDLAEDIYMTP 390
Q+YG+DY+ETFAP+AKMTT+ Q+DVKN FL+GDL E++YM P
Sbjct: 942 KGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQLDVKNAFLNGDLQEEVYMAP 1001
Query: 391 PQGLFSSSKGVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIV 450
P G+ S VCKLK++LYGLKQAPRAW+EKF + F S ++S+LFI T G +
Sbjct: 1002 PPGISHDSGYVCKLKKALYGLKQAPRAWFEKFSIVISSLGFVSSSHDSALFIKCTDAGRI 1061
Query: 451 LLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKY 510
+L LYVDDM+ITG D I LK +L F MKDLG L YFLG+EV + +G L Q KY
Sbjct: 1062 ILSLYVDDMIITGDDIDGISVLKTELARRFEMKDLGYLRYFLGIEVAYSPRGYLLSQSKY 1121
Query: 511 ATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFV 570
++ A L VDTP+EVN +Y DG L DP LYR +VGSL LTIT PDI++
Sbjct: 1122 VANILERARLTDNKTVDTPIEVNARYSSSDGLPLIDPTLYRTIVGSLVYLTITHPDIAYA 1181
Query: 571 VQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRH 630
V VSQF+ SP +H AAV RI+RYL+G+ L S + +L AYSDA+ P R
Sbjct: 1182 VHVVSQFVASPTTIHWAAVLRILRYLRGTVFQSLLLSSTSSLELRAYSDADHGSDPTDRK 1241
Query: 631 SVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEP 690
SVT +C+FL SLISWKSKKQ+ VS+SSTE+EY AM + EI+W R LA++G +
Sbjct: 1242 SVTGFCIFLGDSLISWKSKKQSIVSQSSTEAEYCAMASTTKEIVWSRWLLADMGISFSHL 1301
Query: 691 TSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKA 750
T +Y DN S+I N VFHERTK IE+DCH R I+LP V + LQIAD TKA
Sbjct: 1302 TPMYCDNQSSIQIAHNSVFHERTKHIEIDCHLTRHHLKHGTIALPFVPSSLQIADFFTKA 1361
Query: 751 VPRPRHQFLVGKL 763
R FLVGKL
Sbjct: 1362 HSISRFCFLVGKL 1374
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 610 bits (1572), Expect = e-173
Identities = 351/838 (41%), Positives = 478/838 (56%), Gaps = 77/838 (9%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
S+++ F + TQF S+++IFRSDS GEYMS+ F+E+L +G L Q SCP QNG+AE
Sbjct: 427 SIYQSFAQMIHTQFSSAIRIFRSDSGGEYMSNAFREFLVSQGTLPQLSCPGAHAQNGVAE 486
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
RK+RH+++ R+LL +F+P FW EA++T V+LIN PS ++ SP LF Y
Sbjct: 487 RKHRHIIETARTLLIASFVPAHFWAEAISTAVYLINMQPSSSLQGRSPGEVLFGSPPRYD 546
Query: 121 DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFY 180
L FGC C+V L P E+ KL QSV+C F+GYS HKG+ CYD S R+SR+VTF
Sbjct: 547 HLRVFGCTCYVLLAPRERTKLTAQSVECVFLGYSLEHKGYRCYDPSARRIRISRDVTFDE 606
Query: 181 NQFMLHSI-----SPDINDITIL-----PNFSIMPQSIERYKPEFTYVRKCIKQIPTAPP 230
N+ +S SP+ N I+ L P+ +P S P + + + PT P
Sbjct: 607 NKPFFYSSTNQPSSPE-NSISFLYLPPIPSPESLPSS--PITPSPSPIPPSVPS-PTYVP 662
Query: 231 DTEPPPDPEPVEPRRS------------------------SRTSRAPDRFSPDRYGSKHT 266
P P P PV P S SR + P+ P + +
Sbjct: 663 PPPPSPSPSPVSPPPSHIPASSSPPHVPSTITLDTFPFHYSRRPKIPNESQPSQPTLEDP 722
Query: 267 SLTASLSS-------------------------ISIPTCYSQVVKDVRWIKAMNEEIQAL 301
+ + SS + P+ Y + + W AM+EE+ AL
Sbjct: 723 TCSVDDSSPAPRYNLRARDALRAPNRDDFVVGVVFEPSTYQEAIVLPHWKLAMSEELAAL 782
Query: 302 QENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDETFAP 361
+ TWD+V P P+ CKWVY VK SDG + RY ARLV +Q +G DYDETFAP
Sbjct: 783 ERTNTWDVVPLPSHAVPITCKWVYKVKTKSDGQVERYKARLVARGFQQAHGRDYDETFAP 842
Query: 362 VAKMTTVH--------------QMDVKNIFLHGDLAEDIYMTPPQGLFSSSKGVCKLKRS 407
VA MTTV QMDVKN FLHGDL E++YM PP G+ + V +L+R+
Sbjct: 843 VAHMTTVRTLIAVAATRSWTISQMDVKNAFLHGDLHEEVYMHPPPGVEAPPGHVFRLRRA 902
Query: 408 LYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVITGSDNA 467
LYGLKQAPRAW+ +F S +L FS S ++ +LFIH +S G LLLLYVDDM+ITG D
Sbjct: 903 LYGLKQAPRAWFARFSSVVLAAGFSPSDHDPALFIHTSSRGRTLLLLYVDDMLITGDDLE 962
Query: 468 FIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQSTNPVD 527
+I +K +L F M DLG L YFLG+EV ST G +L QH+Y L++ +GL +
Sbjct: 963 YIAFVKGKLSEQFMMSDLGPLSYFLGIEVTSTVDGYYLSQHRYIEDLLAQSGLTDSRTTT 1022
Query: 528 TPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSPRHLHLA 587
TP+E++V+ DG L DP YR LVGSL LT+TRPDI++ V +SQF+ +P +H
Sbjct: 1023 TPMELHVRLRSTDGTPLDDPSRYRHLVGSLVYLTVTRPDIAYAVHILSQFVSAPISVHYG 1082
Query: 588 AVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRSSLISWK 647
+ R++RYL+G++ LF++ + +L A+SD+ WA P R SVT +C+FL +SL++WK
Sbjct: 1083 HLLRVLRYLRGTTTQCLFYAASSPLQLRAFSDSTWASDPIDRRSVTGYCIFLGTSLLTWK 1142
Query: 648 SKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI*FVANP 707
SKKQ VS+SSTE+E RA+ SEI+WLR LA+ G PT L DNT AI +P
Sbjct: 1143 SKKQTAVSRSSTEAELRALATTTSEIVWLRWLLADFGVSCDVPTPLLCDNTGAIQIANDP 1202
Query: 708 VFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPRPRHQFLVGKLMI 765
+ HE TK I VD R + I+L +V ++LQ+AD TKA R H+ + KL +
Sbjct: 1203 IKHELTKHIGVDASFTRSHCQQSTIALHYVPSELQVADFFTKAQTREHHRLHLLKLNV 1260
>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
Length = 1358
Score = 577 bits (1487), Expect = e-163
Identities = 323/789 (40%), Positives = 458/789 (57%), Gaps = 43/789 (5%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
S+FK F ++ QF S++ FRSD++ EY+S +F+E++ H+GI+ Q +CP TPQQNG+AE
Sbjct: 586 SIFKSFFAEIQNQFGVSIRTFRSDNALEYLSSQFREFMTHQGIIHQTTCPYTPQQNGVAE 645
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
RKNRHL++ R+LL + +P RFW +A+ T+ +LINR+PS I+ P LF Y
Sbjct: 646 RKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLINRMPSSSIQNQVPHSILFPQSHLYP 705
Query: 121 -DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFF 179
FG CFVH K KL ++++C F+GYS KG+ CY H + +S +VTFF
Sbjct: 706 IPPRVFGSTCFVHNLAPGKDKLAPRALKCVFLGYSRVQKGYRCYSHDLHRYLMSADVTFF 765
Query: 180 YNQ-FMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDP 238
+Q + S PD++ + +P +P +E + T P P
Sbjct: 766 ESQPYYTSSNHPDVSMVLPIPQVLPVPTFVE-----------------STVTSTSPVVVP 808
Query: 239 EPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYS--QVVKDVRWIKAMNE 296
+ R R + PD D + + TA L S P + + W +AM +
Sbjct: 809 PLLTYHRRPRPTLVPD----DSCHAPDPAPTADLPPPSQPLALQKGEALSHSGWRQAMVD 864
Query: 297 EIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYD 356
E+ AL ++ TW++VS P +GC+WVY+VK+ DG ++R ARLV Q +G+DY
Sbjct: 865 EMSALHKSGTWELVSLPAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYS 924
Query: 357 ETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFS---SSK 399
+TFAPVAK+ +V HQ+D+KN FLHGDL E++YM P G + SS
Sbjct: 925 DTFAPVAKIASVRLFLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSS 984
Query: 400 GVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFI-HRTSTGIVLLLLYVDD 458
VC+L+RSLYGLKQ+PRAW+ KF + + F + S + S+F H + + L++YVDD
Sbjct: 985 LVCRLRRSLYGLKQSPRAWFGKFSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDD 1044
Query: 459 MVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMA 518
+VITG+D I LK+ L F KDLG L YFLG+EV + GI + Q KYA ++
Sbjct: 1045 IVITGNDQDGITDLKQHLFKHFQTKDLGRLKYFLGIEVAQSRSGIVISQRKYALDILEET 1104
Query: 519 GLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFM 578
G+ PVDTP++ NVK G+ L +P YR+LVG LN LT+TRPDISF V VSQFM
Sbjct: 1105 GMMGCRPVDTPMDPNVKLLPGQGEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFM 1164
Query: 579 HSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMF 638
SP H AV RI+RY+K + GL F + + Y+DA+WAG P R S + +C+
Sbjct: 1165 TSPCDSHWEAVVRILRYIKSAPGKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVL 1224
Query: 639 LRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNT 698
+ +L+SWKSKKQ V++SS ESEYRAM A E++W++ L EL F + L DN
Sbjct: 1225 VGGNLVSWKSKKQNVVARSSAESEYRAMATATCELVWIKQLLGELKFGKVDKMELVCDNQ 1284
Query: 699 SAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPRPRHQF 758
+A+ +NPVFHERTK IE+DCH +R+ I V + Q+ADI TK++ PR +
Sbjct: 1285 AALHIASNPVFHERTKHIEIDCHFVREKILSGDIVTKFVKSNDQLADIFTKSLTCPRINY 1344
Query: 759 LVGKLMIFD 767
+ KL +D
Sbjct: 1345 ICNKLGTYD 1353
>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
Length = 1476
Score = 563 bits (1452), Expect = e-159
Identities = 322/818 (39%), Positives = 459/818 (55%), Gaps = 75/818 (9%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
++ + F+ ++TQF +KIFRSD+ E+ + + + GI+ Q SCP+TPQQNG+ E
Sbjct: 652 TVLQNFILMIDTQFGQKIKIFRSDNGTEFFNAQCDGLFKSHGIVHQSSCPHTPQQNGVVE 711
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
R+++H+L+ R+L F +P RFW E + + V +INR+PS V+ SPF ++K D S
Sbjct: 712 RRHKHILETARALRFQGHLPIRFWGECVLSAVHIINRIPSSVLHNKSPFELMYKRSPDLS 771
Query: 121 DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFY 180
+ GC+C H + + +++ KG+ YD+ + F VSR++ F
Sbjct: 772 YMRVIGCLC---------HATNLVN--------TSTQKGYKLYDLEHQHFFVSRDMVF-- 812
Query: 181 NQFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCI----KQIPTAPPDTEPPP 236
N+ + SP + D P F P + + V+ I + IP A P +
Sbjct: 813 NEAVFPFQSPALADPHDTPVFLASPPC-SSHTEDADAVQPAIITSEEIIPVASPPSAVSD 871
Query: 237 DP--EPVEPRRSSRTSRAP----------------------DRFSPDRYGSKHTSLTASL 272
D P E RRS RT + P D S + AS
Sbjct: 872 DHLHPPPERRRSYRTGKPPIWQKDFITTSTSRSNHCLYPISDNIDYSCLSSTYQCYIASS 931
Query: 273 SSISIPTCYSQVVKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSD 332
S + P Y Q D RW+ AM EEIQAL++N TW++VS P K +GCKWVY +K +
Sbjct: 932 SVETEPQFYYQAANDCRWVHAMKEEIQALEDNKTWEVVSLPKGKKAIGCKWVYKIKYKAS 991
Query: 333 GSLNRYNARLVTLRNKQEYGIDYDETFAPVAKMTT--------------VHQMDVKNIFL 378
G + R+ ARLV Q+ G+DY ETF+PV KM T + QMDV N FL
Sbjct: 992 GEIERFKARLVAKGYNQKEGLDYQETFSPVVKMVTLRTVLTLAVSKGWDIQQMDVYNAFL 1051
Query: 379 HGDLAEDIYMTPPQGLFSSSKG---VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQ 435
GDL E++YM PQG G VC+L +SLYGLKQA R W K ++LL F S
Sbjct: 1052 QGDLIEEVYMQLPQGFQYDKTGDPKVCRLLKSLYGLKQASRQWNVKLTTALLAAGFQQSH 1111
Query: 436 YNSSLFIHRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLE 495
+ SL + RT+ GIV++L+YVDD++ITGS I K+ L A+F +KDLG L YFLG+E
Sbjct: 1112 LDYSLMLKRTADGIVIVLIYVDDLLITGSSLQLIDDAKQVLKANFKIKDLGTLRYFLGME 1171
Query: 496 VHSTSKGIFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDI----------LP 545
+ G+ +HQ KYA LIS GL + P TP+E+++K + D+ L
Sbjct: 1172 FARNASGMLMHQRKYALELISDLGLGGSKPSVTPVELHLKLTTREFDLHVGSSGADSLLA 1231
Query: 546 DPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLF 605
DP Y++LVG L LTITRPDISF VQ +SQFMH+P+ H+ A R+++Y+K + GL+
Sbjct: 1232 DPTEYQRLVGRLLYLTITRPDISFAVQHLSQFMHAPKVSHMEAAIRVVKYVKQAPGLGLY 1291
Query: 606 FSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRA 665
++ L AY DA+W C +TR S+T + + S+L+SWKSKKQ +S+SS E+EYR+
Sbjct: 1292 MAVQTADTLQAYCDADWGSCINTRKSITGYMIQFGSALLSWKSKKQPTISRSSAEAEYRS 1351
Query: 666 MYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRD 725
+ + +E++WL G EL P + P SLY D+ +AI ANPVFHERTK I++DCH IR+
Sbjct: 1352 LASTVAELVWLTGLFKELDMPLSLPVSLYCDSKAAIQIAANPVFHERTKHIDIDCHFIRE 1411
Query: 726 AYDE*LISLPHVSTQLQIADILTKAVPRPRHQFLVGKL 763
L+ + ++ TQ Q ADILTK + +H +LV KL
Sbjct: 1412 KVQAGLVMIHYLPTQEQPADILTKGLSSAQHSYLVSKL 1449
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 552 bits (1423), Expect = e-155
Identities = 310/800 (38%), Positives = 457/800 (56%), Gaps = 49/800 (6%)
Query: 6 FLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRH 65
F+T VE QF++++K RSD++ E F ++ KGI+ SCP TPQQN + ERK++H
Sbjct: 514 FVTMVENQFETTIKGVRSDNAPEL---NFTQFYHSKGIVPYHSCPETPQQNSVVERKHQH 570
Query: 66 LLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSDLHTF 125
+L+V RSL F + IP +W + + T V+LINRLP+ ++E PF L K Y + F
Sbjct: 571 ILNVARSLFFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVLTKTVPTYDHIKVF 630
Query: 126 GCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYNQFML 185
GC+C+ P ++HK ++ CAF+GY + KG+ D+ HS VSR+V F F
Sbjct: 631 GCLCYASTSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDLETHSIIVSRHVVFHEELFPF 690
Query: 186 HSISPDINDITILPNFSIMP----QSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDPEPV 241
+ P+ + P QS + P + ++ +P+A P P
Sbjct: 691 LGSDLSQEEQNFFPDLNPTPPMQRQSSDHVNPSDS--SSSVEILPSANPTNNVPEPSVQT 748
Query: 242 EPRRSSRTSRAPDRF----------------SPDRYGSKHTSLTASLSSISIPTCYSQVV 285
R++ + + D + S DR + + A L P+ Y++
Sbjct: 749 SHRKAKKPAYLQDYYCHSVVSSTPHEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAE 808
Query: 286 KDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTL 345
K W AM E L+ TW++ S P D + +GC+W++ +K NSDGS+ RY ARLV
Sbjct: 809 KLQVWRDAMGAEFDFLEGTHTWEVCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQ 868
Query: 346 RNKQEYGIDYDETFAPVAKMTTVH--------------QMDVKNIFLHGDLAEDIYMTPP 391
Q+ GIDY+ETF+PVAK+ +V Q+D+ N FL+GDL E+IYM P
Sbjct: 869 GYTQKEGIDYNETFSPVAKLNSVKLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLP 928
Query: 392 QGLFSSSKG-------VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHR 444
QG ++S +G VC+LK+SLYGLKQA R WY KF S+LLG F S + + F+ +
Sbjct: 929 QG-YASRQGDSLPPNAVCRLKKSLYGLKQASRQWYLKFSSTLLGLGFIQSYCDHTCFL-K 986
Query: 445 TSTGIVL-LLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGI 503
S GI L +L+Y+DD++I +++A + LK Q+ + F ++DLG L YFLGLE+ + KGI
Sbjct: 987 ISDGIFLCVLVYIDDIIIASNNDAAVDILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGI 1046
Query: 504 FLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTIT 563
+ Q KYA L+ G P P++ ++ + D G + YR+L+G L L IT
Sbjct: 1047 HISQRKYALDLLDETGQLGCKPSSIPMDPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNIT 1106
Query: 564 RPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWA 623
RPDI+F V +++QF +PR HL AV +I++Y+KG+ GLF+S + +L Y++A++
Sbjct: 1107 RPDITFAVNKLAQFSMAPRKAHLQAVYKILQYIKGTIGQGLFYSATSELQLKVYANADYN 1166
Query: 624 GCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAEL 683
C D+R S + +CMFL SLI WKS+KQ VSKSS E+EYR++ A E++WL FL EL
Sbjct: 1167 SCRDSRRSTSGYCMFLGDSLICWKSRKQDVVSKSSAEAEYRSLSVATDELVWLTNFLKEL 1226
Query: 684 GFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQI 743
P ++PT L+ DN +AI N VFHERTK IE DCHS+R+ + L L H++T+LQI
Sbjct: 1227 QVPLSKPTLLFCDNEAAIHIANNHVFHERTKHIESDCHSVRERLLKGLFELYHINTELQI 1286
Query: 744 ADILTKAVPRPRHQFLVGKL 763
AD TK + L+ K+
Sbjct: 1287 ADPFTKPLYPSHFHRLISKM 1306
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 551 bits (1421), Expect = e-155
Identities = 323/844 (38%), Positives = 461/844 (54%), Gaps = 79/844 (9%)
Query: 3 FKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERK 62
F F TYV QF + +K+FR+D+ GEY S +F+++L +GI+ Q SCP TPQQNG+AERK
Sbjct: 557 FTNFETYVTNQFNAKIKVFRTDNGGEYTSQKFRDHLAKRGIIHQTSCPYTPQQNGVAERK 616
Query: 63 NRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSDL 122
NRHL++V RS++F +P RFW +A+ T +LINR P+ V+ SPF L + L
Sbjct: 617 NRHLMEVARSMMFHTSVPKRFWGDAVLTACYLINRTPTKVLSDLSPFEVLNNTKPFIDHL 676
Query: 123 HTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYNQ 182
FGCVCFV +P ++ KL +S +C F+GYS + KG+ C+D + + +SR+V F NQ
Sbjct: 677 RVFGCVCFVLIPGEQRSKLDAKSTKCMFLGYSTTQKGYKCFDPTKNRTFISRDVKFLENQ 736
Query: 183 ------------FMLHSISPDINDIT-ILPNFSIMPQSIERYKPEFTYVRKCIKQ----- 224
+ HS S + + +L + S +++PE T ++ + Q
Sbjct: 737 DYNNKKDWENLKDLTHSTSDRVETLKFLLDHLGNDSTSTTQHQPEMTQDQEDLNQENEEV 796
Query: 225 ----------IPTAPPDTEPPPD-------------------PEPVEPRRSSRTSRAPDR 255
+ PP+T+ + P P RRS+R R +
Sbjct: 797 SLQHQENLTHVQEDPPNTQEHSEHVQEIQDDSSEDEEPTQVLPPPPPLRRSTRIRRKKEF 856
Query: 256 FSPDRYGS-------------KHTSLTASLSSISIPTCYSQVVKDVRWIKAMNEEIQALQ 302
F+ + H + + +S IP Y + ++ W A+ +EI A++
Sbjct: 857 FNSNAVAHPFQATCSLALVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRDAIADEINAMK 916
Query: 303 ENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDETFAPV 362
N TWD P K + +WV+++K S+G + RY RLV Q YG DY ETFAPV
Sbjct: 917 RNHTWDEDDLPKGKKTVSSRWVFTIKYKSNGDIERYKTRLVARGFTQTYGSDYMETFAPV 976
Query: 363 AKMTTVH--------------QMDVKNIFLHGDLAEDIYMTPPQGLFSS--SKGVCKLKR 406
AK+ TV QMDVKN FL G+L +D+YMTPP GL + V +L++
Sbjct: 977 AKLHTVRVVLALATNLSWGLWQMDVKNAFLQGELEDDVYMTPPPGLEDTIPCDKVLRLRK 1036
Query: 407 SLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVITGSDN 466
++YGLKQ+PRAWY K +L F S+ + +LF ++ GIV++L+YVDD++ITG +
Sbjct: 1037 AIYGLKQSPRAWYHKLSRTLKDHGFKKSESDHTLFTLQSPQGIVVVLIYVDDLIITGDNK 1096
Query: 467 AFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQSTNPV 526
I K L + F +KDLG L YFLG+EV ++ G+FL Q KY L++ G P
Sbjct: 1097 DGIDSTKTFLKSCFDIKDLGELKYFLGIEVCRSNAGLFLSQRKYTLDLLNETGFMDAKPA 1156
Query: 527 DTPLEVNVKYHR---DDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSPRH 583
TPLE K +R + + D LYR+LVG L LT TRPDI F V QVSQ M P
Sbjct: 1157 RTPLEDGYKVNRKGEKEDEKFGDAPLYRKLVGKLIYLTNTRPDICFAVNQVSQHMKVPMV 1216
Query: 584 LHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRSSL 643
H V RI+RYLKGSS G++ + ++ Y DA++AG R S T +C F+ +L
Sbjct: 1217 YHWNMVERILRYLKGSSGQGIWMGKNSSTEIVGYCDADYAGDRGDRRSKTGYCTFIGGNL 1276
Query: 644 ISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI*F 703
+WK+KKQ VS SS ESEYRAM +E+ WL+ L +LG P +++ DN +AI
Sbjct: 1277 ATWKTKKQKVVSCSSAESEYRAMRKLTNELTWLKALLKDLGIEQHMPITMHCDNKAAIYI 1336
Query: 704 VANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPRPRHQFLVGKL 763
+N VFHERTK IEVDCH +R+ E + + ++ Q+ADI TKA F+ GKL
Sbjct: 1337 ASNSVFHERTKHIEVDCHKVREKIIEGVTLPCYTRSEDQLADIFTKAASLKVCNFIHGKL 1396
Query: 764 MIFD 767
+ D
Sbjct: 1397 GLVD 1400
>gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|25301686|pir||F96610
probable polyprotein T8L23.26 [imported] - Arabidopsis
thaliana
Length = 1468
Score = 551 bits (1419), Expect = e-155
Identities = 322/846 (38%), Positives = 474/846 (55%), Gaps = 84/846 (9%)
Query: 4 KKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKN 63
K F+ VE QF + +KI RSD+ E++ +EY HKGI + SC TP QNG ERK+
Sbjct: 620 KDFIALVERQFDTEIKIVRSDNGTEFLC--MREYFLHKGIAHETSCVGTPHQNGRVERKH 677
Query: 64 RHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSDLH 123
RH+L++ R+L F +++P +FW E + + +LINR PS++++ SP+ L+K YS L
Sbjct: 678 RHILNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPKYSHLR 737
Query: 124 TFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYNQF 183
FG +C+ H + K +S +C F+GY + KG+ +D+ F VSR+V F +F
Sbjct: 738 VFGSLCYAHNQNHKGDKFAARSRRCVFVGYPHGQKGWRLFDLEEQKFFVSRDVIFQETEF 797
Query: 184 --------------MLHSISP---------------DINDITILPNFS---IMPQ-SIER 210
++ + P +I + T+ PN + I+P+ + E
Sbjct: 798 PYSKMSCNEEDERVLVDCVGPPFIEEAIGPRTIIGRNIGEATVGPNVATGPIIPEINQES 857
Query: 211 YKP-EFTYVRKCIKQIPTAPPDTEPPP----DPEPVEPRRSSRTSRAP------------ 253
P EF + + ++ T P P P++ RRSSR ++ P
Sbjct: 858 SSPSEFVSLSSLDPFLASSTVQTADLPLSSTTPAPIQLRRSSRQTQKPMKLKNFVTNTVS 917
Query: 254 -DRFSPD----------------RYGSKHTSLTASLSSISIPTCYSQVVKDVRWIKAMNE 296
+ SP+ R+ S H + A++++ PT Y++ + D W +AM+
Sbjct: 918 VESISPEASSSSLYPIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSA 977
Query: 297 EIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYD 356
EI++L+ N T+ IV+ PP + +G KWVY +K SDG++ RY ARLV L N Q+ G+DYD
Sbjct: 978 EIESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYD 1037
Query: 357 ETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG-V 401
ETFAPVAKM+TV HQMDV N FLHGDL E++YM PQG V
Sbjct: 1038 ETFAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGFQCDDPSKV 1097
Query: 402 CKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVI 461
C+L +SLYGLKQAPR W+ K S+L + F+ S + SLF + V +L+YVDD++I
Sbjct: 1098 CRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGIFVHVLVYVDDLII 1157
Query: 462 TGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQ 521
+GS + + K L + FHMKDLG L YFLG+EV ++G +L Q KY +IS GL
Sbjct: 1158 SGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRKYVLDIISEMGLL 1217
Query: 522 STNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSP 581
P PLE N K +L D YR+LVG L L +TRP++S+ V ++QFM +P
Sbjct: 1218 GARPSAFPLEQNHKLSLSTSPLLSDSSRYRRLVGRLIYLVVTRPELSYSVHTLAQFMQNP 1277
Query: 582 RHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRS 641
R H A R++RYLK + G+ S + +++ + D+++A CP TR S+T + + L
Sbjct: 1278 RQDHWNAAIRVVRYLKSNPGQGILLSSTSTLQINGWCDSDYAACPLTRRSLTGYFVQLGD 1337
Query: 642 SLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI 701
+ ISWK+KKQ VS+SS E+EYRAM E++WL+ L +LG + +++D+ SAI
Sbjct: 1338 TPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLGVSHVQAMRIFSDSKSAI 1397
Query: 702 *FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPRPRHQFLVG 761
NPV HERTK +EVDCH IRDA + +I+ V + Q+ADILTKA+ ++ +
Sbjct: 1398 ALSVNPVQHERTKHVEVDCHFIRDAILDGIIATSFVPSHKQLADILTKALGEKEVRYFLR 1457
Query: 762 KLMIFD 767
KL I D
Sbjct: 1458 KLGILD 1463
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 544 bits (1401), Expect = e-153
Identities = 310/805 (38%), Positives = 462/805 (56%), Gaps = 54/805 (6%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
++F F+ VE Q++ VK RSD++ E +F + KGI+S SCP TP+QN + E
Sbjct: 660 TVFPAFIQQVENQYKVKVKAVRSDNAPEL---KFTSFYAEKGIVSFHSCPETPEQNSVVE 716
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
RK++H+L+V R+L+F + +P W + + T VFLINR PS ++ +P+ L Y
Sbjct: 717 RKHQHILNVARALMFQSQVPLSLWGDCVLTAVFLINRTPSQLLMNKTPYEILTGTAPVYE 776
Query: 121 DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFY 180
L TFGC+C+ P ++HK +S C F+GY + +KG+ D+ +++ +SRNV F
Sbjct: 777 QLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGYPSGYKGYKLMDLESNTVFISRNVQFHE 836
Query: 181 NQF-----------------MLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIK 223
F M+ S I+D T P S +P I P+ + R ++
Sbjct: 837 EVFPLAKNPGSESSLKLFTPMVPVSSGIISDTTHSP--SSLPSQISDLPPQISSQR--VR 892
Query: 224 QIPTAPPDTEPPPDPEPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYSQ 283
+ P D +S S + H +++ I IPT Y++
Sbjct: 893 KPPAHLNDYH-------CNTMQSDHKYPISSTISYSKISPSHMCYINNITKIPIPTNYAE 945
Query: 284 VVKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLV 343
W +A++ EI A+++ TW+I + P K +GCKWV+++K +DG+L RY ARLV
Sbjct: 946 AQDTKEWCEAVDAEIGAMEKTNTWEITTLPKGKKAVGCKWVFTLKFLADGNLERYKARLV 1005
Query: 344 TLRNKQEYGIDYDETFAPVAKMTTVH--------------QMDVKNIFLHGDLAEDIYMT 389
Q+ G+DY +TF+PVAKMTT+ Q+DV N FL+G+L E+I+M
Sbjct: 1006 AKGYTQKEGLDYTDTFSPVAKMTTIKLLLKVSASKKWFLKQLDVSNAFLNGELEEEIFMK 1065
Query: 390 PPQGLFSSSKG-------VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFI 442
P+G ++ KG V +LKRS+YGLKQA R W++KF SSLL F + + +LF+
Sbjct: 1066 IPEG-YAERKGIVLPSNVVLRLKRSIYGLKQASRQWFKKFSSSLLSLGFKKTHGDHTLFL 1124
Query: 443 HRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKG 502
V++L+YVDD+VI + A +L E+L F ++DLG+L YFLGLEV T+ G
Sbjct: 1125 KMYDGEFVIVLVYVDDIVIASTSEAAAAQLTEELDQRFKLRDLGDLKYFLGLEVARTTAG 1184
Query: 503 IFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTI 562
I + Q KYA L+ G+ + PV P+ N+K +DDGD++ D YR++VG L LTI
Sbjct: 1185 ISICQRKYALELLQSTGMLACKPVSVPMIPNLKMRKDDGDLIEDIEQYRRIVGKLMYLTI 1244
Query: 563 TRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANW 622
TRPDI+F V ++ QF +PR HL A R+++Y+KG+ GLF+S + L ++D++W
Sbjct: 1245 TRPDITFAVNKLCQFSSAPRTTHLTAAYRVLQYIKGTVGQGLFYSASSDLTLKGFADSDW 1304
Query: 623 AGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAE 682
A C D+R S TS+ MF+ SLISW+SKKQ VS+SS E+EYRA+ A E++WL L
Sbjct: 1305 ASCQDSRRSTTSFTMFVGDSLISWRSKKQHTVSRSSAEAEYRALALATCEMVWLFTLLVS 1364
Query: 683 LGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQ 742
L P LY+D+T+AI NPVFHERTK I++DCH++R+ D + L HV T+ Q
Sbjct: 1365 LQASPPVPI-LYSDSTAAIYIATNPVFHERTKHIKLDCHTVRERLDNGELKLLHVRTEDQ 1423
Query: 743 IADILTKAVPRPRHQFLVGKLMIFD 767
+ADILTK + + + L K+ I +
Sbjct: 1424 VADILTKPLFPYQFEHLKSKMSILN 1448
>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
gi|7268497|emb|CAB78748.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444421|pir||A71444 probable
LTR retrotransposon - Arabidopsis thaliana
Length = 1433
Score = 538 bits (1387), Expect = e-151
Identities = 302/790 (38%), Positives = 448/790 (56%), Gaps = 42/790 (5%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+F F+ V TQ+Q+ +K RSD++ E +F + GI++ SCP TP+QN + ER
Sbjct: 647 VFPAFINMVHTQYQTKLKSVRSDNAHEL---KFTDLFAAHGIVAYHSCPETPEQNSVVER 703
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
K++H+L+V R+LLF + IP FW + + T VFLINRLP+ V+ SP+ L Y
Sbjct: 704 KHQHILNVARALLFQSNIPLEFWGDCVLTAVFLINRLPTPVLNNKSPYEKLKNIPPAYES 763
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
L TFGC+C+ P ++HK ++ C F+GY +KG+ D+ H+ +SR+V F +
Sbjct: 764 LKTFGCLCYSSTSPKQRHKFEPRARACVFLGYPLGYKGYKLLDIETHAVSISRHVIFHED 823
Query: 182 --QFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAP-PDTEPPPDP 238
F+ +I DI D L F P + E T + I T P D
Sbjct: 824 IFPFISSTIKDDIKDFFPLLQF---PARTDDLPLEQTSI------IDTHPHQDVSSSKAL 874
Query: 239 EPVEPRRSSRTSRAPDRFSPDRYGSKHTS-----LTASLSSISIPTCYSQVVKDVRWIKA 293
P +P S+ + P + D + +T+ ++++ IP YS+ W A
Sbjct: 875 VPFDPL--SKRQKKPPKHLQDFHCYNNTTEPFHAFINNITNAVIPQRYSEAKDFKAWCDA 932
Query: 294 MNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGI 353
M EEI A+ TW +VS PP+ K +GCKWV+++K N+DGS+ RY ARLV QE G+
Sbjct: 933 MKEEIGAMVRTNTWSVVSLPPNKKAIGCKWVFTIKHNADGSIERYKARLVAKGYTQEEGL 992
Query: 354 DYDETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFS--- 396
DY+ETF+PVAK+T+V HQ+D+ N FL+GDL E+IYM P G
Sbjct: 993 DYEETFSPVAKLTSVRMMLLLAAKMKWSVHQLDISNAFLNGDLDEEIYMKIPPGYADLVG 1052
Query: 397 ---SSKGVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLL 453
+C+L +S+YGLKQA R WY K ++L G F S + +LFI + ++ +L
Sbjct: 1053 EALPPHAICRLHKSIYGLKQASRQWYLKLSNTLKGMGFQKSNADHTLFIKYANGVLMGVL 1112
Query: 454 LYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATY 513
+YVDD++I + + + + +L + F ++DLG YFLG+E+ + KGI + Q KY
Sbjct: 1113 VYVDDIMIVSNSDDAVAQFTAELKSYFKLRDLGAAKYFLGIEIARSEKGISICQRKYILE 1172
Query: 514 LISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQ 573
L+S G + P PL+ +VK +++DG L D YR+LVG L L ITRPDI++ V
Sbjct: 1173 LLSTTGFLGSKPSSIPLDPSVKLNKEDGVPLTDSTSYRKLVGKLMYLQITRPDIAYAVNT 1232
Query: 574 VSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVT 633
+ QF H+P +HL+AV +++RYLKG+ GLF+S + L Y+D+++ C D+R V
Sbjct: 1233 LCQFSHAPTSVHLSAVHKVLRYLKGTVGQGLFYSADDKFDLRGYTDSDFGSCTDSRRCVA 1292
Query: 634 SWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSL 693
++CMF+ L+SWKSKKQ VS S+ E+E+RAM E+IWL + P P L
Sbjct: 1293 AYCMFIGDYLVSWKSKKQDTVSMSTAEAEFRAMSQGTKEMIWLSRLFDDFKVPFIPPAYL 1352
Query: 694 YADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
Y DNT+A+ V N VFHERTK +E+DC+ R+A + + V T Q+AD LTKA+
Sbjct: 1353 YCDNTAALHIVNNSVFHERTKFVELDCYKTREAVESGFLKTMFVETGEQVADPLTKAIHP 1412
Query: 754 PRHQFLVGKL 763
+ L+GK+
Sbjct: 1413 AQFHKLIGKM 1422
>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989054|gb|AAG10817.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1413
Score = 538 bits (1387), Expect = e-151
Identities = 303/776 (39%), Positives = 446/776 (57%), Gaps = 30/776 (3%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
++F F+T VETQ+ + VK RSD++ E +F+E + KGI++ SCP TP+QN + E
Sbjct: 636 TIFPDFITMVETQYDTKVKAVRSDNAPEL---KFEELYRRKGIVAYHSCPETPEQNSVVE 692
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
RK++H+L+V R+LLF + IP +W + + T VF+INR PS VI + F L K DY+
Sbjct: 693 RKHQHILNVARALLFQSQIPLSYWGDCILTAVFIINRTPSPVISNKTLFEMLTKKVPDYT 752
Query: 121 DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFY 180
L +FGC+C+ P ++HK ++ CAF+GY + +KG+ D+ +H+ +SRNV F+
Sbjct: 753 HLKSFGCLCYASTSPKQRHKFEDRARTCAFLGYPSGYKGYKLLDLESHTIFISRNVVFYE 812
Query: 181 NQFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDPEP 240
+ F + + + ++ + ++ +P ++ P
Sbjct: 813 DLFPFKTKPAENEESSVFFPHIYVDRNDSHPSQPLPVQETSASNVPAEKQNSRVSRPPAY 872
Query: 241 VEPRRSSRTSRAPDR-----FSPDRYGSKHTSLTASLSSISIPTCYSQVVKDVRWIKAMN 295
++ + + + D S + +++ I P Y+Q + W AM
Sbjct: 873 LKDYHCNSVTSSTDHPISEVLSYSSLSDPYMIFINAVNKIPEPHTYAQARQIKEWCDAMG 932
Query: 296 EEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDY 355
EI AL++N TW + S P K +GCKWVY +KLN+DGSL RY ARLV Q G+DY
Sbjct: 933 MEITALEDNGTWVVCSLPVGKKAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDY 992
Query: 356 DETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG- 400
+TF+PVAK+TTV Q+D+ N FL+G L E+IYMT P G +S +G
Sbjct: 993 VDTFSPVAKLTTVKLLIAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPG-YSPRQGD 1051
Query: 401 ------VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLL 454
VC+LK+SLYGLKQA R WY KF SL F+ S + +LF ++ + +L+
Sbjct: 1052 SFPPNAVCRLKKSLYGLKQASRQWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLV 1111
Query: 455 YVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYL 514
YVDD++I S + + L++ L S ++DLG L YFLGLE+ + GI + Q KY L
Sbjct: 1112 YVDDIIIASSCDRETELLRDALQRSSKLRDLGTLRYFLGLEIARNTDGISICQRKYTLEL 1171
Query: 515 ISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQV 574
++ GL P+E N K ++DG+++ D YR+LVG L LT TRPDI++ V ++
Sbjct: 1172 LAETGLLGCKSSSVPMEPNQKLSQEDGELIDDAEHYRKLVGKLMYLTFTRPDITYAVHRL 1231
Query: 575 SQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTS 634
QF +PR HL AV +II YLKG+ GLF+S KLS ++D++++ C D+R T
Sbjct: 1232 CQFTSAPRVPHLKAVYKIIYYLKGTVGQGLFYSANVDLKLSGFADSDFSSCSDSRKLTTG 1291
Query: 635 WCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLY 694
+CMFL +SL++WKSKKQ +S SS E+EY+AM A E++WLR L +L +E + LY
Sbjct: 1292 YCMFLGTSLVAWKSKKQEVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLY 1351
Query: 695 ADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKA 750
DNT+AI NPVFHERTK IE D H IR+ LI HV T+ Q+ADI K+
Sbjct: 1352 CDNTAAIHIANNPVFHERTKHIERDYHHIREKIILGLIRTLHVRTENQLADIPYKS 1407
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 524 bits (1350), Expect = e-147
Identities = 306/809 (37%), Positives = 446/809 (54%), Gaps = 53/809 (6%)
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
S+F F VE Q + +K+F+SD GE++S++ + +L GI + SCP TPQQNGLAE
Sbjct: 558 SVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHRISCPYTPQQNGLAE 617
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYS 120
RK+RHL+++ S+LF + P +FWVE+ T ++INRLPS V++ SP+ LF + DYS
Sbjct: 618 RKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPSSVLKNLSPYEALFGEKPDYS 677
Query: 121 DLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFY 180
L FG C+ L P ++K +S+QC F+GY++ +KG+ C+ +SRNV F
Sbjct: 678 SLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFNE 737
Query: 181 NQFMLHSISPDINDITILPNFSI-MPQSIERYKPEFTYVRKCIKQIPTAPPD-------- 231
++ +++P +S + Q+ + K V Q+ + P D
Sbjct: 738 SELPFKE-----KYQSLVPQYSTPLLQAWQHNKISEISVPAAPVQLFSKPIDLNTYAGSQ 792
Query: 232 -TEPPPDPEPVEPRRSSRTSRAP----DRFSPDRYGSKHTSLTASLSSISIP-TCYSQV- 284
TE DPEP S P + ++ + H T S + I P T Y+ +
Sbjct: 793 VTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQVINSHAMTTRSKAGIQKPNTRYALIT 852
Query: 285 --------------VKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLN 330
+K W +A++EEI + TW +V D+ + KWV+ KL+
Sbjct: 853 SRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKWVFKTKLH 912
Query: 331 SDGSLNRYNARLVTLRNKQEYGIDYDETFAPVAKMTTVH--------------QMDVKNI 376
DGS+++ ARLV QE G+DY ETF+PV + T+ Q+DV N
Sbjct: 913 PDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIKQLDVSNA 972
Query: 377 FLHGDLAEDIYMTPPQGLFSSSK--GVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHS 434
FLHG+L E ++M P G K VC+L +++YGLKQAPRAW++ F + LL + F S
Sbjct: 973 FLHGELQEPVFMYQPSGFIDPQKPTHVCRLTKAIYGLKQAPRAWFDTFSNFLLDYGFVCS 1032
Query: 435 QYNSSLFIHRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGL 494
+ + SLF+ I+ LLLYVDD+++TGSD + ++ L + L F MKDLG YFLG+
Sbjct: 1033 KSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLEDLLQALKNRFSMKDLGPPRYFLGI 1092
Query: 495 EVHSTSKGIFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLV 554
++ + G+FLHQ YAT ++ AG+ NP+ TPL + + ++ +P +R L
Sbjct: 1093 QIEDYANGLFLHQTAYATDILQQAGMSDCNPMPTPLPQQL--DNLNSELFAEPTYFRSLA 1150
Query: 555 GSLN*LTITRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKL 614
G L LTITRPDI F V + Q MHSP ++RI+RY+KG+ GL + L
Sbjct: 1151 GKLQYLTITRPDIQFAVNFICQRMHSPTTSDFGLLKRILRYIKGTIGMGLPIKRNSTLTL 1210
Query: 615 SAYSDANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEII 674
SAYSD++ AGC +TR S T +C+ L S+LISW +K+Q VS SSTE+EYRA+ A EI
Sbjct: 1211 SAYSDSDHAGCKNTRRSTTGFCILLGSNLISWSAKRQPTVSNSSTEAEYRALTYAAREIT 1270
Query: 675 WLRGFLAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISL 734
W+ L +LG P PT +Y DN SA+ ANP H R+K + D H IR+ LI
Sbjct: 1271 WISFLLRDLGIPQYLPTQVYCDNLSAVYLSANPALHNRSKHFDTDYHYIREQVALGLIET 1330
Query: 735 PHVSTQLQIADILTKAVPRPRHQFLVGKL 763
H+S Q+AD+ TK++PR L KL
Sbjct: 1331 QHISATFQLADVFTKSLPRRAFVDLRSKL 1359
>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37530764|ref|NP_919684.1| putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1803
Score = 509 bits (1310), Expect = e-142
Identities = 295/793 (37%), Positives = 419/793 (52%), Gaps = 54/793 (6%)
Query: 4 KKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKN 63
+ F Y TQF V ++D+ EY S+ + L G + + SCP + QQNG AER
Sbjct: 603 RSFFAYAHTQFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERIL 662
Query: 64 RHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSDLH 123
R + D R++L + P FW EAL T + LINR P P+ L Y L
Sbjct: 663 RTINDCVRTMLVHSAAPLSFWAEALQTAMHLINRRPCRATGSLKPYQLLLGAPPTYDHLR 722
Query: 124 TFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYNQF 183
FGC+C+ + HKL +S+ C F+GY H+G+ CYD+ + SR+VTF + F
Sbjct: 723 VFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVF 782
Query: 184 MLHSIS---------PDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPP---- 230
PD D TI+ ++P + T V +PP
Sbjct: 783 PFRDAPSPRPSAPPPPDHGDDTIV----LLPAPAQHV---VTPVGTAPAHDAASPPSPAS 835
Query: 231 ----------DTEPPPDPEP-----VEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSI 275
D PPP PE P R + T+RA S + ++TA+ +
Sbjct: 836 STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKP---NPRYAMTATSTLS 892
Query: 276 SIPTCYSQVVKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSL 335
P+ ++D W AM E AL N TW +V PP + + KWV+ KL++DGSL
Sbjct: 893 PTPSSVRVALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGSL 952
Query: 336 NRYNARLVTLRNKQEYGIDYDETFAPVAKMTTV--------------HQMDVKNIFLHGD 381
++Y AR V Q G+D+ ETF+PV K T+ HQ+DV N FLHG
Sbjct: 953 DKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHGH 1012
Query: 382 LAEDIYMTPPQGLFSSSK--GVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSS 439
L E + P G +++ VC L RSLYGL+QAPRAW+++F F S+ + S
Sbjct: 1013 LQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADPS 1072
Query: 440 LFIHRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHST 499
LF+ R + LLLYVDDM+++ S ++ +QR+ ++L A F +KD+G L YFLG+EV T
Sbjct: 1073 LFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYFLGIEVQRT 1132
Query: 500 SKGIFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN* 559
+ G L Q KYAT ++ AG+ + V TP + K D+G + D YR + G+L
Sbjct: 1133 ADGFVLSQSKYATDVLERAGMANCKAVATPADAKPKLSSDEGPLFQDSSWYRSIAGALQY 1192
Query: 560 LTITRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSD 619
LT+TRPDI++ VQQV MH+PR H+ ++RI+RY+KG++ GL P L+A+SD
Sbjct: 1193 LTLTRPDIAYAVQQVCLHMHAPREAHVTLLKRILRYIKGTAAFGLHLRASTSPTLTAFSD 1252
Query: 620 ANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGF 679
A+WAGCPDTR S + +C+FL SLISW SK+Q VS+SS E+EYR + A +E WLR
Sbjct: 1253 ADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQTTVSRSSAEAEYRGVANAVAECTWLRQL 1312
Query: 680 LAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVST 739
L EL + T Y DN S++ NPV H+RTK IE+D H +R+ + + + +
Sbjct: 1313 LGELHCRVPQATIAYCDNISSVYMSKNPVHHKRTKHIELDIHFVREKVALGELRVLPIPS 1372
Query: 740 QLQIADILTKAVP 752
Q AD+ TK +P
Sbjct: 1373 AHQFADVFTKGLP 1385
>gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum]
Length = 1333
Score = 498 bits (1283), Expect = e-139
Identities = 292/789 (37%), Positives = 435/789 (55%), Gaps = 38/789 (4%)
Query: 3 FKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERK 62
FKKF +VE Q + +K R+D GE++S++F + + GI + + P TP+QNG+AERK
Sbjct: 551 FKKFKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFCEENGIRRELTAPYTPEQNGVAERK 610
Query: 63 NRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSDL 122
NR ++++ RS L +P FW EA+AT V+ +N P+ + +P + S L
Sbjct: 611 NRTVVEMARSSLKAKGLPDYFWGEAVATVVYFLNISPTKDVWNTTPLEAWNGKKPRVSHL 670
Query: 123 HTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYNQ 182
FGC+ + L F KL +S +C F+GYS K + Y+ + +SRNV F
Sbjct: 671 RIFGCIAYA-LVNFHS-KLDEKSTKCIFVGYSLQSKAYRLYNPISGKVIISRNVVFN--- 725
Query: 183 FMLHSISPDINDITILPNFSIMPQSIERY-----KPEFTYVRKCIKQIPTAPPDTEPPPD 237
+S + N ++ N ++P E P + V + P AP T P +
Sbjct: 726 ---EDVSWNFNSGNMMSNIQLLPTDEESAVDFGNSPNSSPVSSSVSS-PIAPSTTVAPDE 781
Query: 238 P--EPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYSQVVKDVRWIKAMN 295
EP+ RRS+R + ++S + +TS +L +S P CY + V+ W AM
Sbjct: 782 SSVEPIPLRRSTREKKPNPKYS----NTVNTSCQFALL-VSDPICYEEAVEQSEWKNAMI 836
Query: 296 EEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDY 355
EEIQA++ N TW++V P +G KWV+ K N+DGS+ ++ ARLV Q+ G+D+
Sbjct: 837 EEIQAIERNSTWELVDAPEGKNVIGLKWVFRTKYNADGSIQKHKARLVAKGYSQQQGVDF 896
Query: 356 DETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLF--SSSK 399
DETF+PVA+ TV +Q DVK+ FL+GDL E++Y++ PQG +
Sbjct: 897 DETFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLNGDLEEEVYVSQPQGFMITGNEN 956
Query: 400 GVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLL-LYVDD 458
V KL+++LYGLKQAPRAWY K S G F S +L++ + T LL+ LYVDD
Sbjct: 957 KVYKLRKALYGLKQAPRAWYSKIDSFFQGSGFRRSDNEPTLYLKKQGTDEFLLVCLYVDD 1016
Query: 459 MVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMA 518
M+ GS + + K + +F M DLG L YFLGLEV GIF+ Q KYA L+
Sbjct: 1017 MIYIGSSKSLVNDFKSNMMRNFEMSDLGLLKYFLGLEVIQDKDGIFISQKKYAEDLLKKF 1076
Query: 519 GLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFM 578
+ + TP+ +N K R DG +P L+R LVG LN LT TRPDI+F V VS+F+
Sbjct: 1077 QMMNCEVATTPMNINEKLQRADGTEKANPKLFRSLVGGLNYLTHTRPDIAFSVSVVSRFL 1136
Query: 579 HSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMF 638
SP H A +R++RY+ G++ G+++S +L ++D+++AGC D R S + C
Sbjct: 1137 QSPTKQHFGAAKRVLRYVAGTTDFGIWYSKAPNFRLVGFTDSDYAGCLDDRKSTSGSCFS 1196
Query: 639 LRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNT 698
S +++W SKKQ V+ S++E+EY A A + +WLR L + + E T +++D+
Sbjct: 1197 FGSGVVTWSSKKQETVALSTSEAEYTAASLAARQALWLRKLLEDFSYEQKESTEIFSDSK 1256
Query: 699 SAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPRPRHQF 758
SAI NP FH RTK I+V H IR + I L ST Q ADI TK++P+ +H++
Sbjct: 1257 SAIAMAKNPSFHGRTKHIDVQYHFIRTLVADGRIVLKFCSTNEQAADIFTKSLPQAKHEY 1316
Query: 759 LVGKLMIFD 767
+L + D
Sbjct: 1317 FRLQLGVCD 1325
>gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis thaliana]
gi|12321254|gb|AAG50698.1| copia-type polyprotein,
putative [Arabidopsis thaliana] gi|25301687|pir||F96614
probable copia-type polyprotein T18I24.5 [imported] -
Arabidopsis thaliana
Length = 1320
Score = 483 bits (1243), Expect = e-134
Identities = 280/769 (36%), Positives = 408/769 (52%), Gaps = 62/769 (8%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FKKF +VE + +K RSD GE+ S EF +Y + GI Q + P +PQQNG+AER
Sbjct: 574 IFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAER 633
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
KNR +L++ RS+L +P W EA+A V+L+NR P+ + +P + S
Sbjct: 634 KNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSH 693
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
L FG + H+P ++ KL +S + F+GY N+ KG+ Y+ +SRN+ F
Sbjct: 694 LRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEE 753
Query: 182 -QFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDPEP 240
++ +S D N P+F E KPE P E PP EP
Sbjct: 754 GEWDWNSNEEDYN---FFPHF-------EEDKPE---------------PTREEPPSEEP 788
Query: 241 VEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYSQVVKDVRWIKAMNEEIQA 300
P S +S+ ++ P + + ++ W AM+EEI++
Sbjct: 789 TTPPTSPTSSQIEEKCEP--------------------MDFQEAIEKKTWRNAMDEEIKS 828
Query: 301 LQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDETFA 360
+Q+N TW++ S P K +G KWVY K NS G + RY ARLV Q GIDYDE FA
Sbjct: 829 IQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEVFA 888
Query: 361 PVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG--VCKL 404
PVA++ TV HQMDVK+ FL+GDL E++Y+ PQG + V +L
Sbjct: 889 PVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 948
Query: 405 KRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVITGS 464
K++LYGLKQAPRAW + F Y +L+I I++ LYVDD++ TG+
Sbjct: 949 KKALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKIQKEDILIACLYVDDLIFTGN 1008
Query: 465 DNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQSTN 524
+ + + K+++ F M D+G + Y+LG+EV GIF+ Q YA ++ + +N
Sbjct: 1009 NPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMDDSN 1068
Query: 525 PVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSPRHL 584
PV TP+E +K + + DP ++ LVGSL LT TRPDI + V VS++M P
Sbjct: 1069 PVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1128
Query: 585 HLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRSSLI 644
H A +RI+RY+KG+ + GL +S + KL YSD++W G D R S + + ++ +
Sbjct: 1129 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGDTAF 1188
Query: 645 SWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI*FV 704
+W SKKQ V+ S+ E+EY A + IWLR L EL P EPT ++ DN SAI
Sbjct: 1189 TWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIALA 1248
Query: 705 ANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
NPVFH+R+K I+ H IR+ + + L +V T Q+ADI TK + R
Sbjct: 1249 KNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKR 1297
>gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 482 bits (1240), Expect = e-134
Identities = 284/772 (36%), Positives = 414/772 (52%), Gaps = 36/772 (4%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FKKF +VE + +K RSD GE+ S EF +Y + GI Q + P +PQQNG+AER
Sbjct: 574 IFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAER 633
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
KNR +L++ RS+L +P W EA+A V+L+NR P+ + +P + S
Sbjct: 634 KNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKSGVSH 693
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
L FG + H+P ++ KL +S + F+GY N+ KG+ Y+ +SRN+ F
Sbjct: 694 LRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEE 753
Query: 182 -QFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDP-- 238
++ +S D N P+F E +PE T ++ P + T PP P
Sbjct: 754 GEWDWNSNEEDYN---FFPHF-------EEDEPEPT------REEPPSEEPTTPPTSPTS 797
Query: 239 EPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTA-SLSSISIPTCYSQVVKDVRWIKAMNEE 297
+E S RT R +LT L + P + + ++ W AM+EE
Sbjct: 798 SQIEESSSERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEE 857
Query: 298 IQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDE 357
I+++Q+N TW++ S P K +G KWVY K NS G + RY ARLV Q GIDYDE
Sbjct: 858 IKSIQKNDTWELTSLPNGHKTIGVKWVYKAKKNSKGEVERYKARLVAKGYIQRAGIDYDE 917
Query: 358 TFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG--V 401
FAPVA++ TV HQMDVK+ FL+GDL E++Y+ PQG + V
Sbjct: 918 VFAPVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKV 977
Query: 402 CKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVI 461
+LK++LYGLKQAPRAW + F Y +L+I I++ LYVDD++
Sbjct: 978 LRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKIQKEDILIACLYVDDLIF 1037
Query: 462 TGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQ 521
TG++ + + K+++ F M D+G + Y+LG+EV GIF+ Q YA ++ +
Sbjct: 1038 TGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMD 1097
Query: 522 STNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSP 581
+NPV TP+E +K + + DP ++ LVGSL LT TRPDI + V VS++M P
Sbjct: 1098 DSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHP 1157
Query: 582 RHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRS 641
H A +RI+RY+KG+ + GL +S + KL YSD++W G D R S + + ++
Sbjct: 1158 TTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGD 1217
Query: 642 SLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI 701
+ +W SKKQ V S+ E+EY A + IWLR L EL P EPT ++ DN SAI
Sbjct: 1218 TAFTWMSKKQPIVVLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAI 1277
Query: 702 *FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
NPVFH+R+K I+ H IR+ + + L +V T Q+ADI TK + R
Sbjct: 1278 ALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKR 1329
>gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|50919599|ref|XP_470160.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1335
Score = 480 bits (1236), Expect = e-134
Identities = 279/782 (35%), Positives = 424/782 (53%), Gaps = 51/782 (6%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FKKF VE Q +K+ RSD EY+S EF++Y ++ GI Q + + QQNG+AER
Sbjct: 564 IFKKFKAMVENQSNRKIKVLRSDQGREYISKEFEKYCENAGIRRQLTAGYSAQQNGVAER 623
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
KNR + D+ S+L +P FW EA+ T V+++NR P+ + +PF + +
Sbjct: 624 KNRTINDMANSMLQDKGMPKSFWAEAVNTAVYILNRSPTKAVTNRTPFEAWYGKKPVIGH 683
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
+ FGC+C+ +P ++ K +S +C F+GY++ KG+ Y++ +SR+ F
Sbjct: 684 MRVFGCICYAQVPAQKRVKFDNKSDRCIFVGYADGIKGYRLYNLEKKKIIISRDA-IFDE 742
Query: 182 QFMLHSISPDINDITILPNFSIM--------PQSIERYKPE---FTYVRKCIKQIPTAPP 230
+ SP+ + +LP +I +E + P + + ++P
Sbjct: 743 SATWNWKSPEASSTPLLPTTTITLGQPHMHGTHEVEDHTPSPQPSSPMSSSSASSDSSPS 802
Query: 231 DTEPPPDPEPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSISIPTCYSQVVKDVRW 290
E PE PRR + S R +H S+ P + + K W
Sbjct: 803 SEEQISTPESA-PRRVRSMVELLESTSQQRGSEQHEFCNYSVVE---PQSFQEAEKHDNW 858
Query: 291 IKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQE 350
IKAM +EI +++N TW++V P D + +G KWVY KLN DGS+ +Y ARLV KQ+
Sbjct: 859 IKAMEDEIHMIEKNNTWELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQK 918
Query: 351 YGIDYDETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFS 396
GIDY ET+APVA++ T+ +Q+DVK+ FL+G L E+IY+ P+G FS
Sbjct: 919 PGIDYYETYAPVARLETIRTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEG-FS 977
Query: 397 SSKG---VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLL 453
G V +LK++LYGLKQAPRAWY + + F+ S +L++++T T I+++
Sbjct: 978 VQGGENKVFRLKKALYGLKQAPRAWYSQIDKYFIQKGFAKSISEPTLYVNKTGTDILIVS 1037
Query: 454 LYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATY 513
LYVDD++ TG+ +Q K+ + ++ M DLG LHYFLG+EVH + +GIF+ Q KYA
Sbjct: 1038 LYVDDLIYTGNSEKMMQDFKKDMMHTYEMSDLGLLHYFLGMEVHQSDEGIFISQRKYAEN 1097
Query: 514 LISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQ 573
++ + + V TPL N K DG DP +YR LVGSL LT TRPDI F
Sbjct: 1098 ILKKFKMDNCKSVTTPLLPNEKQKARDGADKADPTIYRSLVGSLLYLTATRPDIMFAASL 1157
Query: 574 VSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVT 633
+S++M SP L+ A +R++RY+KG++ G+++ KL Y+D++WAGC D S +
Sbjct: 1158 LSRYMSSPSQLNFTAAKRVLRYIKGTADYGIWYKPVKESKLIGYTDSDWAGCLDDMKSTS 1217
Query: 634 SWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSL 693
+ S S E+EY A A S+++WLR + +LG +PT++
Sbjct: 1218 GYAF-----------------SLGSAEAEYVAASKAVSQVVWLRRIMEDLGEKQYQPTTI 1260
Query: 694 YADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
Y D+ SAI NPV H+RTK I + H IR+A D + L T Q+ADI TKA+ +
Sbjct: 1261 YCDSKSAIAISENPVSHDRTKHIAIKYHYIREAVDRQEVKLEFCRTDEQLADIFTKALSK 1320
Query: 754 PR 755
+
Sbjct: 1321 EK 1322
>gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25301681|pir||F86246
hypothetical protein [imported] - Arabidopsis thaliana
Length = 1352
Score = 479 bits (1234), Expect = e-133
Identities = 282/772 (36%), Positives = 412/772 (52%), Gaps = 36/772 (4%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FKKF +VE + +K RSD GE+ S EF +Y + GI Q + P +PQQNG+ ER
Sbjct: 574 IFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVER 633
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
KNR +L++ RS+L +P W EA+A V+L+NR P+ + +P + S
Sbjct: 634 KNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSH 693
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
L FG + H+P ++ KL +S + F+GY N+ KG+ Y+ +SRN+ F
Sbjct: 694 LRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEE 753
Query: 182 -QFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDP-- 238
++ +S D N P+F E +PE T ++ P + T PP P
Sbjct: 754 GEWDWNSNEEDYN---FFPHF-------EEDEPEPT------REEPPSEEPTTPPTSPTS 797
Query: 239 EPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTA-SLSSISIPTCYSQVVKDVRWIKAMNEE 297
+E S RT R +LT L + P + + ++ W AM+EE
Sbjct: 798 SQIEESSSERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQKAIEKKTWRNAMDEE 857
Query: 298 IQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDE 357
I+++Q+N TW++ S P K +G KWVY K NS G + RY ARLV Q GIDYDE
Sbjct: 858 IKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRVGIDYDE 917
Query: 358 TFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG--V 401
FAPVA++ TV HQMDVK+ FL+GDL E++Y+ PQG + V
Sbjct: 918 VFAPVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKV 977
Query: 402 CKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVI 461
+LK+ LYGLKQAPRAW + F Y +L+I I++ LYVDD++
Sbjct: 978 LRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKIQKEDILIACLYVDDLIF 1037
Query: 462 TGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQ 521
TG++ + + K+++ F M D+G + Y+LG+EV GIF+ Q YA ++ +
Sbjct: 1038 TGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMD 1097
Query: 522 STNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSP 581
+NPV TP+E +K + + DP ++ LVGSL LT TRPDI + V VS++M P
Sbjct: 1098 DSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHP 1157
Query: 582 RHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRS 641
H A +RI+RY+KG+ + GL +S + KL YSD++W G D R S + + ++
Sbjct: 1158 TTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGD 1217
Query: 642 SLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI 701
+ +W SKKQ V+ S+ E+EY A + IWLR L EL P EPT ++ DN SAI
Sbjct: 1218 TAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAI 1277
Query: 702 *FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
NPVFH+R+K I+ H IR+ + + L +V T Q+AD TK + R
Sbjct: 1278 ALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADFFTKPLKR 1329
>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi|11278364|pir||T47925
copia-type polyprotein - Arabidopsis thaliana
Length = 1352
Score = 479 bits (1234), Expect = e-133
Identities = 282/772 (36%), Positives = 412/772 (52%), Gaps = 36/772 (4%)
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FKKF +VE + +K RSD GE+ S EF +Y + GI Q + P +PQQNG+ ER
Sbjct: 574 IFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVER 633
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSD 121
KNR +L++ RS+L +P W EA+A V+L+NR P+ + +P + S
Sbjct: 634 KNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSH 693
Query: 122 LHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYN 181
L FG + H+P ++ KL +S + F+GY N+ KG+ Y+ +SRN+ F
Sbjct: 694 LRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEE 753
Query: 182 -QFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDP-- 238
++ +S D N P+F E +PE T ++ P + T PP P
Sbjct: 754 GEWDWNSNEEDYN---FFPHF-------EEDEPEPT------REEPPSEEPTTPPTSPTS 797
Query: 239 EPVEPRRSSRTSRAPDRFSPDRYGSKHTSLTA-SLSSISIPTCYSQVVKDVRWIKAMNEE 297
+E S RT R +LT L + P + + ++ W AM+EE
Sbjct: 798 SQIEESSSERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQKAIEKKTWRNAMDEE 857
Query: 298 IQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDE 357
I+++Q+N TW++ S P K +G KWVY K NS G + RY ARLV Q GIDYDE
Sbjct: 858 IKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRVGIDYDE 917
Query: 358 TFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG--V 401
FAPVA++ TV HQMDVK+ FL+GDL E++Y+ PQG + V
Sbjct: 918 VFAPVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKV 977
Query: 402 CKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLLLYVDDMVI 461
+LK+ LYGLKQAPRAW + F Y +L+I I++ LYVDD++
Sbjct: 978 LRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKIQKEDILIACLYVDDLIF 1037
Query: 462 TGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATYLISMAGLQ 521
TG++ + + K+++ F M D+G + Y+LG+EV GIF+ Q YA ++ +
Sbjct: 1038 TGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKID 1097
Query: 522 STNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQVSQFMHSP 581
+NPV TP+E +K + + DP ++ LVGSL LT TRPDI + V VS++M P
Sbjct: 1098 DSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHP 1157
Query: 582 RHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRS 641
H A +RI+RY+KG+ + GL +S + KL YSD++W G D R S + + ++
Sbjct: 1158 TTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIGD 1217
Query: 642 SLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI 701
+ +W SKKQ V+ S+ E+EY A + IWLR L EL P EPT ++ DN SAI
Sbjct: 1218 TAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAI 1277
Query: 702 *FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
NPVFH+R+K I+ H IR+ + + L +V T Q+AD TK + R
Sbjct: 1278 ALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADFFTKPLKR 1329
>gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37536522|ref|NP_922563.1| putative gag-pol polyprotein
[Oryza sativa (japonica cultivar-group)]
gi|13310887|gb|AAG13591.2| putative gag/pol polyprotein
[Oryza sativa]
Length = 1417
Score = 478 bits (1229), Expect = e-133
Identities = 287/793 (36%), Positives = 404/793 (50%), Gaps = 86/793 (10%)
Query: 4 KKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKN 63
+ F Y TQF V ++D+ EY S+ + L G + + SCP + QQNG AER
Sbjct: 603 RSFFAYAHTQFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERIL 662
Query: 64 RHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFFHLFKFQLDYSDLH 123
R + D R++L + P FW EAL T LINR P +P+ L Y L
Sbjct: 663 RTINDYVRTMLVHSAAPLSFWAEALQTATHLINRRPCRATGSLTPYQLLLGAPPTYDHLR 722
Query: 124 TFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTFFYNQF 183
FGC+C+ + HKL +S+ C F+GY H+G+ CYD+ + SR+VTF + F
Sbjct: 723 VFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVF 782
Query: 184 MLHSIS---------PDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPP---- 230
PD D TI+ ++P + T V +PP
Sbjct: 783 PFRDAPSPRPSAPPPPDHGDDTIV----LLPAPAQHV---VTPVGTAPAHDAASPPSPAS 835
Query: 231 ----------DTEPPPDPEP-----VEPRRSSRTSRAPDRFSPDRYGSKHTSLTASLSSI 275
D PPP PE P R + T+RA S + ++TA+ +
Sbjct: 836 STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKP---NPRYAMTATSTLS 892
Query: 276 SIPTCYSQVVKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSL 335
P+ ++D W AM E AL N TW +V PP + + KWV+ KL++DGSL
Sbjct: 893 PTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGSL 952
Query: 336 NRYNARLVTLRNKQEYGIDYDETFAPVAKMTTV--------------HQMDVKNIFLHGD 381
++Y AR V Q G+D+ ETF+PV K T+ HQ+DV N FLHG
Sbjct: 953 DKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHGH 1012
Query: 382 LAEDIYMTPPQGLFSSSK--GVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSS 439
L E + P G +++ VC L RSLYGL+QAPRAW+++F F S+ + S
Sbjct: 1013 LQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADPS 1072
Query: 440 LFIHRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHST 499
LF+ R + LLLYVDDM+++ S ++ +QR+ ++L A F +KD+G L YFLG+EV T
Sbjct: 1073 LFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYFLGIEVQRT 1132
Query: 500 SKGIFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN* 559
+ G L Q KYAT D YR + G+L
Sbjct: 1133 ADGFVLSQSKYAT--------------------------------DDSSWYRSIAGALQY 1160
Query: 560 LTITRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSD 619
LT+TRPDI++ VQQV MH+PR H+ ++RI+RY+KG++ GL P L+A+SD
Sbjct: 1161 LTLTRPDIAYAVQQVCLHMHAPREAHVTLLKRILRYIKGTAAFGLHLRASTSPTLTAFSD 1220
Query: 620 ANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGF 679
A+WAGCPDTR S + +C+FL SLISW SK+Q VS+SS E+EYR + A +E WLR
Sbjct: 1221 ADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQTTVSRSSAEAEYRGVANAVAECTWLRQL 1280
Query: 680 LAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVST 739
L EL + T Y DN S++ NPV H+RTK IE+D H +R+ + + + +
Sbjct: 1281 LGELHCRVPQATIAYCDNISSVYMSKNPVHHKRTKHIELDIHFVREKVALGELRVLPIPS 1340
Query: 740 QLQIADILTKAVP 752
Q AD+ TK +P
Sbjct: 1341 AHQFADVFTKGLP 1353
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.326 0.139 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,316,112,675
Number of Sequences: 2540612
Number of extensions: 55843734
Number of successful extensions: 252363
Number of sequences better than 10.0: 1795
Number of HSP's better than 10.0 without gapping: 1695
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 245361
Number of HSP's gapped (non-prelim): 2958
length of query: 770
length of database: 863,360,394
effective HSP length: 136
effective length of query: 634
effective length of database: 517,837,162
effective search space: 328308760708
effective search space used: 328308760708
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 79 (35.0 bits)
Medicago: description of AC144727.8