
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0016.5
(733 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum] 650 0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides] 316 2e-84
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 233 2e-59
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 204 1e-50
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 201 9e-50
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 197 8e-49
gb|AAD15534.1| putative retroelement pol polyprotein [Arabidopsi... 197 1e-48
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7... 196 2e-48
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 195 5e-48
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 195 5e-48
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 194 7e-48
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 189 4e-46
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ... 187 1e-45
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi... 182 3e-44
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 182 3e-44
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 181 8e-44
ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cu... 177 8e-43
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi... 177 1e-42
gb|AAP53070.1| putative retroelement [Oryza sativa (japonica cul... 172 5e-41
emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|49720... 170 1e-40
>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 650 bits (1677), Expect = 0.0
Identities = 331/628 (52%), Positives = 440/628 (69%), Gaps = 31/628 (4%)
Query: 1 MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVK 60
M++ +E VR GKNYS+W F+FQ+FV GK LWG++DGS A T+ +W++K
Sbjct: 1 MNSHHFESFSVRFTGKNYSSWEFQFQLFVTGKELWGYIDGSDPA---PTDATKLGEWKIK 57
Query: 61 DNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEIL 120
D +V+ WI+ +DP IVLNLRP+ T MW YL+K+YNQ+N+ARRFQLE++IA + Q L
Sbjct: 58 DARVMTWILGSIDPLIVLNLRPYKTVKAMWDYLQKVYNQDNSARRFQLEYEIANYSQGGL 117
Query: 121 SISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLM 180
+ +++S F NLWA++TDIVY + TE L+ +Q VHE +K DQFLMKLRSDFE IR+NLM
Sbjct: 118 FVQDYFSGFQNLWAEFTDIVYAKIPTESLSVIQAVHEQSKRDQFLMKLRSDFESIRSNLM 177
Query: 181 NRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKPRGRDLSAVQCFCCK 240
NR PS D C ELLREE+RL+T + K + VA+ Q K +GRD+S QC+ CK
Sbjct: 178 NRDPSPSLDVCFRELLREEQRLVTQNVFK--KENDVTVAFAAQGKGKGRDMSRTQCYSCK 235
Query: 241 GFGHYASNCPRKSCNYCKKDGHVIKECPIRPPKKNATTFTTSVHSPIAPSFVDIANVQHN 300
+GH ASNC +K NYCK+ GH+IKECP+RP + F ++ S D +++
Sbjct: 236 EYGHIASNCSKKFYNYCKQQGHIIKECPMRPQNRRINAFQARING----STDDNSSLG-- 289
Query: 301 APTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQN 360
Q LTPE+V+ MI+ AFSALG+ G + S+ W D GASNHM N+ L N+ +
Sbjct: 290 -----QVLTPEMVQQMIVSAFSALGLQG-NDVTSNFWIVDSGASNHMTNSTSILKNVRKY 343
Query: 361 FGNLKIQVANGNHLPITAIGDISTSLNDVYVSPGLTSNLIFVGQLVDNDCRVAFSKSGCL 420
G +IQ+ANG++LPIT +GDI+ + +V+VSP L+++LI VGQLVDN+C V FS++GCL
Sbjct: 344 QGPSQIQIANGSNLPITKVGDITPTFKNVFVSPKLSTSLISVGQLVDNNCDVNFSRNGCL 403
Query: 421 VQDQHSGKLIARGPKVGRLFPLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVL 480
VQDQ SG +IA+GPKVGRLFP+ FS+ P S S S T E+WHKRLGHPNS VL
Sbjct: 404 VQDQVSGTIIAKGPKVGRLFPIHFSIPPVLSFACTSTASKT----EVWHKRLGHPNSVVL 459
Query: 481 YELLQSGVLGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITP 540
+ SG+LGNK S+++I DC+ CKLGKSK LPFPN S ++ FD+IHSD+WGI+P
Sbjct: 460 SHISNSGLLGNKNKFSVASI--DCSTCKLGKSKTLPFPNFGSRATKCFDVIHSDVWGISP 517
Query: 541 VISHANYKYFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
+ISHA++KYF+TFIDD+SRFTWVYFLRSK EVFS FK F AY+ETQFS+ IK+LRSD+GG
Sbjct: 518 IISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKTFLAYIETQFSTCIKLLRSDSGG 577
Query: 601 GKVHI*FDSRFFENKWYFISK-VMSFHS 627
+ +E K + + K ++S HS
Sbjct: 578 -------EYMSYEFKKFLLDKGIVSQHS 598
>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 316 bits (810), Expect = 2e-84
Identities = 217/628 (34%), Positives = 333/628 (52%), Gaps = 47/628 (7%)
Query: 1 MSTEKYEVLL---VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYAD- 56
M+TE+ + L VRL+GKNYS W++ + F+KGK +WG+V G T + K +E +
Sbjct: 1 MATERDDSLQSVSVRLDGKNYSYWSYVMRNFLKGKKMWGYVSG-TYVVPKNTEEGDTVSI 59
Query: 57 --WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAI 114
WE + +++ WI +V+ +I L + TA ++W +L++++ Q+N A+++QLE+DI
Sbjct: 60 DTWEANNAKIITWINNYVEHSIGTQLAKYETAKEVWDHLQRLFTQSNFAKQYQLENDIRA 119
Query: 115 FQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEG 174
Q+ +SI EFYS +LW + SV + + E + QFL LRSDFEG
Sbjct: 120 LHQKNMSIQEFYSAMTDLWDQLA--LTESVELKACGAYIERREQQRLVQFLTALRSDFEG 177
Query: 175 IRTNLMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKP----RGRD 230
+R ++++R+ +PS D+ ++ELL EE RL + + + SAS P V KP + +
Sbjct: 178 LRGSILHRSPLPSVDSVVSELLAEEIRLQSYSE-KGILSASNPSVLAVPSKPFSNHQNKP 236
Query: 231 LSAV---QCFCCKGFGHYASNCPR-KSCNYCKKDGHVIKECPIRPPK--KNATTFTTSVH 284
+ V +C CK GH+ + CP+ + N K G + R P+ K T +V
Sbjct: 237 YTRVGFDECSFCKQKGHWKAQCPKLRQQNQAWKSGSQSQSNAHRSPQGYKPPHHNTAAVA 296
Query: 285 SPIAPSFVDIANVQHNAPTPVQALTPEIVEHMII--LAFSALGISGKHSPNSSPWYFDYG 342
SP + N +L P+ + I L S+ GIS S W D G
Sbjct: 297 SP---GSITDPNTLAEQFQKFLSLQPQAMSASSIGQLPHSSSGIS------HSEWVLDSG 347
Query: 343 ASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST---SLNDVYVSPGLTSNL 399
AS+HM+ ++ + T+++ ++ + A+G +P+ +G + T SL +VY+ P L NL
Sbjct: 348 ASHHMSPDSSSFTSVSP-LSSIPVMTADGTPMPLAGVGSVVTLHLSLPNVYLIPKLKLNL 406
Query: 400 IFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCS------SL 452
+GQ+ D+ D V FS S C VQD S KLI G + L+ L P L
Sbjct: 407 ASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDL 466
Query: 453 PFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKLGKS 512
F S + +F LWH RLGH +S+ L L +G LGN +T +S DC+ CKL K
Sbjct: 467 SFFRL-SLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLKTCDIS----DCSGCKLAKF 521
Query: 513 KILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSKDEV 572
LPF S S FD+IHSD+WG +PV + +Y+V+FIDD +R+ WVY ++ + E
Sbjct: 522 SALPFNRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEF 581
Query: 573 FSAFKFFHAYVETQFSSKIKILRSDNGG 600
F + F A ++TQ S+ IK R D GG
Sbjct: 582 FEIYAAFRALIKTQHSAVIKCFRCDLGG 609
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 233 bits (594), Expect = 2e-59
Identities = 179/618 (28%), Positives = 285/618 (45%), Gaps = 55/618 (8%)
Query: 1 MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSAL-DKEKQETEYAD--- 56
M T + + V L G NY W+ + + G+ LW HV S + DKE++ETE
Sbjct: 1 METSQKVITTVILQGGNYLTWSRTTKTVLCGRGLWSHVISSQAPKEDKEEEETETISPEE 60
Query: 57 --WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIY-NQNNTARRFQLEHDIA 113
W +D VLA + ++ +I+ TA ++W LK +Y N++N R F+++ I
Sbjct: 61 EKWFQEDQAVLALLQNSLETSILEGYSYCETAKELWDTLKNVYGNESNLTRVFEVKKAIN 120
Query: 114 IFQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQ---FLMKLRS 170
QE L ++ + +F +LW++ + G++ + +HE + D+ L+ L
Sbjct: 121 ELSQEDLEFTKHFGKFRSLWSELKSLRPGTLDP------KILHERREQDKVFGLLLTLNP 174
Query: 171 DFEGIRTNLMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKPRGRD 230
+ + +L+ +PS D +++ +E+ + +A+ + + D
Sbjct: 175 GYNDLIKHLLRSEKLPSLDEVCSKIQKEQGSTGLFGGKSELITANKGEVVANKGVYKNED 234
Query: 231 LSAVQCFCCKGFGHYASNC-----PRKSCNYCKKDGHVIKECPIRPPKKNATTFTTSVHS 285
+ C CK GH C K + H +E + ++ TS
Sbjct: 235 RKLLTCDHCKKKGHTKDKCWLLHPHLKPAKFKDSRAHFSQETHEEQSQAGSSKGETST-- 292
Query: 286 PIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPWYFDYGASN 345
SF D + ++AL IV + GI+ +S D GAS+
Sbjct: 293 ----SFGDYVR-----KSDLEALIKSIV------SLKESGITFSSQTSSGSIVIDSGASH 337
Query: 346 HMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTSLND--VYVSPGLTSNLIFVG 403
HM +N+ L NI G+ + +ANG+ +PI IG++ D + P TSNL+ V
Sbjct: 338 HMISNSNLLDNIEPALGH--VIIANGDKVPIEGIGNLKLFNKDSKAFFMPKFTSNLLSVK 395
Query: 404 QLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCSSLPFVSCHSATV 462
+ D +C F + QD +GK+I G G L+ L +SP SS F S +
Sbjct: 396 RTTRDLNCYAIFGPNDVYFQDIETGKVIGEGGSKGELYVL-EDLSPNSSSCFSSKSHLGI 454
Query: 463 VNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQS 522
LWH RLGHP++ L +L P++S C C LGK FP +
Sbjct: 455 SFNTLWHARLGHPHTRALKLML----------PNISFDHTSCEACILGKHCKSVFPKSLT 504
Query: 523 NISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAY 582
+ FD++HSD+W +P +S N KYFVTFI++ S++TW+ L SKD VF AF F Y
Sbjct: 505 IYEKCFDLVHSDVW-TSPCVSRDNNKYFVTFINEKSKYTWITLLPSKDRVFEAFTNFETY 563
Query: 583 VETQFSSKIKILRSDNGG 600
V QF++KIK+ R+DNGG
Sbjct: 564 VTNQFNAKIKVFRTDNGG 581
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 204 bits (518), Expect = 1e-50
Identities = 166/665 (24%), Positives = 289/665 (42%), Gaps = 87/665 (13%)
Query: 12 RLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLAWIIRF 71
RL+ NY W+ I + K+ G +DG+ S + + + W ++ V +W++
Sbjct: 78 RLDETNYGDWSVAMLISLDAKNKTGFIDGTLSR--PLESDLNFRLWSRCNSMVKSWLLNS 135
Query: 72 VDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYSQFMN 131
V P I ++ + A+ +W L +N N R + L +I F+Q LS+SE+Y++
Sbjct: 136 VSPQIYRSILRMNDASDIWRDLNSRFNVTNLPRTYNLTQEIQDFRQGTLSLSEYYTRLKT 195
Query: 132 LW--ADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS*D 189
LW D T+ + + +Q E K +FL L + +R ++ + +PS
Sbjct: 196 LWDQLDSTEALDEPCTCGKAMRLQQKAEQAKIVKFLAGLNESYAIVRRQIIAKKALPS-- 253
Query: 190 ACLNELLREERRLLTLATMEQHKS--ASLPVAYVVQEKPRGRDLSAVQCFCCKGFGHYAS 247
L E +L +Q S + P A+ V E + + C+ G
Sbjct: 254 ------LGEVYHILDQDNSQQSFSNVVAPPAAFQVSEITQSPSMDPTVCYVQNG-----P 302
Query: 248 NCPRKSCNYCKKDGHVIKECPIR---PP---KKNATTFTTSVHSPIAPSFVDIANVQHNA 301
N R C++ + GH+ + C + PP K P+A + + + V +
Sbjct: 303 NKGRPICSFYNRVGHIAERCYKKHGFPPGFTPKGKAGEKLQKPKPLAANVAESSEVNTSL 362
Query: 302 PTPVQALTPEIVEHMIIL-------------------------------AFSALGIS--G 328
+ V L+ E ++ I + +S +GI
Sbjct: 363 ESMVGNLSKEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNLGICFSPSTYSFIGILTVA 422
Query: 329 KHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS--- 385
+H+ +S+ W D GA++H++++ +++ + + + + G + I+ +G + +
Sbjct: 423 RHTLSSATWVIDSGATHHVSHDRSLFSSLDTSVLSA-VNLPTGPTVKISGVGTLKLNDDI 481
Query: 386 -LNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLC 443
L +V P NLI + L D+ RV F K+ C +QD G+++ +G +V L+ L
Sbjct: 482 LLKNVLFIPEFRLNLISISSLTDDIGSRVIFDKNSCEIQDLIKGRMLGQGRRVANLYLL- 540
Query: 444 FSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQS-GVLGNKETPSLSTIKF 502
+ S VV+ +WH+RLGH + L + S G +K S
Sbjct: 541 -------DVGDQSISVNAVVDISMWHRRLGHASLQRLDAISDSLGTTRHKNKGSDF---- 589
Query: 503 DCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTW 562
C C L K + L FP + FD++H D+WG V + YKYF+T +DD SR TW
Sbjct: 590 -CHVCHLAKQRKLSFPTSNKVCKEIFDLLHIDVWGPFSVETVEGYKYFLTIVDDHSRATW 648
Query: 563 VYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGGGKVHI*FDSRFFENKWYFISKV 622
+Y L++K EV + F F VE Q+ K+K +RSDN + +F +Y +
Sbjct: 649 MYLLKTKSEVLTVFPAFIQQVENQYKVKVKAVRSDNAP-------ELKF--TSFYAEKGI 699
Query: 623 MSFHS 627
+SFHS
Sbjct: 700 VSFHS 704
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 201 bits (510), Expect = 9e-50
Identities = 166/633 (26%), Positives = 277/633 (43%), Gaps = 108/633 (17%)
Query: 11 VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYADWEV 59
++LN NY W +F+ + + L G V+G + + E +Y DW
Sbjct: 19 LKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYEDWFC 78
Query: 60 KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
D V +W+ + ++ ++ +T+ ++W L + +N+++ AR F L ++ + ++
Sbjct: 79 TDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKD 138
Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNL 179
S+S + F I+ S+S+ G + V E+ K FL L +++ I T +
Sbjct: 139 KSLSVYCRDFK--------IICDSLSSIG----KPVEESMKIFGFLNGLGREYDPITTVI 186
Query: 180 ---MNRATVPS*DACLNELLREERRLLT----------LATMEQHKSASLPVAYVVQEKP 226
+++ P+ + ++E+ + +L + LA + ++ P Y +
Sbjct: 187 QSSLSKLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNTERSNSGAP-QYNSNSRG 245
Query: 227 RGRDLS----AVQCFCCKGFGHYASNCP----RKSCNYCKKDGHVIKECPIRPPKKNATT 278
RGR +GF + S P R C C + GH +C R
Sbjct: 246 RGRSGQNRGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDN----- 300
Query: 279 FTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPWY 338
N Q PT AFSAL +S + WY
Sbjct: 301 -----------------NYQSEVPTQ---------------AFSALRVSDE---TGKEWY 325
Query: 339 FDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LNDVYV 391
D A+ H+ + L N T GN + V +G +LPIT +G + S LN+V V
Sbjct: 326 PDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLV 385
Query: 392 SPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCS 450
P + +L+ V +L D+ C V F + + D + K++++GP+ L+ L
Sbjct: 386 CPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYML-------E 438
Query: 451 SLPFVSCHS--ATVVNFELWHKRLGHPNSNVLYELL-QSGVLGNKETPSLSTIKFDCTYC 507
+ FV+ +S + E WH RLGH NS +L +LL + + NK S C C
Sbjct: 439 NSEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRTSPV-----CEPC 493
Query: 508 KLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLR 567
++GKS L F + + D +H DLWG +PV+S+ +KY+ F+DDFSRF+W + LR
Sbjct: 494 QMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLR 553
Query: 568 SKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
K + S F + VE Q +KIK +SD GG
Sbjct: 554 MKSKFISVFIAYQKLVENQLGTKIKEFQSDGGG 586
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 197 bits (502), Expect = 8e-49
Identities = 166/671 (24%), Positives = 280/671 (40%), Gaps = 92/671 (13%)
Query: 12 RLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLAWIIRF 71
RL+ Y W+ +I + K+ G VDGS + + + W ++ V +W++
Sbjct: 82 RLDETTYGDWSVAMRISLDAKNKLGFVDGSLPR--PLESDPNFRLWSRCNSMVKSWLLNS 139
Query: 72 VDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYSQFMN 131
V P I ++ + A +W L +N N R + L +I +Q +S+SE+Y+
Sbjct: 140 VSPQIYRSILRLNDATDIWRDLFDRFNLTNLPRTYNLTQEIQDLRQGTMSLSEYYTLLKT 199
Query: 132 LW--ADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS*D 189
LW D T+ + + + E K +FL L + +R ++ + +PS
Sbjct: 200 LWDQLDSTEALDDPCTCGKAVRLYQKAEKAKIMKFLAGLNESYAIVRRQIIAKKALPS-- 257
Query: 190 ACLNELLREERRLLTLATMEQ--HKSASLPVAYVVQEKPRGRDLSAVQCFCCKGFGHYAS 247
L E +L ++ + P A+ V E S + G
Sbjct: 258 ------LAEVYHILDQDNSQKGFFNVVAPPAAFQVSEVSHSPITSPEIMYVQSG-----P 306
Query: 248 NCPRKSCNYCKKDGHVIKEC----------------PIRPPKKNATTFTTSVH------- 284
N R +C++C + GH+ + C +PPK A ++
Sbjct: 307 NKGRPTCSFCNRVGHIAERCYKKHGFPPGFTPKGKSSDKPPKPQAVAAQVTLSPDKMTGQ 366
Query: 285 ---------------------SPIAPSFVD--IANVQHNAPTPVQALTPEIVEHMIILAF 321
S + P V A+ QH A + I+ F
Sbjct: 367 LETLAGNFSPDQIQNLIALFSSQLQPQIVSPQTASSQHEASSSQSVAPSGILFSPSTYCF 426
Query: 322 SALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGD 381
+ +S +S W D GA++H++++ + + + + + + G ++ I+ +G
Sbjct: 427 IGILAVSHNSLSSDTWVIDSGATHHVSHDRKLFQTLDTSIVSF-VNLPTGPNVRISGVGT 485
Query: 382 ISTS----LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKV 436
+ + L +V P NLI + L D RV F S C +QD G + G ++
Sbjct: 486 VLINKDIILQNVLFIPEFRLNLISISSLTTDLGTRVIFDPSCCQIQDLTKGLTLGEGKRI 545
Query: 437 GRLFPLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPS 496
G L+ L + SP S+ VV+ +WHKRLGHP+ + L L S VLG +
Sbjct: 546 GNLYVLD-TQSPAISVN-------AVVDVSVWHKRLGHPSFSRLDSL--SEVLGTTRHKN 595
Query: 497 LSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDD 556
+ C C L K K L FP+ + + TF+++H D+WG V + YKYF+T +DD
Sbjct: 596 KKSAY--CHVCHLAKQKKLSFPSANNICNSTFELLHIDVWGPFSVETVEGYKYFLTIVDD 653
Query: 557 FSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGGGKVHI*FDSRFFENKW 616
SR TW+Y L+SK +V + F F VE Q+ +++K +RSDN ++
Sbjct: 654 HSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSVRSDNA---------KELAFTEF 704
Query: 617 YFISKVMSFHS 627
Y ++SFHS
Sbjct: 705 YKAKGIVSFHS 715
>gb|AAD15534.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25411300|pir||F84485 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1664
Score = 197 bits (501), Expect = 1e-48
Identities = 170/629 (27%), Positives = 278/629 (44%), Gaps = 93/629 (14%)
Query: 1 MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHV---DGSTSALDKEKQETEYAD- 56
M K + V L G NY WA + + + LW H+ + + A +E E +
Sbjct: 1 MENTKALFVPVTLKGVNYLLWARTTKTTLCSRGLWAHILTSEAPSEATIREGMEIVHVGE 60
Query: 57 --WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIY-NQNNTARRFQLEHDIA 113
W +D VLA + ++ +++ TA ++W L ++ NQ+N +R F+++ I
Sbjct: 61 EKWFQEDQSVLALLQNSLEASLLEAYSYCETAKELWETLFNVFGNQSNLSRVFEVKKAIN 120
Query: 114 IFQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFE 173
Q + ++ + +F +LWA+ + ++ + L + E K L+ L S +
Sbjct: 121 DLSQGDMEFTQHFGKFRSLWAELEMLRPNTLDPKVLIERR---EQDKVFGLLLTLSSTYN 177
Query: 174 GIRTNLMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKPRGRDLSA 233
+ +L+ +P+ + +++ +E+
Sbjct: 178 DLIKHLLRADKLPNLEEVCSQIQKEQAN-------------------------------- 205
Query: 234 VQCFCCKGFGHYASNCPRKSCNYCKKDGHVIKEC-----PIRP----PKKNATT---FTT 281
+G Y +N C +CK+ GH ++C +RP P+ N T F T
Sbjct: 206 ------RGNYKYDNNKKALWCEHCKRSGHTKEKCWTLHPHLRPGRREPRANQVTGENFGT 259
Query: 282 SVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAF--SALGISGK--HSPNS-SP 336
S +N A + ++V + A + SGK H+ +S P
Sbjct: 260 QEQS-------GTSNQHLGGNGAAMAASSDLVRRSDLKALIKALKESSGKSYHALSSLKP 312
Query: 337 WYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST--SLNDVYVSPG 394
D GAS+HM ++++ ++NI GN + +ANG+ +P+ +GD+ + + P
Sbjct: 313 LIIDSGASHHMISDSKLISNIEPALGN--VVIANGDRIPVKGVGDLDLFDKSSKAFYMPT 370
Query: 395 LTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCSSLP 453
TSNL+ V + D +C F + QD + +++ +G L+ L S+P
Sbjct: 371 FTSNLLSVKKATTDLNCYAIFGPNEVHFQDIETSRVLGQGVTKDGLYVL---EDTKPSVP 427
Query: 454 FVSCHSATV--VNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKLGK 511
S S+ + N E WH RLGHP+S L LL PS S +C C LGK
Sbjct: 428 LSSHFSSILGNANSESWHARLGHPHSRALKLLL----------PSTSFKNDECEACILGK 477
Query: 512 SKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSKDE 571
FP + + FD+IHSD+W +P +S N+KYFVTFID+ S+FTW L SKD
Sbjct: 478 HCKSVFPKSSTIYEKCFDLIHSDVW-TSPCLSRENHKYFVTFIDEKSKFTWFTLLPSKDR 536
Query: 572 VFSAFKFFHAYVETQFSSKIKILRSDNGG 600
V AF F YV + +KIKILRSDN G
Sbjct: 537 VLEAFTNFQTYVTNHYDAKIKILRSDNRG 565
>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
hypothetical protein 1 - wild cabbage transposon Melmoth
Length = 1131
Score = 196 bits (498), Expect = 2e-48
Identities = 163/645 (25%), Positives = 277/645 (42%), Gaps = 79/645 (12%)
Query: 7 EVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLA 66
+++ ++L+G NY W +I + K+ G VDG+ + D + + W ++ V +
Sbjct: 26 QLISLKLDGSNYDDWNAAMKIALDAKNKIGFVDGTLTRPDTS--DPTFRLWSRCNSMVKS 83
Query: 67 WIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFY 126
W++ V P I ++ + AA +W L ++ N R F L +I +Q +S+S++Y
Sbjct: 84 WLLNSVSPQIYRSILRLNDAADIWRDLHGRFHMTNLPRTFNLTQEIQDLKQGSMSLSDYY 143
Query: 127 SQFMNLWADY-------TDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNL 179
+ LW + T V G+ +Q + K +FL L + IR +
Sbjct: 144 TTLKTLWDNLESVDEPDTPCVCGNAE-----KLQKKVDRAKIVKFLAGLNDSYAIIRRQI 198
Query: 180 MNRATVPS*DACLNELLREERR-----LLTLATMEQHKSASLPVA-----YVVQEKPRGR 229
+ + +PS N L +++ + +T A ++ P+A YV +GR
Sbjct: 199 IMKKVLPSLVEVYNILDQDDSQKGFSTAITPAAFNVSENVPPPMAEAGICYVQTGPNKGR 258
Query: 230 DLSAVQCFCCKGFGHYASNCPRK---------------SCNYCKKDGHVIKECPIRPPKK 274
+ C C GH A C +K S + +K V + PP
Sbjct: 259 PI----CSFCNRVGHIAERCYKKHGFPPGFVSKYKSQSSGDRLQKPKQVAAQVSFSPPNS 314
Query: 275 NATTFTT----------------SVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMII 318
+ T ++ S P+ +N ++ P+ I +
Sbjct: 315 GQSPMTMDHLVGNHSKEQLQQFIALFSSQLPNVTMGSNEASSSKQPMD--NSGISFNPTT 372
Query: 319 LAFSALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITA 378
L F L +H+ + W D GA++H+ ++ T+I + + + NG + I+
Sbjct: 373 LVFIGLLTVSRHTLANETWIIDSGATHHVCHDRSMYTSIDITTTS-NVNLPNGMIVKISG 431
Query: 379 IGDISTS----LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARG 433
+G + + L++V P NL+ + L D +V F S C +QD G I +G
Sbjct: 432 VGIVQLNEHITLHNVLYIPEFRLNLLSISSLTSDIGSQVIFDVSSCAIQDPTKGWTIGQG 491
Query: 434 PKVGRLFPLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKE 493
+V L+ L SP VV+ LWHKRLGHP+ L ++ S LG +
Sbjct: 492 RRVANLYVLDVKSSPMKI--------NAVVDISLWHKRLGHPSYTRLDKI--SEALGTTK 541
Query: 494 TPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTF 553
+ C C L K K L + + + +F ++H D+WG V + YKYF+T
Sbjct: 542 HKNKGDAH--CHVCHLAKQKKLSYSSQNHICTASFQLLHVDVWGPFSVETLEGYKYFLTI 599
Query: 554 IDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDN 598
+DD SR TW+Y L+SK +V F F +ETQ+++KIK +R DN
Sbjct: 600 VDDHSRATWIYLLQSKSDVLHIFPTFVNQIETQYNTKIKSVRRDN 644
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 195 bits (495), Expect = 5e-48
Identities = 160/638 (25%), Positives = 274/638 (42%), Gaps = 120/638 (18%)
Query: 11 VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYADWEV 59
++L NY W +F+ + + L G V+G+ +A + E+ Y W
Sbjct: 19 LKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYESWFC 78
Query: 60 KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
D V +W+ + ++ ++ ST+ ++W L + +N+++ AR F L ++ + ++
Sbjct: 79 TDQLVRSWLFGTLSEEVLGHVHNLSTSRQIWVSLAENFNKSSVAREFSLRQNLQLLSKKE 138
Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSV-QTVHETTK*DQFLMKLRSDFEGIRTN 178
S + +F + + L+S+ + V E+ K FL L D++ I T
Sbjct: 139 KPFSVYCREFKTI-------------CDALSSIGKPVDESMKIFGFLNGLGRDYDPITTV 185
Query: 179 L---MNRATVPS*DACLNELLREERRLLTL---ATMEQH------KSASLPVAYVVQEKP 226
+ +++ P+ + ++E+ + +L + A++ H +S S Y +K
Sbjct: 186 IQSSLSKLPTPTFNDVVSEVQGFDSKLQSYEEAASVTPHLAFNIERSESGSPQYNPNQKG 245
Query: 227 RGRDLSAVQCFCCKGFGHYAS--------------NCPRKSCNYCKKDGHVIKECPIRPP 272
RGR KG G Y++ + PR C C + GH +C R
Sbjct: 246 RGRSGQN------KGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNR-- 297
Query: 273 KKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSP 332
+N +QA FS L +S
Sbjct: 298 ------------------------FDNNYQAEIQA-------------FSTLRVS---DD 317
Query: 333 NSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS------- 385
W+ D A+ H+ ++ L + T+ G+ + V +G +LPIT G +
Sbjct: 318 TGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIP 377
Query: 386 LNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCF 444
LN+V V P + +L+ V +L D+ C V F + + D + K++ GP+ L+ L
Sbjct: 378 LNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVL-- 435
Query: 445 SMSPCSSLPFVSCHS--ATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKF 502
+ FV+ +S E+WH RLGH NS L L S + ++ +
Sbjct: 436 -----ENQEFVALYSNRQCAATEEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPV--- 487
Query: 503 DCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTW 562
C C++GKS LPF S + D IH DLWG +PV+S+ KY+ F+DD+SR++W
Sbjct: 488 -CEPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSW 546
Query: 563 VYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
Y L +K E S F F VE Q ++KIK+ +SD GG
Sbjct: 547 FYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGG 584
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 195 bits (495), Expect = 5e-48
Identities = 154/579 (26%), Positives = 266/579 (45%), Gaps = 59/579 (10%)
Query: 29 VKGKSLWGHVDGSTSALDKEKQETEYAD-WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAA 87
++ K+ G VDGS + K + Y W ++ V +W++ V I ++ F TAA
Sbjct: 5 IEAKNKLGFVDGS---IPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAA 61
Query: 88 KMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYSQFMNLWADYTDIVYGSVSTE 147
+W L +++++ R ++L I +Q L +S ++++ LW + T + + E
Sbjct: 62 AIWKDLYTRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEELTSLQAVPRTVE 121
Query: 148 GLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS*DACLNELLREE-RRLLTLA 206
L + ET + FLM L ++ +R+ ++ + T+PS N + ++E +R ++
Sbjct: 122 DLLIER---ETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMIDQDETQRSARIS 178
Query: 207 TMEQHKSASLPVAYVVQEKPRGRDLSAVQCFCCKGFGHYASNCPRKSCNYCKKDGHVIKE 266
T S+ PV+ + D + R C+YC + GHV
Sbjct: 179 TTPGMTSSVFPVSNQSSQSALNGDTYQKK--------------ERPVCSYCSRPGHVEDT 224
Query: 267 CPIRPPKKNATTFTTSVHSPIAPSF-----VDIANVQHNAPTPVQALTPEIVEHMIILAF 321
C K T S + PS + V +N LT ++ ++
Sbjct: 225 CY---KKHGYPTSFKSKQKFVKPSISANAAIGSEEVVNNTSVSTGDLTTSQIQQLVSF-- 279
Query: 322 SALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGD 381
+S K P S+P + + + ++++ + + + G++ + G HL
Sbjct: 280 ----LSSKLQPPSTPVQPEVHSIS-VSSDPSSSSTVCPISGSVHL----GRHL------- 323
Query: 382 ISTSLNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLF 440
LNDV P NL+ V L + CR+ F ++ C++QD ++ G +V L+
Sbjct: 324 ---ILNDVLFIPQFKFNLLSVSSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLY 380
Query: 441 PLCF-SMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLST 499
+ S+S + ++ A+V + +LWHKRLGHP+ L + S +L + +
Sbjct: 381 IVDLDSLSHPGTDSSITV--ASVTSHDLWHKRLGHPSVQKLQPM--SSLLSFPKQKN--N 434
Query: 500 IKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSR 559
F C C + K K LPF +H + S+ FD+IH D WG V +H Y+YF+T +DD+SR
Sbjct: 435 TDFHCRVCHISKQKHLPFVSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSR 494
Query: 560 FTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDN 598
TWVY LR+K +V + F VE QF + IK +RSDN
Sbjct: 495 ATWVYLLRNKSDVLTVIPTFVTMVENQFETTIKGVRSDN 533
>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
Arabidopsis thaliana
Length = 1392
Score = 194 bits (494), Expect = 7e-48
Identities = 159/637 (24%), Positives = 272/637 (41%), Gaps = 105/637 (16%)
Query: 7 EVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYA 55
+V+ ++L NY W +F+ ++ L G V G+T + E+ E+
Sbjct: 14 QVVTLKLTPTNYLLWKTQFESYLSSHLLLGFVTGATPRPASTIIVTKDDIQSEEANQEFL 73
Query: 56 DWEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIF 115
W D V AWI + + + ++A ++W L + +N+ +T R++ L+ +
Sbjct: 74 KWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLGTC 133
Query: 116 QQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGI 175
+ ++ + S+ N+ I + E + V L L ++E I
Sbjct: 134 SKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGV------------LNGLGKEYESI 181
Query: 176 RTNLMNRATV---PS*DACLNELLREERRLLTLATMEQ---HKSASLPVAYVVQEKPRGR 229
T + + V P D + +L + +L T + H + +Y + R
Sbjct: 182 ATVIEHSLDVYPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNSR 241
Query: 230 -----DLSAVQCFCCKGFGHY----------ASNCPRKSCNYCKKDGHVIKECPIRPPKK 274
+ + +G G + + N + +C C+K GH +C
Sbjct: 242 GGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFKC------- 294
Query: 275 NATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNS 334
T P D+ N AF+A+ +S ++ +S
Sbjct: 295 -----YTRFEENYLPE--DLPN-----------------------AFAAMRVSDQNQASS 324
Query: 335 SPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LN 387
W D A+ H+ N + L N G+ + V NG+ LPIT IG I + L
Sbjct: 325 HEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLE 384
Query: 388 DVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSM 446
DV V PG+T +L+ V +L D+ C F +++D+ + +L+ +G K L+ L
Sbjct: 385 DVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVL---- 440
Query: 447 SPCSSLPFVSCHSATVVNF--ELWHKRLGHPNSNVLYELLQS-GVLGNKETPSLSTIKFD 503
+PF + +S + E+WH+RLGHPN VL L+++ ++ NK + ++
Sbjct: 441 ---KDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNKTSSNM------ 491
Query: 504 CTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWV 563
C C++GK LPF + S+ + IH DLWG PV S ++Y+V FID++SRFTW
Sbjct: 492 CEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWF 551
Query: 564 YFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
Y L+ K + FS F F VE Q+ KI + + D GG
Sbjct: 552 YPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGG 588
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 189 bits (479), Expect = 4e-46
Identities = 107/277 (38%), Positives = 154/277 (54%), Gaps = 14/277 (5%)
Query: 334 SSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST---SLNDVY 390
S PW D GAS HM+ + LT+ + ANG +T G IS+ ++ +V
Sbjct: 181 SQPWILDSGASFHMSFDDSWLTSCRLVKNGATVHTANGTLCKVTHQGSISSPQFTVPNVS 240
Query: 391 VSPGLTSNLIFVGQLVDNDCRVAFSKSGCLVQDQHSGKLIARGPKVGR---LFPLCFSMS 447
+ P L+ NLI VGQL D +C V F + C VQD+H+G +I G + R L+ L
Sbjct: 241 LVPKLSMNLISVGQLTDTNCFVGFDDTSCFVQDRHTGAVIGTGHRQKRSCGLYILDSLSL 300
Query: 448 PCSSLPFVSCHS----ATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFD 503
P SS S +S +F WH RLGH + L L+ GVLG+ + F
Sbjct: 301 PSSSTNTPSVYSPMCSTACKSFPQWHHRLGHLCGSRLATLINQGVLGSVPVDTT----FV 356
Query: 504 CTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWV 563
C CKLGK LP+P+ S S+ FD++HSD+WG +P S + Y+V F+DD+SR+TW+
Sbjct: 357 CKGCKLGKQVQLPYPSSTSRSSRPFDLVHSDVWGKSPFPSKGGHNYYVIFVDDYSRYTWI 416
Query: 564 YFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
YF++ + ++ S ++ F + TQFSS I+I RSD+GG
Sbjct: 417 YFMKHRSQLISIYQSFAQMIHTQFSSAIRIFRSDSGG 453
>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
gi|25301689|pir||C96578 hypothetical protein T18A20.5
[imported] - Arabidopsis thaliana
Length = 1522
Score = 187 bits (475), Expect = 1e-45
Identities = 170/633 (26%), Positives = 261/633 (40%), Gaps = 104/633 (16%)
Query: 11 VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYADWEV 59
V LN +NY W +F+ F+ G+ L G V GS SA + E+ E+ W
Sbjct: 17 VTLNQQNYILWKSQFESFLSGQGLLGFVTGSISAPAQTRSVTHNNVTSEEPNPEFYTWHQ 76
Query: 60 KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
D V +W++ +I+ + T+ ++W L +N+ +++R F+L+ + +++
Sbjct: 77 TDQVVKSWLLGSFAEDILSVVVNCFTSHQVWLTLANHFNRVSSSRLFELQRRLQTLEKKD 136
Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQT-VHETTK*DQFLMKLRSDFEGIRTN 178
++ F ++ + L SV + V E K L L ++E I+T
Sbjct: 137 NTMEVFLKDLKHI-------------CDQLASVGSPVPEKMKIFSALNGLGREYEPIKTT 183
Query: 179 LMNRATVP---S*DACLNELLREERRL---LTLATMEQHKSASLPVA----YVVQEKPRG 228
+ N S D ++L + RL +T T+ H + ++ + Y + +G
Sbjct: 184 IENSVDSNPSLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNVTHSDSGYYHNNNRGKG 243
Query: 229 RDLSAV--QCFCCKGFGHYASNCPRKS---------CNYCKKDGHVIKECPIRPPKKNAT 277
R S F +G G + P C C K GH +C R
Sbjct: 244 RSNSGSGKSSFSTRGRGFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHR------- 296
Query: 278 TFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPW 337
F S P +A + + I+ + W
Sbjct: 297 -FDNSYQHEDLP-----------------------------MALATMRITDVTDHHGHEW 326
Query: 338 YFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LNDVY 390
D AS H+ NN L G+ I VA+GN LPIT G S + L +V
Sbjct: 327 IPDSAASAHVTNNRHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLKEVL 386
Query: 391 VSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPC 449
V P + +L+ V +L D C V F + D+ + KL+ G L+ L
Sbjct: 387 VCPDIVKSLLSVSKLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSL-----EE 441
Query: 450 SSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSG--VLGNKETPSLSTIKFDCTYC 507
L + + E+WH+RLGH N+ VL++L S ++ NK +K C C
Sbjct: 442 PKLQVLYSTRQNSASSEVWHRRLGHANAEVLHQLASSKSIIIINK------VVKTVCEAC 495
Query: 508 KLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLR 567
LGKS LPF N S+ + IH DLWG +P S ++Y+V FID +SRFTW Y L+
Sbjct: 496 HLGKSTRLPFMLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLK 555
Query: 568 SKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
K + FS F F VE Q KIKI + D GG
Sbjct: 556 LKSDFFSTFVMFQKLVENQLGHKIKIFQCDGGG 588
>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301693|pir||F84480 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1402
Score = 182 bits (462), Expect = 3e-44
Identities = 161/641 (25%), Positives = 265/641 (41%), Gaps = 114/641 (17%)
Query: 11 VRLNGKNYSAWAFEFQIFVKGKSLWGHV----------------DGSTSALDKEKQETEY 54
V L KNY W +F+ F+ G+ L G V DGSTSA EY
Sbjct: 17 VTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSVVSDIDGSTSA----SPNPEY 72
Query: 55 ADWEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAI 114
W D V +W++ +I+ + +T+ ++W + +N+ +++R F+L+ +
Sbjct: 73 YTWFKTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSSRLFELQRRLQN 132
Query: 115 FQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEG 174
+ S+ E+ + + GS TE + K L L ++E
Sbjct: 133 VSKRDKSMDEYLKDLKTICDQLASV--GSPVTEKM----------KIFAALNGLGREYEP 180
Query: 175 IRT---NLMNRATVPS*DACLNELLREERRL---LTLATMEQHKSASLPVA--------Y 220
I+T N M+ PS + + +L + RL L + H + ++ + +
Sbjct: 181 IKTTIENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYF 240
Query: 221 VVQEKPRGRDLSAVQCFCCKGFGHYASNCPRKS------------CNYCKKDGHVIKECP 268
+ +G+ F +G G + S C C K GH +C
Sbjct: 241 NAYNRGKGKSNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGHPALKCW 300
Query: 269 IRPPKKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISG 328
R F S P A +A+ I+
Sbjct: 301 HR--------FNNSYQYEELPR-----------------------------ALAAMRITD 323
Query: 329 KHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS--- 385
+ + W D A+ H+ N+ +L G+ + VA+GN LPIT G + +
Sbjct: 324 ITDQHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSS 383
Query: 386 ----LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLF 440
L DV V P +T +L+ V +L D C V F G + D+ + KL+ G L+
Sbjct: 384 GNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLY 443
Query: 441 PLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLG-NKETPSLST 499
L F S + + E+WH+RLGHP+ VL +L+++ + NK + SL
Sbjct: 444 CL---KDDSQFKAFFSTRQQSASD-EVWHRRLGHPHPQVLQQLVKTNSISINKTSKSL-- 497
Query: 500 IKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSR 559
C C+LGKS LPF + ++ + +H DLWG +P+ S ++Y+ FID +SR
Sbjct: 498 ----CEACQLGKSTRLPFVSSSFTSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSR 553
Query: 560 FTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
F+W+Y L+ K + ++ F FH VE Q + KI + + D GG
Sbjct: 554 FSWIYPLKLKSDFYNIFVAFHKLVENQLNHKISVFQCDGGG 594
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 182 bits (462), Expect = 3e-44
Identities = 155/636 (24%), Positives = 277/636 (43%), Gaps = 113/636 (17%)
Query: 11 VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGS-----------TSALDKEKQETEYADWEV 59
++LN NY W +F+ + L G V+G T + +Y W
Sbjct: 19 LKLNDSNYLLWKTQFESLLSCHKLIGFVNGGITPPPRTLNVVTGDTSVDVANPQYESWFC 78
Query: 60 KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
D + +W+ + ++ + T+ +W L + +N+++ AR F L + + ++
Sbjct: 79 TDQLIRSWLFGTLSEEVLGYVHNLQTSRDIWISLAENFNKSSVAREFTLRRTLQLLSKKD 138
Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSV-QTVHETTK*DQFLMKLRSDFEGIRTN 178
++S + +F+ + + L+S+ + V E+ K FL L +++ I T
Sbjct: 139 KTLSAYCREFIAV-------------CDALSSIGKPVDESMKIFGFLNGLGREYDPITTV 185
Query: 179 LMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKP------------ 226
+ + + S + + + + L + E+ +A+ +A+ Q
Sbjct: 186 IQSSLSKISPPTFRDVISEVKGFDVKLQSYEESVTANPHMAFNTQRSEYTDNYTSGNRGK 245
Query: 227 --------RGRDLSAVQCFCCKGFGHYASNC----PRKSCNYCKKDGHVIKECPIRPPKK 274
RGR + + +GF + +N R C C + GH +C R
Sbjct: 246 GRGGYGQNRGRSGYSTRG---RGFSQHQTNSNNTGERPVCQICGRTGHTALKCYNR---- 298
Query: 275 NATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNS 334
F + S VD A AFS+L +S +
Sbjct: 299 ----FDHNYQS------VDTAQ-----------------------AFSSLRVSDS---SG 322
Query: 335 SPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LN 387
W D A+ H+ ++ L + G+ + V +G +LPIT +G + S LN
Sbjct: 323 KEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSGTLPLN 382
Query: 388 DVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSM 446
+V V P + +L+ V +L D+ C V F + + D ++ K++++GP+ L+ L
Sbjct: 383 EVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYVL---- 438
Query: 447 SPCSSLPFVSCHS--ATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDC 504
+ FV+ +S + E+WH RLGH NS +L +L S + ++ +S + C
Sbjct: 439 ---ENQEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKS-RMSPV---C 491
Query: 505 TYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVY 564
C++GKS L F + S IH DLWG +PV+S +KY+V F+DD+SR++W Y
Sbjct: 492 EPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFY 551
Query: 565 FLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
L++K + F+ F F VE QF++KIK+ +SD GG
Sbjct: 552 PLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGG 587
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 181 bits (459), Expect = 8e-44
Identities = 148/588 (25%), Positives = 252/588 (42%), Gaps = 94/588 (15%)
Query: 45 LDKEKQETEYADWEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTAR 104
+ E+ E+ W D V AWI + + + ++A ++W L + +N+ +T R
Sbjct: 60 IQSEEANQEFLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTR 119
Query: 105 RFQLEHDIAIFQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQF 164
++ L+ + + ++ + S+ N+ I + E + V
Sbjct: 120 KYDLQKRLGTCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGV------------ 167
Query: 165 LMKLRSDFEGIRTNLMNRATV---PS*DACLNELLREERRLLTLATMEQ---HKSASLPV 218
L L ++E I T + + V P D + +L + +L T + H +
Sbjct: 168 LNGLGKEYESIATVIEHSLDVYPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDK 227
Query: 219 AYVVQEKPRGR-----DLSAVQCFCCKGFGHY----------ASNCPRKSCNYCKKDGHV 263
+Y + R + + +G G + + N + +C C+K GH
Sbjct: 228 SYSSRGNNNSRGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHS 287
Query: 264 IKECPIRPPKKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSA 323
+C T P D+ N AF+A
Sbjct: 288 AFKC------------YTRFEENYLPE--DLPN-----------------------AFAA 310
Query: 324 LGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIS 383
+ +S ++ +S W D A+ H+ N + L N G+ + V NG+ LPIT IG I
Sbjct: 311 MRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIP 370
Query: 384 TS-------LNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPK 435
+ L DV V PG+T +L+ V +L D+ C F +++D+ + +L+ +G K
Sbjct: 371 LNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNK 430
Query: 436 VGRLFPLCFSMSPCSSLPFVSCHSATVVNF--ELWHKRLGHPNSNVLYELLQS-GVLGNK 492
L+ L +PF + +S + E+WH+RLGHPN VL L+++ ++ NK
Sbjct: 431 HKGLYVL-------KDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK 483
Query: 493 ETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVT 552
+ ++ C C++GK LPF + S+ + IH DLWG PV S ++Y+V
Sbjct: 484 TSSNM------CEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVI 537
Query: 553 FIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
FID++SRFTW Y L+ K + FS F F VE Q+ KI + + D GG
Sbjct: 538 FIDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGG 585
>ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|49328070|gb|AAT58770.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1419
Score = 177 bits (450), Expect = 8e-43
Identities = 106/296 (35%), Positives = 155/296 (51%), Gaps = 25/296 (8%)
Query: 325 GISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST 384
G S + S W D GAS HM + L + + + +Q A+G LP++ G + T
Sbjct: 476 GCSLSANVTDSCWILDSGASFHMTPDISQLQSCSLTKAS-SVQTADGTILPVSLQGTLQT 534
Query: 385 ---SLNDVYVSPGLTSNLIFVGQLVDNDCRVAFSKSGCLVQDQHSGKLIARGPKVGR--- 438
++ DV+ P L+ LI VGQL D C V F ++ C V D+ +G L+ G ++
Sbjct: 535 KEYTIPDVFYVPNLSMKLISVGQLTDMKCHVVFDEAACYVLDRATGNLVGAGHRLNGPRG 594
Query: 439 ---LFPLCFSMSPCSSLPFVSCHSATVVN-----------FELWHKRLGHPNSNVLYELL 484
L L S S P S + ++ + F WH RLGH + L L+
Sbjct: 595 LYVLDHLHLPTSTSSGFPGNSASATSITSNSSVYSSLSASFPQWHHRLGHLCGSRLSTLV 654
Query: 485 QSGVLGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISH 544
Q GVLGN S+ T F C CKLGK LP+ + S + F ++HSD+WG P S
Sbjct: 655 QQGVLGNV---SIET-DFVCKGCKLGKQVQLPYRSSMSRSTSPFALVHSDVWGPAPFHSK 710
Query: 545 ANYKYFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
++Y+V F+DDFSR+TW+YF++ + E++ +K F + V TQFS+ IK RSD+GG
Sbjct: 711 GGHRYYVIFVDDFSRYTWIYFMKHRSELYQVYKSFASMVHTQFSTSIKNFRSDSGG 766
>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1499
Score = 177 bits (449), Expect = 1e-42
Identities = 173/640 (27%), Positives = 271/640 (42%), Gaps = 92/640 (14%)
Query: 1 MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVD-GSTSALDKEKQET---EYAD 56
M T +V+ + NG++Y W + +K + LW ++ G TS E E D
Sbjct: 1 METTMQQVIPI-FNGESYGFWKIKMITILKTRKLWDVIENGVTSNSSPETSPALTRERDD 59
Query: 57 WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQ 116
+KD L + V +I + P S+A + W L+ + ++ + L+ ++
Sbjct: 60 QVMKDMMALQILQSAVSDSIFPRIAPASSATEAWNALEMEFQGSSQVKMINLQTLRREYE 119
Query: 117 ----QEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDF 172
+E +I++F ++ +NL V E + Q V + L+ + F
Sbjct: 120 NLKMEEGETINDFTTKLINLSNQLR------VHGEEKSDYQVVQK------ILISVPQQF 167
Query: 173 E---GIRTNLMNRATVPS*DACLNELLREERRL------LTLATMEQHKSASLPVAYVVQ 223
+ G+ + +T+ S + L ERRL + K S Q
Sbjct: 168 DSIVGVLEQTKDLSTL-SVTELIGTLKAHERRLNLREDRINEGAFNGEKLGSR--GENKQ 224
Query: 224 EKPRGRDLSAVQCFCCKGFGHYASNCPRKS--------------CNYCKKDGHVIKECPI 269
K R + + C CK H +C RK C C K GH+ ++C +
Sbjct: 225 NKIR-HGKTNMWCGVCKRNNHNEVDCFRKKSESISQRGGSYERRCYVCDKQGHIARDCKL 283
Query: 270 RPPKKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGK 329
R ++ H I S + + H + FSA+
Sbjct: 284 RKGER--------AHLSIEESEDEKEDECH-------------------MLFSAVEEKEI 316
Query: 330 HSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTSLN-- 387
+ W D G +NHM+ + + ++ + I++ NG + GDI S N
Sbjct: 317 STIGEETWLVDSGCTNHMSKDVRHFIALDRS-KKIIIRIGNGGKVVSEGKGDIRVSTNKG 375
Query: 388 -----DVYVSPGLTSNLIFVGQLVDNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPL 442
DV P L NL+ V Q++ N RV F + C++QD K++ K R FP+
Sbjct: 376 DHVIKDVLYVPELARNLLSVSQMISNGYRVIFEDNKCVIQDLKGRKILDIKMK-DRSFPI 434
Query: 443 CFSMSPCSS-LPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIK 501
+ S + + F T +LWHKR GH N + + E +Q+ + K P IK
Sbjct: 435 IWKKSREETYMAFEEKEEQT----DLWHKRFGHVNYDKI-ETMQTLKIVEK-LPKFEVIK 488
Query: 502 FDCTYCKLGKSKILPFPNH-QSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRF 560
C C++GK FP QSN ++T ++IHSD+ G S +YF+TFIDDFSR
Sbjct: 489 GICAACEMGKQSRRSFPKKSQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDDFSRM 548
Query: 561 TWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
TWVYFL++K EV + FK F YVE Q S+IK LR+D GG
Sbjct: 549 TWVYFLKNKSEVITKFKIFKPYVENQSESRIKRLRTDGGG 588
>gb|AAP53070.1| putative retroelement [Oryza sativa (japonica cultivar-group)]
gi|37532962|ref|NP_920783.1| putative retroelement
[Oryza sativa (japonica cultivar-group)]
gi|21671985|gb|AAM74347.1| Putative retroelement [Oryza
sativa (japonica cultivar-group)]
Length = 1250
Score = 172 bits (435), Expect = 5e-41
Identities = 167/660 (25%), Positives = 265/660 (39%), Gaps = 94/660 (14%)
Query: 9 LLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLAWI 68
+ + L N++ W F L H+DG+ + + W D V W+
Sbjct: 39 ITLELKHPNFNKWKTFFTSMCGKFGLLPHIDGTAPPRPDD------STWAQADCCVQGWL 92
Query: 69 IRFVDPNIV-LNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYS 127
V I+ + + TA +W + ++ N R L H+ Q + I++ Y
Sbjct: 93 FGSVSDAILDVVMETDQTARDLWLAIDDLFQANKEPRTIYLSHEFHSMTQGDMPIAD-YC 151
Query: 128 QFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS 187
Q + AD V V+ L L L S F N+ + +PS
Sbjct: 152 QKVKTAADALRDVGHPVTESQLVL-----------NLLSGLNSRFSSTADNIASAPVLPS 200
Query: 188 *DACLNELLREERRL-----------LTLATMEQHKSASLPVAYVVQEKPRGRDLSAVQC 236
+ N LL +E R+ + +A + S A + G + S
Sbjct: 201 FASAHNTLLLKELRIANAHKVQAETTMVVAASSANACTSGTCASSSSSQSHGDNNSNGG- 259
Query: 237 FCCKGFGHYASNC--------PRKSCNYCKKDGHVIKE--CPIRPPKKNA---------T 277
G+G++ +N PR + + + +++ P RP T
Sbjct: 260 ----GYGNFGNNFQQQQHQAGPRTTGPWVCFNPWAVQQQQSPWRPSNSAGLLGPYPQAHT 315
Query: 278 TFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPW 337
TF SP P P+Q P + +I A + L + + SPW
Sbjct: 316 TFAGPYVSPPMPGL-----------PPMQQSQPNWDQAGLIAALNQLSVQ-----SPSPW 359
Query: 338 YFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITA-------IGDISTSLNDVY 390
D GA++HM++ L N I V NG+ +P+ IG L ++
Sbjct: 360 VLDTGATSHMSSTDGILDTRLPNSYTF-ITVGNGHTIPVICHGTSFLPIGTTKFDLKNIL 418
Query: 391 VSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPC 449
V+P L NL+ + Q DN+C + F + G V+ + ++I R G L+ L +
Sbjct: 419 VAPSLVRNLLSIRQFTRDNNCSIEFDEFGFSVKGLRTRRVILRCNSRGDLYTLPIAA--- 475
Query: 450 SSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKL 509
P ++ HS + LWH+RLGHP+S + L + +L P C CKL
Sbjct: 476 ---PAIAAHSFLAQSSTLWHRRLGHPSSAAIQTLHKLAIL-----PCTKIDHSLCHACKL 527
Query: 510 GKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSK 569
GK LPF QS+ S F+++H D+W +PV+S + +KY++ +DDFS F W + LR K
Sbjct: 528 GKHTRLPFSRSQSSTSSPFELVHCDVW-TSPVLSTSGFKYYLVVLDDFSHFCWTFPLRHK 586
Query: 570 DEVFSAFKFFHAYVETQFSSKIKILRSDNGGGKVHI*FDSRFFENKWYFISKVMSFHSPT 629
+V F AYV TQF S IK ++DN K + D + FIS+ + F T
Sbjct: 587 SDVHQHIVEFVAYVTTQFGSSIKCFQADNASHKGYRCLD---ISTRRVFISRHVVFDEQT 643
>emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|4972079|emb|CAB43904.1|
putative protein [Arabidopsis thaliana]
gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
Arabidopsis thaliana
Length = 1415
Score = 170 bits (431), Expect = 1e-40
Identities = 104/292 (35%), Positives = 160/292 (54%), Gaps = 27/292 (9%)
Query: 320 AFSALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAI 379
AF+A+ +S + S+PW D GA++H+ N+ L + G + V N + LPIT I
Sbjct: 279 AFAAMRVSDQ---KSNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHI 335
Query: 380 GD-ISTS------LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIA 431
G + TS L DV V P +T +L+ V +L D C + F G +V+D+ + +L+
Sbjct: 336 GSAVLTSNQGNLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTKQLLT 395
Query: 432 RGPKVGRLFPLCFSMSPCSSLPFVSCHSAT--VVNFELWHKRLGHPNSNVLYELLQS-GV 488
+G + L+ L + F++C+S+ + E+WH RLGHPN +VL +LL++ +
Sbjct: 396 KGTRHNDLYLL-------ENPKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQLLRNKAI 448
Query: 489 LGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYK 548
+ +K + SL C C++GK LPF + S+ + +H DLWG PV+S ++
Sbjct: 449 VISKTSHSL------CDACQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFR 502
Query: 549 YFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
Y+V FID++SRFTW Y LR K + FS F F VE Q KI + D GG
Sbjct: 503 YYVIFIDNYSRFTWFYPLRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGGG 554
Score = 45.4 bits (106), Expect = 0.006
Identities = 26/126 (20%), Positives = 60/126 (46%), Gaps = 11/126 (8%)
Query: 11 VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGS------TSALDKEKQETE-----YADWEV 59
++L+ NY W +F+ ++ + L G V G+ T ++ Q TE + W
Sbjct: 19 LKLSTANYLLWKIQFETWLNNQRLLGFVTGANPCPNATRSIRNGDQVTEATNPDFLTWVQ 78
Query: 60 KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
D +++ W++ + + + ++ T+ ++W L K YN+ + +R+ L+ + +
Sbjct: 79 NDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSDLQRRLNPVSKNE 138
Query: 120 LSISEF 125
S+ E+
Sbjct: 139 KSMLEY 144
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.331 0.141 0.457
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,217,653,626
Number of Sequences: 2540612
Number of extensions: 50430803
Number of successful extensions: 148222
Number of sequences better than 10.0: 3855
Number of HSP's better than 10.0 without gapping: 691
Number of HSP's successfully gapped in prelim test: 3164
Number of HSP's that attempted gapping in prelim test: 138229
Number of HSP's gapped (non-prelim): 6227
length of query: 733
length of database: 863,360,394
effective HSP length: 136
effective length of query: 597
effective length of database: 517,837,162
effective search space: 309148785714
effective search space used: 309148785714
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0016.5