Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0016.5
         (733 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]           650  0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]               316  2e-84
pir||F86470 probable retroelement polyprotein [imported] - Arabi...   233  2e-59
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   204  1e-50
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]         201  9e-50
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi...   197  8e-49
gb|AAD15534.1| putative retroelement pol polyprotein [Arabidopsi...   197  1e-48
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7...   196  2e-48
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   195  5e-48
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop...   195  5e-48
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha...   194  7e-48
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ...   189  4e-46
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ...   187  1e-45
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi...   182  3e-44
gb|AAK51235.1| polyprotein [Arabidopsis thaliana]                     182  3e-44
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia...   181  8e-44
ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cu...   177  8e-43
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi...   177  1e-42
gb|AAP53070.1| putative retroelement [Oryza sativa (japonica cul...   172  5e-41
emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|49720...   170  1e-40

>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
          Length = 1212

 Score =  650 bits (1677), Expect = 0.0
 Identities = 331/628 (52%), Positives = 440/628 (69%), Gaps = 31/628 (4%)

Query: 1   MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVK 60
           M++  +E   VR  GKNYS+W F+FQ+FV GK LWG++DGS  A       T+  +W++K
Sbjct: 1   MNSHHFESFSVRFTGKNYSSWEFQFQLFVTGKELWGYIDGSDPA---PTDATKLGEWKIK 57

Query: 61  DNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEIL 120
           D +V+ WI+  +DP IVLNLRP+ T   MW YL+K+YNQ+N+ARRFQLE++IA + Q  L
Sbjct: 58  DARVMTWILGSIDPLIVLNLRPYKTVKAMWDYLQKVYNQDNSARRFQLEYEIANYSQGGL 117

Query: 121 SISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLM 180
            + +++S F NLWA++TDIVY  + TE L+ +Q VHE +K DQFLMKLRSDFE IR+NLM
Sbjct: 118 FVQDYFSGFQNLWAEFTDIVYAKIPTESLSVIQAVHEQSKRDQFLMKLRSDFESIRSNLM 177

Query: 181 NRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKPRGRDLSAVQCFCCK 240
           NR   PS D C  ELLREE+RL+T    +  K   + VA+  Q K +GRD+S  QC+ CK
Sbjct: 178 NRDPSPSLDVCFRELLREEQRLVTQNVFK--KENDVTVAFAAQGKGKGRDMSRTQCYSCK 235

Query: 241 GFGHYASNCPRKSCNYCKKDGHVIKECPIRPPKKNATTFTTSVHSPIAPSFVDIANVQHN 300
            +GH ASNC +K  NYCK+ GH+IKECP+RP  +    F   ++     S  D +++   
Sbjct: 236 EYGHIASNCSKKFYNYCKQQGHIIKECPMRPQNRRINAFQARING----STDDNSSLG-- 289

Query: 301 APTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQN 360
                Q LTPE+V+ MI+ AFSALG+ G +   S+ W  D GASNHM N+   L N+ + 
Sbjct: 290 -----QVLTPEMVQQMIVSAFSALGLQG-NDVTSNFWIVDSGASNHMTNSTSILKNVRKY 343

Query: 361 FGNLKIQVANGNHLPITAIGDISTSLNDVYVSPGLTSNLIFVGQLVDNDCRVAFSKSGCL 420
            G  +IQ+ANG++LPIT +GDI+ +  +V+VSP L+++LI VGQLVDN+C V FS++GCL
Sbjct: 344 QGPSQIQIANGSNLPITKVGDITPTFKNVFVSPKLSTSLISVGQLVDNNCDVNFSRNGCL 403

Query: 421 VQDQHSGKLIARGPKVGRLFPLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVL 480
           VQDQ SG +IA+GPKVGRLFP+ FS+ P  S    S  S T    E+WHKRLGHPNS VL
Sbjct: 404 VQDQVSGTIIAKGPKVGRLFPIHFSIPPVLSFACTSTASKT----EVWHKRLGHPNSVVL 459

Query: 481 YELLQSGVLGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITP 540
             +  SG+LGNK   S+++I  DC+ CKLGKSK LPFPN  S  ++ FD+IHSD+WGI+P
Sbjct: 460 SHISNSGLLGNKNKFSVASI--DCSTCKLGKSKTLPFPNFGSRATKCFDVIHSDVWGISP 517

Query: 541 VISHANYKYFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           +ISHA++KYF+TFIDD+SRFTWVYFLRSK EVFS FK F AY+ETQFS+ IK+LRSD+GG
Sbjct: 518 IISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKTFLAYIETQFSTCIKLLRSDSGG 577

Query: 601 GKVHI*FDSRFFENKWYFISK-VMSFHS 627
                  +   +E K + + K ++S HS
Sbjct: 578 -------EYMSYEFKKFLLDKGIVSQHS 598


>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
          Length = 1382

 Score =  316 bits (810), Expect = 2e-84
 Identities = 217/628 (34%), Positives = 333/628 (52%), Gaps = 47/628 (7%)

Query: 1   MSTEKYEVLL---VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYAD- 56
           M+TE+ + L    VRL+GKNYS W++  + F+KGK +WG+V G T  + K  +E +    
Sbjct: 1   MATERDDSLQSVSVRLDGKNYSYWSYVMRNFLKGKKMWGYVSG-TYVVPKNTEEGDTVSI 59

Query: 57  --WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAI 114
             WE  + +++ WI  +V+ +I   L  + TA ++W +L++++ Q+N A+++QLE+DI  
Sbjct: 60  DTWEANNAKIITWINNYVEHSIGTQLAKYETAKEVWDHLQRLFTQSNFAKQYQLENDIRA 119

Query: 115 FQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEG 174
             Q+ +SI EFYS   +LW      +  SV  +   +     E  +  QFL  LRSDFEG
Sbjct: 120 LHQKNMSIQEFYSAMTDLWDQLA--LTESVELKACGAYIERREQQRLVQFLTALRSDFEG 177

Query: 175 IRTNLMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKP----RGRD 230
           +R ++++R+ +PS D+ ++ELL EE RL + +  +   SAS P    V  KP    + + 
Sbjct: 178 LRGSILHRSPLPSVDSVVSELLAEEIRLQSYSE-KGILSASNPSVLAVPSKPFSNHQNKP 236

Query: 231 LSAV---QCFCCKGFGHYASNCPR-KSCNYCKKDGHVIKECPIRPPK--KNATTFTTSVH 284
            + V   +C  CK  GH+ + CP+ +  N   K G   +    R P+  K     T +V 
Sbjct: 237 YTRVGFDECSFCKQKGHWKAQCPKLRQQNQAWKSGSQSQSNAHRSPQGYKPPHHNTAAVA 296

Query: 285 SPIAPSFVDIANVQHNAPTPVQALTPEIVEHMII--LAFSALGISGKHSPNSSPWYFDYG 342
           SP     +   N          +L P+ +    I  L  S+ GIS       S W  D G
Sbjct: 297 SP---GSITDPNTLAEQFQKFLSLQPQAMSASSIGQLPHSSSGIS------HSEWVLDSG 347

Query: 343 ASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST---SLNDVYVSPGLTSNL 399
           AS+HM+ ++ + T+++    ++ +  A+G  +P+  +G + T   SL +VY+ P L  NL
Sbjct: 348 ASHHMSPDSSSFTSVSP-LSSIPVMTADGTPMPLAGVGSVVTLHLSLPNVYLIPKLKLNL 406

Query: 400 IFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCS------SL 452
             +GQ+ D+ D  V FS S C VQD  S KLI  G +   L+ L     P         L
Sbjct: 407 ASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDL 466

Query: 453 PFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKLGKS 512
            F    S +  +F LWH RLGH +S+ L  L  +G LGN +T  +S    DC+ CKL K 
Sbjct: 467 SFFRL-SLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLKTCDIS----DCSGCKLAKF 521

Query: 513 KILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSKDEV 572
             LPF    S  S  FD+IHSD+WG +PV +    +Y+V+FIDD +R+ WVY ++ + E 
Sbjct: 522 SALPFNRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEF 581

Query: 573 FSAFKFFHAYVETQFSSKIKILRSDNGG 600
           F  +  F A ++TQ S+ IK  R D GG
Sbjct: 582 FEIYAAFRALIKTQHSAVIKCFRCDLGG 609


>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
           gi|9989049|gb|AAG10812.1| Putative retroelement
           polyprotein [Arabidopsis thaliana]
          Length = 1404

 Score =  233 bits (594), Expect = 2e-59
 Identities = 179/618 (28%), Positives = 285/618 (45%), Gaps = 55/618 (8%)

Query: 1   MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSAL-DKEKQETEYAD--- 56
           M T +  +  V L G NY  W+   +  + G+ LW HV  S +   DKE++ETE      
Sbjct: 1   METSQKVITTVILQGGNYLTWSRTTKTVLCGRGLWSHVISSQAPKEDKEEEETETISPEE 60

Query: 57  --WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIY-NQNNTARRFQLEHDIA 113
             W  +D  VLA +   ++ +I+       TA ++W  LK +Y N++N  R F+++  I 
Sbjct: 61  EKWFQEDQAVLALLQNSLETSILEGYSYCETAKELWDTLKNVYGNESNLTRVFEVKKAIN 120

Query: 114 IFQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQ---FLMKLRS 170
              QE L  ++ + +F +LW++   +  G++        + +HE  + D+    L+ L  
Sbjct: 121 ELSQEDLEFTKHFGKFRSLWSELKSLRPGTLDP------KILHERREQDKVFGLLLTLNP 174

Query: 171 DFEGIRTNLMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKPRGRD 230
            +  +  +L+    +PS D   +++ +E+          +  +A+       +   +  D
Sbjct: 175 GYNDLIKHLLRSEKLPSLDEVCSKIQKEQGSTGLFGGKSELITANKGEVVANKGVYKNED 234

Query: 231 LSAVQCFCCKGFGHYASNC-----PRKSCNYCKKDGHVIKECPIRPPKKNATTFTTSVHS 285
              + C  CK  GH    C       K   +     H  +E      +  ++   TS   
Sbjct: 235 RKLLTCDHCKKKGHTKDKCWLLHPHLKPAKFKDSRAHFSQETHEEQSQAGSSKGETST-- 292

Query: 286 PIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPWYFDYGASN 345
               SF D         + ++AL   IV      +    GI+     +S     D GAS+
Sbjct: 293 ----SFGDYVR-----KSDLEALIKSIV------SLKESGITFSSQTSSGSIVIDSGASH 337

Query: 346 HMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTSLND--VYVSPGLTSNLIFVG 403
           HM +N+  L NI    G+  + +ANG+ +PI  IG++     D   +  P  TSNL+ V 
Sbjct: 338 HMISNSNLLDNIEPALGH--VIIANGDKVPIEGIGNLKLFNKDSKAFFMPKFTSNLLSVK 395

Query: 404 QLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCSSLPFVSCHSATV 462
           +   D +C   F  +    QD  +GK+I  G   G L+ L   +SP SS  F S     +
Sbjct: 396 RTTRDLNCYAIFGPNDVYFQDIETGKVIGEGGSKGELYVL-EDLSPNSSSCFSSKSHLGI 454

Query: 463 VNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQS 522
               LWH RLGHP++  L  +L          P++S     C  C LGK     FP   +
Sbjct: 455 SFNTLWHARLGHPHTRALKLML----------PNISFDHTSCEACILGKHCKSVFPKSLT 504

Query: 523 NISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAY 582
              + FD++HSD+W  +P +S  N KYFVTFI++ S++TW+  L SKD VF AF  F  Y
Sbjct: 505 IYEKCFDLVHSDVW-TSPCVSRDNNKYFVTFINEKSKYTWITLLPSKDRVFEAFTNFETY 563

Query: 583 VETQFSSKIKILRSDNGG 600
           V  QF++KIK+ R+DNGG
Sbjct: 564 VTNQFNAKIKVFRTDNGG 581


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301694|pir||E84535 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1454

 Score =  204 bits (518), Expect = 1e-50
 Identities = 166/665 (24%), Positives = 289/665 (42%), Gaps = 87/665 (13%)

Query: 12  RLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLAWIIRF 71
           RL+  NY  W+    I +  K+  G +DG+ S     + +  +  W   ++ V +W++  
Sbjct: 78  RLDETNYGDWSVAMLISLDAKNKTGFIDGTLSR--PLESDLNFRLWSRCNSMVKSWLLNS 135

Query: 72  VDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYSQFMN 131
           V P I  ++   + A+ +W  L   +N  N  R + L  +I  F+Q  LS+SE+Y++   
Sbjct: 136 VSPQIYRSILRMNDASDIWRDLNSRFNVTNLPRTYNLTQEIQDFRQGTLSLSEYYTRLKT 195

Query: 132 LW--ADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS*D 189
           LW   D T+ +    +      +Q   E  K  +FL  L   +  +R  ++ +  +PS  
Sbjct: 196 LWDQLDSTEALDEPCTCGKAMRLQQKAEQAKIVKFLAGLNESYAIVRRQIIAKKALPS-- 253

Query: 190 ACLNELLREERRLLTLATMEQHKS--ASLPVAYVVQEKPRGRDLSAVQCFCCKGFGHYAS 247
                 L E   +L     +Q  S   + P A+ V E  +   +    C+   G      
Sbjct: 254 ------LGEVYHILDQDNSQQSFSNVVAPPAAFQVSEITQSPSMDPTVCYVQNG-----P 302

Query: 248 NCPRKSCNYCKKDGHVIKECPIR---PP---KKNATTFTTSVHSPIAPSFVDIANVQHNA 301
           N  R  C++  + GH+ + C  +   PP    K           P+A +  + + V  + 
Sbjct: 303 NKGRPICSFYNRVGHIAERCYKKHGFPPGFTPKGKAGEKLQKPKPLAANVAESSEVNTSL 362

Query: 302 PTPVQALTPEIVEHMIIL-------------------------------AFSALGIS--G 328
            + V  L+ E ++  I +                                +S +GI    
Sbjct: 363 ESMVGNLSKEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNLGICFSPSTYSFIGILTVA 422

Query: 329 KHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS--- 385
           +H+ +S+ W  D GA++H++++    +++  +  +  + +  G  + I+ +G +  +   
Sbjct: 423 RHTLSSATWVIDSGATHHVSHDRSLFSSLDTSVLSA-VNLPTGPTVKISGVGTLKLNDDI 481

Query: 386 -LNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLC 443
            L +V   P    NLI +  L D+   RV F K+ C +QD   G+++ +G +V  L+ L 
Sbjct: 482 LLKNVLFIPEFRLNLISISSLTDDIGSRVIFDKNSCEIQDLIKGRMLGQGRRVANLYLL- 540

Query: 444 FSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQS-GVLGNKETPSLSTIKF 502
                   +   S     VV+  +WH+RLGH +   L  +  S G   +K   S      
Sbjct: 541 -------DVGDQSISVNAVVDISMWHRRLGHASLQRLDAISDSLGTTRHKNKGSDF---- 589

Query: 503 DCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTW 562
            C  C L K + L FP       + FD++H D+WG   V +   YKYF+T +DD SR TW
Sbjct: 590 -CHVCHLAKQRKLSFPTSNKVCKEIFDLLHIDVWGPFSVETVEGYKYFLTIVDDHSRATW 648

Query: 563 VYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGGGKVHI*FDSRFFENKWYFISKV 622
           +Y L++K EV + F  F   VE Q+  K+K +RSDN         + +F    +Y    +
Sbjct: 649 MYLLKTKSEVLTVFPAFIQQVENQYKVKVKAVRSDNAP-------ELKF--TSFYAEKGI 699

Query: 623 MSFHS 627
           +SFHS
Sbjct: 700 VSFHS 704


>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  201 bits (510), Expect = 9e-50
 Identities = 166/633 (26%), Positives = 277/633 (43%), Gaps = 108/633 (17%)

Query: 11  VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYADWEV 59
           ++LN  NY  W  +F+  +  + L G V+G  +            +  E    +Y DW  
Sbjct: 19  LKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYEDWFC 78

Query: 60  KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
            D  V +W+   +   ++ ++   +T+ ++W  L + +N+++ AR F L  ++ +  ++ 
Sbjct: 79  TDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKD 138

Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNL 179
            S+S +   F         I+  S+S+ G    + V E+ K   FL  L  +++ I T +
Sbjct: 139 KSLSVYCRDFK--------IICDSLSSIG----KPVEESMKIFGFLNGLGREYDPITTVI 186

Query: 180 ---MNRATVPS*DACLNELLREERRLLT----------LATMEQHKSASLPVAYVVQEKP 226
              +++   P+ +  ++E+   + +L +          LA   +  ++  P  Y    + 
Sbjct: 187 QSSLSKLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNTERSNSGAP-QYNSNSRG 245

Query: 227 RGRDLS----AVQCFCCKGFGHYASNCP----RKSCNYCKKDGHVIKECPIRPPKKNATT 278
           RGR              +GF  + S  P    R  C  C + GH   +C  R        
Sbjct: 246 RGRSGQNRGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDN----- 300

Query: 279 FTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPWY 338
                            N Q   PT                AFSAL +S +       WY
Sbjct: 301 -----------------NYQSEVPTQ---------------AFSALRVSDE---TGKEWY 325

Query: 339 FDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LNDVYV 391
            D  A+ H+  +   L N T   GN  + V +G +LPIT +G  + S       LN+V V
Sbjct: 326 PDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLV 385

Query: 392 SPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCS 450
            P +  +L+ V +L D+  C V F  +   + D  + K++++GP+   L+ L        
Sbjct: 386 CPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYML-------E 438

Query: 451 SLPFVSCHS--ATVVNFELWHKRLGHPNSNVLYELL-QSGVLGNKETPSLSTIKFDCTYC 507
           +  FV+ +S      + E WH RLGH NS +L +LL +  +  NK   S       C  C
Sbjct: 439 NSEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRTSPV-----CEPC 493

Query: 508 KLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLR 567
           ++GKS  L F +      +  D +H DLWG +PV+S+  +KY+  F+DDFSRF+W + LR
Sbjct: 494 QMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLR 553

Query: 568 SKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
            K +  S F  +   VE Q  +KIK  +SD GG
Sbjct: 554 MKSKFISVFIAYQKLVENQLGTKIKEFQSDGGG 586


>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301701|pir||E84589 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1461

 Score =  197 bits (502), Expect = 8e-49
 Identities = 166/671 (24%), Positives = 280/671 (40%), Gaps = 92/671 (13%)

Query: 12  RLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLAWIIRF 71
           RL+   Y  W+   +I +  K+  G VDGS       + +  +  W   ++ V +W++  
Sbjct: 82  RLDETTYGDWSVAMRISLDAKNKLGFVDGSLPR--PLESDPNFRLWSRCNSMVKSWLLNS 139

Query: 72  VDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYSQFMN 131
           V P I  ++   + A  +W  L   +N  N  R + L  +I   +Q  +S+SE+Y+    
Sbjct: 140 VSPQIYRSILRLNDATDIWRDLFDRFNLTNLPRTYNLTQEIQDLRQGTMSLSEYYTLLKT 199

Query: 132 LW--ADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS*D 189
           LW   D T+ +    +      +    E  K  +FL  L   +  +R  ++ +  +PS  
Sbjct: 200 LWDQLDSTEALDDPCTCGKAVRLYQKAEKAKIMKFLAGLNESYAIVRRQIIAKKALPS-- 257

Query: 190 ACLNELLREERRLLTLATMEQ--HKSASLPVAYVVQEKPRGRDLSAVQCFCCKGFGHYAS 247
                 L E   +L     ++      + P A+ V E       S    +   G      
Sbjct: 258 ------LAEVYHILDQDNSQKGFFNVVAPPAAFQVSEVSHSPITSPEIMYVQSG-----P 306

Query: 248 NCPRKSCNYCKKDGHVIKEC----------------PIRPPKKNATTFTTSVH------- 284
           N  R +C++C + GH+ + C                  +PPK  A     ++        
Sbjct: 307 NKGRPTCSFCNRVGHIAERCYKKHGFPPGFTPKGKSSDKPPKPQAVAAQVTLSPDKMTGQ 366

Query: 285 ---------------------SPIAPSFVD--IANVQHNAPTPVQALTPEIVEHMIILAF 321
                                S + P  V    A+ QH A +        I+       F
Sbjct: 367 LETLAGNFSPDQIQNLIALFSSQLQPQIVSPQTASSQHEASSSQSVAPSGILFSPSTYCF 426

Query: 322 SALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGD 381
             +     +S +S  W  D GA++H++++ +    +  +  +  + +  G ++ I+ +G 
Sbjct: 427 IGILAVSHNSLSSDTWVIDSGATHHVSHDRKLFQTLDTSIVSF-VNLPTGPNVRISGVGT 485

Query: 382 ISTS----LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKV 436
           +  +    L +V   P    NLI +  L  D   RV F  S C +QD   G  +  G ++
Sbjct: 486 VLINKDIILQNVLFIPEFRLNLISISSLTTDLGTRVIFDPSCCQIQDLTKGLTLGEGKRI 545

Query: 437 GRLFPLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPS 496
           G L+ L  + SP  S+         VV+  +WHKRLGHP+ + L  L  S VLG     +
Sbjct: 546 GNLYVLD-TQSPAISVN-------AVVDVSVWHKRLGHPSFSRLDSL--SEVLGTTRHKN 595

Query: 497 LSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDD 556
             +    C  C L K K L FP+  +  + TF+++H D+WG   V +   YKYF+T +DD
Sbjct: 596 KKSAY--CHVCHLAKQKKLSFPSANNICNSTFELLHIDVWGPFSVETVEGYKYFLTIVDD 653

Query: 557 FSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGGGKVHI*FDSRFFENKW 616
            SR TW+Y L+SK +V + F  F   VE Q+ +++K +RSDN                ++
Sbjct: 654 HSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSVRSDNA---------KELAFTEF 704

Query: 617 YFISKVMSFHS 627
           Y    ++SFHS
Sbjct: 705 YKAKGIVSFHS 715


>gb|AAD15534.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25411300|pir||F84485 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1664

 Score =  197 bits (501), Expect = 1e-48
 Identities = 170/629 (27%), Positives = 278/629 (44%), Gaps = 93/629 (14%)

Query: 1   MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHV---DGSTSALDKEKQETEYAD- 56
           M   K   + V L G NY  WA   +  +  + LW H+   +  + A  +E  E  +   
Sbjct: 1   MENTKALFVPVTLKGVNYLLWARTTKTTLCSRGLWAHILTSEAPSEATIREGMEIVHVGE 60

Query: 57  --WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIY-NQNNTARRFQLEHDIA 113
             W  +D  VLA +   ++ +++       TA ++W  L  ++ NQ+N +R F+++  I 
Sbjct: 61  EKWFQEDQSVLALLQNSLEASLLEAYSYCETAKELWETLFNVFGNQSNLSRVFEVKKAIN 120

Query: 114 IFQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFE 173
              Q  +  ++ + +F +LWA+   +   ++  + L   +   E  K    L+ L S + 
Sbjct: 121 DLSQGDMEFTQHFGKFRSLWAELEMLRPNTLDPKVLIERR---EQDKVFGLLLTLSSTYN 177

Query: 174 GIRTNLMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKPRGRDLSA 233
            +  +L+    +P+ +   +++ +E+                                  
Sbjct: 178 DLIKHLLRADKLPNLEEVCSQIQKEQAN-------------------------------- 205

Query: 234 VQCFCCKGFGHYASNCPRKSCNYCKKDGHVIKEC-----PIRP----PKKNATT---FTT 281
                 +G   Y +N     C +CK+ GH  ++C      +RP    P+ N  T   F T
Sbjct: 206 ------RGNYKYDNNKKALWCEHCKRSGHTKEKCWTLHPHLRPGRREPRANQVTGENFGT 259

Query: 282 SVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAF--SALGISGK--HSPNS-SP 336
              S         +N          A + ++V    + A   +    SGK  H+ +S  P
Sbjct: 260 QEQS-------GTSNQHLGGNGAAMAASSDLVRRSDLKALIKALKESSGKSYHALSSLKP 312

Query: 337 WYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST--SLNDVYVSPG 394
              D GAS+HM ++++ ++NI    GN  + +ANG+ +P+  +GD+      +  +  P 
Sbjct: 313 LIIDSGASHHMISDSKLISNIEPALGN--VVIANGDRIPVKGVGDLDLFDKSSKAFYMPT 370

Query: 395 LTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPCSSLP 453
            TSNL+ V +   D +C   F  +    QD  + +++ +G     L+ L        S+P
Sbjct: 371 FTSNLLSVKKATTDLNCYAIFGPNEVHFQDIETSRVLGQGVTKDGLYVL---EDTKPSVP 427

Query: 454 FVSCHSATV--VNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKLGK 511
             S  S+ +   N E WH RLGHP+S  L  LL          PS S    +C  C LGK
Sbjct: 428 LSSHFSSILGNANSESWHARLGHPHSRALKLLL----------PSTSFKNDECEACILGK 477

Query: 512 SKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSKDE 571
                FP   +   + FD+IHSD+W  +P +S  N+KYFVTFID+ S+FTW   L SKD 
Sbjct: 478 HCKSVFPKSSTIYEKCFDLIHSDVW-TSPCLSRENHKYFVTFIDEKSKFTWFTLLPSKDR 536

Query: 572 VFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           V  AF  F  YV   + +KIKILRSDN G
Sbjct: 537 VLEAFTNFQTYVTNHYDAKIKILRSDNRG 565


>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
           hypothetical protein 1 - wild cabbage transposon Melmoth
          Length = 1131

 Score =  196 bits (498), Expect = 2e-48
 Identities = 163/645 (25%), Positives = 277/645 (42%), Gaps = 79/645 (12%)

Query: 7   EVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLA 66
           +++ ++L+G NY  W    +I +  K+  G VDG+ +  D    +  +  W   ++ V +
Sbjct: 26  QLISLKLDGSNYDDWNAAMKIALDAKNKIGFVDGTLTRPDTS--DPTFRLWSRCNSMVKS 83

Query: 67  WIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFY 126
           W++  V P I  ++   + AA +W  L   ++  N  R F L  +I   +Q  +S+S++Y
Sbjct: 84  WLLNSVSPQIYRSILRLNDAADIWRDLHGRFHMTNLPRTFNLTQEIQDLKQGSMSLSDYY 143

Query: 127 SQFMNLWADY-------TDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNL 179
           +    LW +        T  V G+        +Q   +  K  +FL  L   +  IR  +
Sbjct: 144 TTLKTLWDNLESVDEPDTPCVCGNAE-----KLQKKVDRAKIVKFLAGLNDSYAIIRRQI 198

Query: 180 MNRATVPS*DACLNELLREERR-----LLTLATMEQHKSASLPVA-----YVVQEKPRGR 229
           + +  +PS     N L +++ +      +T A     ++   P+A     YV     +GR
Sbjct: 199 IMKKVLPSLVEVYNILDQDDSQKGFSTAITPAAFNVSENVPPPMAEAGICYVQTGPNKGR 258

Query: 230 DLSAVQCFCCKGFGHYASNCPRK---------------SCNYCKKDGHVIKECPIRPPKK 274
            +    C  C   GH A  C +K               S +  +K   V  +    PP  
Sbjct: 259 PI----CSFCNRVGHIAERCYKKHGFPPGFVSKYKSQSSGDRLQKPKQVAAQVSFSPPNS 314

Query: 275 NATTFTT----------------SVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMII 318
             +  T                 ++ S   P+    +N   ++  P+      I  +   
Sbjct: 315 GQSPMTMDHLVGNHSKEQLQQFIALFSSQLPNVTMGSNEASSSKQPMD--NSGISFNPTT 372

Query: 319 LAFSALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITA 378
           L F  L    +H+  +  W  D GA++H+ ++    T+I     +  + + NG  + I+ 
Sbjct: 373 LVFIGLLTVSRHTLANETWIIDSGATHHVCHDRSMYTSIDITTTS-NVNLPNGMIVKISG 431

Query: 379 IGDISTS----LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARG 433
           +G +  +    L++V   P    NL+ +  L  D   +V F  S C +QD   G  I +G
Sbjct: 432 VGIVQLNEHITLHNVLYIPEFRLNLLSISSLTSDIGSQVIFDVSSCAIQDPTKGWTIGQG 491

Query: 434 PKVGRLFPLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKE 493
            +V  L+ L    SP             VV+  LWHKRLGHP+   L ++  S  LG  +
Sbjct: 492 RRVANLYVLDVKSSPMKI--------NAVVDISLWHKRLGHPSYTRLDKI--SEALGTTK 541

Query: 494 TPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTF 553
             +       C  C L K K L + +     + +F ++H D+WG   V +   YKYF+T 
Sbjct: 542 HKNKGDAH--CHVCHLAKQKKLSYSSQNHICTASFQLLHVDVWGPFSVETLEGYKYFLTI 599

Query: 554 IDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDN 598
           +DD SR TW+Y L+SK +V   F  F   +ETQ+++KIK +R DN
Sbjct: 600 VDDHSRATWIYLLQSKSDVLHIFPTFVNQIETQYNTKIKSVRRDN 644


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
           Arabidopsis thaliana BAC gb|AF080119 and is a member of
           the reverse transcriptase family PF|00078
           gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
           Arabidopsis thaliana
          Length = 1415

 Score =  195 bits (495), Expect = 5e-48
 Identities = 160/638 (25%), Positives = 274/638 (42%), Gaps = 120/638 (18%)

Query: 11  VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYADWEV 59
           ++L   NY  W  +F+  +  + L G V+G+ +A           +  E+    Y  W  
Sbjct: 19  LKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYESWFC 78

Query: 60  KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
            D  V +W+   +   ++ ++   ST+ ++W  L + +N+++ AR F L  ++ +  ++ 
Sbjct: 79  TDQLVRSWLFGTLSEEVLGHVHNLSTSRQIWVSLAENFNKSSVAREFSLRQNLQLLSKKE 138

Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSV-QTVHETTK*DQFLMKLRSDFEGIRTN 178
              S +  +F  +              + L+S+ + V E+ K   FL  L  D++ I T 
Sbjct: 139 KPFSVYCREFKTI-------------CDALSSIGKPVDESMKIFGFLNGLGRDYDPITTV 185

Query: 179 L---MNRATVPS*DACLNELLREERRLLTL---ATMEQH------KSASLPVAYVVQEKP 226
           +   +++   P+ +  ++E+   + +L +    A++  H      +S S    Y   +K 
Sbjct: 186 IQSSLSKLPTPTFNDVVSEVQGFDSKLQSYEEAASVTPHLAFNIERSESGSPQYNPNQKG 245

Query: 227 RGRDLSAVQCFCCKGFGHYAS--------------NCPRKSCNYCKKDGHVIKECPIRPP 272
           RGR          KG G Y++              + PR  C  C + GH   +C  R  
Sbjct: 246 RGRSGQN------KGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNR-- 297

Query: 273 KKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSP 332
                                     +N    +QA             FS L +S     
Sbjct: 298 ------------------------FDNNYQAEIQA-------------FSTLRVS---DD 317

Query: 333 NSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS------- 385
               W+ D  A+ H+ ++   L + T+  G+  + V +G +LPIT  G  +         
Sbjct: 318 TGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIP 377

Query: 386 LNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCF 444
           LN+V V P +  +L+ V +L D+  C V F  +   + D  + K++  GP+   L+ L  
Sbjct: 378 LNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVL-- 435

Query: 445 SMSPCSSLPFVSCHS--ATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKF 502
                 +  FV+ +S        E+WH RLGH NS  L  L  S  +   ++ +      
Sbjct: 436 -----ENQEFVALYSNRQCAATEEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPV--- 487

Query: 503 DCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTW 562
            C  C++GKS  LPF    S +    D IH DLWG +PV+S+   KY+  F+DD+SR++W
Sbjct: 488 -CEPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSW 546

Query: 563 VYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
            Y L +K E  S F  F   VE Q ++KIK+ +SD GG
Sbjct: 547 FYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGG 584


>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
           (gb|U12626). [Arabidopsis thaliana]
           gi|25301690|pir||G96722 hypothetical protein F20P5.25
           [imported] - Arabidopsis thaliana
          Length = 1315

 Score =  195 bits (495), Expect = 5e-48
 Identities = 154/579 (26%), Positives = 266/579 (45%), Gaps = 59/579 (10%)

Query: 29  VKGKSLWGHVDGSTSALDKEKQETEYAD-WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAA 87
           ++ K+  G VDGS   + K   +  Y   W   ++ V +W++  V   I  ++  F TAA
Sbjct: 5   IEAKNKLGFVDGS---IPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAA 61

Query: 88  KMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYSQFMNLWADYTDIVYGSVSTE 147
            +W  L   +++++  R ++L   I   +Q  L +S ++++   LW + T +     + E
Sbjct: 62  AIWKDLYTRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEELTSLQAVPRTVE 121

Query: 148 GLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS*DACLNELLREE-RRLLTLA 206
            L   +   ET +   FLM L   ++ +R+ ++ + T+PS     N + ++E +R   ++
Sbjct: 122 DLLIER---ETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMIDQDETQRSARIS 178

Query: 207 TMEQHKSASLPVAYVVQEKPRGRDLSAVQCFCCKGFGHYASNCPRKSCNYCKKDGHVIKE 266
           T     S+  PV+    +     D    +               R  C+YC + GHV   
Sbjct: 179 TTPGMTSSVFPVSNQSSQSALNGDTYQKK--------------ERPVCSYCSRPGHVEDT 224

Query: 267 CPIRPPKKNATTFTTSVHSPIAPSF-----VDIANVQHNAPTPVQALTPEIVEHMIILAF 321
           C     K    T   S    + PS      +    V +N       LT   ++ ++    
Sbjct: 225 CY---KKHGYPTSFKSKQKFVKPSISANAAIGSEEVVNNTSVSTGDLTTSQIQQLVSF-- 279

Query: 322 SALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGD 381
               +S K  P S+P   +  + + ++++  + + +    G++ +    G HL       
Sbjct: 280 ----LSSKLQPPSTPVQPEVHSIS-VSSDPSSSSTVCPISGSVHL----GRHL------- 323

Query: 382 ISTSLNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLF 440
               LNDV   P    NL+ V  L  +  CR+ F ++ C++QD     ++  G +V  L+
Sbjct: 324 ---ILNDVLFIPQFKFNLLSVSSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLY 380

Query: 441 PLCF-SMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLST 499
            +   S+S   +   ++   A+V + +LWHKRLGHP+   L  +  S +L   +  +   
Sbjct: 381 IVDLDSLSHPGTDSSITV--ASVTSHDLWHKRLGHPSVQKLQPM--SSLLSFPKQKN--N 434

Query: 500 IKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSR 559
             F C  C + K K LPF +H +  S+ FD+IH D WG   V +H  Y+YF+T +DD+SR
Sbjct: 435 TDFHCRVCHISKQKHLPFVSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSR 494

Query: 560 FTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDN 598
            TWVY LR+K +V +    F   VE QF + IK +RSDN
Sbjct: 495 ATWVYLLRNKSDVLTVIPTFVTMVENQFETTIKGVRSDN 533


>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
           gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
           Arabidopsis thaliana
          Length = 1392

 Score =  194 bits (494), Expect = 7e-48
 Identities = 159/637 (24%), Positives = 272/637 (41%), Gaps = 105/637 (16%)

Query: 7   EVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYA 55
           +V+ ++L   NY  W  +F+ ++    L G V G+T             +  E+   E+ 
Sbjct: 14  QVVTLKLTPTNYLLWKTQFESYLSSHLLLGFVTGATPRPASTIIVTKDDIQSEEANQEFL 73

Query: 56  DWEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIF 115
            W   D  V AWI   +    +  +   ++A ++W  L + +N+ +T R++ L+  +   
Sbjct: 74  KWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLGTC 133

Query: 116 QQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGI 175
            +   ++  + S+  N+      I +     E +  V            L  L  ++E I
Sbjct: 134 SKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGV------------LNGLGKEYESI 181

Query: 176 RTNLMNRATV---PS*DACLNELLREERRLLTLATMEQ---HKSASLPVAYVVQEKPRGR 229
            T + +   V   P  D  + +L   + +L T     +   H +     +Y  +     R
Sbjct: 182 ATVIEHSLDVYPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNSR 241

Query: 230 -----DLSAVQCFCCKGFGHY----------ASNCPRKSCNYCKKDGHVIKECPIRPPKK 274
                +      +  +G G +          + N  + +C  C+K GH   +C       
Sbjct: 242 GGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFKC------- 294

Query: 275 NATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNS 334
                 T       P   D+ N                       AF+A+ +S ++  +S
Sbjct: 295 -----YTRFEENYLPE--DLPN-----------------------AFAAMRVSDQNQASS 324

Query: 335 SPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LN 387
             W  D  A+ H+ N  + L N     G+  + V NG+ LPIT IG I  +       L 
Sbjct: 325 HEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLE 384

Query: 388 DVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSM 446
           DV V PG+T +L+ V +L D+  C   F     +++D+ + +L+ +G K   L+ L    
Sbjct: 385 DVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVL---- 440

Query: 447 SPCSSLPFVSCHSATVVNF--ELWHKRLGHPNSNVLYELLQS-GVLGNKETPSLSTIKFD 503
                +PF + +S    +   E+WH+RLGHPN  VL  L+++  ++ NK + ++      
Sbjct: 441 ---KDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNKTSSNM------ 491

Query: 504 CTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWV 563
           C  C++GK   LPF   +   S+  + IH DLWG  PV S   ++Y+V FID++SRFTW 
Sbjct: 492 CEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWF 551

Query: 564 YFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           Y L+ K + FS F  F   VE Q+  KI + + D GG
Sbjct: 552 YPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGG 588


>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|37534632|ref|NP_921618.1| putative pol polyprotein
           [Oryza sativa (japonica cultivar-group)]
          Length = 1688

 Score =  189 bits (479), Expect = 4e-46
 Identities = 107/277 (38%), Positives = 154/277 (54%), Gaps = 14/277 (5%)

Query: 334 SSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST---SLNDVY 390
           S PW  D GAS HM+ +   LT+         +  ANG    +T  G IS+   ++ +V 
Sbjct: 181 SQPWILDSGASFHMSFDDSWLTSCRLVKNGATVHTANGTLCKVTHQGSISSPQFTVPNVS 240

Query: 391 VSPGLTSNLIFVGQLVDNDCRVAFSKSGCLVQDQHSGKLIARGPKVGR---LFPLCFSMS 447
           + P L+ NLI VGQL D +C V F  + C VQD+H+G +I  G +  R   L+ L     
Sbjct: 241 LVPKLSMNLISVGQLTDTNCFVGFDDTSCFVQDRHTGAVIGTGHRQKRSCGLYILDSLSL 300

Query: 448 PCSSLPFVSCHS----ATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFD 503
           P SS    S +S        +F  WH RLGH   + L  L+  GVLG+    +     F 
Sbjct: 301 PSSSTNTPSVYSPMCSTACKSFPQWHHRLGHLCGSRLATLINQGVLGSVPVDTT----FV 356

Query: 504 CTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWV 563
           C  CKLGK   LP+P+  S  S+ FD++HSD+WG +P  S   + Y+V F+DD+SR+TW+
Sbjct: 357 CKGCKLGKQVQLPYPSSTSRSSRPFDLVHSDVWGKSPFPSKGGHNYYVIFVDDYSRYTWI 416

Query: 564 YFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           YF++ + ++ S ++ F   + TQFSS I+I RSD+GG
Sbjct: 417 YFMKHRSQLISIYQSFAQMIHTQFSSAIRIFRSDSGG 453


>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
           gi|25301689|pir||C96578 hypothetical protein T18A20.5
           [imported] - Arabidopsis thaliana
          Length = 1522

 Score =  187 bits (475), Expect = 1e-45
 Identities = 170/633 (26%), Positives = 261/633 (40%), Gaps = 104/633 (16%)

Query: 11  VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSA-----------LDKEKQETEYADWEV 59
           V LN +NY  W  +F+ F+ G+ L G V GS SA           +  E+   E+  W  
Sbjct: 17  VTLNQQNYILWKSQFESFLSGQGLLGFVTGSISAPAQTRSVTHNNVTSEEPNPEFYTWHQ 76

Query: 60  KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
            D  V +W++     +I+  +    T+ ++W  L   +N+ +++R F+L+  +   +++ 
Sbjct: 77  TDQVVKSWLLGSFAEDILSVVVNCFTSHQVWLTLANHFNRVSSSRLFELQRRLQTLEKKD 136

Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQT-VHETTK*DQFLMKLRSDFEGIRTN 178
            ++  F     ++              + L SV + V E  K    L  L  ++E I+T 
Sbjct: 137 NTMEVFLKDLKHI-------------CDQLASVGSPVPEKMKIFSALNGLGREYEPIKTT 183

Query: 179 LMNRATVP---S*DACLNELLREERRL---LTLATMEQHKSASLPVA----YVVQEKPRG 228
           + N        S D   ++L   + RL   +T  T+  H + ++  +    Y    + +G
Sbjct: 184 IENSVDSNPSLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNVTHSDSGYYHNNNRGKG 243

Query: 229 RDLSAV--QCFCCKGFGHYASNCPRKS---------CNYCKKDGHVIKECPIRPPKKNAT 277
           R  S      F  +G G +    P            C  C K GH   +C  R       
Sbjct: 244 RSNSGSGKSSFSTRGRGFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHR------- 296

Query: 278 TFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPW 337
            F  S      P                             +A + + I+     +   W
Sbjct: 297 -FDNSYQHEDLP-----------------------------MALATMRITDVTDHHGHEW 326

Query: 338 YFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LNDVY 390
             D  AS H+ NN   L       G+  I VA+GN LPIT  G  S +       L +V 
Sbjct: 327 IPDSAASAHVTNNRHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLKEVL 386

Query: 391 VSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPC 449
           V P +  +L+ V +L  D  C V F      + D+ + KL+  G     L+ L       
Sbjct: 387 VCPDIVKSLLSVSKLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSL-----EE 441

Query: 450 SSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSG--VLGNKETPSLSTIKFDCTYC 507
             L  +        + E+WH+RLGH N+ VL++L  S   ++ NK       +K  C  C
Sbjct: 442 PKLQVLYSTRQNSASSEVWHRRLGHANAEVLHQLASSKSIIIINK------VVKTVCEAC 495

Query: 508 KLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLR 567
            LGKS  LPF     N S+  + IH DLWG +P  S   ++Y+V FID +SRFTW Y L+
Sbjct: 496 HLGKSTRLPFMLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLK 555

Query: 568 SKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
            K + FS F  F   VE Q   KIKI + D GG
Sbjct: 556 LKSDFFSTFVMFQKLVENQLGHKIKIFQCDGGG 588


>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301693|pir||F84480 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1402

 Score =  182 bits (462), Expect = 3e-44
 Identities = 161/641 (25%), Positives = 265/641 (41%), Gaps = 114/641 (17%)

Query: 11  VRLNGKNYSAWAFEFQIFVKGKSLWGHV----------------DGSTSALDKEKQETEY 54
           V L  KNY  W  +F+ F+ G+ L G V                DGSTSA        EY
Sbjct: 17  VTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSVVSDIDGSTSA----SPNPEY 72

Query: 55  ADWEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAI 114
             W   D  V +W++     +I+  +   +T+ ++W  +   +N+ +++R F+L+  +  
Sbjct: 73  YTWFKTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSSRLFELQRRLQN 132

Query: 115 FQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEG 174
             +   S+ E+      +      +  GS  TE +          K    L  L  ++E 
Sbjct: 133 VSKRDKSMDEYLKDLKTICDQLASV--GSPVTEKM----------KIFAALNGLGREYEP 180

Query: 175 IRT---NLMNRATVPS*DACLNELLREERRL---LTLATMEQHKSASLPVA--------Y 220
           I+T   N M+    PS +  + +L   + RL   L    +  H + ++  +        +
Sbjct: 181 IKTTIENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYF 240

Query: 221 VVQEKPRGRDLSAVQCFCCKGFGHYASNCPRKS------------CNYCKKDGHVIKECP 268
               + +G+       F  +G G +       S            C  C K GH   +C 
Sbjct: 241 NAYNRGKGKSNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGHPALKCW 300

Query: 269 IRPPKKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISG 328
            R        F  S      P                              A +A+ I+ 
Sbjct: 301 HR--------FNNSYQYEELPR-----------------------------ALAAMRITD 323

Query: 329 KHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS--- 385
               + + W  D  A+ H+ N+  +L       G+  + VA+GN LPIT  G  + +   
Sbjct: 324 ITDQHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSS 383

Query: 386 ----LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLF 440
               L DV V P +T +L+ V +L  D  C V F   G  + D+ + KL+  G     L+
Sbjct: 384 GNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLY 443

Query: 441 PLCFSMSPCSSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLG-NKETPSLST 499
            L           F S    +  + E+WH+RLGHP+  VL +L+++  +  NK + SL  
Sbjct: 444 CL---KDDSQFKAFFSTRQQSASD-EVWHRRLGHPHPQVLQQLVKTNSISINKTSKSL-- 497

Query: 500 IKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSR 559
               C  C+LGKS  LPF +     ++  + +H DLWG +P+ S   ++Y+  FID +SR
Sbjct: 498 ----CEACQLGKSTRLPFVSSSFTSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSR 553

Query: 560 FTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           F+W+Y L+ K + ++ F  FH  VE Q + KI + + D GG
Sbjct: 554 FSWIYPLKLKSDFYNIFVAFHKLVENQLNHKISVFQCDGGG 594


>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  182 bits (462), Expect = 3e-44
 Identities = 155/636 (24%), Positives = 277/636 (43%), Gaps = 113/636 (17%)

Query: 11  VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGS-----------TSALDKEKQETEYADWEV 59
           ++LN  NY  W  +F+  +    L G V+G            T     +    +Y  W  
Sbjct: 19  LKLNDSNYLLWKTQFESLLSCHKLIGFVNGGITPPPRTLNVVTGDTSVDVANPQYESWFC 78

Query: 60  KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
            D  + +W+   +   ++  +    T+  +W  L + +N+++ AR F L   + +  ++ 
Sbjct: 79  TDQLIRSWLFGTLSEEVLGYVHNLQTSRDIWISLAENFNKSSVAREFTLRRTLQLLSKKD 138

Query: 120 LSISEFYSQFMNLWADYTDIVYGSVSTEGLTSV-QTVHETTK*DQFLMKLRSDFEGIRTN 178
            ++S +  +F+ +              + L+S+ + V E+ K   FL  L  +++ I T 
Sbjct: 139 KTLSAYCREFIAV-------------CDALSSIGKPVDESMKIFGFLNGLGREYDPITTV 185

Query: 179 LMNRATVPS*DACLNELLREERRLLTLATMEQHKSASLPVAYVVQEKP------------ 226
           + +  +  S     + +   +   + L + E+  +A+  +A+  Q               
Sbjct: 186 IQSSLSKISPPTFRDVISEVKGFDVKLQSYEESVTANPHMAFNTQRSEYTDNYTSGNRGK 245

Query: 227 --------RGRDLSAVQCFCCKGFGHYASNC----PRKSCNYCKKDGHVIKECPIRPPKK 274
                   RGR   + +    +GF  + +N      R  C  C + GH   +C  R    
Sbjct: 246 GRGGYGQNRGRSGYSTRG---RGFSQHQTNSNNTGERPVCQICGRTGHTALKCYNR---- 298

Query: 275 NATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNS 334
               F  +  S      VD A                        AFS+L +S     + 
Sbjct: 299 ----FDHNYQS------VDTAQ-----------------------AFSSLRVSDS---SG 322

Query: 335 SPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTS-------LN 387
             W  D  A+ H+ ++   L   +   G+  + V +G +LPIT +G  + S       LN
Sbjct: 323 KEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSGTLPLN 382

Query: 388 DVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSM 446
           +V V P +  +L+ V +L D+  C V F  +   + D ++ K++++GP+   L+ L    
Sbjct: 383 EVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYVL---- 438

Query: 447 SPCSSLPFVSCHS--ATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDC 504
               +  FV+ +S      + E+WH RLGH NS +L +L  S  +   ++  +S +   C
Sbjct: 439 ---ENQEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKS-RMSPV---C 491

Query: 505 TYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVY 564
             C++GKS  L F +  S        IH DLWG +PV+S   +KY+V F+DD+SR++W Y
Sbjct: 492 EPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFY 551

Query: 565 FLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
            L++K + F+ F  F   VE QF++KIK+ +SD GG
Sbjct: 552 PLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGG 587


>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
           gi|4539447|emb|CAB40035.1| retrotransposon like protein
           [Arabidopsis thaliana] gi|7444419|pir||T04204
           hypothetical protein T4F9.150 - Arabidopsis thaliana
          Length = 1515

 Score =  181 bits (459), Expect = 8e-44
 Identities = 148/588 (25%), Positives = 252/588 (42%), Gaps = 94/588 (15%)

Query: 45  LDKEKQETEYADWEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTAR 104
           +  E+   E+  W   D  V AWI   +    +  +   ++A ++W  L + +N+ +T R
Sbjct: 60  IQSEEANQEFLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTR 119

Query: 105 RFQLEHDIAIFQQEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQF 164
           ++ L+  +    +   ++  + S+  N+      I +     E +  V            
Sbjct: 120 KYDLQKRLGTCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGV------------ 167

Query: 165 LMKLRSDFEGIRTNLMNRATV---PS*DACLNELLREERRLLTLATMEQ---HKSASLPV 218
           L  L  ++E I T + +   V   P  D  + +L   + +L T     +   H +     
Sbjct: 168 LNGLGKEYESIATVIEHSLDVYPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDK 227

Query: 219 AYVVQEKPRGR-----DLSAVQCFCCKGFGHY----------ASNCPRKSCNYCKKDGHV 263
           +Y  +     R     +      +  +G G +          + N  + +C  C+K GH 
Sbjct: 228 SYSSRGNNNSRGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHS 287

Query: 264 IKECPIRPPKKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSA 323
             +C             T       P   D+ N                       AF+A
Sbjct: 288 AFKC------------YTRFEENYLPE--DLPN-----------------------AFAA 310

Query: 324 LGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIS 383
           + +S ++  +S  W  D  A+ H+ N  + L N     G+  + V NG+ LPIT IG I 
Sbjct: 311 MRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIP 370

Query: 384 TS-------LNDVYVSPGLTSNLIFVGQLVDN-DCRVAFSKSGCLVQDQHSGKLIARGPK 435
            +       L DV V PG+T +L+ V +L D+  C   F     +++D+ + +L+ +G K
Sbjct: 371 LNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNK 430

Query: 436 VGRLFPLCFSMSPCSSLPFVSCHSATVVNF--ELWHKRLGHPNSNVLYELLQS-GVLGNK 492
              L+ L         +PF + +S    +   E+WH+RLGHPN  VL  L+++  ++ NK
Sbjct: 431 HKGLYVL-------KDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK 483

Query: 493 ETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVT 552
            + ++      C  C++GK   LPF   +   S+  + IH DLWG  PV S   ++Y+V 
Sbjct: 484 TSSNM------CEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVI 537

Query: 553 FIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           FID++SRFTW Y L+ K + FS F  F   VE Q+  KI + + D GG
Sbjct: 538 FIDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGG 585


>ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|49328070|gb|AAT58770.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1419

 Score =  177 bits (450), Expect = 8e-43
 Identities = 106/296 (35%), Positives = 155/296 (51%), Gaps = 25/296 (8%)

Query: 325 GISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDIST 384
           G S   +   S W  D GAS HM  +   L + +    +  +Q A+G  LP++  G + T
Sbjct: 476 GCSLSANVTDSCWILDSGASFHMTPDISQLQSCSLTKAS-SVQTADGTILPVSLQGTLQT 534

Query: 385 ---SLNDVYVSPGLTSNLIFVGQLVDNDCRVAFSKSGCLVQDQHSGKLIARGPKVGR--- 438
              ++ DV+  P L+  LI VGQL D  C V F ++ C V D+ +G L+  G ++     
Sbjct: 535 KEYTIPDVFYVPNLSMKLISVGQLTDMKCHVVFDEAACYVLDRATGNLVGAGHRLNGPRG 594

Query: 439 ---LFPLCFSMSPCSSLPFVSCHSATVVN-----------FELWHKRLGHPNSNVLYELL 484
              L  L    S  S  P  S  + ++ +           F  WH RLGH   + L  L+
Sbjct: 595 LYVLDHLHLPTSTSSGFPGNSASATSITSNSSVYSSLSASFPQWHHRLGHLCGSRLSTLV 654

Query: 485 QSGVLGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISH 544
           Q GVLGN    S+ T  F C  CKLGK   LP+ +  S  +  F ++HSD+WG  P  S 
Sbjct: 655 QQGVLGNV---SIET-DFVCKGCKLGKQVQLPYRSSMSRSTSPFALVHSDVWGPAPFHSK 710

Query: 545 ANYKYFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
             ++Y+V F+DDFSR+TW+YF++ + E++  +K F + V TQFS+ IK  RSD+GG
Sbjct: 711 GGHRYYVIFVDDFSRYTWIYFMKHRSELYQVYKSFASMVHTQFSTSIKNFRSDSGG 766


>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1499

 Score =  177 bits (449), Expect = 1e-42
 Identities = 173/640 (27%), Positives = 271/640 (42%), Gaps = 92/640 (14%)

Query: 1   MSTEKYEVLLVRLNGKNYSAWAFEFQIFVKGKSLWGHVD-GSTSALDKEKQET---EYAD 56
           M T   +V+ +  NG++Y  W  +    +K + LW  ++ G TS    E       E  D
Sbjct: 1   METTMQQVIPI-FNGESYGFWKIKMITILKTRKLWDVIENGVTSNSSPETSPALTRERDD 59

Query: 57  WEVKDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQ 116
             +KD   L  +   V  +I   + P S+A + W  L+  +  ++  +   L+     ++
Sbjct: 60  QVMKDMMALQILQSAVSDSIFPRIAPASSATEAWNALEMEFQGSSQVKMINLQTLRREYE 119

Query: 117 ----QEILSISEFYSQFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDF 172
               +E  +I++F ++ +NL           V  E  +  Q V +       L+ +   F
Sbjct: 120 NLKMEEGETINDFTTKLINLSNQLR------VHGEEKSDYQVVQK------ILISVPQQF 167

Query: 173 E---GIRTNLMNRATVPS*DACLNELLREERRL------LTLATMEQHKSASLPVAYVVQ 223
           +   G+     + +T+ S    +  L   ERRL      +        K  S       Q
Sbjct: 168 DSIVGVLEQTKDLSTL-SVTELIGTLKAHERRLNLREDRINEGAFNGEKLGSR--GENKQ 224

Query: 224 EKPRGRDLSAVQCFCCKGFGHYASNCPRKS--------------CNYCKKDGHVIKECPI 269
            K R    + + C  CK   H   +C RK               C  C K GH+ ++C +
Sbjct: 225 NKIR-HGKTNMWCGVCKRNNHNEVDCFRKKSESISQRGGSYERRCYVCDKQGHIARDCKL 283

Query: 270 RPPKKNATTFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGK 329
           R  ++         H  I  S  +  +  H                   + FSA+     
Sbjct: 284 RKGER--------AHLSIEESEDEKEDECH-------------------MLFSAVEEKEI 316

Query: 330 HSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAIGDISTSLN-- 387
            +     W  D G +NHM+ +      + ++   + I++ NG  +     GDI  S N  
Sbjct: 317 STIGEETWLVDSGCTNHMSKDVRHFIALDRS-KKIIIRIGNGGKVVSEGKGDIRVSTNKG 375

Query: 388 -----DVYVSPGLTSNLIFVGQLVDNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPL 442
                DV   P L  NL+ V Q++ N  RV F  + C++QD    K++    K  R FP+
Sbjct: 376 DHVIKDVLYVPELARNLLSVSQMISNGYRVIFEDNKCVIQDLKGRKILDIKMK-DRSFPI 434

Query: 443 CFSMSPCSS-LPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIK 501
            +  S   + + F      T    +LWHKR GH N + + E +Q+  +  K  P    IK
Sbjct: 435 IWKKSREETYMAFEEKEEQT----DLWHKRFGHVNYDKI-ETMQTLKIVEK-LPKFEVIK 488

Query: 502 FDCTYCKLGKSKILPFPNH-QSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRF 560
             C  C++GK     FP   QSN ++T ++IHSD+ G     S    +YF+TFIDDFSR 
Sbjct: 489 GICAACEMGKQSRRSFPKKSQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDDFSRM 548

Query: 561 TWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           TWVYFL++K EV + FK F  YVE Q  S+IK LR+D GG
Sbjct: 549 TWVYFLKNKSEVITKFKIFKPYVENQSESRIKRLRTDGGG 588


>gb|AAP53070.1| putative retroelement [Oryza sativa (japonica cultivar-group)]
           gi|37532962|ref|NP_920783.1| putative retroelement
           [Oryza sativa (japonica cultivar-group)]
           gi|21671985|gb|AAM74347.1| Putative retroelement [Oryza
           sativa (japonica cultivar-group)]
          Length = 1250

 Score =  172 bits (435), Expect = 5e-41
 Identities = 167/660 (25%), Positives = 265/660 (39%), Gaps = 94/660 (14%)

Query: 9   LLVRLNGKNYSAWAFEFQIFVKGKSLWGHVDGSTSALDKEKQETEYADWEVKDNQVLAWI 68
           + + L   N++ W   F        L  H+DG+      +      + W   D  V  W+
Sbjct: 39  ITLELKHPNFNKWKTFFTSMCGKFGLLPHIDGTAPPRPDD------STWAQADCCVQGWL 92

Query: 69  IRFVDPNIV-LNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEILSISEFYS 127
              V   I+ + +    TA  +W  +  ++  N   R   L H+     Q  + I++ Y 
Sbjct: 93  FGSVSDAILDVVMETDQTARDLWLAIDDLFQANKEPRTIYLSHEFHSMTQGDMPIAD-YC 151

Query: 128 QFMNLWADYTDIVYGSVSTEGLTSVQTVHETTK*DQFLMKLRSDFEGIRTNLMNRATVPS 187
           Q +   AD    V   V+   L               L  L S F     N+ +   +PS
Sbjct: 152 QKVKTAADALRDVGHPVTESQLVL-----------NLLSGLNSRFSSTADNIASAPVLPS 200

Query: 188 *DACLNELLREERRL-----------LTLATMEQHKSASLPVAYVVQEKPRGRDLSAVQC 236
             +  N LL +E R+           + +A    +   S   A     +  G + S    
Sbjct: 201 FASAHNTLLLKELRIANAHKVQAETTMVVAASSANACTSGTCASSSSSQSHGDNNSNGG- 259

Query: 237 FCCKGFGHYASNC--------PRKSCNYCKKDGHVIKE--CPIRPPKKNA---------T 277
               G+G++ +N         PR +  +   +   +++   P RP              T
Sbjct: 260 ----GYGNFGNNFQQQQHQAGPRTTGPWVCFNPWAVQQQQSPWRPSNSAGLLGPYPQAHT 315

Query: 278 TFTTSVHSPIAPSFVDIANVQHNAPTPVQALTPEIVEHMIILAFSALGISGKHSPNSSPW 337
           TF     SP  P              P+Q   P   +  +I A + L +      + SPW
Sbjct: 316 TFAGPYVSPPMPGL-----------PPMQQSQPNWDQAGLIAALNQLSVQ-----SPSPW 359

Query: 338 YFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITA-------IGDISTSLNDVY 390
             D GA++HM++    L     N     I V NG+ +P+         IG     L ++ 
Sbjct: 360 VLDTGATSHMSSTDGILDTRLPNSYTF-ITVGNGHTIPVICHGTSFLPIGTTKFDLKNIL 418

Query: 391 VSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIARGPKVGRLFPLCFSMSPC 449
           V+P L  NL+ + Q   DN+C + F + G  V+   + ++I R    G L+ L  +    
Sbjct: 419 VAPSLVRNLLSIRQFTRDNNCSIEFDEFGFSVKGLRTRRVILRCNSRGDLYTLPIAA--- 475

Query: 450 SSLPFVSCHSATVVNFELWHKRLGHPNSNVLYELLQSGVLGNKETPSLSTIKFDCTYCKL 509
              P ++ HS    +  LWH+RLGHP+S  +  L +  +L     P        C  CKL
Sbjct: 476 ---PAIAAHSFLAQSSTLWHRRLGHPSSAAIQTLHKLAIL-----PCTKIDHSLCHACKL 527

Query: 510 GKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYKYFVTFIDDFSRFTWVYFLRSK 569
           GK   LPF   QS+ S  F+++H D+W  +PV+S + +KY++  +DDFS F W + LR K
Sbjct: 528 GKHTRLPFSRSQSSTSSPFELVHCDVW-TSPVLSTSGFKYYLVVLDDFSHFCWTFPLRHK 586

Query: 570 DEVFSAFKFFHAYVETQFSSKIKILRSDNGGGKVHI*FDSRFFENKWYFISKVMSFHSPT 629
            +V      F AYV TQF S IK  ++DN   K +   D      +  FIS+ + F   T
Sbjct: 587 SDVHQHIVEFVAYVTTQFGSSIKCFQADNASHKGYRCLD---ISTRRVFISRHVVFDEQT 643


>emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|4972079|emb|CAB43904.1|
           putative protein [Arabidopsis thaliana]
           gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
           Arabidopsis thaliana
          Length = 1415

 Score =  170 bits (431), Expect = 1e-40
 Identities = 104/292 (35%), Positives = 160/292 (54%), Gaps = 27/292 (9%)

Query: 320 AFSALGISGKHSPNSSPWYFDYGASNHMANNAEALTNITQNFGNLKIQVANGNHLPITAI 379
           AF+A+ +S +    S+PW  D GA++H+ N+   L +     G   + V N + LPIT I
Sbjct: 279 AFAAMRVSDQ---KSNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHI 335

Query: 380 GD-ISTS------LNDVYVSPGLTSNLIFVGQLV-DNDCRVAFSKSGCLVQDQHSGKLIA 431
           G  + TS      L DV V P +T +L+ V +L  D  C + F   G +V+D+ + +L+ 
Sbjct: 336 GSAVLTSNQGNLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTKQLLT 395

Query: 432 RGPKVGRLFPLCFSMSPCSSLPFVSCHSAT--VVNFELWHKRLGHPNSNVLYELLQS-GV 488
           +G +   L+ L        +  F++C+S+     + E+WH RLGHPN +VL +LL++  +
Sbjct: 396 KGTRHNDLYLL-------ENPKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQLLRNKAI 448

Query: 489 LGNKETPSLSTIKFDCTYCKLGKSKILPFPNHQSNISQTFDMIHSDLWGITPVISHANYK 548
           + +K + SL      C  C++GK   LPF +     S+  + +H DLWG  PV+S   ++
Sbjct: 449 VISKTSHSL------CDACQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFR 502

Query: 549 YFVTFIDDFSRFTWVYFLRSKDEVFSAFKFFHAYVETQFSSKIKILRSDNGG 600
           Y+V FID++SRFTW Y LR K + FS F  F   VE Q   KI   + D GG
Sbjct: 503 YYVIFIDNYSRFTWFYPLRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGGG 554



 Score = 45.4 bits (106), Expect = 0.006
 Identities = 26/126 (20%), Positives = 60/126 (46%), Gaps = 11/126 (8%)

Query: 11  VRLNGKNYSAWAFEFQIFVKGKSLWGHVDGS------TSALDKEKQETE-----YADWEV 59
           ++L+  NY  W  +F+ ++  + L G V G+      T ++    Q TE     +  W  
Sbjct: 19  LKLSTANYLLWKIQFETWLNNQRLLGFVTGANPCPNATRSIRNGDQVTEATNPDFLTWVQ 78

Query: 60  KDNQVLAWIIRFVDPNIVLNLRPFSTAAKMWAYLKKIYNQNNTARRFQLEHDIAIFQQEI 119
            D +++ W++  +  + + ++    T+ ++W  L K YN+ + +R+  L+  +    +  
Sbjct: 79  NDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSDLQRRLNPVSKNE 138

Query: 120 LSISEF 125
            S+ E+
Sbjct: 139 KSMLEY 144


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.331    0.141    0.457 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,217,653,626
Number of Sequences: 2540612
Number of extensions: 50430803
Number of successful extensions: 148222
Number of sequences better than 10.0: 3855
Number of HSP's better than 10.0 without gapping: 691
Number of HSP's successfully gapped in prelim test: 3164
Number of HSP's that attempted gapping in prelim test: 138229
Number of HSP's gapped (non-prelim): 6227
length of query: 733
length of database: 863,360,394
effective HSP length: 136
effective length of query: 597
effective length of database: 517,837,162
effective search space: 309148785714
effective search space used: 309148785714
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 79 (35.0 bits)


Lotus: description of TM0016.5