Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0331c.2
         (1291 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...   206  4e-52
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III    202  5e-51
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...   191  1e-47
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...   189  3e-47
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...   188  9e-47
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...   175  6e-43
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...   172  7e-42
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...   172  7e-42
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...   149  4e-35
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...   132  5e-30
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...   132  6e-30
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...   130  2e-29
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...   129  5e-29
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...   127  2e-28
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot...   122  5e-27
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II     114  2e-24
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot...   112  9e-24
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;...    95  1e-18
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro...    87  4e-16
RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse...    86  9e-16

>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
            transposon 17.6 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score =  206 bits (523), Expect = 4e-52
 Identities = 134/385 (34%), Positives = 201/385 (51%), Gaps = 10/385 (2%)

Query: 816  VQQEVDKLLAAEFIREVKYPTWLANVVIVKKANG----KWRMCVDYTDLNKACPKDSYPL 871
            V+ ++  +L    IR    P      V+ KK +     K+R+ +DY  LN+    D +P+
Sbjct: 223  VESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282

Query: 872  PSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAG 931
            P++D ++         + +D   G HQI M P    KTAF T   +Y Y  MPFGLKNA 
Sbjct: 283  PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCY 991
            AT+QR M+ +    + ++  VY+DD+IV S    +H Q L   F ++ K N++L  +KC 
Sbjct: 343  ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402

Query: 992  FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 1051
            F  Q   FLG ++T  GI+ NPEK +AIQ+   P+  KE++   G      +F+P   D 
Sbjct: 403  FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462

Query: 1052 SFPFFKCLRKNVAFEWT-AECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDGALSS 1110
            + P  KCL+KN+  + T  E + AF +LK L+S  PIL  P       L    SD AL +
Sbjct: 463  AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522

Query: 1111 VMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-L 1169
            V+ Q  DG H + Y +S TL   E+ Y  IEK  LA++   +  R Y      ++ +D  
Sbjct: 523  VLSQ--DG-HPLSY-ISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQ 578

Query: 1170 PLRQVLQKPDLSGRLVAWSVELSEY 1194
            PL  + +  D + +L  W V+LSE+
Sbjct: 579  PLSWLYRMKDPNSKLTRWRVKLSEF 603


>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score =  202 bits (514), Expect = 5e-51
 Identities = 122/390 (31%), Positives = 204/390 (52%), Gaps = 9/390 (2%)

Query: 816  VQQEVDKLLAAEFIREVKYPTWLANVVIVKKANGKWRMCVDYTDLNKACPKDSYPLPSID 875
            +++ + K+L  + IRE K P W + VV+VKK +G  RMC+DY  +NK    +++PLP+I+
Sbjct: 958  IRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIE 1016

Query: 876  SLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAGATYQ 935
            + +   +G +L ++ D  +G  QI +    ++ TAF      + +  +PFGL  + A +Q
Sbjct: 1017 ATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQ 1076

Query: 936  RLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCYFGVQ 995
              M+ +    +G    VYVDD+++ S     H QD++EA   IRK  M+L   KC+   +
Sbjct: 1077 GTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKK 1136

Query: 996  GGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDRSFPF 1055
              ++LG  +T  G+E    K   ++Q   P+NVKE+Q   G +    +F+      +   
Sbjct: 1137 EVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSL 1196

Query: 1056 FKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKP-----IQG-HPLHLYFAVSDGALS 1109
               +   VA+ W  E E AF  LK+L+   P+L++P     ++G  P  +Y   S   + 
Sbjct: 1197 TSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIG 1256

Query: 1110 SVMLQE-IDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD 1168
            +V+ QE  DG+   + F S  L  AE RY   +  ALA++   RR +       + V TD
Sbjct: 1257 AVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTD 1316

Query: 1169 -LPLRQVLQKPDLSGRLVAWSVELSEYSLQ 1197
              PL  +L+   L+ RL  WS+E+ E+ ++
Sbjct: 1317 HKPLISLLKGSPLADRLWRWSIEILEFDVK 1346


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
            transposon opus [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score =  191 bits (484), Expect = 1e-47
 Identities = 124/399 (31%), Positives = 203/399 (50%), Gaps = 17/399 (4%)

Query: 816  VQQEVDKLLAAEFIREVKYPTWLANVVIVKKA--NGK--WRMCVDYTDLNKACPKDSYPL 871
            V++++D+LL    IR    P      ++ KK   NG+  +RM VD+  LN     D+YP+
Sbjct: 139  VERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPI 198

Query: 872  PSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAG 931
            P I++ +      +  + +D  SG HQI M  +D  KTAF T    Y +  +PFGLKNA 
Sbjct: 199  PDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAP 258

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCY 991
            A +QR++D +    +G+   VY+DD+IV S     H ++L      + K N+++N EK +
Sbjct: 259  AIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSH 318

Query: 992  FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 1051
            F     +FLG+++T+ GI+ +P+K +AI +M  P++VKE++R  G  +   +F+      
Sbjct: 319  FLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKV 378

Query: 1052 SFPFFKCLR-----------KNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLY 1100
            + P     R             V         ++F  LK +L S  IL+ P    P HL 
Sbjct: 379  AKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLT 438

Query: 1101 FAVSDGALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPY-FQ 1159
               S+ A+ +V+ Q+  G  R + ++S +L   E  Y  IEK  LA++ +   LR Y + 
Sbjct: 439  TDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYG 498

Query: 1160 SFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEYSLQ 1197
            +  +KV TD  PL   L   + + +L  W   + EY+ +
Sbjct: 499  AGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCE 537


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
            transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score =  189 bits (481), Expect = 3e-47
 Identities = 124/390 (31%), Positives = 190/390 (47%), Gaps = 10/390 (2%)

Query: 816  VQQEVDKLLAAEFIREVKYPTWLANVVIVKKANG----KWRMCVDYTDLNKACPKDSYPL 871
            V+ +V ++L    IRE   P      V+ KK +     K+R+ +DY  LN+    D YP+
Sbjct: 222  VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPI 281

Query: 872  PSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAG 931
            P++D ++      +  + +D   G HQI M      KTAF T   +Y Y  MPFGL+NA 
Sbjct: 282  PNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAP 341

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCY 991
            AT+QR M+ +    + ++  VY+DD+I+ S    +H   ++  F ++   N++L  +KC 
Sbjct: 342  ATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCE 401

Query: 992  FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 1051
            F  +   FLG ++T  GI+ NP K KAI     P+  KE++   G      +F+P   D 
Sbjct: 402  FLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADI 461

Query: 1052 SFPFFKCLRKNVAFE-WTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDGALSS 1110
            + P   CL+K    +    E  EAF +LK L+   PIL  P       L    S+ AL +
Sbjct: 462  AKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGA 521

Query: 1111 VMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-L 1169
            V+ Q        + F+S TL   E+ Y  IEK  LA++   +  R Y       + +D  
Sbjct: 522  VLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQ 577

Query: 1170 PLRQVLQKPDLSGRLVAWSVELSEYSLQYD 1199
            PLR +    +   +L  W V LSEY  + D
Sbjct: 578  PLRWLHNLKEPGAKLERWRVRLSEYQFKID 607


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
            transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score =  188 bits (477), Expect = 9e-47
 Identities = 129/444 (29%), Positives = 207/444 (46%), Gaps = 11/444 (2%)

Query: 759  ETRLTKLLGENLDLFAWSCKDMPGIDPNFICHRLALNPSLKPVS*LRRRLGGDKGKAVQQ 818
            +++L  +  E +D+FA   +  P    N    +L L    +PV     R    + + +Q 
Sbjct: 276  KSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQVEEIQA 332

Query: 819  EVDKLLAAEFIREVKYPTWLANVVIVKKANG------KWRMCVDYTDLNKACPKDSYPLP 872
            +V KL+  + + E     + + +++V K +       KWR+ +DY  +NK    D +PLP
Sbjct: 333  QVQKLIKDKIV-EPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391

Query: 873  SIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAGA 932
             ID ++D     +  S +D  SG HQI +     D T+F T+  +Y +  +PFGLK A  
Sbjct: 392  RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451

Query: 933  TYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCYF 992
            ++QR+M   F+G       +Y+DD+IV         ++L E FG+ R++N++L+PEKC F
Sbjct: 452  SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511

Query: 993  GVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDRS 1052
             +    FLG   T +GI  + +K   IQ    P +    +R         RF+    D S
Sbjct: 512  FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571

Query: 1053 FPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDGALSSVM 1112
                +  +KNV FEWT EC++AF+ LK  L +P +L  P       +    S  A  +V+
Sbjct: 572  RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631

Query: 1113 LQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-LPL 1171
             Q  +G    V + S      E      E+   A+       RPY       V+TD  PL
Sbjct: 632  TQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPL 691

Query: 1172 RQVLQKPDLSGRLVAWSVELSEYS 1195
              +    + S +L    +EL EY+
Sbjct: 692  TYLFSMVNPSSKLTRIRLELEEYN 715


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
            type 1
          Length = 1333

 Score =  175 bits (444), Expect = 6e-43
 Identities = 116/406 (28%), Positives = 202/406 (49%), Gaps = 8/406 (1%)

Query: 812  KGKAVQQEVDKLLAAEFIREVKYPTWLANVVIVKKANGKWRMCVDYTDLNKACPKDSYPL 871
            K +A+  E+++ L +  IRE K       V+ V K  G  RM VDY  LNK    + YPL
Sbjct: 424  KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 872  PSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAG 931
            P I+ L+    G+ + + +D  S  H IR+   DE K AF   R  + Y  MP+G+  A 
Sbjct: 483  PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCY 991
            A +Q  ++ +       ++  Y+DD+++ S    +H + +++   +++  N+ +N  KC 
Sbjct: 543  AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 992  FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 1051
            F     KF+G+ I+ +G     E    + Q K P N KE+++  G +  L +F+PK+   
Sbjct: 603  FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 1052 SFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDGALSSV 1111
            + P    L+K+V ++WT    +A   +K+ L SPP+L        + L    SD A+ +V
Sbjct: 663  THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 1112 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 1168
            + Q+ D + +  V + S  +  A++ Y   +K  LA++ + +  R Y +S   P K+ TD
Sbjct: 723  LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 1169 ---LPLRQVLQKPDLSGRLVAWSVELSEYSLQYDKRGAVGAQSLAD 1211
               L  R   +    + RL  W + L +++ + + R    A  +AD
Sbjct: 783  HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIAD 827


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
            type 3
          Length = 1333

 Score =  172 bits (435), Expect = 7e-42
 Identities = 115/406 (28%), Positives = 202/406 (49%), Gaps = 8/406 (1%)

Query: 812  KGKAVQQEVDKLLAAEFIREVKYPTWLANVVIVKKANGKWRMCVDYTDLNKACPKDSYPL 871
            K +A+  E+++ L +  IRE K       V+ V K  G  RM VDY  LNK    + YPL
Sbjct: 424  KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 872  PSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAG 931
            P I+ L+    G+ + + +D  S  H IR+   DE K AF   R  + Y  MP+G+  A 
Sbjct: 483  PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCY 991
            A +Q  ++ +       ++  Y+D++++ S    +H + +++   +++  N+ +N  KC 
Sbjct: 543  AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 992  FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 1051
            F     KF+G+ I+ +G     E    + Q K P N KE+++  G +  L +F+PK+   
Sbjct: 603  FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 1052 SFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDGALSSV 1111
            + P    L+K+V ++WT    +A   +K+ L SPP+L        + L    SD A+ +V
Sbjct: 663  THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 1112 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 1168
            + Q+ D + +  V + S  +  A++ Y   +K  LA++ + +  R Y +S   P K+ TD
Sbjct: 723  LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 1169 ---LPLRQVLQKPDLSGRLVAWSVELSEYSLQYDKRGAVGAQSLAD 1211
               L  R   +    + RL  W + L +++ + + R    A  +AD
Sbjct: 783  HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIAD 827


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
            type 2
          Length = 1333

 Score =  172 bits (435), Expect = 7e-42
 Identities = 115/406 (28%), Positives = 202/406 (49%), Gaps = 8/406 (1%)

Query: 812  KGKAVQQEVDKLLAAEFIREVKYPTWLANVVIVKKANGKWRMCVDYTDLNKACPKDSYPL 871
            K +A+  E+++ L +  IRE K       V+ V K  G  RM VDY  LNK    + YPL
Sbjct: 424  KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 872  PSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAG 931
            P I+ L+    G+ + + +D  S  H IR+   DE K AF   R  + Y  MP+G+  A 
Sbjct: 483  PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCY 991
            A +Q  ++ +       ++  Y+D++++ S    +H + +++   +++  N+ +N  KC 
Sbjct: 543  AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 992  FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 1051
            F     KF+G+ I+ +G     E    + Q K P N KE+++  G +  L +F+PK+   
Sbjct: 603  FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 1052 SFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDGALSSV 1111
            + P    L+K+V ++WT    +A   +K+ L SPP+L        + L    SD A+ +V
Sbjct: 663  THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 1112 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 1168
            + Q+ D + +  V + S  +  A++ Y   +K  LA++ + +  R Y +S   P K+ TD
Sbjct: 723  LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 1169 ---LPLRQVLQKPDLSGRLVAWSVELSEYSLQYDKRGAVGAQSLAD 1211
               L  R   +    + RL  W + L +++ + + R    A  +AD
Sbjct: 783  HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIAD 827


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
            transposon gypsy [Contains: Reverse transcriptase (EC
            2.7.7.49); Endonuclease]
          Length = 1035

 Score =  149 bits (377), Expect = 4e-35
 Identities = 110/399 (27%), Positives = 188/399 (46%), Gaps = 23/399 (5%)

Query: 816  VQQEVDKLLAAEFIREVKYPTWLANVVIVKKA-----NGKWRMCVDYTDLNKACPKDSYP 870
            V  EV +LL    IR  + P      V+ KK      N   R+ +D+  LN+    D YP
Sbjct: 197  VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256

Query: 871  LPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNA 930
            +PSI  ++      +  + +D  SG HQI +   D +KT+F      Y +  +PFGL+NA
Sbjct: 257  MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316

Query: 931  GATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKC 990
             + +QR +D V   Q+G+   VYVDD+I+ S    DH + ++     +   NMR++ EK 
Sbjct: 317  SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376

Query: 991  YFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGD 1050
             F  +  ++LGF+++  G + +PEK KAIQ+   P  V +V+   G  +    F+     
Sbjct: 377  RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436

Query: 1051 RSFPFFKCLR-----------KNVAFEWTAECEEAFVRLKELLSSPPILSK-PIQGHPLH 1098
             + P    L+           K +  E+      AF RL+ +L+S  ++ K P    P  
Sbjct: 437  IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496

Query: 1099 LYFAVSDGALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPY- 1157
            L    S   + +V+ Q    E R +  +S TL+  E  Y   E+  LA++    +L+ + 
Sbjct: 497  LTTDASASGIGAVLSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552

Query: 1158 FQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEYS 1195
            + S  + + TD  PL   +   + + ++  W   + +++
Sbjct: 553  YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  132 bits (333), Expect = 5e-30
 Identities = 125/484 (25%), Positives = 209/484 (42%), Gaps = 28/484 (5%)

Query: 734  PIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FIC 789
            P+EE   L  G R  +    +T+++  ++ +LL +        C + P +DPN    ++ 
Sbjct: 184  PLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEK-------VCSENP-LDPNKTKQWMK 235

Query: 790  HRLALNPSLKPVS*LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVI---VKK 846
              + L+   K +     +      +   +++ +LL  + I+  K P      ++    +K
Sbjct: 236  ASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEK 295

Query: 847  ANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADE 906
              GK RM V+Y  +NKA   D+Y LP+ D L+    G ++ S  D  SG  Q+ +     
Sbjct: 296  RRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESR 355

Query: 907  DKTAFMTARANYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGL 965
              TAF   + +Y +  +PFGLK A + +QR MD  F  +V R    VYVDD++V S    
Sbjct: 356  PLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEE 413

Query: 966  DHHQDLEEAFGEIRKHNMRLNPEKCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP 1025
            DH   +     +  +H + L+ +K     +   FLG  I     +      + I +    
Sbjct: 414  DHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDT 473

Query: 1026 -SNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSS 1084
              + K++QR  G +   S ++PK      P    L++NV + WT E      ++K+ L  
Sbjct: 474  LEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQG 533

Query: 1085 PPILSKPIQGHPLHLYFAVSD----GALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKI 1140
             P L  P+    L +    SD    G L ++ + E      I  + S + + AE  Y   
Sbjct: 534  FPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSN 593

Query: 1141 EKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYS 1195
            +K  LAV+ T ++   Y       +RTD         +  K D   GR + W   LS YS
Sbjct: 594  DKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYS 653

Query: 1196 LQYD 1199
               +
Sbjct: 654  FDVE 657


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  132 bits (332), Expect = 6e-30
 Identities = 125/484 (25%), Positives = 210/484 (42%), Gaps = 28/484 (5%)

Query: 734  PIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FIC 789
            P+EE   L  G R  +    +T+++  ++ +LL +        C + P +DPN    ++ 
Sbjct: 184  PLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEK-------VCSENP-LDPNKTKQWMK 235

Query: 790  HRLALNPSLKPVS*LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVI---VKK 846
              + L+   K +     +      +   +++ +LL  + I+  K P      ++    +K
Sbjct: 236  ASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEK 295

Query: 847  ANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADE 906
              GK RM V+Y  +NKA   D+Y LP+ D L+    G ++ S  D  SG  Q+ +     
Sbjct: 296  RRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESR 355

Query: 907  DKTAFMTARANYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGL 965
              TAF   + +Y +  +PFGLK A + +QR MD  F  +V R    VYVDD++V S    
Sbjct: 356  PLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEE 413

Query: 966  DHHQDLEEAFGEIRKHNMRLNPEKCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP 1025
            DH   +     +  +H + L+ +K     +   FLG  I     +      + I +    
Sbjct: 414  DHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDT 473

Query: 1026 -SNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSS 1084
              + K++QR  G +   S ++PK      P    L++NV ++WT E      ++K+ L  
Sbjct: 474  LEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQG 533

Query: 1085 PPILSKPIQGHPLHLYFAVSD----GALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKI 1140
             P L  P+    L +    SD    G L ++ + E      I  + S + + AE  Y   
Sbjct: 534  FPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSN 593

Query: 1141 EKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYS 1195
            +K  LAV+ T ++   Y       +RTD         +  K D   GR + W   LS YS
Sbjct: 594  DKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYS 653

Query: 1196 LQYD 1199
               +
Sbjct: 654  FDVE 657


>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  130 bits (328), Expect = 2e-29
 Identities = 123/470 (26%), Positives = 204/470 (43%), Gaps = 21/470 (4%)

Query: 748  LKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FICHRLALNPSLKPVS* 803
            L  G RL+EE+     + + +  +L    C + P +DPN    ++   + L+   K +  
Sbjct: 191  LSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKV 249

Query: 804  LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVI---VKKANGKWRMCVDYTDL 860
               +      +   +++ +LL  + I+  K P      ++    +K  GK RM V+Y  +
Sbjct: 250  KPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAM 309

Query: 861  NKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCY 920
            NKA   D+Y LP+ D L+    G ++ S  D  SG  Q+ +       TAF   + +Y +
Sbjct: 310  NKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 369

Query: 921  RTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGLDHHQDLEEAFGEIR 979
              +PFGLK A + +QR MD  F  +V R    VYVDD++V S    DH   +     +  
Sbjct: 370  NVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCN 427

Query: 980  KHNMRLNPEKCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP-SNVKEVQRLTGRI 1038
            +H + L+ +K     +   FLG  I     +      + I +      + K++QR  G +
Sbjct: 428  QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487

Query: 1039 AALSRFLPKSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 1098
               S ++PK      P    L++NV ++WT E      ++K+ L   P L  P+    L 
Sbjct: 488  TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547

Query: 1099 LYFAVSD----GALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRL 1154
            +    SD    G L ++ + E      I  + S + + AE  Y   +K  LAV+ T ++ 
Sbjct: 548  IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKF 607

Query: 1155 RPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYSLQYD 1199
              Y       +RTD         +  K D   GR + W   LS YS   +
Sbjct: 608  SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVE 657


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 674

 Score =  129 bits (324), Expect = 5e-29
 Identities = 127/499 (25%), Positives = 212/499 (42%), Gaps = 26/499 (5%)

Query: 719  ENLDPRGEGRVNRPTPIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCK 778
            E++  R + +   P  I   K        L  G RL+EE+     + + +  +L    C 
Sbjct: 162  ESMKKRSKTQQPEPVNISTNKIA-----ILSEGRRLSEEKLFITQQRMQKIEELLEKVCS 216

Query: 779  DMPGIDPN----FICHRLALNPSLKPVS*LRRRLGGDKGKAVQQEVDKLLAAEFIREVKY 834
            + P +DPN    ++   + L+   K +     +      +   +++ +LL  + I+  K 
Sbjct: 217  ENP-LDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS 275

Query: 835  PTWLANVVI---VKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMD 891
            P      ++    +K  GK RM V+Y  +NKA   D+Y  P+ D L+    G ++ S  D
Sbjct: 276  PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFD 335

Query: 892  AYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNM- 950
              SG  Q+ +       TAF   + +Y +  +PFGLK A + +QR MD  F  +V R   
Sbjct: 336  CKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFC 393

Query: 951  EVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCYFGVQGGKFLGFMITSRGIE 1010
             VYVDD++V S    DH   +     +  +H + L+ +K     +   FLG  I     +
Sbjct: 394  CVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHK 453

Query: 1011 INPEKCKAIQQMKSP-SNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVAFEWTA 1069
                  + I +      + K++QR  G +   S ++PK      P    L++NV ++WT 
Sbjct: 454  PQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTK 513

Query: 1070 ECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSD----GALSSVMLQEIDGEHRIVYF 1125
            E      ++K+ L   P L  P+    L +    SD    G L ++ + E      I  +
Sbjct: 514  EDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRY 573

Query: 1126 VSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS 1181
             S + + AE  Y   +K  LAV+ T ++   Y       +RTD         +  K D  
Sbjct: 574  ASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK 633

Query: 1182 -GRLVAWSVELSEYSLQYD 1199
             GR + W   LS YS   +
Sbjct: 634  LGRNIRWQAWLSHYSFDVE 652


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 680

 Score =  127 bits (319), Expect = 2e-28
 Identities = 121/470 (25%), Positives = 202/470 (42%), Gaps = 21/470 (4%)

Query: 748  LKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FICHRLALNPSLKPVS* 803
            L  G RL+EE+     + + +  +L    C + P +DPN    ++   + L+   K +  
Sbjct: 192  LSEGRRLSEEKLFITQQRMQKTEELLEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKV 250

Query: 804  LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVIVKKAN---GKWRMCVDYTDL 860
               +      +   +++ +LL  + I+  K P      ++  +A    G  RM V+Y  +
Sbjct: 251  KPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAM 310

Query: 861  NKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCY 920
            NKA   D+Y LP+ D L+    G ++ S  D  SG  Q+ +       TAF   + +Y +
Sbjct: 311  NKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 370

Query: 921  RTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGLDHHQDLEEAFGEIR 979
              +PFGLK A + +QR MD  F  +V R    VYVDD++V S    DH   +     +  
Sbjct: 371  NVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCN 428

Query: 980  KHNMRLNPEKCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP-SNVKEVQRLTGRI 1038
            +H + L+ +K     +   FLG  I     +      + I +      + K++QR  G +
Sbjct: 429  QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 488

Query: 1039 AALSRFLPKSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 1098
               S ++P       P    L++NV ++WT E      ++K+ L   P L  P+    L 
Sbjct: 489  TYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 548

Query: 1099 LYFAVSD----GALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRL 1154
            +    SD    G L ++ + E      I  + S + + AE  Y   +K  LAV+ T ++ 
Sbjct: 549  IETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKF 608

Query: 1155 RPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYSLQYD 1199
              Y       +RTD         +  K D   GR + W   LS YS   +
Sbjct: 609  SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVE 658


>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 666

 Score =  122 bits (307), Expect = 5e-27
 Identities = 108/377 (28%), Positives = 168/377 (43%), Gaps = 15/377 (3%)

Query: 845  KKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPA 904
            ++  GK RM V+Y  +N+A   DS+ LP++  L+    G  + S  D  SG  Q+ +   
Sbjct: 287  ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346

Query: 905  DEDKTAFMTARANYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRG 964
             +  TAF   + ++ ++ +PFGLK A + +QR M     G   +   VYVDD+IV S   
Sbjct: 347  SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSE 405

Query: 965  LDHHQDLEEAFGEIRKHNMRLNPEKCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKS 1024
            LDH+  +      + K+ + L+ +K     +   FLG  I  +G    P+        K 
Sbjct: 406  LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHC-PQNHILENIHKF 463

Query: 1025 PSNV---KEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKEL 1081
            P  +   K +QR  G +     ++PK  +   P    L+K+V + WT    +   ++K+ 
Sbjct: 464  PDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKN 523

Query: 1082 LSSPPILSKPIQGHPLHLYFAVSDGALSSVM-LQEIDGEHRIVYFVSHTLQGAEVRYQKI 1140
            L S P L  P     L +    SD     V+  + +DG   I  + S + + AE  Y   
Sbjct: 524  LGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSN 583

Query: 1141 EKAALAVLVTARRLRPYFQSFPVKVRTDLP-----LRQVLQKPDLSGRLVAWSVELSEYS 1195
            +K  LAV     +   Y       VRTD       LR  L+     GRLV W    S+Y 
Sbjct: 584  DKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKY- 642

Query: 1196 LQYDKRGAVGAQS-LAD 1211
             Q+D     G ++ LAD
Sbjct: 643  -QFDVEHLEGVKNVLAD 658


>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
          Length = 1268

 Score =  114 bits (285), Expect = 2e-24
 Identities = 73/237 (30%), Positives = 125/237 (51%), Gaps = 11/237 (4%)

Query: 814  KAVQQEVDKLLAAEFIREVKYPTWLANVVIVKK-ANGKWRMCVDY--TDLNKACPKDSYP 870
            +AV+ E+++L     I  + Y  W A +V++KK   GK R+C D+  + LN A   + +P
Sbjct: 454  EAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEFHP 513

Query: 871  LPSIDSLVDGASGN--ELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRTMPFGLK 928
            LP+ + +     G     + L DAY    Q+ +    +      T R  + Y  M FGLK
Sbjct: 514  LPTSEDIFSRLKGTVYSQIDLKDAYL---QVELDEEAQKLAVINTHRGIFKYLRMTFGLK 570

Query: 929  NAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHNMRLNPE 988
             A A++Q++MD++ +G  G  + VY DD+I+ +    +H + L E F   +++  R++ E
Sbjct: 571  PAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAE 628

Query: 989  KCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFL 1045
            KC F  +   FLGF +   G   + +K +AI+ MK+P++ K++    G    LSR +
Sbjct: 629  KCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684


>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 659

 Score =  112 bits (279), Expect = 9e-24
 Identities = 132/545 (24%), Positives = 228/545 (41%), Gaps = 46/545 (8%)

Query: 702  KKSALVGH--RCYEIEASDENLDPRGEGRVNRPTPIEET--KALKFGDRTLKIGTRLTEE 757
            K+S ++G   + Y+          + + +VNRP PI  T  + L   +    +   L E 
Sbjct: 127  KQSVIIGKITKAYQYGVKGFLESMKKKSKVNRPEPINITSNQHLFLEEGGNHVDEMLYEI 186

Query: 758  Q-------ETRLTKLLGEN-LD---LFAWSCKDMPGIDPNFICHRLALNPSLKPVS*LRR 806
            Q       E  L ++  EN +D      W    +  IDP  +         +KP+S    
Sbjct: 187  QISKFSAIEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVV-------KVKPMS---- 235

Query: 807  RLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVIVK----KANGKWRMCVDYTDLNK 862
                   +   +++ +LL  + I+  K  T ++   +V+    +  GK RM V+Y  +NK
Sbjct: 236  -YSPSDREEFDRQIKELLELKVIKPSK-STHMSPAFLVENEAERRRGKKRMVVNYKAMNK 293

Query: 863  ACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAFMTARANYCYRT 922
            A   D++ LP+ D L+    G ++ S  D  SG  Q+ +    +  TAF   + +Y +  
Sbjct: 294  ATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNV 353

Query: 923  MPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRG-LDHHQDLEEAFGEIRKH 981
            +PFGLK A + + +      + Q  +   VYVDD++V S  G  +H+  +        K 
Sbjct: 354  VPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKL 413

Query: 982  NMRLNPEKCYFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNV---KEVQRLTGRI 1038
             + L+ +K     +   FLG  I  +G    P+        K P  +   K++QR  G +
Sbjct: 414  GIILSKKKAQLFKEKINFLGLEI-DQGTHC-PQNHILEHIHKFPDRIEDKKQLQRFLGIL 471

Query: 1039 AALSRFLPKSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 1098
               S ++PK      P    L+++  + W     +   ++K+ L S P L  P     L 
Sbjct: 472  TYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLV 531

Query: 1099 LYFAVSDGALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYF 1158
            +    S+     ++    +    I  + S + + AE  Y   EK  LAV+   ++   Y 
Sbjct: 532  IETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYL 591

Query: 1159 QSFPVKVRTDLP-----LRQVLQKPDLSGRLVAWSVELSEYSLQYDKRGAVGAQSL-ADF 1212
                  +RTD       +   L+     GRLV W + LS+Y   +D     G +++ ADF
Sbjct: 592  TPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQY--DFDVEHIAGTKNVFADF 649

Query: 1213 VVELT 1217
            + E T
Sbjct: 650  LQENT 654


>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
            Protease (EC 3.4.23.-); Reverse transcriptase (EC
            2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1886

 Score = 94.7 bits (234), Expect = 1e-18
 Identities = 69/235 (29%), Positives = 111/235 (46%), Gaps = 11/235 (4%)

Query: 789  CHRLALNPSLKPVS*LRRRLGGDKGKAVQQEVDKLLAAEFIR--EVKYPTWLANV----- 841
            C    +NP +K +    + +     +A+ ++++ LL  + IR  E K+ +    V     
Sbjct: 1391 CKLNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESKHRSTAFIVRSGTE 1450

Query: 842  ---VIVKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQ 898
               +  K+  GK RM  +Y  LN+    D Y LP I++++     +++ S  D  SG  Q
Sbjct: 1451 IDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQ 1510

Query: 899  IRMHPADEDKTAFMTARANYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMI 958
            + M       TAF+     Y +  MPFGLKNA A +QR MD VF G   + + VY+DD++
Sbjct: 1511 VAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDIL 1569

Query: 959  VKSVRGLDHHQDLEEAFGEIRKHNMRLNPEKCYFGVQGGKFLGFMITSRGIEINP 1013
            V S     H Q L       +++ + L+P K   G     FLG  +    I++ P
Sbjct: 1570 VFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQP 1624


>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 692

 Score = 86.7 bits (213), Expect = 4e-16
 Identities = 68/230 (29%), Positives = 115/230 (49%), Gaps = 10/230 (4%)

Query: 817  QQEVDKLLAAEFIREVKYPTWLANVVIVKKAN----GKWRMCVDYTDLNKACPKDSYPLP 872
            ++E + LL    IRE + P   A    V+  N    GK RM ++Y  +N+A   DSY LP
Sbjct: 220  KEECEDLLKKGLIRESQSPH-SAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLP 278

Query: 873  SIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAF-MTARANYCYRTMPFGLKNAG 931
              D +++   G+   S +DA SG +Q+R+H   +  TAF    + +Y +  + FGLK A 
Sbjct: 279  RKDFILEKIKGSLWFSSLDAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAP 338

Query: 932  ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHH-QDLEEAFGEIRKHNMRLNPEKC 990
            + YQR MD+   G +      Y+DD+++ +    + H  D+      I++  + ++ +K 
Sbjct: 339  SIYQRFMDQSLKG-LEHICLAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKS 397

Query: 991  YFGVQGGKFLGFMITSRG-IEINPEKCKAIQQMKSP-SNVKEVQRLTGRI 1038
                Q  ++LG  I   G I+++P   + I Q      + K++QR  G I
Sbjct: 398  KLIQQEIEYLGLKIQGNGEIDLSPHTQEKILQFPDELEDRKQIQRFLGCI 447


>RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse
           transcriptase homolog)
          Length = 142

 Score = 85.5 bits (210), Expect = 9e-16
 Identities = 43/121 (35%), Positives = 68/121 (55%)

Query: 852 RMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGNHQIRMHPADEDKTAF 911
           RMC+DY  L K   K+ YP+P +D L D  +     + +D  SG  Q+R+   DE KT  
Sbjct: 7   RMCIDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQVRIAKGDEPKTTC 66

Query: 912 MTARANYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDL 971
           +T   ++ +R MPFGL NA AT+  LM+ V    +   + VY+DD++V ++     H+ +
Sbjct: 67  VTRYGSFEFRVMPFGLTNALATFCNLMNNVLYEYLDHFVVVYLDDLVVYTIYSNSLHEHI 126

Query: 972 E 972
           +
Sbjct: 127 K 127


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.318    0.135    0.397 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 156,485,843
Number of Sequences: 164201
Number of extensions: 7202799
Number of successful extensions: 49959
Number of sequences better than 10.0: 703
Number of HSP's better than 10.0 without gapping: 276
Number of HSP's successfully gapped in prelim test: 460
Number of HSP's that attempted gapping in prelim test: 37798
Number of HSP's gapped (non-prelim): 5047
length of query: 1291
length of database: 59,974,054
effective HSP length: 122
effective length of query: 1169
effective length of database: 39,941,532
effective search space: 46691650908
effective search space used: 46691650908
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)


Lotus: description of TM0331c.2