Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0101.14
         (1733 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...   332  4e-90
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...   332  7e-90
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...   330  2e-89
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...   327  2e-88
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...   326  3e-88
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot...   322  4e-87
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot...   317  1e-85
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro...   254  1e-66
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;...   238  1e-61
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr...   228  9e-59
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...   226  6e-58
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...   223  5e-57
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...   220  2e-56
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...   218  9e-56
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...   215  1e-54
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III    185  1e-45
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...   179  6e-44
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...   176  7e-43
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...   164  3e-39
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23...   102  1e-20

>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  332 bits (852), Expect = 4e-90
 Identities = 223/653 (34%), Positives = 337/653 (51%), Gaps = 55/653 (8%)

Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
            +E     DTGA+    S+ +IP  ++      +    A+GS + I               
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREI 97

Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
            +KIP+          II N+  ++  PF+     T +VI       K +P +  +    V
Sbjct: 98   FKIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHIAKLTRAV 151

Query: 1215 QHLGKPILFKFSK-------PPFSKTLNIISYKEKQINFLKE--EISHKSIEVQLQQPSV 1265
            +   +  L    K        P + + N I    K+I  L E   +S + + +  Q+   
Sbjct: 152  RVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQ- 210

Query: 1266 KTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQF 1325
              +I  +LE + S    D PN   +     ++L      SD     K +P++ +    + 
Sbjct: 211  --KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREE 261

Query: 1326 CQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1385
              K+I +LL  K+I+ SKSP     F VN +AE  RG  R+V+NYK +N+A     Y +P
Sbjct: 262  FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321

Query: 1386 NKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPS 1445
            NK +LL  +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS
Sbjct: 322  NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 381

Query: 1446 EFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLF 1505
             FQR M+E F  +  F  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF
Sbjct: 382  IFQRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 441

Query: 1506 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1565
            + KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I 
Sbjct: 442  KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIR 501

Query: 1566 KLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL 1624
            K L  +LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+L
Sbjct: 502  KPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 561

Query: 1625 K----QKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVD 1680
            K     +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+R D
Sbjct: 562  KAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621

Query: 1681 CKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
                K  +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE+
Sbjct: 622  NTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 674


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  332 bits (850), Expect = 7e-90
 Identities = 220/647 (34%), Positives = 335/647 (51%), Gaps = 43/647 (6%)

Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
            +E     DTGA+    S+ +IP  ++      +    A+GS + I               
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97

Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
            +KIP+          II N+  ++  PF+     T +VI       K +P +  +    V
Sbjct: 98   FKIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHITKLTRAV 151

Query: 1215 QHLGKPILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSV---KTRIGN 1271
            +   +  L    K   ++    ++    +I    EEI+  S   +L +  +   + R+  
Sbjct: 152  RVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQK 211

Query: 1272 ILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEIN 1331
            I E +   +CS+ P    + K  M         SD     K +P++ +    +   K+I 
Sbjct: 212  I-EELLEKVCSENPLDPNKTKQWMKA---SIKLSDPSKAIKVKPMKYSPMDREEFDKQIK 267

Query: 1332 DLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLL 1391
            +LL  K+I+ SKSP     F VN +AE  RG  R+V+NYK +N+A     Y +PNK +LL
Sbjct: 268  ELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELL 327

Query: 1392 ARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIM 1451
              +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS FQR M
Sbjct: 328  TLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHM 387

Query: 1452 NEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRF 1511
            +E F  +  F  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ KI F
Sbjct: 388  DEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 447

Query: 1512 LGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDR 1571
            LG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I K L  +
Sbjct: 448  LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAK 507

Query: 1572 LKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK----Q 1626
            LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK     
Sbjct: 508  LKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 567

Query: 1627 KIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKD 1686
            +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+R D    K 
Sbjct: 568  EGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 627

Query: 1687 ILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
             +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE+
Sbjct: 628  FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 674


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  330 bits (847), Expect = 2e-89
 Identities = 219/647 (33%), Positives = 335/647 (50%), Gaps = 43/647 (6%)

Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
            +E     DTGA+    S+ +IP  ++      +    A+GS + I               
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97

Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
            ++IP+          II N+  ++  PF+     T +VI       K +P +  +    V
Sbjct: 98   FRIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHIAKLTRAV 151

Query: 1215 QHLGKPILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSV---KTRIGN 1271
            +   +  L    K   ++    ++    +I    EEI+  S   +L +  +   + R+  
Sbjct: 152  RVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQK 211

Query: 1272 ILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEIN 1331
            I E +   +CS+ P    + K  M         SD     K +P++ +    +   K+I 
Sbjct: 212  I-EELLEKVCSENPLDPNKTKQWMKA---SIKLSDPSKAIKVKPMKYSPMDREEFDKQIK 267

Query: 1332 DLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLL 1391
            +LL  K+I+ SKSP     F VN +AE  RG  R+V+NYK +N+A     Y +PNK +LL
Sbjct: 268  ELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELL 327

Query: 1392 ARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIM 1451
              +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS FQR M
Sbjct: 328  TLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHM 387

Query: 1452 NEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRF 1511
            +E F  +  F  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ KI F
Sbjct: 388  DEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 447

Query: 1512 LGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDR 1571
            LG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I K L  +
Sbjct: 448  LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAK 507

Query: 1572 LKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK----Q 1626
            LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK     
Sbjct: 508  LKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 567

Query: 1627 KIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKD 1686
            +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+R D    K 
Sbjct: 568  EGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 627

Query: 1687 ILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
             +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE+
Sbjct: 628  FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 674


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 674

 Score =  327 bits (837), Expect = 2e-88
 Identities = 219/647 (33%), Positives = 335/647 (50%), Gaps = 50/647 (7%)

Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
            +E     DTGA+    S+ +IP  ++      +    A+GS + I               
Sbjct: 40   IELHCFVDTGASLCIASKFVIPEEHWINAERPIMVKIADGSSITINKVCRDIDLIIAGEI 99

Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIK-KLFPYNTDEKGIT 1213
            + IP+          II N+  ++  PF+     T +VI    F K + +P +  +    
Sbjct: 100  FHIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVI----FTKDRTYPVHIAKLTRA 152

Query: 1214 VQHLGKPILFKFSKPPFSKTLNIISYKEKQINFLKE--EISHKSIEVQLQQPSVKTRIGN 1271
            V+   +  L    K   ++    ++    +I  L E   +S + + +  Q+     +I  
Sbjct: 153  VRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQ---KIEE 209

Query: 1272 ILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEIN 1331
            +LE + S    D PN   +     ++L      SD     K +P++ +    +   K+I 
Sbjct: 210  LLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIK 262

Query: 1332 DLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLL 1391
            +LL  K+I+ SKSP     F VN +AE  RG  R+V+NYK +N+A     Y  PNK +LL
Sbjct: 263  ELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELL 322

Query: 1392 ARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIM 1451
              +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS FQR M
Sbjct: 323  TLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHM 382

Query: 1452 NEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRF 1511
            +E F  +  F  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ KI F
Sbjct: 383  DEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 442

Query: 1512 LGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDR 1571
            LG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I K L  +
Sbjct: 443  LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAK 502

Query: 1572 LKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK----Q 1626
            LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK     
Sbjct: 503  LKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 562

Query: 1627 KIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKD 1686
            +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+R D    K 
Sbjct: 563  EGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 622

Query: 1687 ILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
             +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE+
Sbjct: 623  FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 669


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 680

 Score =  326 bits (836), Expect = 3e-88
 Identities = 220/653 (33%), Positives = 334/653 (50%), Gaps = 55/653 (8%)

Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
            +E     DTGA+    S+ +IP  ++      +    A+GS + I               
Sbjct: 39   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVI 98

Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
            +KIP+          II N+  ++  PF+     T +VI       K +P +  +    V
Sbjct: 99   FKIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHIAKLTRAV 152

Query: 1215 QHLGKPILFKFSK-------PPFSKTLNIISYKEKQINFLKE--EISHKSIEVQLQQPSV 1265
            +   +  L    K        P + + N I    ++I  L E   +S + + +  QQ   
Sbjct: 153  RVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFIT-QQRMQ 211

Query: 1266 KTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQF 1325
            KT      E +   +CS+ P    + K  M         SD     K +P++ +    + 
Sbjct: 212  KT------EELLEKVCSENPLDPNKTKQWMKA---SIKLSDPSKAIKVKPMKYSPMDREE 262

Query: 1326 CQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1385
              K+I +LL  K+I+ SKSP     F VN +AE  RG  R+V+NYK +N+A     Y +P
Sbjct: 263  FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLP 322

Query: 1386 NKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPS 1445
            NK +LL  +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS
Sbjct: 323  NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 382

Query: 1446 EFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLF 1505
             FQR M+E F  +  F  VY+DD+++FS + + H  H+   +    ++G+ +SK K  LF
Sbjct: 383  IFQRHMDEAFRVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 442

Query: 1506 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1565
            + KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P L+ + 
Sbjct: 443  KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMR 502

Query: 1566 KLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL 1624
            + L  +LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+L
Sbjct: 503  QPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 562

Query: 1625 K----QKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVD 1680
            K     +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+R D
Sbjct: 563  KAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 622

Query: 1681 CKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
                K  +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE+
Sbjct: 623  NTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 675


>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 666

 Score =  322 bits (826), Expect = 4e-87
 Identities = 206/615 (33%), Positives = 318/615 (51%), Gaps = 12/615 (1%)

Query: 1127 DTGANSSCISEGLIPTRYFEKTTEKLSAAEGSKLIIKYK--IPSAIIKNDSLEIETPFLL 1184
            DTGA+    S  +IP   +E + + +     ++ +IK      +  +K      E P + 
Sbjct: 54   DTGASLCIASRYIIPEELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVY 113

Query: 1185 VRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGKPILFKFSKPPFSKTLNIISYKEKQI 1244
             +      +IG  F +   P+   E  I      + +L K     FS + N    +  + 
Sbjct: 114  QQETGIDFLIGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVS-NPSFLENMKK 172

Query: 1245 NFLKEEISHKSIEVQLQQPSVK----TRIGNILENIQSSICSDLPNAFWERKSHMVELPY 1300
            +   E+I   +I   +  P  +    T     +E +   +CS+ P    + K  M     
Sbjct: 173  DSKTEQIPGTNISKNIINPEERYFLITEKYQKIEQLLDKVCSENPIDPIKSKQWMKA--- 229

Query: 1301 EKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIE 1360
                 D     + +P+  + +  +   K+I +LL   LI  SKS      F V  +AE  
Sbjct: 230  SIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERR 289

Query: 1361 RGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRY 1420
            RG  R+V+NYK +NQA     + +PN ++LL  L    IFS FD KSGFWQ+ L E+ + 
Sbjct: 290  RGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQK 349

Query: 1421 KTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHF 1480
             TAFT P G ++W V+PFGLK APS FQR M    N    F +VY+DD+++FS S   H+
Sbjct: 350  LTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCMVYVDDIIVFSNSELDHY 409

Query: 1481 KHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIID 1540
             H+   + +++K G+ +SK K +LF+ KI FLG  I +GT  P N  +E   KFPD++ D
Sbjct: 410  NHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLED 469

Query: 1541 KTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPC 1599
            K  LQRFLG L Y   + P+L+ I K L  +LKKD    W+   ++ VK+IK  + + P 
Sbjct: 470  KKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPK 529

Query: 1600 LYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQIIA-FTSKHWNPAQQNYSTVKKEVLA 1658
            LYLP P+   I+ETDASD  +GG+LK +  D  ++I  ++S  +  A++NY +  KE+LA
Sbjct: 530  LYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLA 589

Query: 1659 IVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYI 1718
            +   I+KF   L    F VR D K+    L+ ++K  + +    RWQ   S + F++E++
Sbjct: 590  VKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHL 649

Query: 1719 KGSTNSLPDFLTREY 1733
            +G  N L D LTR++
Sbjct: 650  EGVKNVLADCLTRDF 664


>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 659

 Score =  317 bits (813), Expect = 1e-85
 Identities = 211/634 (33%), Positives = 330/634 (51%), Gaps = 34/634 (5%)

Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSAA-EGSKLIIKYKIPSAI-IKNDSLE 1177
            L+     DTG++    S+ +IP  Y++   + L+      K+I   K+ S + I+     
Sbjct: 27   LDLHCYVDTGSSLCMASKYVIPEEYWQTAEKPLNIKIANGKIIQLTKVCSKLPIRLGGER 86

Query: 1178 IETPFLLVRNLTHKVIIGTPFIKKLFPY---------NTDEKGITVQHLGKPILF----- 1223
               P L  +     +++G  F +   P+         + +++ + +  + K   +     
Sbjct: 87   FLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKITKAYQYGVKGF 146

Query: 1224 -----KFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSVKTRIGNILENIQS 1278
                 K SK    + +NI S    Q  FL+E  +H    +   Q S  + I  +LE + S
Sbjct: 147  LESMKKKSKVNRPEPINITS---NQHLFLEEGGNHVDEMLYEIQISKFSAIEEMLERVSS 203

Query: 1279 SICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKL 1338
                D P    +  +  +EL       D +   K +P+  +    +   ++I +LL+ K+
Sbjct: 204  ENPID-PEKSKQWMTATIEL------IDPKTVVKVKPMSYSPSDREEFDRQIKELLELKV 256

Query: 1339 IRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAK 1398
            I+ SKS      F V  +AE  RG  R+V+NYK +N+A     + +PNK +LL  +   K
Sbjct: 257  IKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKK 316

Query: 1399 IFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQR-IMNEIFNP 1457
            I+S FD KSG WQ+ L ++ +  TAFT P G Y+WNV+PFGLK APS F +   N   N 
Sbjct: 317  IYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQ 376

Query: 1458 YSNFTIVYIDDVLIFSQS-IDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNI 1516
            YS +  VY+DD+L+FS +   +H+ H+   +   +K G+ +SK K  LF+ KI FLG  I
Sbjct: 377  YSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI 436

Query: 1517 HQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDP 1576
             QGT  P N  +E   KFPD+I DK QLQRFLG L Y +D+ P+L++I K L  +LK+D 
Sbjct: 437  DQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDS 496

Query: 1577 P-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQII 1635
               W+D  +  + +IK  +K+ P LY P P    ++ETDAS+  +GGILK     +E I 
Sbjct: 497  TWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYIC 556

Query: 1636 AFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNL 1695
             + S  +  A++NY + +KE+LA++  I KF   L    FL+R D K+    +  ++K  
Sbjct: 557  RYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGD 616

Query: 1696 ASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFL 1729
              +    RWQ  LS +DF++E+I G+ N   DFL
Sbjct: 617  RKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFL 650


>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 692

 Score =  254 bits (649), Expect = 1e-66
 Identities = 208/694 (29%), Positives = 329/694 (46%), Gaps = 100/694 (14%)

Query: 1111 IKIIIG--DFILETPALFDTGANSSCISEGLIPTRY-FEKTTEKLSAAEGSKLIIKYKIP 1167
            IK+ IG  +F+    A  DTGA + C  +  I   +   K  +++  A+ SK  I+  I 
Sbjct: 22   IKVSIGKRNFL----AYIDTGA-TLCFGKRKISNNWEILKQPKEIIIADKSKHYIREAIS 76

Query: 1168 SAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQ--HLGKPILFKF 1225
            +  +K ++ E   P + + +    +IIG  F+K   P+    + I ++  +L  P   + 
Sbjct: 77   NVFLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQM 136

Query: 1226 SKPPFSKTLNIISYKEKQINF-LKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDL 1284
                      ++    ++I+  L++ +  K+IE QL++                 +CS+ 
Sbjct: 137  ISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEE-----------------VCSEH 179

Query: 1285 PNAFWERKSHM-VEL----PYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLI 1339
            P    + K+ + +E+    P ++     +IP   R +Q  +E       E  DLL+K LI
Sbjct: 180  PLDETKNKNGLLIEIRLKDPLQEINVTNRIPYTIRDVQEFKE-------ECEDLLKKGLI 232

Query: 1340 RRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKI 1399
            R S+SP S   FYV    EI+RG  R+VINYK +N+A     Y +P K  +L ++  +  
Sbjct: 233  RESQSPHSAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLW 292

Query: 1400 FSKFDMKSGFWQIQLQEKDRYKTAFTV-PFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPY 1458
            FS  D KSG++Q++L E  +  TAF+  P   YEWNV+ FGLK APS +QR M++     
Sbjct: 293  FSSLDAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGL 352

Query: 1459 SNFTIVYIDDVLIFSQ-SIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIH 1517
             +  + YIDD+LIF++ S +QH   +   +  IK+ G+ +SK K  L Q +I +LG  I 
Sbjct: 353  EHICLAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQ 412

Query: 1518 -QGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD--FCPQLSTIIKLLHDRLK- 1573
              G I       E   +FPD++ D+ Q+QRFLGC+NY+A+  F   L+   K L  ++  
Sbjct: 413  GNGEIDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISV 472

Query: 1574 KDPPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL------KQK 1627
            K+P  W  + T +V+ IK +I++LP LY  + Q F IVETDAS   + G L      KQK
Sbjct: 473  KNPWKWDTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQK 532

Query: 1628 I-----------------------------------------------FDNEQIIA-FTS 1639
            I                                                +NE ++  + S
Sbjct: 533  IGLDEFGIPTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVS 592

Query: 1640 KHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKH 1699
              +   +  Y   + EVLA V  + K++ DL+   FL+R D K      + ++K      
Sbjct: 593  GTFTDTETRYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDYRNG 652

Query: 1700 IFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
               RWQ  L  +   +E IK   N   D LTRE+
Sbjct: 653  RLIRWQLRLQAYQPYVELIKSENNPFADTLTREW 686


>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
            Protease (EC 3.4.23.-); Reverse transcriptase (EC
            2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1886

 Score =  238 bits (607), Expect = 1e-61
 Identities = 211/775 (27%), Positives = 370/775 (47%), Gaps = 117/775 (15%)

Query: 1023 DTFRRLEKSTIKPVTIQDL-QSEVHTLQAEVKSLKQ------IQISQQLILDK------- 1068
            +  +  EK++    TIQ+  ++E++ ++ E++  K+       Q+ + +I+ +       
Sbjct: 1114 EALKHSEKASRVFSTIQESDEAELNLIKEELRQFKEETRMAIAQLKEAIIVQEEDTIEER 1173

Query: 1069 ----LTEENSEE--SSSSSSTPNSASNNNVGDFLEIINNVIIQKFYINIKIIIGDFILET 1122
                L E+++E   S+++ +  N   N  VG     I    ++ +YIN            
Sbjct: 1174 CAMILEEKHTENIYSATAKAEYNGLYNVKVG-----IKPDNMEPYYIN------------ 1216

Query: 1123 PALFDTGANSSCISEGLIPTRYFE--KTTEKLSAAEGSKLIIKY-KIPSAIIKNDSLEIE 1179
             A+ DTGA +  I    IP  Y+E  K T    +  G     +  K    +I      + 
Sbjct: 1217 -AIVDTGATACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQMIKAGRILIGEQYFRMP 1275

Query: 1180 TPFLLVRNLTH--KVIIGTPFIKKLFPYNTDEKGITVQHLGKPIL--FKFSKPPFSKTLN 1235
              +++   L+   ++IIG  FI+ L      E G+ ++   K I+  +K      +    
Sbjct: 1276 VTYVMNMGLSPGIQMIIGCSFIRSL------EGGLRIE---KDIITFYKLVTSIETSRTT 1326

Query: 1236 IISYKEKQINFLKEEISHKSIEVQ----LQQPSVKTRIGNILENIQSSICSDLPNAFWER 1291
             ++   +++   ++E  + +  V+    L Q   +     + E  +     + P  FW+ 
Sbjct: 1327 QVANSIEELELSEDEYLNIAASVETPSFLDQEFARKNKDLLKEMKEMKYIGENPMEFWKN 1386

Query: 1292 KSHMVELPYEKDFSDKQIPTKARPIQM----NEELLQFCQKEINDLLQKKLIRRSKSPWS 1347
                 +L    +  +  I    RPI+     +EE +    ++IN LLQ K+IR S+S   
Sbjct: 1387 NKIKCKL----NIINPDIKIMGRPIKHVTPGDEEAMT---RQINLLLQMKVIRPSESKHR 1439

Query: 1348 CATFYVNKQAEIE-------RGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIF 1400
               F V    EI+       +G  R+V NYK LN+     +Y +P    +++++  +KI+
Sbjct: 1440 STAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIY 1499

Query: 1401 SKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSN 1460
            SKFD+KSGFWQ+ ++E+    TAF      YEW VMPFGLKNAP+ FQR M+ +F     
Sbjct: 1500 SKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKGTEK 1559

Query: 1461 FTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGT 1520
            F  VYIDD+L+FS++ +QH +HL T + + K+NGL +S TK+ +   +I FLG ++    
Sbjct: 1560 FIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTK 1619

Query: 1521 I----IPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRL---- 1572
            I      I++  +F+D   +++     ++ +LG L+Y  ++   +  +++ L  ++    
Sbjct: 1620 IKLQPHIISKICDFSD---EKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTG 1676

Query: 1573 --KKDPPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIF- 1629
              + +P  W      +V+QIK ++KNLP L LP   +F I+ETD    G+G + K K+  
Sbjct: 1677 DKRMNPETW-----KMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSK 1731

Query: 1630 ----DNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQ-YDLINQTFLVRVDCKSA 1684
                  E+I A+ S  +NP +   ST+  E+ A +  + KF+ Y L  +  ++R DC++ 
Sbjct: 1732 HDPRSTERICAYASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAI 1788

Query: 1685 KDILQKDVKNLASKHIFARWQAILSVFDF--------EIEYIKGSTNSLPDFLTR 1731
                 K  +N  S+    RW   L+  DF          E+I G  N L D L+R
Sbjct: 1789 IKFYNKTNENKPSR---VRW---LTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837



 Score = 37.0 bits (84), Expect = 0.48
 Identities = 52/336 (15%), Positives = 123/336 (36%), Gaps = 50/336 (14%)

Query: 726  LSNLKCKSLGDFRWYKDTFLTRVYTR----EDSQQAFWKEKFLAGLPKSFGDKVREKLRS 781
            L  L C +    R Y   +LT          +++     E+    +P + G++V +  + 
Sbjct: 735  LKQLVCPNYQSIRRYLMDYLTLAAETGLMWSETEGPAISEELFTKMPAAIGERVAQAYKI 794

Query: 782  QNPGGEIPYQTLSYGQLIAIIQRVALKICQDDKIQQQLTKEKSQNRRDLGTFCEQFGIQG 841
             +P   +   +  Y  +  + ++     C++    + L             FC  F I+G
Sbjct: 795  MDPTSAVNLPSRVYFTINYLTEQ-----CKEASYMRSLKALD---------FCRDFPIEG 840

Query: 842  CPKKPKPRKHDPPPKQQWRRNSSRNHDHRKPKPRSKPHSTQAAKNPPENRPSQGKNVTCY 901
               +   +K     K    + + + HD+     ++K                  +   CY
Sbjct: 841  YYGRSGEKKKYTARKAT--KYTGKAHDNHIRVTKAKYQ----------------RKCKCY 882

Query: 902  NCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLIQTSDEEESASSDSEVSEDLNQIQN 961
             CG+ GH +  CR K      H + +    + +L ++ ++E  SA    E  +++  +  
Sbjct: 883  ICGQEGHYANQCRNK------HKDQQRVAILQSLDLKENEEVVSADDKEEEDDEIFSVLG 936

Query: 962  DDDPQSSSSINVLTNEQDLLFRAINSIPDPDEKKIYLERLKFTLEDKPPKNPITTNKFNL 1021
            ++D Q  + + +  ++   + +  +   D   + +          + P    +       
Sbjct: 937  EEDYQEETIMVLEEDDIQQIIKEFSKFGDLSRRNVG--------PNFPGPAEVQMGVLKP 988

Query: 1022 RDTFRRLEKSTIKPVTIQDLQSEVHTLQAEVKSLKQ 1057
            + ++RR  ++T++ +      + + T Q   +S KQ
Sbjct: 989  KSSWRRPIQATLEEINCHHNWTAISTGQLACRSCKQ 1024


>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
            protein; Protease (EC 3.4.23.-); Reverse transcriptase
            (EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1675

 Score =  228 bits (582), Expect = 9e-59
 Identities = 235/906 (25%), Positives = 407/906 (43%), Gaps = 98/906 (10%)

Query: 881  TQAAKNPPEN---RPSQGKNVTCYNCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLI 937
            T   KN  +N   RPS  K   CY C    H++  C   RR +         ++    LI
Sbjct: 752  TNYNKNRRKNYVRRPSIKKKCRCYICQDENHLANRC--PRRYT---------NQARASLI 800

Query: 938  QTSDEE--ESASSDSEVSEDLNQIQNDDDPQSSSSINVLTNE----QDLLFRAINSIPDP 991
               DE+    AS D ++   L  I+ D+    SS  +  T E    +D +    +   D 
Sbjct: 801  DGLDEDIVSIASDDEDIENFLEIIELDEFIAHSSQEHEHTWEIGGKKDKVCEICSYFTDY 860

Query: 992  DEKKIYLERLKFTLEDKPPKNPITTNKFNLRDTFRRLEKSTIKPVTIQDLQSEVHTLQAE 1051
            + K +  +    T E +  K    +++  L  T   ++K T +   I DL+  V  L+  
Sbjct: 861  N-KTVSCK----TCETQYCKT--CSDQLALEVT--EVKKPTKEETMIDDLKLNVKNLEFR 911

Query: 1052 VKSLKQIQISQQLILDKLTEENSEESSSSSSTPNS--ASNNNVGDFLEIINNVIIQKFYI 1109
            V  L+  ++  Q + DK         S  +  P +  A   N  ++++   N      Y+
Sbjct: 912  VTILEH-KVEMQNLQDKFETMQIRNKSEITEIPTTSLAMRANESNYIKTSINKTAG-CYV 969

Query: 1110 NIKIIIGDFILETPALFDTGANSSCISEGLIPTRYFEKTTEKLS--AAEGSKLIIKYKIP 1167
              KI   +      AL D+G+  + I   LIP  +   T  ++   A + SK  +  ++ 
Sbjct: 970  ETKISFNNENRIITALIDSGSTHNIICPTLIPASWINNTHREIIMFAVDNSKYNLNQELI 1029

Query: 1168 SAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYN--TDEKG--------ITVQ-- 1215
              I K    E++  F +   L    +   P    +  +   T+E G        IT+Q  
Sbjct: 1030 DDI-KLQFQEVDETFGIKYKLGQTYVAPKPTKTFIIGHRFLTNENGSVTIHKDYITIQKT 1088

Query: 1216 ------------------HLGKPILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIE 1257
                              H G+P LF      ++K  ++ SY+ + I   K EI ++S+ 
Sbjct: 1089 TGIYPTARHELKSEFARKHGGRPPLFSNIPETYNKIPHLHSYQPQPILGYKNEIGNQSLI 1148

Query: 1258 VQLQQPSVKTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQ 1317
              +++      IG+ +   +++   D      +       +PY    +DK++        
Sbjct: 1149 TMVKELEALGFIGDDITKNRTTWVCDFKIINPDINITCATIPYTP--ADKEV-------- 1198

Query: 1318 MNEELLQFCQKEINDLLQKKLIRRSKSPWS--CATFYVNKQAEIERGTPRLVINYKPLNQ 1375
                     +K+I +LL  KLI+++        A F V   +E     PR+V NYK LN 
Sbjct: 1199 --------FEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLND 1250

Query: 1376 ALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNV 1435
             +    + IP+K  ++  +  A IFSKFD+K+GF  ++L++  +  T FT   G Y WNV
Sbjct: 1251 NMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNV 1310

Query: 1436 MPFGLKNAPSEFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGL 1495
             PFG+ NAP  FQR M E F     F ++YIDD+LI S +  +H +HL  F + +K+ G 
Sbjct: 1311 CPFGIANAPCAFQRFMQESFGDL-KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGC 1369

Query: 1496 AVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQ-LQRFLGCLNYV 1554
             +SK K  +F  ++ +LG  I +G I      ++   KF    ++  + LQ +LG LNY 
Sbjct: 1370 VLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNYA 1429

Query: 1555 ADFCPQLSTIIKLLHDRLKKDPPP-WSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVET 1613
              +   LS ++  L+ +  K+    ++    N++ +I+  +  +  L  P    + I+ET
Sbjct: 1430 RGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIET 1489

Query: 1614 DASDIGFGGIL-----KQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQY 1668
            DAS+ G+G +L     K    D E+I  + S ++   ++ ++++  E+ AI  +++KFQ 
Sbjct: 1490 DASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALNKFQI 1548

Query: 1669 DLINQTFLVRVDCKS-AKDILQKDVKNLA-SKHIFARWQAILSVFDFEIEYIKGSTNSLP 1726
              +++ F +R DC++  K I  +D K  + ++ I  R   +   +    E+IKG+ N LP
Sbjct: 1549 -YLDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKLRDNLLKDGYKPTFEHIKGNKNFLP 1607

Query: 1727 DFLTRE 1732
            +FL+RE
Sbjct: 1608 NFLSRE 1613


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
            type 1
          Length = 1333

 Score =  226 bits (575), Expect = 6e-58
 Identities = 185/691 (26%), Positives = 333/691 (47%), Gaps = 70/691 (10%)

Query: 1053 KSLKQIQI-SQQLILDKLTEENSEESSSSSSTPN-SASNNNVGDFLEIINNVIIQKFYIN 1110
            K+ K+ Q+ ++ L  + L+++N+  +S +    N S    +   FL    N   +++ + 
Sbjct: 199  KTHKRFQLQNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFL----NHTRKRYEMV 254

Query: 1111 IKIIIGDFILETPALFDTGANSSCISEGLI-----PTRYFEKTTEKLSAAEGSKLIIKYK 1165
            ++  + DF    P L DTGA ++ I+E  +     PTR + K+             I  K
Sbjct: 255  LQAELPDFKCSIPCLIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNK---INRK 311

Query: 1166 IPSAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGKPILFKF 1225
                 I  + + I+T FL+V+  +H   I       L+  N +                 
Sbjct: 312  TIKLNISLNGISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE----------------- 351

Query: 1226 SKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDLP 1285
                 S + + +S   K  N +KE      I  + +  + +T    + + I+        
Sbjct: 352  ----ISSSKHTLSQMNKVSNIVKEP-ELPDIYKEFKDITAETNTEKLPKPIKG------- 399

Query: 1286 NAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSP 1345
                      +E   E    + ++P +  P+   +  +Q    EIN  L+  +IR SK+ 
Sbjct: 400  ----------LEFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAI 447

Query: 1346 WSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDM 1405
             +C   +V K+     GT R+V++YKPLN+ +    YP+P  + LLA++  + IF+K D+
Sbjct: 448  NACPVMFVPKK----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503

Query: 1406 KSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIV- 1464
            KS +  I++++ D +K AF  P G +E+ VMP+G+  AP+ FQ  +N I        +V 
Sbjct: 504  KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVC 563

Query: 1465 YIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPI 1524
            Y+DD+LI S+S  +H KH+   +  +K   L +++ K    Q++++F+G++I +    P 
Sbjct: 564  YMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623

Query: 1525 NRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVH 1583
               I+   ++  Q  ++ +L++FLG +NY+  F P+ S +   L++ LKKD    W+   
Sbjct: 624  QENIDKVLQW-KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQ 682

Query: 1584 TNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQI-IAFTSKHW 1642
            T  ++ IK  + + P L   +     ++ETDASD+  G +L QK  D++   + + S   
Sbjct: 683  TQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKM 742

Query: 1643 NPAQQNYSTVKKEVLAIVLSISKFQYDLIN--QTFLVRVDCKSAKDILQKDVKNLASKHI 1700
            + AQ NYS   KE+LAI+ S+  +++ L +  + F +  D ++    +  + +       
Sbjct: 743  SKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKR 800

Query: 1701 FARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
             ARWQ  L  F+FEI Y  GS N + D L+R
Sbjct: 801  LARWQLFLQDFNFEINYRPGSANHIADALSR 831


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
            type 2
          Length = 1333

 Score =  223 bits (567), Expect = 5e-57
 Identities = 181/673 (26%), Positives = 323/673 (47%), Gaps = 71/673 (10%)

Query: 1075 EESSSSSSTPNSAS--NNNVGDFLEIINNVII----QKFYINIKIIIGDFILETPALFDT 1128
            E  S  ++T NS +    NV   +E  +N  +    +++ + ++  + DF    P L DT
Sbjct: 214  ESLSKKNNTTNSRNLRKTNVSR-IEYSSNKFLNHTRKRYEMVLQAELPDFKCSIPCLIDT 272

Query: 1129 GANSSCISEGLI-----PTRYFEKTTEKLSAAEGSKLIIKYKIPSAIIKNDSLEIETPFL 1183
            GA ++ I+E  +     PTR + K+             I  K     I  + + I+T FL
Sbjct: 273  GAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNK---INRKTIKLNISLNGISIKTEFL 329

Query: 1184 LVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGKPILFKFSKPPFSKTLNIISYKEKQ 1243
            +V+  +H   I       L+  N +                      S + + +S   K 
Sbjct: 330  VVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKHTLSQMNKV 365

Query: 1244 INFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKD 1303
             N +KE      I  + +  + +T    + + I+                  +E   E  
Sbjct: 366  SNIVKEP-ELPDIYKEFKDITAETNTEKLPKPIKG-----------------LEFEVELT 407

Query: 1304 FSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGT 1363
              + ++P +  P+   +  +Q    EIN  L+  +IR SK+  +C   +V K+     GT
Sbjct: 408  QENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAINACPVMFVPKK----EGT 461

Query: 1364 PRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTA 1423
             R+V++YKPLN+ +    YP+P  + LLA++  + IF+K D+KS +  I++++ D +K A
Sbjct: 462  LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521

Query: 1424 FTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIV-YIDDVLIFSQSIDQHFKH 1482
            F  P G +E+ VMP+G+  AP+ FQ  +N I        +V Y+D++LI S+S  +H KH
Sbjct: 522  FRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKH 581

Query: 1483 LNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKT 1542
            +   +  +K   L +++ K    Q++++F+G++I +    P    I+   ++  Q  ++ 
Sbjct: 582  VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW-KQPKNRK 640

Query: 1543 QLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPCLY 1601
            +L++FLG +NY+  F P+ S +   L++ LKKD    W+   T  ++ IK  + + P L 
Sbjct: 641  ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700

Query: 1602 LPNPQAFKIVETDASDIGFGGILKQKIFDNEQI-IAFTSKHWNPAQQNYSTVKKEVLAIV 1660
              +     ++ETDASD+  G +L QK  D++   + + S   + AQ NYS   KE+LAI+
Sbjct: 701  HFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAII 760

Query: 1661 LSISKFQYDLIN--QTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYI 1718
             S+  +++ L +  + F +  D ++    +  + +        ARWQ  L  F+FEI Y 
Sbjct: 761  KSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKRLARWQLFLQDFNFEINYR 818

Query: 1719 KGSTNSLPDFLTR 1731
             GS N + D L+R
Sbjct: 819  PGSANHIADALSR 831


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
            type 3
          Length = 1333

 Score =  220 bits (561), Expect = 2e-56
 Identities = 171/637 (26%), Positives = 307/637 (47%), Gaps = 64/637 (10%)

Query: 1105 QKFYINIKIIIGDFILETPALFDTGANSSCISEGLI-----PTRYFEKTTEKLSAAEGSK 1159
            +++ + ++  + DF    P L DTG  ++ I+E  +     PTR + K+           
Sbjct: 249  KRYEMVLQAELPDFKCSIPCLIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPNK- 307

Query: 1160 LIIKYKIPSAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGK 1219
              I  K     I  + + I+T FL+V+  +H   I       L+  N +           
Sbjct: 308  --INRKTIKLNISLNGISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE----------- 351

Query: 1220 PILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSS 1279
                       S + + +S   K  N +KE      I  + +  + +T    + + I+  
Sbjct: 352  ----------ISSSKHTLSQMNKVSNIVKEP-ELPDIYKEFKDITAETNTEKLPKPIKG- 399

Query: 1280 ICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLI 1339
                            +E   E    + ++P +  P+   +  +Q    EIN  L+  +I
Sbjct: 400  ----------------LEFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGII 441

Query: 1340 RRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKI 1399
            R SK+  +C   +V K+     GT R+V++YKPLN+ +    YP+P  + LLA++  + I
Sbjct: 442  RESKAINACPVMFVPKK----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTI 497

Query: 1400 FSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYS 1459
            F+K D+KS +  I++++ D +K AF  P G +E+ VMP+G+  AP+ FQ  +N I     
Sbjct: 498  FTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVK 557

Query: 1460 NFTIV-YIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQ 1518
               +V Y+D++LI S+S  +H KH+   +  +K   L +++ K    Q++++F+G++I +
Sbjct: 558  ESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISE 617

Query: 1519 GTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP- 1577
                P    I+   ++  Q  ++ +L++FLG +NY+  F P+ S +   L++ LKKD   
Sbjct: 618  KGFTPCQENIDKVLQW-KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676

Query: 1578 PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQI-IA 1636
             W+   T  ++ IK  + + P L   +     ++ETDASD+  G +L QK  D++   + 
Sbjct: 677  KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736

Query: 1637 FTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLIN--QTFLVRVDCKSAKDILQKDVKN 1694
            + S   + AQ NYS   KE+LAI+ S+  +++ L +  + F +  D ++    +  + + 
Sbjct: 737  YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE- 795

Query: 1695 LASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
                   ARWQ  L  F+FEI Y  GS N + D L+R
Sbjct: 796  -PENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 831


>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
            transposon 17.6 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score =  218 bits (556), Expect = 9e-56
 Identities = 138/412 (33%), Positives = 221/412 (53%), Gaps = 13/412 (3%)

Query: 1324 QFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIE-RGTPRLVINYKPLNQALCWIRY 1382
            Q  + +I D+L + +IR S SP++   + V K+ +   +   R+VI+Y+ LN+     R+
Sbjct: 221  QEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280

Query: 1383 PIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKN 1442
            PIPN  ++L +L     F+  D+  GF QI++  +   KTAF+   G YE+  MPFGLKN
Sbjct: 281  PIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340

Query: 1443 APSEFQRIMNEIFNPYSN-FTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTK 1501
            AP+ FQR MN+I  P  N   +VY+DD+++FS S+D+H + L      + K  L +   K
Sbjct: 341  APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400

Query: 1502 VSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQL 1561
                + +  FLGH +    I P    IE   K+P     K +++ FLG   Y   F P  
Sbjct: 401  CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPK-EIKAFLGLTGYYRKFIPNF 459

Query: 1562 STIIKLLHDRLKKDP--PPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIG 1619
            + I K +   LKK+      +  + +  K++K  I   P L +P+      + TDASD+ 
Sbjct: 460  ADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVA 519

Query: 1620 FGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRV 1679
             G +L Q    +   +++ S+  N  + NYST++KE+LAIV +   F++ L+ + F +  
Sbjct: 520  LGAVLSQ----DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISS 575

Query: 1680 DCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
            D +    + +  +K+  SK    RW+  LS FDF+I+YIKG  N + D L+R
Sbjct: 576  DHQPLSWLYR--MKDPNSK--LTRWRVKLSEFDFDIKYIKGKENCVADALSR 623


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
            transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score =  215 bits (547), Expect = 1e-54
 Identities = 138/427 (32%), Positives = 226/427 (52%), Gaps = 15/427 (3%)

Query: 1309 IPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTP-RLV 1367
            I +K  P+    E+    + ++ ++L + LIR S SP++  T+ V K+ +       R+V
Sbjct: 207  IYSKQYPLAQTHEIE--VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264

Query: 1368 INYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVP 1427
            I+Y+ LN+     RYPIPN  ++L +L   + F+  D+  GF QI++ E+   KTAF+  
Sbjct: 265  IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324

Query: 1428 FGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSN-FTIVYIDDVLIFSQSIDQHFKHLNTF 1486
             G YE+  MPFGL+NAP+ FQR MN I  P  N   +VY+DD++IFS S+ +H   +   
Sbjct: 325  SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384

Query: 1487 ISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQR 1546
             + +    L +   K    + +  FLGH +    I P    ++    +P    DK +++ 
Sbjct: 385  FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDK-EIRA 443

Query: 1547 FLGCLNYVADFCPQLSTIIKLLHDRLKKDPPPWSD--VHTNVVKQIKLRIKNLPCLYLPN 1604
            FLG   Y   F P  + I K +   LKK     +    +    +++K  I   P L LP+
Sbjct: 444  FLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503

Query: 1605 PQAFKIVETDASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1664
             +   ++ TDAS++  G +L Q    N   I+F S+  N  + NYS ++KE+LAIV +  
Sbjct: 504  FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559

Query: 1665 KFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNS 1724
             F++ L+ + FL+  D +  + +   ++K   +K    RW+  LS + F+I+YIKG  NS
Sbjct: 560  TFRHYLLGRQFLIASDHQPLRWL--HNLKEPGAK--LERWRVRLSEYQFKIDYIKGKENS 615

Query: 1725 LPDFLTR 1731
            + D L+R
Sbjct: 616  VADALSR 622


>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score =  185 bits (469), Expect = 1e-45
 Identities = 134/434 (30%), Positives = 217/434 (49%), Gaps = 24/434 (5%)

Query: 1309 IPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVI 1368
            I  K RPI +   L    +K I  +L +K+IR SKSPWS     V K+     G+ R+ I
Sbjct: 943  IRQKPRPIPL--ALKPEIRKMIQKMLNQKVIRESKSPWSSPVVLVKKKD----GSIRMCI 996

Query: 1369 NYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPF 1428
            +Y+ +N+ +    +P+PN +  L  L   K+++ FDM +GFWQI L EK +  TAF +  
Sbjct: 997  DYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGS 1056

Query: 1429 GQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNF-TIVYIDDVLIFSQSIDQHFKHLNTFI 1487
              +EWNV+PFGL  +P+ FQ  M EI          VY+DD+LI S+ ++QH + +   +
Sbjct: 1057 ELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEAL 1116

Query: 1488 SVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP--DQIIDKTQLQ 1545
            + I+K+G+ +  +K  + + ++ +LGH +   T+  +      TDK     +  +  +LQ
Sbjct: 1117 TRIRKSGMKLRASKCHIAKKEVEYLGHKV---TLDGVETQEVKTDKMKQFSRPTNVKELQ 1173

Query: 1546 RFLGCLNYVADFCPQLSTIIKLLHDRLK-KDPPPWSDVHTNVVKQIKLRIKNLPCLYLPN 1604
             FLG + Y   F    + I   L   +  K    W        +++K  +   P L  P+
Sbjct: 1174 SFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPD 1233

Query: 1605 PQAFK------IVETDASDIGFGGILKQKIFDNEQ-IIAFTSKHWNPAQQNYSTVKKEVL 1657
             +A        ++ TDAS  G G +L Q+  D +Q  IAF SK  +PA+  Y     E L
Sbjct: 1234 VEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEAL 1293

Query: 1658 AIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEY 1717
            A++ ++ +F+  +      V  D K    +L+     LA +    RW   +  FD +I Y
Sbjct: 1294 AMMFALRRFKTIIYGTAITVFTDHKPLISLLKG--SPLADR--LWRWSIEILEFDVKIVY 1349

Query: 1718 IKGSTNSLPDFLTR 1731
            + G  N++ D L+R
Sbjct: 1350 LAGKANAVADALSR 1363


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
            transposon opus [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score =  179 bits (454), Expect = 6e-44
 Identities = 140/479 (29%), Positives = 236/479 (49%), Gaps = 30/479 (6%)

Query: 1274 ENIQSSICSDLPNAFWERKSHM-VELPYEKDF-SDKQIPTKAR----PIQMNEELLQFCQ 1327
            + I +S+  + P  F    S M VE   + +  ++ Q P  A+    P+ M  E+    +
Sbjct: 85   QEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEV----E 140

Query: 1328 KEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTP-RLVINYKPLNQALCWIRYPIPN 1386
            ++I++LLQ  +IR S SP++   + V K+ +       R+V+++K LN       YPIP+
Sbjct: 141  RQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200

Query: 1387 KKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSE 1446
                LA L +AK F+  D+ SGF QI ++E D  KTAF+   G+YE+  +PFGLKNAP+ 
Sbjct: 201  INATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAI 260

Query: 1447 FQRIMNEIFNPY-SNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLF 1505
            FQR++++I   +      VYIDD+++FS+  D H+K+L   ++ + K  L V+  K    
Sbjct: 261  FQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFL 320

Query: 1506 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1565
             T++ FLG+ +    I    + +    + P     K +L+RFLG  +Y   F    + + 
Sbjct: 321  DTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVK-ELKRFLGMTSYYRKFIQDYAKVA 379

Query: 1566 KLL------------HDRLKKDPPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVET 1613
            K L              +  K P    +        +K  + +   L  P       + T
Sbjct: 380  KPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTT 439

Query: 1614 DASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQ 1673
            DAS+   G +L Q     ++ IA+ S+  N  ++NY+T++KE+LAI+ S+   +  L   
Sbjct: 440  DASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGA 499

Query: 1674 -TFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
             T  V  D +     L    +N  +K    RW+A +  ++ E+ Y  G +N + D L+R
Sbjct: 500  GTIKVYTDHQPLTFALGN--RNFNAK--LKRWKARIEEYNCELIYKPGKSNVVADALSR 554


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
            transposon gypsy [Contains: Reverse transcriptase (EC
            2.7.7.49); Endonuclease]
          Length = 1035

 Score =  176 bits (445), Expect = 7e-43
 Identities = 139/531 (26%), Positives = 260/531 (48%), Gaps = 45/531 (8%)

Query: 1234 LNIISYKEKQINFLKEEISHKSIEVQLQQ---PSVK-TRIGNILENIQSSICSDLPNAFW 1289
            L++++    ++N  ++ + ++ I  +L     PSV  T + +I+  +  S+  +  +   
Sbjct: 94   LDLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVNFTDVNDIV--VPDSVKKEFKDTII 151

Query: 1290 ERKSHMVE----LPYE-------KDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKL 1338
             RK         LP+        +   ++ + ++A P  M   +  F   E+  LL+  +
Sbjct: 152  RRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMG--VSDFVNNEVKQLLKDGI 209

Query: 1339 IRRSKSPWSCATFYVNKQAEIERGTP--RLVINYKPLNQALCWIRYPIPNKKDLLARLHD 1396
            IR S+SP++  T+ V+K+     G P  RLVI+++ LN+     RYP+P+   +LA L  
Sbjct: 210  IRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGK 269

Query: 1397 AKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF- 1455
            AK F+  D+KSG+ QI L E DR KT+F+V  G+YE+  +PFGL+NA S FQR ++++  
Sbjct: 270  AKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLR 329

Query: 1456 NPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHN 1515
                    VY+DDV+IFS++   H +H++T +  +    + VS+ K   F+  + +LG  
Sbjct: 330  EQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFI 389

Query: 1516 IHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRL--- 1572
            + +         ++   ++P+      +++ FLG  +Y   F    + I + + D L   
Sbjct: 390  VSKDGTKSDPEKVKAIQEYPEPDC-VYKVRSFLGLASYYRVFIKDFAAIARPITDILKGE 448

Query: 1573 ---------KKDPPPWSDVHTNVVKQIK--LRIKNLPCLYLPNPQAFKIVETDASDIGFG 1621
                     KK P  +++   N  ++++  L  +++   Y    + F +  TDAS  G G
Sbjct: 449  NGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLT-TDASASGIG 507

Query: 1622 GILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDC 1681
             +L Q+     + I   S+     +QNY+T ++E+LAIV ++ K Q  L        ++ 
Sbjct: 508  AVLSQE----GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR---EINI 560

Query: 1682 KSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1732
             +    L   V +  +     RW++ +   + ++ Y  G  N + D L+R+
Sbjct: 561  FTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ 611


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
            transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score =  164 bits (414), Expect = 3e-39
 Identities = 132/497 (26%), Positives = 235/497 (46%), Gaps = 22/497 (4%)

Query: 1245 NFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDLPNAF-WERKSHMVELPYEKD 1303
            N ++    H++  V  Q   +K     + ++   +ICS+  + F  E +   V   Y++ 
Sbjct: 250  NVVQANSEHRNKTVLSQ---LKKNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQ 306

Query: 1304 F---SDKQIPTK--ARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAE 1358
                 D+ + TK    P    EE+    Q ++  L++ K++  S S ++     V K++ 
Sbjct: 307  LRLKDDEPVYTKNYRSPHSQVEEI----QAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSS 362

Query: 1359 --IERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1416
               ++   RLVI+Y+ +N+ L   ++P+P   D+L +L  AK FS  D+ SGF QI+L E
Sbjct: 363  PNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDE 422

Query: 1417 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYS-NFTIVYIDDVLIFSQS 1475
              R  T+F+   G Y +  +PFGLK AP+ FQR+M   F+    +   +Y+DD+++   S
Sbjct: 423  GSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCS 482

Query: 1476 IDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1535
                 K+L       ++  L +   K S F  ++ FLGH      I+P ++  +    +P
Sbjct: 483  EKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYP 542

Query: 1536 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVHTNVVKQIKLRI 1594
                D    +RF+   NY   F    +   + +    KK+ P  W+D        +K ++
Sbjct: 543  VP-HDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQL 601

Query: 1595 KNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKK 1654
             N   L  P+      + TDAS    G +L Q    ++  +A+ S+ +   + N ST ++
Sbjct: 602  INPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQ 661

Query: 1655 EVLAIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFE 1714
            E+ AI  +I  F+  +  + F V+ D +    +    + N +SK    R +  L  ++F 
Sbjct: 662  ELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLF--SMVNPSSK--LTRIRLELEEYNFT 717

Query: 1715 IEYIKGSTNSLPDFLTR 1731
            +EY+KG  N + D L+R
Sbjct: 718  VEYLKGKDNHVADALSR 734


>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1161

 Score =  102 bits (253), Expect = 1e-20
 Identities = 87/377 (23%), Positives = 167/377 (44%), Gaps = 25/377 (6%)

Query: 1310 PTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVIN 1369
            P K  PI  N +     Q  I+DLL++ ++ +  S  +   + V K      G  R+V++
Sbjct: 176  PQKQYPI--NPKAKPSIQIVIDDLLKQGVLIQQNSTMNTPVYPVPKPD----GKWRMVLD 229

Query: 1370 YKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFG 1429
            Y+ +N+ +  I     +   +L+ ++  K  +  D+ +GFW   +  +  + TAFT    
Sbjct: 230  YREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGK 289

Query: 1430 QYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISV 1489
            QY W  +P G  N+P+ F   + ++     N    Y+DD+ I      +H + L    S+
Sbjct: 290  QYCWTRLPQGFLNSPALFTADVVDLLKEIPNVQ-AYVDDIYISHDDPQEHLEQLEKIFSI 348

Query: 1490 IKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQII------DKTQ 1543
            +   G  VS  K  + Q ++ FLG NI              TD F  +++      D  Q
Sbjct: 349  LLNAGYVVSLKKSEIAQREVEFLGFNI-------TKEGRGLTDTFKQKLLNITPPKDLKQ 401

Query: 1544 LQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP---PWSDVHTNVVKQIKLRIKNLPCL 1600
            LQ  LG LN+  +F P  S ++K L+  +         W++ ++N ++ I   +     L
Sbjct: 402  LQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISVLNQADNL 461

Query: 1601 YLPNPQAFKIVETDASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIV 1660
               NP+   I++ ++S     G ++     +++ I + +  ++ A+  ++  +K +  + 
Sbjct: 462  EERNPETRLIIKVNSSP--SAGYIRYYNEGSKRPIMYVNYIFSKAEAKFTQTEKLLTTMH 519

Query: 1661 LSISKFQYDLINQTFLV 1677
              + K     + Q  LV
Sbjct: 520  KGLIKAMDLAMGQEILV 536


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.339    0.149    0.490 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 190,646,649
Number of Sequences: 164201
Number of extensions: 8027356
Number of successful extensions: 41685
Number of sequences better than 10.0: 304
Number of HSP's better than 10.0 without gapping: 90
Number of HSP's successfully gapped in prelim test: 216
Number of HSP's that attempted gapping in prelim test: 40848
Number of HSP's gapped (non-prelim): 789
length of query: 1733
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1609
effective length of database: 39,613,130
effective search space: 63737526170
effective search space used: 63737526170
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 73 (32.7 bits)


Lotus: description of TM0101.14