Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0173b.7
         (1526 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...   336  3e-91
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...   335  4e-91
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...   334  1e-90
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...   331  8e-90
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...   328  7e-89
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot...   325  4e-88
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot...   320  1e-86
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;...   246  3e-64
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro...   246  5e-64
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...   233  2e-60
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...   231  1e-59
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...   227  2e-58
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr...   224  2e-57
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...   219  5e-56
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...   213  3e-54
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III    185  9e-46
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...   184  2e-45
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...   180  3e-44
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...   162  5e-39
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23...   103  4e-21

>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  336 bits (861), Expect = 3e-91
 Identities = 224/645 (34%), Positives = 338/645 (51%), Gaps = 41/645 (6%)

Query: 914  LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIKY--KIPSAIIKNDS 969
            +E     DTGA     S+ +IP  ++      +    A+GS + I    K    II  + 
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREI 97

Query: 970  LEIETSFLLVRNLTHKVIIETPFIKKLFPY-NTDEKGITVQHHGQPI-VFKFSKPPFVKT 1027
             +I T +     +    II   F +   P+    ++ I  ++   P+ + K ++   V T
Sbjct: 98   FKIPTVYQQESGIDF--IIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGT 155

Query: 1028 LNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------------RIENI 1066
               +   +K+    + E   IS   IE  L++ ++ S                  +IE +
Sbjct: 156  EGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQKIEEL 215

Query: 1067 LENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREIND 1126
            LE + S    D PN   +     ++L      SD     K +P++ +    + F ++I +
Sbjct: 216  LEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIKE 268

Query: 1127 LLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
            LL  K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y +PNK +LL 
Sbjct: 269  LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLT 328

Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
             +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS FQR M+
Sbjct: 329  LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 388

Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
            E F    KF  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ KI FL
Sbjct: 389  EAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFL 448

Query: 1307 GHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL 1366
            G  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I KPL  +L
Sbjct: 449  GLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKL 508

Query: 1367 KKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDK 1425
            K++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK   I++
Sbjct: 509  KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568

Query: 1426 ----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDI 1481
                E I  + S  +  A++NY +  KE LA++ +I  F   L    FL+R D    K  
Sbjct: 569  GTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628

Query: 1482 LQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
            +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE
Sbjct: 629  VNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  335 bits (860), Expect = 4e-91
 Identities = 224/645 (34%), Positives = 337/645 (51%), Gaps = 41/645 (6%)

Query: 914  LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIKY--KIPSAIIKNDS 969
            +E     DTGA     S+ +IP  ++      +    A+GS + I    K    II  + 
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97

Query: 970  LEIETSFLLVRNLTHKVIIETPFIKKLFPY-NTDEKGITVQHHGQPI-VFKFSKPPFVKT 1027
              I T +     +    II   F +   P+    ++ I  ++   P+ + K ++   V T
Sbjct: 98   FRIPTVYQQESGIDF--IIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGT 155

Query: 1028 LNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------------RIENI 1066
               +   +K+    + E   IS   IE  L++ ++ S                  +IE +
Sbjct: 156  EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215

Query: 1067 LENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREIND 1126
            LE + S    D PN   +     ++L      SD     K +P++ +    + F ++I +
Sbjct: 216  LEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIKE 268

Query: 1127 LLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
            LL  K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y +PNK +LL 
Sbjct: 269  LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLT 328

Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
             +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS FQR M+
Sbjct: 329  LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 388

Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
            E F    KF  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ KI FL
Sbjct: 389  EAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFL 448

Query: 1307 GHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL 1366
            G  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I KPL  +L
Sbjct: 449  GLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKL 508

Query: 1367 KKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDK 1425
            K++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK   I++
Sbjct: 509  KENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568

Query: 1426 ----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDI 1481
                E I  + S  +  A++NY +  KE LA++ +I  F   L    FL+R D    K  
Sbjct: 569  GTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628

Query: 1482 LQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
            +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE
Sbjct: 629  VNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  334 bits (856), Expect = 1e-90
 Identities = 223/645 (34%), Positives = 337/645 (51%), Gaps = 41/645 (6%)

Query: 914  LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIKY--KIPSAIIKNDS 969
            +E     DTGA     S+ +IP  ++      +    A+GS + I    K    II  + 
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97

Query: 970  LEIETSFLLVRNLTHKVIIETPFIKKLFPY-NTDEKGITVQHHGQPI-VFKFSKPPFVKT 1027
             +I T +     +    II   F +   P+    ++ I  ++   P+ + K ++   V  
Sbjct: 98   FKIPTVYQQESGIDF--IIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHITKLTRAVRVGI 155

Query: 1028 LNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------------RIENI 1066
               +   +K+    + E   IS   IE  L++ ++ S                  +IE +
Sbjct: 156  EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215

Query: 1067 LENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREIND 1126
            LE + S    D PN   +     ++L      SD     K +P++ +    + F ++I +
Sbjct: 216  LEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIKE 268

Query: 1127 LLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
            LL  K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y +PNK +LL 
Sbjct: 269  LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLT 328

Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
             +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS FQR M+
Sbjct: 329  LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 388

Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
            E F    KF  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ KI FL
Sbjct: 389  EAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFL 448

Query: 1307 GHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL 1366
            G  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I KPL  +L
Sbjct: 449  GLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKL 508

Query: 1367 KKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDK 1425
            K++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK   I++
Sbjct: 509  KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568

Query: 1426 ----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDI 1481
                E I  + S  +  A++NY +  KE LA++ +I  F   L    FL+R D    K  
Sbjct: 569  GTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628

Query: 1482 LQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
            +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE
Sbjct: 629  VNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 674

 Score =  331 bits (849), Expect = 8e-90
 Identities = 190/470 (40%), Positives = 273/470 (57%), Gaps = 12/470 (2%)

Query: 1062 RIENILENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQ 1121
            +IE +LE + S    D PN   +     ++L      SD     K +P++ +    + F 
Sbjct: 206  KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFD 258

Query: 1122 REINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNK 1181
            ++I +LL  K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y  PNK
Sbjct: 259  KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNK 318

Query: 1182 KDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1241
             +LL  +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK APS F
Sbjct: 319  DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 378

Query: 1242 QRIMNEIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQT 1301
            QR M+E F    KF  VY+DD+L+FS + + H  H+   +    ++G+ +SK K  LF+ 
Sbjct: 379  QRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKK 438

Query: 1302 KIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKP 1361
            KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I KP
Sbjct: 439  KINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKP 498

Query: 1362 LHDRLKKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQ 1420
            L  +LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +GG+LK 
Sbjct: 499  LQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKA 558

Query: 1421 KIIDK----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCK 1476
              I++    E I  + S  +  A++NY +  KE LA++ +I  F   L    FL+R D  
Sbjct: 559  IKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNT 618

Query: 1477 SAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
              K  +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE
Sbjct: 619  HFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 668


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 680

 Score =  328 bits (841), Expect = 7e-89
 Identities = 219/656 (33%), Positives = 338/656 (51%), Gaps = 63/656 (9%)

Query: 914  LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIK-------------- 957
            +E     DTGA     S+ +IP  ++      +    A+GS + I               
Sbjct: 39   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVI 98

Query: 958  YKIPSAIIKNDSLEIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPI-V 1016
            +KIP+   +   ++    F++  N      +  PFI+        ++ I  ++   P+ +
Sbjct: 99   FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQ------FTDRVIFTKNKSYPVHI 145

Query: 1017 FKFSKPPFVKTLNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------ 1061
             K ++   V T   +   +K+    + E   IS   IE  L++ ++ S            
Sbjct: 146  AKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFI 205

Query: 1062 ------RIENILENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEE 1115
                  + E +LE + S    D PN   +     ++L      SD     K +P++ +  
Sbjct: 206  TQQRMQKTEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPM 258

Query: 1116 LLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIR 1175
              + F ++I +LL  K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     
Sbjct: 259  DREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDA 318

Query: 1176 YPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLK 1235
            Y +PNK +LL  +   KIFS FD KSGFWQ+ L ++ R  TAFT P G YEWNV+PFGLK
Sbjct: 319  YNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLK 378

Query: 1236 NAPSEFQRIMNEIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTK 1295
             APS FQR M+E F    KF  VY+DD+++FS + + H  H+   +    ++G+ +SK K
Sbjct: 379  QAPSIFQRHMDEAFRVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKK 438

Query: 1296 VSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQL 1355
              LF+ KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P L
Sbjct: 439  AQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNL 498

Query: 1356 STIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGF 1414
            + + +PL  +LK++ P  W+   T  ++++K  ++  P L+ P P+   I+ETDASD  +
Sbjct: 499  AQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYW 558

Query: 1415 GGILKQKIIDK----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFL 1470
            GG+LK   I++    E I  + S  +  A++NY +  KE LA++ +I  F   L    FL
Sbjct: 559  GGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFL 618

Query: 1471 VRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
            +R D    K  +  + K  +      RWQA LS + F++E+IKG+ N   DFL+RE
Sbjct: 619  IRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 674


>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 666

 Score =  325 bits (834), Expect = 4e-88
 Identities = 215/627 (34%), Positives = 328/627 (52%), Gaps = 38/627 (6%)

Query: 921  DTGADSSCISEGLIPTRYFEKTTE----KLSGAEGSKLIIKYKIPSAIIKNDSLEIETSF 976
            DTGA     S  +IP   +E + +    K++  E  K+    K         S EI T +
Sbjct: 54   DTGASLCIASRYIIPEELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVY 113

Query: 977  LLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIV-------FKFSKPPFVKTLN 1029
                 +    +I   F +   P+   E  I      + ++       F  S P F++ + 
Sbjct: 114  QQETGIDF--LIGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMK 171

Query: 1030 IISYKEKQI-------NFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFD-LPNA 1081
              S K +QI       N +  E  Y  I  + Q      +IE +L+ + S    D + + 
Sbjct: 172  KDS-KTEQIPGTNISKNIINPEERYFLITEKYQ------KIEQLLDKVCSENPIDPIKSK 224

Query: 1082 FWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWS 1141
             W + S  +  P +          + +P+  + +  + F ++I +LL   LI  SKS   
Sbjct: 225  QWMKASIKLIDPLKV--------IRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHM 276

Query: 1142 CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKS 1201
              AF V  +AE  RG  R+V+NYK +NQA     + +PN ++LL  L    IFS FD KS
Sbjct: 277  SPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKS 336

Query: 1202 GFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIVYID 1261
            GFWQ+ L E+ +  TAFT P G ++W V+PFGLK APS FQR M    N   KF +VY+D
Sbjct: 337  GFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCMVYVD 396

Query: 1262 DVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRA 1321
            D+++FS S   H+ H+   + I++K G+ +SK K +LF+ KI FLG  I +GT  P N  
Sbjct: 397  DIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHI 456

Query: 1322 IEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNV 1380
            +E   KFPD++ DK  LQRFLG L Y   + P+L+ I KPL  +LKKD    W+   ++ 
Sbjct: 457  LENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516

Query: 1381 VKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDKEQIIA-FTSKHWNPA 1439
            VK+IK  + + P LYLP P+   I+ETDASD  +GG+LK + +D  ++I  ++S  +  A
Sbjct: 517  VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576

Query: 1440 QQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQ 1499
            ++NY +  KE+LA+   I+ F + L   +F VR D K+    L+ ++K  + +    RWQ
Sbjct: 577  EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636

Query: 1500 AILSVFDFEIEYIKGSTNSLPDFLTRE 1526
               S + F++E+++G  N L D LTR+
Sbjct: 637  NWFSKYQFDVEHLEGVKNVLADCLTRD 663


>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 659

 Score =  320 bits (821), Expect = 1e-86
 Identities = 190/496 (38%), Positives = 281/496 (56%), Gaps = 10/496 (2%)

Query: 1031 ISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNAFWERKSHMV 1090
            I+    Q  FL+E  ++ +  +   Q S  S IE +LE + S    D P    +  +  +
Sbjct: 162  INITSNQHLFLEEGGNHVDEMLYEIQISKFSAIEEMLERVSSENPID-PEKSKQWMTATI 220

Query: 1091 ELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQ 1150
            EL       D +   K +P+  +    + F R+I +LL+ K+I+ SKS     AF V  +
Sbjct: 221  EL------IDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENE 274

Query: 1151 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1210
            AE  RG  R+V+NYK +N+A     + +PNK +LL  +   KI+S FD KSG WQ+ L +
Sbjct: 275  AERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDK 334

Query: 1211 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQR-IMNEIFNP*SKFAIVYIDDVLIFSQS 1269
            + +  TAFT P G Y+WNV+PFGLK APS F +   N   N  SK+  VY+DD+L+FS +
Sbjct: 335  ESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNT 394

Query: 1270 -IDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKF 1328
               +H+ H+   +   +K G+ +SK K  LF+ KI FLG  I QGT  P N  +E   KF
Sbjct: 395  GRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKF 454

Query: 1329 PDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLR 1387
            PD+I DK QLQRFLG L Y +D+ P+L++I KPL  +LK+D    W+D  +  + +IK  
Sbjct: 455  PDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKN 514

Query: 1388 IKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVK 1447
            +K+ P LY P P    ++ETDAS+  +GGILK      E I  + S  +  A++NY + +
Sbjct: 515  LKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNE 574

Query: 1448 KEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDF 1507
            KE+LA++  I  F   L   +FL+R D K+    +  ++K    +    RWQ  LS +DF
Sbjct: 575  KELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDF 634

Query: 1508 EIEYIKGSTNSLPDFL 1523
            ++E+I G+ N   DFL
Sbjct: 635  DVEHIAGTKNVFADFL 650


>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
            Protease (EC 3.4.23.-); Reverse transcriptase (EC
            2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1886

 Score =  246 bits (629), Expect = 3e-64
 Identities = 220/783 (28%), Positives = 372/783 (47%), Gaps = 129/783 (16%)

Query: 813  DTFKRLEKSTIKPVTIQDL-QSEVHILKAEVKSLKQ------IQTSQQLILEK------- 858
            +  K  EK++    TIQ+  ++E++++K E++  K+       Q  + +I+++       
Sbjct: 1114 EALKHSEKASRVFSTIQESDEAELNLIKEELRQFKEETRMAIAQLKEAIIVQEEDTIEER 1173

Query: 859  --LTEENSNGGSSSSSSSTSTSNRAPNNNVGDFLEIINYVTIQKFYINITIIIGDFILET 916
              +  E  +  +  S+++ +  N   N  VG     I    ++ +YIN            
Sbjct: 1174 CAMILEEKHTENIYSATAKAEYNGLYNVKVG-----IKPDNMEPYYIN------------ 1216

Query: 917  PALFDTGADSSCISEGLIPTRYFE--KTTEKLSGAEG---SKLIIK----------YKIP 961
             A+ DTGA +  I    IP  Y+E  K T       G   S  +IK          +++P
Sbjct: 1217 -AIVDTGATACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQMIKAGRILIGEQYFRMP 1275

Query: 962  SAIIKNDSLEIETSFLL----VRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVF 1017
               + N  L      ++    +R+L   + IE   I       + E   T Q        
Sbjct: 1276 VTYVMNMGLSPGIQMIIGCSFIRSLEGGLRIEKDIITFYKLVTSIETSRTTQVANSIEEL 1335

Query: 1018 KFSKPPFVKTLNIISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFD 1077
            + S+  +   LNI +  E   +FL +E + KN ++  +   +K   EN +E         
Sbjct: 1336 ELSEDEY---LNIAASVETP-SFLDQEFARKNKDLLKEMKEMKYIGENPME--------- 1382

Query: 1078 LPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQM----NEELLQFFQREINDLLQKKLI 1133
                FW+      +L    +  +  I    RPI+     +EE +    R+IN LLQ K+I
Sbjct: 1383 ----FWKNNKIKCKL----NIINPDIKIMGRPIKHVTPGDEEAMT---RQINLLLQMKVI 1431

Query: 1134 RRSKSPWSCAAFYVNKQAEIE-------RGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
            R S+S     AF V    EI+       +G  R+V NYK LN+     +Y +P    +++
Sbjct: 1432 RPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIIS 1491

Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
            ++  +KI+SKFD+KSGFWQ+ ++E+    TAF      YEW VMPFGLKNAP+ FQR M+
Sbjct: 1492 KVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMD 1551

Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
             +F    KF  VYIDD+L+FS++ +QH +HL T + + K+NG+ +S TK+ +   +I FL
Sbjct: 1552 NVFKGTEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFL 1611

Query: 1307 GHNIHQGTI----IPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPL 1362
            G ++    I      I++  +F+D   +++     ++ +LG L+Y  ++   +  +++PL
Sbjct: 1612 GASLGCTKIKLQPHIISKICDFSD---EKLATPEGMRSWLGILSYARNYIQDIGKLVQPL 1668

Query: 1363 HDRL------KKDPPPWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGG 1416
              ++      + +P  W      +V+QIK ++KNLP L LP   +F I+ETD    G+G 
Sbjct: 1669 RQKMAPTGDKRMNPETW-----KMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGA 1723

Query: 1417 ILKQKII-----DKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQK-FL 1470
            + K K+        E+I A+ S  +NP +   ST+  E+ A +  +  F+   +++K  +
Sbjct: 1724 VCKWKMSKHDPRSTERICAYASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELI 1780

Query: 1471 VRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDF--------EIEYIKGSTNSLPDF 1522
            +R DC++      K  +N  S+    RW   L+  DF          E+I G  N L D 
Sbjct: 1781 IRSDCEAIIKFYNKTNENKPSR---VRW---LTFSDFLTGLGITVTFEHIDGKHNGLADA 1834

Query: 1523 LTR 1525
            L+R
Sbjct: 1835 LSR 1837



 Score = 37.4 bits (85), Expect = 0.32
 Identities = 47/249 (18%), Positives = 94/249 (36%), Gaps = 42/249 (16%)

Query: 518 LSNLKCKSLGDFRWYKDTFLTRVYTR----EDSQHAFWKEKFLAGLPKSFGDKVREKLRS 573
           L  L C +    R Y   +LT          +++     E+    +P + G++V +  + 
Sbjct: 735 LKQLVCPNYQSIRRYLMDYLTLAAETGLMWSETEGPAISEELFTKMPAAIGERVAQAYKI 794

Query: 574 QNPGGEIPYQTLSYGQLIAIIQRVALKICQDDKIQQQLTKEKSQNRRDLGTFCEQFGIQG 633
            +P   +   +  Y  +  + ++     C++    + L             FC  F I+G
Sbjct: 795 MDPTSAVNLPSRVYFTINYLTEQ-----CKEASYMRSLKALD---------FCRDFPIEG 840

Query: 634 CPKKPKPRKQDPPPKQQWRRKSSQNHDHRKPKPRSKPHSTQAAKTPPENRPQGKDVTCYN 693
              +   +K+         RK++        K   K H      T  + + + K   CY 
Sbjct: 841 YYGRSGEKKK------YTARKAT--------KYTGKAHDNHIRVTKAKYQRKCK---CYI 883

Query: 694 CGKPGHISRYCRLKRRISE-LHLEPEIEDKINNLLIQTSDEEE------SVPSDSEVSED 746
           CG+ GH +  CR K +  + + +   ++ K N  ++   D+EE      SV  + +  E+
Sbjct: 884 CGQEGHYANQCRNKHKDQQRVAILQSLDLKENEEVVSADDKEEEDDEIFSVLGEEDYQEE 943

Query: 747 LNQIQNDDD 755
              +  +DD
Sbjct: 944 TIMVLEEDD 952


>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 692

 Score =  246 bits (627), Expect = 5e-64
 Identities = 203/685 (29%), Positives = 311/685 (44%), Gaps = 108/685 (15%)

Query: 918  ALFDTGADSSCISEGLIPTRY-FEKTTEKLSGAEGSKLIIKYKIPSAIIKNDSLEIETSF 976
            A  DTGA + C  +  I   +   K  +++  A+ SK  I+  I +  +K ++ E     
Sbjct: 33   AYIDTGA-TLCFGKRKISNNWEILKQPKEIIIADKSKHYIREAISNVFLKIENKEFLIPI 91

Query: 977  LLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKTLNIISYKEK 1036
            + + +    +II   F+K                            PF++ L  I  + K
Sbjct: 92   IYLHDSGLDLIIGNNFLKLY-------------------------QPFIQRLETIELRWK 126

Query: 1037 QINFLKE---------------EISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNA 1081
             +N  KE               ++S++ I + L++      IE  LE + S    D    
Sbjct: 127  NLNNPKESQMISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEEVCSEHPLDETK- 185

Query: 1082 FWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWS 1141
               +   ++E+  +    +  + T   P  + +  +Q F+ E  DLL+K LIR S+SP S
Sbjct: 186  --NKNGLLIEIRLKDPLQEINV-TNRIPYTIRD--VQEFKEECEDLLKKGLIRESQSPHS 240

Query: 1142 CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKS 1201
              AFYV    EI+RG  R+VINYK +N+A     Y +P K  +L ++  +  FS  D KS
Sbjct: 241  APAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAKS 300

Query: 1202 GFWQIQLQEKDRYKTAFTV-PFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIVYI 1260
            G++Q++L E  +  TAF+  P   YEWNV+ FGLK APS +QR M++         + YI
Sbjct: 301  GYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGLEHICLAYI 360

Query: 1261 DDVLIFSQ-SIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIH-QGTIIPI 1318
            DD+LIF++ S +QH   +   +  IK+ G+ +SK K  L Q +I +LG  I   G I   
Sbjct: 361  DDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGEIDLS 420

Query: 1319 NRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD--FCPQLSTIIKPLHDRLK-KDPPPWSD 1375
                E   +FPD++ D+ Q+QRFLGC+NY+A+  F   L+   K L  ++  K+P  W  
Sbjct: 421  PHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPWKWDT 480

Query: 1376 IHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL------KQKI------- 1422
            I T +V+ IK +I++LP LY  + Q F IVETDAS   + G L      KQKI       
Sbjct: 481  IDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKIGLDEFGI 540

Query: 1423 --------------------IDKEQ---------------------IIAFTSKHWNPAQQ 1441
                                IDK                       +  + S  +   + 
Sbjct: 541  PTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVSGTFTDTET 600

Query: 1442 NYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAI 1501
             Y   + EVLA V  +  ++ DL+  +FL+R D K      + ++K         RWQ  
Sbjct: 601  RYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDYRNGRLIRWQLR 660

Query: 1502 LSVFDFEIEYIKGSTNSLPDFLTRE 1526
            L  +   +E IK   N   D LTRE
Sbjct: 661  LQAYQPYVELIKSENNPFADTLTRE 685


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
            type 1
          Length = 1333

 Score =  233 bits (595), Expect = 2e-60
 Identities = 201/740 (27%), Positives = 359/740 (48%), Gaps = 77/740 (10%)

Query: 807  NKFNLKDTFKRLEKSTIKPVTIQDLQSEVHILKAEVKSLKQIQTSQQLILEKLTE----- 861
            +K N+ D   RL    + P +++ L+ E +  K+E+  +      ++      T      
Sbjct: 148  SKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQL 206

Query: 862  ENSNGGSSSSSSSTSTSN----RAPNNNVGDFL--EIINYVTIQKFYINITIIIGDFILE 915
            +N N G  S S   +T+N    R  N +  ++   + +N+ T +++ + +   + DF   
Sbjct: 207  QNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNH-TRKRYEMVLQAELPDFKCS 265

Query: 916  TPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSGAEGSKLIIKYKIPSAIIKNDSL 970
             P L DTGA ++ I+E  +     PTR + K+     G   +K  I  K     I  + +
Sbjct: 266  IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVI-YGGVYPNK--INRKTIKLNISLNGI 322

Query: 971  EIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKTLNI 1030
             I+T FL+V+  +H   I       L+  N +   I+   H    + K S        NI
Sbjct: 323  SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---ISSSKHTLSQMNKVS--------NI 368

Query: 1031 ISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNAFWERKSHMV 1090
            +  KE ++  + +E  +K+I  +     +   I+ +                        
Sbjct: 369  V--KEPELPDIYKE--FKDITAETNTEKLPKPIKGL------------------------ 400

Query: 1091 ELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQ 1150
            E   E    + ++P +  P+   +  +Q    EIN  L+  +IR SK+  +C   +V K+
Sbjct: 401  EFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458

Query: 1151 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1210
                 GT R+V++YKPLN+ +    YP+P  + LLA++  + IF+K D+KS +  I++++
Sbjct: 459  ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514

Query: 1211 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIV-YIDDVLIFSQS 1269
             D +K AF  P G +E+ VMP+G+  AP+ FQ  +N I     +  +V Y+DD+LI S+S
Sbjct: 515  GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574

Query: 1270 IDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1329
              +H KH+   +  +K   + +++ K    Q++++F+G++I +    P    I+   ++ 
Sbjct: 575  ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633

Query: 1330 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLRI 1388
             Q  ++ +L++FLG +NY+  F P+ S +  PL++ LKKD    W+   T  ++ IK  +
Sbjct: 634  KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693

Query: 1389 KNLPCLYLPNPQAFKIVETDASDIGFGGILKQK-IIDKEQIIAFTSKHWNPAQQNYSTVK 1447
             + P L   +     ++ETDASD+  G +L QK   DK   + + S   + AQ NYS   
Sbjct: 694  VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753

Query: 1448 KEVLAIVLSISNFQSDLIN--QKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVF 1505
            KE+LAI+ S+ +++  L +  + F +  D ++    +  + +        ARWQ  L  F
Sbjct: 754  KEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKRLARWQLFLQDF 811

Query: 1506 DFEIEYIKGSTNSLPDFLTR 1525
            +FEI Y  GS N + D L+R
Sbjct: 812  NFEINYRPGSANHIADALSR 831


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
            type 2
          Length = 1333

 Score =  231 bits (589), Expect = 1e-59
 Identities = 199/740 (26%), Positives = 360/740 (47%), Gaps = 77/740 (10%)

Query: 807  NKFNLKDTFKRLEKSTIKPVTIQDLQSEVHILKAEVKSLKQIQTSQQLIL-----EKLTE 861
            +K N+ D   RL    + P +++ L+ E +  K+E+  +      ++        ++   
Sbjct: 148  SKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQP 206

Query: 862  ENSNGGSSSSSSSTSTSN----RAPNNNVGDFL--EIINYVTIQKFYINITIIIGDFILE 915
            +N N G  S S   +T+N    R  N +  ++   + +N+ T +++ + +   + DF   
Sbjct: 207  QNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNH-TRKRYEMVLQAELPDFKCS 265

Query: 916  TPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSGAEGSKLIIKYKIPSAIIKNDSL 970
             P L DTGA ++ I+E  +     PTR + K+     G   +K  I  K     I  + +
Sbjct: 266  IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVI-YGGVYPNK--INRKTIKLNISLNGI 322

Query: 971  EIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKTLNI 1030
             I+T FL+V+  +H   I       L+  N +   I+   H    + K S        NI
Sbjct: 323  SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---ISSSKHTLSQMNKVS--------NI 368

Query: 1031 ISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNAFWERKSHMV 1090
            +  KE ++  + +E  +K+I  +     +   I+ +                        
Sbjct: 369  V--KEPELPDIYKE--FKDITAETNTEKLPKPIKGL------------------------ 400

Query: 1091 ELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQ 1150
            E   E    + ++P +  P+   +  +Q    EIN  L+  +IR SK+  +C   +V K+
Sbjct: 401  EFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458

Query: 1151 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1210
                 GT R+V++YKPLN+ +    YP+P  + LLA++  + IF+K D+KS +  I++++
Sbjct: 459  ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514

Query: 1211 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIV-YIDDVLIFSQS 1269
             D +K AF  P G +E+ VMP+G+  AP+ FQ  +N I     +  +V Y+D++LI S+S
Sbjct: 515  GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574

Query: 1270 IDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1329
              +H KH+   +  +K   + +++ K    Q++++F+G++I +    P    I+   ++ 
Sbjct: 575  ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633

Query: 1330 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLRI 1388
             Q  ++ +L++FLG +NY+  F P+ S +  PL++ LKKD    W+   T  ++ IK  +
Sbjct: 634  KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693

Query: 1389 KNLPCLYLPNPQAFKIVETDASDIGFGGILKQK-IIDKEQIIAFTSKHWNPAQQNYSTVK 1447
             + P L   +     ++ETDASD+  G +L QK   DK   + + S   + AQ NYS   
Sbjct: 694  VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753

Query: 1448 KEVLAIVLSISNFQSDLIN--QKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVF 1505
            KE+LAI+ S+ +++  L +  + F +  D ++    +  + +        ARWQ  L  F
Sbjct: 754  KEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKRLARWQLFLQDF 811

Query: 1506 DFEIEYIKGSTNSLPDFLTR 1525
            +FEI Y  GS N + D L+R
Sbjct: 812  NFEINYRPGSANHIADALSR 831


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
            type 3
          Length = 1333

 Score =  227 bits (578), Expect = 2e-58
 Identities = 187/693 (26%), Positives = 340/693 (48%), Gaps = 70/693 (10%)

Query: 843  KSLKQIQTSQQLILEKLTEENSNGGSSSSSSSTSTSNRAPNNNVGDFLEIINYVTIQKFY 902
            K+ K+ Q   + + ++   + +N  +S +   T+ S    ++N     + +N+ T +++ 
Sbjct: 199  KTHKRFQPQNKNLGKEFLPKKNNTTNSRNLRKTNISRIEYSSN-----KFLNH-TRKRYE 252

Query: 903  INITIIIGDFILETPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSGAEGSKLIIK 957
            + +   + DF    P L DTG  ++ I+E  +     PTR + K+     G   +K  I 
Sbjct: 253  MVLQAELPDFKCSIPCLIDTGTQANIITEETVRAHKLPTRPWSKSVI-YGGVYPNK--IN 309

Query: 958  YKIPSAIIKNDSLEIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVF 1017
             K     I  + + I+T FL+V+  +H   I       L+  N +   I+   H    + 
Sbjct: 310  RKTIKLNISLNGISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---ISSSKHTLSQMN 363

Query: 1018 KFSKPPFVKTLNIISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFD 1077
            K S        NI+  KE ++  + +E  +K+I  +     +   I+ +           
Sbjct: 364  KVS--------NIV--KEPELPDIYKE--FKDITAETNTEKLPKPIKGL----------- 400

Query: 1078 LPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSK 1137
                         E   E    + ++P +  P+   +  +Q    EIN  L+  +IR SK
Sbjct: 401  -------------EFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESK 445

Query: 1138 SPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKF 1197
            +  +C   +V K+     GT R+V++YKPLN+ +    YP+P  + LLA++  + IF+K 
Sbjct: 446  AINACPVMFVPKK----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKL 501

Query: 1198 DMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAI 1257
            D+KS +  I++++ D +K AF  P G +E+ VMP+G+  AP+ FQ  +N I     +  +
Sbjct: 502  DLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHV 561

Query: 1258 V-YIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTII 1316
            V Y+D++LI S+S  +H KH+   +  +K   + +++ K    Q++++F+G++I +    
Sbjct: 562  VCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFT 621

Query: 1317 PINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSD 1375
            P    I+   ++  Q  ++ +L++FLG +NY+  F P+ S +  PL++ LKKD    W+ 
Sbjct: 622  PCQENIDKVLQW-KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 680

Query: 1376 IHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQK-IIDKEQIIAFTSK 1434
              T  ++ IK  + + P L   +     ++ETDASD+  G +L QK   DK   + + S 
Sbjct: 681  TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 740

Query: 1435 HWNPAQQNYSTVKKEVLAIVLSISNFQSDLIN--QKFLVRVDCKSAKDILQKDVKNLASK 1492
              + AQ NYS   KE+LAI+ S+ +++  L +  + F +  D ++    +  + +     
Sbjct: 741  KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PEN 798

Query: 1493 HIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
               ARWQ  L  F+FEI Y  GS N + D L+R
Sbjct: 799  KRLARWQLFLQDFNFEINYRPGSANHIADALSR 831


>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
            protein; Protease (EC 3.4.23.-); Reverse transcriptase
            (EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1675

 Score =  224 bits (570), Expect = 2e-57
 Identities = 226/885 (25%), Positives = 404/885 (45%), Gaps = 86/885 (9%)

Query: 687  KDVTCYNCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLIQTSDEEESVPSDSEVSED 746
            K   CY C    H++  C   RR +       + D ++  ++  + ++E + +  E+ E 
Sbjct: 770  KKCRCYICQDENHLANRC--PRRYTN-QARASLIDGLDEDIVSIASDDEDIENFLEIIE- 825

Query: 747  LNQIQNDDDQSSSSINVLTNEQDLIFRAIDSIPDPDEKKVYLERLKLTLEDRPPKSPITT 806
            L++      Q       +  ++D +        D + K V  +    T E +  K+   +
Sbjct: 826  LDEFIAHSSQEHEHTWEIGGKKDKVCEICSYFTDYN-KTVSCK----TCETQYCKT--CS 878

Query: 807  NKFNLKDTFKRLEKSTIKPVTIQDLQSEVHILKAEVKSLKQIQTSQQLILEKLTEENSNG 866
            ++  L+ T   ++K T +   I DL+  V  L+  V  L+     Q L  +  T +  N 
Sbjct: 879  DQLALEVT--EVKKPTKEETMIDDLKLNVKNLEFRVTILEHKVEMQNLQDKFETMQIRNK 936

Query: 867  GSSSSSSSTSTSNRAPNNNVGDFLEIINYVTIQKFYINITIIIGDFILETPALFDTGADS 926
               +   +TS + RA  +N       IN       Y+   I   +      AL D+G+  
Sbjct: 937  SEITEIPTTSLAMRANESNY--IKTSINKTA--GCYVETKISFNNENRIITALIDSGSTH 992

Query: 927  SCISEGLIPTRYFEKTTEKLS--GAEGSKLIIKYKIPSAIIKNDSLEIETSFLLVRNLTH 984
            + I   LIP  +   T  ++     + SK  +  ++   I K    E++ +F +   L  
Sbjct: 993  NIICPTLIPASWINNTHREIIMFAVDNSKYNLNQELIDDI-KLQFQEVDETFGIKYKLGQ 1051

Query: 985  KVIIETPFIKKLFPYN--TDEKG--------ITVQ--------------------HHGQP 1014
              +   P    +  +   T+E G        IT+Q                    H G+P
Sbjct: 1052 TYVAPKPTKTFIIGHRFLTNENGSVTIHKDYITIQKTTGIYPTARHELKSEFARKHGGRP 1111

Query: 1015 IVFKFSKPPFVKTLNIISYKEKQINFLKEEISYKNIEVQLQQPSVKSRI-ENILENIQSS 1073
             +F      + K  ++ SY+ + I   K EI  +++   +++      I ++I +N  + 
Sbjct: 1112 PLFSNIPETYNKIPHLHSYQPQPILGYKNEIGNQSLITMVKELEALGFIGDDITKNRTTW 1171

Query: 1074 IC-FDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKL 1132
            +C F + N   +       +PY    +DK++                F+++I +LL  KL
Sbjct: 1172 VCDFKIINP--DINITCATIPYTP--ADKEV----------------FEKQIKELLDNKL 1211

Query: 1133 IRRSKSPWS--CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHD 1190
            I+++        AAF V   +E     PR+V NYK LN  +    + IP+K  ++  +  
Sbjct: 1212 IKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQK 1271

Query: 1191 AKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFN 1250
            A IFSKFD+K+GF  ++L++  +  T FT   G Y WNV PFG+ NAP  FQR M E F 
Sbjct: 1272 ANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFG 1331

Query: 1251 P*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNI 1310
               KFA++YIDD+LI S +  +H +HL  F + +K+ G  +SK K  +F  ++ +LG  I
Sbjct: 1332 D-LKFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEI 1390

Query: 1311 HQGTIIPINRAIEFTDKFPDQIIDKTQ-LQRFLGCLNYVADFCPQLSTIIKPLHDRLKKD 1369
             +G I      ++   KF    ++  + LQ +LG LNY   +   LS ++ PL+ +  K+
Sbjct: 1391 KEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKN 1450

Query: 1370 PPP-WSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL-----KQKII 1423
                ++    N++ +I+  +  +  L  P    + I+ETDAS+ G+G +L     K    
Sbjct: 1451 GQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIETDASEEGWGAVLVCKPDKYSGK 1510

Query: 1424 DKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKS-AKDIL 1482
            D E+I  + S ++   ++ ++++  E+ AI  +++ FQ   +++ F +R DC++  K I 
Sbjct: 1511 DTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALNKFQI-YLDKDFTIRTDCEAIVKGIK 1568

Query: 1483 QKDVKNLA-SKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
             +D K  + ++ I  R   +   +    E+IKG+ N LP+FL+RE
Sbjct: 1569 TEDYKKRSKTRWIKLRDNLLKDGYKPTFEHIKGNKNFLPNFLSRE 1613


>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
            transposon 17.6 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score =  219 bits (558), Expect = 5e-56
 Identities = 138/412 (33%), Positives = 221/412 (53%), Gaps = 13/412 (3%)

Query: 1118 QFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIE-RGTPRLVINYKPLNQALCWIRY 1176
            Q  + +I D+L + +IR S SP++   + V K+ +   +   R+VI+Y+ LN+     R+
Sbjct: 221  QEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280

Query: 1177 PIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKN 1236
            PIPN  ++L +L     F+  D+  GF QI++  +   KTAF+   G YE+  MPFGLKN
Sbjct: 281  PIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340

Query: 1237 APSEFQRIMNEIFNP-*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTK 1295
            AP+ FQR MN+I  P  +K  +VY+DD+++FS S+D+H + L      + K  + +   K
Sbjct: 341  APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400

Query: 1296 VSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQL 1355
                + +  FLGH +    I P    IE   K+P     K +++ FLG   Y   F P  
Sbjct: 401  CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPK-EIKAFLGLTGYYRKFIPNF 459

Query: 1356 STIIKPLHDRLKKDP--PPWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIG 1413
            + I KP+   LKK+      +  + +  K++K  I   P L +P+      + TDASD+ 
Sbjct: 460  ADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVA 519

Query: 1414 FGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRV 1473
             G +L Q        +++ S+  N  + NYST++KE+LAIV +   F+  L+ + F +  
Sbjct: 520  LGAVLSQ----DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISS 575

Query: 1474 DCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
            D +    + +  +K+  SK    RW+  LS FDF+I+YIKG  N + D L+R
Sbjct: 576  DHQPLSWLYR--MKDPNSK--LTRWRVKLSEFDFDIKYIKGKENCVADALSR 623


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
            transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score =  213 bits (542), Expect = 3e-54
 Identities = 136/427 (31%), Positives = 226/427 (52%), Gaps = 15/427 (3%)

Query: 1103 IPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLV 1161
            I +K  P+    E+    + ++ ++L + LIR S SP++   + V K+ +       R+V
Sbjct: 207  IYSKQYPLAQTHEIE--VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264

Query: 1162 INYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVP 1221
            I+Y+ LN+     RYPIPN  ++L +L   + F+  D+  GF QI++ E+   KTAF+  
Sbjct: 265  IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324

Query: 1222 FGQYEWNVMPFGLKNAPSEFQRIMNEIFNP-*SKFAIVYIDDVLIFSQSIDQHFKHLNTF 1280
             G YE+  MPFGL+NAP+ FQR MN I  P  +K  +VY+DD++IFS S+ +H   +   
Sbjct: 325  SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384

Query: 1281 ISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQR 1340
             + +    + +   K    + +  FLGH +    I P    ++    +P    DK +++ 
Sbjct: 385  FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDK-EIRA 443

Query: 1341 FLGCLNYVADFCPQLSTIIKPLHDRLKKDPPPWSD--IHTNVVKQIKLRIKNLPCLYLPN 1398
            FLG   Y   F P  + I KP+   LKK     +    +    +++K  I   P L LP+
Sbjct: 444  FLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503

Query: 1399 PQAFKIVETDASDIGFGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1458
             +   ++ TDAS++  G +L Q        I+F S+  N  + NYS ++KE+LAIV +  
Sbjct: 504  FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559

Query: 1459 NFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNS 1518
             F+  L+ ++FL+  D +  + +   ++K   +K    RW+  LS + F+I+YIKG  NS
Sbjct: 560  TFRHYLLGRQFLIASDHQPLRWL--HNLKEPGAK--LERWRVRLSEYQFKIDYIKGKENS 615

Query: 1519 LPDFLTR 1525
            + D L+R
Sbjct: 616  VADALSR 622


>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score =  185 bits (469), Expect = 9e-46
 Identities = 135/434 (31%), Positives = 218/434 (50%), Gaps = 24/434 (5%)

Query: 1103 IPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVI 1162
            I  K RPI +   L    ++ I  +L +K+IR SKSPWS     V K+     G+ R+ I
Sbjct: 943  IRQKPRPIPL--ALKPEIRKMIQKMLNQKVIRESKSPWSSPVVLVKKKD----GSIRMCI 996

Query: 1163 NYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPF 1222
            +Y+ +N+ +    +P+PN +  L  L   K+++ FDM +GFWQI L EK +  TAF +  
Sbjct: 997  DYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGS 1056

Query: 1223 GQYEWNVMPFGLKNAPSEFQRIMNEIFNP-*SKFAIVYIDDVLIFSQSIDQHFKHLNTFI 1281
              +EWNV+PFGL  +P+ FQ  M EI        A VY+DD+LI S+ ++QH + +   +
Sbjct: 1057 ELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEAL 1116

Query: 1282 SIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP--DQIIDKTQLQ 1339
            + I+K+GM +  +K  + + ++ +LGH +   T+  +      TDK     +  +  +LQ
Sbjct: 1117 TRIRKSGMKLRASKCHIAKKEVEYLGHKV---TLDGVETQEVKTDKMKQFSRPTNVKELQ 1173

Query: 1340 RFLGCLNYVADFCPQLSTIIKPLHDRLK-KDPPPWSDIHTNVVKQIKLRIKNLPCLYLPN 1398
             FLG + Y   F    + I   L   +  K    W        +++K  +   P L  P+
Sbjct: 1174 SFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPD 1233

Query: 1399 PQAFK------IVETDASDIGFGGILKQKIIDKEQ-IIAFTSKHWNPAQQNYSTVKKEVL 1451
             +A        ++ TDAS  G G +L Q+  D +Q  IAF SK  +PA+  Y     E L
Sbjct: 1234 VEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEAL 1293

Query: 1452 AIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEY 1511
            A++ ++  F++ +      V  D K    +L+     LA +    RW   +  FD +I Y
Sbjct: 1294 AMMFALRRFKTIIYGTAITVFTDHKPLISLLKG--SPLADR--LWRWSIEILEFDVKIVY 1349

Query: 1512 IKGSTNSLPDFLTR 1525
            + G  N++ D L+R
Sbjct: 1350 LAGKANAVADALSR 1363


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
            transposon opus [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score =  184 bits (467), Expect = 2e-45
 Identities = 140/478 (29%), Positives = 239/478 (49%), Gaps = 28/478 (5%)

Query: 1068 ENIQSSICFDLPNAFWERKSHM-VELPYEKDF-SDKQIPTKAR----PIQMNEELLQFFQ 1121
            + I +S+  + P  F    S M VE   + +  ++ Q P  A+    P+ M  E+    +
Sbjct: 85   QEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEV----E 140

Query: 1122 REINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLVINYKPLNQALCWIRYPIPN 1180
            R+I++LLQ  +IR S SP++   + V K+ +       R+V+++K LN       YPIP+
Sbjct: 141  RQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200

Query: 1181 KKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSE 1240
                LA L +AK F+  D+ SGF QI ++E D  KTAF+   G+YE+  +PFGLKNAP+ 
Sbjct: 201  INATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAI 260

Query: 1241 FQRIMNEIFNP*-SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLF 1299
            FQR++++I      K   VYIDD+++FS+  D H+K+L   ++ + K  + V+  K    
Sbjct: 261  FQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFL 320

Query: 1300 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1359
             T++ FLG+ +    I    + +    + P     K +L+RFLG  +Y   F    + + 
Sbjct: 321  DTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVK-ELKRFLGMTSYYRKFIQDYAKVA 379

Query: 1360 KPLHDRLK------------KDPPPWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVET 1407
            KPL +  +            K P    +        +K  + +   L  P       + T
Sbjct: 380  KPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTT 439

Query: 1408 DASDIGFGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQ 1467
            DAS+   G +L Q    +++ IA+ S+  N  ++NY+T++KE+LAI+ S+ N ++ L   
Sbjct: 440  DASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGA 499

Query: 1468 KFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
               ++V             +N  +K    RW+A +  ++ E+ Y  G +N + D L+R
Sbjct: 500  G-TIKVYTDHQPLTFALGNRNFNAK--LKRWKARIEEYNCELIYKPGKSNVVADALSR 554


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
            transposon gypsy [Contains: Reverse transcriptase (EC
            2.7.7.49); Endonuclease]
          Length = 1035

 Score =  180 bits (456), Expect = 3e-44
 Identities = 151/589 (25%), Positives = 280/589 (46%), Gaps = 58/589 (9%)

Query: 979  VRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKT----------L 1028
            V+ L + + + +PF       +T+     ++H     VFK   P F+            L
Sbjct: 40   VKELKNVMPVASPFSVSSIHGSTE-----IKHKCLMKVFKHISPFFLLDSLNAFDAIIGL 94

Query: 1029 NIISYKEKQINFLKEEISYKNIEVQLQQ---PSVKSRIENILENIQSSICFDLPNAFWER 1085
            ++++    ++N  ++ + Y+ I  +L     PSV     N +  +  S+  +  +    R
Sbjct: 95   DLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVNFTDVNDIV-VPDSVKKEFKDTIIRR 153

Query: 1086 KSHMVE----LPYE-------KDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIR 1134
            K         LP+        +   ++ + ++A P  M   +  F   E+  LL+  +IR
Sbjct: 154  KKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMG--VSDFVNNEVKQLLKDGIIR 211

Query: 1135 RSKSPWSCAAFYVNKQAEIERGTP--RLVINYKPLNQALCWIRYPIPNKKDLLARLHDAK 1192
             S+SP++   + V+K+     G P  RLVI+++ LN+     RYP+P+   +LA L  AK
Sbjct: 212  PSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAK 271

Query: 1193 IFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF-NP 1251
             F+  D+KSG+ QI L E DR KT+F+V  G+YE+  +PFGL+NA S FQR ++++    
Sbjct: 272  FFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQ 331

Query: 1252 *SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIH 1311
              K   VY+DDV+IFS++   H +H++T +  +    M VS+ K   F+  + +LG  + 
Sbjct: 332  IGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVS 391

Query: 1312 QGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL----- 1366
            +         ++   ++P+      +++ FLG  +Y   F    + I +P+ D L     
Sbjct: 392  KDGTKSDPEKVKAIQEYPEPDC-VYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENG 450

Query: 1367 -------KKDPPPWSDIHTNVVKQIK--LRIKNLPCLYLPNPQAFKIVETDASDIGFGGI 1417
                   KK P  +++   N  ++++  L  +++   Y    + F +  TDAS  G G +
Sbjct: 451  SVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLT-TDASASGIGAV 509

Query: 1418 LKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKS 1477
            L Q    + + I   S+     +QNY+T ++E+LAIV ++   Q+ L   +    ++  +
Sbjct: 510  LSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR---EINIFT 562

Query: 1478 AKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
                L   V +  +     RW++ +   + ++ Y  G  N + D L+R+
Sbjct: 563  DHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ 611


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
            transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score =  162 bits (411), Expect = 5e-39
 Identities = 146/586 (24%), Positives = 261/586 (43%), Gaps = 32/586 (5%)

Query: 959  KIPSAIIKNDSLEIETSFLLVRN-LTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVF 1017
            K P  I    S  I T+ L  R+ +  ++I+ +     L P    + GI V +       
Sbjct: 162  KFPIYIPIAYSSGINTTLLPARSQVVRRLIVSSKDDNILIPNQEIQTGIYVAN-----TI 216

Query: 1018 KFSKPPFVKTLNIISYKE------------KQINFLKEEISYKNIEVQLQQPSVKSRIEN 1065
              S   FV+ LN     +               N ++    ++N  V  Q   +K     
Sbjct: 217  ATSSNTFVRILNTTDSDQLVNMDTLKYEPLSNYNVVQANSEHRNKTVLSQ---LKKNFPE 273

Query: 1066 ILENIQSSICFDLPNAF-WERKSHMVELPYEKDFSDKQI-PTKARPIQMNEELLQFFQRE 1123
            + ++   +IC +  + F  E +   V   Y++    K   P   +  +     ++  Q +
Sbjct: 274  LFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVEEIQAQ 333

Query: 1124 INDLLQKKLIRRSKSPWSCAAFYVNKQAE--IERGTPRLVINYKPLNQALCWIRYPIPNK 1181
            +  L++ K++  S S ++     V K++    ++   RLVI+Y+ +N+ L   ++P+P  
Sbjct: 334  VQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRI 393

Query: 1182 KDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1241
             D+L +L  AK FS  D+ SGF QI+L E  R  T+F+   G Y +  +PFGLK AP+ F
Sbjct: 394  DDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSF 453

Query: 1242 QRIMNEIFNP*S-KFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQ 1300
            QR+M   F+      A +Y+DD+++   S     K+L       ++  + +   K S F 
Sbjct: 454  QRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFM 513

Query: 1301 TKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIK 1360
             ++ FLGH      I+P ++  +    +P    D    +RF+   NY   F    +   +
Sbjct: 514  HEVTFLGHKCTDKGILPDDKKYDVIQNYPVP-HDADSARRFVAFCNYYRRFIKNFADYSR 572

Query: 1361 PLHDRLKKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK 1419
             +    KK+ P  W+D        +K ++ N   L  P+      + TDAS    G +L 
Sbjct: 573  HITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLT 632

Query: 1420 QKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAK 1479
            Q     +  +A+ S+ +   + N ST ++E+ AI  +I +F+  +  + F V+ D +   
Sbjct: 633  QNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLT 692

Query: 1480 DILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
             +    + N +SK    R +  L  ++F +EY+KG  N + D L+R
Sbjct: 693  YLF--SMVNPSSK--LTRIRLELEEYNFTVEYLKGKDNHVADALSR 734


>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1161

 Score =  103 bits (257), Expect = 4e-21
 Identities = 83/320 (25%), Positives = 144/320 (44%), Gaps = 31/320 (9%)

Query: 1104 PTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVIN 1163
            P K  PI  N +     Q  I+DLL++ ++ +  S  +   + V K      G  R+V++
Sbjct: 176  PQKQYPI--NPKAKPSIQIVIDDLLKQGVLIQQNSTMNTPVYPVPKPD----GKWRMVLD 229

Query: 1164 YKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFG 1223
            Y+ +N+ +  I     +   +L+ ++  K  +  D+ +GFW   +  +  + TAFT    
Sbjct: 230  YREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGK 289

Query: 1224 QYEWNVMPFGLKNAPSEFQR----IMNEIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNT 1279
            QY W  +P G  N+P+ F      ++ EI N        Y+DD+ I      +H + L  
Sbjct: 290  QYCWTRLPQGFLNSPALFTADVVDLLKEIPN-----VQAYVDDIYISHDDPQEHLEQLEK 344

Query: 1280 FISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQII------ 1333
              SI+   G  VS  K  + Q ++ FLG NI              TD F  +++      
Sbjct: 345  IFSILLNAGYVVSLKKSEIAQREVEFLGFNI-------TKEGRGLTDTFKQKLLNITPPK 397

Query: 1334 DKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP---PWSDIHTNVVKQIKLRIKN 1390
            D  QLQ  LG LN+  +F P  S ++KPL+  +         W++ ++N ++ I   +  
Sbjct: 398  DLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISVLNQ 457

Query: 1391 LPCLYLPNPQAFKIVETDAS 1410
               L   NP+   I++ ++S
Sbjct: 458  ADNLEERNPETRLIIKVNSS 477


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.316    0.134    0.389 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 183,035,463
Number of Sequences: 164201
Number of extensions: 8287028
Number of successful extensions: 34081
Number of sequences better than 10.0: 342
Number of HSP's better than 10.0 without gapping: 96
Number of HSP's successfully gapped in prelim test: 251
Number of HSP's that attempted gapping in prelim test: 33042
Number of HSP's gapped (non-prelim): 951
length of query: 1526
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1403
effective length of database: 39,777,331
effective search space: 55807595393
effective search space used: 55807595393
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 73 (32.7 bits)


Lotus: description of TM0173b.7