Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0019a.7
         (1703 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...   295  6e-79
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...   293  2e-78
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...   293  4e-78
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot...   291  8e-78
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot...   290  2e-77
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...   288  1e-76
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...   287  2e-76
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;...   235  7e-61
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro...   227  2e-58
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr...   210  2e-53
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...   201  1e-50
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...   199  4e-50
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...   197  3e-49
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...   191  1e-47
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...   189  7e-47
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...   172  7e-42
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...   170  4e-41
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III    168  1e-40
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...   138  1e-31
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23...    97  4e-19

>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  295 bits (756), Expect = 6e-79
 Identities = 205/600 (34%), Positives = 311/600 (51%), Gaps = 55/600 (9%)

Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIK-------------- 1174
            +E+    DTGA     S+ +IP  ++      +    A+GS + I               
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREI 97

Query: 1175 YKIPSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIK----------KLFPYNTDENGIT 1224
            +KIP+   +   ++    F++  N  Q   +  PFI+          K +P +  +    
Sbjct: 98   FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQFTDRVIFTKNKSYPVHIAKLTRA 150

Query: 1225 VQHLGQPIL------FKFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK 1277
            V+   +  L       K  +P P++ + N I    K+I  L E    R  EE+L     +
Sbjct: 151  VRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSE--GRRLSEEKLFITQQR 208

Query: 1278 -SRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHF 1336
              +IE++LE + S    D PN   +     ++L      SD   + K +P++ +      
Sbjct: 209  MQKIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREE 261

Query: 1337 CQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1396
              K+I +LL+ K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y +P
Sbjct: 262  FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321

Query: 1397 NKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPS 1456
            NK +LL      K+FS FD KSGFWQ+ L ++ +  TAFT P G YEWNV+PFGLK APS
Sbjct: 322  NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 381

Query: 1457 EFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLF 1516
             FQR M+E F  + K   VY+DD+L+FS   + H  H+   +    ++G+ +SK K  LF
Sbjct: 382  IFQRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 441

Query: 1517 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1576
            + KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I 
Sbjct: 442  KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIR 501

Query: 1577 KPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGIL 1635
            KPL  +LK++ P   +   T  ++++K  ++  P L+ P P+   IIETDASD  +GG+L
Sbjct: 502  KPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 561

Query: 1636 K----QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
            K     +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+  D
Sbjct: 562  KAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  293 bits (751), Expect = 2e-78
 Identities = 204/600 (34%), Positives = 311/600 (51%), Gaps = 55/600 (9%)

Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIK-------------- 1174
            +E+    DTGA     S+ +IP  ++      +    A+GS + I               
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97

Query: 1175 YKIPSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIK----------KLFPYNTDENGIT 1224
            +KIP+   +   ++    F++  N  Q   +  PFI+          K +P +  +    
Sbjct: 98   FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQFTDRVIFTKNKSYPVHITKLTRA 150

Query: 1225 VQHLGQPIL------FKFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK 1277
            V+   +  L       K  +P P++ + N I    ++I  L E    R  EE+L     +
Sbjct: 151  VRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE--GRRLSEEKLFITQQR 208

Query: 1278 -SRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHF 1336
              +IE++LE + S    D PN   +     ++L      SD   + K +P++ +      
Sbjct: 209  MQKIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREE 261

Query: 1337 CQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1396
              K+I +LL+ K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y +P
Sbjct: 262  FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321

Query: 1397 NKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPS 1456
            NK +LL      K+FS FD KSGFWQ+ L ++ +  TAFT P G YEWNV+PFGLK APS
Sbjct: 322  NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 381

Query: 1457 EFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLF 1516
             FQR M+E F  + K   VY+DD+L+FS   + H  H+   +    ++G+ +SK K  LF
Sbjct: 382  IFQRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 441

Query: 1517 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1576
            + KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I 
Sbjct: 442  KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIR 501

Query: 1577 KPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGIL 1635
            KPL  +LK++ P   +   T  ++++K  ++  P L+ P P+   IIETDASD  +GG+L
Sbjct: 502  KPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 561

Query: 1636 K----QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
            K     +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+  D
Sbjct: 562  KAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  293 bits (749), Expect = 4e-78
 Identities = 203/598 (33%), Positives = 307/598 (50%), Gaps = 51/598 (8%)

Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIKYKIPSAIIKNGSLE 1188
            +E+    DTGA     S+ +IP  ++      +    A+GS + I     S + K+  L 
Sbjct: 38   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITI-----SKVCKDIDLI 92

Query: 1189 IETPFLLVRNLSQK-----IIIGTPFIKKLFPY-NTDENGITVQHLGQPILF-------- 1234
            I      +  + Q+      IIG  F +   P+    +  I  ++   P+          
Sbjct: 93   IAGEIFRIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVR 152

Query: 1235 --------------KFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK-S 1278
                          K  +P P++ + N I    ++I  L E    R  EE+L     +  
Sbjct: 153  VGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE--GRRLSEEKLFITQQRMQ 210

Query: 1279 RIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQ 1338
            +IE++LE + S    D PN   +     ++L      SD   + K +P++ +        
Sbjct: 211  KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFD 263

Query: 1339 KEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNK 1398
            K+I +LL+ K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y +PNK
Sbjct: 264  KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNK 323

Query: 1399 KDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1458
             +LL      K+FS FD KSGFWQ+ L ++ +  TAFT P G YEWNV+PFGLK APS F
Sbjct: 324  DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 383

Query: 1459 QRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQT 1518
            QR M+E F  + K   VY+DD+L+FS   + H  H+   +    ++G+ +SK K  LF+ 
Sbjct: 384  QRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKK 443

Query: 1519 KIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKP 1578
            KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I KP
Sbjct: 444  KINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKP 503

Query: 1579 LHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILK- 1636
            L  +LK++ P   +   T  ++++K  ++  P L+ P P+   IIETDASD  +GG+LK 
Sbjct: 504  LQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKA 563

Query: 1637 ---QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
                +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+  D
Sbjct: 564  IKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621


>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 666

 Score =  291 bits (746), Expect = 8e-78
 Identities = 193/575 (33%), Positives = 292/575 (50%), Gaps = 32/575 (5%)

Query: 1138 DTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIKYKIPSAIIKNGSLEIETPFLL 1195
            DTGA     S  +IP   +E + + +    A    +KI     +  +K      E P + 
Sbjct: 54   DTGASLCIASRYIIPEELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVY 113

Query: 1196 VRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPIL-------FKFSEPPIDKTLNVI 1248
             +      +IG  F +   P+   E+ I      + +L       F  S P   + +   
Sbjct: 114  QQETGIDFLIGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMKKD 173

Query: 1249 SYKEKQI-------NFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWE 1301
            S K +QI       N +  E  Y  I E+ Q+          +E +   +CS+ P    +
Sbjct: 174  S-KTEQIPGTNISKNIINPEERYFLITEKYQK----------IEQLLDKVCSENPIDPIK 222

Query: 1302 RKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAA 1361
             K  M          D     + +P+  + +      K+I +LL+  LI  SKS     A
Sbjct: 223  SKQWMKA---SIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPA 279

Query: 1362 FYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFW 1421
            F V  +AE  RG  R+V+NYK +NQA     + +PN ++LL       +FS FD KSGFW
Sbjct: 280  FLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFW 339

Query: 1422 QIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIVYIDDVL 1481
            Q+ L E+ +  TAFT P G ++W V+PFGLK APS FQR M    N   K  +VY+DD++
Sbjct: 340  QVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCMVYVDDII 399

Query: 1482 IFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEF 1541
            +FS +   H+ H+   + ++++ G+ +SK K +LF+ KI FLG  I +GT  P N  +E 
Sbjct: 400  VFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILEN 459

Query: 1542 TDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQ 1600
              KFPD++ DK  LQRFLG L Y   + P+L+ I KPL  +LKKD     +   ++ VK+
Sbjct: 460  IHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKK 519

Query: 1601 IKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVFDKEQIIA-FTSKHWNPAQQN 1659
            IK  + + P LYLP P+   IIETDASD  +GG+LK +  D  ++I  ++S  +  A++N
Sbjct: 520  IKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKN 579

Query: 1660 YSTVKKEVLAIVLSISKFQSDLINQKFLVCVDCKS 1694
            Y +  KE+LA+   I+KF + L   +F V  D K+
Sbjct: 580  YHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKN 614


>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 659

 Score =  290 bits (743), Expect = 2e-77
 Identities = 199/591 (33%), Positives = 305/591 (50%), Gaps = 40/591 (6%)

Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEK---LSAAEGSRLKIKYKIPSAIIKNGSL 1187
            L++    DTG+     S+ +IP  Y++ T EK   +  A G  +++        I+ G  
Sbjct: 27   LDLHCYVDTGSSLCMASKYVIPEEYWQ-TAEKPLNIKIANGKIIQLTKVCSKLPIRLGGE 85

Query: 1188 EIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILF------------- 1234
                P L  +     +++G  F +   P+    + I      Q ++              
Sbjct: 86   RFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKITKAYQYGVKG 145

Query: 1235 ------KFSEPPIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK--SRIEDILEN 1286
                  K S+    + +N+ S    Q  FL+E  ++  ++E L +  +   S IE++LE 
Sbjct: 146  FLESMKKKSKVNRPEPINITS---NQHLFLEEGGNH--VDEMLYEIQISKFSAIEEMLER 200

Query: 1287 IQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLE 1346
            + S    D P    +  +  +EL       D +   K +P+  +        ++I +LLE
Sbjct: 201  VSSENPID-PEKSKQWMTATIEL------IDPKTVVKVKPMSYSPSDREEFDRQIKELLE 253

Query: 1347 KKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQH 1406
             K+I+ SKS     AF V  +AE  RG  R+V+NYK +N+A     + +PNK +LL    
Sbjct: 254  LKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVR 313

Query: 1407 DAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQR-IMNEI 1465
              K++S FD KSG WQ+ L ++ +  TAFT P G Y+WNV+PFGLK APS F +   N  
Sbjct: 314  GKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSH 373

Query: 1466 FNPYSKLTIVYIDDVLIFSQT-LDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLG 1524
             N YSK   VY+DD+L+FS T   +H+ H+   +   ++ G+ +SK K  LF+ KI FLG
Sbjct: 374  SNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLG 433

Query: 1525 HNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLK 1584
              I QGT  P N  +E   KFPD+I DK QLQRFLG L Y +D+ P+L++I KPL  +LK
Sbjct: 434  LEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLK 493

Query: 1585 KDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVFDKE 1643
            +D     +D  +  + +IK  +K+ P LY P P    +IETDAS+  +GGILK      E
Sbjct: 494  EDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHE 553

Query: 1644 QIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVDCKS 1694
             I  + S  +  A++NY + +KE+LA++  I KF   L   +FL+  D K+
Sbjct: 554  YICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKN 604


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 680

 Score =  288 bits (736), Expect = 1e-76
 Identities = 203/602 (33%), Positives = 311/602 (50%), Gaps = 59/602 (9%)

Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIK-------------- 1174
            +E+    DTGA     S+ +IP  ++      +    A+GS + I               
Sbjct: 39   IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVI 98

Query: 1175 YKIPSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIK----------KLFPYNTDENGIT 1224
            +KIP+   +   ++    F++  N  Q   +  PFI+          K +P +  +    
Sbjct: 99   FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQFTDRVIFTKNKSYPVHIAKLTRA 151

Query: 1225 VQHLGQPIL------FKFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQL---QQP 1274
            V+   +  L       K  +P P++ + N I    ++I  L E    R  EE+L   QQ 
Sbjct: 152  VRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE--GRRLSEEKLFITQQR 209

Query: 1275 SVKSRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELL 1334
              K+  E++LE + S    D PN   +     ++L      SD   + K +P++ +    
Sbjct: 210  MQKT--EELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDR 260

Query: 1335 HFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYP 1394
                K+I +LL+ K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y 
Sbjct: 261  EEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYN 320

Query: 1395 IPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNA 1454
            +PNK +LL      K+FS FD KSGFWQ+ L ++ +  TAFT P G YEWNV+PFGLK A
Sbjct: 321  LPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQA 380

Query: 1455 PSEFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVS 1514
            PS FQR M+E F  + K   VY+DD+++FS   + H  H+   +    ++G+ +SK K  
Sbjct: 381  PSIFQRHMDEAFRVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQ 440

Query: 1515 LFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLST 1574
            LF+ KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P L+ 
Sbjct: 441  LFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQ 500

Query: 1575 IIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGG 1633
            + +PL  +LK++ P   +   T  ++++K  ++  P L+ P P+   IIETDASD  +GG
Sbjct: 501  MRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGG 560

Query: 1634 ILK----QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVC 1689
            +LK     +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+ 
Sbjct: 561  MLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIR 620

Query: 1690 VD 1691
             D
Sbjct: 621  TD 622


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 674

 Score =  287 bits (735), Expect = 2e-76
 Identities = 167/418 (39%), Positives = 242/418 (56%), Gaps = 12/418 (2%)

Query: 1279 RIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQ 1338
            +IE++LE + S    D PN   +     ++L      SD   + K +P++ +        
Sbjct: 206  KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFD 258

Query: 1339 KEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNK 1398
            K+I +LL+ K+I+ SKSP    AF VN +AE  RG  R+V+NYK +N+A     Y  PNK
Sbjct: 259  KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNK 318

Query: 1399 KDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1458
             +LL      K+FS FD KSGFWQ+ L ++ +  TAFT P G YEWNV+PFGLK APS F
Sbjct: 319  DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 378

Query: 1459 QRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQT 1518
            QR M+E F  + K   VY+DD+L+FS   + H  H+   +    ++G+ +SK K  LF+ 
Sbjct: 379  QRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKK 438

Query: 1519 KIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKP 1578
            KI FLG  I +GT  P    +E  +KFPD + DK QLQRFLG L Y +D+ P+L+ I KP
Sbjct: 439  KINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKP 498

Query: 1579 LHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILK- 1636
            L  +LK++ P   +   T  ++++K  ++  P L+ P P+   IIETDASD  +GG+LK 
Sbjct: 499  LQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKA 558

Query: 1637 ---QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
                +  + E I  + S  +  A++NY +  KE LA++ +I KF   L    FL+  D
Sbjct: 559  IKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 616


>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
            Protease (EC 3.4.23.-); Reverse transcriptase (EC
            2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1886

 Score =  235 bits (600), Expect = 7e-61
 Identities = 207/714 (28%), Positives = 345/714 (47%), Gaps = 80/714 (11%)

Query: 1035 DTFKRLEKSTVKPVTIQDL-HFEINSLKTEVKSLKQ-----IQKSQQLILEKLTKNYEED 1088
            +  K  EK++    TIQ+    E+N +K E++  K+     I + ++ I+ +     EE 
Sbjct: 1114 EALKHSEKASRVFSTIQESDEAELNLIKEELRQFKEETRMAIAQLKEAIIVQEEDTIEER 1173

Query: 1089 DSSIPDSNPAPNDNCEDFLENINQVTIQKFF---IHVKILIGDFVLE---IPALFDTGAD 1142
             + I +         E   ENI   T +  +    +VK+ I    +E   I A+ DTGA 
Sbjct: 1174 CAMILE---------EKHTENIYSATAKAEYNGLYNVKVGIKPDNMEPYYINAIVDTGAT 1224

Query: 1143 SSCISEGLIPTRYFE--KTTEKLSAAEGSRLKIKYKIPSAIIKNGSLEIETPFLLVRNLS 1200
            +  I    IP  Y+E  K T    +  G     +  I +  I  G      P   V N+ 
Sbjct: 1225 ACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQM-IKAGRILIGEQYFRMPVTYVMNMG 1283

Query: 1201 Q----KIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLNVI-SYKEKQI 1255
                 ++IIG  FI+ L      E G+ ++          +     +T  V  S +E ++
Sbjct: 1284 LSPGIQMIIGCSFIRSL------EGGLRIEKDIITFYKLVTSIETSRTTQVANSIEELEL 1337

Query: 1256 NFLKEEISYRTIEEQLQQPSVKS-----RIEDILENIQS-SICSDLPNAFWERKSHMVEL 1309
            +    E  Y  I   ++ PS        + +D+L+ ++      + P  FW+      +L
Sbjct: 1338 S----EDEYLNIAASVETPSFLDQEFARKNKDLLKEMKEMKYIGENPMEFWKNNKIKCKL 1393

Query: 1310 PYEKDFSDKQISTKARPIQM----NEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVN 1365
                +  +  I    RPI+     +EE +    ++IN LL+ K+IR S+S     AF V 
Sbjct: 1394 ----NIINPDIKIMGRPIKHVTPGDEEAM---TRQINLLLQMKVIRPSESKHRSTAFIVR 1446

Query: 1366 KQAEIE-------RGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKS 1418
               EI+       +G  R+V NYK LN+     +Y +P    ++++   +K++SKFD+KS
Sbjct: 1447 SGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKS 1506

Query: 1419 GFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIVYID 1478
            GFWQ+ ++E+    TAF      YEW VMPFGLKNAP+ FQR M+ +F    K   VYID
Sbjct: 1507 GFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKGTEKFIAVYID 1566

Query: 1479 DVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTI----IP 1534
            D+L+FS+T +QH +HL T + + K NGL +S TK+ +   +I FLG ++    I      
Sbjct: 1567 DILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHI 1626

Query: 1535 INRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPPP*SDIH 1594
            I++  +F+D   +++     ++ +LG L+Y  ++   +  +++PL  ++        +  
Sbjct: 1627 ISKICDFSD---EKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPE 1683

Query: 1595 T-NVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVF-----DKEQIIAF 1648
            T  +V+QIK +VKNLP L LP   +F IIETD    G+G + K K+        E+I A+
Sbjct: 1684 TWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAY 1743

Query: 1649 TSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVC-VDCKSAKEILQK 1701
             S  +NP +   ST+  E+ A +  + KF+   +++K L+   DC++  +   K
Sbjct: 1744 ASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNK 1794



 Score = 33.9 bits (76), Expect = 3.9
 Identities = 43/251 (17%), Positives = 90/251 (35%), Gaps = 43/251 (17%)

Query: 735 LSNLKCKSLGDFRWYKDTFLTRVYTR----EDSQQAFWKEKFLAGLPKSFGDKVREKLRS 790
           L  L C +    R Y   +LT          +++     E+    +P + G++V +  + 
Sbjct: 735 LKQLVCPNYQSIRRYLMDYLTLAAETGLMWSETEGPAISEELFTKMPAAIGERVAQAYKI 794

Query: 791 QNPGGEIPYHTLSYGQLIAIIQRVALKICQDDKIQQQLTKEKSQNRRDLGTFCEQFGIQG 850
            +P   +   +  Y  +  + ++     C++    + L             FC  F I+G
Sbjct: 795 MDPTSAVNLPSRVYFTINYLTEQ-----CKEASYMRSLKALD---------FCRDFPIEG 840

Query: 851 CPKKPKPRKQDPPPKQQWRKRSSRNDHRKPKPRSKPQSSQIPKNPPETRPSQGKDVTCYN 910
              +   +K+    K       + ++H +                  T+    +   CY 
Sbjct: 841 YYGRSGEKKKYTARKATKYTGKAHDNHIRV-----------------TKAKYQRKCKCYI 883

Query: 911 CGKPGHISRYCRLKRRISE-LHLEPEIEDKINNLLIQTSDEEESNPSDSEV-------SE 962
           CG+ GH +  CR K +  + + +   ++ K N  ++   D+EE +     V        E
Sbjct: 884 CGQEGHYANQCRNKHKDQQRVAILQSLDLKENEEVVSADDKEEEDDEIFSVLGEEDYQEE 943

Query: 963 DLNQIQNDDSQ 973
            +  ++ DD Q
Sbjct: 944 TIMVLEEDDIQ 954


>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 692

 Score =  227 bits (578), Expect = 2e-58
 Identities = 173/537 (32%), Positives = 276/537 (51%), Gaps = 34/537 (6%)

Query: 1119 FIHVKILIGDFVLEIPALFDTGADSSCISEGLIPTRY-FEKTTEKLSAAEGSRLKIKYKI 1177
            FI V I   +F+    A  DTGA + C  +  I   +   K  +++  A+ S+  I+  I
Sbjct: 21   FIKVSIGKRNFL----AYIDTGA-TLCFGKRKISNNWEILKQPKEIIIADKSKHYIREAI 75

Query: 1178 PSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQ--HLGQPILFK 1235
             +  +K  + E   P + + +    +IIG  F+K   P+      I ++  +L  P   +
Sbjct: 76   SNVFLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQ 135

Query: 1236 FSEPPIDKTLNVISYKEKQINF-LKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSD 1294
                 I     V+    ++I+  L++ + ++TIEEQL++                 +CS+
Sbjct: 136  MISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEE-----------------VCSE 178

Query: 1295 LPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSK 1354
             P    + K+ ++     KD   +   T   P  + +  +   ++E  DLL+K LIR S+
Sbjct: 179  HPLDETKNKNGLLIEIRLKDPLQEINVTNRIPYTIRD--VQEFKEECEDLLKKGLIRESQ 236

Query: 1355 SPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKF 1414
            SP S  AFYV    EI+RG  R+VINYK +N+A     Y +P K  +L +   +  FS  
Sbjct: 237  SPHSAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSL 296

Query: 1415 DMKSGFWQIQLQEKDKYKTAFTV-PFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLT 1473
            D KSG++Q++L E  K  TAF+  P   YEWNV+ FGLK APS +QR M++       + 
Sbjct: 297  DAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGLEHIC 356

Query: 1474 IVYIDDVLIFSQ-TLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIH-QGT 1531
            + YIDD+LIF++ + +QH   +   +  IK  G+ +SK K  L Q +I +LG  I   G 
Sbjct: 357  LAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGE 416

Query: 1532 IIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD--FCPQLSTIIKPLHDRLK-KDPP 1588
            I       E   +FPD++ D+ Q+QRFLGC+NY+A+  F   L+   K L  ++  K+P 
Sbjct: 417  IDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPW 476

Query: 1589 P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVFDKEQI 1645
                I T +V+ IK ++++LP LY  + Q F I+ETDAS   + G L+     K++I
Sbjct: 477  KWDTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKI 533


>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
            protein; Protease (EC 3.4.23.-); Reverse transcriptase
            (EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
          Length = 1675

 Score =  210 bits (535), Expect = 2e-53
 Identities = 213/844 (25%), Positives = 374/844 (44%), Gaps = 94/844 (11%)

Query: 899  RPSQGKDVTCYNCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLIQTSDEEESNPSDS 958
            RPS  K   CY C    H++  C   RR +       I+    +++   SD+E+      
Sbjct: 765  RPSIKKKCRCYICQDENHLANRC--PRRYTNQARASLIDGLDEDIVSIASDDED------ 816

Query: 959  EVSEDLNQIQNDDSQSSSSVNTLSINTLTNEQDLLFRAINSIPDPEEK---KIYLERLRS 1015
             +   L  I+ D+  + SS        +  ++D +    +   D  +    K    +   
Sbjct: 817  -IENFLEIIELDEFIAHSSQEHEHTWEIGGKKDKVCEICSYFTDYNKTVSCKTCETQYCK 875

Query: 1016 TLEDRPPKSPITINKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQIQKSQQ 1075
            T  D+            L      ++K T +   I DL   + +L+  V  L+   + Q 
Sbjct: 876  TCSDQ------------LALEVTEVKKPTKEETMIDDLKLNVKNLEFRVTILEHKVEMQN 923

Query: 1076 LI--LEKLTKNYEEDDSSIPDSNPAPNDNCEDFLE-NINQVTIQKFFIHVKILIGDFVLE 1132
            L    E +    + + + IP ++ A   N  ++++ +IN+      ++  KI   +    
Sbjct: 924  LQDKFETMQIRNKSEITEIPTTSLAMRANESNYIKTSINKTA--GCYVETKISFNNENRI 981

Query: 1133 IPALFDTGADSSCISEGLIPTRYFEKTTEKLS--AAEGSRLKIKYKIPSAIIKNGSLEIE 1190
            I AL D+G+  + I   LIP  +   T  ++   A + S+  +  ++   I K    E++
Sbjct: 982  ITALIDSGSTHNIICPTLIPASWINNTHREIIMFAVDNSKYNLNQELIDDI-KLQFQEVD 1040

Query: 1191 TPFLLVRNLSQKIIIGTPFIKKLFPYN--TDENG--------ITVQ-------------- 1226
              F +   L Q  +   P    +  +   T+ENG        IT+Q              
Sbjct: 1041 ETFGIKYKLGQTYVAPKPTKTFIIGHRFLTNENGSVTIHKDYITIQKTTGIYPTARHELK 1100

Query: 1227 ------HLGQPILFKFSEPPIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVKSRI 1280
                  H G+P LF       +K  ++ SY+ + I   K EI  +++   +++      I
Sbjct: 1101 SEFARKHGGRPPLFSNIPETYNKIPHLHSYQPQPILGYKNEIGNQSLITMVKELEALGFI 1160

Query: 1281 -EDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQK 1339
             +DI +N  + +C D      +       +PY    +DK++                 +K
Sbjct: 1161 GDDITKNRTTWVC-DFKIINPDINITCATIPYTP--ADKEVF----------------EK 1201

Query: 1340 EINDLLEKKLIRRSKSPWS--CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPN 1397
            +I +LL+ KLI+++        AAF V   +E     PR+V NYK LN  +    + IP+
Sbjct: 1202 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261

Query: 1398 KKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSE 1457
            K  ++     A +FSKFD+K+GF  ++L++  K  T FT   G Y WNV PFG+ NAP  
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1321

Query: 1458 FQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQ 1517
            FQR M E F    K  ++YIDD+LI S    +H +HL  F + +K  G  +SK K  +F 
Sbjct: 1322 FQRFMQESFGDL-KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSKMFL 1380

Query: 1518 TKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQ-LQRFLGCLNYVADFCPQLSTII 1576
             ++ +LG  I +G I      ++   KF    ++  + LQ +LG LNY   +   LS ++
Sbjct: 1381 KEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNYARGYIKDLSKLV 1440

Query: 1577 KPLHDRLKKDPPP*SDIHT-NVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGIL 1635
             PL+ +  K+     +    N++ +I+  V  +  L  P    + IIETDAS+ G+G +L
Sbjct: 1441 GPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIETDASEEGWGAVL 1500

Query: 1636 -----KQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCV 1690
                 K    D E+I  + S ++   ++ ++++  E+ AI  +++KFQ   +++ F +  
Sbjct: 1501 VCKPDKYSGKDTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALNKFQI-YLDKDFTIRT 1558

Query: 1691 DCKS 1694
            DC++
Sbjct: 1559 DCEA 1562



 Score = 43.9 bits (102), Expect = 0.004
 Identities = 25/107 (23%), Positives = 53/107 (49%), Gaps = 2/107 (1%)

Query: 6   YLHIGSVQVGLKPLTRKSLDIAVLLCLRDVRHNQFHDSLLGTVETSLSNG-PIFFNCFPD 64
           Y HIG + +G+K L R+ +   V++   D    +  ++ +G++E  ++ G  +F++C PD
Sbjct: 112 YYHIGMMAIGVKGLHRRKIGTKVMIMFYDDSFGKAREASIGSIEMDMNAGCGVFYSC-PD 170

Query: 65  LTVSLEDKNILDVLFLNIKLHGLDMKEDSIPISLIYRVQYKVMNSIK 111
               ++D + L +    +     + K  S+ I  I R+   + +  K
Sbjct: 171 FAKYIKDLSHLKIGIQTLGYENYEGKNLSVAIKTIGRLTTNIQSKYK 217


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
            type 1
          Length = 1333

 Score =  201 bits (512), Expect = 1e-50
 Identities = 173/674 (25%), Positives = 313/674 (45%), Gaps = 76/674 (11%)

Query: 1028 INKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQ-IQKSQQLILEKLTKNYE 1086
            ++K N+ D   RL    + P +++ L  E N  K+E+  +   ++  +     K  K ++
Sbjct: 147  MSKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQ 205

Query: 1087 EDDSSIPDSNPAPNDNCEDFLE----NINQV--TIQKFFIHVK--------ILIGDFVLE 1132
              + ++   + +  +N  +       N++++  +  KF  H +          + DF   
Sbjct: 206  LQNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNHTRKRYEMVLQAELPDFKCS 265

Query: 1133 IPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSAAEGSRLKIKYKIPSAIIKNGSL 1187
            IP L DTGA ++ I+E  +     PTR + K+            KI  K     I    +
Sbjct: 266  IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPN---KINRKTIKLNISLNGI 322

Query: 1188 EIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLNV 1247
             I+T FL+V+  S    I       L+  N +                     I  + + 
Sbjct: 323  SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKHT 358

Query: 1248 ISYKEKQINFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHMV 1307
            +S   K  N +KE           + P +    +DI     +     LP         + 
Sbjct: 359  LSQMNKVSNIVKEP----------ELPDIYKEFKDITAETNTE---KLPKP-------IK 398

Query: 1308 ELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQ 1367
             L +E + + +      R   +    +     EIN  L+  +IR SK+  +C   +V K+
Sbjct: 399  GLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458

Query: 1368 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQE 1427
                 GT R+V++YKPLN+ +    YP+P  + LLA+   + +F+K D+KS +  I++++
Sbjct: 459  ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514

Query: 1428 KDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIV-YIDDVLIFSQT 1486
             D++K AF  P G +E+ VMP+G+  AP+ FQ  +N I     +  +V Y+DD+LI S++
Sbjct: 515  GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574

Query: 1487 LDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1546
              +H KH+   +  +K   L +++ K    Q++++F+G++I +    P    I+   ++ 
Sbjct: 575  ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633

Query: 1547 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRV 1605
             Q  ++ +L++FLG +NY+  F P+ S +  PL++ LKKD     +   T  ++ IK  +
Sbjct: 634  KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693

Query: 1606 KNLPCLYLPNPQAFKIIETDASDIGFGGILKQK-VFDKEQIIAFTSKHWNPAQQNYSTVK 1664
             + P L   +     ++ETDASD+  G +L QK   DK   + + S   + AQ NYS   
Sbjct: 694  VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753

Query: 1665 KEVLAIVLSISKFQ 1678
            KE+LAI+ S+  ++
Sbjct: 754  KEMLAIIKSLKHWR 767


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
            type 2
          Length = 1333

 Score =  199 bits (507), Expect = 4e-50
 Identities = 172/674 (25%), Positives = 313/674 (45%), Gaps = 76/674 (11%)

Query: 1028 INKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQ-IQKSQQLILEKLTKNYE 1086
            ++K N+ D   RL    + P +++ L  E N  K+E+  +   ++  +     K  K ++
Sbjct: 147  MSKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQ 205

Query: 1087 EDDSSIPDSNPAPNDNCEDFLE----NINQV--TIQKFFIHVK--------ILIGDFVLE 1132
              + ++   + +  +N  +       N++++  +  KF  H +          + DF   
Sbjct: 206  PQNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNHTRKRYEMVLQAELPDFKCS 265

Query: 1133 IPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSAAEGSRLKIKYKIPSAIIKNGSL 1187
            IP L DTGA ++ I+E  +     PTR + K+            KI  K     I    +
Sbjct: 266  IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPN---KINRKTIKLNISLNGI 322

Query: 1188 EIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLNV 1247
             I+T FL+V+  S    I       L+  N +                     I  + + 
Sbjct: 323  SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKHT 358

Query: 1248 ISYKEKQINFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHMV 1307
            +S   K  N +KE           + P +    +DI     +     LP         + 
Sbjct: 359  LSQMNKVSNIVKEP----------ELPDIYKEFKDITAETNTE---KLPKP-------IK 398

Query: 1308 ELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQ 1367
             L +E + + +      R   +    +     EIN  L+  +IR SK+  +C   +V K+
Sbjct: 399  GLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458

Query: 1368 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQE 1427
                 GT R+V++YKPLN+ +    YP+P  + LLA+   + +F+K D+KS +  I++++
Sbjct: 459  ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514

Query: 1428 KDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIV-YIDDVLIFSQT 1486
             D++K AF  P G +E+ VMP+G+  AP+ FQ  +N I     +  +V Y+D++LI S++
Sbjct: 515  GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574

Query: 1487 LDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1546
              +H KH+   +  +K   L +++ K    Q++++F+G++I +    P    I+   ++ 
Sbjct: 575  ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633

Query: 1547 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRV 1605
             Q  ++ +L++FLG +NY+  F P+ S +  PL++ LKKD     +   T  ++ IK  +
Sbjct: 634  KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693

Query: 1606 KNLPCLYLPNPQAFKIIETDASDIGFGGILKQK-VFDKEQIIAFTSKHWNPAQQNYSTVK 1664
             + P L   +     ++ETDASD+  G +L QK   DK   + + S   + AQ NYS   
Sbjct: 694  VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753

Query: 1665 KEVLAIVLSISKFQ 1678
            KE+LAI+ S+  ++
Sbjct: 754  KEMLAIIKSLKHWR 767


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
            type 3
          Length = 1333

 Score =  197 bits (500), Expect = 3e-49
 Identities = 173/675 (25%), Positives = 307/675 (44%), Gaps = 78/675 (11%)

Query: 1028 INKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQ-IQKSQQLILEKLTKNYE 1086
            ++K N+ D   RL    + P +++ L  E N  K+E+  +   ++  +     K  K ++
Sbjct: 147  MSKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQ 205

Query: 1087 EDDSSIPDSNPAPNDNCEDFLENINQVTIQ-------KFFIHVK--------ILIGDFVL 1131
              + ++      P  N      N+ +  I        KF  H +          + DF  
Sbjct: 206  PQNKNL-GKEFLPKKNNTTNSRNLRKTNISRIEYSSNKFLNHTRKRYEMVLQAELPDFKC 264

Query: 1132 EIPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSAAEGSRLKIKYKIPSAIIKNGS 1186
             IP L DTG  ++ I+E  +     PTR + K+            KI  K     I    
Sbjct: 265  SIPCLIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPN---KINRKTIKLNISLNG 321

Query: 1187 LEIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLN 1246
            + I+T FL+V+  S    I       L+  N +                     I  + +
Sbjct: 322  ISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKH 357

Query: 1247 VISYKEKQINFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHM 1306
             +S   K  N +KE           + P +    +DI     +     LP         +
Sbjct: 358  TLSQMNKVSNIVKEP----------ELPDIYKEFKDITAETNTE---KLPKP-------I 397

Query: 1307 VELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNK 1366
              L +E + + +      R   +    +     EIN  L+  +IR SK+  +C   +V K
Sbjct: 398  KGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPK 457

Query: 1367 QAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQ 1426
            +     GT R+V++YKPLN+ +    YP+P  + LLA+   + +F+K D+KS +  I+++
Sbjct: 458  K----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVR 513

Query: 1427 EKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIV-YIDDVLIFSQ 1485
            + D++K AF  P G +E+ VMP+G+  AP+ FQ  +N I     +  +V Y+D++LI S+
Sbjct: 514  KGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSK 573

Query: 1486 TLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKF 1545
            +  +H KH+   +  +K   L +++ K    Q++++F+G++I +    P    I+   ++
Sbjct: 574  SESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW 633

Query: 1546 PDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLR 1604
              Q  ++ +L++FLG +NY+  F P+ S +  PL++ LKKD     +   T  ++ IK  
Sbjct: 634  -KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQC 692

Query: 1605 VKNLPCLYLPNPQAFKIIETDASDIGFGGILKQK-VFDKEQIIAFTSKHWNPAQQNYSTV 1663
            + + P L   +     ++ETDASD+  G +L QK   DK   + + S   + AQ NYS  
Sbjct: 693  LVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVS 752

Query: 1664 KKEVLAIVLSISKFQ 1678
             KE+LAI+ S+  ++
Sbjct: 753  DKEMLAIIKSLKHWR 767


>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
            transposon 17.6 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score =  191 bits (485), Expect = 1e-47
 Identities = 116/358 (32%), Positives = 190/358 (52%), Gaps = 9/358 (2%)

Query: 1338 QKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIE-RGTPRLVINYKPLNQALCWIRYPIP 1396
            + +I D+L + +IR S SP++   + V K+ +   +   R+VI+Y+ LN+     R+PIP
Sbjct: 224  ESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIP 283

Query: 1397 NKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPS 1456
            N  ++L +      F+  D+  GF QI++  +   KTAF+   G YE+  MPFGLKNAP+
Sbjct: 284  NMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPA 343

Query: 1457 EFQRIMNEIFNP-YSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSL 1515
             FQR MN+I  P  +K  +VY+DD+++FS +LD+H + L      + +  L +   K   
Sbjct: 344  TFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEF 403

Query: 1516 FQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTI 1575
             + +  FLGH +    I P    IE   K+P     K +++ FLG   Y   F P  + I
Sbjct: 404  LKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPK-EIKAFLGLTGYYRKFIPNFADI 462

Query: 1576 IKPLHDRLKKDP--PP*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGG 1633
             KP+   LKK+      +  + +  K++K  +   P L +P+      + TDASD+  G 
Sbjct: 463  AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522

Query: 1634 ILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
            +L Q        +++ S+  N  + NYST++KE+LAIV +   F+  L+ + F +  D
Sbjct: 523  VLSQ----DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
            transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score =  189 bits (479), Expect = 7e-47
 Identities = 118/376 (31%), Positives = 196/376 (51%), Gaps = 11/376 (2%)

Query: 1320 ISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLV 1378
            I +K  P+    E+    + ++ ++L + LIR S SP++   + V K+ +       R+V
Sbjct: 207  IYSKQYPLAQTHEIE--VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264

Query: 1379 INYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVP 1438
            I+Y+ LN+     RYPIPN  ++L +    + F+  D+  GF QI++ E+   KTAF+  
Sbjct: 265  IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324

Query: 1439 FGQYEWNVMPFGLKNAPSEFQRIMNEIFNP-YSKLTIVYIDDVLIFSQTLDQHFKHLNTF 1497
             G YE+  MPFGL+NAP+ FQR MN I  P  +K  +VY+DD++IFS +L +H   +   
Sbjct: 325  SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384

Query: 1498 ISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQR 1557
             + +    L +   K    + +  FLGH +    I P    ++    +P    DK +++ 
Sbjct: 385  FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDK-EIRA 443

Query: 1558 FLGCLNYVADFCPQLSTIIKPLHDRLKKDPPP*SD--IHTNVVKQIKLRVKNLPCLYLPN 1615
            FLG   Y   F P  + I KP+   LKK     +    +    +++K  +   P L LP+
Sbjct: 444  FLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503

Query: 1616 PQAFKIIETDASDIGFGGILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1675
             +   ++ TDAS++  G +L Q        I+F S+  N  + NYS ++KE+LAIV +  
Sbjct: 504  FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559

Query: 1676 KFQSDLINQKFLVCVD 1691
             F+  L+ ++FL+  D
Sbjct: 560  TFRHYLLGRQFLIASD 575


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
            transposon gypsy [Contains: Reverse transcriptase (EC
            2.7.7.49); Endonuclease]
          Length = 1035

 Score =  172 bits (436), Expect = 7e-42
 Identities = 130/473 (27%), Positives = 238/473 (49%), Gaps = 42/473 (8%)

Query: 1245 LNVISYKEKQINFLKEEISYRTIEEQLQQ---PSVK-SRIEDILENIQSSICSDLPNAFW 1300
            L++++    ++N  ++ + Y+ I E+L     PSV  + + DI+  +  S+  +  +   
Sbjct: 94   LDLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVNFTDVNDIV--VPDSVKKEFKDTII 151

Query: 1301 ERKSHMVE----LPYE-------KDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKL 1349
             RK         LP+        +   ++ + ++A P  M   +  F   E+  LL+  +
Sbjct: 152  RRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMG--VSDFVNNEVKQLLKDGI 209

Query: 1350 IRRSKSPWSCAAFYVNKQAEIERGTP--RLVINYKPLNQALCWIRYPIPNKKDLLARQHD 1407
            IR S+SP++   + V+K+     G P  RLVI+++ LN+     RYP+P+   +LA    
Sbjct: 210  IRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGK 269

Query: 1408 AKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF- 1466
            AK F+  D+KSG+ QI L E D+ KT+F+V  G+YE+  +PFGL+NA S FQR ++++  
Sbjct: 270  AKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLR 329

Query: 1467 NPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHN 1526
                K+  VY+DDV+IFS+    H +H++T +  +    + VS+ K   F+  + +LG  
Sbjct: 330  EQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFI 389

Query: 1527 IHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL--- 1583
            + +         ++   ++P+      +++ FLG  +Y   F    + I +P+ D L   
Sbjct: 390  VSKDGTKSDPEKVKAIQEYPEPDC-VYKVRSFLGLASYYRVFIKDFAAIARPITDILKGE 448

Query: 1584 ---------KKDPPP*SDIHTNVVKQIK--LRVKNLPCLYLPNPQAFKIIETDASDIGFG 1632
                     KK P   ++   N  ++++  L  +++   Y    + F  + TDAS  G G
Sbjct: 449  NGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD-LTTDASASGIG 507

Query: 1633 GILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQK 1685
             +L Q    + + I   S+     +QNY+T ++E+LAIV ++ K Q+ L   +
Sbjct: 508  AVLSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR 556


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
            transposon opus [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score =  170 bits (430), Expect = 4e-41
 Identities = 123/426 (28%), Positives = 215/426 (49%), Gaps = 21/426 (4%)

Query: 1274 PSVKSRIEDILENIQSSICSDLPNAFWERKSHM-VELPYEKDF---SDKQISTKARPIQM 1329
            P + +   D  + I +S+  + P  F    S M VE   + +    +   I  K+ P  +
Sbjct: 74   PLLAAEHPDGTQEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPV 133

Query: 1330 NEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLVINYKPLNQAL 1388
            N  +    +++I++LL+  +IR S SP++   + V K+ +       R+V+++K LN   
Sbjct: 134  N--MRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVT 191

Query: 1389 CWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMP 1448
                YPIP+    LA   +AK F+  D+ SGF QI ++E D  KTAF+   G+YE+  +P
Sbjct: 192  IPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLP 251

Query: 1449 FGLKNAPSEFQRIMNEIFNPY-SKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLA 1507
            FGLKNAP+ FQR++++I   +  K+  VYIDD+++FS+  D H+K+L   ++ + +  L 
Sbjct: 252  FGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQ 311

Query: 1508 VSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD 1567
            V+  K     T++ FLG+ +    I    + +    + P     K +L+RFLG  +Y   
Sbjct: 312  VNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVK-ELKRFLGMTSYYRK 370

Query: 1568 FCPQLSTIIKPLHDRLK------------KDPPP*SDIHTNVVKQIKLRVKNLPCLYLPN 1615
            F    + + KPL +  +            K P    +        +K  + +   L  P 
Sbjct: 371  FIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPC 430

Query: 1616 PQAFKIIETDASDIGFGGILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1675
                  + TDAS+   G +L Q    +++ IA+ S+  N  ++NY+T++KE+LAI+ S+ 
Sbjct: 431  FTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLD 490

Query: 1676 KFQSDL 1681
              ++ L
Sbjct: 491  NLRAYL 496


>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score =  168 bits (425), Expect = 1e-40
 Identities = 146/499 (29%), Positives = 236/499 (47%), Gaps = 37/499 (7%)

Query: 1217 NTDENGITVQH-LGQPILFKFSEPPIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPS 1275
            N  E  ITV+  L  P LF   E   + +  V+   E    F        TI E L++ +
Sbjct: 849  NKAEQDITVEEVLNDPTLFSEIETDTN-SCEVVKTAETYERFT-------TICEHLKREN 900

Query: 1276 VKSR-IEDILENIQS--SICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEE 1332
               R I D++E  Q   +I  D        ++   E   E     + I  K RPI +   
Sbjct: 901  GDDRKIWDVIEQFQDVFAISDDELG-----RNSGTECVIELKEGAEPIRQKPRPIPL--A 953

Query: 1333 LLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIR 1392
            L    +K I  +L +K+IR SKSPWS     V K+     G+ R+ I+Y+ +N+ +    
Sbjct: 954  LKPEIRKMIQKMLNQKVIRESKSPWSSPVVLVKKKD----GSIRMCIDYRKVNKVVKNNA 1009

Query: 1393 YPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLK 1452
            +P+PN +  L      K+++ FDM +GFWQI L EK K  TAF +    +EWNV+PFGL 
Sbjct: 1010 HPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLV 1069

Query: 1453 NAPSEFQRIMNEIFNP-YSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKT 1511
             +P+ FQ  M EI          VY+DD+LI S+ ++QH + +   ++ I+++G+ +  +
Sbjct: 1070 ISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRAS 1129

Query: 1512 KVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP--DQIIDKTQLQRFLGCLNYVADFC 1569
            K  + + ++ +LGH +   T+  +      TDK     +  +  +LQ FLG + Y   F 
Sbjct: 1130 KCHIAKKEVEYLGHKV---TLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFI 1186

Query: 1570 PQLSTIIKPLHDRLKKDPPP*SDIHTNVV-KQIKLRVKNLPCLYLPNPQAFK------II 1622
               + I   L   +        +    +  +++K  V   P L  P+ +A        +I
Sbjct: 1187 LNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMI 1246

Query: 1623 ETDASDIGFGGILKQKVFDKEQ-IIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDL 1681
             TDAS  G G +L Q+  D +Q  IAF SK  +PA+  Y     E LA++ ++ +F++ +
Sbjct: 1247 YTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTII 1306

Query: 1682 INQKFLVCVDCKSAKEILQ 1700
                  V  D K    +L+
Sbjct: 1307 YGTAITVFTDHKPLISLLK 1325


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
            transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score =  138 bits (348), Expect = 1e-31
 Identities = 111/426 (26%), Positives = 197/426 (46%), Gaps = 15/426 (3%)

Query: 1276 VKSRIEDILENIQSSICSDLPNAF-WERKSHMVELPYEKDF---SDKQISTK--ARPIQM 1329
            +K    ++ ++   +ICS+  + F  E +   V   Y++      D+ + TK    P   
Sbjct: 267  LKKNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQ 326

Query: 1330 NEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAE--IERGTPRLVINYKPLNQA 1387
             EE+    Q ++  L++ K++  S S ++     V K++    ++   RLVI+Y+ +N+ 
Sbjct: 327  VEEI----QAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKK 382

Query: 1388 LCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVM 1447
            L   ++P+P   D+L +   AK FS  D+ SGF QI+L E  +  T+F+   G Y +  +
Sbjct: 383  LLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRL 442

Query: 1448 PFGLKNAPSEFQRIMNEIFNPYS-KLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGL 1506
            PFGLK AP+ FQR+M   F+        +Y+DD+++   +     K+L       +   L
Sbjct: 443  PFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNL 502

Query: 1507 AVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVA 1566
             +   K S F  ++ FLGH      I+P ++  +    +P    D    +RF+   NY  
Sbjct: 503  KLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVP-HDADSARRFVAFCNYYR 561

Query: 1567 DFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETD 1625
             F    +   + +    KK+ P   +D        +K ++ N   L  P+      I TD
Sbjct: 562  RFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTD 621

Query: 1626 ASDIGFGGILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQK 1685
            AS    G +L Q     +  +A+ S+ +   + N ST ++E+ AI  +I  F+  +  + 
Sbjct: 622  ASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKH 681

Query: 1686 FLVCVD 1691
            F V  D
Sbjct: 682  FTVKTD 687


>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1161

 Score = 97.1 bits (240), Expect = 4e-19
 Identities = 78/322 (24%), Positives = 142/322 (43%), Gaps = 26/322 (8%)

Query: 1268 EEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPI 1327
            E  LQQ ++    +++L+ +         +A W+   + V     K  +    +   RP 
Sbjct: 123  ERLLQQTALPKEQKELLQKLFLKY-----DALWQHWENQVGHRRIKPHNIATGTLAPRPQ 177

Query: 1328 Q---MNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPL 1384
            +   +N +     Q  I+DLL++ ++ +  S  +   + V K      G  R+V++Y+ +
Sbjct: 178  KQYPINPKAKPSIQIVIDDLLKQGVLIQQNSTMNTPVYPVPKPD----GKWRMVLDYREV 233

Query: 1385 NQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEW 1444
            N+ +  I     +   +L+  +  K  +  D+ +GFW   +  +  + TAFT    QY W
Sbjct: 234  NKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGKQYCW 293

Query: 1445 NVMPFGLKNAPSEFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRN 1504
              +P G  N+P+ F   + ++      +   Y+DD+ I      +H + L    S++   
Sbjct: 294  TRLPQGFLNSPALFTADVVDLLKEIPNVQ-AYVDDIYISHDDPQEHLEQLEKIFSILLNA 352

Query: 1505 GLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQII------DKTQLQRF 1558
            G  VS  K  + Q ++ FLG NI              TD F  +++      D  QLQ  
Sbjct: 353  GYVVSLKKSEIAQREVEFLGFNI-------TKEGRGLTDTFKQKLLNITPPKDLKQLQSI 405

Query: 1559 LGCLNYVADFCPQLSTIIKPLH 1580
            LG LN+  +F P  S ++KPL+
Sbjct: 406  LGLLNFARNFIPNYSELVKPLY 427


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.316    0.134    0.389 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 206,823,710
Number of Sequences: 164201
Number of extensions: 9477616
Number of successful extensions: 42479
Number of sequences better than 10.0: 431
Number of HSP's better than 10.0 without gapping: 142
Number of HSP's successfully gapped in prelim test: 298
Number of HSP's that attempted gapping in prelim test: 39796
Number of HSP's gapped (non-prelim): 2213
length of query: 1703
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1579
effective length of database: 39,613,130
effective search space: 62549132270
effective search space used: 62549132270
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 73 (32.7 bits)


Lotus: description of TM0019a.7