Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0114.12
         (1414 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III    202  4e-51
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...   202  4e-51
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...   201  2e-50
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...   182  7e-45
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript...   175  9e-43
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...   172  4e-42
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...   161  1e-38
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...   158  1e-37
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...   158  1e-37
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...   144  2e-33
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...   130  3e-29
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...   130  3e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...   130  3e-29
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...   127  2e-28
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot...   127  2e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...   126  4e-28
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot...   121  1e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II     118  1e-25
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23...   104  2e-21
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23...   102  1e-20

>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score =  202 bits (515), Expect = 4e-51
 Identities = 126/414 (30%), Positives = 210/414 (50%), Gaps = 9/414 (2%)

Query: 409  LAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKW 468
            + ++ GA P+ Q  R +       ++   +K++  + IRE + P W + VV+VKK +G  
Sbjct: 934  IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992

Query: 469  RMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAF 528
            RMC DY  +NKV   +++PLPN++  +   +G +L ++ D  +G+ QI +    +E TAF
Sbjct: 993  RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052

Query: 529  MTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDL 588
                  + +  +PFGL  + A +Q  M++I    +G    VYVDD+++ S     H  D+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112

Query: 589  KEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEV 648
            KEA  ++R   MKL   KC    +  ++LG  +T  G+E    K   + +   PT+VKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172

Query: 649  QRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKP 708
            Q   G +    +F+      A+   + +     + W +E E AF +LK+ +   PVL++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232

Query: 709  TPGV------PLVLYLAVTDKAVSTVLLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAA 761
                      P ++Y   + K +  VL QE    +Q  I F S  L  AE RY   +  A
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEA 1292

Query: 762  LAILKTARRLRPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ 814
            LA++   RR +       + V TD  PL  +L+   L+ RL  WS+E+ E+D++
Sbjct: 1293 LAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVK 1346



 Score =  110 bits (274), Expect = 4e-23
 Identities = 144/606 (23%), Positives = 244/606 (39%), Gaps = 64/606 (10%)

Query: 815  YEPRGQVTVQSLIDFVAELT----PTEGEKTQGEWVLSVDGSSNNTGSGAGITIESPDKM 870
            +E   ++  Q L   V +      P      +G+    +   ++  G GA +  E PD  
Sbjct: 1208 WEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQ 1267

Query: 871  IIEQSLKFEFKA-SNNQSEYEAL-IAGLRLAIELGVQKLFIKG-------DSQLVVKQVK 921
              +  + F  KA S  ++ Y    +  L +   L   K  I G       D + ++  +K
Sbjct: 1268 --QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLK 1325

Query: 922  GEYQVKDPQLSKYLEVVRRLMMEVKE--IKIEHVPRGQNERADVLAKLA---------ST 970
            G         S   + + R  +E+ E  +KI ++    N  AD L++            T
Sbjct: 1326 G---------SPLADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQT 1376

Query: 971  GRLGNYQTVIQETLPRPSIDLVEIK--LKVVKSVNEGELPWMESIKTFLENPPKEDDLNT 1028
              L +    IQ  LP    D+++    L+ +K  +EG   W E I   LE    +     
Sbjct: 1377 KELTSIVNAIQTELP----DILDSSCWLERLKGEDEG---WKEVIAA-LEGGKTKGTFKI 1428

Query: 1029 RTKRREAS--FYTLVDGELYRRGIMSPMLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAV 1086
                 E S  +Y +V G L    I       V  K    ++ E+HEG+ + H G + +  
Sbjct: 1429 VGIESEISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGHFGIKKMW- 1487

Query: 1087 KVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMAPWPFAMWGTDILGPF 1146
            +++   FYWP M+      V+ C KC   +D H      LT     +P  +   D++   
Sbjct: 1488 RMVHRKFYWPQMRVCVENCVRTCAKCLCAND-HSKLTSSLTPYRMTFPLEIVACDLMD-V 1545

Query: 1147 PVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFG-VPMALVMDNGTQ 1205
             ++    +YI+  +D FTK+  A  +    A  V     +R     G +P+ L+ D G +
Sbjct: 1546 GLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKE 1605

Query: 1206 FTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGV 1265
            F + +  +F   + IE       + + NG  E  NK I+  +KKK       W +++   
Sbjct: 1606 FVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAVPME-WDDQVVYA 1664

Query: 1266 LWAYNTTEQSSTKETPYRLTYGTDAMLSVEIENQSWRVARFNENDNGENLIANLIMLPEE 1325
            ++AYN     +T ETP  L +G D M  +E+  +      + + D  ++L+   ++  ++
Sbjct: 1665 VYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQELLKVQK 1724

Query: 1326 QREAHIRNEAGKVKVARKFSTKVVPRKMRV---GDLVL*KNTIPDKH-----NKLSPNWG 1377
              + H   E    K    F  K   +K R    G  VL +  IP +       KL   W 
Sbjct: 1725 IAKEHAMREQESYK--SLFDQKYASKKHRFPQPGSRVLLE--IPSEKLGAQCPKLVNKWS 1780

Query: 1378 GPYRII 1383
            GPYR+I
Sbjct: 1781 GPYRVI 1786


>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
           transposon 17.6 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score =  202 bits (515), Expect = 4e-51
 Identities = 124/363 (34%), Positives = 182/363 (49%), Gaps = 6/363 (1%)

Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
           P W+           K+R+  DY  LN++   D +P+PN+D+++         + +D   
Sbjct: 246 PIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305

Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
           G++QI M P     TAF T   +Y Y  MPFGLKNA AT+QR M+ I    + ++  VY+
Sbjct: 306 GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365

Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
           DD+IV S    +H   L   F++L    +KL  +KC F  Q   FLG +LT  GI+ NP+
Sbjct: 366 DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425

Query: 632 KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTE-ECEQ 690
           K  AI +   PT  KE++   G      +F+P   D A P   CLKKN K   T  E + 
Sbjct: 426 KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485

Query: 691 AFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGA 750
           AF KLK  ++  P+L  P       L    +D A+  VL Q+       + ++S TL   
Sbjct: 486 AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQD----GHPLSYISRTLNEH 541

Query: 751 ELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELS 809
           E+ Y  IEK  LAI+   +  R Y      ++ +D  PL  + +  D + +L  W V+LS
Sbjct: 542 EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601

Query: 810 EYD 812
           E+D
Sbjct: 602 EFD 604


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
           transposon 297 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score =  201 bits (510), Expect = 2e-50
 Identities = 128/408 (31%), Positives = 201/408 (48%), Gaps = 12/408 (2%)

Query: 416 TPVIQPMRRMSEEKHKAVQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGKWRMC 471
           +P+      +++     V+ + ++++    IRE       PTW+           K+R+ 
Sbjct: 205 SPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264

Query: 472 TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTN 531
            DY  LN++   D YP+PN+D+++      +  + +D   G++QI M       TAF T 
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324

Query: 532 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 591
             +Y Y  MPFGL+NA AT+QR M+ I    + ++  VY+DD+I+ S   ++H   ++  
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384

Query: 592 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 651
           F +L    +KL  +KC F  +   FLG ++T  GI+ NP K +AI+    PT  KE++  
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444

Query: 652 TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECE--QAFTKLKETLATLPVLSKPT 709
            G      +F+P   D A P  +CLKK +K   T++ E  +AF KLK  +   P+L  P 
Sbjct: 445 LGLTGYYRKFIPNYADIAKPMTSCLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPD 503

Query: 710 PGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTAR 769
                VL    ++ A+  VL Q        I F+S TL   EL Y  IEK  LAI+   +
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559

Query: 770 RLRPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 816
             R Y    Q  + +D  PLR +    +   +L  W V LSEY  + +
Sbjct: 560 TFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKID 607


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
           transposon opus [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score =  182 bits (461), Expect = 7e-45
 Identities = 128/406 (31%), Positives = 205/406 (49%), Gaps = 23/406 (5%)

Query: 433 VQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGK--WRMCTDYTSLNKVCPKDSY 486
           V+ + ++L++   IR        P W+  V    K NG+  +RM  D+  LN V   D+Y
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196

Query: 487 PLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKN 546
           P+P+++  +      +  + +D  SG++QI M  SD   TAF T    Y +  +PFGLKN
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256

Query: 547 AGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEK 606
           A A +QR++D I  + +G+   VY+DD+IV S     H  +L+     L    +++N EK
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316

Query: 607 CSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAG 666
             F     +FLG+++T+ GI+ +P K RAI EM  PTSVKE++R  G  +   +F+    
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376

Query: 667 DKAAPFFTCLK---------KNSKFQWT--EECEQAFTKLKETLATLPVLSKPTPGVPLV 715
             A P     +         ++SK   T  E   Q+F  LK  L +  +L+ P    P  
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436

Query: 716 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 774
           L    ++ A+  VL Q++  + + I ++S +L   E  Y  IEK  LAI+ +   LR Y 
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496

Query: 775 FQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ--YEP 817
           + +  +KV TD  PL   L   + + +L  W   + EY+ +  Y+P
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKP 542



 Score = 44.3 bits (103), Expect = 0.002
 Identities = 41/207 (19%), Positives = 83/207 (39%), Gaps = 10/207 (4%)

Query: 1077 SHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFS-DLHKAPPEELTTMMAPWPF 1135
            +H G   + ++++   +Y+P M          C+ C+++  + H   P    T +  +P 
Sbjct: 704  AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762

Query: 1136 AMWGTDILGPFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFGVP 1195
             +   DI         + +  +  +D F+K+ +   + +  +  +R  L + +   F  P
Sbjct: 763  EILHIDIFA------LEKRLYLSCIDKFSKFAKLFHLQSKASVHLRETLVEALHY-FTAP 815

Query: 1196 MALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAK 1255
              LV DN           +   + I++ +A  +  + NGQ E  +   L+  +   DE  
Sbjct: 816  KVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELP 875

Query: 1256 GLWAEELPGV-LWAYNTTEQSSTKETP 1281
                 EL  + +  YNT+  S T   P
Sbjct: 876  TFKPVELVHIAVDRYNTSVHSVTNRKP 902


>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
            transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
            (RT); Integrase (IN)]
          Length = 886

 Score =  175 bits (443), Expect = 9e-43
 Identities = 201/881 (22%), Positives = 362/881 (40%), Gaps = 105/881 (11%)

Query: 458  VVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIM 517
            V  V K +G+WRM  DY  +NK  P  +    +   ++      +  + +D  +G+    
Sbjct: 5    VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGF---W 61

Query: 518  MHPSDEES---TAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDM 574
             HP   ES   TAF      YC+  +P G  N+ A +    D +   +   N++VYVDD+
Sbjct: 62   AHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQVYVDDI 119

Query: 575  IVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGR 634
             +      +H   L++ F  L      ++ +K   G +  +FLGF +T  G  +      
Sbjct: 120  YLSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKT 179

Query: 635  AILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLK--KNSKFQWTEECEQAF 692
             +L +  P  +K++Q + G +     F+P   +   P +  +   K    +W+EE  +  
Sbjct: 180  KLLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQL 239

Query: 693  TKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAEL 752
              + E L T   L +  P   LV+ +  +  A       E GKK   I ++++    AEL
Sbjct: 240  NMVIEALNTASNLEERLPEQRLVIKVNTSPSAGYVRYYNETGKKP--IMYLNYVFSKAEL 297

Query: 753  RYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVPLRQVLQKPDLSGRL------VSWSV 806
            ++  +EK    + K   +        ++ V + +     +QK  L  R       ++W  
Sbjct: 298  KFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMT 357

Query: 807  ELSEYDIQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNT 856
             L +  IQ+   +    ++ + D            +Q E V   DGS         SNN 
Sbjct: 358  YLEDPRIQFHYDKTLPELKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAIKSPDPTKSNNA 417

Query: 857  GSGAGITIESPDKMIIEQSLKFEFKASNNQSEYEALIAGLRLAIELGVQ---KLFIKGDS 913
            G G       P+  ++    ++     N+ ++  A IA +  A +  ++    + +  DS
Sbjct: 418  GMGIVHATYKPEYQVLN---QWSIPLGNHTAQM-AEIAAVEFACKKALKIPGPVLVITDS 473

Query: 914  QLVVKQVKGEYQV----------KDP--QLSKYLEVVRRLMMEVKEIKIEHVPRGQNERA 961
              V +    E             K P   +SK+  +   L M+  +I I+H  +G + + 
Sbjct: 474  FYVAESANKELPYWKSNGFVNNKKKPLKHISKWKSIAECLSMK-PDITIQH-EKGISLQI 531

Query: 962  DVLA--------KLASTGRLGNYQTVIQETLPRPSIDLVEIKLKVVKSVNEGELPWMESI 1013
             V          KLA+ G       V+     +P++D        +  + +G        
Sbjct: 532  PVFILKGNALADKLATQGSY-----VVNCNTKKPNLDAE------LDQLLQGH------- 573

Query: 1014 KTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSPM-LKCVDTK-DALGIMAEVH 1071
              +++  PK+              Y L DG++    +  P  +K +  + D   I+ + H
Sbjct: 574  --YIKGYPKQYT------------YFLEDGKVK---VSRPEGVKIIPPQSDRQKIVLQAH 616

Query: 1072 EGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMA 1131
                 +H G  +  +K+    ++WP M+KD ++ + +C++C + +  +KA    L     
Sbjct: 617  N---LAHTGREATLLKIANL-YWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRP 672

Query: 1132 PWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFT--KWIEAEAVATITAAKVRNFLWQRIV 1189
              PF  +  D +GP P ++  + Y++V VD  T   W+     A  T+A V++     ++
Sbjct: 673  QKPFDKFFIDYIGPLPPSQGYL-YVLVVVDGMTGFTWLYPTK-APSTSATVKSL---NVL 727

Query: 1190 CRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKK 1249
                +P  +  D G  FTSS   E+  E GI + F++  HPQ+  + E  N  I + L K
Sbjct: 728  TSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTK 787

Query: 1250 KLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRLTYGTDA 1290
             L      W + LP V  A N T     K TP++L +G D+
Sbjct: 788  LLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDS 828


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
           transposon 412 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score =  172 bits (437), Expect = 4e-42
 Identities = 119/406 (29%), Positives = 176/406 (43%), Gaps = 6/406 (1%)

Query: 417 PVIQPMRRMSEEKHKAVQLETEKLIKARFIREV--QYPTWLANVVMVKKANG---KWRMC 471
           PV     R    + + +Q + +KLIK + +     QY + L  V      N    KWR+ 
Sbjct: 314 PVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLV 373

Query: 472 TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTN 531
            DY  +NK    D +PLP +D ++D     +  S +D  SG++QI +     + T+F T+
Sbjct: 374 IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433

Query: 532 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 591
             +Y +  +PFGLK A  ++QR+M   FS        +Y+DD+IV          +L E 
Sbjct: 434 NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493

Query: 592 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 651
           F + R Y +KL+PEKCSF +    FLG   T +GI  +  K   I     P      +R 
Sbjct: 494 FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553

Query: 652 TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPG 711
                   RF+    D +       KKN  F+WT+EC++AF  LK  L    +L  P   
Sbjct: 554 VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613

Query: 712 VPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRL 771
               +    + +A   VL Q     Q  + + S      E      E+   AI       
Sbjct: 614 KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673

Query: 772 RPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 816
           RPY       VKTD  PL  +    + S +L    +EL EY+   E
Sbjct: 674 RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719



 Score =  110 bits (276), Expect = 2e-23
 Identities = 78/338 (23%), Positives = 148/338 (43%), Gaps = 8/338 (2%)

Query: 1050 IMSPMLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKC 1109
            +++P+ +  + K+   I++ +H+        G +  +  ++  +YW  M K   EYV+KC
Sbjct: 880  LLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKC 939

Query: 1110 EKCQVFSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTKWIEA 1169
            +KCQ              T      F     D +GP P ++   +Y +  +   TK++ A
Sbjct: 940  QKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVA 999

Query: 1170 EAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRFASVEH 1229
              +A  +A  V   +++  + ++G     + D GT++ +S+  + C  + I+   ++  H
Sbjct: 1000 IPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHH 1059

Query: 1230 PQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRLTYGTD 1289
             QT G  E +++ + + ++  +   K  W   L   ++ +NTT+       PY L +G  
Sbjct: 1060 HQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRT 1119

Query: 1290 AMLSVEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARKFSTKVV 1349
            + L             +N +D  +    +   L      A    EA K K    +  KV 
Sbjct: 1120 SNLPKHFNKLHSIEPIYNIDDYAKE---SKYRLEVAYARARKLLEAHKEKNKENYDLKVK 1176

Query: 1350 PRKMRVGDLVL*KNTIPDKHNKLSPNWGGPYRI--IGD 1385
              ++ VGD VL +N +    +KL   + GPY+I  IGD
Sbjct: 1177 DIELEVGDKVLLRNEV---GHKLDFKYTGPYKIESIGD 1211


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
           type 1
          Length = 1333

 Score =  161 bits (408), Expect = 1e-38
 Identities = 109/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)

Query: 429 KHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 488
           K +A+  E  + +K+  IRE +       V+ V K  G  RM  DY  LNK    + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 489 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAG 548
           P +++L+    G+ + + +D  S Y+ I +   DE   AF   +  + Y  MP+G+  A 
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542

Query: 549 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 608
           A +Q  ++ I  +    ++  Y+DD+++ S   S+H   +K+   +L+   + +N  KC 
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 609 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 668
           F     KF+G+ ++ +G     +    +L+ K P + KE+++  G +  L +F+P     
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 669 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTV 728
             P    LKK+ +++WT    QA   +K+ L + PVL        ++L    +D AV  V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 729 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKVKTD 785
           L Q+ +  K   + + S  +  A+L Y   +K  LAI+K+ +  R Y +S     K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 786 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 817
              +  R   +    + RL  W + L +  ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819



 Score =  124 bits (311), Expect = 2e-27
 Identities = 132/556 (23%), Positives = 236/556 (41%), Gaps = 61/556 (10%)

Query: 877  KFEFKASNNQSEYEALIAGL---RLAIELGVQKLFIKGDSQLVVKQVKGEYQVKDPQLSK 933
            K +   S +  E  A+I  L   R  +E  ++   I  D + ++ ++  E + ++ +L++
Sbjct: 744  KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803

Query: 934  YLEVVRRLMMEVKEIKIEHVPRGQNERADVLAKLASTGRLGNYQTVIQETLPRPSIDLVE 993
            +     +L ++    +I + P   N  AD L++            ++ ET P P  D  +
Sbjct: 804  W-----QLFLQDFNFEINYRPGSANHIADALSR------------IVDETEPIPK-DSED 845

Query: 994  IKLKVVKSVNEGELPWMESIKTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSP 1053
              +  V  ++  +    + +  +  +    + LN   KR E +   L DG L        
Sbjct: 846  NSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQ-LKDGLLINS--KDQ 902

Query: 1054 MLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQ 1113
            +L   DT+    I+ + HE     H  G  L   +I   F W  ++K   EYV+ C  CQ
Sbjct: 903  ILLPNDTQLTRTIIKKYHEEGKLIH-PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQ 961

Query: 1114 V--------FSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTK 1165
            +        +  L   PP E        P+     D +   P +      + V VD F+K
Sbjct: 962  INKSRNHKPYGPLQPIPPSER-------PWESLSMDFITALPESSGY-NALFVVVDRFSK 1013

Query: 1166 W-IEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRF 1224
              I      +ITA +      QR++  FG P  ++ DN   FTS   ++F  +    M+F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073

Query: 1225 ASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRL 1284
            +    PQT+GQ E  N+ + K L+         W + +  V  +YN    S+T+ TP+ +
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEI 1133

Query: 1285 TYGTDAMLS-VEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARK 1343
             +     LS +E+ + S +      ++N +  I     + E     H+      +K+ + 
Sbjct: 1134 VHRYSPALSPLELPSFSDKT-----DENSQETIQVFQTVKE-----HL--NTNNIKMKKY 1181

Query: 1344 FSTKVVP-RKMRVGDLVL*KNT---IPDKHNKLSPNWGGPYRIIGDVGGEAYKLEQLSGQ 1399
            F  K+    + + GDLV+ K T      K NKL+P++ GP+ ++   G   Y+L+     
Sbjct: 1182 FDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSI 1241

Query: 1400 K--VPRTWNASHLKQY 1413
            K     T++ SHL++Y
Sbjct: 1242 KHMFSSTFHVSHLEKY 1257


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
           type 3
          Length = 1333

 Score =  158 bits (399), Expect = 1e-37
 Identities = 108/397 (27%), Positives = 192/397 (48%), Gaps = 9/397 (2%)

Query: 429 KHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 488
           K +A+  E  + +K+  IRE +       V+ V K  G  RM  DY  LNK    + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 489 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAG 548
           P +++L+    G+ + + +D  S Y+ I +   DE   AF   +  + Y  MP+G+  A 
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542

Query: 549 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 608
           A +Q  ++ I  +    ++  Y+D++++ S   S+H   +K+   +L+   + +N  KC 
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 609 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 668
           F     KF+G+ ++ +G     +    +L+ K P + KE+++  G +  L +F+P     
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 669 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTV 728
             P    LKK+ +++WT    QA   +K+ L + PVL        ++L    +D AV  V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 729 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKVKTD 785
           L Q+ +  K   + + S  +  A+L Y   +K  LAI+K+ +  R Y +S     K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 786 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 817
              +  R   +    + RL  W + L +  ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819



 Score =  124 bits (311), Expect = 2e-27
 Identities = 132/556 (23%), Positives = 236/556 (41%), Gaps = 61/556 (10%)

Query: 877  KFEFKASNNQSEYEALIAGL---RLAIELGVQKLFIKGDSQLVVKQVKGEYQVKDPQLSK 933
            K +   S +  E  A+I  L   R  +E  ++   I  D + ++ ++  E + ++ +L++
Sbjct: 744  KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803

Query: 934  YLEVVRRLMMEVKEIKIEHVPRGQNERADVLAKLASTGRLGNYQTVIQETLPRPSIDLVE 993
            +     +L ++    +I + P   N  AD L++            ++ ET P P  D  +
Sbjct: 804  W-----QLFLQDFNFEINYRPGSANHIADALSR------------IVDETEPIPK-DSED 845

Query: 994  IKLKVVKSVNEGELPWMESIKTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSP 1053
              +  V  ++  +    + +  +  +    + LN   KR E +   L DG L        
Sbjct: 846  NSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQ-LKDGLLINS--KDQ 902

Query: 1054 MLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQ 1113
            +L   DT+    I+ + HE     H  G  L   +I   F W  ++K   EYV+ C  CQ
Sbjct: 903  ILLPNDTQLTRTIIKKYHEEGKLIH-PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQ 961

Query: 1114 V--------FSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTK 1165
            +        +  L   PP E        P+     D +   P +      + V VD F+K
Sbjct: 962  INKSRNHKPYGPLQPIPPSER-------PWESLSMDFITALPESSGY-NALFVVVDRFSK 1013

Query: 1166 W-IEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRF 1224
              I      +ITA +      QR++  FG P  ++ DN   FTS   ++F  +    M+F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073

Query: 1225 ASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRL 1284
            +    PQT+GQ E  N+ + K L+         W + +  V  +YN    S+T+ TP+ +
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEI 1133

Query: 1285 TYGTDAMLS-VEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARK 1343
             +     LS +E+ + S +      ++N +  I     + E     H+      +K+ + 
Sbjct: 1134 VHRYSPALSPLELPSFSDKT-----DENSQETIQVFQTVKE-----HL--NTNNIKMKKY 1181

Query: 1344 FSTKVVP-RKMRVGDLVL*KNT---IPDKHNKLSPNWGGPYRIIGDVGGEAYKLEQLSGQ 1399
            F  K+    + + GDLV+ K T      K NKL+P++ GP+ ++   G   Y+L+     
Sbjct: 1182 FDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSI 1241

Query: 1400 K--VPRTWNASHLKQY 1413
            K     T++ SHL++Y
Sbjct: 1242 KHMFSSTFHVSHLEKY 1257


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
           type 2
          Length = 1333

 Score =  158 bits (399), Expect = 1e-37
 Identities = 108/397 (27%), Positives = 192/397 (48%), Gaps = 9/397 (2%)

Query: 429 KHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 488
           K +A+  E  + +K+  IRE +       V+ V K  G  RM  DY  LNK    + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 489 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAG 548
           P +++L+    G+ + + +D  S Y+ I +   DE   AF   +  + Y  MP+G+  A 
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542

Query: 549 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 608
           A +Q  ++ I  +    ++  Y+D++++ S   S+H   +K+   +L+   + +N  KC 
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 609 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 668
           F     KF+G+ ++ +G     +    +L+ K P + KE+++  G +  L +F+P     
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 669 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTV 728
             P    LKK+ +++WT    QA   +K+ L + PVL        ++L    +D AV  V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 729 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKVKTD 785
           L Q+ +  K   + + S  +  A+L Y   +K  LAI+K+ +  R Y +S     K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 786 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 817
              +  R   +    + RL  W + L +  ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819



 Score =  124 bits (311), Expect = 2e-27
 Identities = 132/556 (23%), Positives = 236/556 (41%), Gaps = 61/556 (10%)

Query: 877  KFEFKASNNQSEYEALIAGL---RLAIELGVQKLFIKGDSQLVVKQVKGEYQVKDPQLSK 933
            K +   S +  E  A+I  L   R  +E  ++   I  D + ++ ++  E + ++ +L++
Sbjct: 744  KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803

Query: 934  YLEVVRRLMMEVKEIKIEHVPRGQNERADVLAKLASTGRLGNYQTVIQETLPRPSIDLVE 993
            +     +L ++    +I + P   N  AD L++            ++ ET P P  D  +
Sbjct: 804  W-----QLFLQDFNFEINYRPGSANHIADALSR------------IVDETEPIPK-DSED 845

Query: 994  IKLKVVKSVNEGELPWMESIKTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSP 1053
              +  V  ++  +    + +  +  +    + LN   KR E +   L DG L        
Sbjct: 846  NSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQ-LKDGLLINS--KDQ 902

Query: 1054 MLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQ 1113
            +L   DT+    I+ + HE     H  G  L   +I   F W  ++K   EYV+ C  CQ
Sbjct: 903  ILLPNDTQLTRTIIKKYHEEGKLIH-PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQ 961

Query: 1114 V--------FSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTK 1165
            +        +  L   PP E        P+     D +   P +      + V VD F+K
Sbjct: 962  INKSRNHKPYGPLQPIPPSER-------PWESLSMDFITALPESSGY-NALFVVVDRFSK 1013

Query: 1166 W-IEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRF 1224
              I      +ITA +      QR++  FG P  ++ DN   FTS   ++F  +    M+F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073

Query: 1225 ASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRL 1284
            +    PQT+GQ E  N+ + K L+         W + +  V  +YN    S+T+ TP+ +
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEI 1133

Query: 1285 TYGTDAMLS-VEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARK 1343
             +     LS +E+ + S +      ++N +  I     + E     H+      +K+ + 
Sbjct: 1134 VHRYSPALSPLELPSFSDKT-----DENSQETIQVFQTVKE-----HL--NTNNIKMKKY 1181

Query: 1344 FSTKVVP-RKMRVGDLVL*KNT---IPDKHNKLSPNWGGPYRIIGDVGGEAYKLEQLSGQ 1399
            F  K+    + + GDLV+ K T      K NKL+P++ GP+ ++   G   Y+L+     
Sbjct: 1182 FDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSI 1241

Query: 1400 K--VPRTWNASHLKQY 1413
            K     T++ SHL++Y
Sbjct: 1242 KHMFSSTFHVSHLEKY 1257


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
           transposon gypsy [Contains: Reverse transcriptase (EC
           2.7.7.49); Endonuclease]
          Length = 1035

 Score =  144 bits (363), Expect = 2e-33
 Identities = 110/406 (27%), Positives = 190/406 (46%), Gaps = 25/406 (6%)

Query: 433 VQLETEKLIKARFIREVQYPTWLANVVMVKKA-----NGKWRMCTDYTSLNKVCPKDSYP 487
           V  E ++L+K   IR  + P      V+ KK      N   R+  D+  LN+    D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256

Query: 488 LPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNA 547
           +P++  ++      +  + +D  SGY+QI +   D E T+F  N   Y +  +PFGL+NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316

Query: 548 GATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKC 607
            + +QR +D +  +Q+G+   VYVDD+I+ S   SDH   +      L    M+++ EK 
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376

Query: 608 SFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGD 667
            F  +  ++LGF+++  G + +P+K +AI E   P  V +V+   G  +    F+     
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436

Query: 668 KAAPFFTCLK-----------KNSKFQWTEECEQAFTKLKETLATLPVLSK-PTPGVPLV 715
            A P    LK           K    ++ E    AF +L+  LA+  V+ K P    P  
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496

Query: 716 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 774
           L    +   +  VL QE     + I  +S TL+  E  Y   E+  LAI+    +L+ + 
Sbjct: 497 LTTDASASGIGAVLSQE----GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552

Query: 775 FQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYD--IQYEP 817
           + S ++ + TD  PL   +   + + ++  W   + +++  + Y+P
Sbjct: 553 YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKP 598



 Score = 48.1 bits (113), Expect = 2e-04
 Identities = 70/294 (23%), Positives = 112/294 (37%), Gaps = 26/294 (8%)

Query: 1086 VKVIRAGFYWPTMKKDCLEYVKKCEKC-QVFSDLHKAPPEELTTMMAPWPFAMWGTDILG 1144
            +K +   +Y+P M     E V  C  C Q   D H    E   T +  +   M   DI  
Sbjct: 757  IKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTGEMVHIDIF- 815

Query: 1145 PFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGT 1204
                     K  +  +D F+K+   + V + T   +   L Q I+  F     +  DN  
Sbjct: 816  -----STDRKLFLTCIDKFSKYAIVQPVVSRTIVDITAPLLQ-IINLFPNIKTVYCDNEP 869

Query: 1205 QFTS-SVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLK-KKLDEAKGLWAEEL 1262
             F S +VT       GI++  A   H  +NGQ E  +  + +  +  KLD+      E +
Sbjct: 870  AFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVELI 929

Query: 1263 PGVLWAYNTTEQSSTKETPYRLTYGTDAMLSVEIENQSWRVARFNENDNGENLIANLIML 1322
                  YN T  S T+E P  + +       +EI+    R+ +  ++  G N        
Sbjct: 930  LRATIEYNKTVHSVTRERPIEVVHPGAHERCLEIKA---RLVKAQQDSIGRN-------N 979

Query: 1323 PEEQREAHIRNEAGKVKVARKFSTKVVP----RKMR--VGDLVL*KNTIPDKHN 1370
            P  Q       E   VK  ++   K+ P    +K++  +G  VL K  +  K N
Sbjct: 980  PSRQNRVFEVGERVFVKNNKRLGNKLTPLCTEQKVQADLGTSVLIKGRVVHKDN 1033


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 679

 Score =  130 bits (327), Expect = 3e-29
 Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 18/378 (4%)

Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
           P +L N    +K  GK RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343

Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
           G+ Q+++       TAF   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402

Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
           DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459

Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
           +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKE 519

Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
                 K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579

Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
           S + + AE  Y   +K  LA++ T ++   Y       ++TD         +  K D   
Sbjct: 580 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639

Query: 799 GRLVSWSVELSEYDIQYE 816
           GR + W   LS Y    E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657


>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 679

 Score =  130 bits (326), Expect = 3e-29
 Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 18/378 (4%)

Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
           P +L N    +K  GK RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343

Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
           G+ Q+++       TAF   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402

Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
           DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459

Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
           +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519

Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
                 K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579

Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
           S + + AE  Y   +K  LA++ T ++   Y       ++TD         +  K D   
Sbjct: 580 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639

Query: 799 GRLVSWSVELSEYDIQYE 816
           GR + W   LS Y    E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 679

 Score =  130 bits (326), Expect = 3e-29
 Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 18/378 (4%)

Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
           P +L N    +K  GK RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343

Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
           G+ Q+++       TAF   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402

Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
           DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459

Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
           +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519

Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
                 K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579

Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
           S + + AE  Y   +K  LA++ T ++   Y       ++TD         +  K D   
Sbjct: 580 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639

Query: 799 GRLVSWSVELSEYDIQYE 816
           GR + W   LS Y    E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 674

 Score =  127 bits (320), Expect = 2e-28
 Identities = 99/378 (26%), Positives = 168/378 (44%), Gaps = 18/378 (4%)

Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
           P +L N    +K  GK RM  +Y ++NK    D+Y  PN D+L+    G ++ S  D  S
Sbjct: 280 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKS 338

Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
           G+ Q+++       TAF   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 339 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 397

Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
           DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 398 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 454

Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
           +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 455 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 514

Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
                 K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 515 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 574

Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
           S + + AE  Y   +K  LA++ T ++   Y       ++TD         +  K D   
Sbjct: 575 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 634

Query: 799 GRLVSWSVELSEYDIQYE 816
           GR + W   LS Y    E
Sbjct: 635 GRNIRWQAWLSHYSFDVE 652


>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 659

 Score =  127 bits (319), Expect = 2e-28
 Identities = 111/460 (24%), Positives = 201/460 (43%), Gaps = 33/460 (7%)

Query: 392 WTINDVPGIDPKVITHKLAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQY 451
           W    +  IDPK +             ++PM     ++ +    + ++L++ + I+  + 
Sbjct: 215 WMTATIELIDPKTVVK-----------VKPMSYSPSDREE-FDRQIKELLELKVIKPSK- 261

Query: 452 PTWLANVVMVK----KANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLM 507
            T ++   +V+    +  GK RM  +Y ++NK    D++ LPN D+L+    G ++ S  
Sbjct: 262 STHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSF 321

Query: 508 DAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNM 567
           D  SG  Q+++    +  TAF   Q +Y +  +PFGLK A + + +      S Q  +  
Sbjct: 322 DCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYC 381

Query: 568 EVYVDDMIV-KSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGI 626
            VYVDD++V  +    +H   +     +     + L+ +K     +   FLG  +  +G 
Sbjct: 382 CVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGT 440

Query: 627 EVNPDKGRAILE--MKSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSK 681
               +    ILE   K P  +   K++QR  G +   S ++P       P  + LK++S 
Sbjct: 441 HCPQNH---ILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDST 497

Query: 682 FQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIY 741
           + W +   Q   K+K+ L + P L  P P   LV+    +++    +L       + +  
Sbjct: 498 WTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICR 557

Query: 742 FVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVP-----LRQVLQKPD 796
           + S + + AE  Y   EK  LA+++  ++   Y    +  ++TD       +   L+   
Sbjct: 558 YASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDR 617

Query: 797 LSGRLVSWSVELSEYDIQYEPRGQVTVQSLIDFVAELTPT 836
             GRLV W + LS+YD   E     T     DF+ E T T
Sbjct: 618 KQGRLVRWQMWLSQYDFDVEHIAG-TKNVFADFLQENTLT 656


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 680

 Score =  126 bits (317), Expect = 4e-28
 Identities = 98/378 (25%), Positives = 167/378 (43%), Gaps = 18/378 (4%)

Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
           P +L N    +   G  RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 286 PAFLVNNE-AENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 344

Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
           G+ Q+++       TAF   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 345 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 403

Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
           DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 404 DDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 460

Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
           +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 461 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKE 520

Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
                 K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 521 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYR 580

Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
           S + + AE  Y   +K  LA++ T ++   Y       ++TD         +  K D   
Sbjct: 581 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 640

Query: 799 GRLVSWSVELSEYDIQYE 816
           GR + W   LS Y    E
Sbjct: 641 GRNIRWQAWLSHYSFDVE 658


>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 666

 Score =  121 bits (304), Expect = 1e-26
 Identities = 98/371 (26%), Positives = 161/371 (42%), Gaps = 26/371 (7%)

Query: 462 KKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPS 521
           ++  GK RM  +Y ++N+    DS+ LPN+ +L+    G  + S  D  SG+ Q+++   
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346

Query: 522 DEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARA 581
            ++ TAF   Q ++ +K +PFGLK A + +QR M    +    +   VYVDD+IV S   
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALN-GADKFCMVYVDDIIVFSNSE 405

Query: 582 SDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTS----------RGIEVNPD 631
            DH   +      +  Y + L+ +K +   +   FLG  +              I   PD
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPD 465

Query: 632 KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQA 691
           +    LE K     K +QR  G +     ++P   +   P    LKK+  + WT+     
Sbjct: 466 R----LEDK-----KHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516

Query: 692 FTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVL-LQEEGKKQKVIYFVSHTLQGA 750
             K+K+ L + P L  P P   L++    +D     VL  +     + +  + S + + A
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576

Query: 751 ELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVP-----LRQVLQKPDLSGRLVSWS 805
           E  Y   +K  LA+ +   +   Y    +  V+TD       LR  L+     GRLV W 
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636

Query: 806 VELSEYDIQYE 816
              S+Y    E
Sbjct: 637 NWFSKYQFDVE 647


>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
          Length = 1268

 Score =  118 bits (296), Expect = 1e-25
 Identities = 76/251 (30%), Positives = 128/251 (50%), Gaps = 7/251 (2%)

Query: 415 ATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQYPTWLANVVMVKK-ANGKWRMCTD 473
           A PV +  R +     +AV+ E  +L +   I  + Y  W A +V++KK   GK R+C D
Sbjct: 438 AVPVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCAD 497

Query: 474 Y--TSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTN 531
           +  + LN     + +PLP  + +     G  + S +D    Y Q+ +    ++     T+
Sbjct: 498 FKCSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTH 556

Query: 532 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 591
           +  + Y  M FGLK A A++Q++MDK+ S   G  + VY DD+I+ ++   +H   L+E 
Sbjct: 557 RGIFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILREL 614

Query: 592 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 651
           F++ + Y  +++ EKC+F  +   FLGF +   G   +  K  AI  MK+PT  K++   
Sbjct: 615 FERFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASF 673

Query: 652 TGRMAALSRFL 662
            G    LSR +
Sbjct: 674 LGAADWLSRMM 684



 Score = 77.0 bits (188), Expect = 3e-13
 Identities = 64/228 (28%), Positives = 102/228 (44%), Gaps = 24/228 (10%)

Query: 1066 IMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEE 1125
            ++ ++HEG    H G   +  K  R+  +W  +  D    V+ C  CQ  S + +  P  
Sbjct: 786  VLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP-- 838

Query: 1126 LTTMMAPWPF--AMWGT---DILGPFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKV 1180
                + PWP   A W     D  GP         Y++V VD  TK+ E +   +I+A   
Sbjct: 839  ----LNPWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSISAVTT 889

Query: 1181 RNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESAN 1240
             + L + I    G P  ++ DNGTQ TS +  + C   GIE + ++V +P++NG AE   
Sbjct: 890  IDLL-EEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFV 948

Query: 1241 KVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTK-ETPYRLTYG 1287
              + +G+ K   E   +  + L   L +Y  T  S+    TP    +G
Sbjct: 949  DTLKRGIAKIKGEG-SVNQQILNKFLISYRNTPHSALNGSTPAECHFG 995


>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1161

 Score =  104 bits (259), Expect = 2e-21
 Identities = 61/213 (28%), Positives = 106/213 (49%), Gaps = 9/213 (4%)

Query: 1081 GRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMAPWPFAMWGT 1140
            GR      + + ++WP ++KD ++ +++C++C V +  +   P  L  +    PF  +  
Sbjct: 830  GRDATFLKVSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLKPFDKFYI 889

Query: 1141 DILGPFPVAKAQMKYIIVAVDYFTKWI---EAEAVATITAAKVRNFLWQRIVCRFGVPMA 1197
            D +GP P +   + +++V VD  T ++     +A +T    K  N L         +P  
Sbjct: 890  DYIGPLPPSNGYL-HVLVVVDSMTGFVWLYPTKAPSTSATVKALNMLTS-----IAIPKV 943

Query: 1198 LVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAKGL 1257
            L  D G  FTSS   ++  E GI++ F++  HPQ++G+ E  N  I + L K L      
Sbjct: 944  LHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGRPAK 1003

Query: 1258 WAEELPGVLWAYNTTEQSSTKETPYRLTYGTDA 1290
            W + LP V  A N +   S+K TP++L +G D+
Sbjct: 1004 WYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDS 1036



 Score =  100 bits (250), Expect = 2e-20
 Identities = 107/492 (21%), Positives = 198/492 (39%), Gaps = 35/492 (7%)

Query: 404 VITHKLAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQYPTWLANVVMVKK 463
           + T  LA RP     I P  + S      +Q+  + L+K   + + Q  T    V  V K
Sbjct: 167 IATGTLAPRPQKQYPINPKAKPS------IQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPK 219

Query: 464 ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDE 523
            +GKWRM  DY  +NK  P  +    +   ++      +  + +D  +G+     HP   
Sbjct: 220 PDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGF---WAHPITP 276

Query: 524 ES---TAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSAR 580
           ES   TAF      YC+  +P G  N+ A +    D +   +   N++ YVDD+ +    
Sbjct: 277 ESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQAYVDDIYISHDD 334

Query: 581 ASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMK 640
             +H   L++ F  L      ++ +K     +  +FLGF +T  G  +     + +L + 
Sbjct: 335 PQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNIT 394

Query: 641 SPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCL-KKNSKF-QWTEECEQAFTKLKET 698
            P  +K++Q + G +     F+P   +   P +T +   N KF  WTE+       +   
Sbjct: 395 PPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISV 454

Query: 699 LATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIE 758
           L     L +  P   L++ +  +  A   +    EG K+ ++Y V++    AE ++ + E
Sbjct: 455 LNQADNLEERNPETRLIIKVNSSPSA-GYIRYYNEGSKRPIMY-VNYIFSKAEAKFTQTE 512

Query: 759 KAALAILKTARRLRPYFQSFQVKVKTDVPLRQVLQKPDLSG------RLVSWSVELSEYD 812
           K    + K   +        ++ V + +     +Q+  L        R ++W   L +  
Sbjct: 513 KLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYLEDPR 572

Query: 813 IQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNTGSGAGI 862
           IQ+   +    +Q + +   ++       ++   V   DGS         S++ G G   
Sbjct: 573 IQFHYDKSLPELQQIPNVTEDVIAKTKHPSEFAMVFYTDGSAIKHPDVNKSHSAGMGIAQ 632

Query: 863 TIESPDKMIIEQ 874
               P+  I+ Q
Sbjct: 633 VQFIPEYKIVHQ 644


>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC
           3.4.23.-); Reverse transcriptase/ribonuclease H (EC
           2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
          Length = 1165

 Score =  102 bits (253), Expect = 1e-20
 Identities = 97/467 (20%), Positives = 188/467 (39%), Gaps = 19/467 (4%)

Query: 387 LDLF--AWTINDVPGIDPKVITHKLAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKAR 444
           L LF   W      G+  +V    + +R GA+PV      MS+E  + ++   +K +   
Sbjct: 143 LQLFPTVWAERAGMGLANQVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLG 202

Query: 445 FIREVQYPTWLANVVMVKK-ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNEL 503
            +   + P W   ++ VKK     +R   D   +NK        +PN   L+     +  
Sbjct: 203 VLVPCRSP-WNTPLLPVKKPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYT 261

Query: 504 -LSLMDAYSGYNQIMMHPSDEESTAF------MTNQANYCYKTMPFGLKNAGATYQRLMD 556
             S++D    +  + +HP+ +   AF        N     +  +P G KN+   +   + 
Sbjct: 262 WYSVLDLKDAFFCLRLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALH 321

Query: 557 KIFSKQVGRNMEV----YVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQ 612
           +  +     N +V    YVDD++V +    D     ++   +L     +++ +K     +
Sbjct: 322 RDLAPFRALNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQR 381

Query: 613 GGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPF 672
              +LG++L      + P +   ++++  PT+ ++V+   G       ++P     AAP 
Sbjct: 382 EVTYLGYLLKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPL 441

Query: 673 FTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQE 732
           +   K++  F WTEE +QAF  +K+ L + P L+ P    P  LY+         VL Q 
Sbjct: 442 YPLTKESIPFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQT 501

Query: 733 EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVPLRQVL 792
            G  ++ + ++S  L      +    KA  A+    +          V V     L  ++
Sbjct: 502 LGPWRRPVAYLSKKLDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHSLESIV 561

Query: 793 QKPD----LSGRLVSWSVELSEYDIQYEPRGQVTVQSLIDFVAELTP 835
           ++P      + R+  +   L    + + P   +   +L+   +E TP
Sbjct: 562 RQPPDRWMTNARMTHYQSLLLNERVSFAPPAVLNPATLLPVESEATP 608



 Score = 94.7 bits (234), Expect = 2e-18
 Identities = 84/314 (26%), Positives = 131/314 (40%), Gaps = 30/314 (9%)

Query: 1077 SHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMAPWPFA 1136
            +H+G   L   V R     P ++    E   +C+ C + + +     E         P  
Sbjct: 820  THLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAV-TTYRETGKRQRGDRPGV 878

Query: 1137 MWGTDILGPFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFGVPM 1196
             W  D     P  +   KY++V +D F+ W+EA    T TA  V   + + I+ RFG+P 
Sbjct: 879  YWEVDFTEIKP-GRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPK 937

Query: 1197 ALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAKG 1256
             L  DNG  F + V++    ++GI  +      PQ++GQ E  N+ I + L K   E  G
Sbjct: 938  VLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGG 997

Query: 1257 L-WAEELP-GVLWAYNTTEQSSTKETPYRLTYGTDAMLSVEIENQSWRVARFNENDNGEN 1314
              W   LP  +L A NT  +     TPY + YG    +                 ++GE 
Sbjct: 998  KDWVTLLPLALLRARNTPGRFGL--TPYEILYGGPPPIL----------------ESGET 1039

Query: 1315 LIANLIMLP----EEQREAHIRNEA-GKVKVARKFSTKVVPRKMRVGDLVL*KNTIPDKH 1369
            L  +   LP      +    +R +   ++K   K  T  +P   +VGD VL +   P   
Sbjct: 1040 LGPDDRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLVRRHRP--- 1096

Query: 1370 NKLSPNWGGPYRII 1383
            + L P W GPY ++
Sbjct: 1097 SSLEPRWKGPYLVL 1110


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.318    0.135    0.399 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,959,019
Number of Sequences: 164201
Number of extensions: 7280459
Number of successful extensions: 18322
Number of sequences better than 10.0: 156
Number of HSP's better than 10.0 without gapping: 95
Number of HSP's successfully gapped in prelim test: 61
Number of HSP's that attempted gapping in prelim test: 17917
Number of HSP's gapped (non-prelim): 317
length of query: 1414
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1291
effective length of database: 39,777,331
effective search space: 51352534321
effective search space used: 51352534321
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)


Lotus: description of TM0114.12