Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0046b.4
         (1706 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...   202  5e-51
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III    202  8e-51
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...   201  2e-50
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...   181  2e-44
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...   174  1e-42
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...   162  1e-38
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...   158  1e-37
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...   158  1e-37
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...   145  9e-34
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...   130  4e-29
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...   129  5e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...   129  5e-29
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...   127  3e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...   126  6e-28
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot...   125  8e-28
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot...   120  3e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II     118  1e-25
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript...   106  6e-22
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23...   101  2e-20
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23...   100  3e-20

>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
            transposon 17.6 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score =  202 bits (515), Expect = 5e-51
 Identities = 124/363 (34%), Positives = 182/363 (49%), Gaps = 6/363 (1%)

Query: 745  PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
            P W+           K+R+  DY  LN++   D +P+PN+D+++         + +D   
Sbjct: 246  PIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305

Query: 805  GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
            G++QI M P     T F T   +Y Y  MPFGLKNA AT+QR M+ I    + ++  VY+
Sbjct: 306  GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365

Query: 865  DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
            DD+IV S    +H   L   F++L    +KL  +KC F  Q   FLG +LT  GI+ NP+
Sbjct: 366  DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425

Query: 925  KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTE-ECEQ 983
            K  AI +   PT  KE++   G      +F+P   D A P   CLKKN K   T  E + 
Sbjct: 426  KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485

Query: 984  AFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGA 1043
            AF KLK  ++  P+L  P  +    L    +D A+  VL Q+       + ++S TL   
Sbjct: 486  AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQD----GHPLSYISRTLNEH 541

Query: 1044 ELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELS 1102
            E+ Y  IEK  LAI+   +  R Y      +I +D  PL  + +  D + +L  W V+LS
Sbjct: 542  EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601

Query: 1103 EYD 1105
            E+D
Sbjct: 602  EFD 604


>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score =  202 bits (513), Expect = 8e-51
 Identities = 124/414 (29%), Positives = 209/414 (49%), Gaps = 9/414 (2%)

Query: 702  LAIRPGATPVIQPRRRMSEEKNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKW 761
            + ++ GA P+ Q  R +       ++   +K++  + IRE + P W + VV+VKK +G  
Sbjct: 934  IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992

Query: 762  RMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTF 821
            RMC DY  +NKV   +++PLPN++  +   +G +L ++ D  +G+ QI +    +E T F
Sbjct: 993  RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052

Query: 822  MTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDL 881
                  + +  +PFGL  + A +Q  M++I    +G    VYVDD+++ S     H  D+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112

Query: 882  KEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEV 941
            KEA  ++R   MKL   KC    +  ++LG  +T  G+E    K   + +   PT+VKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172

Query: 942  QRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKP 1001
            Q   G +    +F+      A+   + +     + W +E E AF +LK+ +   PVL++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232

Query: 1002 TPSV------PLVLYLAVTDKAVSTVLLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAA 1054
                      P ++Y   + K +  VL QE    +Q  I F S  L  AE RY   +  A
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEA 1292

Query: 1055 LAILKTARRLRPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ 1107
            LA++   RR +       + + TD  PL  +L+   L+ RL  WS+E+ E+D++
Sbjct: 1293 LAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVK 1346


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
            transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score =  201 bits (510), Expect = 2e-50
 Identities = 128/408 (31%), Positives = 201/408 (48%), Gaps = 12/408 (2%)

Query: 709  TPVIQPRRRMSEEKNKAVQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGKWRMC 764
            +P+   +  +++     V+ + ++++    IRE       PTW+           K+R+ 
Sbjct: 205  SPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264

Query: 765  TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTN 824
             DY  LN++   D YP+PN+D+++      +  + +D   G++QI M       T F T 
Sbjct: 265  IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324

Query: 825  QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 884
              +Y Y  MPFGL+NA AT+QR M+ I    + ++  VY+DD+I+ S   ++H   ++  
Sbjct: 325  SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384

Query: 885  FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 944
            F +L    +KL  +KC F  +   FLG ++T  GI+ NP K +AI+    PT  KE++  
Sbjct: 385  FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444

Query: 945  TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECE--QAFTKLKETLATLPVLSKPT 1002
             G      +F+P   D A P  +CLKK +K   T++ E  +AF KLK  +   P+L  P 
Sbjct: 445  LGLTGYYRKFIPNYADIAKPMTSCLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPD 503

Query: 1003 PSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTAR 1062
                 VL    ++ A+  VL Q        I F+S TL   EL Y  IEK  LAI+   +
Sbjct: 504  FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559

Query: 1063 RLRPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 1109
              R Y    Q  I +D  PLR +    +   +L  W V LSEY  + +
Sbjct: 560  TFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKID 607


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
            transposon opus [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score =  181 bits (459), Expect = 2e-44
 Identities = 126/406 (31%), Positives = 205/406 (50%), Gaps = 23/406 (5%)

Query: 726  VQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGK--WRMCTDYTSLNKVCPKDSY 779
            V+ + ++L++   IR        P W+  V    K NG+  +RM  D+  LN V   D+Y
Sbjct: 139  VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196

Query: 780  PLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKN 839
            P+P+++  +      +  + +D  SG++QI M  SD   T F T    Y +  +PFGLKN
Sbjct: 197  PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256

Query: 840  AGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEK 899
            A A +QR++D I  + +G+   VY+DD+IV S     H  +L+     L    +++N EK
Sbjct: 257  APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316

Query: 900  CSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAG 959
              F     +FLG+++T+ GI+ +P K RAI EM  PTSVKE++R  G  +   +F+    
Sbjct: 317  SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376

Query: 960  DKAAPFFTCLK---------KNSKFQWT--EECEQAFTKLKETLATLPVLSKPTPSVPLV 1008
              A P     +         ++SK   T  E   Q+F  LK  L +  +L+ P  + P  
Sbjct: 377  KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436

Query: 1009 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 1067
            L    ++ A+  VL Q++  + + I ++S +L   E  Y  IEK  LAI+ +   LR Y 
Sbjct: 437  LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496

Query: 1068 FQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ--YEP 1110
            + +  +K+ TD  PL   L   + + +L  W   + EY+ +  Y+P
Sbjct: 497  YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKP 542


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
            transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score =  174 bits (442), Expect = 1e-42
 Identities = 119/406 (29%), Positives = 177/406 (43%), Gaps = 6/406 (1%)

Query: 710  PVIQPRRRMSEEKNKAVQLETEKLIKARFIREV--QYPTWLANVVMVKKANG---KWRMC 764
            PV     R    + + +Q + +KLIK + +     QY + L  V      N    KWR+ 
Sbjct: 314  PVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLV 373

Query: 765  TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTN 824
             DY  +NK    D +PLP +D ++D     +  S +D  SG++QI +     + T+F T+
Sbjct: 374  IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433

Query: 825  QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 884
              +Y +  +PFGLK A  ++QR+M   FS        +Y+DD+IV          +L E 
Sbjct: 434  NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493

Query: 885  FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 944
            F + R Y +KL+PEKCSF +    FLG   T +GI  +  K   I     P      +R 
Sbjct: 494  FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553

Query: 945  TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPS 1004
                    RF+    D +       KKN  F+WT+EC++AF  LK  L    +L  P  S
Sbjct: 554  VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613

Query: 1005 VPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRL 1064
                +    + +A   VL Q     Q  + + S      E      E+   AI       
Sbjct: 614  KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673

Query: 1065 RPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 1109
            RPY       +KTD  PL  +    + S +L    +EL EY+   E
Sbjct: 674  RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
            type 1
          Length = 1333

 Score =  162 bits (409), Expect = 1e-38
 Identities = 110/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)

Query: 722  KNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 781
            K +A+  E  + +K+  IRE +       V+ V K  G  RM  DY  LNK    + YPL
Sbjct: 424  KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 782  PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAG 841
            P +++L+    G+ + + +D  S Y+ I +   DE    F   +  + Y  MP+G+  A 
Sbjct: 483  PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542

Query: 842  ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 901
            A +Q  ++ I  +    ++  Y+DD+++ S   S+H   +K+   +L+   + +N  KC 
Sbjct: 543  AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 902  FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 961
            F     KF+G+ ++ +G     +    +L+ K P + KE+++  G +  L +F+P     
Sbjct: 603  FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 962  AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTV 1021
              P    LKK+ +++WT    QA   +K+ L + PVL     S  ++L    +D AV  V
Sbjct: 663  THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 1022 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKIKTD 1078
            L Q+ +  K   + + S  +  A+L Y   +K  LAI+K+ +  R Y +S     KI TD
Sbjct: 723  LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 1079 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 1110
               +  R   +    + RL  W + L +  ++I Y P
Sbjct: 783  HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
            type 3
          Length = 1333

 Score =  158 bits (400), Expect = 1e-37
 Identities = 109/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)

Query: 722  KNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 781
            K +A+  E  + +K+  IRE +       V+ V K  G  RM  DY  LNK    + YPL
Sbjct: 424  KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 782  PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAG 841
            P +++L+    G+ + + +D  S Y+ I +   DE    F   +  + Y  MP+G+  A 
Sbjct: 483  PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542

Query: 842  ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 901
            A +Q  ++ I  +    ++  Y+D++++ S   S+H   +K+   +L+   + +N  KC 
Sbjct: 543  AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 902  FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 961
            F     KF+G+ ++ +G     +    +L+ K P + KE+++  G +  L +F+P     
Sbjct: 603  FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 962  AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTV 1021
              P    LKK+ +++WT    QA   +K+ L + PVL     S  ++L    +D AV  V
Sbjct: 663  THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 1022 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKIKTD 1078
            L Q+ +  K   + + S  +  A+L Y   +K  LAI+K+ +  R Y +S     KI TD
Sbjct: 723  LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 1079 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 1110
               +  R   +    + RL  W + L +  ++I Y P
Sbjct: 783  HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
            type 2
          Length = 1333

 Score =  158 bits (400), Expect = 1e-37
 Identities = 109/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)

Query: 722  KNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 781
            K +A+  E  + +K+  IRE +       V+ V K  G  RM  DY  LNK    + YPL
Sbjct: 424  KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482

Query: 782  PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAG 841
            P +++L+    G+ + + +D  S Y+ I +   DE    F   +  + Y  MP+G+  A 
Sbjct: 483  PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542

Query: 842  ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 901
            A +Q  ++ I  +    ++  Y+D++++ S   S+H   +K+   +L+   + +N  KC 
Sbjct: 543  AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602

Query: 902  FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 961
            F     KF+G+ ++ +G     +    +L+ K P + KE+++  G +  L +F+P     
Sbjct: 603  FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662

Query: 962  AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTV 1021
              P    LKK+ +++WT    QA   +K+ L + PVL     S  ++L    +D AV  V
Sbjct: 663  THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722

Query: 1022 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKIKTD 1078
            L Q+ +  K   + + S  +  A+L Y   +K  LAI+K+ +  R Y +S     KI TD
Sbjct: 723  LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782

Query: 1079 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 1110
               +  R   +    + RL  W + L +  ++I Y P
Sbjct: 783  HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
            transposon gypsy [Contains: Reverse transcriptase (EC
            2.7.7.49); Endonuclease]
          Length = 1035

 Score =  145 bits (366), Expect = 9e-34
 Identities = 111/406 (27%), Positives = 190/406 (46%), Gaps = 25/406 (6%)

Query: 726  VQLETEKLIKARFIREVQYPTWLANVVMVKKA-----NGKWRMCTDYTSLNKVCPKDSYP 780
            V  E ++L+K   IR  + P      V+ KK      N   R+  D+  LN+    D YP
Sbjct: 197  VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256

Query: 781  LPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNA 840
            +P++  ++      +  + +D  SGY+QI +   D E T+F  N   Y +  +PFGL+NA
Sbjct: 257  MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316

Query: 841  GATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKC 900
             + +QR +D +  +Q+G+   VYVDD+I+ S   SDH   +      L    M+++ EK 
Sbjct: 317  SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376

Query: 901  SFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGD 960
             F  +  ++LGF+++  G + +P+K +AI E   P  V +V+   G  +    F+     
Sbjct: 377  RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436

Query: 961  KAAPFFTCLK-----------KNSKFQWTEECEQAFTKLKETLATLPVLSK-PTPSVPLV 1008
             A P    LK           K    ++ E    AF +L+  LA+  V+ K P    P  
Sbjct: 437  IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496

Query: 1009 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 1067
            L    +   +  VL QE     + I  +S TL+  E  Y   E+  LAI+    +L+ + 
Sbjct: 497  LTTDASASGIGAVLSQE----GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552

Query: 1068 FQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYD--IQYEP 1110
            + S ++ I TD  PL   +   + + ++  W   + +++  + Y+P
Sbjct: 553  YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKP 598


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  130 bits (326), Expect = 4e-29
 Identities = 100/378 (26%), Positives = 168/378 (43%), Gaps = 18/378 (4%)

Query: 745  PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
            P +L N    +K  GK RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 285  PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343

Query: 805  GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
            G+ Q+++       T F   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 344  GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402

Query: 865  DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
            DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 403  DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459

Query: 925  KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
            +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 460  QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKE 519

Query: 981  CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
                  K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 520  DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579

Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
            S + + AE  Y   +K  LA++ T ++   Y       I+TD         +  K D   
Sbjct: 580  SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639

Query: 1092 GRLVSWSVELSEYDIQYE 1109
            GR + W   LS Y    E
Sbjct: 640  GRNIRWQAWLSHYSFDVE 657


>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  129 bits (325), Expect = 5e-29
 Identities = 100/378 (26%), Positives = 168/378 (43%), Gaps = 18/378 (4%)

Query: 745  PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
            P +L N    +K  GK RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 285  PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343

Query: 805  GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
            G+ Q+++       T F   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 344  GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402

Query: 865  DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
            DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 403  DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459

Query: 925  KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
            +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 460  QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519

Query: 981  CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
                  K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 520  DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579

Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
            S + + AE  Y   +K  LA++ T ++   Y       I+TD         +  K D   
Sbjct: 580  SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639

Query: 1092 GRLVSWSVELSEYDIQYE 1109
            GR + W   LS Y    E
Sbjct: 640  GRNIRWQAWLSHYSFDVE 657


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 679

 Score =  129 bits (325), Expect = 5e-29
 Identities = 100/378 (26%), Positives = 168/378 (43%), Gaps = 18/378 (4%)

Query: 745  PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
            P +L N    +K  GK RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 285  PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343

Query: 805  GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
            G+ Q+++       T F   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 344  GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402

Query: 865  DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
            DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 403  DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459

Query: 925  KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
            +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 460  QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519

Query: 981  CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
                  K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 520  DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579

Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
            S + + AE  Y   +K  LA++ T ++   Y       I+TD         +  K D   
Sbjct: 580  SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639

Query: 1092 GRLVSWSVELSEYDIQYE 1109
            GR + W   LS Y    E
Sbjct: 640  GRNIRWQAWLSHYSFDVE 657


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 674

 Score =  127 bits (319), Expect = 3e-28
 Identities = 99/378 (26%), Positives = 167/378 (43%), Gaps = 18/378 (4%)

Query: 745  PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
            P +L N    +K  GK RM  +Y ++NK    D+Y  PN D+L+    G ++ S  D  S
Sbjct: 280  PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKS 338

Query: 805  GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
            G+ Q+++       T F   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 339  GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 397

Query: 865  DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
            DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 398  DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 454

Query: 925  KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
            +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 455  QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 514

Query: 981  CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
                  K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 515  DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 574

Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
            S + + AE  Y   +K  LA++ T ++   Y       I+TD         +  K D   
Sbjct: 575  SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 634

Query: 1092 GRLVSWSVELSEYDIQYE 1109
            GR + W   LS Y    E
Sbjct: 635  GRNIRWQAWLSHYSFDVE 652


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 680

 Score =  126 bits (316), Expect = 6e-28
 Identities = 98/378 (25%), Positives = 166/378 (42%), Gaps = 18/378 (4%)

Query: 745  PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
            P +L N    +   G  RM  +Y ++NK    D+Y LPN D+L+    G ++ S  D  S
Sbjct: 286  PAFLVNNE-AENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 344

Query: 805  GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
            G+ Q+++       T F   Q +Y +  +PFGLK A + +QR MD+ F +   +   VYV
Sbjct: 345  GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 403

Query: 865  DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
            DD++V S    DH   +     +   + + L+ +K     +   FLG  +       +  
Sbjct: 404  DDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 460

Query: 925  KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
            +G  +  + K P ++   K++QR  G +   S ++P       P    LK+N  ++WT+E
Sbjct: 461  QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKE 520

Query: 981  CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
                  K+K+ L   P L  P P   L++    +D      +  + + E    + +  + 
Sbjct: 521  DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYR 580

Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
            S + + AE  Y   +K  LA++ T ++   Y       I+TD         +  K D   
Sbjct: 581  SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 640

Query: 1092 GRLVSWSVELSEYDIQYE 1109
            GR + W   LS Y    E
Sbjct: 641  GRNIRWQAWLSHYSFDVE 658


>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 659

 Score =  125 bits (315), Expect = 8e-28
 Identities = 100/386 (25%), Positives = 172/386 (43%), Gaps = 16/386 (4%)

Query: 755  KKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPS 814
            ++  GK RM  +Y ++NK    D++ LPN D+L+    G ++ S  D  SG  Q+++   
Sbjct: 276  ERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKE 335

Query: 815  DEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIV-KSAR 873
             +  T F   Q +Y +  +PFGLK A + + +      S Q  +   VYVDD++V  +  
Sbjct: 336  SQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTG 395

Query: 874  ASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILE-- 931
              +H   +     +     + L+ +K     +   FLG  +  +G     +    ILE  
Sbjct: 396  RKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGTHCPQNH---ILEHI 451

Query: 932  MKSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKL 988
             K P  +   K++QR  G +   S ++P       P  + LK++S + W +   Q   K+
Sbjct: 452  HKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKI 511

Query: 989  KETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQ 1048
            K+ L + P L  P P+  LV+    +++    +L       + +  + S + + AE  Y 
Sbjct: 512  KKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYH 571

Query: 1049 KIEKAALAILKTARRLRPYFQSFQVKIKTDVP-----LRQVLQKPDLSGRLVSWSVELSE 1103
              EK  LA+++  ++   Y    +  I+TD       +   L+     GRLV W + LS+
Sbjct: 572  SNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQ 631

Query: 1104 YDIQYEPRGQVTVQSLIDFVAELTPT 1129
            YD   E     T     DF+ E T T
Sbjct: 632  YDFDVEHIAG-TKNVFADFLQENTLT 656


>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
            (EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
            2.7.7.49)]
          Length = 666

 Score =  120 bits (301), Expect = 3e-26
 Identities = 96/371 (25%), Positives = 160/371 (42%), Gaps = 26/371 (7%)

Query: 755  KKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPS 814
            ++  GK RM  +Y ++N+    DS+ LPN+ +L+    G  + S  D  SG+ Q+++   
Sbjct: 287  ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346

Query: 815  DEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARA 874
             ++ T F   Q ++ +K +PFGLK A + +QR M    +    +   VYVDD+IV S   
Sbjct: 347  SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALN-GADKFCMVYVDDIIVFSNSE 405

Query: 875  SDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTS----------RGIEVNPD 924
             DH   +      +  Y + L+ +K +   +   FLG  +              I   PD
Sbjct: 406  LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPD 465

Query: 925  KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQA 984
            +    LE K     K +QR  G +     ++P   +   P    LKK+  + WT+     
Sbjct: 466  R----LEDK-----KHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516

Query: 985  FTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVL-LQEEGKKQKVIYFVSHTLQGA 1043
              K+K+ L + P L  P P   L++    +D     VL  +     + +  + S + + A
Sbjct: 517  VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576

Query: 1044 ELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTDVP-----LRQVLQKPDLSGRLVSWS 1098
            E  Y   +K  LA+ +   +   Y    +  ++TD       LR  L+     GRLV W 
Sbjct: 577  EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636

Query: 1099 VELSEYDIQYE 1109
               S+Y    E
Sbjct: 637  NWFSKYQFDVE 647


>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
          Length = 1268

 Score =  118 bits (296), Expect = 1e-25
 Identities = 76/251 (30%), Positives = 128/251 (50%), Gaps = 7/251 (2%)

Query: 708 ATPVIQPRRRMSEEKNKAVQLETEKLIKARFIREVQYPTWLANVVMVKK-ANGKWRMCTD 766
           A PV +  R +     +AV+ E  +L +   I  + Y  W A +V++KK   GK R+C D
Sbjct: 438 AVPVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCAD 497

Query: 767 Y--TSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTN 824
           +  + LN     + +PLP  + +     G  + S +D    Y Q+ +    ++     T+
Sbjct: 498 FKCSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTH 556

Query: 825 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 884
           +  + Y  M FGLK A A++Q++MDK+ S   G  + VY DD+I+ ++   +H   L+E 
Sbjct: 557 RGIFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILREL 614

Query: 885 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 944
           F++ + Y  +++ EKC+F  +   FLGF +   G   +  K  AI  MK+PT  K++   
Sbjct: 615 FERFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASF 673

Query: 945 TGRMAALSRFL 955
            G    LSR +
Sbjct: 674 LGAADWLSRMM 684


>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
            transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
            (RT); Integrase (IN)]
          Length = 886

 Score =  106 bits (264), Expect = 6e-22
 Identities = 98/438 (22%), Positives = 176/438 (39%), Gaps = 28/438 (6%)

Query: 751  VVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIM 810
            V  V K +G+WRM  DY  +NK  P  +    +   ++      +  + +D  +G+    
Sbjct: 5    VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGF---W 61

Query: 811  MHPSDEES---TTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDM 867
             HP   ES   T F      YC+  +P G  N+ A +    D +   +   N++VYVDD+
Sbjct: 62   AHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQVYVDDI 119

Query: 868  IVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGR 927
             +      +H   L++ F  L      ++ +K   G +  +FLGF +T  G  +      
Sbjct: 120  YLSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKT 179

Query: 928  AILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCL--KKNSKFQWTEECEQAF 985
             +L +  P  +K++Q + G +     F+P   +   P +  +   K    +W+EE  +  
Sbjct: 180  KLLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQL 239

Query: 986  TKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAEL 1045
              + E L T   L +  P   LV+ +  +  A       E GKK   I ++++    AEL
Sbjct: 240  NMVIEALNTASNLEERLPEQRLVIKVNTSPSAGYVRYYNETGKKP--IMYLNYVFSKAEL 297

Query: 1046 RYQKIEKAALAILKTARRLRPYFQSFQVKIKTDVPLRQVLQKPDLSG------RLVSWSV 1099
            ++  +EK    + K   +        ++ + + +     +QK  L        R ++W  
Sbjct: 298  KFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMT 357

Query: 1100 ELSEYDIQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNT 1149
             L +  IQ+   +    ++ + D            +Q E V   DGS         SNN 
Sbjct: 358  YLEDPRIQFHYDKTLPELKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAIKSPDPTKSNNA 417

Query: 1150 GSGAGITIESPDKMIIEQ 1167
            G G       P+  ++ Q
Sbjct: 418  GMGIVHATYKPEYQVLNQ 435


>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1165

 Score =  101 bits (252), Expect = 2e-20
 Identities = 95/467 (20%), Positives = 189/467 (40%), Gaps = 19/467 (4%)

Query: 680  LDLF--AWTINDVPGIDPKVITHKLAIRPGATPVIQPRRRMSEEKNKAVQLETEKLIKAR 737
            L LF   W      G+  +V    + +R GA+PV   +  MS+E  + ++   +K +   
Sbjct: 143  LQLFPTVWAERAGMGLANQVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLG 202

Query: 738  FIREVQYPTWLANVVMVKK-ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNEL 796
             +   + P W   ++ VKK     +R   D   +NK        +PN   L+     +  
Sbjct: 203  VLVPCRSP-WNTPLLPVKKPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYT 261

Query: 797  -LSLMDAYSGYNQIMMHPSDEESTTF------MTNQANYCYKTMPFGLKNAGATYQRLMD 849
              S++D    +  + +HP+ +    F        N     +  +P G KN+   +   + 
Sbjct: 262  WYSVLDLKDAFFCLRLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALH 321

Query: 850  KIFSKQVGRNMEV----YVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQ 905
            +  +     N +V    YVDD++V +    D     ++   +L     +++ +K     +
Sbjct: 322  RDLAPFRALNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQR 381

Query: 906  GGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPF 965
               +LG++L      + P +   ++++  PT+ ++V+   G       ++P     AAP 
Sbjct: 382  EVTYLGYLLKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPL 441

Query: 966  FTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQE 1025
            +   K++  F WTEE +QAF  +K+ L + P L+ P  + P  LY+         VL Q 
Sbjct: 442  YPLTKESIPFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQT 501

Query: 1026 EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTDVPLRQVL 1085
             G  ++ + ++S  L      +    KA  A+    +          V +     L  ++
Sbjct: 502  LGPWRRPVAYLSKKLDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHSLESIV 561

Query: 1086 QKPD----LSGRLVSWSVELSEYDIQYEPRGQVTVQSLIDFVAELTP 1128
            ++P      + R+  +   L    + + P   +   +L+   +E TP
Sbjct: 562  RQPPDRWMTNARMTHYQSLLLNERVSFAPPAVLNPATLLPVESEATP 608


>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1161

 Score =  100 bits (250), Expect = 3e-20
 Identities = 105/492 (21%), Positives = 198/492 (39%), Gaps = 35/492 (7%)

Query: 697  VITHKLAIRPGATPVIQPRRRMSEEKNKAVQLETEKLIKARFIREVQYPTWLANVVMVKK 756
            + T  LA RP     I P+ + S      +Q+  + L+K   + + Q  T    V  V K
Sbjct: 167  IATGTLAPRPQKQYPINPKAKPS------IQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPK 219

Query: 757  ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDE 816
             +GKWRM  DY  +NK  P  +    +   ++      +  + +D  +G+     HP   
Sbjct: 220  PDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGF---WAHPITP 276

Query: 817  ES---TTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSAR 873
            ES   T F      YC+  +P G  N+ A +    D +   +   N++ YVDD+ +    
Sbjct: 277  ESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQAYVDDIYISHDD 334

Query: 874  ASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMK 933
              +H   L++ F  L      ++ +K     +  +FLGF +T  G  +     + +L + 
Sbjct: 335  PQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNIT 394

Query: 934  SPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCL-KKNSKF-QWTEECEQAFTKLKET 991
             P  +K++Q + G +     F+P   +   P +T +   N KF  WTE+       +   
Sbjct: 395  PPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISV 454

Query: 992  LATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIE 1051
            L     L +  P   L++ +  +  A   +    EG K+ ++Y V++    AE ++ + E
Sbjct: 455  LNQADNLEERNPETRLIIKVNSSPSA-GYIRYYNEGSKRPIMY-VNYIFSKAEAKFTQTE 512

Query: 1052 KAALAILKTARRLRPYFQSFQVKIKTDVPLRQVLQKPDLSG------RLVSWSVELSEYD 1105
            K    + K   +        ++ + + +     +Q+  L        R ++W   L +  
Sbjct: 513  KLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYLEDPR 572

Query: 1106 IQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNTGSGAGI 1155
            IQ+   +    +Q + +   ++       ++   V   DGS         S++ G G   
Sbjct: 573  IQFHYDKSLPELQQIPNVTEDVIAKTKHPSEFAMVFYTDGSAIKHPDVNKSHSAGMGIAQ 632

Query: 1156 TIESPDKMIIEQ 1167
                P+  I+ Q
Sbjct: 633  VQFIPEYKIVHQ 644


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.324    0.139    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 196,268,927
Number of Sequences: 164201
Number of extensions: 8668786
Number of successful extensions: 31774
Number of sequences better than 10.0: 168
Number of HSP's better than 10.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 31468
Number of HSP's gapped (non-prelim): 325
length of query: 1706
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1582
effective length of database: 39,613,130
effective search space: 62667971660
effective search space used: 62667971660
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 73 (32.7 bits)


Lotus: description of TM0046b.4