
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0039.4
(1435 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 516 e-145
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 513 e-144
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 513 e-144
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 377 e-103
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 350 2e-95
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 342 6e-93
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 340 1e-92
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 305 6e-82
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 285 8e-76
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 218 7e-56
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 218 1e-55
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 217 2e-55
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 215 6e-55
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 210 2e-53
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 209 3e-53
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 195 9e-49
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 179 6e-44
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 174 2e-42
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 161 1e-38
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 161 1e-38
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 516 bits (1328), Expect = e-145
Identities = 310/916 (33%), Positives = 493/916 (52%), Gaps = 38/916 (4%)
Query: 467 KRPAMEDIPVVREFPEVFPEDMTE-LP-PEREVEFAIDVIPGTTPISAAPYRISPLELAE 524
K P + DI +EF ++ E TE LP P + +EF +++ + Y + P ++
Sbjct: 370 KEPELPDI--YKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQA 427
Query: 525 LQKQVEELLSKGFIRPSVSPWGAPVLLVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDD 584
+ ++ + L G IR S + PV+ V KK+G++R+ VDY+ LNK N YPLP I+
Sbjct: 428 MNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQ 487
Query: 585 LMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMD 644
L+ +++G+ +F+K+DL+S YH IRV+ D K AFR G +EYLVMP+G++ APA F
Sbjct: 488 LLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQY 547
Query: 645 YMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQ 704
++N I + V+ ++DDILI+SKS+ EHV+H++ VL+ LK+ L +KCEF Q
Sbjct: 548 FINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQ 607
Query: 705 VQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATPLT 764
V+F+G+ +SE G ++ V WK P+ +R FLG Y R+FI S++ PL
Sbjct: 608 VKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLN 667
Query: 765 QLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQER 824
L KKD + WT + + +K+ L PVL D SK + DAS +G VL Q+
Sbjct: 668 NLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKH 727
Query: 825 K-----VIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGV--KFTIYSDHQSL- 876
+ Y S ++ + NY D E+ A++ +LK WRHYL F I +DH++L
Sbjct: 728 DDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLI 787
Query: 877 -KYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHAARLMIEETELI 935
+ + + N R RW FL+D++F++ Y PG AN +ADALSR +++ETE I
Sbjct: 788 GRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPI 839
Query: 936 EKFRDMNLIMETLPQGTRLGTLTLTNEFIEEVKKEQARDENLQKEAHGRDSMSRPDFLKG 995
K + N I + +++T++F +V E D L + D +
Sbjct: 840 PKDSEDNSI-------NFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLK 892
Query: 996 PDGLWRYQGRLCVPEGGELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMK 1055
L + ++ +P +L + I+++ H+ IHPG + + + F W G++K I +
Sbjct: 893 DGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQE 952
Query: 1056 KVTSCLTCQKVKGEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRL 1115
V +C TCQ K + KP G LQP+ E WE +SMDF++ LP ++G++A++V+VDR
Sbjct: 953 YVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE-SSGYNALFVVVDRF 1011
Query: 1116 TKSAHFIAVNMTFPSEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRL 1175
+K A + + +E+ AR++ + ++ G P I++D D F S+ W +
Sbjct: 1012 SKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVM 1071
Query: 1176 SLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAEFSYNNSYHSSLGMAPF 1235
S Y PQ+DGQ+ERT QT+E +LR + +W D + L + SYNN+ HS+ M PF
Sbjct: 1072 KFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPF 1131
Query: 1236 EALYG-RRCKTPLCWLSGEDKITLGPELLQEMTEKVRSIREKLRIAQDRQKSYYDKRHKP 1294
E ++ +PL S DK E QE + ++++E L + K Y+D + +
Sbjct: 1132 EIVHRYSPALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQE 1188
Query: 1295 L-EFQEGDHVFLRVTPITGVGRSIHSKKLTPKYLGPYQILDRIGAVAYRIALPPSLSNL- 1352
+ EFQ GD V ++ T G S KL P + GP+ +L + G Y + LP S+ ++
Sbjct: 1189 IEEFQPGDLVMVKRTK---TGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMF 1245
Query: 1353 HDVFHISQLRKYLPDS 1368
FH+S L KY +S
Sbjct: 1246 SSTFHVSHLEKYRHNS 1261
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 513 bits (1321), Expect = e-144
Identities = 309/916 (33%), Positives = 493/916 (53%), Gaps = 38/916 (4%)
Query: 467 KRPAMEDIPVVREFPEVFPEDMTE-LP-PEREVEFAIDVIPGTTPISAAPYRISPLELAE 524
K P + DI +EF ++ E TE LP P + +EF +++ + Y + P ++
Sbjct: 370 KEPELPDI--YKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQA 427
Query: 525 LQKQVEELLSKGFIRPSVSPWGAPVLLVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDD 584
+ ++ + L G IR S + PV+ V KK+G++R+ VDY+ LNK N YPLP I+
Sbjct: 428 MNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQ 487
Query: 585 LMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMD 644
L+ +++G+ +F+K+DL+S YH IRV+ D K AFR G +EYLVMP+G++ APA F
Sbjct: 488 LLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQY 547
Query: 645 YMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQ 704
++N I + V+ ++D+ILI+SKS+ EHV+H++ VL+ LK+ L +KCEF Q
Sbjct: 548 FINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQ 607
Query: 705 VQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATPLT 764
V+F+G+ +SE G ++ V WK P+ +R FLG Y R+FI S++ PL
Sbjct: 608 VKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLN 667
Query: 765 QLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQER 824
L KKD + WT + + +K+ L PVL D SK + DAS +G VL Q+
Sbjct: 668 NLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKH 727
Query: 825 K-----VIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGV--KFTIYSDHQSL- 876
+ Y S ++ + NY D E+ A++ +LK WRHYL F I +DH++L
Sbjct: 728 DDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLI 787
Query: 877 -KYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHAARLMIEETELI 935
+ + + N R RW FL+D++F++ Y PG AN +ADALSR +++ETE I
Sbjct: 788 GRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPI 839
Query: 936 EKFRDMNLIMETLPQGTRLGTLTLTNEFIEEVKKEQARDENLQKEAHGRDSMSRPDFLKG 995
K + N I + +++T++F +V E D L + D +
Sbjct: 840 PKDSEDNSI-------NFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLK 892
Query: 996 PDGLWRYQGRLCVPEGGELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMK 1055
L + ++ +P +L + I+++ H+ IHPG + + + F W G++K I +
Sbjct: 893 DGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQE 952
Query: 1056 KVTSCLTCQKVKGEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRL 1115
V +C TCQ K + KP G LQP+ E WE +SMDF++ LP ++G++A++V+VDR
Sbjct: 953 YVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE-SSGYNALFVVVDRF 1011
Query: 1116 TKSAHFIAVNMTFPSEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRL 1175
+K A + + +E+ AR++ + ++ G P I++D D F S+ W +
Sbjct: 1012 SKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVM 1071
Query: 1176 SLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAEFSYNNSYHSSLGMAPF 1235
S Y PQ+DGQ+ERT QT+E +LR + +W D + L + SYNN+ HS+ M PF
Sbjct: 1072 KFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPF 1131
Query: 1236 EALYG-RRCKTPLCWLSGEDKITLGPELLQEMTEKVRSIREKLRIAQDRQKSYYDKRHKP 1294
E ++ +PL S DK E QE + ++++E L + K Y+D + +
Sbjct: 1132 EIVHRYSPALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQE 1188
Query: 1295 L-EFQEGDHVFLRVTPITGVGRSIHSKKLTPKYLGPYQILDRIGAVAYRIALPPSLSNL- 1352
+ EFQ GD V ++ T G S KL P + GP+ +L + G Y + LP S+ ++
Sbjct: 1189 IEEFQPGDLVMVKRTK---TGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMF 1245
Query: 1353 HDVFHISQLRKYLPDS 1368
FH+S L KY +S
Sbjct: 1246 SSTFHVSHLEKYRHNS 1261
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 513 bits (1321), Expect = e-144
Identities = 309/916 (33%), Positives = 493/916 (53%), Gaps = 38/916 (4%)
Query: 467 KRPAMEDIPVVREFPEVFPEDMTE-LP-PEREVEFAIDVIPGTTPISAAPYRISPLELAE 524
K P + DI +EF ++ E TE LP P + +EF +++ + Y + P ++
Sbjct: 370 KEPELPDI--YKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQA 427
Query: 525 LQKQVEELLSKGFIRPSVSPWGAPVLLVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDD 584
+ ++ + L G IR S + PV+ V KK+G++R+ VDY+ LNK N YPLP I+
Sbjct: 428 MNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQ 487
Query: 585 LMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMD 644
L+ +++G+ +F+K+DL+S YH IRV+ D K AFR G +EYLVMP+G++ APA F
Sbjct: 488 LLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQY 547
Query: 645 YMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQ 704
++N I + V+ ++D+ILI+SKS+ EHV+H++ VL+ LK+ L +KCEF Q
Sbjct: 548 FINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQ 607
Query: 705 VQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATPLT 764
V+F+G+ +SE G ++ V WK P+ +R FLG Y R+FI S++ PL
Sbjct: 608 VKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLN 667
Query: 765 QLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQER 824
L KKD + WT + + +K+ L PVL D SK + DAS +G VL Q+
Sbjct: 668 NLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKH 727
Query: 825 K-----VIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGV--KFTIYSDHQSL- 876
+ Y S ++ + NY D E+ A++ +LK WRHYL F I +DH++L
Sbjct: 728 DDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLI 787
Query: 877 -KYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHAARLMIEETELI 935
+ + + N R RW FL+D++F++ Y PG AN +ADALSR +++ETE I
Sbjct: 788 GRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPI 839
Query: 936 EKFRDMNLIMETLPQGTRLGTLTLTNEFIEEVKKEQARDENLQKEAHGRDSMSRPDFLKG 995
K + N I + +++T++F +V E D L + D +
Sbjct: 840 PKDSEDNSI-------NFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLK 892
Query: 996 PDGLWRYQGRLCVPEGGELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMK 1055
L + ++ +P +L + I+++ H+ IHPG + + + F W G++K I +
Sbjct: 893 DGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQE 952
Query: 1056 KVTSCLTCQKVKGEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRL 1115
V +C TCQ K + KP G LQP+ E WE +SMDF++ LP ++G++A++V+VDR
Sbjct: 953 YVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE-SSGYNALFVVVDRF 1011
Query: 1116 TKSAHFIAVNMTFPSEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRL 1175
+K A + + +E+ AR++ + ++ G P I++D D F S+ W +
Sbjct: 1012 SKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVM 1071
Query: 1176 SLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAEFSYNNSYHSSLGMAPF 1235
S Y PQ+DGQ+ERT QT+E +LR + +W D + L + SYNN+ HS+ M PF
Sbjct: 1072 KFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPF 1131
Query: 1236 EALYG-RRCKTPLCWLSGEDKITLGPELLQEMTEKVRSIREKLRIAQDRQKSYYDKRHKP 1294
E ++ +PL S DK E QE + ++++E L + K Y+D + +
Sbjct: 1132 EIVHRYSPALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQE 1188
Query: 1295 L-EFQEGDHVFLRVTPITGVGRSIHSKKLTPKYLGPYQILDRIGAVAYRIALPPSLSNL- 1352
+ EFQ GD V ++ T G S KL P + GP+ +L + G Y + LP S+ ++
Sbjct: 1189 IEEFQPGDLVMVKRTK---TGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMF 1245
Query: 1353 HDVFHISQLRKYLPDS 1368
FH+S L KY +S
Sbjct: 1246 SSTFHVSHLEKYRHNS 1261
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 377 bits (967), Expect = e-103
Identities = 267/896 (29%), Positives = 448/896 (49%), Gaps = 56/896 (6%)
Query: 476 VVREFPEVFPEDMTELPPEREVEFAIDVIPGTTPISAAPYRISPLELAELQKQVEELLSK 535
V+ +F +VF EL E I++ G PI P I E++K ++++L++
Sbjct: 909 VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQ 968
Query: 536 GFIRPSVSPWGAPVLLVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGARVF 595
IR S SPW +PV+LVKKKDGS+R+C+DYR++NKV N +PLP I+ + L G +++
Sbjct: 969 KVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLY 1028
Query: 596 SKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPYLD 655
+ D+ +G+ QI + + TAF +E+ V+PFG+ +PA+F M I L
Sbjct: 1029 TVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLG 1088
Query: 656 KFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQFLGHVVSED 715
V++DD+LI SK E+H++ ++ L ++ + + SKC ++V++LGH V+ D
Sbjct: 1089 VCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLD 1148
Query: 716 GIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATPLTQLTKKDHPFVW 775
G+ K + + + P V ++SFLGL GYYR+FI F++IA+ LT L ++W
Sbjct: 1149 GVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIW 1208
Query: 776 TEKCE*SFQTLKERLTKAPVLTLPD------PSKDYDVYCDASKSGLGCVLMQE-----R 824
++ E +FQ LK+ + + PVL PD + + +Y DAS+ G+G VL QE +
Sbjct: 1209 EKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQ 1268
Query: 825 KVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFTIYSDHQSLKYLFDQKT 884
IA+AS+ L P E Y D+E A++FAL+ ++ +YG T+++DH+ L L
Sbjct: 1269 HPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSP 1328
Query: 885 LNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHAARLMIEETELIEKFRDMNLI 944
L R RW + ++D K+ Y GKAN VADALSR L E+T+ + +N I
Sbjct: 1329 LADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSI--VNAI 1386
Query: 945 METLP----QGTRLGTLTLTNEFIEEV--KKEQARDENLQKEAHGRDSMSRPDFLKGPDG 998
LP L L +E +EV E + + K G +S ++ K G
Sbjct: 1387 QTELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIV-GIESEISLEYYKIVGG 1445
Query: 999 LWR-----YQGRLCVPEGGELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDI 1053
+ + Q R VPE ++R +L+E H+ + H G KM++ + + F+WP M+ +
Sbjct: 1446 VLKNTEIEEQSRSVVPE--KIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRVCV 1503
Query: 1054 MKKVTSCLTCQKVKGEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVD 1113
V +C C +H K + SL P + + E ++ D + + + G+ I I+D
Sbjct: 1504 ENCVRTCAKC-LCANDHSKLTSSLTPYRM-TFPLEIVACDLMD-VGLSVQGNRYILTIID 1560
Query: 1114 RLTKSAHFIAVNMTFPSEKLARIYVKEIVRLHG-----VPANIVSDRDPRFVSKFWGSLH 1168
TK + + +K A +K V +P +++D+ FV+ +
Sbjct: 1561 LFTKYGTAVPI-----PDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFT 1615
Query: 1169 EALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAEFSYNNSYHS 1228
L + Y+ +++G ER +T+ +++ W+D + A ++YNN H
Sbjct: 1616 HMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTA-VPMEWDDQVVYAVYAYNNCVHE 1674
Query: 1229 SLGMAPFEALYGRRCKTPLCWLSGEDKITLGPE--------LLQEMTEKVRSIREKLRIA 1280
+ G P ++GR PL +SGED + + L QE+ + + +E
Sbjct: 1675 NTGETPMFLMHGRDVMGPL-EMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHAMRE 1733
Query: 1281 QDRQKSYYDKRHKPLEF---QEGDHVFLRVTPITGVGRSIHSKKLTPKYLGPYQIL 1333
Q+ KS +D+++ + Q G V L + P +G KL K+ GPY+++
Sbjct: 1734 QESYKSLFDQKYASKKHRFPQPGSRVLLEI-PSEKLG--AQCPKLVNKWSGPYRVI 1786
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 350 bits (898), Expect = 2e-95
Identities = 255/776 (32%), Positives = 388/776 (49%), Gaps = 70/776 (9%)
Query: 508 TPISAAPYRISPLELAELQKQVEELLSKGFIRPSVSPWGAPVLLVKKKDGSM-----RLC 562
+PI + Y ++ E++ QV+E+L++G IR S SP+ +P +V KK + R+
Sbjct: 205 SPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264
Query: 563 VDYRQLNKVTIKNRYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTR 622
+DYR+LN++TI +RYP+P +D+++ +L + F+ IDL G+HQI + + + KTAF T+
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 623 YGHYEYLVMPFGVTNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVV 682
GHYEYL MPFG+ NAPA F MN I P L+K +V++DDI+I+S S EH+ +Q+V
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 683 LKVLKDRKLYAKLSKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSF 742
L D L +L KCEF ++ FLGH+V+ DGI +P KV+A+ S+ +P +R+F
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444
Query: 743 LGLAGYYRRFIEGFSKIATPLTQLTKKDHPFVWTEKCE--*SFQTLKERLTKAPVLTLPD 800
LGL GYYR+FI ++ IA P+T KK + T+K E +F+ LK + + P+L LPD
Sbjct: 445 LGLTGYYRKFIPNYADIAKPMTSCLKK-RTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503
Query: 801 PSKDYDVYCDASKSGLGCVLMQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRH 860
K + + DAS LG VL Q I++ S+ L HE NY + EL A+V+A K +RH
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRH 563
Query: 861 YLYGVKFTIYSDHQSLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRK 920
YL G +F I SDHQ L++L + K + RW L +Y FK+ Y GK N VADALSR
Sbjct: 564 YLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRI 623
Query: 921 SL----------HAAR----LMIEETEL-IEKFR--------DMNLIMETLPQGTRLGTL 957
+ H+A +I TE I F+ D N + + G + T+
Sbjct: 624 KIEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTI 683
Query: 958 TLTNEFIEEVKK---------------EQARDENLQKEAHGRDSMSRPDFLKGPDGLWRY 1002
+E+ K+ E D + + AH + + K L+
Sbjct: 684 QYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAH--IEIVNTTYTKVIRSLFLL 741
Query: 1003 QGRLCVPEGGELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMKKVTSCLT 1062
+ V E ++ IL+ K +HPG KM + K+ ++P + I + C
Sbjct: 742 KN---VGSYAEFKEIILQSHEK---LLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNI 795
Query: 1063 CQKVKGEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRLTKSAHFI 1122
C K EH+ L+ PE E +D S + G I +D +K A
Sbjct: 796 CNLAKTEHRNTKMPLKITPNPEHCREKFVVDIYS-----SEGKHYI-SCIDIYSKFATLE 849
Query: 1123 AVNMTFPSEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYH 1182
+ E R + I G P + +DRD F S E L L++A +
Sbjct: 850 QIKTKDWIE--CRNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKN 907
Query: 1183 PQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAE---FSYNNSY-HSSLGMAP 1234
+D ER +T+ + +R +++ E L E ++YN H + G P
Sbjct: 908 GVAD--VERLHKTINEKIR--IINSSDDEEVKLSKIETILYTYNQKIKHDTTGQRP 959
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 342 bits (876), Expect = 6e-93
Identities = 175/420 (41%), Positives = 270/420 (63%), Gaps = 13/420 (3%)
Query: 524 ELQKQVEELLSKGFIRPSVSPWGAPVLLV-KKKDGS----MRLCVDYRQLNKVTIKNRYP 578
E++ Q++++L++G IR S SP+ +P+ +V KK+D S R+ +DYR+LN++T+ +R+P
Sbjct: 222 EVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHP 281
Query: 579 LPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNA 638
+P +D+++ +L F+ IDL G+HQI + + V KTAF T++GHYEYL MPFG+ NA
Sbjct: 282 IPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNA 341
Query: 639 PAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKC 698
PA F MN I P L+K +V++DDI+++S S +EH++ + +V + L L +L KC
Sbjct: 342 PATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKC 401
Query: 699 EFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSK 758
EF ++ FLGHV++ DGI +P K+EA+ + +P +++FLGL GYYR+FI F+
Sbjct: 402 EFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFAD 461
Query: 759 IATPLTQLTKKDHPFVWTE-KCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLG 817
IA P+T+ KK+ T + + +F+ LK +++ P+L +PD +K + + DAS LG
Sbjct: 462 IAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALG 521
Query: 818 CVLMQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFTIYSDHQSLK 877
VL Q+ ++Y S+ L HE NY T + EL A+V+A K +RHYL G F I SDHQ L
Sbjct: 522 AVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLS 581
Query: 878 YLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHAARLMIEETELIEK 937
+L+ K N + RW L ++DF ++Y GK N VADALS R+ +EET L E+
Sbjct: 582 WLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALS-------RIKLEETYLSEQ 634
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 340 bits (873), Expect = 1e-92
Identities = 258/898 (28%), Positives = 426/898 (46%), Gaps = 100/898 (11%)
Query: 479 EFPEVFPEDMTELPPEREVEFAIDVIPGTTPISAAPYRISPLELAELQKQVEELLSKGFI 538
EFP +F ++ + E V+ I PI A Y E+++Q++ELL G I
Sbjct: 94 EFPRIFEPPLSGMSVETAVKAEIRTNT-QDPIYAKSYPYPVNMRGEVERQIDELLQDGII 152
Query: 539 RPSVSPWGAPVLLVKKK-----DGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGAR 593
RPS SP+ +P+ +V KK + R+ VD+++LN VTI + YP+P I+ + L A+
Sbjct: 153 RPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINATLASLGNAK 212
Query: 594 VFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPY 653
F+ +DL SG+HQI +K D+ KTAF T G YE+L +PFG+ NAPAIF ++ I +
Sbjct: 213 YFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDILREH 272
Query: 654 LDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQFLGHVVS 713
+ K V+IDDI+++S+ + H +++++VL L L L K F QV+FLG++V+
Sbjct: 273 IGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLGYIVT 332
Query: 714 EDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATPLTQLTK----- 768
DGI DP KV A++ P +V ++ FLG+ YYR+FI+ ++K+A PLT LT+
Sbjct: 333 ADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRGLYAN 392
Query: 769 ------KDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQ 822
P E SF LK L + +L P +K + + DAS +G VL Q
Sbjct: 393 IKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGAVLSQ 452
Query: 823 E----RKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGV-KFTIYSDHQSLK 877
+ + IAY S+ L E+NY T + E+ A++++L R YLYG +Y+DHQ L
Sbjct: 453 DDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLT 512
Query: 878 YLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHAARLMIE-ETELIE 936
+ + N + +RW +E+Y+ +L Y PGK+NVVADALSR +L + + +
Sbjct: 513 FALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRIPPQLNQLSTDLDANPED 572
Query: 937 KFRDMNLIMETLPQGTRL------GTLTLTNEFIEEVKKEQ----------------ARD 974
+ + L +RL N+ I + + + +D
Sbjct: 573 DMQSLATAHSALHDSSRLIPHVESPINVFKNQLIFDTTRSKYLCEHPFPGYTRHLIPLKD 632
Query: 975 ENLQKEAHGRDSMSRPDFLKG---PDG-LWRYQGRLCVP-----------------EGGE 1013
+L + S RP + G P+ L R+Q +C+ G E
Sbjct: 633 GSLADLTNSLQSCLRPVIINGVKIPEAHLQRFQS-ICLANFLLYKIRITQRLVADVSGAE 691
Query: 1014 LRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMKKVTSCLTCQKVKGEHQKP 1073
+I+E+ H+ H G T++ L + +++P M I + +SC C+ K E
Sbjct: 692 EICEIIEKEHR---RAHRGPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERHPN 748
Query: 1074 SGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRLTKSAHFIAVNMTFPSEKL 1133
+LQP IP + E + +D + R +D+ +K A F +
Sbjct: 749 KPNLQPTPIPNYPCEILHIDIFALEKRLYLS------CIDKFSKFAKL------FHLQSK 796
Query: 1134 ARIYVKE--IVRLH--GVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQS 1189
A ++++E + LH P +VSD + + + +L L + + +GQ
Sbjct: 797 ASVHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQV 856
Query: 1190 ERTIQTLEDMLRACVLDYKGSWE--DFLPLAEFSYNNSYHSSLGMAPFEALYGRRCKTPL 1247
ER T ++ R C+ D +++ + + +A YN S HS P + + R +
Sbjct: 857 ERFHSTFLEIYR-CLKDELPTFKPVELVHIAVDRYNTSVHSVTNRKPADVFFDRSSRVNY 915
Query: 1248 CWLSGEDKITLGPELLQEMTEKVRSIREKLRIAQDRQKSYYDKRHKPLEFQEGDHVFL 1305
L+ + TL E ++ + E +I + ++ R +P + GD VF+
Sbjct: 916 QGLTDFRRQTL---------EDIKGLIEYKQIRGNMARN--KNRDEPKSYGPGDEVFV 962
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 305 bits (781), Expect = 6e-82
Identities = 165/419 (39%), Positives = 253/419 (60%), Gaps = 19/419 (4%)
Query: 525 LQKQVEELLSKGFIRPSVSPWGAPVLLVKKK------DGSMRLCVDYRQLNKVTIKNRYP 578
+ +V++LL G IRPS SP+ +P +V KK + + RL +D+R+LN+ TI +RYP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 579 LPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNA 638
+P I ++ L A+ F+ +DL+SGYHQI + D +KT+F G YE+ +PFG+ NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 639 PAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKC 698
+IF ++ + + K V++DD++I+S+++ +HV H+ VLK L D + K
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 699 EFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSK 758
F+ E V++LG +VS+DG DP KV+A+ + P+ V VRSFLGLA YYR FI+ F+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 759 IATPLTQLTK-----------KDHPFVWTEKCE*SFQTLKERLTKAPV-LTLPDPSKDYD 806
IA P+T + K K P + E +FQ L+ L V L PD K +D
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 807 VYCDASKSGLGCVLMQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVK 866
+ DAS SG+G VL QE + I S+ L+ EQNY T++ EL A+V+AL +++LYG +
Sbjct: 497 LTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR 556
Query: 867 -FTIYSDHQSLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSRKSLHA 924
I++DHQ L + + N + +RW +++ ++ K+ Y PGK N VADALSR++L+A
Sbjct: 557 EINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQNLNA 615
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 285 bits (728), Expect = 8e-76
Identities = 162/466 (34%), Positives = 251/466 (53%), Gaps = 24/466 (5%)
Query: 478 REFPEVFPEDMTELPPEREVEFAIDVIPGTT--------------PISAAPYRISPLELA 523
+ FPE+F + + E FA++ P T P+ YR ++
Sbjct: 269 KNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE 328
Query: 524 ELQKQVEELLSKGFIRPSVSPWGAPVLLVKKKDG------SMRLCVDYRQLNKVTIKNRY 577
E+Q QV++L+ + PSVS + +P+LLV KK RL +DYRQ+NK + +++
Sbjct: 329 EIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKF 388
Query: 578 PLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTN 637
PLPRIDD++DQL A+ FS +DL SG+HQI + T+F T G Y + +PFG+
Sbjct: 389 PLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKI 448
Query: 638 APAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSK 697
AP F M F +++DD+++ S++ ++++ V ++ L K
Sbjct: 449 APNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEK 508
Query: 698 CEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFS 757
C F++ +V FLGH ++ GI D K + + ++ VP R F+ YYRRFI+ F+
Sbjct: 509 CSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFA 568
Query: 758 KIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLG 817
+ +T+L KK+ PF WT++C+ +F LK +L +L PD SK++ + DASK G
Sbjct: 569 DYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACG 628
Query: 818 CVLMQERK----VIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFTIYSDH 873
VL Q +AYAS+ E N T + ELAA+ +A+ +R Y+YG FT+ +DH
Sbjct: 629 AVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDH 688
Query: 874 QSLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSR 919
+ L YLF + + R LE+Y+F ++Y GK N VADALSR
Sbjct: 689 RPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSR 734
Score = 97.4 bits (241), Expect = 2e-19
Identities = 78/311 (25%), Positives = 143/311 (45%), Gaps = 21/311 (6%)
Query: 1030 HPGTTKMYQDLKKMFWWPGMKKDIMKKVTSCLTCQKVKG-EHQKPSGSLQPLSIPEWKWE 1088
H G TK +K+ ++W M K I + V C CQK K +H K ++ PE ++
Sbjct: 909 HTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTIT--ETPEHAFD 966
Query: 1089 GISMDFVSGLPRTTTGHDAIWVIVDRLTKSAHFIAVNMTFPSEK-LARIYVKEIVRLHGV 1147
+ +D + LP++ G++ ++ LTK + +A+ + S K +A+ + + +G
Sbjct: 967 RVVVDTIGPLPKSENGNEYAVTLICDLTK--YLVAIPIANKSAKTVAKAIFESFILKYGP 1024
Query: 1148 PANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDY 1207
++D + + L + L + S+A+H Q+ G ER+ +TL + +R+ +
Sbjct: 1025 MKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTD 1084
Query: 1208 KGSWEDFLPLAEFSYNNSYHSSLGMAPFEALYGRRCKTPLCW--LSGEDKITLGPELLQE 1265
K W+ +L + +N + P+E ++GR P + L + I + +E
Sbjct: 1085 KTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKE 1144
Query: 1266 MTEKVR----SIREKLRIAQDRQKSYYDKRHKPLEFQEGDHVFLRVTPITGVGRSIHSKK 1321
++ R+ L +++ K YD + K +E + GD V LR VG K
Sbjct: 1145 SKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLR----NEVGH-----K 1195
Query: 1322 LTPKYLGPYQI 1332
L KY GPY+I
Sbjct: 1196 LDFKYTGPYKI 1206
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 218 bits (556), Expect = 7e-56
Identities = 178/662 (26%), Positives = 302/662 (44%), Gaps = 72/662 (10%)
Query: 323 NTSEDFIQGTCFLCDISLVVLY---DSGATHSFISHERAKSLKLVITQLPYDLVVTTPTK 379
N + +I+G + + L+ D+GA+ S V + P ++V
Sbjct: 20 NPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAERP--IMVKIADG 77
Query: 380 ESAVTSSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNW------------------- 420
S S VCK L+I + V G+D I+G N+
Sbjct: 78 SSITISKVCKDIDLIIAGEIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKN 137
Query: 421 ----LSINNVLLDCRLRVPIFLQKYKEKHTASLPEK--------EPSAYLILFSSEGTKR 468
+ I + R+ + FL+ K++ PE E I SEG +R
Sbjct: 138 KSYPVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEG-RR 196
Query: 469 PAMEDIPVVREFPEVFPEDMTELPPEREVE---------FAIDVIPGTTPISAAPYRISP 519
+ E + + ++ + E + ++ E ++ +I + + I P + SP
Sbjct: 197 LSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSP 256
Query: 520 LELAELQKQVEELLSKGFIRPSVSPWGAPVLLV----KKKDGSMRLCVDYRQLNKVTIKN 575
++ E KQ++ELL I+PS SP AP LV +K+ G R+ V+Y+ +NK TI +
Sbjct: 257 MDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGD 316
Query: 576 RYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGV 635
Y LP D+L+ ++G ++FS D +SG+ Q+ + + TAF GHYE+ V+PFG+
Sbjct: 317 AYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGL 376
Query: 636 TNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKL 695
AP+IF +M+ F + KF V++DDIL++S ++E+H+ H+ ++L+ +
Sbjct: 377 KQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSK 435
Query: 696 SKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRRF 752
K + + +++ FLG + E +E +N K P+T+ ++ FLG+ Y +
Sbjct: 436 KKAQLFKKKINFLGLEIDEGTHKPQGHILEHIN--KFPDTLEDKKQLQRFLGILTYASDY 493
Query: 753 IEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDAS 812
I ++I PL K++ P+ WT++ Q +K+ L P L P P + + DAS
Sbjct: 494 IPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDAS 553
Query: 813 KSGLGCVL--------MQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYG 864
G +L + YAS + E+NY ++D E AV+ +K + YL
Sbjct: 554 DDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTP 613
Query: 865 VKFTIYSDHQ------SLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALS 918
V F I +D+ +L Y D K R RW +L Y F +++ G N AD LS
Sbjct: 614 VHFLIRTDNTHFKSFVNLNYKGDSKL--GRNIRWQAWLSHYSFDVEHIKGTDNHFADFLS 671
Query: 919 RK 920
R+
Sbjct: 672 RE 673
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 218 bits (554), Expect = 1e-55
Identities = 179/662 (27%), Positives = 302/662 (45%), Gaps = 72/662 (10%)
Query: 323 NTSEDFIQGTCFLCDISLVVLY---DSGATHSFISHERAKSLKLVITQLPYDLVVTTPTK 379
N + +I+G + + L+ D+GA+ S V + P ++V
Sbjct: 20 NPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAERP--IMVKIADG 77
Query: 380 ESAVTSSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNWLSINNVLLDCRLRV----- 434
S S VCK L+I + V G+D I+G N+ + + RV
Sbjct: 78 SSITISKVCKDIDLIIAREIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKN 137
Query: 435 ---PI---------------FLQKYKEKHTASLPEK--------EPSAYLILFSSEGTKR 468
P+ FL+ K++ PE E I SEG +R
Sbjct: 138 KSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSEG-RR 196
Query: 469 PAMEDIPVVREFPEVFPEDMTELPPEREVE---------FAIDVIPGTTPISAAPYRISP 519
+ E + + ++ + E + ++ E ++ +I + + I P + SP
Sbjct: 197 LSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSP 256
Query: 520 LELAELQKQVEELLSKGFIRPSVSPWGAPVLLV----KKKDGSMRLCVDYRQLNKVTIKN 575
++ E KQ++ELL I+PS SP AP LV +K+ G R+ V+Y+ +NK TI +
Sbjct: 257 MDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGD 316
Query: 576 RYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGV 635
Y LP D+L+ ++G ++FS D +SG+ Q+ + + TAF GHYE+ V+PFG+
Sbjct: 317 AYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGL 376
Query: 636 TNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKL 695
AP+IF +M+ F + KF V++DDIL++S ++E+H+ H+ ++L+ +
Sbjct: 377 KQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSK 435
Query: 696 SKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRRF 752
K + + +++ FLG + E +E +N K P+T+ ++ FLG+ Y +
Sbjct: 436 KKAQLFKKKINFLGLEIDEGTHKPQGHILEHIN--KFPDTLEDKKQLQRFLGILTYASDY 493
Query: 753 IEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDAS 812
I ++I PL K++ P+ WT++ Q +K+ L P L P P + + DAS
Sbjct: 494 IPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDAS 553
Query: 813 KSGLGCVL--------MQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYG 864
G +L + YAS + E+NY ++D E AV+ +K + YL
Sbjct: 554 DDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTP 613
Query: 865 VKFTIYSDHQ------SLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALS 918
V F I +D+ +L Y D K R RW +L Y F +++ G N AD LS
Sbjct: 614 VHFLIRTDNTHFKSFVNLNYKGDSKL--GRNIRWQAWLSHYSFDVEHIKGTDNHFADFLS 671
Query: 919 RK 920
R+
Sbjct: 672 RE 673
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 217 bits (553), Expect = 2e-55
Identities = 178/662 (26%), Positives = 302/662 (44%), Gaps = 72/662 (10%)
Query: 323 NTSEDFIQGTCFLCDISLVVLY---DSGATHSFISHERAKSLKLVITQLPYDLVVTTPTK 379
N + +I+G + + L+ D+GA+ S V + P ++V
Sbjct: 20 NPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAERP--IMVKIADG 77
Query: 380 ESAVTSSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNWLSINNVLLDCRLRV----- 434
S S VCK L+I + V G+D I+G N+ + + RV
Sbjct: 78 SSITISKVCKDIDLIIAGEIFRIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKN 137
Query: 435 ---PI---------------FLQKYKEKHTASLPEK--------EPSAYLILFSSEGTKR 468
P+ FL+ K++ PE E I SEG +R
Sbjct: 138 KSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEG-RR 196
Query: 469 PAMEDIPVVREFPEVFPEDMTELPPEREVE---------FAIDVIPGTTPISAAPYRISP 519
+ E + + ++ + E + ++ E ++ +I + + I P + SP
Sbjct: 197 LSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSP 256
Query: 520 LELAELQKQVEELLSKGFIRPSVSPWGAPVLLV----KKKDGSMRLCVDYRQLNKVTIKN 575
++ E KQ++ELL I+PS SP AP LV +K+ G R+ V+Y+ +NK T+ +
Sbjct: 257 MDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGD 316
Query: 576 RYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGV 635
Y LP D+L+ ++G ++FS D +SG+ Q+ + + TAF GHYE+ V+PFG+
Sbjct: 317 AYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGL 376
Query: 636 TNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKL 695
AP+IF +M+ F + KF V++DDIL++S ++E+H+ H+ ++L+ +
Sbjct: 377 KQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSK 435
Query: 696 SKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRRF 752
K + + +++ FLG + E +E +N K P+T+ ++ FLG+ Y +
Sbjct: 436 KKAQLFKKKINFLGLEIDEGTHKPQGHILEHIN--KFPDTLEDKKQLQRFLGILTYASDY 493
Query: 753 IEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDAS 812
I ++I PL K++ P+ WT++ Q +K+ L P L P P + + DAS
Sbjct: 494 IPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDAS 553
Query: 813 KSGLGCVL--------MQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYG 864
G +L + YAS + E+NY ++D E AV+ +K + YL
Sbjct: 554 DDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTP 613
Query: 865 VKFTIYSDHQ------SLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALS 918
V F I +D+ +L Y D K R RW +L Y F +++ G N AD LS
Sbjct: 614 VHFLIRTDNTHFKSFVNLNYKGDSKL--GRNIRWQAWLSHYSFDVEHIKGTDNHFADFLS 671
Query: 919 RK 920
R+
Sbjct: 672 RE 673
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 215 bits (548), Expect = 6e-55
Identities = 165/603 (27%), Positives = 282/603 (46%), Gaps = 60/603 (9%)
Query: 372 LVVTTPTKESAVTSSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNWLSINNVLLDCR 431
++V S + VC+ L+I + V G+D I+G N+ + +
Sbjct: 72 IMVKIADGSSITINKVCRDIDLIIAGEIFHIPTVYQQESGIDFIIGNNFCQLYEPFIQFT 131
Query: 432 LRV--------PI---------------FLQKYKEKHTASLPEK-EPSAYLILFSSEGTK 467
RV P+ FL+ K++ PE S I SEG +
Sbjct: 132 DRVIFTKDRTYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEG-R 190
Query: 468 RPAMEDIPVVREFPEVFPEDMTELPPEREVE---------FAIDVIPGTTPISAAPYRIS 518
R + E + + ++ + E + ++ E ++ +I + + I P + S
Sbjct: 191 RLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYS 250
Query: 519 PLELAELQKQVEELLSKGFIRPSVSPWGAPVLLV----KKKDGSMRLCVDYRQLNKVTIK 574
P++ E KQ++ELL I+PS SP AP LV +K+ G R+ V+Y+ +NK T+
Sbjct: 251 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVG 310
Query: 575 NRYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFG 634
+ Y P D+L+ ++G ++FS D +SG+ Q+ + + TAF GHYE+ V+PFG
Sbjct: 311 DAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFG 370
Query: 635 VTNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAK 694
+ AP+IF +M+ F + KF V++DDIL++S ++E+H+ H+ ++L+ +
Sbjct: 371 LKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILS 429
Query: 695 LSKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRR 751
K + + +++ FLG + E +E +N K P+T+ ++ FLG+ Y
Sbjct: 430 KKKAQLFKKKINFLGLEIDEGTHKPQGHILEHIN--KFPDTLEDKKQLQRFLGILTYASD 487
Query: 752 FIEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDA 811
+I ++I PL K++ P+ WT++ Q +K+ L P L P P + + DA
Sbjct: 488 YIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDA 547
Query: 812 SKSGLGCVL--------MQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLY 863
S G +L + YAS + E+NY ++D E AV+ +K + YL
Sbjct: 548 SDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLT 607
Query: 864 GVKFTIYSDHQ------SLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADAL 917
V F I +D+ +L Y D K R RW +L Y F +++ G N AD L
Sbjct: 608 PVHFLIRTDNTHFKSFVNLNYKGDSKL--GRNIRWQAWLSHYSFDVEHIKGTDNHFADFL 665
Query: 918 SRK 920
SR+
Sbjct: 666 SRE 668
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 210 bits (535), Expect = 2e-53
Identities = 161/597 (26%), Positives = 269/597 (44%), Gaps = 56/597 (9%)
Query: 369 PYDLVVTTPTKESAVTSSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNWLSINNVLL 428
P D+ V +E + VCK + + + V G+D ++G N+ + N +
Sbjct: 76 PKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVYQQETGIDFLIGNNFCRLYNPFI 135
Query: 429 DCRLRVPIFLQKYK---EKHTASLPEKEPSAYLILFSSEGTKRPAMEDIPVVREFPEVF- 484
R+ L+ +K T + PS F K E IP +
Sbjct: 136 QWEDRIAFHLKNEMVLIKKVTKAFSVSNPS-----FLENMKKDSKTEQIPGTNISKNIIN 190
Query: 485 PEDMTELPPER--EVEFAIDVIPGTTPIS----------------------AAPYRISPL 520
PE+ L E+ ++E +D + PI P SP
Sbjct: 191 PEERYFLITEKYQKIEQLLDKVCSENPIDPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQ 250
Query: 521 ELAELQKQVEELLSKGFIRPSVSPWGAPVLLVK----KKDGSMRLCVDYRQLNKVTIKNR 576
+ KQ++ELL G I PS S +P LV+ ++ G R+ V+Y+ +N+ TI +
Sbjct: 251 DREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERRRGKKRMVVNYKAINQATIGDS 310
Query: 577 YPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVT 636
+ LP + +L+ L+G +FS D +SG+ Q+ + + + TAF GH+++ V+PFG+
Sbjct: 311 HNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLK 370
Query: 637 NAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLS 696
AP+IF +M + DKF +V++DDI+++S S+ +H H+ VLK+++ +
Sbjct: 371 QAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKK 429
Query: 697 KCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRRFI 753
K + E++ FLG + + P N K P+ + ++ FLG+ Y +I
Sbjct: 430 KANLFKEKINFLGLEIDKGTHC--PQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYI 487
Query: 754 EGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASK 813
++I PL KKD + WT+ + +K+ L P L LP P + DAS
Sbjct: 488 PKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLIIETDASD 547
Query: 814 SGLGCVLMQE-----RKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFT 868
S G VL + Y+S + E+NY ++D EL AV + + YL V+FT
Sbjct: 548 SFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAYLTPVRFT 607
Query: 869 IYSDHQSLKYLF------DQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALSR 919
+ +D+++ Y D K R RW + Y F +++ G NV+AD L+R
Sbjct: 608 VRTDNKNFTYFLRINLKGDSK--QGRLVRWQNWFSKYQFDVEHLEGVKNVLADCLTR 662
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 209 bits (533), Expect = 3e-53
Identities = 174/662 (26%), Positives = 300/662 (45%), Gaps = 72/662 (10%)
Query: 323 NTSEDFIQGTCFLCDISLVVLY---DSGATHSFISHERAKSLKLVITQLPYDLVVTTPTK 379
N + +I+G + + L+ D+GA+ S V + P ++V
Sbjct: 21 NPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAERP--IMVKIADG 78
Query: 380 ESAVTSSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNWLSINNVLLDCRLRV----- 434
S S VCK L+I + V G+D I+G N+ + + RV
Sbjct: 79 SSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKN 138
Query: 435 ---PI---------------FLQKYKEKHTASLPEK--------EPSAYLILFSSEGTKR 468
P+ FL+ K++ PE E I SEG +R
Sbjct: 139 KSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEG-RR 197
Query: 469 PAMEDIPVVREFPEVFPEDMTELPPEREVE---------FAIDVIPGTTPISAAPYRISP 519
+ E + + ++ + E + ++ E ++ +I + + I P + SP
Sbjct: 198 LSEEKLFITQQRMQKTEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSP 257
Query: 520 LELAELQKQVEELLSKGFIRPSVSPWGAPVLLVKKKD----GSMRLCVDYRQLNKVTIKN 575
++ E KQ++ELL I+PS SP AP LV + G+ R+ V+Y+ +NK T+ +
Sbjct: 258 MDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGD 317
Query: 576 RYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGV 635
Y LP D+L+ ++G ++FS D +SG+ Q+ + + TAF GHYE+ V+PFG+
Sbjct: 318 AYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGL 377
Query: 636 TNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKL 695
AP+IF +M+ F + KF V++DDI+++S ++E+H+ H+ ++L+ +
Sbjct: 378 KQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSK 436
Query: 696 SKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRRF 752
K + + +++ FLG + E +E +N K P+T+ ++ FLG+ Y +
Sbjct: 437 KKAQLFKKKINFLGLEIDEGTHKPQGHILEHIN--KFPDTLEDKKQLQRFLGILTYASDY 494
Query: 753 IEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDAS 812
I +++ PL K++ P+ WT++ Q +K+ L P L P P + + DAS
Sbjct: 495 IPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDAS 554
Query: 813 KSGLGCVL--------MQERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYG 864
G +L + Y S + E+NY ++D E AV+ +K + YL
Sbjct: 555 DDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTP 614
Query: 865 VKFTIYSDHQ------SLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKANVVADALS 918
V F I +D+ +L Y D K R RW +L Y F +++ G N AD LS
Sbjct: 615 VHFLIRTDNTHFKSFVNLNYKGDSKL--GRNIRWQAWLSHYSFDVEHIKGTDNHFADFLS 672
Query: 919 RK 920
R+
Sbjct: 673 RE 674
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 195 bits (495), Expect = 9e-49
Identities = 149/587 (25%), Positives = 274/587 (46%), Gaps = 52/587 (8%)
Query: 385 SSVCKKCPLVIEDREYITNLVCLPLEGLDIILGMNWLSINNVLLDCRLRVPI-------- 436
+ VC K P+ + ++ + G+D++LG N+ + + + R+
Sbjct: 72 TKVCSKLPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVI 131
Query: 437 --------------FLQKYKEKHTASLPEK-EPSAYLILFSSEGTKR--PAMEDIPVVR- 478
FL+ K+K + PE ++ LF EG + +I + +
Sbjct: 132 IGKITKAYQYGVKGFLESMKKKSKVNRPEPINITSNQHLFLEEGGNHVDEMLYEIQISKF 191
Query: 479 -EFPEVFPEDMTELP--PEREVEF---AIDVIPGTTPISAAPYRISPLELAELQKQVEEL 532
E+ +E P PE+ ++ I++I T + P SP + E +Q++EL
Sbjct: 192 SAIEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKEL 251
Query: 533 LSKGFIRPSVSPWGAPVLLVK----KKDGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQ 588
L I+PS S +P LV+ ++ G R+ V+Y+ +NK T + + LP D+L+
Sbjct: 252 LELKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTL 311
Query: 589 LKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIF-MDYMN 647
++G +++S D +SG Q+ + + TAF GHY++ V+PFG+ AP+IF Y N
Sbjct: 312 VRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYAN 371
Query: 648 RIFHPYLDKFVIVFIDDILIYSKS-KEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQ 706
+ Y K+ V++DDIL++S + ++EH H+ +L+ + + K + + E++
Sbjct: 372 SHSNQY-SKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKIN 430
Query: 707 FLGHVVSEDGIAVDPAKVEAVNSWKVPETVTG---VRSFLGLAGYYRRFIEGFSKIATPL 763
FLG + + +E ++ K P+ + ++ FLG+ Y +I + I PL
Sbjct: 431 FLGLEIDQGTHCPQNHILEHIH--KFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPL 488
Query: 764 TQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLM-- 821
K+D + W + +K+ L P L P+P+ + DAS+ G +L
Sbjct: 489 QSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAI 548
Query: 822 --QERKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFTIYSDHQSLKYL 879
+ YAS + E+NY +++ EL AV+ +K + YL +F I +D+++ +
Sbjct: 549 HNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHF 608
Query: 880 FDQKTLNMRQR----RWVEFLEDYDFKLQYHPGKANVVADALSRKSL 922
+ R++ RW +L YDF +++ G NV AD L +L
Sbjct: 609 VNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFLQENTL 655
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 179 bits (453), Expect = 6e-44
Identities = 131/469 (27%), Positives = 229/469 (47%), Gaps = 44/469 (9%)
Query: 486 EDMTELPPEREVEFAIDVIPGTTPISAAPYR-ISPLELAELQKQVEELLSKGFIRPSVSP 544
E+ E +++ +++I I P + ++P + + +Q+ LL IRPS S
Sbjct: 1378 ENPMEFWKNNKIKCKLNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESK 1437
Query: 545 WGAPVLLV-----------KKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGAR 593
+ +V K+K G R+ +Y+ LN+ T ++Y LP I+ ++ ++ ++
Sbjct: 1438 HRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSK 1497
Query: 594 VFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPY 653
++SK DL+SG+ Q+ ++ + V TAF YE+LVMPFG+ NAPAIF M+ +F
Sbjct: 1498 IYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG- 1556
Query: 654 LDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQFLGHVVS 713
+KF+ V+IDDIL++S++ E+H +H+ +L++ K+ L +K + ++ FLG +
Sbjct: 1557 TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLG 1616
Query: 714 EDGIAVDPAKVEAVNSWKVPETVT--GVRSFLGLAGYYRRFIEGFSKIATPLTQL----- 766
I + P + + + + T G+RS+LG+ Y R +I+ K+ PL Q
Sbjct: 1617 CTKIKLQPHIISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTG 1676
Query: 767 TKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVL------ 820
K+ +P W + +KE++ P L LP + D +G G V
Sbjct: 1677 DKRMNPETWK-----MVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSK 1731
Query: 821 ---MQERKVIAYASQQLRPHEQNYPTHDMELAAVVFAL-KIWRHYLYGVKFTIYSDHQSL 876
++ AYAS P + T D E+ A + L K +YL + I SD +++
Sbjct: 1732 HDPRSTERICAYASGSFNPIKS---TIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAI 1788
Query: 877 KYLFDQKTLNMRQR-RWVEFLE-----DYDFKLQYHPGKANVVADALSR 919
+++ N R RW+ F + ++ GK N +ADALSR
Sbjct: 1789 IKFYNKTNENKPSRVRWLTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 174 bits (440), Expect = 2e-42
Identities = 132/481 (27%), Positives = 228/481 (46%), Gaps = 28/481 (5%)
Query: 477 VREFPEVFPEDMTELPPEREVEFAIDVIPGTTPISAAPYRISPLELAELQKQVEELLSKG 536
+++FP+ + E + +V ID+ P P+S Y +S +Q + L G
Sbjct: 1 LQDFPQAWAETGGLGRAKCQVPIIIDLKPTAMPVSIRQYPMSKEAHMGIQPHITRFLELG 60
Query: 537 FIRPSVSPWGAPVLLVKKKDG-SMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGARV- 594
+RP SPW P+L VKK R D R++NK T+ +P +L+ L R
Sbjct: 61 VLRPCRSPWNTPLLPVKKPGTRDYRPVQDLREVNKRTMDIHPTVPNPYNLLSTLSPDRTW 120
Query: 595 FSKIDLRSGYHQIRVKSDDVQKTAFRTR------YGHYEYLVMPFGVTNAPAIFMDYMNR 648
++ +DL+ + + + + AF R G + +P G N+P +F + ++R
Sbjct: 121 YTVLDLKDAFFCLPLAPQSQELFAFEWRDPERGISGQLTWTRLPQGFKNSPTLFDEALHR 180
Query: 649 IF------HPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWL 702
HP + ++ ++DD+L+ + +KE + + +L+ L D+ A K +
Sbjct: 181 DLTDFRTQHPEVT--LLQYVDDLLLAAPTKEACIRGTKHLLRELGDKGYRASAKKAQICQ 238
Query: 703 EQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATP 762
+V +LG+++SE + P ++E V P+ VR FLG AG+ R +I GF+++A P
Sbjct: 239 TKVTYLGYILSEGKRWLTPGRIETVAHIPPPQNPREVREFLGTAGFCRLWIPGFAELAAP 298
Query: 763 LTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQ 822
L LTK+ PF W EK + +F+ LKE L AP L LPD SK + ++ D + VL Q
Sbjct: 299 LYALTKESAPFTWQEKHQSAFEALKEALLSAPALGLPDTSKPFTLFIDEKQGIAKGVLTQ 358
Query: 823 E----RKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFTIYSDH---QS 875
+ ++ +AY S++L P +P +AA +K G T+ + H
Sbjct: 359 KLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPHALEAI 418
Query: 876 LKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHP----GKANVVADALSRKSLHAARLMIEE 931
++ D+ N R + L D D ++Q+ P A ++ ++S H R ++ E
Sbjct: 419 VRQTPDRWITNARLTHYQALLLDTD-RIQFGPPVTLNPATLLPAPEDQQSAHDCRQVLAE 477
Query: 932 T 932
T
Sbjct: 478 T 478
Score = 74.7 bits (182), Expect = 2e-12
Identities = 103/390 (26%), Positives = 158/390 (40%), Gaps = 47/390 (12%)
Query: 956 TLTLTNEFIEE--VKKEQARDENLQKEAHGRDSMSRPDFLKGPDGLWRYQGRLCVPEGGE 1013
TLTLT + E + A Q+EA ++ D W +G++ +P
Sbjct: 640 TLTLTTKLEETNLTTNKYAYTPEDQEEAKAIGAILNQDTKD-----WEKEGKIVLP---- 690
Query: 1014 LRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMKKVTS-CLTCQKVK-GEHQ 1071
R++ L + H K+ ++K + ++++VTS C CQ+V G +
Sbjct: 691 -RKEALAMIQQMHAWTHLSNQKLKLLIEKTDFLIPKAGTLIEQVTSACKVCQQVNAGATR 749
Query: 1072 KPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRLTKSAHFIAVNMTFPSE 1131
P G + P WE +DF P G+ + V VD + +
Sbjct: 750 VPEGKRTRGNRPGVYWE---IDFTEVKPHYA-GYKYLLVFVDTFSGWVEAYPTRQE-TAH 804
Query: 1132 KLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSER 1191
+A+ ++EI G+P I SD P FVS+ L LG L AY PQS GQ ER
Sbjct: 805 MVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVER 864
Query: 1192 TIQTLEDMLRACVLDY-KGSWEDFLPLAEFSYNNSYHSSLGMAPFEALYGRRCKTPLCWL 1250
+T+++ L L+ W L LA N+ + G+ P+E LYG PL L
Sbjct: 865 MNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGLTPYEILYGG--PPPLSTL 921
Query: 1251 -----SGEDKITLGPEL--LQEMTEKVRSIREKLRIAQDRQKSYYDKRHKPLEFQEGDHV 1303
+ K L L LQ + ++ + +L Q SY FQ GD V
Sbjct: 922 LNSFSPSDPKTDLQARLKGLQAVQAQIWTPLAELYRPGHPQTSY--------PFQVGDSV 973
Query: 1304 FLRVTPITGVGRSIHSKKLTPKYLGPYQIL 1333
++R S+ L P++ GPY +L
Sbjct: 974 YVRWH---------RSQGLEPRWKGPYIVL 994
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 161 bits (408), Expect = 1e-38
Identities = 129/493 (26%), Positives = 230/493 (46%), Gaps = 36/493 (7%)
Query: 473 DIPVVREFPEVFPEDMTELPPER--------EVEFAIDVIPGTTPISAAPYRISPLELAE 524
DIPV P+V+ +D + E + ID+ P P+S Y +S
Sbjct: 132 DIPVTTSLPDVWLQDFPQAWAETGGLGRAKCQAPIIIDLKPTAVPVSIKQYPMSLEAHMG 191
Query: 525 LQKQVEELLSKGFIRPSVSPWGAPVLLVKKKDGS-MRLCVDYRQLNKVTIKNRYPLPRID 583
+++ + + L G +RP SPW P+L VKK R D R++NK T+ +P
Sbjct: 192 IRQHIIKFLELGVLRPCRSPWNTPLLPVKKPGTQDYRPVQDLREINKRTVDIHPTVPNPY 251
Query: 584 DLMDQLK-GARVFSKIDLRSGYHQIRVKSDDVQKTAFRTR------YGHYEYLVMPFGVT 636
+L+ LK ++ +DL+ + + + + AF + G + +P G
Sbjct: 252 NLLSTLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAFEWKDPERGISGQLTWTRLPQGFK 311
Query: 637 NAPAIFMDYMNRIF------HPYLDKFVIVFIDDILIYSKSKEEHVEHMQVVLKVLKDRK 690
N+P +F + ++R HP + ++ ++DD+L+ + +K+ + + +L+ L ++
Sbjct: 312 NSPTLFDEALHRDLTDFRTQHPEVT--LLQYVDDLLLAAPTKKACTQGTRHLLQELGEKG 369
Query: 691 LYAKLSKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYR 750
A K + +V +LG+++SE + P ++E V P VR FLG AG+ R
Sbjct: 370 YRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVARIPPPRNPREVREFLGTAGFCR 429
Query: 751 RFIEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCD 810
+I GF+++A PL LTK+ PF W + + +F+ LK+ L AP L LPD SK + ++ D
Sbjct: 430 LWIPGFAELAAPLYALTKESTPFTWQTEHQLAFEALKKALLSAPALGLPDTSKPFTLFLD 489
Query: 811 ASKSGLGCVLMQE----RKVIAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVK 866
+ VL Q+ ++ +AY S++L P +P +AA +K G
Sbjct: 490 ERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQP 549
Query: 867 FTIYSDH---QSLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHP----GKANVVADALSR 919
T+ + H ++ D+ N R + L D D ++Q+ P A ++ ++
Sbjct: 550 LTVITPHTLEAIVRQPPDRWITNARLTHYQALLLDTD-RVQFGPPVTLNPATLLPVPENQ 608
Query: 920 KSLHAARLMIEET 932
S H R ++ ET
Sbjct: 609 PSPHDCRQVLAET 621
Score = 79.7 bits (195), Expect = 5e-14
Identities = 90/344 (26%), Positives = 151/344 (43%), Gaps = 40/344 (11%)
Query: 1000 WRYQGRLCVPEGGELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIMKKVTS 1059
W +G++ +P+ L ++++ H H G K+ ++K + ++++VTS
Sbjct: 824 WEKEGKIVLPQKEALA--MIQQMHAWT---HLGNRKLKLLIEKTDFLIPRASTLIEQVTS 878
Query: 1060 -CLTCQKVK-GEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRLTK 1117
C CQ+V G + P+G + P WE +DF P G+ + V VD +
Sbjct: 879 ACKVCQQVNAGATRVPAGKRTRGNRPGVYWE---IDFTEVKPHYA-GYKYLLVFVDTFSG 934
Query: 1118 SAHFIAVNMTFPSEK-----LARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALG 1172
FP+ + +A+ ++EI G+P I SD P FVS+ L LG
Sbjct: 935 WVE------AFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILG 988
Query: 1173 TRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDY-KGSWEDFLPLAEFSYNNSYHSSLG 1231
L AY PQS GQ ER +T+++ L L+ W L LA N+ + G
Sbjct: 989 INWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFG 1047
Query: 1232 MAPFEALYGRRCKTPLCWLSGEDKITLGPELLQEMTEKVRSIREKL--RIAQDRQKSYYD 1289
+ P+E LYG PL L + LQ + +++++ ++ +A+ + +
Sbjct: 1048 LTPYEILYGG--PPPLSTLLNSFSPSNSKTDLQARLKGLQAVQAQIWAPLAELYRPGHSQ 1105
Query: 1290 KRHKPLEFQEGDHVFLRVTPITGVGRSIHSKKLTPKYLGPYQIL 1333
H FQ GD V++ R S+ L P++ GPY +L
Sbjct: 1106 TSH---PFQVGDSVYV---------RRHRSQGLEPRWKGPYIVL 1137
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 161 bits (407), Expect = 1e-38
Identities = 144/560 (25%), Positives = 246/560 (43%), Gaps = 58/560 (10%)
Query: 343 LYDSGATHSFISHERAK--SLKLVITQLPYDLVVTTPTKE------SAVTSS--VCKKCP 392
L D+GA HS ++ K S + V+ V TK VT S V +CP
Sbjct: 25 LVDTGAEHSVLTQPMGKVGSRRTVVEGATGSKVYPWTTKRLLKIGHKQVTHSFLVIPECP 84
Query: 393 LVIEDREYITNL---VCLPLEGLDIILGMNWLSINNVLLDCRLRVPIFLQKYKEKHTASL 449
+ R+ +T L + EG + G + + + L++ H +
Sbjct: 85 APLLGRDLLTKLKAQIQFSAEGPQVTWGER----------PTMCLVLNLEEEYRLHEKPV 134
Query: 450 PEKEPSAYLILFSSEGTKRPAMEDIPVVREFPEVFPEDMTELPPEREVEFAIDVIPGTTP 509
P ++L LF + +R M + + P V +++ G +P
Sbjct: 135 PSSIDPSWLQLFPTVWAERAGMG---LANQVPPV----------------VVELRSGASP 175
Query: 510 ISAAPYRISPLELAELQKQVEELLSKGFIRPSVSPWGAPVLLVKKKD-GSMRLCVDYRQL 568
++ Y +S ++ +++ L G + P SPW P+L VKK R D R++
Sbjct: 176 VAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSPWNTPLLPVKKPGTNDYRPVQDLREI 235
Query: 569 NKVTIKNRYPLPRIDDLMDQLKGARV-FSKIDLRSGYHQIRVKSDDVQKTAFRTR----- 622
NK +P +L+ L + +S +DL+ + +R+ + AF +
Sbjct: 236 NKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHPNSQPLFAFEWKDPEKG 295
Query: 623 -YGHYEYLVMPFGVTNAPAIFMDYMNRIFHPY--LDKFVIV--FIDDILIYSKSKEEHVE 677
G + +P G N+P +F + ++R P+ L+ V++ ++DD+L+ + + E+ +
Sbjct: 296 NTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQVVLLQYVDDLLVAAPTYEDCKK 355
Query: 678 HMQVVLKVLKDRKLYAKLSKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWKVPETVT 737
Q +L+ L K + +V +LG+++ E + PA+ V VP T
Sbjct: 356 GTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLLKEGKRWLTPARKATVMKIPVPTTPR 415
Query: 738 GVRSFLGLAGYYRRFIEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLT 797
VR FLG AG+ R +I GF+ +A PL LTK+ PF+WTE+ + +F +K+ L AP L
Sbjct: 416 QVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIPFIWTEEHQQAFDHIKKALLSAPALA 475
Query: 798 LPDPSKDYDVYCDASKSGLGCVLMQE----RKVIAYASQQLRPHEQNYPTHDMELAAVVF 853
LPD +K + +Y D VL Q R+ +AY S++L P +PT +AAV
Sbjct: 476 LPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVAYLSKKLDPVASGWPTCLKAVAAVAL 535
Query: 854 ALKIWRHYLYGVKFTIYSDH 873
LK G T+ + H
Sbjct: 536 LLKDADKLTLGQNVTVIASH 555
Score = 72.4 bits (176), Expect = 8e-12
Identities = 84/316 (26%), Positives = 127/316 (39%), Gaps = 38/316 (12%)
Query: 1030 HPGTTKMYQDLKKM-FWWPGMKKDIMKKVTSCLTCQKVKG-EHQKPSGSLQPLSIPEWKW 1087
H G K+ Q + + P ++ + + + C C + +G Q P W
Sbjct: 821 HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGVYW 880
Query: 1088 EGISMDFVSGLPRTTTGHDAIWVIVDRLTKSAHFIAVNMTFPSEKLARIYV-----KEIV 1142
E +DF P G+ + V +D + FP++ + V +EI+
Sbjct: 881 E---VDFTEIKPGRY-GNKYLLVFIDTFSGWVE------AFPTKTETALIVCKKILEEIL 930
Query: 1143 RLHGVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRA 1202
G+P + SD P FV++ L LG L AY PQS GQ ER +T+++ L
Sbjct: 931 PRFGIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTK 990
Query: 1203 CVLDYKG-SWEDFLPLAEFSYNNSYHSSLGMAPFEALYGRRCKTPLCWLSGEDKITLGPE 1261
L+ G W LPLA N+ G+ P+E LYG P SGE TLGP+
Sbjct: 991 LALETGGKDWVTLLPLALLRARNT-PGRFGLTPYEILYG---GPPPILESGE---TLGPD 1043
Query: 1262 --LLQEMTEKVRSIREKLRIAQDRQKSYYDKRHK--PLEFQEGDHVFLRVTPITGVGRSI 1317
L + ++++ D+ K Y P FQ GD V + R
Sbjct: 1044 DRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLV---------RRH 1094
Query: 1318 HSKKLTPKYLGPYQIL 1333
L P++ GPY +L
Sbjct: 1095 RPSSLEPRWKGPYLVL 1110
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.136 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 176,232,756
Number of Sequences: 164201
Number of extensions: 7867807
Number of successful extensions: 20611
Number of sequences better than 10.0: 145
Number of HSP's better than 10.0 without gapping: 100
Number of HSP's successfully gapped in prelim test: 45
Number of HSP's that attempted gapping in prelim test: 20110
Number of HSP's gapped (non-prelim): 290
length of query: 1435
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1312
effective length of database: 39,777,331
effective search space: 52187858272
effective search space used: 52187858272
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0039.4