
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0102a.7
(1496 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 505 e-142
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 503 e-141
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 503 e-141
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 352 6e-96
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 352 6e-96
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 346 2e-94
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 325 6e-88
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 323 3e-87
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 275 7e-73
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 202 4e-51
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 202 4e-51
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 201 1e-50
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 199 6e-50
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 196 3e-49
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 194 2e-48
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 185 9e-46
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 168 9e-41
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 159 7e-38
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 156 5e-37
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 152 5e-36
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 505 bits (1301), Expect = e-142
Identities = 297/903 (32%), Positives = 484/903 (52%), Gaps = 36/903 (3%)
Query: 534 VVQEFEDVFPE-DVPGIP-PVRDMEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLT 591
+ +EF+D+ E + +P P++ +EF +++ + I Y + P ++ + ++
Sbjct: 377 IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436
Query: 592 KKGFIRPSVSPWGAPVLLVKKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAA 651
K G IR S + PV+ V KK+G R+ VDY+ LNK N YPLP I+ L+ +++G+
Sbjct: 437 KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496
Query: 652 IFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPF 711
IF+K+DL+S YH IRV+ D K AFR G +EYLVMP+G++ APA F ++N I
Sbjct: 497 IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556
Query: 712 LDRFVVVFIDDILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVIS 771
+ VV ++DDILI+S++ EH +H++ VLQ L+ L N AKCEF +VKF+G+ IS
Sbjct: 557 KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616
Query: 772 KEGIAVDPSKVETVLAWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPF 831
++G ++ VL W +PK ++R F+G Y R+FI +++ PL L +K+ +
Sbjct: 617 EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 832 AWTEDCEQSFQDMKERLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQHRK-----AVA 886
WT Q+ +++K+ L + PVL + + DAS +G VL Q V
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 887 YSSRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGS--TFTVFSDHKSL--KYLFDQKD 942
Y S ++ + NY D E+ A++ +LK WRHYL + F + +DH++L + + +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796
Query: 943 LNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSRQTIHVSSLMIKELELI-ETFRDLSL 1001
N R RW F++D++F + Y PG AN +ADALSR ++ E E I + D S+
Sbjct: 797 ENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPIPKDSEDNSI 848
Query: 1002 GMQVTPGKLSFGMVTITSDFLNEIKVKQLLDEELIEKRNLIILGKAPDFEVGTDNILRCK 1061
++IT DF N++ + D +L+ N + ++ ++ K
Sbjct: 849 NF--------VNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSK 900
Query: 1062 GRVCVPLDMELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAEYVAACLTC 1121
++ +P D +L R I+ + H+ IHPG + + F W G++K + EYV C TC
Sbjct: 901 DQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTC 960
Query: 1122 QKAKIEHQKPAGMLQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIP 1181
Q K + KP G LQ + E W+S+SMDF+ ALP + +++++V+VDR +K A +P
Sbjct: 961 QINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVP 1019
Query: 1182 VKTTFNVEALAKVYVAEIVRLHGVPSSIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHP 1241
+ E A+++ ++ G P I++D D FTS WK ++ S Y P
Sbjct: 1020 CTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRP 1079
Query: 1242 QTDGQTERTIQSLEDLLRACVLDSQESWDELLPLIEFTYNNSFHASIGMAPYEALYGRRC 1301
QTDGQTERT Q++E LLR +W + + L++ +YNN+ H++ M P+E ++ R
Sbjct: 1080 QTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RY 1137
Query: 1302 RTPLCWFQDGEHLLVGPELVQQTTEKVKQIQEKMRTSQSRQKSYADTRRREL-EFEAGDH 1360
L + E Q+T + + ++E + T+ + K Y D + +E+ EF+ GD
Sbjct: 1138 SPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDL 1197
Query: 1361 VFLRVTPTTGVGRAIKSRKLTPKFIGPYQIIERVGKVAYRIALPPFLSKI-HDVLHVSQL 1419
V ++ T T G KS KL P F GP+ ++++ G Y + LP + + HVS L
Sbjct: 1198 VMVKRTKT---GFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHL 1254
Query: 1420 RKY 1422
KY
Sbjct: 1255 EKY 1257
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 503 bits (1294), Expect = e-141
Identities = 296/903 (32%), Positives = 484/903 (52%), Gaps = 36/903 (3%)
Query: 534 VVQEFEDVFPE-DVPGIP-PVRDMEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLT 591
+ +EF+D+ E + +P P++ +EF +++ + I Y + P ++ + ++
Sbjct: 377 IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436
Query: 592 KKGFIRPSVSPWGAPVLLVKKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAA 651
K G IR S + PV+ V KK+G R+ VDY+ LNK N YPLP I+ L+ +++G+
Sbjct: 437 KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496
Query: 652 IFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPF 711
IF+K+DL+S YH IRV+ D K AFR G +EYLVMP+G++ APA F ++N I
Sbjct: 497 IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEV 556
Query: 712 LDRFVVVFIDDILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVIS 771
+ VV ++D+ILI+S++ EH +H++ VLQ L+ L N AKCEF +VKF+G+ IS
Sbjct: 557 KESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616
Query: 772 KEGIAVDPSKVETVLAWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPF 831
++G ++ VL W +PK ++R F+G Y R+FI +++ PL L +K+ +
Sbjct: 617 EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 832 AWTEDCEQSFQDMKERLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQHRK-----AVA 886
WT Q+ +++K+ L + PVL + + DAS +G VL Q V
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 887 YSSRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGS--TFTVFSDHKSL--KYLFDQKD 942
Y S ++ + NY D E+ A++ +LK WRHYL + F + +DH++L + + +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796
Query: 943 LNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSRQTIHVSSLMIKELELI-ETFRDLSL 1001
N R RW F++D++F + Y PG AN +ADALSR ++ E E I + D S+
Sbjct: 797 ENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPIPKDSEDNSI 848
Query: 1002 GMQVTPGKLSFGMVTITSDFLNEIKVKQLLDEELIEKRNLIILGKAPDFEVGTDNILRCK 1061
++IT DF N++ + D +L+ N + ++ ++ K
Sbjct: 849 NF--------VNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSK 900
Query: 1062 GRVCVPLDMELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAEYVAACLTC 1121
++ +P D +L R I+ + H+ IHPG + + F W G++K + EYV C TC
Sbjct: 901 DQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTC 960
Query: 1122 QKAKIEHQKPAGMLQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIP 1181
Q K + KP G LQ + E W+S+SMDF+ ALP + +++++V+VDR +K A +P
Sbjct: 961 QINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVP 1019
Query: 1182 VKTTFNVEALAKVYVAEIVRLHGVPSSIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHP 1241
+ E A+++ ++ G P I++D D FTS WK ++ S Y P
Sbjct: 1020 CTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRP 1079
Query: 1242 QTDGQTERTIQSLEDLLRACVLDSQESWDELLPLIEFTYNNSFHASIGMAPYEALYGRRC 1301
QTDGQTERT Q++E LLR +W + + L++ +YNN+ H++ M P+E ++ R
Sbjct: 1080 QTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RY 1137
Query: 1302 RTPLCWFQDGEHLLVGPELVQQTTEKVKQIQEKMRTSQSRQKSYADTRRREL-EFEAGDH 1360
L + E Q+T + + ++E + T+ + K Y D + +E+ EF+ GD
Sbjct: 1138 SPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDL 1197
Query: 1361 VFLRVTPTTGVGRAIKSRKLTPKFIGPYQIIERVGKVAYRIALPPFLSKI-HDVLHVSQL 1419
V ++ T T G KS KL P F GP+ ++++ G Y + LP + + HVS L
Sbjct: 1198 VMVKRTKT---GFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHL 1254
Query: 1420 RKY 1422
KY
Sbjct: 1255 EKY 1257
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 503 bits (1294), Expect = e-141
Identities = 296/903 (32%), Positives = 484/903 (52%), Gaps = 36/903 (3%)
Query: 534 VVQEFEDVFPE-DVPGIP-PVRDMEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLT 591
+ +EF+D+ E + +P P++ +EF +++ + I Y + P ++ + ++
Sbjct: 377 IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436
Query: 592 KKGFIRPSVSPWGAPVLLVKKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAA 651
K G IR S + PV+ V KK+G R+ VDY+ LNK N YPLP I+ L+ +++G+
Sbjct: 437 KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496
Query: 652 IFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPF 711
IF+K+DL+S YH IRV+ D K AFR G +EYLVMP+G++ APA F ++N I
Sbjct: 497 IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEV 556
Query: 712 LDRFVVVFIDDILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVIS 771
+ VV ++D+ILI+S++ EH +H++ VLQ L+ L N AKCEF +VKF+G+ IS
Sbjct: 557 KESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616
Query: 772 KEGIAVDPSKVETVLAWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPF 831
++G ++ VL W +PK ++R F+G Y R+FI +++ PL L +K+ +
Sbjct: 617 EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 832 AWTEDCEQSFQDMKERLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQHRK-----AVA 886
WT Q+ +++K+ L + PVL + + DAS +G VL Q V
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 887 YSSRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGS--TFTVFSDHKSL--KYLFDQKD 942
Y S ++ + NY D E+ A++ +LK WRHYL + F + +DH++L + + +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796
Query: 943 LNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSRQTIHVSSLMIKELELI-ETFRDLSL 1001
N R RW F++D++F + Y PG AN +ADALSR ++ E E I + D S+
Sbjct: 797 ENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPIPKDSEDNSI 848
Query: 1002 GMQVTPGKLSFGMVTITSDFLNEIKVKQLLDEELIEKRNLIILGKAPDFEVGTDNILRCK 1061
++IT DF N++ + D +L+ N + ++ ++ K
Sbjct: 849 NF--------VNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSK 900
Query: 1062 GRVCVPLDMELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAEYVAACLTC 1121
++ +P D +L R I+ + H+ IHPG + + F W G++K + EYV C TC
Sbjct: 901 DQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTC 960
Query: 1122 QKAKIEHQKPAGMLQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIP 1181
Q K + KP G LQ + E W+S+SMDF+ ALP + +++++V+VDR +K A +P
Sbjct: 961 QINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVP 1019
Query: 1182 VKTTFNVEALAKVYVAEIVRLHGVPSSIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHP 1241
+ E A+++ ++ G P I++D D FTS WK ++ S Y P
Sbjct: 1020 CTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRP 1079
Query: 1242 QTDGQTERTIQSLEDLLRACVLDSQESWDELLPLIEFTYNNSFHASIGMAPYEALYGRRC 1301
QTDGQTERT Q++E LLR +W + + L++ +YNN+ H++ M P+E ++ R
Sbjct: 1080 QTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RY 1137
Query: 1302 RTPLCWFQDGEHLLVGPELVQQTTEKVKQIQEKMRTSQSRQKSYADTRRREL-EFEAGDH 1360
L + E Q+T + + ++E + T+ + K Y D + +E+ EF+ GD
Sbjct: 1138 SPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDL 1197
Query: 1361 VFLRVTPTTGVGRAIKSRKLTPKFIGPYQIIERVGKVAYRIALPPFLSKI-HDVLHVSQL 1419
V ++ T T G KS KL P F GP+ ++++ G Y + LP + + HVS L
Sbjct: 1198 VMVKRTKT---GFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHL 1254
Query: 1420 RKY 1422
KY
Sbjct: 1255 EKY 1257
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 352 bits (902), Expect = 6e-96
Identities = 261/924 (28%), Positives = 443/924 (47%), Gaps = 65/924 (7%)
Query: 534 VVQEFEDVFPEDVPGIPPVRDMEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKK 593
V+++F+DVF + E I++ G PI P + A E++ ++ + +
Sbjct: 909 VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQ 968
Query: 594 GFIRPSVSPWGAPVLLVKKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIF 653
IR S SPW +PV+LVKKKDG R+C+DYR++NKV N +PLP I+ + L G ++
Sbjct: 969 KVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLY 1028
Query: 654 SKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLD 713
+ D+ +G+ QI + ++ + TAF +E+ V+PFG+ +PA+F M I L
Sbjct: 1029 TVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLG 1088
Query: 714 RFVVVFIDDILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKE 773
V++DD+LI S+++E+H + +++ L +R+ + A+KC +EV++LGH ++ +
Sbjct: 1089 VCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLD 1148
Query: 774 GIAVDPSKVETVLAWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAW 833
G+ K + + + RP V +++SF+GL GYYR+FI FA+IA LT L + W
Sbjct: 1149 GVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIW 1208
Query: 834 TEDCEQSFQDMKERLTTAPVLTLPQEE------EPYEVYCDASYQGLGCVLMQ-----HR 882
++ E +FQ++K+ + PVL P E P+ +Y DAS +G+G VL Q +
Sbjct: 1209 EKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQ 1268
Query: 883 KAVAYSSRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGSTFTVFSDHKSLKYLFDQKD 942
+A++S+ L E Y DLE A++FAL+ ++ +YG+ TVF+DHK L L
Sbjct: 1269 HPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSP 1328
Query: 943 LNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSR-----------QTIHVSSLMIKELE 991
L R RW I ++D ++Y GKAN VADALSR QT ++S++
Sbjct: 1329 LADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQT 1388
Query: 992 LIETFRDLSLGMQVTPGKLSFGMVTITSDFLNEIKVKQL-----LDEELIEKRNLIILGK 1046
+ D S ++ G+ I + L K K ++ E+ + I+ G
Sbjct: 1389 ELPDILDSSCWLERLKGEDEGWKEVIAA--LEGGKTKGTFKIVGIESEISLEYYKIVGGV 1446
Query: 1047 APDFEVGTDNILRCKGRVCVPLDMELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPG 1106
+ E+ + R VP ++R +L E H+ L+ H G KM++ + F+WP
Sbjct: 1447 LKNTEIEE------QSRSVVP--EKIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQ 1498
Query: 1107 MKKNVAEYVAACLTCQKAKIEHQKPAGMLQSLDVPEWKWDSISMDFV-VALPATRKRFDS 1165
M+ V V C C A +H K L + + + ++ D + V L R+
Sbjct: 1499 MRVCVENCVRTCAKCLCAN-DHSKLTSSLTPYRM-TFPLEIVACDLMDVGLSVQGNRY-- 1554
Query: 1166 IWVIVDRLTKSAHFIPVKTTFNVEALAKVYVAEIVRLHG-VPSSIVSDRDPKFTSHFWKA 1224
I I+D TK +P+ E + K +V G +P +++D+ +F + +
Sbjct: 1555 ILTIIDLFTKYGTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQ 1613
Query: 1225 LHEALGTKLRLSSAYHPQTDGQTERTIQSLEDLLRACVLDSQESWDELLPLIEFTYNNSF 1284
L + + Y+ + +G ER +++ +++ E WD+ + + YNN
Sbjct: 1614 FTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAVPME-WDDQVVYAVYAYNNCV 1672
Query: 1285 HASIGMAPYEALYGRRCRTPL------------CWFQDGEHLLVGPELVQQTTEKVKQIQ 1332
H + G P ++GR PL + +HLL L Q K ++
Sbjct: 1673 HENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHAMR 1732
Query: 1333 EKMRTSQSRQKSYADTRRRELEFEAGDHVFLRVTPTTGVGRAIKSRKLTPKFIGPYQIIE 1392
E+ + YA + R + G V L + P+ +G + KL K+ GPY++I
Sbjct: 1733 EQESYKSLFDQKYASKKHRFP--QPGSRVLLEI-PSEKLG--AQCPKLVNKWSGPYRVIS 1787
Query: 1393 RVGKVAYRIALPPFLSKIHDVLHV 1416
A + P L K +L +
Sbjct: 1788 CSENSA---EITPVLGKRKHILQI 1808
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 352 bits (902), Expect = 6e-96
Identities = 245/770 (31%), Positives = 392/770 (50%), Gaps = 60/770 (7%)
Query: 567 PISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKKDGRS-----RLCV 621
PI Y +A E+++Q++++ +G IR S SP+ +P +V KK S R+ +
Sbjct: 206 PIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVI 265
Query: 622 DYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTRY 681
DYR+LN++TI + YP+P +D+++ +L F+ IDL G+HQI + +E I KTAF T+
Sbjct: 266 DYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKS 325
Query: 682 GHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNLEEHEEHLRQVL 741
GHYEYL MPFG+ NAPA F MN I P L++ +V++DDI+I+S +L EH ++ V
Sbjct: 326 GHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVF 385
Query: 742 QVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDRPKTVTDIRSFI 801
L + L KCEF +E FLGH+++ +GI +P KV+ ++++ P +IR+F+
Sbjct: 386 TKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFL 445
Query: 802 GLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCE--QSFQDMKERLTTAPVLTLPQE 859
GL GYYR+FI +A IA P+T +K T+ E ++F+ +K + P+L LP
Sbjct: 446 GLTGYYRKFIPNYADIAKPMTSCLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPDF 504
Query: 860 EEPYEVYCDASYQGLGCVLMQHRKAVAYSSRQLKIHERNYPTHDLELAAVVFALKIWRHY 919
E+ + + DAS LG VL Q+ +++ SR L HE NY + EL A+V+A K +RHY
Sbjct: 505 EKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHY 564
Query: 920 LYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSRQT 979
L G F + SDH+ L++L + K+ + RW + +Y F + Y GK N VADALSR
Sbjct: 565 LLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIK 624
Query: 980 IHV---------------SSLMIKELELIETFRDLSLGMQVTPGKLS----FG--MVTIT 1018
I S+L+ + I F+ + ++ K+ FG + TI
Sbjct: 625 IEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTIQ 684
Query: 1019 SDFLNEIKVKQLLDEELIEKRNLIILGKAPDFEV----GTDNILRCKGRVCVPLDM---- 1070
D + K KQ+L + I + I + DFE+ + + +V L +
Sbjct: 685 YDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKNV 744
Query: 1071 ----ELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAEYVAACLTCQKAKI 1126
E + +IL K +HPG KM + K N ++P + + + C C AK
Sbjct: 745 GSYAEFKEIILQSHEKL---LHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKT 801
Query: 1127 EHQKPAGMLQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIPVKTTF 1186
EH+ L+ PE + +D + K + S +D +K A +KT
Sbjct: 802 EHRNTKMPLKITPNPEHCREKFVVDI---YSSEGKHYIS---CIDIYSKFATLEQIKTKD 855
Query: 1187 NVEALAKVYVAEIVRLHGVPSSIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHPQTDGQ 1246
+E + + I G P + +DRD F+S K E +L+L++A + D
Sbjct: 856 WIE--CRNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD-- 911
Query: 1247 TERTIQSLEDLLRACVLDSQESWDELLPLIE---FTYNNSF-HASIGMAP 1292
ER +++ + +R +++S + + L IE +TYN H + G P
Sbjct: 912 VERLHKTINEKIR--IINSSDDEEVKLSKIETILYTYNQKIKHDTTGQRP 959
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 346 bits (888), Expect = 2e-94
Identities = 238/760 (31%), Positives = 379/760 (49%), Gaps = 53/760 (6%)
Query: 573 YRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKKDGRS-----RLCVDYRQLN 627
Y A E++SQ++D+ +G IR S SP+ +P+ +V KK S R+ +DYR+LN
Sbjct: 213 YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLN 272
Query: 628 KVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYL 687
++T+ + +P+P +D+++ +L F+ IDL G+HQI + E + KTAF T++GHYEYL
Sbjct: 273 EITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYL 332
Query: 688 VMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNLEEHEEHLRQVLQVLREK 747
MPFG+ NAPA F MN I P L++ +V++DDI+++S +L+EH + L V + L +
Sbjct: 333 RMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKA 392
Query: 748 VLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDRPKTVTDIRSFIGLAGYY 807
L KCEF +E FLGHV++ +GI +P K+E + + P +I++F+GL GYY
Sbjct: 393 NLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYY 452
Query: 808 RRFIEGFAKIAGPLTKLTRKNQPFAWTE-DCEQSFQDMKERLTTAPVLTLPQEEEPYEVY 866
R+FI FA IA P+TK +KN T + + +F+ +K ++ P+L +P + + +
Sbjct: 453 RKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLT 512
Query: 867 CDASYQGLGCVLMQHRKAVAYSSRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGSTFT 926
DAS LG VL Q ++Y SR L HE NY T + EL A+V+A K +RHYL G F
Sbjct: 513 TDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFE 572
Query: 927 VFSDHKSLKYLFDQKDLNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSR--------- 977
+ SDH+ L +L+ KD N + RW + ++DF + Y GK N VADALSR
Sbjct: 573 ISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLS 632
Query: 978 -QTIHVSSLMIKELELIETFRDLSLGMQVTPGK----------LSFGMVTITSDFLNEIK 1026
QT H + +L I + QV K + I D + K
Sbjct: 633 EQTQHSAEEDNSDLIFITERPLNTFNRQVIFSKGPPDIKVTKYFKKHITQIFYDIMTREK 692
Query: 1027 VKQLLDEELIEKRNLIILGKAPDFEV-----------GTDNILRCKGRVC-VPLDMELRR 1074
+Q L + K++ + + DFEV ILR + + E +
Sbjct: 693 AEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNITTYAEFKE 752
Query: 1075 MILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAEYVAACLTCQKAKIEHQKPAGM 1134
+IL K +HPG K + +++P + + + C C AK EH+
Sbjct: 753 LILTAHEKL---LHPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMP 809
Query: 1135 LQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIPVKTTFNVEALAKV 1194
++ PE + +D + K + S +D +K A +KT +E K
Sbjct: 810 TKTTPKPEHCREKFMIDI---YSSEGKHYVS---CIDIYSKFATLEEIKTKDWIE--CKN 861
Query: 1195 YVAEIVRLHGVPSSIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHPQTDGQTERTIQSL 1254
+ I G P + +DRD F+S K E+ +L+L++ D ER +++
Sbjct: 862 ALMRIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD--IERLHKTI 919
Query: 1255 EDLLRAC-VLDSQESWDELLPLIEFTYNN-SFHASIGMAP 1292
+ +R D +E+ + + YN+ + H + G P
Sbjct: 920 NEKIRIIKTSDDEETKLSKMETVLNIYNHKTKHDTTGQTP 959
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 325 bits (833), Expect = 6e-88
Identities = 275/975 (28%), Positives = 447/975 (45%), Gaps = 113/975 (11%)
Query: 462 LKGLDVIIGMDWLSHHHVLLDCANKVVIFPDAGLAEFLNSYFSKLSLRKGALSSLMSTTV 521
L D IIG D L ++D N +I L A +S+ +
Sbjct: 28 LHSFDGIIGDDTLKDLKAIVDRKNNCLIITPGIKIPLL------------ARASINVNPL 75
Query: 522 MEAKE-NGIQGI--AVVQEFEDVFPEDVPGIPPVRDMEFTIDIVPGTGPISIAPYRMAPA 578
+ A+ +G Q I +++ EF +F + G+ ++ I PI Y
Sbjct: 76 LAAEHPDGTQEILNSLLGEFPRIFEPPLSGMSVETAVKAEIR-TNTQDPIYAKSYPYPVN 134
Query: 579 ELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKK-----DGRSRLCVDYRQLNKVTIKN 633
E++ Q+++L + G IRPS SP+ +P+ +V KK + + R+ VD+++LN VTI +
Sbjct: 135 MRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPD 194
Query: 634 CYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYLVMPFGV 693
YP+P I+ + L A F+ +DL SG+HQI +K+ DI KTAF T G YE+L +PFG+
Sbjct: 195 TYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGL 254
Query: 694 TNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNLEEHEEHLRQVLQVLREKVLYANA 753
NAPA+F ++ I + + V+IDDI+++S + + H ++LR VL L + L N
Sbjct: 255 KNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNL 314
Query: 754 AKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDRPKTVTDIRSFIGLAGYYRRFIEG 813
K F +V+FLG++++ +GI DP KV + P +V +++ F+G+ YYR+FI+
Sbjct: 315 EKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQD 374
Query: 814 FAKIAGPLTKLTR-----------KNQPFAWTEDCEQSFQDMKERLTTAPVLTLPQEEEP 862
+AK+A PLT LTR P E QSF D+K L ++ +L P +P
Sbjct: 375 YAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKP 434
Query: 863 YEVYCDASYQGLGCVLMQ----HRKAVAYSSRQLKIHERNYPTHDLELAAVVFALKIWRH 918
+ + DAS +G VL Q + +AY SR L E NY T + E+ A++++L R
Sbjct: 435 FHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRA 494
Query: 919 YLYGS-TFTVFSDHKSLKYLFDQKDLNMRQRRWMEFIKDYDFTLLYHPGKANVVADALSR 977
YLYG+ T V++DH+ L + ++ N + +RW I++Y+ L+Y PGK+NVVADALSR
Sbjct: 495 YLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554
Query: 978 ---------------------------QTIHVSSLMIKELEL-IETFRDLSLGMQVTPGK 1009
+H SS +I +E I F++ L T K
Sbjct: 555 IPPQLNQLSTDLDANPEDDMQSLATAHSALHDSSRLIPHVESPINVFKN-QLIFDTTRSK 613
Query: 1010 L----SFGMVTITSDFLNEIKVKQLLDEELIEKRNLIILG-KAPDF------EVGTDNIL 1058
F T L + + L + R +II G K P+ + N L
Sbjct: 614 YLCEHPFPGYTRHLIPLKDGSLADLTNSLQSCLRPVIINGVKIPEAHLQRFQSICLANFL 673
Query: 1059 RCKGRVCVPL--DMELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAEYVA 1116
K R+ L D+ I + K H G T++ L +++P M + +
Sbjct: 674 LYKIRITQRLVADVSGAEEICEIIEKEHRRAHRGPTEIRLQLLEKYYFPRMSSTIRLQTS 733
Query: 1117 ACLTCQKAKIEHQKPAGMLQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRLTKS 1176
+C C+ K E LQ +P + + + +D A KR +D+ +K
Sbjct: 734 SCQCCKLYKYERHPNKPNLQPTPIPNYPCEILHIDIF----ALEKRL--YLSCIDKFSKF 787
Query: 1177 AHFIPVKTTFNVEALAKVYVAE--IVRLH--GVPSSIVSDRDPKFTSHFWKALHEALGTK 1232
A F++++ A V++ E + LH P +VSD + +L
Sbjct: 788 AKL------FHLQSKASVHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDID 841
Query: 1233 LRLSSAYHPQTDGQTERTIQSLEDLLRACVLDSQESWD--ELLPLIEFTYNNSFHASIGM 1290
L + + +GQ ER + ++ R C+ D ++ EL+ + YN S H+
Sbjct: 842 LYYAPTQKSEVNGQVERFHSTFLEIYR-CLKDELPTFKPVELVHIAVDRYNTSVHSVTNR 900
Query: 1291 APYEALYGRRCRTPLCWFQDGEHLLVGPELVQQTTEKVKQIQE--KMRTSQSRQKSYADT 1348
P + + R R D +QT E +K + E ++R + +R K+
Sbjct: 901 KPADVFFDRSSRVNYQGLTD---------FRRQTLEDIKGLIEYKQIRGNMARNKN---- 947
Query: 1349 RRRELEFEAGDHVFL 1363
R + GD VF+
Sbjct: 948 RDEPKSYGPGDEVFV 962
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 323 bits (827), Expect = 3e-87
Identities = 278/1051 (26%), Positives = 474/1051 (44%), Gaps = 118/1051 (11%)
Query: 388 IAGNSLIALFDSGATHSFIDIACATRLKLEVSKLPFDLTVSTPASKSLVTNTACLECPWM 447
+AG +L L D+ A ++I V +L + V++P S S + + ++ +
Sbjct: 19 LAGRTLKMLIDTDAAKNYIR---------PVKELKNVMPVASPFSVSSIHGSTEIKHKCL 69
Query: 448 YLDKKFIANLICLP-LKGLDVIIGMDWLSHHHVLLDCANKVVIFPDAGLAEFLNSYFSKL 506
K I+ L L D IIG+D L+ V L+ A + + G+AE L+ YFS
Sbjct: 70 MKVFKHISPFFLLDSLNAFDAIIGLDLLTQAGVKLNLAEDSLEYQ--GIAEKLH-YFSCP 126
Query: 507 SLRKGALSSLM--STTVMEAKENGIQGIAVVQEFEDVFPEDVPGIPPVRDMEFTIDIVPG 564
S+ ++ ++ + E K+ I+ + P + +R T+D
Sbjct: 127 SVNFTDVNDIVVPDSVKKEFKDTIIRRKKAFSTTNEALPFNTAVTATIR----TVD---- 178
Query: 565 TGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKK------DGRSR 618
P+ Y + ++++ L K G IRPS SP+ +P +V KK + R
Sbjct: 179 NEPVYSRAYPTLMGVSDFVNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKR 238
Query: 619 LCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFR 678
L +D+R+LN+ TI + YP+P I ++ L A F+ +DL+SGYHQI + + D +KT+F
Sbjct: 239 LVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFS 298
Query: 679 TRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNLEEHEEHLR 738
G YE+ +PFG+ NA ++F ++ + + + V++DD++I+S N +H H+
Sbjct: 299 VNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHID 358
Query: 739 QVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDRPKTVTDIR 798
VL+ L + + + K F+ E V++LG ++SK+G DP KV+ + + P V +R
Sbjct: 359 TVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVR 418
Query: 799 SFIGLAGYYRRFIEGFAKIAGPLTKLTR-----------KNQPFAWTEDCEQSFQDMKER 847
SF+GLA YYR FI+ FA IA P+T + + K P + E +FQ ++
Sbjct: 419 SFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNI 478
Query: 848 LTTAPV-LTLPQEEEPYEVYCDASYQGLGCVLMQHRKAVAYSSRQLKIHERNYPTHDLEL 906
L + V L P ++P+++ DAS G+G VL Q + + SR LK E+NY T++ EL
Sbjct: 479 LASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNEREL 538
Query: 907 AAVVFALKIWRHYLYGS-TFTVFSDHKSLKYLFDQKDLNMRQRRWMEFIKDYDFTLLYHP 965
A+V+AL +++LYGS +F+DH+ L + ++ N + +RW +I ++ + Y P
Sbjct: 539 LAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKP 598
Query: 966 GKANVVADALSRQTIHV--------SSLMIKELELIET----------FRD-LSLGMQVT 1006
GK N VADALSRQ ++ ++ + EL L T FR+ + L
Sbjct: 599 GKENFVADALSRQNLNALQNEPQSDAATIHSELSLTYTVETTDKPLNCFRNQIILEAARF 658
Query: 1007 PGKLSFGMVTITSDFLNEIKVKQLLDEELIEKRNLIILG----KAPDFEVGTDNIL---- 1058
P K + + S L K L + L E N ++ P +++
Sbjct: 659 PLKRNLVLFRSKSRHLISFTDKSWLLKTLKEVVNPDVVNAIHCDLPTLASFQHDLIAHFP 718
Query: 1059 -----RCKGRVCVPLDMELRRMILDEGHKSRLSIHPGSTKMYQDLKLNFWWPGMKKNVAE 1113
CK V D + I+ H H + + + + ++++P M E
Sbjct: 719 ATQFRHCKNVVLDITDKNEQIEIVTAEHN---RAHRAAQENIKQVLRDYYFPKMGSLAKE 775
Query: 1114 YVAACLTCQKAKIEHQKPAGMLQSLDVPEWKWDSISMDFVVALPATRKRFDSIWVIVDRL 1173
VA C C +AK + L +P + + + +D RK F +D+
Sbjct: 776 VVANCRVCTQAKYDRHPKKQELGETPIPSYTGEMVHIDI---FSTDRKLF---LTCIDKF 829
Query: 1174 TKSAHFIPVKTTFNVEALAKVYVAEIVRLHGVPSSIVSDRDPKFTSH-FWKALHEALGTK 1232
+K A PV + V+ A + +I+ L ++ D +P F S L + G
Sbjct: 830 SKYAIVQPVVSRTIVDITAP--LLQIINLFPNIKTVYCDNEPAFNSETVTSMLKNSFGID 887
Query: 1233 LRLSSAYHPQTDGQTERTIQSLEDLLRACVLDSQ-ESWDELLPLIEFTYNNSFHASIGMA 1291
+ + H ++GQ ER +L ++ R LD + EL+ YN + H+
Sbjct: 888 IVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVELILRATIEYNKTVHSVTRER 947
Query: 1292 PYEALYGRRCRTPLCWFQDGEHLLVGPELVQQTTEKVKQIQEKMRTSQSRQKSYADTRRR 1351
P E ++ G H E+ +I+ ++ +Q + R+
Sbjct: 948 PIEVVH------------PGAH------------ERCLEIKARLVKAQQDSIGRNNPSRQ 983
Query: 1352 ELEFEAGDHVFLRVTPTTGVGRAIKSRKLTP 1382
FE G+ VF++ G KLTP
Sbjct: 984 NRVFEVGERVFVKNNKRLG-------NKLTP 1007
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 275 bits (703), Expect = 7e-73
Identities = 155/424 (36%), Positives = 238/424 (55%), Gaps = 10/424 (2%)
Query: 567 PISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKKDGRS------RLC 620
P+ YR +++ E+++Q++ L K + PSVS + +P+LLV KK + RL
Sbjct: 314 PVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLV 373
Query: 621 VDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTR 680
+DYRQ+NK + + +PLPRIDD++DQL A FS +DL SG+HQI + + T+F T
Sbjct: 374 IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433
Query: 681 YGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNLEEHEEHLRQV 740
G Y + +PFG+ AP F M F +++DD+++ + + ++L +V
Sbjct: 434 NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493
Query: 741 LQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDRPKTVTDIRSF 800
RE L + KC F++ EV FLGH + +GI D K + + + P R F
Sbjct: 494 FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553
Query: 801 IGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKERLTTAPVLTLPQEE 860
+ YYRRFI+ FA + +T+L +KN PF WT++C+++F +K +L +L P
Sbjct: 554 VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613
Query: 861 EPYEVYCDASYQGLGCVLMQ----HRKAVAYSSRQLKIHERNYPTHDLELAAVVFALKIW 916
+ + + DAS Q G VL Q H+ VAY+SR E N T + ELAA+ +A+ +
Sbjct: 614 KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673
Query: 917 RHYLYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEFIKDYDFTLLYHPGKANVVADALS 976
R Y+YG FTV +DH+ L YLF + + + R +++Y+FT+ Y GK N VADALS
Sbjct: 674 RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALS 733
Query: 977 RQTI 980
R TI
Sbjct: 734 RITI 737
Score = 102 bits (254), Expect = 8e-21
Identities = 80/319 (25%), Positives = 140/319 (43%), Gaps = 28/319 (8%)
Query: 1088 HPGSTKMYQDLKLNFWWPGMKKNVAEYVAACLTCQKAKIEHQKPAGMLQSLDVPEWKWDS 1147
H G TK +K +++W M K + EYV C CQKAK M + + PE +D
Sbjct: 909 HTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTIT-ETPEHAFDR 967
Query: 1148 ISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIPVKTTFNVEALAKVYVAEIVRLHGVPS 1207
+ +D + LP + + ++ LTK IP+ + + +AK + +G
Sbjct: 968 VVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANK-SAKTVAKAIFESFILKYGPMK 1026
Query: 1208 SIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHPQTDGQTERTIQSLEDLLRACVLDSQE 1267
+ ++D ++ + L + L K S+A+H QT G ER+ ++L + +R+ + +
Sbjct: 1027 TFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKT 1086
Query: 1268 SWDELLPLIEFTYNNSFHASIGMAPYEALYGRRCRTPLCWFQDGEHLLVGPELVQQTTEK 1327
WD L + +N + PYE ++GR P + + L E + +
Sbjct: 1087 DWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF-----NKLHSIEPIYNIDDY 1141
Query: 1328 VKQIQEKMRTSQSR-----------QKSYADTRRRELEFEAGDHVFLRVTPTTGVGRAIK 1376
K+ + ++ + +R K D + +++E E GD V LR VG
Sbjct: 1142 AKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLR----NEVG---- 1193
Query: 1377 SRKLTPKFIGPYQIIERVG 1395
KL K+ GPY+ IE +G
Sbjct: 1194 -HKLDFKYTGPYK-IESIG 1210
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 202 bits (515), Expect = 4e-51
Identities = 138/451 (30%), Positives = 224/451 (49%), Gaps = 26/451 (5%)
Query: 555 MEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLV---- 610
M+ +I + + I + P + +P + E Q+++L I+PS SP AP LV
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 611 KKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDE 670
+K+ G+ R+ V+Y+ +NK TI + Y LP D+L+ ++G IFS D +SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 671 DIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNL 730
TAF GHYE+ V+PFG+ AP++F +M+ F F +F V++DDIL++S N
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNE 412
Query: 731 EEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDR 790
E+H H+ +LQ + + + K + + +++ FLG I + +E + +
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKF-- 470
Query: 791 PKTVTD---IRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKER 847
P T+ D ++ F+G+ Y +I A+I PL ++N P+ WT++ Q +K+
Sbjct: 471 PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKN 530
Query: 848 LTTAPVLTLPQEEEPYEVYCDASYQGLGCVL--------MQHRKAVAYSSRQLKIHERNY 899
L P L P EE + DAS G +L Y+S K ERNY
Sbjct: 531 LQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNY 590
Query: 900 PTHDLELAAVVFALKIWR------HYLYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEF 953
++D E AV+ +K + H+L + T F +L Y D K R RW +
Sbjct: 591 HSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAW 648
Query: 954 IKDYDFTLLYHPGKANVVADALSRQTIHVSS 984
+ Y F + + G N AD LSR+ V+S
Sbjct: 649 LSHYSFDVEHIKGTDNHFADFLSREFNKVNS 679
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 202 bits (515), Expect = 4e-51
Identities = 138/451 (30%), Positives = 224/451 (49%), Gaps = 26/451 (5%)
Query: 555 MEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLV---- 610
M+ +I + + I + P + +P + E Q+++L I+PS SP AP LV
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 611 KKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDE 670
+K+ G+ R+ V+Y+ +NK TI + Y LP D+L+ ++G IFS D +SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 671 DIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNL 730
TAF GHYE+ V+PFG+ AP++F +M+ F F +F V++DDIL++S N
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNE 412
Query: 731 EEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDR 790
E+H H+ +LQ + + + K + + +++ FLG I + +E + +
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKF-- 470
Query: 791 PKTVTD---IRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKER 847
P T+ D ++ F+G+ Y +I A+I PL ++N P+ WT++ Q +K+
Sbjct: 471 PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKN 530
Query: 848 LTTAPVLTLPQEEEPYEVYCDASYQGLGCVL--------MQHRKAVAYSSRQLKIHERNY 899
L P L P EE + DAS G +L Y+S K ERNY
Sbjct: 531 LQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNY 590
Query: 900 PTHDLELAAVVFALKIWR------HYLYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEF 953
++D E AV+ +K + H+L + T F +L Y D K R RW +
Sbjct: 591 HSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAW 648
Query: 954 IKDYDFTLLYHPGKANVVADALSRQTIHVSS 984
+ Y F + + G N AD LSR+ V+S
Sbjct: 649 LSHYSFDVEHIKGTDNHFADFLSREFNKVNS 679
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 201 bits (511), Expect = 1e-50
Identities = 136/451 (30%), Positives = 224/451 (49%), Gaps = 26/451 (5%)
Query: 555 MEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLV---- 610
M+ +I + + I + P + +P + E Q+++L I+PS SP AP LV
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 611 KKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDE 670
+K+ G+ R+ V+Y+ +NK T+ + Y LP D+L+ ++G IFS D +SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 671 DIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNL 730
TAF GHYE+ V+PFG+ AP++F +M+ F F +F V++DDIL++S N
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNE 412
Query: 731 EEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDR 790
E+H H+ +LQ + + + K + + +++ FLG I + +E + +
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKF-- 470
Query: 791 PKTVTD---IRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKER 847
P T+ D ++ F+G+ Y +I A+I PL ++N P+ WT++ Q +K+
Sbjct: 471 PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKN 530
Query: 848 LTTAPVLTLPQEEEPYEVYCDASYQGLGCVL--------MQHRKAVAYSSRQLKIHERNY 899
L P L P EE + DAS G +L Y+S K E+NY
Sbjct: 531 LQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNY 590
Query: 900 PTHDLELAAVVFALKIWR------HYLYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEF 953
++D E AV+ +K + H+L + T F +L Y D K R RW +
Sbjct: 591 HSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAW 648
Query: 954 IKDYDFTLLYHPGKANVVADALSRQTIHVSS 984
+ Y F + + G N AD LSR+ V+S
Sbjct: 649 LSHYSFDVEHIKGTDNHFADFLSREFNKVNS 679
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 199 bits (505), Expect = 6e-50
Identities = 135/451 (29%), Positives = 223/451 (48%), Gaps = 26/451 (5%)
Query: 555 MEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLV---- 610
M+ +I + + I + P + +P + E Q+++L I+PS SP AP LV
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 611 KKKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDE 670
+K+ G+ R+ V+Y+ +NK T+ + Y P D+L+ ++G IFS D +SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 671 DIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNL 730
TAF GHYE+ V+PFG+ AP++F +M+ F F +F V++DDIL++S N
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNE 407
Query: 731 EEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDR 790
E+H H+ +LQ + + + K + + +++ FLG I + +E + +
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKF-- 465
Query: 791 PKTVTD---IRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKER 847
P T+ D ++ F+G+ Y +I A+I PL ++N P+ WT++ Q +K+
Sbjct: 466 PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKN 525
Query: 848 LTTAPVLTLPQEEEPYEVYCDASYQGLGCVL--------MQHRKAVAYSSRQLKIHERNY 899
L P L P EE + DAS G +L Y+S K E+NY
Sbjct: 526 LQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNY 585
Query: 900 PTHDLELAAVVFALKIWR------HYLYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEF 953
++D E AV+ +K + H+L + T F +L Y D K R RW +
Sbjct: 586 HSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAW 643
Query: 954 IKDYDFTLLYHPGKANVVADALSRQTIHVSS 984
+ Y F + + G N AD LSR+ V+S
Sbjct: 644 LSHYSFDVEHIKGTDNHFADFLSREFNRVNS 674
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 196 bits (499), Expect = 3e-49
Identities = 134/451 (29%), Positives = 220/451 (48%), Gaps = 26/451 (5%)
Query: 555 MEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKKD 614
M+ +I + + I + P + +P + E Q+++L I+PS SP AP LV +
Sbjct: 235 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 294
Query: 615 ----GRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDE 670
G R+ V+Y+ +NK T+ + Y LP D+L+ ++G IFS D +SG+ Q+ + E
Sbjct: 295 ENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 354
Query: 671 DIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNL 730
TAF GHYE+ V+PFG+ AP++F +M+ F F +F V++DDI+++S N
Sbjct: 355 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNE 413
Query: 731 EEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDR 790
E+H H+ +LQ + + + K + + +++ FLG I + +E + +
Sbjct: 414 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKF-- 471
Query: 791 PKTVTD---IRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKER 847
P T+ D ++ F+G+ Y +I A++ PL ++N P+ WT++ Q +K+
Sbjct: 472 PDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKN 531
Query: 848 LTTAPVLTLPQEEEPYEVYCDASYQGLGCVL--------MQHRKAVAYSSRQLKIHERNY 899
L P L P EE + DAS G +L Y S K ERNY
Sbjct: 532 LQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNY 591
Query: 900 PTHDLELAAVVFALKIWR------HYLYGSTFTVFSDHKSLKYLFDQKDLNMRQRRWMEF 953
++D E AV+ +K + H+L + T F +L Y D K R RW +
Sbjct: 592 HSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAW 649
Query: 954 IKDYDFTLLYHPGKANVVADALSRQTIHVSS 984
+ Y F + + G N AD LSR+ V+S
Sbjct: 650 LSHYSFDVEHIKGTDNHFADFLSREFNKVNS 680
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 194 bits (492), Expect = 2e-48
Identities = 129/448 (28%), Positives = 226/448 (49%), Gaps = 22/448 (4%)
Query: 549 IPPVRD---MEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGA 605
I P++ M+ +I ++ I + P +P + Q+++L G I PS S +
Sbjct: 218 IDPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMS 277
Query: 606 PVLLVK----KKDGRSRLCVDYRQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSG 661
P LV+ ++ G+ R+ V+Y+ +N+ TI + + LP + +L+ L+G +IFS D +SG
Sbjct: 278 PAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSG 337
Query: 662 YHQIRVKDEDIQKTAFRTRYGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFID 721
+ Q+ + +E + TAF GH+++ V+PFG+ AP++F +M + D+F +V++D
Sbjct: 338 FWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVD 396
Query: 722 DILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSK 781
DI+++S + +H H+ VL+++ + + + K + E++ FLG I K
Sbjct: 397 DIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHI 456
Query: 782 VETVLAW-DRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQS 840
+E + + DR + ++ F+G+ Y +I A+I PL +K+ + WT+
Sbjct: 457 LENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516
Query: 841 FQDMKERLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQH-----RKAVAYSSRQLKIH 895
+ +K+ L + P L LP+ E+ + DAS G VL YSS K
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576
Query: 896 ERNYPTHDLELAAVVFALKIWRHYLYGSTFTVFSDHKSLKYLF------DQKDLNMRQRR 949
E+NY ++D EL AV + + YL FTV +D+K+ Y D K R R
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSK--QGRLVR 634
Query: 950 WMEFIKDYDFTLLYHPGKANVVADALSR 977
W + Y F + + G NV+AD L+R
Sbjct: 635 WQNWFSKYQFDVEHLEGVKNVLADCLTR 662
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 185 bits (469), Expect = 9e-46
Identities = 140/599 (23%), Positives = 274/599 (45%), Gaps = 45/599 (7%)
Query: 425 LTVSTPASKSLVTNTACLECPWMYLDKKFIANLICLPLKGLDVIIGMDWLSHHHVLLDCA 484
L + K + C + P ++F+ + G+D+++G ++ + +
Sbjct: 59 LNIKIANGKIIQLTKVCSKLPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYT 118
Query: 485 NKVVIFPDA--------------GLAEFLNSYFSKLSLRKGALSSLMSTTVMEAKENG-- 528
+++ + G+ FL S K + + ++ S + +E G
Sbjct: 119 DRIYFHLNKQSVIIGKITKAYQYGVKGFLESMKKKSKVNRPEPINITSNQHLFLEEGGNH 178
Query: 529 ---------IQGIAVVQEFEDVFPEDVPGIPPVRDMEF---TIDIVPGTGPISIAPYRMA 576
I + ++E + + P I P + ++ TI+++ + + P +
Sbjct: 179 VDEMLYEIQISKFSAIEEMLERVSSENP-IDPEKSKQWMTATIELIDPKTVVKVKPMSYS 237
Query: 577 PAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVK----KKDGRSRLCVDYRQLNKVTIK 632
P++ E Q+++L + I+PS S +P LV+ ++ G+ R+ V+Y+ +NK T
Sbjct: 238 PSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKG 297
Query: 633 NCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGHYEYLVMPFG 692
+ + LP D+L+ ++G I+S D +SG Q+ + E TAF GHY++ V+PFG
Sbjct: 298 DAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFG 357
Query: 693 VTNAPAVF-MDYMNRIFHPFLDRFVVVFIDDILIYSR-NLEEHEEHLRQVLQVLREKVLY 750
+ AP++F Y N + + ++ V++DDIL++S +EH H+ +L+ + +
Sbjct: 358 LKQAPSIFPKTYANSHSNQY-SKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGII 416
Query: 751 ANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAW-DRPKTVTDIRSFIGLAGYYRR 809
+ K + + E++ FLG I + +E + + DR + ++ F+G+ Y
Sbjct: 417 LSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASD 476
Query: 810 FIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKERLTTAPVLTLPQEEEPYEVYCDA 869
+I A I PL +++ + W + Q +K+ L + P L P+ + + DA
Sbjct: 477 YIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDA 536
Query: 870 SYQGLGCVLM----QHRKAVAYSSRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGSTF 925
S + G +L H Y+S K ERNY +++ EL AV+ +K + YL S F
Sbjct: 537 SEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRF 596
Query: 926 TVFSDHKSLKYLFDQKDLNMRQR----RWMEFIKDYDFTLLYHPGKANVVADALSRQTI 980
+ +D+K+ + + R++ RW ++ YDF + + G NV AD L T+
Sbjct: 597 LIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFLQENTL 655
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 168 bits (426), Expect = 9e-41
Identities = 121/432 (28%), Positives = 212/432 (49%), Gaps = 33/432 (7%)
Query: 575 MAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLV-----------KKKDGRSRLCVDY 623
+ P + + Q+ L + IRPS S + +V K+K G+ R+ +Y
Sbjct: 1410 VTPGDEEAMTRQINLLLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNY 1469
Query: 624 RQLNKVTIKNCYPLPRIDDLMDQLKGAAIFSKIDLRSGYHQIRVKDEDIQKTAFRTRYGH 683
+ LN+ T + Y LP I+ ++ ++ + I+SK DL+SG+ Q+ +++E + TAF
Sbjct: 1470 KLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKL 1529
Query: 684 YEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVVFIDDILIYSRNLEEHEEHLRQVLQV 743
YE+LVMPFG+ NAPA+F M+ +F ++F+ V+IDDIL++S E+H +HL +LQ+
Sbjct: 1530 YEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQL 1588
Query: 744 LREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVLAWDRPKTVT--DIRSFI 801
+E L + K + E+ FLG + I + P + + + K T +RS++
Sbjct: 1589 CKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWL 1648
Query: 802 GLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKERLTTAPVLTLPQEEE 861
G+ Y R +I+ K+ PL + + + + +KE++ P L LP ++
Sbjct: 1649 GILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPPKDS 1708
Query: 862 PYEVYCDASYQGLGCV----LMQH-----RKAVAYSSRQLKIHERNYPTHDLELAAVVFA 912
+ D G G V + +H + AY+S + T D E+ A +
Sbjct: 1709 FIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAYASGSFNPIK---STIDAEIQAAIHG 1765
Query: 913 L-KIWRHYLYGSTFTVFSDHKSLKYLFDQKDLNMRQR-RWM---EFIKDYDFTLLYH--P 965
L K +YL + SD +++ +++ + N R RW+ +F+ T+ +
Sbjct: 1766 LDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENKPSRVRWLTFSDFLTGLGITVTFEHID 1825
Query: 966 GKANVVADALSR 977
GK N +ADALSR
Sbjct: 1826 GKHNGLADALSR 1837
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 159 bits (401), Expect = 7e-38
Identities = 114/400 (28%), Positives = 200/400 (49%), Gaps = 23/400 (5%)
Query: 549 IPPVRDMEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVL 608
+PPV +++ G P+++ Y M+ ++ ++ G + P SPW P+L
Sbjct: 162 VPPV-----VVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSPWNTPLL 216
Query: 609 LVKKKDGRS-RLCVDYRQLNKVTIKNCYP-LPRIDDLMDQLKGAAI-FSKIDLRSGYHQI 665
VKK R D R++NK +++ +P +P +L+ L + +S +DL+ + +
Sbjct: 217 PVKKPGTNDYRPVQDLREINK-RVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCL 275
Query: 666 RVKDEDIQKTAFRTR------YGHYEYLVMPFGVTNAPAVFMDYMNRIFHPF--LDRFVV 717
R+ AF + G + +P G N+P +F + ++R PF L+ VV
Sbjct: 276 RLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQVV 335
Query: 718 V--FIDDILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGI 775
+ ++DD+L+ + E+ ++ +++LQ L + +A K + EV +LG+++ +
Sbjct: 336 LLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLLKEGKR 395
Query: 776 AVDPSKVETVLAWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTE 835
+ P++ TV+ P T +R F+G AG+ R +I GFA +A PL LT+++ PF WTE
Sbjct: 396 WLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIPFIWTE 455
Query: 836 DCEQSFQDMKERLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQ----HRKAVAYSSRQ 891
+ +Q+F +K+ L +AP L LP +P+ +Y D VL Q R+ VAY S++
Sbjct: 456 EHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVAYLSKK 515
Query: 892 LKIHERNYPTHDLELAAVVFALKIWRHYLYGSTFTVFSDH 931
L +PT +AAV LK G TV + H
Sbjct: 516 LDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASH 555
Score = 76.6 bits (187), Expect = 5e-13
Identities = 81/311 (26%), Positives = 127/311 (40%), Gaps = 28/311 (9%)
Query: 1088 HPGSTKMYQDL-KLNFWWPGMKKNVAEYVAACLTCQKAK-IEHQKPAGMLQSLDVPEWKW 1145
H G K+ Q + + + P ++ V E + C C + + G Q D P W
Sbjct: 821 HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGVYW 880
Query: 1146 DSISMDFVVALPATRKRFDSIWVIVDRLTKSAHFIPVKTTFNVEALAKVYVAEIVRLHGV 1205
+ +DF P R + V +D + P KT + K+ + EI+ G+
Sbjct: 881 E---VDFTEIKPG-RYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKI-LEEILPRFGI 935
Query: 1206 PSSIVSDRDPKFTSHFWKALHEALGTKLRLSSAYHPQTDGQTERTIQSLEDLLRACVLDS 1265
P + SD P F + + L LG +L AY PQ+ GQ ER +++++ L L++
Sbjct: 936 PKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALET 995
Query: 1266 -QESWDELLPLIEFTYNNSFHASIGMAPYEALYGRRCRTPLCWFQDGEHLLVGPE----L 1320
+ W LLPL N+ G+ PYE LYG P + GE L GP+
Sbjct: 996 GGKDWVTLLPLALLRARNT-PGRFGLTPYEILYG----GPPPILESGETL--GPDDRFLP 1048
Query: 1321 VQQTTEKVKQIQEKMRTSQSRQKSYADTRRRELEFEAGDHVFLRVTPTTGVGRAIKSRKL 1380
V T K +I Q ++ T F+ GD V + R + L
Sbjct: 1049 VLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLV---------RRHRPSSL 1099
Query: 1381 TPKFIGPYQII 1391
P++ GPY ++
Sbjct: 1100 EPRWKGPYLVL 1110
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 156 bits (394), Expect = 5e-37
Identities = 143/574 (24%), Positives = 256/574 (43%), Gaps = 54/574 (9%)
Query: 383 RSTCEIAGNSLIALFDSGATHSFIDIACATRLKLEVSKLPFDLTVSTPASKSLVTNTACL 442
R T + G+ L D+GA HS + T+ +S + +T TN +
Sbjct: 12 RLTLSVGGHPTTFLVDTGAQHSVL-----TKANGPLSSRTSWVQGATGRKMHKWTNRRTV 66
Query: 443 ECPWMYLDKKFIANLIC-LPLKGLDVIIGMDWLSHHHVLLDCANKVVIFPDAGLAEFLNS 501
+ F+ C PL G D++ + H F +AG A+ L+
Sbjct: 67 NLGQGMVTHSFLVVPECPYPLLGRDLLTKLGAQIH-------------FSEAG-AQVLD- 111
Query: 502 YFSKLSLRKGALSSLMSTTVMEAKENGIQGIAVVQEFEDVFPEDVP-------GIPPVR- 553
R G +++ ++ + E+ + I V DV+ +D P G+ +
Sbjct: 112 -------RDGQPIQILTVSLQD--EHRLFDIPVTTSLPDVWLQDFPQAWAETGGLGRAKC 162
Query: 554 DMEFTIDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKK 613
ID+ P P+SI Y M+ ++ + + G +RP SPW P+L VKK
Sbjct: 163 QAPIIIDLKPTAVPVSIKQYPMSLEAHMGIRQHIIKFLELGVLRPCRSPWNTPLLPVKKP 222
Query: 614 DGRS-RLCVDYRQLNKVTIKNCYPLPRIDDLMDQLK-GAAIFSKIDLRSGYHQIRVKDED 671
+ R D R++NK T+ +P +L+ LK + ++ +DL+ + + + +
Sbjct: 223 GTQDYRPVQDLREINKRTVDIHPTVPNPYNLLSTLKPDYSWYTVLDLKDAFFCLPLAPQS 282
Query: 672 IQKTAFRTR------YGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVV----FID 721
+ AF + G + +P G N+P +F + ++R F + V ++D
Sbjct: 283 QELFAFEWKDPERGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVD 342
Query: 722 DILIYSRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSK 781
D+L+ + + + R +LQ L EK A+A K + +V +LG+++S+ + P +
Sbjct: 343 DLLLAAPTKKACTQGTRHLLQELGEKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGR 402
Query: 782 VETVLAWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSF 841
+ETV P+ ++R F+G AG+ R +I GFA++A PL LT+++ PF W + + +F
Sbjct: 403 IETVARIPPPRNPREVREFLGTAGFCRLWIPGFAELAAPLYALTKESTPFTWQTEHQLAF 462
Query: 842 QDMKERLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQH----RKAVAYSSRQLKIHER 897
+ +K+ L +AP L LP +P+ ++ D VL Q ++ VAY S++L
Sbjct: 463 EALKKALLSAPALGLPDTSKPFTLFLDERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAA 522
Query: 898 NYPTHDLELAAVVFALKIWRHYLYGSTFTVFSDH 931
+P +AA +K G TV + H
Sbjct: 523 GWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPH 556
Score = 70.9 bits (172), Expect = 3e-11
Identities = 73/297 (24%), Positives = 125/297 (41%), Gaps = 24/297 (8%)
Query: 1099 KLNFWWPGMKKNVAEYVAACLTCQKAKIEHQK-PAGMLQSLDVPEWKWDSISMDFVVALP 1157
K +F P + + +AC CQ+ + PAG + P W+ +DF P
Sbjct: 861 KTDFLIPRASTLIEQVTSACKVCQQVNAGATRVPAGKRTRGNRPGVYWE---IDFTEVKP 917
Query: 1158 ATRKRFDSIWVIVDRLTKSAHFIPVKTTFNVEALAKVYVAEIVRLHGVPSSIVSDRDPKF 1217
+ + V VD + P + +AK + EI G+P I SD P F
Sbjct: 918 HYAG-YKYLLVFVDTFSGWVEAFPTRQE-TAHIVAKKILEEIFPRFGLPKVIGSDNGPAF 975
Query: 1218 TSHFWKALHEALGTKLRLSSAYHPQTDGQTERTIQSLEDLLRACVLDS-QESWDELLPLI 1276
S + L LG +L AY PQ+ GQ ER +++++ L L++ + W LL L
Sbjct: 976 VSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLA 1035
Query: 1277 EFTYNNSFHASIGMAPYEALYGRRCRTPLCWFQDGEHLLVGPELVQQTTEKVKQIQEKM- 1335
N+ + G+ PYE LYG PL + +Q + ++ +Q ++
Sbjct: 1036 LLRARNTPN-RFGLTPYEILYGG--PPPLSTLLNSFSPSNSKTDLQARLKGLQAVQAQIW 1092
Query: 1336 -RTSQSRQKSYADTRRRELEFEAGDHVFLRVTPTTGVGRAIKSRKLTPKFIGPYQII 1391
++ + ++ T F+ GD V++ R +S+ L P++ GPY ++
Sbjct: 1093 APLAELYRPGHSQTSH---PFQVGDSVYV---------RRHRSQGLEPRWKGPYIVL 1137
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 152 bits (385), Expect = 5e-36
Identities = 108/389 (27%), Positives = 188/389 (47%), Gaps = 16/389 (4%)
Query: 559 IDIVPGTGPISIAPYRMAPAELTELKSQLEDLTKKGFIRPSVSPWGAPVLLVKKKDGRS- 617
ID+ P P+SI Y M+ ++ + + G +RP SPW P+L VKK R
Sbjct: 25 IDLKPTAMPVSIRQYPMSKEAHMGIQPHITRFLELGVLRPCRSPWNTPLLPVKKPGTRDY 84
Query: 618 RLCVDYRQLNKVTIKNCYPLPRIDDLMDQLK-GAAIFSKIDLRSGYHQIRVKDEDIQKTA 676
R D R++NK T+ +P +L+ L ++ +DL+ + + + + + A
Sbjct: 85 RPVQDLREVNKRTMDIHPTVPNPYNLLSTLSPDRTWYTVLDLKDAFFCLPLAPQSQELFA 144
Query: 677 FRTR------YGHYEYLVMPFGVTNAPAVFMDYMNRIFHPFLDRFVVV----FIDDILIY 726
F R G + +P G N+P +F + ++R F + V ++DD+L+
Sbjct: 145 FEWRDPERGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLA 204
Query: 727 SRNLEEHEEHLRQVLQVLREKVLYANAAKCEFWLEEVKFLGHVISKEGIAVDPSKVETVL 786
+ E + +L+ L +K A+A K + +V +LG+++S+ + P ++ETV
Sbjct: 205 APTKEACIRGTKHLLRELGDKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVA 264
Query: 787 AWDRPKTVTDIRSFIGLAGYYRRFIEGFAKIAGPLTKLTRKNQPFAWTEDCEQSFQDMKE 846
P+ ++R F+G AG+ R +I GFA++A PL LT+++ PF W E + +F+ +KE
Sbjct: 265 HIPPPQNPREVREFLGTAGFCRLWIPGFAELAAPLYALTKESAPFTWQEKHQSAFEALKE 324
Query: 847 RLTTAPVLTLPQEEEPYEVYCDASYQGLGCVLMQH----RKAVAYSSRQLKIHERNYPTH 902
L +AP L LP +P+ ++ D VL Q ++ VAY S++L +P
Sbjct: 325 ALLSAPALGLPDTSKPFTLFIDEKQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPC 384
Query: 903 DLELAAVVFALKIWRHYLYGSTFTVFSDH 931
+AA +K G TV + H
Sbjct: 385 LRIMAATAMLVKDSAKLTLGQPLTVITPH 413
Score = 68.2 bits (165), Expect = 2e-10
Identities = 73/297 (24%), Positives = 124/297 (41%), Gaps = 24/297 (8%)
Query: 1099 KLNFWWPGMKKNVAEYVAACLTCQKAKIEHQK-PAGMLQSLDVPEWKWDSISMDFVVALP 1157
K +F P + + +AC CQ+ + P G + P W+ +DF P
Sbjct: 718 KTDFLIPKAGTLIEQVTSACKVCQQVNAGATRVPEGKRTRGNRPGVYWE---IDFTEVKP 774
Query: 1158 ATRKRFDSIWVIVDRLTKSAHFIPVKTTFNVEALAKVYVAEIVRLHGVPSSIVSDRDPKF 1217
+ + V VD + P + +AK + EI G+P I SD P F
Sbjct: 775 HYAG-YKYLLVFVDTFSGWVEAYPTRQE-TAHMVAKKILEEIFPRFGLPKVIGSDNGPAF 832
Query: 1218 TSHFWKALHEALGTKLRLSSAYHPQTDGQTERTIQSLEDLLRACVLDS-QESWDELLPLI 1276
S + L LG +L AY PQ+ GQ ER +++++ L L++ + W LL L
Sbjct: 833 VSQVSQGLARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLA 892
Query: 1277 EFTYNNSFHASIGMAPYEALYGRRCRTPLCWFQDGEHLLVGPELVQQTTEKVKQIQEKMR 1336
N+ + G+ PYE LYG PL + +Q + ++ +Q ++
Sbjct: 893 LLRARNTPN-RFGLTPYEILYGG--PPPLSTLLNSFSPSDPKTDLQARLKGLQAVQAQIW 949
Query: 1337 T--SQSRQKSYADTRRRELEFEAGDHVFLRVTPTTGVGRAIKSRKLTPKFIGPYQII 1391
T ++ + + T F+ GD V++R +S+ L P++ GPY ++
Sbjct: 950 TPLAELYRPGHPQT---SYPFQVGDSVYVRWH---------RSQGLEPRWKGPYIVL 994
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.321 0.137 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 175,800,990
Number of Sequences: 164201
Number of extensions: 7712611
Number of successful extensions: 21376
Number of sequences better than 10.0: 254
Number of HSP's better than 10.0 without gapping: 157
Number of HSP's successfully gapped in prelim test: 97
Number of HSP's that attempted gapping in prelim test: 20488
Number of HSP's gapped (non-prelim): 611
length of query: 1496
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1373
effective length of database: 39,777,331
effective search space: 54614275463
effective search space used: 54614275463
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0102a.7