
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC137666.4 + phase: 0
(1451 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 479 e-134
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 476 e-133
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 476 e-133
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 347 2e-94
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 325 6e-88
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 315 4e-85
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 291 7e-78
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 276 3e-73
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 265 9e-70
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 211 1e-53
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 211 1e-53
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 207 1e-52
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 206 3e-52
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 204 1e-51
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 201 1e-50
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 185 7e-46
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 174 2e-42
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 160 2e-38
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 152 5e-36
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 150 3e-35
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 479 bits (1233), Expect = e-134
Identities = 285/884 (32%), Positives = 463/884 (52%), Gaps = 34/884 (3%)
Query: 507 PEREVEFSIDLVPGAKLVSMAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV 566
P + +EF ++L + + Y + ++ + ++ L +R S + PV+ V
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 567 KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDE 626
KK+G++R+ +DY+ LNK N YPLP I+ L+ ++ G+ +F+K+DL+S YH I+V+
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 627 DMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIE 686
D K AFR G +EY VMP+G++ AP F ++N I + VV ++DDILI+SK E
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSE 575
Query: 687 EEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQWET 746
EH KH+K VLQ LK L +KCEF S+V F+G+ S G +D V QW+
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 747 PKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTT 806
PK+ E+R FLG Y R+FI S+L PL L K + W + +KQ L +
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVS 695
Query: 807 APILILPKPEEPFVVYCDASKFGLGGVLMQ---DGKV--VAYASRQLRVHEKNSPTHDLE 861
P+L + ++ DAS +G VL Q D K V Y S ++ + N D E
Sbjct: 696 PPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKE 755
Query: 862 LAAVVFVLKIWRHYLYGS--RFEVFSDHKSL--KYLFDQKELNMRQRRWLELLKDYDFCL 917
+ A++ LK WRHYL + F++ +DH++L + + + N R RW L+D++F +
Sbjct: 756 MLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 918 NYHPDKAKVVADALSRKTLHMSALMVKEFELLEQFRDLSLVCELSSQSVQ-LGMLKINSD 976
NY P A +ADALSR +V E E + + + S+ + + I D
Sbjct: 816 NYRPGSANHIADALSR--------IVDETEPIPK--------DSEDNSINFVNQISITDD 859
Query: 977 FLGSIREAQQVDVKFVDLMVVSNQAEESDFKVDEQGVLRFRGRICIPDNEELKKLILEEG 1036
F + D K ++L+ ++ E + ++ + ++ + +I +P++ +L + I+++
Sbjct: 860 FKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKY 919
Query: 1037 HKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDV 1096
H+ IH G + + + F W G++K + +V C TCQ +K + +P G L P+
Sbjct: 920 HEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPP 979
Query: 1097 PEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIPINISYPVAQLAEIYIQNIV 1156
E W+S+SMDF+++LP +S G+++++VVVDR +K A +P S Q A ++ Q ++
Sbjct: 980 SERPWESLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 1157 KLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAYHPQTDGQSERTIQSLEDLLRV 1216
G P I++D D FTS+ W+ +K S Y PQTDGQ+ERT Q++E LLR
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 1217 CVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRKCKTPLCWFESGESVVLGPEL 1276
W H+ L++ +YNN+ HS+ M PFE ++ + L E E
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDEN 1156
Query: 1277 VHETTEKVKMIREKMKASQSRQKSYHDKRRKDL-EFQEGGHVFLRVTPMTGVGRALKSRK 1335
ET + + ++E + + + K Y D + +++ EFQ G V ++ T G KS K
Sbjct: 1157 SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNK 1213
Query: 1336 LTPKFIGPYQISERVGTVAYRVGLPPHLSNL-HDVFHVSQLRKY 1378
L P F GP+ + ++ G Y + LP + ++ FHVS L KY
Sbjct: 1214 LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 476 bits (1226), Expect = e-133
Identities = 284/884 (32%), Positives = 463/884 (52%), Gaps = 34/884 (3%)
Query: 507 PEREVEFSIDLVPGAKLVSMAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV 566
P + +EF ++L + + Y + ++ + ++ L +R S + PV+ V
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 567 KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDE 626
KK+G++R+ +DY+ LNK N YPLP I+ L+ ++ G+ +F+K+DL+S YH I+V+
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 627 DMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIE 686
D K AFR G +EY VMP+G++ AP F ++N I + VV ++D+ILI+SK E
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSE 575
Query: 687 EEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQWET 746
EH KH+K VLQ LK L +KCEF S+V F+G+ S G +D V QW+
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 747 PKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTT 806
PK+ E+R FLG Y R+FI S+L PL L K + W + +KQ L +
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVS 695
Query: 807 APILILPKPEEPFVVYCDASKFGLGGVLMQ---DGKV--VAYASRQLRVHEKNSPTHDLE 861
P+L + ++ DAS +G VL Q D K V Y S ++ + N D E
Sbjct: 696 PPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKE 755
Query: 862 LAAVVFVLKIWRHYLYGS--RFEVFSDHKSL--KYLFDQKELNMRQRRWLELLKDYDFCL 917
+ A++ LK WRHYL + F++ +DH++L + + + N R RW L+D++F +
Sbjct: 756 MLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 918 NYHPDKAKVVADALSRKTLHMSALMVKEFELLEQFRDLSLVCELSSQSVQ-LGMLKINSD 976
NY P A +ADALSR +V E E + + + S+ + + I D
Sbjct: 816 NYRPGSANHIADALSR--------IVDETEPIPK--------DSEDNSINFVNQISITDD 859
Query: 977 FLGSIREAQQVDVKFVDLMVVSNQAEESDFKVDEQGVLRFRGRICIPDNEELKKLILEEG 1036
F + D K ++L+ ++ E + ++ + ++ + +I +P++ +L + I+++
Sbjct: 860 FKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKY 919
Query: 1037 HKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDV 1096
H+ IH G + + + F W G++K + +V C TCQ +K + +P G L P+
Sbjct: 920 HEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPP 979
Query: 1097 PEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIPINISYPVAQLAEIYIQNIV 1156
E W+S+SMDF+++LP +S G+++++VVVDR +K A +P S Q A ++ Q ++
Sbjct: 980 SERPWESLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 1157 KLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAYHPQTDGQSERTIQSLEDLLRV 1216
G P I++D D FTS+ W+ +K S Y PQTDGQ+ERT Q++E LLR
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 1217 CVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRKCKTPLCWFESGESVVLGPEL 1276
W H+ L++ +YNN+ HS+ M PFE ++ + L E E
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDEN 1156
Query: 1277 VHETTEKVKMIREKMKASQSRQKSYHDKRRKDL-EFQEGGHVFLRVTPMTGVGRALKSRK 1335
ET + + ++E + + + K Y D + +++ EFQ G V ++ T G KS K
Sbjct: 1157 SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNK 1213
Query: 1336 LTPKFIGPYQISERVGTVAYRVGLPPHLSNL-HDVFHVSQLRKY 1378
L P F GP+ + ++ G Y + LP + ++ FHVS L KY
Sbjct: 1214 LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 476 bits (1226), Expect = e-133
Identities = 284/884 (32%), Positives = 463/884 (52%), Gaps = 34/884 (3%)
Query: 507 PEREVEFSIDLVPGAKLVSMAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV 566
P + +EF ++L + + Y + ++ + ++ L +R S + PV+ V
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 567 KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDE 626
KK+G++R+ +DY+ LNK N YPLP I+ L+ ++ G+ +F+K+DL+S YH I+V+
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 627 DMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIE 686
D K AFR G +EY VMP+G++ AP F ++N I + VV ++D+ILI+SK E
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSE 575
Query: 687 EEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQWET 746
EH KH+K VLQ LK L +KCEF S+V F+G+ S G +D V QW+
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 747 PKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTT 806
PK+ E+R FLG Y R+FI S+L PL L K + W + +KQ L +
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVS 695
Query: 807 APILILPKPEEPFVVYCDASKFGLGGVLMQ---DGKV--VAYASRQLRVHEKNSPTHDLE 861
P+L + ++ DAS +G VL Q D K V Y S ++ + N D E
Sbjct: 696 PPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKE 755
Query: 862 LAAVVFVLKIWRHYLYGS--RFEVFSDHKSL--KYLFDQKELNMRQRRWLELLKDYDFCL 917
+ A++ LK WRHYL + F++ +DH++L + + + N R RW L+D++F +
Sbjct: 756 MLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 918 NYHPDKAKVVADALSRKTLHMSALMVKEFELLEQFRDLSLVCELSSQSVQ-LGMLKINSD 976
NY P A +ADALSR +V E E + + + S+ + + I D
Sbjct: 816 NYRPGSANHIADALSR--------IVDETEPIPK--------DSEDNSINFVNQISITDD 859
Query: 977 FLGSIREAQQVDVKFVDLMVVSNQAEESDFKVDEQGVLRFRGRICIPDNEELKKLILEEG 1036
F + D K ++L+ ++ E + ++ + ++ + +I +P++ +L + I+++
Sbjct: 860 FKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKY 919
Query: 1037 HKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDV 1096
H+ IH G + + + F W G++K + +V C TCQ +K + +P G L P+
Sbjct: 920 HEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPP 979
Query: 1097 PEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIPINISYPVAQLAEIYIQNIV 1156
E W+S+SMDF+++LP +S G+++++VVVDR +K A +P S Q A ++ Q ++
Sbjct: 980 SERPWESLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 1157 KLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAYHPQTDGQSERTIQSLEDLLRV 1216
G P I++D D FTS+ W+ +K S Y PQTDGQ+ERT Q++E LLR
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 1217 CVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRKCKTPLCWFESGESVVLGPEL 1276
W H+ L++ +YNN+ HS+ M PFE ++ + L E E
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDEN 1156
Query: 1277 VHETTEKVKMIREKMKASQSRQKSYHDKRRKDL-EFQEGGHVFLRVTPMTGVGRALKSRK 1335
ET + + ++E + + + K Y D + +++ EFQ G V ++ T G KS K
Sbjct: 1157 SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNK 1213
Query: 1336 LTPKFIGPYQISERVGTVAYRVGLPPHLSNL-HDVFHVSQLRKY 1378
L P F GP+ + ++ G Y + LP + ++ FHVS L KY
Sbjct: 1214 LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 347 bits (889), Expect = 2e-94
Identities = 253/907 (27%), Positives = 439/907 (47%), Gaps = 43/907 (4%)
Query: 490 VVNEFHEVFPDEIPDVPPEREVEFSIDLVPGAKLVSMAPYHMSASELAELKKQLEDLLDK 549
V+ +F +VF ++ E I+L GA+ + P + + E++K ++ +L++
Sbjct: 909 VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQ 968
Query: 550 KFVRPSVSPWGAPVLLVKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVF 609
K +R S SPW +PV+LVKKKDGS+R+CIDYR++NKV N +PLP I+ + L G K++
Sbjct: 969 KVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLY 1028
Query: 610 SKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLD 669
+ D+ +G+ QI + ++ + TAF +E+ V+PFG+ +P +F M I L
Sbjct: 1029 TVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLG 1088
Query: 670 KFVVVFIDDILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGN 729
V++DD+LI SK E+H + +K L +++ + + SKC EV +LGH + +
Sbjct: 1089 VCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLD 1148
Query: 730 GIAVDPSKVDAVSQWETPKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKSFVW 789
G+ K D + Q+ P +V E++SFLGL GYYR+FI F+++A LT L +++W
Sbjct: 1149 GVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIW 1208
Query: 790 DAQCESSFNELKQRLTTAPILILP------KPEEPFVVYCDASKFGLGGVLMQDG----- 838
+ + E +F ELK+ + P+L P K + PF++Y DAS+ G+G VL Q+G
Sbjct: 1209 EKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQ 1268
Query: 839 KVVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFDQKE 898
+A+AS+ L E DLE A++F L+ ++ +YG+ VF+DHK L L
Sbjct: 1269 HPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSP 1328
Query: 899 LNMRQRRWLELLKDYDFCLNYHPDKAKVVADALSRKTLHMSALMVKEFELLEQFRDL--S 956
L R RW + ++D + Y KA VADALSR + L ++ + L + +
Sbjct: 1329 LADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQT 1388
Query: 957 LVCELSSQSVQLGMLKINSDFLGSIREAQQVDVKFVDLMVVSNQAEES-DFKVDEQGVLR 1015
+ ++ S L LK + + A + +V ++E S ++ GVL+
Sbjct: 1389 ELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLK 1448
Query: 1016 -----FRGRICIPDNEELKKLILEEGHKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARF 1070
+ R +P E+++ +L+E H+ L+ H G KM++ + + F+W ++ V
Sbjct: 1449 NTEIEEQSRSVVP--EKIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRVCVENC 1506
Query: 1071 VYACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLT 1130
V C C + +H + LTP + + + ++ D + + + +G+ I ++D T
Sbjct: 1507 VRTCAKCLCAN-DHSKLTSSLTPYRM-TFPLEIVACDLM-DVGLSVQGNRYILTIIDLFT 1563
Query: 1131 KSAHFIPINISYPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLK 1190
K +PI L + + +P +++D+ F + + L +
Sbjct: 1564 KYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHI 1623
Query: 1191 LSSAYHPQTDGQSERTIQSLEDLLRVCVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFE 1250
+ Y+ + +G ER +++ +++ WD + + YNN H + G P
Sbjct: 1624 TTKGYNSRANGAVERFNKTIMHIMKKKTAVP-MEWDDQVVYAVYAYNNCVHENTGETPMF 1682
Query: 1251 ALYGRKCKTPLCWFESGESVV---------LGPELVHETTEKVKMIREKMKASQSRQKSY 1301
++GR PL SGE V L E + K+ +E Q KS
Sbjct: 1683 LMHGRDVMGPL--EMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHAMREQESYKSL 1740
Query: 1302 HDKR--RKDLEFQEGGHVFLRVTPMTGVGRALKSRKLTPKFIGPYQI---SERVGTVAYR 1356
D++ K F + G L P +G + KL K+ GPY++ SE +
Sbjct: 1741 FDQKYASKKHRFPQPGSRVLLEIPSEKLG--AQCPKLVNKWSGPYRVISCSENSAEITPV 1798
Query: 1357 VGLPPHL 1363
+G H+
Sbjct: 1799 LGKRKHI 1805
Score = 38.1 bits (87), Expect = 0.18
Identities = 29/77 (37%), Positives = 35/77 (44%), Gaps = 10/77 (12%)
Query: 243 PQKGKNAPVDVVCYKCGVKGHKSNACTQDEK-----KCFRCGQKGHVLAEC-KRGDIVCF 296
PQK +N P D C C +G C++ K KC C Q G +A C K + CF
Sbjct: 535 PQKHQN-PSDR-CSDCQQRGWHMFWCSKKSKDNASQKCDECQQSGWHMASCFKLKNRACF 592
Query: 297 SCGEEGHNGAQCTQPKK 313
C E GH C PKK
Sbjct: 593 RCNEMGHIAWNC--PKK 607
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 325 bits (833), Expect = 6e-88
Identities = 230/776 (29%), Positives = 385/776 (48%), Gaps = 75/776 (9%)
Query: 490 VVNEFHEVFPDEIPDVPPEREVEFSIDLVPGAKLVSMAPYHMSASELAELKKQLEDLLDK 549
++ ++H++ E + + + +I+ L S Y + + E++ Q++D+L++
Sbjct: 176 LLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYSKYSYPQAYEQ--EVESQIQDMLNQ 233
Query: 550 KFVRPSVSPWGAPVLLV-KKKDGS----MRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLV 604
+R S SP+ +P+ +V KK+D S R+ IDYR+LN++T+ +R+P+P +D+++ +L
Sbjct: 234 GIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLG 293
Query: 605 GAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIF 664
F+ IDL G+HQI++ E + KTAF T++GHYEY MPFG+ NAP F MN I
Sbjct: 294 RCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDIL 353
Query: 665 HAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGH 724
L+K +V++DDI+++S +EH + L +V + L + L +L KCEF E +FLGH
Sbjct: 354 RPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGH 413
Query: 725 VTSGNGIAVDPSKVDAVSQWETPKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKG 784
V + +GI +P K++A+ ++ P EI++FLGL GYYR+FI F+ +A P+T+ K
Sbjct: 414 VLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKN 473
Query: 785 -KSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFVVYCDASKFGLGGVLMQDGKVVAY 843
K + + +S+F +LK ++ PIL +P + F + DAS LG VL QDG ++Y
Sbjct: 474 MKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSY 533
Query: 844 ASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFDQKELNMRQ 903
SR L HE N T + EL A+V+ K +RHYL G FE+ SDH+ L +L+ K+ N +
Sbjct: 534 ISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKL 593
Query: 904 RRWLELLKDYDFCLNYHPDKAKVVADALSRKTLHMSALMVK-EFELLEQFRDLSLVCE-- 960
RW L ++DF + Y K VADALSR L + L + + E DL + E
Sbjct: 594 TRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLSEQTQHSAEEDNSDLIFITERP 653
Query: 961 LSSQSVQLGMLKINSDFLGSIREAQQVDVKFVDLMVVSNQAE-----------------E 1003
L++ + Q+ K D + + + F D+M + +
Sbjct: 654 LNTFNRQVIFSKGPPDIKVTKYFKKHITQIFYDIMTREKAEQYLIDHFCGKKSALYIESD 713
Query: 1004 SDFKVDEQG-----------VLRFRGRI-CIPDNEELKKLILEEGHKSNLSIHLGATKMY 1051
+DF+V + +LR + I E K+LIL K +H G K
Sbjct: 714 ADFEVIQAAHKLAINTKYTKILRSTILLKNITTYAEFKELILTAHEK---LLHPGIQKTT 770
Query: 1052 QDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSS 1111
+ + +++ + + + C C +K EH+ PE + +D SS
Sbjct: 771 KLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPEHCREKFMIDIYSS 830
Query: 1112 LPNTSRGHDSIWVVVDRLTKSAHFIP-INISYPVAQLAEI----------YIQNIVKLHG 1160
+ H++ I+I A L EI + I G
Sbjct: 831 -------------------EGKHYVSCIDIYSKFATLEEIKTKDWIECKNALMRIFNQLG 871
Query: 1161 VPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAYHPQTDGQSERTIQSLEDLLRV 1216
P + +DRD F+S + ++ +L+L++ D ER +++ + +R+
Sbjct: 872 KPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD--IERLHKTINEKIRI 925
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 315 bits (808), Expect = 4e-85
Identities = 242/880 (27%), Positives = 417/880 (46%), Gaps = 84/880 (9%)
Query: 427 LEYNRVCINCFNKTVHFSSAEEESGAQFL---TTKQLKQLERDGI--LMFSLMASLSLEN 481
+ Y + F++T ++E E T + + +++ I L FS L
Sbjct: 106 INYKNDTVTLFDQTYKLITSESERNQNLYIQRTPESIASSDQESIKKLDFSQFRLDHLNQ 165
Query: 482 QVVIDRLPVVNEFHEVFPDEIPDVPPEREVEFSIDLVPGAKLVSMAPYHMSASELAELKK 541
+ ++N+F + E + ++ ++ + + S Y ++ + E++
Sbjct: 166 EETFKLKGLLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYSKQ-YPLAQTHEIEVEN 224
Query: 542 QLEDLLDKKFVRPSVSPWGAPVLLVKKKDGSM-----RLCIDYRQLNKVTIKNRYPLPRI 596
Q++++L++ +R S SP+ +P +V KK + R+ IDYR+LN++TI +RYP+P +
Sbjct: 225 QVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNM 284
Query: 597 DDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVF 656
D+++ +L + F+ IDL G+HQI++ +E + KTAF T+ GHYEY MPFG+ NAP F
Sbjct: 285 DEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATF 344
Query: 657 MEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWL 716
MN I L+K +V++DDI+I+S EH +++V L + L +L KCEF
Sbjct: 345 QRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLK 404
Query: 717 SEVSFLGHVTSGNGIAVDPSKVDAVSQWETPKSVTEIRSFLGLAGYYRRFIEGFSKLALP 776
E +FLGH+ + +GI +P KV A+ + P EIR+FLGL GYYR+FI ++ +A P
Sbjct: 405 KEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKP 464
Query: 777 LTQ-LTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFVVYCDASKFGLGGVLM 835
+T L + K + +F +LK + PIL LP E+ FV+ DAS LG VL
Sbjct: 465 MTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLS 524
Query: 836 QDGKVVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFD 895
Q+G +++ SR L HE N + EL A+V+ K +RHYL G +F + SDH+ L++L +
Sbjct: 525 QNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHN 584
Query: 896 QKELNMRQRRWLELLKDYDFCLNYHPDKAKVVADALSR----------KTLHM-----SA 940
KE + RW L +Y F ++Y K VADALSR T H S
Sbjct: 585 LKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSN 644
Query: 941 LMVKEFELLEQFRDLSLVCELSSQSVQLGMLKINS------DFLGSIREAQQVDV-KFVD 993
L+ + + F+ + + V+ + NS D + ++ +A+Q+ + F+
Sbjct: 645 LIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTIQYDVM-TLEKAKQILLDHFIH 703
Query: 994 LMVVSNQAEESDFKVDEQGVLRFRGRIC------------IPDNEELKKLILEEGHKSNL 1041
+ + DF++ ++ + + E K++IL+ K
Sbjct: 704 RNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKNVGSYAEFKEIILQSHEK--- 760
Query: 1042 SIHLGATKMYQDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDVPEWKW 1101
+H G KM + K+ ++ + + + C C +K EH+ L PE
Sbjct: 761 LLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKITPNPEHCR 820
Query: 1102 DSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIP-INISYPVAQLAEIYIQNIVKLH- 1159
+ +D SS + H+I I+I A L +I ++ ++
Sbjct: 821 EKFVVDIYSS-------------------EGKHYISCIDIYSKFATLEQIKTKDWIECRN 861
Query: 1160 ---------GVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAYHPQTDGQSERTIQSL 1210
G P + +DRD F+S + + +L+L++A + D ER +++
Sbjct: 862 ALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD--VERLHKTI 919
Query: 1211 EDLLRVC-VLEQGGAWDSHLPLIEFTYNNSY-HSSIGMAP 1248
+ +R+ + S + I +TYN H + G P
Sbjct: 920 NEKIRIINSSDDEEVKLSKIETILYTYNQKIKHDTTGQRP 959
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 291 bits (746), Expect = 7e-78
Identities = 222/847 (26%), Positives = 391/847 (45%), Gaps = 103/847 (12%)
Query: 490 VVNEFHEVFPDEIPDVPPEREVEFSIDLVPGAKLVSMAPYHMSASELAELKKQLEDLLDK 549
++ EF +F + + E V+ I + + + Y + E+++Q+++LL
Sbjct: 91 LLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKS-YPYPVNMRGEVERQIDELLQD 149
Query: 550 KFVRPSVSPWGAPVLLVKKK-----DGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLV 604
+RPS SP+ +P+ +V KK + R+ +D+++LN VTI + YP+P I+ + L
Sbjct: 150 GIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINATLASLG 209
Query: 605 GAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIF 664
AK F+ +DL SG+HQI +K+ D+ KTAF T G YE+ +PFG+ NAP +F ++ I
Sbjct: 210 NAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDIL 269
Query: 665 HAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGH 724
+ K V+IDDI+++S+ + H K+L++VL L + L L K F ++V FLG+
Sbjct: 270 REHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLGY 329
Query: 725 VTSGNGIAVDPSKVDAVSQWETPKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKG 784
+ + +GI DP KV A+S+ P SV E++ FLG+ YYR+FI+ ++K+A PLT LT
Sbjct: 330 IVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRGL 389
Query: 785 KSFVWDAQCE-----------SSFNELKQRLTTAPILILPKPEEPFVVYCDASKFGLGGV 833
+ + +Q SFN+LK L ++ IL P +PF + DAS + +G V
Sbjct: 390 YANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGAV 449
Query: 834 LMQD----GKVVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGS-RFEVFSDHK 888
L QD + +AY SR L E+N T + E+ A+++ L R YLYG+ +V++DH+
Sbjct: 450 LSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDHQ 509
Query: 889 SLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKVVADALSR--------------- 933
L + + N + +RW +++Y+ L Y P K+ VVADALSR
Sbjct: 510 PLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRIPPQLNQLSTDLDAN 569
Query: 934 ------------KTLHMSALMVKEFE---------LLEQFRDLSLVCELSSQSVQLGMLK 972
LH S+ ++ E L+ +CE ++
Sbjct: 570 PEDDMQSLATAHSALHDSSRLIPHVESPINVFKNQLIFDTTRSKYLCEHPFPGYTRHLIP 629
Query: 973 INSDFLGSIREAQQVDVKFVDLMVVSNQAEESDFKVDEQGVLRFRGRICIPD-------- 1024
+ L + + Q ++ V + + K+ E + RF+ IC+ +
Sbjct: 630 LKDGSLADLTNSLQSCLRPVII---------NGVKIPEAHLQRFQS-ICLANFLLYKIRI 679
Query: 1025 ----------NEELKKLILEEGHKSNLSIHLGATKMYQDLKKLFWWSGLKKDVARFVYAC 1074
EE+ ++I +E ++ H G T++ L + +++ + + +C
Sbjct: 680 TQRLVADVSGAEEICEIIEKEHRRA----HRGPTEIRLQLLEKYYFPRMSSTIRLQTSSC 735
Query: 1075 LTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAH 1134
C+ K E L P +P + + + +D + +D+ +K A
Sbjct: 736 QCCKLYKYERHPNKPNLQPTPIPNYPCEILHIDIFALEKRLYLS------CIDKFSKFAK 789
Query: 1135 FIPINISYPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSA 1194
+ S L E ++ + P +VSD + + +L L +
Sbjct: 790 LFHLQ-SKASVHLRETLVE-ALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPT 847
Query: 1195 YHPQTDGQSERTIQSLEDLLRVCVLEQGGAWDSHLPLIEFT---YNNSYHSSIGMAPFEA 1251
+ +GQ ER + ++ R C+ ++ + + L+ YN S HS P +
Sbjct: 848 QKSEVNGQVERFHSTFLEIYR-CLKDELPTF-KPVELVHIAVDRYNTSVHSVTNRKPADV 905
Query: 1252 LYGRKCK 1258
+ R +
Sbjct: 906 FFDRSSR 912
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 276 bits (706), Expect = 3e-73
Identities = 215/806 (26%), Positives = 381/806 (46%), Gaps = 73/806 (9%)
Query: 505 VPPEREVEFSIDLVPGAKLVSMA-PYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPV 563
+P V +I V + S A P M S+ + +++ LL +RPS SP+ +P
Sbjct: 164 LPFNTAVTATIRTVDNEPVYSRAYPTLMGVSDF--VNNEVKQLLKDGIIRPSRSPYNSPT 221
Query: 564 LLVKKK------DGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSG 617
+V KK + + RL ID+R+LN+ TI +RYP+P I ++ L AK F+ +DL+SG
Sbjct: 222 WVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSG 281
Query: 618 YHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFID 677
YHQI + + D +KT+F G YE+ +PFG+ NA +F ++ + + K V++D
Sbjct: 282 YHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVD 341
Query: 678 DILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSK 737
D++I+S+ E +H +H+ VL+ L + + K F+ V +LG + S +G DP K
Sbjct: 342 DVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEK 401
Query: 738 VDAVSQWETPKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKS----------- 786
V A+ ++ P V ++RSFLGLA YYR FI+ F+ +A P+T + KG++
Sbjct: 402 VKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPITDI-LKGENGSVSKHMSKKI 460
Query: 787 -FVWDAQCESSFNELKQRLTTAPILI-LPKPEEPFVVYCDASKFGLGGVLMQDGKVVAYA 844
++ ++F L+ L + +++ P ++PF + DAS G+G VL Q+G+ +
Sbjct: 461 PVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMI 520
Query: 845 SRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGSR-FEVFSDHKSLKYLFDQKELNMRQ 903
SR L+ E+N T++ EL A+V+ L +++LYGSR +F+DH+ L + + N +
Sbjct: 521 SRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKI 580
Query: 904 RRWLELLKDYDFCLNYHPDKAKVVADALSRKTLHM--------SALMVKEFEL------- 948
+RW + ++ + Y P K VADALSR+ L+ +A + E L
Sbjct: 581 KRWKSYIDQHNAKVFYKPGKENFVADALSRQNLNALQNEPQSDAATIHSELSLTYTVETT 640
Query: 949 ---LEQFRDLSLVCELS------------SQSVQLGMLKINSDFLGSIREAQQVDVKFVD 993
L FR+ ++ E + S+S L S L +++E DV
Sbjct: 641 DKPLNCFRN-QIILEAARFPLKRNLVLFRSKSRHLISFTDKSWLLKTLKEVVNPDVVNAI 699
Query: 994 LMVVSNQAEESDFKVDEQGVLRFRG----RICIPDNEELKKLILEEGHKSNLSIHLGATK 1049
+ A + +FR + I D E +++ E +++ H A +
Sbjct: 700 HCDLPTLASFQHDLIAHFPATQFRHCKNVVLDITDKNEQIEIVTAEHNRA----HRAAQE 755
Query: 1050 MYQDLKKLFWWSGLKKDVARFVYACLTCQKSKVEHQRPAGLLTPLDVPEWKWDSISMDFV 1109
+ + + +++ + V C C ++K + L +P + + + +D
Sbjct: 756 NIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTGEMVHIDIF 815
Query: 1110 SSLPNTSRGHDSIWVVVDRLTKSAHFIPINISYPVAQLAEIYIQNIVKLHGVPSSIVSDR 1169
S+ +D+ +K A P+ +S + + +Q I+ L ++ D
Sbjct: 816 ST------DRKLFLTCIDKFSKYAIVQPV-VSRTIVDITAPLLQ-IINLFPNIKTVYCDN 867
Query: 1170 DPRFTSRFWRS-LQDALGSKLKLSSAYHPQTDGQSERTIQSLEDLLRVCVLEQGGAWDSH 1228
+P F S S L+++ G + + H ++GQ ER +L ++ R L++
Sbjct: 868 EPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVE 927
Query: 1229 LPL-IEFTYNNSYHSSIGMAPFEALY 1253
L L YN + HS P E ++
Sbjct: 928 LILRATIEYNKTVHSVTRERPIEVVH 953
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 265 bits (676), Expect = 9e-70
Identities = 152/418 (36%), Positives = 222/418 (52%), Gaps = 10/418 (2%)
Query: 529 YHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLVKKKDG------SMRLCIDYRQL 582
Y S++ E++ Q++ L+ K V PSVS + +P+LLV KK RL IDYRQ+
Sbjct: 320 YRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQI 379
Query: 583 NKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEY 642
NK + +++PLPRIDD++DQL AK FS +DL SG+HQI++ + T+F T G Y +
Sbjct: 380 NKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRF 439
Query: 643 KVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILKE 702
+PFG+ AP F M F +++DD+++ E+ K+L V +E
Sbjct: 440 TRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCRE 499
Query: 703 RKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQWETPKSVTEIRSFLGLAGY 762
L KC F++ EV+FLGH + GI D K D + + P R F+ Y
Sbjct: 500 YNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNY 559
Query: 763 YRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFVVY 822
YRRFI+ F+ + +T+L K F W +C+ +F LK +L +L P + F +
Sbjct: 560 YRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCIT 619
Query: 823 CDASKFGLGGVLMQDGK----VVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYG 878
DASK G VL Q+ VAYASR E N T + ELAA+ + + +R Y+YG
Sbjct: 620 TDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYG 679
Query: 879 SRFEVFSDHKSLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKVVADALSRKTL 936
F V +DH+ L YLF + + R L++Y+F + Y K VADALSR T+
Sbjct: 680 KHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSRITI 737
Score = 104 bits (260), Expect = 2e-21
Identities = 84/336 (25%), Positives = 155/336 (46%), Gaps = 21/336 (6%)
Query: 1024 DNEELKKLILEEGHKSNLSI-HLGATKMYQDLKKLFWWSGLKKDVARFVYACLTCQKSKV 1082
+NE+ K+ IL H + H G TK +K+ ++W + K + +V C CQK+K
Sbjct: 888 NNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKT 947
Query: 1083 -EHQRPAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHFIPINIS 1141
+H + +T + PE +D + +D + LP + G++ ++ LTK IPI +
Sbjct: 948 TKHTKTPMTIT--ETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPI-AN 1004
Query: 1142 YPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAYHPQTDG 1201
+A+ ++ + +G + ++D + + L L K S+A+H QT G
Sbjct: 1005 KSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVG 1064
Query: 1202 QSERTIQSLEDLLRVCVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRKCKTPL 1261
ER+ ++L + +R + WD L + +N + P+E ++GR P
Sbjct: 1065 VVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPK 1124
Query: 1262 CW--FESGESVVLGPELVHETTEKVKM----IREKMKASQSRQKSYHDKRRKDLEFQEGG 1315
+ S E + + E+ ++++ R+ ++A + + K +D + KD+E + G
Sbjct: 1125 HFNKLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGD 1184
Query: 1316 HVFLRVTPMTGVGRALKSRKLTPKFIGPYQISERVG 1351
V LR VG KL K+ GPY+I E +G
Sbjct: 1185 KVLLR----NEVG-----HKLDFKYTGPYKI-ESIG 1210
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 211 bits (537), Expect = 1e-53
Identities = 181/668 (27%), Positives = 301/668 (44%), Gaps = 70/668 (10%)
Query: 326 NQTTNEDRHIRGTCFFNSTPLIAI---IDTGATHC----FIVLECAYKLGLIVSDMKGEM 378
N T +I+G +F I + +DTGA+ C F++ E + + + +
Sbjct: 17 NVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWV------NAERPI 70
Query: 379 VVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNW-------LEYNR 431
+V+ S+T S VC L + G F + V G+D I G N+ +++
Sbjct: 71 MVKIADGSSITISKVCKDIDLIIAGEIFRIPTVYQQESGIDFIIGNNFCQLYEPFIQFTD 130
Query: 432 VCINCFNKT--VHF---------------------SSAEEESGAQFLTTKQLKQLERDGI 468
I NK+ VH S ++ T K LE I
Sbjct: 131 RVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAI 190
Query: 469 LMFSLMASLSLENQVVID-RLPVVNEFHEVFPDEIPDVPPERE--VEFSIDLVPGAKLVS 525
L S LS E + R+ + E E E P P + + ++ SI L +K +
Sbjct: 191 L--SEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIK 248
Query: 526 MAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV----KKKDGSMRLCIDYRQ 581
+ P S + E KQ+++LLD K ++PS SP AP LV +K+ G R+ ++Y+
Sbjct: 249 VKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKA 308
Query: 582 LNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYE 641
+NK T+ + Y LP D+L+ + G K+FS D +SG+ Q+ + E TAF GHYE
Sbjct: 309 MNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYE 368
Query: 642 YKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILK 701
+ V+PFG+ AP +F +M+ F F KF V++DDIL++S EE+H H+ ++LQ
Sbjct: 369 WNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCN 427
Query: 702 ERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQW-ETPKSVTEIRSFLGLA 760
+ + K + + +++FLG ++ ++++ +T + +++ FLG+
Sbjct: 428 QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 761 GYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFV 820
Y +I +++ PL + + W + ++K+ L P L P PEE +
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 821 VYCDASKFGLGGVL----MQDGK----VVAYASRQLRVHEKNSPTHDLELAAVVFVLKIW 872
+ DAS GG+L + +G + YAS + EKN ++D E AV+ +K +
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKF 607
Query: 873 RHYLYGSRFEVFSDHK------SLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKV 926
YL F + +D+ +L Y D K R RW L Y F + +
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAWLSHYSFDVEHIKGTDNH 665
Query: 927 VADALSRK 934
AD LSR+
Sbjct: 666 FADFLSRE 673
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 211 bits (537), Expect = 1e-53
Identities = 181/668 (27%), Positives = 302/668 (45%), Gaps = 70/668 (10%)
Query: 326 NQTTNEDRHIRGTCFFNSTPLIAI---IDTGATHC----FIVLECAYKLGLIVSDMKGEM 378
N T +I+G +F I + +DTGA+ C F++ E + + + +
Sbjct: 17 NVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWV------NAERPI 70
Query: 379 VVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNW-------LEYNR 431
+V+ S+T S VC L + G F++ V G+D I G N+ +++
Sbjct: 71 MVKIADGSSITISKVCKDIDLIIAGEIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTD 130
Query: 432 VCINCFNKT--VHF---------------------SSAEEESGAQFLTTKQLKQLERDGI 468
I NK+ VH S ++ T K LE I
Sbjct: 131 RVIFTKNKSYPVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAI 190
Query: 469 LMFSLMASLSLENQVVID-RLPVVNEFHEVFPDEIPDVPPERE--VEFSIDLVPGAKLVS 525
L S LS E + R+ + E E E P P + + ++ SI L +K +
Sbjct: 191 L--SEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIK 248
Query: 526 MAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV----KKKDGSMRLCIDYRQ 581
+ P S + E KQ+++LLD K ++PS SP AP LV +K+ G R+ ++Y+
Sbjct: 249 VKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKA 308
Query: 582 LNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYE 641
+NK TI + Y LP D+L+ + G K+FS D +SG+ Q+ + E TAF GHYE
Sbjct: 309 MNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYE 368
Query: 642 YKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILK 701
+ V+PFG+ AP +F +M+ F F KF V++DDIL++S EE+H H+ ++LQ
Sbjct: 369 WNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCN 427
Query: 702 ERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQW-ETPKSVTEIRSFLGLA 760
+ + K + + +++FLG ++ ++++ +T + +++ FLG+
Sbjct: 428 QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 761 GYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFV 820
Y +I +++ PL + + W + ++K+ L P L P PEE +
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 821 VYCDASKFGLGGVL----MQDGK----VVAYASRQLRVHEKNSPTHDLELAAVVFVLKIW 872
+ DAS GG+L + +G + YAS + E+N ++D E AV+ +K +
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKF 607
Query: 873 RHYLYGSRFEVFSDHK------SLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKV 926
YL F + +D+ +L Y D K R RW L Y F + +
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAWLSHYSFDVEHIKGTDNH 665
Query: 927 VADALSRK 934
AD LSR+
Sbjct: 666 FADFLSRE 673
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 207 bits (528), Expect = 1e-52
Identities = 175/659 (26%), Positives = 300/659 (44%), Gaps = 59/659 (8%)
Query: 326 NQTTNEDRHIRGTCFFNSTPLIAI---IDTGATHC----FIVLECAYKLGLIVSDMKGEM 378
N T +I+G +F I + +DTGA+ C F++ E + + + +
Sbjct: 19 NITNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHW------INAERPI 72
Query: 379 VVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNWLEYNRVCINCFN 438
+V+ S+T + VC L + G F + V G+D I G N+ + I +
Sbjct: 73 MVKIADGSSITINKVCRDIDLIIAGEIFHIPTVYQQESGIDFIIGNNFCQLYEPFIQFTD 132
Query: 439 KT---------VHFSSAEE----------ESGAQFLTTKQLK--QLERDGILMFSLMASL 477
+ VH + ES + T+Q + + + I + S L
Sbjct: 133 RVIFTKDRTYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRL 192
Query: 478 SLENQVVID-RLPVVNEFHEVFPDEIPDVPPERE--VEFSIDLVPGAKLVSMAPYHMSAS 534
S E + R+ + E E E P P + + ++ SI L +K + + P S
Sbjct: 193 SEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPM 252
Query: 535 ELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV----KKKDGSMRLCIDYRQLNKVTIKNR 590
+ E KQ+++LLD K ++PS SP AP LV +K+ G R+ ++Y+ +NK T+ +
Sbjct: 253 DREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDA 312
Query: 591 YPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVT 650
Y P D+L+ + G K+FS D +SG+ Q+ + E TAF GHYE+ V+PFG+
Sbjct: 313 YNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLK 372
Query: 651 NAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILKERKLYAKLS 710
AP +F +M+ F F KF V++DDIL++S EE+H H+ ++LQ + +
Sbjct: 373 QAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKK 431
Query: 711 KCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQW-ETPKSVTEIRSFLGLAGYYRRFIEG 769
K + + +++FLG ++ ++++ +T + +++ FLG+ Y +I
Sbjct: 432 KAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPK 491
Query: 770 FSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFVVYCDASKFG 829
+++ PL + + W + ++K+ L P L P PEE ++ DAS
Sbjct: 492 LAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDY 551
Query: 830 LGGVL----MQDGK----VVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGSRF 881
GG+L + +G + YAS + EKN ++D E AV+ +K + YL F
Sbjct: 552 WGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHF 611
Query: 882 EVFSDHK------SLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKVVADALSRK 934
+ +D+ +L Y D K R RW L Y F + + AD LSR+
Sbjct: 612 LIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 668
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 206 bits (525), Expect = 3e-52
Identities = 179/668 (26%), Positives = 301/668 (44%), Gaps = 70/668 (10%)
Query: 326 NQTTNEDRHIRGTCFFNSTPLIAI---IDTGATHC----FIVLECAYKLGLIVSDMKGEM 378
N T +I+G +F I + +DTGA+ C F++ E + + + +
Sbjct: 17 NVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWV------NAERPI 70
Query: 379 VVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNW-------LEYNR 431
+V+ S+T S VC L + F++ V G+D I G N+ +++
Sbjct: 71 MVKIADGSSITISKVCKDIDLIIAREIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTD 130
Query: 432 VCINCFNKT--VHF---------------------SSAEEESGAQFLTTKQLKQLERDGI 468
I NK+ VH S ++ T K L+ I
Sbjct: 131 RVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAI 190
Query: 469 LMFSLMASLSLENQVVID-RLPVVNEFHEVFPDEIPDVPPERE--VEFSIDLVPGAKLVS 525
L S LS E + R+ + E E E P P + + ++ SI L +K +
Sbjct: 191 L--SEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIK 248
Query: 526 MAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV----KKKDGSMRLCIDYRQ 581
+ P S + E KQ+++LLD K ++PS SP AP LV +K+ G R+ ++Y+
Sbjct: 249 VKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKA 308
Query: 582 LNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYE 641
+NK TI + Y LP D+L+ + G K+FS D +SG+ Q+ + E TAF GHYE
Sbjct: 309 MNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYE 368
Query: 642 YKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILK 701
+ V+PFG+ AP +F +M+ F F KF V++DDIL++S EE+H H+ ++LQ
Sbjct: 369 WNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCN 427
Query: 702 ERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQW-ETPKSVTEIRSFLGLA 760
+ + K + + +++FLG ++ ++++ +T + +++ FLG+
Sbjct: 428 QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 761 GYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFV 820
Y +I +++ PL + + W + ++K+ L P L P PEE +
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 821 VYCDASKFGLGGVL----MQDGK----VVAYASRQLRVHEKNSPTHDLELAAVVFVLKIW 872
+ DAS GG+L + +G + YAS + E+N ++D E AV+ +K +
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKF 607
Query: 873 RHYLYGSRFEVFSDHK------SLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKV 926
YL F + +D+ +L Y D K R RW L Y F + +
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAWLSHYSFDVEHIKGTDNH 665
Query: 927 VADALSRK 934
AD LSR+
Sbjct: 666 FADFLSRE 673
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 204 bits (519), Expect = 1e-51
Identities = 177/668 (26%), Positives = 299/668 (44%), Gaps = 70/668 (10%)
Query: 326 NQTTNEDRHIRGTCFFNSTPLIAI---IDTGATHC----FIVLECAYKLGLIVSDMKGEM 378
N T +I+G +F I + +DTGA+ C F++ E + + + +
Sbjct: 18 NVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWV------NAERPI 71
Query: 379 VVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNW-------LEYNR 431
+V+ S+T S VC L + G F++ V G+D I G N+ +++
Sbjct: 72 MVKIADGSSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTD 131
Query: 432 VCINCFNKT--VHF---------------------SSAEEESGAQFLTTKQLKQLERDGI 468
I NK+ VH S ++ T K LE I
Sbjct: 132 RVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAI 191
Query: 469 LMFSLMASLSLENQVVID-RLPVVNEFHEVFPDEIPDVPPERE--VEFSIDLVPGAKLVS 525
L S LS E + R+ E E E P P + + ++ SI L +K +
Sbjct: 192 L--SEGRRLSEEKLFITQQRMQKTEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIK 249
Query: 526 MAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLVKKKD----GSMRLCIDYRQ 581
+ P S + E KQ+++LLD K ++PS SP AP LV + G+ R+ ++Y+
Sbjct: 250 VKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKA 309
Query: 582 LNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYE 641
+NK T+ + Y LP D+L+ + G K+FS D +SG+ Q+ + E TAF GHYE
Sbjct: 310 MNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYE 369
Query: 642 YKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILK 701
+ V+PFG+ AP +F +M+ F F KF V++DDI+++S EE+H H+ ++LQ
Sbjct: 370 WNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCN 428
Query: 702 ERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQW-ETPKSVTEIRSFLGLA 760
+ + K + + +++FLG ++ ++++ +T + +++ FLG+
Sbjct: 429 QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 488
Query: 761 GYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFV 820
Y +I +++ PL + + W + ++K+ L P L P PEE +
Sbjct: 489 TYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 548
Query: 821 VYCDASKFGLGGVL----MQDGK----VVAYASRQLRVHEKNSPTHDLELAAVVFVLKIW 872
+ DAS GG+L + +G + Y S + E+N ++D E AV+ +K +
Sbjct: 549 IETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKF 608
Query: 873 RHYLYGSRFEVFSDHK------SLKYLFDQKELNMRQRRWLELLKDYDFCLNYHPDKAKV 926
YL F + +D+ +L Y D K R RW L Y F + +
Sbjct: 609 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK--LGRNIRWQAWLSHYSFDVEHIKGTDNH 666
Query: 927 VADALSRK 934
AD LSR+
Sbjct: 667 FADFLSRE 674
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 201 bits (511), Expect = 1e-50
Identities = 165/646 (25%), Positives = 297/646 (45%), Gaps = 47/646 (7%)
Query: 326 NQTTNEDRHIRGTCFFN---STPLIAIIDTGATHC----FIVLECAYKLGLIVSDMKGEM 378
N T +I G F S + +DTGA+ C +I+ E ++ + ++
Sbjct: 26 NVTNPNSIYIEGKLSFEGYKSFNIHCYVDTGASLCIASRYIIPEELWE------NSPKDI 79
Query: 379 VVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNWLE-YN------- 430
V+ + + + VC + G+ FE+ V G+D + G N+ YN
Sbjct: 80 QVKIANQELIKITKVCKNLKVKFAGKSFEIPTVYQQETGIDFLIGNNFCRLYNPFIQWED 139
Query: 431 RVCINCFNKTV---HFSSAEEESGAQFLTT--KQLKQLERDGILMFSLMASLSLENQVVI 485
R+ + N+ V + A S FL K K + G + + + ++
Sbjct: 140 RIAFHLKNEMVLIKKVTKAFSVSNPSFLENMKKDSKTEQIPGTNISKNIINPEERYFLIT 199
Query: 486 DRLPVVNEFHEVFPDEIPDVPPERE--VEFSIDLVPGAKLVSMAPYHMSASELAELKKQL 543
++ + + + E P P + + ++ SI L+ K++ + P S + KQ+
Sbjct: 200 EKYQKIEQLLDKVCSENPIDPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQDREGFAKQI 259
Query: 544 EDLLDKKFVRPSVSPWGAPVLLVK----KKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDL 599
++LLD + PS S +P LV+ ++ G R+ ++Y+ +N+ TI + + LP + +L
Sbjct: 260 KELLDLGLIIPSKSQHMSPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQEL 319
Query: 600 MDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEY 659
+ L G +FS D +SG+ Q+ + +E + TAF GH+++KV+PFG+ AP +F +
Sbjct: 320 LTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRH 379
Query: 660 MNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEV 719
M + DKF +V++DDI+++S E +H H+ VL+I+++ + K + ++
Sbjct: 380 MQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKI 438
Query: 720 SFLG-HVTSGNGIAVDPSKVDAVSQWETPKSVTEIRSFLGLAGYYRRFIEGFSKLALPLT 778
+FLG + G + + + + ++ FLG+ Y +I +++ PL
Sbjct: 439 NFLGLEIDKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQ 498
Query: 779 QLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFVVYCDASKFGLGGVLMQ-- 836
K ++ W ++K+ L + P L LPKPE+ ++ DAS GGVL
Sbjct: 499 VKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARA 558
Query: 837 -DG--KVVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYL 893
DG + Y+S + EKN ++D EL AV V+ + YL RF V +D+K+ Y
Sbjct: 559 LDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYF 618
Query: 894 F------DQKELNMRQRRWLELLKDYDFCLNYHPDKAKVVADALSR 933
D K+ R RW Y F + + V+AD L+R
Sbjct: 619 LRINLKGDSKQ--GRLVRWQNWFSKYQFDVEHLEGVKNVLADCLTR 662
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 185 bits (470), Expect = 7e-46
Identities = 154/661 (23%), Positives = 303/661 (45%), Gaps = 60/661 (9%)
Query: 326 NQTTNEDRHIRGTCFF----NSTPLIAIIDTGATHC----FIVLECAYKLG---LIVSDM 374
N+T +++G F + L +DTG++ C +++ E ++ L +
Sbjct: 5 NRTNPNSIYVKGILKFPGYQTNLDLHCYVDTGSSLCMASKYVIPEEYWQTAEKPLNIKIA 64
Query: 375 KGEMVVETPAKGSVTTSLVCLRCPLSMFGRDFEVDLVCLPLLGMDVIFGMNWLEY----- 429
G+++ T VC + P+ + G F + + G+D++ G N+ +
Sbjct: 65 NGKIIQLTK---------VCSKLPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFI 115
Query: 430 ---NRVCINCFNKTV---HFSSAEEESGAQFLTT-KQLKQLERDGILMFSLMASLSLEN- 481
+R+ + ++V + A + FL + K+ ++ R + + L LE
Sbjct: 116 QYTDRIYFHLNKQSVIIGKITKAYQYGVKGFLESMKKKSKVNRPEPINITSNQHLFLEEG 175
Query: 482 ---------QVVIDRLPVVNEFHEVFPDEIPDVPPEREVEF---SIDLVPGAKLVSMAPY 529
++ I + + E E E P + PE+ ++ +I+L+ +V + P
Sbjct: 176 GNHVDEMLYEIQISKFSAIEEMLERVSSENP-IDPEKSKQWMTATIELIDPKTVVKVKPM 234
Query: 530 HMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLVK----KKDGSMRLCIDYRQLNKV 585
S S+ E +Q+++LL+ K ++PS S +P LV+ ++ G R+ ++Y+ +NK
Sbjct: 235 SYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKA 294
Query: 586 TIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYEYKVM 645
T + + LP D+L+ + G K++S D +SG Q+ + E TAF GHY++ V+
Sbjct: 295 TKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVV 354
Query: 646 PFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKI-EEEHAKHLKIVLQILKERK 704
PFG+ AP +F + K+ V++DDIL++S +EH H+ +L+ ++
Sbjct: 355 PFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLG 414
Query: 705 LYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQW-ETPKSVTEIRSFLGLAGYY 763
+ K + + +++FLG ++ + ++ + + +++ FLG+ Y
Sbjct: 415 IILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYA 474
Query: 764 RRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPFVVYC 823
+I + + PL + ++ W+ ++K+ L + P L P+P + V+
Sbjct: 475 SDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIET 534
Query: 824 DASKFGLGGVLM----QDGKVVAYASRQLRVHEKNSPTHDLELAAVVFVLKIWRHYLYGS 879
DAS+ GG+L + YAS + E+N +++ EL AV+ V+K + YL S
Sbjct: 535 DASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPS 594
Query: 880 RFEVFSDHKSLKYLFDQKELNMRQR----RWLELLKDYDFCLNYHPDKAKVVADALSRKT 935
RF + +D+K+ + + R++ RW L YDF + + V AD L T
Sbjct: 595 RFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFLQENT 654
Query: 936 L 936
L
Sbjct: 655 L 655
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 174 bits (441), Expect = 2e-42
Identities = 159/634 (25%), Positives = 288/634 (45%), Gaps = 61/634 (9%)
Query: 348 AIIDTGATHCFIVL----ECAYKLGLIVSDMKGEMVVETPAK----GSVTTSLVCLRCPL 399
AI+DTGAT C I + E Y+ + + + + + T + G + R P+
Sbjct: 1217 AIVDTGATACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQMIKAGRILIGEQYFRMPV 1276
Query: 400 SMFGRDFEVDLVCLPLLGMDVIFGMNWLEYNRVCINCFNKTVHF----SSAEEESGAQFL 455
+ + +++ P G+ +I G +++ + + F +S E Q
Sbjct: 1277 T-----YVMNMGLSP--GIQMIIGCSFIRSLEGGLRIEKDIITFYKLVTSIETSRTTQVA 1329
Query: 456 TTKQLKQLERDGILMF--SLMASLSLENQVVIDRLPVVNEFHEVFPDEIPDVPPE----R 509
+ + +L D L S+ L+ + ++ E E+ I + P E
Sbjct: 1330 NSIEELELSEDEYLNIAASVETPSFLDQEFARKNKDLLKEMKEM--KYIGENPMEFWKNN 1387
Query: 510 EVEFSIDLV-PGAKLVSMAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLV-- 566
+++ ++++ P K++ H++ + + +Q+ LL K +RPS S + +V
Sbjct: 1388 KIKCKLNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESKHRSTAFIVRS 1447
Query: 567 ---------KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSG 617
K+K G R+ +Y+ LN+ T ++Y LP I+ ++ ++ +K++SK DL+SG
Sbjct: 1448 GTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSG 1507
Query: 618 YHQIKVKDEDMQKTAFRTRYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFID 677
+ Q+ +++E + TAF YE+ VMPFG+ NAP +F M+ +F +KF+ V+ID
Sbjct: 1508 FWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYID 1566
Query: 678 DILIYSKIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSK 737
DIL++S+ E+H++HL +LQ+ KE L +K + E+ FLG I + P
Sbjct: 1567 DILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHI 1626
Query: 738 VDAVSQWETPKSVTE--IRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCES 795
+ + + K T +RS+LG+ Y R +I+ KL PL Q + +
Sbjct: 1627 ISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWK 1686
Query: 796 SFNELKQRLTTAPILILPKPEEPFVVYCDASKFGLGGVL---------MQDGKVVAYASR 846
++K+++ P L LP + ++ D G G V ++ AYAS
Sbjct: 1687 MVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAYASG 1746
Query: 847 QLRVHEKNSPTHDLELAAVVFVL-KIWRHYLYGSRFEVFSDHKSLKYLFDQKELNMRQR- 904
+ T D E+ A + L K +YL + SD +++ +++ N R
Sbjct: 1747 SFNPIKS---TIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENKPSRV 1803
Query: 905 RWL---ELLKDYDFCLNY-HPD-KAKVVADALSR 933
RWL + L + + H D K +ADALSR
Sbjct: 1804 RWLTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 692
Score = 160 bits (405), Expect = 2e-38
Identities = 140/532 (26%), Positives = 256/532 (47%), Gaps = 36/532 (6%)
Query: 347 IAIIDTGATHCFIVLECAYKLGLIVSDMKGEMVVETPAKGSVTTSLVCLRCPLSMFGRDF 406
+A IDTGAT CF + + ++ E+++ +K + ++ + L + ++F
Sbjct: 32 LAYIDTGATLCFGKRKISNNWEILKQPK--EIIIADKSKHYIREAISNVF--LKIENKEF 87
Query: 407 EVDLVCLPLLGMDVIFGMNWLEYNRVCINCFNKT-VHFSSAEEESGAQFLTTKQLKQLER 465
+ ++ L G+D+I G N+L+ + I + + + +Q ++TK L + E
Sbjct: 88 LIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQMISTKILTKNE- 146
Query: 466 DGILMFSL-MASLSLENQVVIDRLPVVNEFHEVFPDEIPDVPPERE---VEFSI-DLVPG 520
+L S + LE + + + EV + D + +E + D +
Sbjct: 147 --VLKLSFEKIHICLEKYLFFKTIE--EQLEEVCSEHPLDETKNKNGLLIEIRLKDPLQE 202
Query: 521 AKLVSMAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLVKK----KDGSMRLC 576
+ + PY + ++ E K++ EDLL K +R S SP AP V+ K G R+
Sbjct: 203 INVTNRIPY--TIRDVQEFKEECEDLLKKGLIRESQSPHSAPAFYVENHNEIKRGKRRMV 260
Query: 577 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFR-T 635
I+Y+++N+ TI + Y LPR D +++++ G+ FS +D +SGY+Q+++ + TAF
Sbjct: 261 INYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAKSGYYQLRLHENTKPLTAFSCP 320
Query: 636 RYGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSK-IEEEHAKHLK 694
HYE+ V+ FG+ AP ++ +M++ L+ + +IDDILI++K +E+H ++
Sbjct: 321 PQKHYEWNVLSFGLKQAPSIYQRFMDQSLKG-LEHICLAYIDDILIFTKGSKEQHVNDVR 379
Query: 695 IVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNG-IAVDPSKVDAVSQW-ETPKSVTE 752
IVLQ +KE+ + K + E+ +LG GNG I + P + + Q+ + + +
Sbjct: 380 IVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGEIDLSPHTQEKILQFPDELEDRKQ 439
Query: 753 IRSFLGLAGYYRRFIEGFSK-LALPLTQLTYK---GKSFVWDAQCESSFNELKQRLTTAP 808
I+ FLG Y EGF K LAL L K + WD +K ++ + P
Sbjct: 440 IQRFLGCINYIAN--EGFFKNLALERKHLQKKISVKNPWKWDTIDTKMVQSIKGKIQSLP 497
Query: 809 ILILPKPEEPFVVYCDASKFGLGGVLMQDGKVVAYASRQLRVHEKNSPTHDL 860
L ++ +V DAS+ G L + + +++ + E PT DL
Sbjct: 498 KLYNASIQDFLIVETDASQHSWSGCL----RALPKGKQKIGLDEFGIPTADL 545
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 152 bits (385), Expect = 5e-36
Identities = 147/568 (25%), Positives = 248/568 (42%), Gaps = 57/568 (10%)
Query: 345 PLIAIIDTGATHCFIVLECAYKLGLIVSDMKGEMVVETPAKGSVTTSLVCLRCPLSMFGR 404
P ++DTGA H + K +S + T K T+ + M
Sbjct: 21 PTTFLVDTGAQHSVLT-----KANGPLSSRTSWVQGATGRKMHKWTNRRTVNLGQGMVTH 75
Query: 405 DFEVDLVC-LPLLGMDVIFGMNWLEYNRVCINCFNKTVHFSSAEEESGAQFLTTKQLKQL 463
F V C PLLG D++ + +HFS E+GAQ L
Sbjct: 76 SFLVVPECPYPLLGRDLLTKLG-------------AQIHFS----EAGAQVL-------- 110
Query: 464 ERDGILMFSLMASLSLENQVVIDRLPVVNEFHEVFPDEIPDVPPER--------EVEFSI 515
+RDG + L SL E+++ +PV +V+ + P E + I
Sbjct: 111 DRDGQPIQILTVSLQDEHRLF--DIPVTTSLPDVWLQDFPQAWAETGGLGRAKCQAPIII 168
Query: 516 DLVPGAKLVSMAPYHMSASELAELKKQLEDLLDKKFVRPSVSPWGAPVLLVKKKDGS-MR 574
DL P A VS+ Y MS +++ + L+ +RP SPW P+L VKK R
Sbjct: 169 DLKPTAVPVSIKQYPMSLEAHMGIRQHIIKFLELGVLRPCRSPWNTPLLPVKKPGTQDYR 228
Query: 575 LCIDYRQLNKVTIKNRYPLPRIDDLMDQLV-GAKVFSKIDLRSGYHQIKVKDEDMQKTAF 633
D R++NK T+ +P +L+ L ++ +DL+ + + + + + AF
Sbjct: 229 PVQDLREINKRTVDIHPTVPNPYNLLSTLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAF 288
Query: 634 RTR------YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVV----FIDDILIYS 683
+ G + +P G N+P +F E ++R F + V ++DD+L+ +
Sbjct: 289 EWKDPERGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAA 348
Query: 684 KIEEEHAKHLKIVLQILKERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQ 743
++ + + +LQ L E+ A K + ++V++LG++ S + P +++ V++
Sbjct: 349 PTKKACTQGTRHLLQELGEKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVAR 408
Query: 744 WETPKSVTEIRSFLGLAGYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQR 803
P++ E+R FLG AG+ R +I GF++LA PL LT + F W + + +F LK+
Sbjct: 409 IPPPRNPREVREFLGTAGFCRLWIPGFAELAAPLYALTKESTPFTWQTEHQLAFEALKKA 468
Query: 804 LTTAPILILPKPEEPFVVYCDASKFGLGGVLMQD----GKVVAYASRQLRVHEKNSPTHD 859
L +AP L LP +PF ++ D + GVL Q + VAY S++L P
Sbjct: 469 LLSAPALGLPDTSKPFTLFLDERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCL 528
Query: 860 LELAAVVFVLKIWRHYLYGSRFEVFSDH 887
+AA ++K G V + H
Sbjct: 529 RIMAATAMLVKDSAKLTLGQPLTVITPH 556
Score = 68.2 bits (165), Expect = 2e-10
Identities = 67/240 (27%), Positives = 103/240 (42%), Gaps = 14/240 (5%)
Query: 1018 GRICIPDNEELKKLILEEGHKSNLSIHLGATKMYQDLKKL-FWWSGLKKDVARFVYACLT 1076
G+I +P E L + + + HLG K+ ++K F + + AC
Sbjct: 828 GKIVLPQKEALAMI-----QQMHAWTHLGNRKLKLLIEKTDFLIPRASTLIEQVTSACKV 882
Query: 1077 CQKSKVEHQR-PAGLLTPLDVPEWKWDSISMDFVSSLPNTSRGHDSIWVVVDRLTKSAHF 1135
CQ+ R PAG T + P W+ +DF P+ + G+ + V VD +
Sbjct: 883 CQQVNAGATRVPAGKRTRGNRPGVYWE---IDFTEVKPHYA-GYKYLLVFVDTFSGWVEA 938
Query: 1136 IPINISYPVAQLAEIYIQNIVKLHGVPSSIVSDRDPRFTSRFWRSLQDALGSKLKLSSAY 1195
P +A+ ++ I G+P I SD P F S+ + L LG KL AY
Sbjct: 939 FPTR-QETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGINWKLHCAY 997
Query: 1196 HPQTDGQSERTIQSLEDLLRVCVLEQG-GAWDSHLPLIEFTYNNSYHSSIGMAPFEALYG 1254
PQ+ GQ ER +++++ L LE G W L L N+ + G+ P+E LYG
Sbjct: 998 RPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGLTPYEILYG 1056
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 150 bits (378), Expect = 3e-35
Identities = 110/400 (27%), Positives = 199/400 (49%), Gaps = 24/400 (6%)
Query: 528 PYHMSASELAELKKQLEDLLDKKFVRPS--VSPWGAPVLLVKKKDGSM----RLCIDYRQ 581
PY + E+ E KQ+++LLD K ++ + +V+ + R+ +Y++
Sbjct: 1190 PYTPADKEVFE--KQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKR 1247
Query: 582 LNKVTIKNRYPLPRIDDLMDQLVGAKVFSKIDLRSGYHQIKVKDEDMQKTAFRTRYGHYE 641
LN + + +P +++ + A +FSK DL++G+H +K+KD+ T F G Y
Sbjct: 1248 LNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYT 1307
Query: 642 YKVMPFGVTNAPGVFMEYMNRIFHAFLDKFVVVFIDDILIYSKIEEEHAKHLKIVLQILK 701
+ V PFG+ NAP F +M F KF +++IDDILI S E+EH +HLKI +K
Sbjct: 1308 WNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFFNRVK 1365
Query: 702 ERKLYAKLSKCEFWLSEVSFLGHVTSGNGIAVDPSKVDAVSQWETPK--SVTEIRSFLGL 759
E K + +L EV +LG I++ P VD + +++ K ++ ++++LGL
Sbjct: 1366 EVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGL 1425
Query: 760 AGYYRRFIEGFSKLALPLTQLTYKGKSFVWDAQCESSFNELKQRLTTAPILILPKPEEPF 819
Y R +I+ SKL PL + T K +++ + + ++++ ++ L PK +
Sbjct: 1426 LNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYI 1485
Query: 820 VVYCDASKFGLGGVLM---------QDGKVVAYASRQLRVHEKNSPTHDLELAAVVFVLK 870
++ DAS+ G G VL+ K+ YAS +K + D E+ A+ L
Sbjct: 1486 IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALN 1544
Query: 871 IWRHYLYGSRFEVFSDHKSLKYLFDQKELNMRQR-RWLEL 909
++ YL F + +D +++ ++ R + RW++L
Sbjct: 1545 KFQIYL-DKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKL 1583
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.321 0.137 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 172,585,232
Number of Sequences: 164201
Number of extensions: 7562910
Number of successful extensions: 20070
Number of sequences better than 10.0: 242
Number of HSP's better than 10.0 without gapping: 120
Number of HSP's successfully gapped in prelim test: 123
Number of HSP's that attempted gapping in prelim test: 19056
Number of HSP's gapped (non-prelim): 676
length of query: 1451
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1328
effective length of database: 39,777,331
effective search space: 52824295568
effective search space used: 52824295568
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)
Medicago: description of AC137666.4