
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0045.8
(1555 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 491 e-138
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 489 e-137
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 487 e-137
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 361 1e-98
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 354 9e-97
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 331 1e-89
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 322 7e-87
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 296 4e-79
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 254 1e-66
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 209 5e-53
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 209 6e-53
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 207 2e-52
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 207 2e-52
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 206 4e-52
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 203 3e-51
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 181 2e-44
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 177 2e-43
M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860... 151 1e-35
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 149 8e-35
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 138 1e-31
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 491 bits (1265), Expect = e-138
Identities = 335/1037 (32%), Positives = 519/1037 (49%), Gaps = 76/1037 (7%)
Query: 435 LVDCGATSNFISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNLKLEVQGIPIIQH 494
L+D GA +N I++E V ++P S+ V+ G + N K +KL +
Sbjct: 269 LIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINR---KTIKLNI-------- 317
Query: 495 FFILGLGGTELVLGMDWLASLGNIEANFQDLIIKWELNGQKMCMQGEPSFCKVAATWKSI 554
SL I + L++K K SF + I
Sbjct: 318 -------------------SLNGISIKTEFLVVK------KFSHPAAISFTTLYDNNIEI 352
Query: 555 KKTKHDEGEEYFLSYECSEEEPTANVTIPELWIKLLTEFPEVFQEPKELPPKRATDHAIL 614
+KH + +S E E +P+++ + E E P K L
Sbjct: 353 SSSKHTLSQMNKVSNIVKEPE------LPDIYKEFKDITAETNTEKLPKPIKGLEFEVEL 406
Query: 615 LQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILVKKKDGGWRFC 674
QE +P IR Y P + + + + L +GIIR S + + P + V KK+G R
Sbjct: 407 TQENYRLP-IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 675 VDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTH 734
VDY+ LNK P+ +P+P+I++LL +I + IF+KLDLKS YH IR+R+ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 735 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREV 794
G +EYLV+P+G++ AP+ FQ +N +L V+ + DILI+S + H +H+++V
Sbjct: 526 RGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDV 585
Query: 795 LQVLRDNHLVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLNWPLPKDVKGLRGF 854
LQ L++ +L+ NQ KC F QS++ ++G+ IS++G I +L W PK+ K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 855 LGLTGYYRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQ 913
LG Y R+F+ S+L PLN LLKK+ + W+ TQA + +K+ + + PVL DF
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 914 KTFVLETDASGKGLGAVLMQGG-----RPVAYMSKTLSERAQAKSVYERELMAVVLAVQK 968
K +LETDAS +GAVL Q PV Y S +S+ SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 969 WRHYLLGC--KFIVHTDQKSL--RFLAEQRLMGEEQQKWVSKLMGFDFEIKYKPGIENKA 1024
WRHYL F + TD ++L R E + +W L F+FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 1025 ADALSRKLQFSAISSVQCEDWE-DLETEI-LADDKYQKIIQEITTQGP-----------V 1071
ADALSR + + ED + +I + DD +++ E T V
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 1072 PAGYHMRRGRLL-YKNRIVLPKTSGKIPIILQEFHDSAVGGHAGIFRTYKRISALFFWEG 1130
++ G L+ K++I+LP + I++++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1131 MKLDIQTYVQKCEICQRNKYETLNPAGYLQPLPIPSQVWSDISMDFIGGLPKTMGKDTIL 1190
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1191 VVVDRFTKYAHFLALSHPYNAKEVAELFIKEIVKLHGFPTSIVSDRDRVFLSSFWSELFK 1250
VVVDRF+K A + + A++ A +F + ++ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1251 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGAKPKQWPKWLSWAEFWYNTNYHSA 1310
+KFS Y PQTDGQTE N+ VE LRCV P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1311 IKTTPFQALYGREPPVIIKGTDSLASVNEVEKMTAERNLFLDTLKENLEKAQNRMKQQAN 1370
+ TPF+ ++ P + S + ++ ++ + E T+KE+L +MK+ +
Sbjct: 1126 TQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1371 KHRRDI-QLKVGDMVYLKIQPYKLKSLARRKNQKLSPRFYGPYPVIEKINAVAYKLQLPE 1429
++I + + GD+V +K + K+ K+ KL+P F GP+ V++K Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1430 G--SQVHPVFHVSLLKK 1444
FHVS L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 489 bits (1258), Expect = e-137
Identities = 334/1037 (32%), Positives = 519/1037 (49%), Gaps = 76/1037 (7%)
Query: 435 LVDCGATSNFISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNLKLEVQGIPIIQH 494
L+D GA +N I++E V ++P S+ V+ G + N K +KL +
Sbjct: 269 LIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINR---KTIKLNI-------- 317
Query: 495 FFILGLGGTELVLGMDWLASLGNIEANFQDLIIKWELNGQKMCMQGEPSFCKVAATWKSI 554
SL I + L++K K SF + I
Sbjct: 318 -------------------SLNGISIKTEFLVVK------KFSHPAAISFTTLYDNNIEI 352
Query: 555 KKTKHDEGEEYFLSYECSEEEPTANVTIPELWIKLLTEFPEVFQEPKELPPKRATDHAIL 614
+KH + +S E E +P+++ + E E P K L
Sbjct: 353 SSSKHTLSQMNKVSNIVKEPE------LPDIYKEFKDITAETNTEKLPKPIKGLEFEVEL 406
Query: 615 LQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILVKKKDGGWRFC 674
QE +P IR Y P + + + + L +GIIR S + + P + V KK+G R
Sbjct: 407 TQENYRLP-IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 675 VDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTH 734
VDY+ LNK P+ +P+P+I++LL +I + IF+KLDLKS YH IR+R+ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 735 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREV 794
G +EYLV+P+G++ AP+ FQ +N +L V+ + +ILI+S + H +H+++V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 795 LQVLRDNHLVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLNWPLPKDVKGLRGF 854
LQ L++ +L+ NQ KC F QS++ ++G+ IS++G I +L W PK+ K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 855 LGLTGYYRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQ 913
LG Y R+F+ S+L PLN LLKK+ + W+ TQA + +K+ + + PVL DF
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 914 KTFVLETDASGKGLGAVLMQGG-----RPVAYMSKTLSERAQAKSVYERELMAVVLAVQK 968
K +LETDAS +GAVL Q PV Y S +S+ SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 969 WRHYLLGC--KFIVHTDQKSL--RFLAEQRLMGEEQQKWVSKLMGFDFEIKYKPGIENKA 1024
WRHYL F + TD ++L R E + +W L F+FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 1025 ADALSRKLQFSAISSVQCEDWE-DLETEI-LADDKYQKIIQEITTQGP-----------V 1071
ADALSR + + ED + +I + DD +++ E T V
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 1072 PAGYHMRRGRLL-YKNRIVLPKTSGKIPIILQEFHDSAVGGHAGIFRTYKRISALFFWEG 1130
++ G L+ K++I+LP + I++++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1131 MKLDIQTYVQKCEICQRNKYETLNPAGYLQPLPIPSQVWSDISMDFIGGLPKTMGKDTIL 1190
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1191 VVVDRFTKYAHFLALSHPYNAKEVAELFIKEIVKLHGFPTSIVSDRDRVFLSSFWSELFK 1250
VVVDRF+K A + + A++ A +F + ++ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1251 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGAKPKQWPKWLSWAEFWYNTNYHSA 1310
+KFS Y PQTDGQTE N+ VE LRCV P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1311 IKTTPFQALYGREPPVIIKGTDSLASVNEVEKMTAERNLFLDTLKENLEKAQNRMKQQAN 1370
+ TPF+ ++ P + S + ++ ++ + E T+KE+L +MK+ +
Sbjct: 1126 TQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1371 KHRRDI-QLKVGDMVYLKIQPYKLKSLARRKNQKLSPRFYGPYPVIEKINAVAYKLQLPE 1429
++I + + GD+V +K + K+ K+ KL+P F GP+ V++K Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1430 G--SQVHPVFHVSLLKK 1444
FHVS L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 487 bits (1254), Expect = e-137
Identities = 333/1037 (32%), Positives = 518/1037 (49%), Gaps = 76/1037 (7%)
Query: 435 LVDCGATSNFISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNLKLEVQGIPIIQH 494
L+D G +N I++E V ++P S+ V+ G + N K +KL +
Sbjct: 269 LIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINR---KTIKLNI-------- 317
Query: 495 FFILGLGGTELVLGMDWLASLGNIEANFQDLIIKWELNGQKMCMQGEPSFCKVAATWKSI 554
SL I + L++K K SF + I
Sbjct: 318 -------------------SLNGISIKTEFLVVK------KFSHPAAISFTTLYDNNIEI 352
Query: 555 KKTKHDEGEEYFLSYECSEEEPTANVTIPELWIKLLTEFPEVFQEPKELPPKRATDHAIL 614
+KH + +S E E +P+++ + E E P K L
Sbjct: 353 SSSKHTLSQMNKVSNIVKEPE------LPDIYKEFKDITAETNTEKLPKPIKGLEFEVEL 406
Query: 615 LQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILVKKKDGGWRFC 674
QE +P IR Y P + + + + L +GIIR S + + P + V KK+G R
Sbjct: 407 TQENYRLP-IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 675 VDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTH 734
VDY+ LNK P+ +P+P+I++LL +I + IF+KLDLKS YH IR+R+ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 735 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREV 794
G +EYLV+P+G++ AP+ FQ +N +L V+ + +ILI+S + H +H+++V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 795 LQVLRDNHLVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLNWPLPKDVKGLRGF 854
LQ L++ +L+ NQ KC F QS++ ++G+ IS++G I +L W PK+ K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 855 LGLTGYYRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQ 913
LG Y R+F+ S+L PLN LLKK+ + W+ TQA + +K+ + + PVL DF
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 914 KTFVLETDASGKGLGAVLMQGG-----RPVAYMSKTLSERAQAKSVYERELMAVVLAVQK 968
K +LETDAS +GAVL Q PV Y S +S+ SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 969 WRHYLLGC--KFIVHTDQKSL--RFLAEQRLMGEEQQKWVSKLMGFDFEIKYKPGIENKA 1024
WRHYL F + TD ++L R E + +W L F+FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 1025 ADALSRKLQFSAISSVQCEDWE-DLETEI-LADDKYQKIIQEITTQGP-----------V 1071
ADALSR + + ED + +I + DD +++ E T V
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 1072 PAGYHMRRGRLL-YKNRIVLPKTSGKIPIILQEFHDSAVGGHAGIFRTYKRISALFFWEG 1130
++ G L+ K++I+LP + I++++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1131 MKLDIQTYVQKCEICQRNKYETLNPAGYLQPLPIPSQVWSDISMDFIGGLPKTMGKDTIL 1190
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1191 VVVDRFTKYAHFLALSHPYNAKEVAELFIKEIVKLHGFPTSIVSDRDRVFLSSFWSELFK 1250
VVVDRF+K A + + A++ A +F + ++ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1251 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGAKPKQWPKWLSWAEFWYNTNYHSA 1310
+KFS Y PQTDGQTE N+ VE LRCV P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1311 IKTTPFQALYGREPPVIIKGTDSLASVNEVEKMTAERNLFLDTLKENLEKAQNRMKQQAN 1370
+ TPF+ ++ P + S + ++ ++ + E T+KE+L +MK+ +
Sbjct: 1126 TQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1371 KHRRDI-QLKVGDMVYLKIQPYKLKSLARRKNQKLSPRFYGPYPVIEKINAVAYKLQLPE 1429
++I + + GD+V +K + K+ K+ KL+P F GP+ V++K Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1430 G--SQVHPVFHVSLLKK 1444
FHVS L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 361 bits (926), Expect = 1e-98
Identities = 190/452 (42%), Positives = 285/452 (63%), Gaps = 13/452 (2%)
Query: 589 LLTEFPEV-FQEPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIE--KLVKEML 645
LL +F + ++E ++L H + +PI + +YP Q +EIE V+EML
Sbjct: 174 LLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYS---KQYPLAQTHEIEVENQVQEML 230
Query: 646 AAGIIRHSTSPFSSPAILVKKKDGG-----WRFCVDYRALNKVTIPDKFPIPIIDELLDE 700
G+IR S SP++SP +V KK +R +DYR LN++TIPD++PIP +DE+L +
Sbjct: 231 NQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGK 290
Query: 701 IGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQ 760
+G + F+ +DL G+HQI M EE I KTAF T GHYEYL +PFGL NAP+TFQ MN
Sbjct: 291 LGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNN 350
Query: 761 VLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYL 820
+LRP L K LV+ DI+I+S ++ H ++ V L D +L KC F + E +L
Sbjct: 351 ILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFL 410
Query: 821 GHVISKEGVAADPSKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLK 880
GH+++ +G+ +P K+K ++++P+P K +R FLGLTGYYR+F+ NY+ +A+P+ LK
Sbjct: 411 GHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLK 470
Query: 881 KNN--FSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDASGKGLGAVLMQGGRPV 938
K + +AF+KLK ++ P+L +PDF+K FVL TDAS LGAVL Q G P+
Sbjct: 471 KRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPI 530
Query: 939 AYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCKFIVHTDQKSLRFLAEQRLMGE 998
+++S+TL++ S E+EL+A+V A + +RHYLLG +F++ +D + LR+L + G
Sbjct: 531 SFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGA 590
Query: 999 EQQKWVSKLMGFDFEIKYKPGIENKAADALSR 1030
+ ++W +L + F+I Y G EN ADALSR
Sbjct: 591 KLERWRVRLSEYQFKIDYIKGKENSVADALSR 622
Score = 35.8 bits (81), Expect = 0.94
Identities = 74/321 (23%), Positives = 124/321 (38%), Gaps = 36/321 (11%)
Query: 1099 IILQEFHDSAVGGHAGIFRTYKRISALFFWEGMKLDIQTYVQKCEICQRNKYETLNPAGY 1158
IILQ H+ + H GI + K F+ +L IQ + +C IC K E N
Sbjct: 753 IILQS-HEKLL--HPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMP 809
Query: 1159 LQPLPIPSQVWSDISMDFIGGLPKTMGKDTILVVVDRFTKYAHFLALSHPYNAKEVAEL- 1217
L+ P P F+ + + GK I +D ++K+A K+ E
Sbjct: 810 LKITPNPEH----CREKFVVDIYSSEGKHYI-SCIDIYSKFATL----EQIKTKDWIECR 860
Query: 1218 -FIKEIVKLHGFPTSIVSDRDRVFLSSFWSELFKLAGTKLKFSSAYHPQTDGQTEVVNRC 1276
+ I G P + +DRD F S + +L+ ++A + D E +++
Sbjct: 861 NALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD--VERLHKT 918
Query: 1277 VETYLRCVTGAKPKQWPKWLSWAE---FWYNTNY-HSAIKTTPFQA-LYGREPPVIIKGT 1331
+ +R + + ++ LS E + YN H P Q LY P I T
Sbjct: 919 INEKIRIINSSDDEEVK--LSKIETILYTYNQKIKHDTTGQRPAQIFLYAGHP---ILDT 973
Query: 1332 DSLASVNEVEKMTAERNLFLDTLKENLEKAQNRMKQQANKHRRDIQLKVGDMVYLKI--- 1388
+ ++EK+ +R F + N K + + N + ++ D + KI
Sbjct: 974 QKIKE-KKIEKINEDRREF--NIDTNYRKGPLQKGKLENPFKPTKNVEQTDPDHYKITNR 1030
Query: 1389 ----QPYKLKSLARRKNQKLS 1405
YK + ++KN KLS
Sbjct: 1031 NRVTHYYKTQFKKQKKNNKLS 1051
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 354 bits (909), Expect = 9e-97
Identities = 224/647 (34%), Positives = 350/647 (53%), Gaps = 37/647 (5%)
Query: 426 KIGEREILILVDCGATSNFISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNLKLE 485
K E + L+D G+T N S+ + ++P+ TS ++ NG N + K+
Sbjct: 19 KYKENNLKCLIDTGSTVNMTSKNI---FDLPIQNTSTFI-HTSNGPLIVNKSIIIPSKIL 74
Query: 486 VQGIPIIQHFFILGLGGT-ELVLGMDWLASLGNIEANFQDLIIKWELNGQKM----CMQG 540
P F + +L+LG LA +++D + N K+
Sbjct: 75 ---FPTTNEFLLHPFSENYDLLLGRKLLAE-AKATISYRDQEVTLYNNKYKLIEGIATHE 130
Query: 541 EPSFCKVAATWKSIKKTKHD-----EGEEYFLSYECSEEEPTANVTIPELWIKLLTEFPE 595
+ F V ++ + + E + Y L + +EE+ + LL ++ +
Sbjct: 131 QSHFQNVNMIPDTMLRQPNKISPILESDLYRLEHLNNEEK--------QRLCALLQKYHD 182
Query: 596 V-FQEPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHST 654
+ + E +L T H I + P+ + Y YP + E+E +++ML GIIR S
Sbjct: 183 IQYHEGDKLTFTNQTKHTINTKHNLPLYS--KYSYPQAYEQEVESQIQDMLNQGIIRTSN 240
Query: 655 SPFSSPAILVKKKDGG-----WRFCVDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSK 709
SP++SP +V KK +R +DYR LN++T+ D+ PIP +DE+L ++G F+
Sbjct: 241 SPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTT 300
Query: 710 LDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKF 769
+DL G+HQI M E + KTAF T GHYEYL +PFGL NAP+TFQ MN +LRP L K
Sbjct: 301 IDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKH 360
Query: 770 VLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYLGHVISKEGV 829
LV+ DI+++S ++D H + L V + L +L KC F + E +LGHV++ +G+
Sbjct: 361 CLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGI 420
Query: 830 AADPSKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLKKNN--FSWS 887
+P KI+ + +P+P K ++ FLGLTGYYR+F+ N++ +A+P+ + LKKN + +
Sbjct: 421 KPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTN 480
Query: 888 AGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDASGKGLGAVLMQGGRPVAYMSKTLSE 947
AF KLK +++ P+L VPDF K F L TDAS LGAVL Q G P++Y+S+TL+E
Sbjct: 481 PEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNE 540
Query: 948 RAQAKSVYERELMAVVLAVQKWRHYLLGCKFIVHTDQKSLRFLAEQRLMGEEQQKWVSKL 1007
S E+EL+A+V A + +RHYLLG F + +D + L +L + + +W KL
Sbjct: 541 HEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKL 600
Query: 1008 MGFDFEIKYKPGIENKAADALSR-KLQFSAISSVQCEDWEDLETEIL 1053
FDF+IKY G EN ADALSR KL+ + +S E+ ++++
Sbjct: 601 SEFDFDIKYIKGKENCVADALSRIKLEETYLSEQTQHSAEEDNSDLI 647
Score = 33.9 bits (76), Expect = 3.6
Identities = 58/301 (19%), Positives = 111/301 (36%), Gaps = 19/301 (6%)
Query: 1112 HAGIFRTYKRISALFFWEGMKLDIQTYVQKCEICQRNKYETLNPAGYLQPLPIPSQVWSD 1171
H GI +T K +++ +L IQ + +C IC K E N + P P
Sbjct: 763 HPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPEHCREK 822
Query: 1172 ISMDFIGGLPKTMGKDTILVVVDRFTKYAHFLALSHPYNAKEVAEL--FIKEIVKLHGFP 1229
+D + + GK + +D ++K+A K+ E + I G P
Sbjct: 823 FMID----IYSSEGKHYV-SCIDIYSKFATL----EEIKTKDWIECKNALMRIFNQLGKP 873
Query: 1230 TSIVSDRDRVFLSSFWSELFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGAKP 1289
+ +DRD F S + +L+ ++ D E +++ + +R + +
Sbjct: 874 KLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD--IERLHKTINEKIRIIKTSDD 931
Query: 1290 KQ--WPKWLSWAEFWYNTNYHSAIKTTPFQALYGREPPVIIKGTDSLASVNEVEKMTAER 1347
++ K + + + H TP P++ + +N++ E
Sbjct: 932 EETKLSKMETVLNIYNHKTKHDTTGQTPAHIFLYAGQPILDTQQNKENKINKINNDRVEY 991
Query: 1348 NLFLDTLKENLEKA--QNRMKQQANKHRRDI-QLKVGDMVYLKIQPYKLKSLARRKNQKL 1404
+ K L+K +N K N + D K+ + + YK + R+KN +L
Sbjct: 992 EVDTRYRKGPLQKGKLENPFKPTKNVEQTDSDHYKITNRNRI-THYYKTQFKKRKKNNQL 1050
Query: 1405 S 1405
S
Sbjct: 1051 S 1051
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 331 bits (848), Expect = 1e-89
Identities = 257/958 (26%), Positives = 455/958 (46%), Gaps = 89/958 (9%)
Query: 523 QDLIIKWELNGQKMCMQGEPSFCKVAATWKSIKKTKHDEGEEYFLSYECSEEEPTANVTI 582
QD+ ++ LN + F ++ S + K E E F + E N
Sbjct: 853 QDITVEEVLNDPTL-------FSEIETDTNSCEVVKTAETYERFTTI--CEHLKRENGDD 903
Query: 583 PELWIKLLTEFPEVFQ-EPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLV 641
++W ++ +F +VF EL T+ I L+EGA +P P K EI K++
Sbjct: 904 RKIW-DVIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMI 962
Query: 642 KEMLAAGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKVTIPDKFPIPIIDELLDEI 701
++ML +IR S SP+SSP +LVKKKDG R C+DYR +NKV + P+P I+ L +
Sbjct: 963 QKMLNQKVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSL 1022
Query: 702 GTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 761
++++ D+ +G+ QI + E+ TAF +E+ VLPFGL +P+ FQ M ++
Sbjct: 1023 AGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEI 1082
Query: 762 LRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYLG 821
+ L V+ D+LI S +++ H + ++E L +R + + KC + E+ YLG
Sbjct: 1083 IGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLG 1142
Query: 822 HVISKEGVAADPSKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLK- 880
H ++ +GV K M + P +VK L+ FLGL GYYR+F+ N++++A L L+
Sbjct: 1143 HKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISA 1202
Query: 881 KNNFSWSAGATQAFDKLKEIMTTVPVLAVPDFQ------KTFVLETDASGKGLGAVLMQG 934
K + W AF +LK+++ PVLA PD + + F++ TDAS KG+GAVL Q
Sbjct: 1203 KVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQE 1262
Query: 935 G-----RPVAYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCKFIVHTDQKSLRF 989
G P+A+ SK LS + + E +A++ A+++++ + G V TD K L
Sbjct: 1263 GPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLIS 1322
Query: 990 LAEQRLMGEEQQKWVSKLMGFDFEIKYKPGIENKAADALSR-------------KLQFSA 1036
L + + + +W +++ FD +I Y G N ADALSR K S
Sbjct: 1323 LLKGSPLADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSI 1382
Query: 1037 ISSVQC-------------------EDWEDLETEILADDKYQKIIQEITTQGPVPAGYHM 1077
++++Q E W+++ L K + + + + + Y+
Sbjct: 1383 VNAIQTELPDILDSSCWLERLKGEDEGWKEV-IAALEGGKTKGTFKIVGIESEISLEYYK 1441
Query: 1078 RRGRLLYKNRIVLPKTSGKIP-----IILQEFHDSAVGGHAGIFRTYKRISALFFWEGMK 1132
G +L KN + ++ +P +L+E H+ + GH GI + ++ + F+W M+
Sbjct: 1442 IVGGVL-KNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMR 1500
Query: 1133 LDIQTYVQKCEICQ-RNKYETLNPAGYLQPLPIPSQVWSDISMDFIGGLPKTMGKDTILV 1191
+ ++ V+ C C N + L + + P ++ ++ D + G IL
Sbjct: 1501 VCVENCVRTCAKCLCANDHSKLTSSLTPYRMTFPLEI---VACDLMDVGLSVQGNRYILT 1557
Query: 1192 VVDRFTKYAHFLALSHPYNAKEVAELFIKEIVKLHG-FPTSIVSDRDRVFLSSFWSELFK 1250
++D FTKY + + A+ V + F++ G P +++D+ + F++ +++
Sbjct: 1558 IIDLFTKYGTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTH 1616
Query: 1251 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGAKPKQWPKWLSWAEFWYNTNYHSA 1310
+ + + Y+ + +G E N+ + ++ T A P +W + +A + YN H
Sbjct: 1617 MLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKT-AVPMEWDDQVVYAVYAYNNCVHEN 1675
Query: 1311 IKTTPFQALYGRE--PPVIIKGTDSL----ASVNEVEKMTAERNLFLDTL-KENLEKAQN 1363
TP ++GR+ P+ + G D++ A ++E + + + L + + KE+ + Q
Sbjct: 1676 TGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHAMREQE 1735
Query: 1364 R------MKQQANKHRRDIQLKVGDMVYLKIQPYKLKSLARRKNQKLSPRFYGPYPVI 1415
K + KHR + G V L+I KL + + KL ++ GPY VI
Sbjct: 1736 SYKSLFDQKYASKKHRFP---QPGSRVLLEIPSEKLGA----QCPKLVNKWSGPYRVI 1786
Score = 37.4 bits (85), Expect = 0.32
Identities = 40/151 (26%), Positives = 65/151 (42%), Gaps = 24/151 (15%)
Query: 325 NNEGKIPERKWNGGQRLTQTELQERSRRGLCFKCGEKWGREHICAKKNFQLILIEGEDEE 384
N K E + +G + +L+ R+ CF+C E C KKN E+
Sbjct: 565 NASQKCDECQQSGWHMASCFKLKNRA----CFRCNEMGHIAWNCPKKN--------ENTS 612
Query: 385 EEEEVFEEAEDGEFVLEGKVLQLSLNSKEGLTSNRSFKVKGKIGEREILILVDCGATSNF 444
E+E + E E V L + + K RS + KG+IG+ + IL+D GA+ +
Sbjct: 613 EKEAPVAKVETIEGVRMKDCLLMVKSEKSESEVTRSLE-KGQIGKANVEILLDSGASISL 671
Query: 445 ISQELVAELEIPVVATSEYVVEVGNGARERN 475
+S+ T E +VEV + E++
Sbjct: 672 MSKN-----------TWEKIVEVNGKSWEQD 691
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 322 bits (824), Expect = 7e-87
Identities = 181/470 (38%), Positives = 279/470 (58%), Gaps = 24/470 (5%)
Query: 584 ELWIKLLTEFPEVFQEPKE-LPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVK 642
E+ LL EFP +F+ P + + A I PI + Y YP + E+E+ +
Sbjct: 86 EILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPI-YAKSYPYPVNMRGEVERQID 144
Query: 643 EMLAAGIIRHSTSPFSSPAILVKKK-----DGGWRFCVDYRALNKVTIPDKFPIPIIDEL 697
E+L GIIR S SP++SP +V KK + +R VD++ LN VTIPD +PIP I+
Sbjct: 145 ELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINAT 204
Query: 698 LDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQAL 757
L +G A+ F+ LDL SG+HQI M+E DIPKTAF T G YE+L LPFGL NAP+ FQ +
Sbjct: 205 LASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRM 264
Query: 758 MNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSEL 817
++ +LR ++ K V+ DI+++S + D H ++LR VL L +L N +K F +++
Sbjct: 265 IDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQV 324
Query: 818 IYLGHVISKEGVAADPSKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQ 877
+LG++++ +G+ ADP K++ + P P VK L+ FLG+T YYR+F+++Y+K+A+PL
Sbjct: 325 EFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTN 384
Query: 878 LLK------------KNNFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDASGK 925
L + K + A Q+F+ LK I+ + +LA P F K F L TDAS
Sbjct: 385 LTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNW 444
Query: 926 GLGAVLMQG----GRPVAYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCKFI-V 980
+GAVL Q RP+AY+S++L++ + + E+E++A++ ++ R YL G I V
Sbjct: 445 AIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKV 504
Query: 981 HTDQKSLRFLAEQRLMGEEQQKWVSKLMGFDFEIKYKPGIENKAADALSR 1030
+TD + L F R + ++W +++ ++ E+ YKPG N ADALSR
Sbjct: 505 YTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554
Score = 67.0 bits (162), Expect = 4e-10
Identities = 81/334 (24%), Positives = 136/334 (40%), Gaps = 37/334 (11%)
Query: 1082 LLYKNRI---VLPKTSGKIPI--ILQEFHDSAVGGHAGIFRTYKRISALFFWEGMKLDIQ 1136
LLYK RI ++ SG I I+++ H A H G ++ +++ M I+
Sbjct: 673 LLYKIRITQRLVADVSGAEEICEIIEKEHRRA---HRGPTEIRLQLLEKYYFPRMSSTIR 729
Query: 1137 TYVQKCEICQRNKYETLNPAGYLQPLPIPSQVWSDISMDFIGGLPKTMGKDTILVVVDRF 1196
C+ C+ KYE LQP PIP+ + +D + K L +D+F
Sbjct: 730 LQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPCEILHIDIF-----ALEKRLYLSCIDKF 784
Query: 1197 TKYAHFLALSHPYNAKEVAELFIKE--IVKLHGF--PTSIVSDRDRVFLSSFWSELFKLA 1252
+K+A ++ + A + ++E + LH F P +VSD +R L +
Sbjct: 785 SKFAKL------FHLQSKASVHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSL 838
Query: 1253 GTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGAKPKQWP-KWLSWAEFWYNTNYHSAI 1311
L ++ + +GQ E + RC+ P P + + A YNT+ HS
Sbjct: 839 DIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPTFKPVELVHIAVDRYNTSVHSVT 898
Query: 1312 KTTPFQALYGREPPVIIKGTDSLASVNEVEKMTAERNLFLDTLKENLEKAQNRMKQQANK 1371
P + R V +G +T R L+ +K +E Q R NK
Sbjct: 899 NRKPADVFFDRSSRVNYQG------------LTDFRRQTLEDIKGLIEYKQIRGNMARNK 946
Query: 1372 HRRDIQ-LKVGDMVYLKIQPYKLKSLARRKNQKL 1404
+R + + GD V++ + K K AR + +K+
Sbjct: 947 NRDEPKSYGPGDEVFVANKQIKTKEKARFRCEKV 980
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 296 bits (757), Expect = 4e-79
Identities = 206/657 (31%), Positives = 335/657 (50%), Gaps = 74/657 (11%)
Query: 423 VKGKIGEREILILVDCGATSNFISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNL 482
++ ++ R + +L+D A N+I V EL+ + S + V +G+ E
Sbjct: 15 IERRLAGRTLKMLIDTDAAKNYIRP--VKELKNVMPVASPFSVSSIHGSTEIKH------ 66
Query: 483 KLEVQGIPIIQHFFIL-GLGGTELVLGMDWLASLGNIEANFQDLIIKWELNGQKMCMQGE 541
K ++ I FF+L L + ++G+D L G ++ N + ++++ +K+
Sbjct: 67 KCLMKVFKHISPFFLLDSLNAFDAIIGLDLLTQAG-VKLNLAEDSLEYQGIAEKL----- 120
Query: 542 PSFCKVAATWKSIKKTKHDEGEEYFLSYECSEEEPTANVTIPELWIKLLTEFPEVFQEPK 601
++ S ++ +P+ + EF + K
Sbjct: 121 -----------------------HYFSCPSVNFTDVNDIVVPD---SVKKEFKDTIIRRK 154
Query: 602 E--------LPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHS 653
+ LP A I + P+ + R Y + + VK++L GIIR S
Sbjct: 155 KAFSTTNEALPFNTAVTATIRTVDNEPVYS-RAYPTLMGVSDFVNNEVKQLLKDGIIRPS 213
Query: 654 TSPFSSPAILVKKK------DGGWRFCVDYRALNKVTIPDKFPIPIIDELLDEIGTAEIF 707
SP++SP +V KK + R +D+R LN+ TIPD++P+P I +L +G A+ F
Sbjct: 214 RSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFF 273
Query: 708 SKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLR 767
+ LDLKSGYHQI + E D KT+F + G YE+ LPFGL NA S FQ ++ VLR +
Sbjct: 274 TTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIG 333
Query: 768 KFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYLGHVISKE 827
K V+ D++I+S N H H+ VL+ L D ++ +Q+K F + + YLG ++SK+
Sbjct: 334 KICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKD 393
Query: 828 GVAADPSKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLKKNNFSWS 887
G +DP K+K + +P P V +R FLGL YYR F+++++ +A+P+ +LK N S S
Sbjct: 394 GTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENGSVS 453
Query: 888 AGATQ------------AFDKLKEIMTTVPV-LAVPDFQKTFVLETDASGKGLGAVLMQG 934
++ AF +L+ I+ + V L PDF+K F L TDAS G+GAVL Q
Sbjct: 454 KHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQE 513
Query: 935 GRPVAYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCKFI-VHTDQKSLRFLAEQ 993
GRP+ +S+TL + Q + EREL+A+V A+ K +++L G + I + TD + L F
Sbjct: 514 GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVAD 573
Query: 994 RLMGEEQQKWVSKLMGFDFEIKYKPGIENKAADALSRKLQFSAISSVQCEDWEDLET 1050
R + ++W S + + ++ YKPG EN ADALSR+ ++++Q E D T
Sbjct: 574 RNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ----NLNALQNEPQSDAAT 626
Score = 47.4 bits (111), Expect = 3e-04
Identities = 56/280 (20%), Positives = 106/280 (37%), Gaps = 37/280 (13%)
Query: 1112 HAGIFRTYKRISALFFWEGMKLDIQTYVQKCEICQRNKYETLNPAGYLQPLPIPSQVWSD 1171
H K++ +++ M + V C +C + KY+ L PIPS
Sbjct: 750 HRAAQENIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTGEM 809
Query: 1172 ISMDFIGGLPKTMGKDTILVVVDRFTKYAHFLALSHPYNAKEVAELFIKEIVKLHGFPT- 1230
+ +D + + L +D+F+KYA + P ++ + ++ + ++ FP
Sbjct: 810 VHIDIF-----STDRKLFLTCIDKFSKYA----IVQPVVSRTIVDITAPLLQIINLFPNI 860
Query: 1231 -SIVSDRDRVFLSSFWSELFKLA-GTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVT-GA 1287
++ D + F S + + K + G + + H ++GQ E + + RC+
Sbjct: 861 KTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDK 920
Query: 1288 KPKQWPKWLSWAEFWYNTNYHSAIKTTPFQALYGREPPVIIKGTDSLASVNEVEKMTAER 1347
K + + A YN HS + P + V ER
Sbjct: 921 KTNDTVELILRATIEYNKTVHSVTRERPIEV---------------------VHPGAHER 959
Query: 1348 NLFLDTLKENLEKAQNRMKQQANKHRRDIQLKVGDMVYLK 1387
L +K L KAQ + N R++ +VG+ V++K
Sbjct: 960 CL---EIKARLVKAQQDSIGRNNPSRQNRVFEVGERVFVK 996
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 254 bits (650), Expect = 1e-66
Identities = 151/454 (33%), Positives = 241/454 (52%), Gaps = 13/454 (2%)
Query: 589 LLTEFPEVFQ-EPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAA 647
+ +E+ ++F E + + + L++ P+ + YR P Q EI+ V++++
Sbjct: 282 ICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYT-KNYRSPHSQVEEIQAQVQKLIKD 340
Query: 648 GIIRHSTSPFSSPAILVKKKDGG------WRFCVDYRALNKVTIPDKFPIPIIDELLDEI 701
I+ S S ++SP +LV KK WR +DYR +NK + DKFP+P ID++LD++
Sbjct: 341 KIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDILDQL 400
Query: 702 GTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 761
G A+ FS LDL SG+HQI + E T+F T G Y + LPFGL AP++FQ +M
Sbjct: 401 GRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIA 460
Query: 762 LRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYLG 821
++ D+++ + ++L EV R+ +L + +KCSF E+ +LG
Sbjct: 461 FSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLG 520
Query: 822 HVISKEGVAADPSKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLKK 881
H + +G+ D K + N+P+P D R F+ YYRRF++N++ ++ + +L KK
Sbjct: 521 HKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKK 580
Query: 882 N-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDASGKGLGAVLMQGGR---- 936
N F W+ +AF LK + +L PDF K F + TDAS + GAVL Q
Sbjct: 581 NVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQL 640
Query: 937 PVAYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCKFIVHTDQKSLRFLAEQRLM 996
PVAY S+ ++ KS E+EL A+ A+ +R Y+ G F V TD + L +L
Sbjct: 641 PVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNP 700
Query: 997 GEEQQKWVSKLMGFDFEIKYKPGIENKAADALSR 1030
+ + +L ++F ++Y G +N ADALSR
Sbjct: 701 SSKLTRIRLELEEYNFTVEYLKGKDNHVADALSR 734
Score = 113 bits (283), Expect = 4e-24
Identities = 80/321 (24%), Positives = 148/321 (45%), Gaps = 21/321 (6%)
Query: 1100 ILQEFHDSAV-GGHAGIFRTYKRISALFFWEGMKLDIQTYVQKCEICQRNKYETLNPAGY 1158
IL HD + GGH GI +T ++ ++W+ M I+ YV+KC+ CQ+ K T +
Sbjct: 896 ILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAK-TTKHTKTP 954
Query: 1159 LQPLPIPSQVWSDISMDFIGGLPKTM-GKDTILVVVDRFTKYAHFLALSHPYNAKEVAEL 1217
+ P + + +D IG LPK+ G + + ++ TKY + +++ +AK VA+
Sbjct: 955 MTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANK-SAKTVAKA 1013
Query: 1218 FIKEIVKLHGFPTSIVSDRDRVFLSSFWSELFKLAGTKLKFSSAYHPQTDGQTEVVNRCV 1277
+ + +G + ++D + +S ++L K K S+A+H QT G E +R +
Sbjct: 1014 IFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTL 1073
Query: 1278 ETYLRCVTGAKPKQWPKWLSWAEFWYNTNYHSAIKTTPFQALYGREP--PVIIKGTDSLA 1335
Y+R W WL + + +NT P++ ++GR P S+
Sbjct: 1074 NEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIE 1133
Query: 1336 SVNEVEKMTAERNLFLDT----LKENLEKAQNRMKQQANKHRRDIQLKVGDMVYLKIQ-- 1389
+ ++ E L+ ++ LE + + K+ + +DI+L+VGD V L+ +
Sbjct: 1134 PIYNIDDYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLRNEVG 1193
Query: 1390 ---------PYKLKSLARRKN 1401
PYK++S+ N
Sbjct: 1194 HKLDFKYTGPYKIESIGDNNN 1214
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 209 bits (532), Expect = 5e-53
Identities = 180/668 (26%), Positives = 317/668 (46%), Gaps = 49/668 (7%)
Query: 415 LTSNRSFKVKGKIGER-----EILILVDCGATSNFISQELVAELEIPVVATSEYVVEVGN 469
+T+ S +KG++ + E+ VD GA+ S+ ++ E E V A +V++ +
Sbjct: 19 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAERPIMVKIAD 77
Query: 470 GARERNSGVCKNLKLEVQGIPIIQHFFILGLGGTELVLGMDWLASLGNIEANFQDLIIKW 529
G+ S VCK++ L + G+ G + ++G ++ L F D +I
Sbjct: 78 GSSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNF-CQLYEPFIQFTDRVIFT 136
Query: 530 ELNGQKMCMQGEPSFCKVAATW--KSIKKTKHDEGEEYFLSYECSEEEPTANVTIPELWI 587
+ + + +V +S+KK + E E P + I
Sbjct: 137 KNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGR 196
Query: 588 KLLTE----FPEVFQEPKELPPKRATDH-------------AILLQEGAPIPNIRPYRYP 630
+L E + Q+ +EL K +++ +I L + + ++P +Y
Sbjct: 197 RLSEEKLFITQQRMQKTEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYS 256
Query: 631 FYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILVKKKD----GGWRFCVDYRALNKVTIP 686
+ E +K +KE+L +I+ S SP +PA LV + G R V+Y+A+NK T+
Sbjct: 257 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVG 316
Query: 687 DKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFG 746
D + +P DELL I +IFS D KSG+ Q+ + +E P TAF +GHYE+ V+PFG
Sbjct: 317 DAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFG 376
Query: 747 LTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVAN 806
L APS FQ M++ R + RKF V+ DI+++SNN + H H+ +LQ + ++ +
Sbjct: 377 LKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILS 435
Query: 807 QKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLN-WP-LPKDVKGLRGFLGLTGYYRRF 864
+KK + ++ +LG I EG I + +N +P +D K L+ FLG+ Y +
Sbjct: 436 KKKAQLFKKKINFLGLEID-EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDY 494
Query: 865 VRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDAS 923
+ N +++ QPL LK+N + W+ T K+K+ + P L P ++ ++ETDAS
Sbjct: 495 IPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDAS 554
Query: 924 ----GKGLGAVLMQGGRPV----AYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLG 975
G L A+ + G Y S + + ++E +AV+ ++K+ YL
Sbjct: 555 DDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTP 614
Query: 976 CKFIVHTDQKSLRFLAEQRLMGEEQQ----KWVSKLMGFDFEIKYKPGIENKAADALSRK 1031
F++ TD + G+ + +W + L + F++++ G +N AD LSR
Sbjct: 615 VHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR- 673
Query: 1032 LQFSAISS 1039
+F+ ++S
Sbjct: 674 -EFNKVNS 680
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 209 bits (531), Expect = 6e-53
Identities = 188/687 (27%), Positives = 325/687 (46%), Gaps = 75/687 (10%)
Query: 389 VFEEAEDGEFVLEGKVLQLSLNSKEGLTSNRSFKVKGKIGER-----EILILVDCGATSN 443
+F E E G F L K L LN +T+ S ++GK+ I VD GA+
Sbjct: 6 LFREGELGHFCLN-KQEMLHLN----VTNPNSIYIEGKLSFEGYKSFNIHCYVDTGASLC 60
Query: 444 FISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNLKLEVQGIPIIQHFFILGLGGT 503
S+ ++ E E+ + + V++ N + + VCKNLK++ G + F I +
Sbjct: 61 IASRYIIPE-ELWENSPKDIQVKIANQELIKITKVCKNLKVKFAG----KSFEIPTVYQQ 115
Query: 504 ELVLGMDWLASLGNIEANFQDLIIKWE------LNGQKMCMQ--------GEPSFCKVAA 549
E G+D+L +GN + I+WE L + + ++ PSF +
Sbjct: 116 ET--GIDFL--IGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLE--- 168
Query: 550 TWKSIKKTKHDEG-----------EEYFLSYECSEEEPTANVTIPELWIKLLTEFPEVFQ 598
K KT+ G E YFL E ++ I +L K+ +E P
Sbjct: 169 NMKKDSKTEQIPGTNISKNIINPEERYFLITEKYQK-------IEQLLDKVCSENPI--- 218
Query: 599 EPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFS 658
+ K+ +I L + + ++P Y + K +KE+L G+I S S
Sbjct: 219 --DPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHM 276
Query: 659 SPAILVK----KKDGGWRFCVDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKS 714
SPA LV+ ++ G R V+Y+A+N+ TI D +P + ELL + IFS D KS
Sbjct: 277 SPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKS 336
Query: 715 GYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFF 774
G+ Q+ + EE TAF +GH+++ V+PFGL APS FQ M L KF +V+
Sbjct: 337 GFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYV 395
Query: 775 YDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYLGHVISKEGVAADPS 834
DI+++SN+ H H+ VL+++ ++ ++KK + + ++ +LG I K
Sbjct: 396 DDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNH 455
Query: 835 KIKDMLNWP-LPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQ 892
++++ +P +D K L+ FLG+ Y ++ +++ +PL LKK+ ++W+ +
Sbjct: 456 ILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSD 515
Query: 893 AFDKLKEIMTTVPVLAVPDFQKTFVLETDASGKGLGAVL----MQGGRPVA-YMSKTLSE 947
K+K+ + + P L +P + ++ETDAS G VL + G + Y S + +
Sbjct: 516 YVKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQ 575
Query: 948 RAQAKSVYERELMAVVLAVQKWRHYLLGCKFIVHTDQKSLRFLAEQRLMGEEQQ----KW 1003
+ ++EL+AV + K+ YL +F V TD K+ + L G+ +Q +W
Sbjct: 576 AEKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRW 635
Query: 1004 VSKLMGFDFEIKYKPGIENKAADALSR 1030
+ + F++++ G++N AD L+R
Sbjct: 636 QNWFSKYQFDVEHLEGVKNVLADCLTR 662
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 207 bits (527), Expect = 2e-52
Identities = 184/681 (27%), Positives = 322/681 (47%), Gaps = 57/681 (8%)
Query: 406 QLSLNSKEGLTSNRSFKVKGKIGER-----EILILVDCGATSNFISQELVAELEIPVVAT 460
Q + +T+ S +KG++ + E+ VD GA+ S+ ++ E E V A
Sbjct: 9 QTQIEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAE 67
Query: 461 SEYVVEVGNGARERNSGVCKNLKLEVQG----IPIIQHFFILGLGGTELVLGMDWLASLG 516
+V++ +G+ S VCK++ L + G IP + G + ++G ++ L
Sbjct: 68 RPIMVKIADGSSITISKVCKDIDLIIAGEIFKIPTVYQ----QESGIDFIIGNNF-CQLY 122
Query: 517 NIEANFQDLIIKWELNGQKMCMQGEPSFCKVAATW--KSIKKTKHDEGEEYFLSYECSEE 574
F D +I + + + +V +S+KK + E E
Sbjct: 123 EPFIQFTDRVIFTKNKSYPVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIE 182
Query: 575 EPTANVTIPELWIKLLTE----FPEVFQEPKELPPKRATDH-------------AILLQE 617
P + I +L E + Q+ +EL K +++ +I L +
Sbjct: 183 NPLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSD 242
Query: 618 GAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILV----KKKDGGWRF 673
+ ++P +Y + E +K +KE+L +I+ S SP +PA LV +K+ G R
Sbjct: 243 PSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRM 302
Query: 674 CVDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRT 733
V+Y+A+NK TI D + +P DELL I +IFS D KSG+ Q+ + +E P TAF
Sbjct: 303 VVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC 362
Query: 734 HEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLRE 793
+GHYE+ V+PFGL APS FQ M++ R + RKF V+ DIL++SNN + H H+
Sbjct: 363 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAM 421
Query: 794 VLQVLRDNHLVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLN-WP-LPKDVKGL 851
+LQ + ++ ++KK + ++ +LG I EG I + +N +P +D K L
Sbjct: 422 ILQKCNQHGIILSKKKAQLFKKKINFLGLEID-EGTHKPQGHILEHINKFPDTLEDKKQL 480
Query: 852 RGFLGLTGYYRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVP 910
+ FLG+ Y ++ +++ +PL LK+N + W+ T K+K+ + P L P
Sbjct: 481 QRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHP 540
Query: 911 DFQKTFVLETDAS----GKGLGAVLMQGGRPV----AYMSKTLSERAQAKSVYERELMAV 962
++ ++ETDAS G L A+ + G Y S + + ++E +AV
Sbjct: 541 LPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAV 600
Query: 963 VLAVQKWRHYLLGCKFIVHTDQKSLRFLAEQRLMGEEQQ----KWVSKLMGFDFEIKYKP 1018
+ ++K+ YL F++ TD + G+ + +W + L + F++++
Sbjct: 601 INTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIK 660
Query: 1019 GIENKAADALSRKLQFSAISS 1039
G +N AD LSR +F+ ++S
Sbjct: 661 GTDNHFADFLSR--EFNKVNS 679
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 207 bits (526), Expect = 2e-52
Identities = 183/672 (27%), Positives = 324/672 (47%), Gaps = 57/672 (8%)
Query: 415 LTSNRSFKVKGKIGER-----EILILVDCGATSNFISQELVAELEIPVVATSEYVVEVGN 469
+T+ S +KG++ + E+ VD GA+ S+ ++ E E V A +V++ +
Sbjct: 18 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAERPIMVKIAD 76
Query: 470 GARERNSGVCKNLKLEVQG----IPIIQHFFILGLGGTELVLGMDWLASLGNIEANFQDL 525
G+ S VCK++ L + G IP + G + ++G ++ L F D
Sbjct: 77 GSSITISKVCKDIDLIIAGEIFRIPTVYQ----QESGIDFIIGNNF-CQLYEPFIQFTDR 131
Query: 526 IIKWELNGQKMCMQGEPSFCKVAATW--KSIKKTKHDEGEEYFLSYECSEEEPTANVTIP 583
+I + + + +V +S+KK + E E P + I
Sbjct: 132 VIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAIL 191
Query: 584 ELWIKLLTE----FPEVFQEPKELPPKRATDH-------------AILLQEGAPIPNIRP 626
+L E + Q+ +EL K +++ +I L + + ++P
Sbjct: 192 SEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKP 251
Query: 627 YRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILV----KKKDGGWRFCVDYRALNK 682
+Y + E +K +KE+L +I+ S SP +PA LV +K+ G R V+Y+A+NK
Sbjct: 252 MKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNK 311
Query: 683 VTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLV 742
T+ D + +P DELL I +IFS D KSG+ Q+ + +E P TAF +GHYE+ V
Sbjct: 312 ATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNV 371
Query: 743 LPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNH 802
+PFGL APS FQ M++ R + RKF V+ DIL++SNN + H H+ +LQ +
Sbjct: 372 VPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHG 430
Query: 803 LVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLN-WP-LPKDVKGLRGFLGLTGY 860
++ ++KK + ++ +LG I EG I + +N +P +D K L+ FLG+ Y
Sbjct: 431 IILSKKKAQLFKKKINFLGLEID-EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTY 489
Query: 861 YRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLE 919
++ +++ +PL LK+N + W+ T K+K+ + P L P ++ ++E
Sbjct: 490 ASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIE 549
Query: 920 TDAS----GKGLGAVLMQGGRPVAYMSKTLSE--RAQAKSVY--ERELMAVVLAVQKWRH 971
TDAS G L A+ + G + + S +A K+ + ++E +AV+ ++K+
Sbjct: 550 TDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSI 609
Query: 972 YLLGCKFIVHTDQKSLRFLAEQRLMGEEQQ----KWVSKLMGFDFEIKYKPGIENKAADA 1027
YL F++ TD + G+ + +W + L + F++++ G +N AD
Sbjct: 610 YLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADF 669
Query: 1028 LSRKLQFSAISS 1039
LSR +F+ ++S
Sbjct: 670 LSR--EFNKVNS 679
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 206 bits (524), Expect = 4e-52
Identities = 183/672 (27%), Positives = 320/672 (47%), Gaps = 57/672 (8%)
Query: 415 LTSNRSFKVKGKIGER-----EILILVDCGATSNFISQELVAELEIPVVATSEYVVEVGN 469
+T+ S +KG++ + E+ VD GA+ S+ ++ E E V A +V++ +
Sbjct: 18 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAERPIMVKIAD 76
Query: 470 GARERNSGVCKNLKL----EVQGIPIIQHFFILGLGGTELVLGMDWLASLGNIEANFQDL 525
G+ S VCK++ L E+ IP + G + ++G ++ L F D
Sbjct: 77 GSSITISKVCKDIDLIIAREIFKIPTVYQ----QESGIDFIIGNNF-CQLYEPFIQFTDR 131
Query: 526 IIKWELNGQKMCMQGEPSFCKVAATW--KSIKKTKHDEGEEYFLSYECSEEEPTANVTIP 583
+I + + + +V +S+KK + E E P + I
Sbjct: 132 VIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAIL 191
Query: 584 ELWIKLLTE----FPEVFQEPKELPPKRATDH-------------AILLQEGAPIPNIRP 626
+L E + Q+ +EL K +++ +I L + + ++P
Sbjct: 192 SEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKP 251
Query: 627 YRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILV----KKKDGGWRFCVDYRALNK 682
+Y + E +K +KE+L +I+ S SP +PA LV +K+ G R V+Y+A+NK
Sbjct: 252 MKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNK 311
Query: 683 VTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLV 742
TI D + +P DELL I +IFS D KSG+ Q+ + +E P TAF +GHYE+ V
Sbjct: 312 ATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNV 371
Query: 743 LPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNH 802
+PFGL APS FQ M++ R + RKF V+ DIL++SNN + H H+ +LQ +
Sbjct: 372 VPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHG 430
Query: 803 LVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLN-WP-LPKDVKGLRGFLGLTGY 860
++ ++KK + ++ +LG I EG I + +N +P +D K L+ FLG+ Y
Sbjct: 431 IILSKKKAQLFKKKINFLGLEID-EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTY 489
Query: 861 YRRFVRNYSKLAQPLNQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLE 919
++ +++ +PL LK+N + W+ T K+K+ + P L P ++ ++E
Sbjct: 490 ASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIE 549
Query: 920 TDAS----GKGLGAVLMQGGRPV----AYMSKTLSERAQAKSVYERELMAVVLAVQKWRH 971
TDAS G L A+ + G Y S + + ++E +AV+ ++K+
Sbjct: 550 TDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSI 609
Query: 972 YLLGCKFIVHTDQKSLRFLAEQRLMGEEQQ----KWVSKLMGFDFEIKYKPGIENKAADA 1027
YL F++ TD + G+ + +W + L + F++++ G +N AD
Sbjct: 610 YLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADF 669
Query: 1028 LSRKLQFSAISS 1039
LSR +F+ ++S
Sbjct: 670 LSR--EFNKVNS 679
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 203 bits (517), Expect = 3e-51
Identities = 147/477 (30%), Positives = 251/477 (51%), Gaps = 28/477 (5%)
Query: 582 IPELWIKLLTEFPEVFQEPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLV 641
I EL K+ +E P +P + K+ +I L + + ++P +Y + E +K +
Sbjct: 207 IEELLEKVCSENP---LDPNKT--KQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQI 261
Query: 642 KEMLAAGIIRHSTSPFSSPAILV----KKKDGGWRFCVDYRALNKVTIPDKFPIPIIDEL 697
KE+L +I+ S SP +PA LV +K+ G R V+Y+A+NK T+ D + P DEL
Sbjct: 262 KELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDEL 321
Query: 698 LDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQAL 757
L I +IFS D KSG+ Q+ + +E P TAF +GHYE+ V+PFGL APS FQ
Sbjct: 322 LTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRH 381
Query: 758 MNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSEL 817
M++ R + RKF V+ DIL++SNN + H H+ +LQ + ++ ++KK + ++
Sbjct: 382 MDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 440
Query: 818 IYLGHVISKEGVAADPSKIKDMLN-WP-LPKDVKGLRGFLGLTGYYRRFVRNYSKLAQPL 875
+LG I EG I + +N +P +D K L+ FLG+ Y ++ +++ +PL
Sbjct: 441 NFLGLEID-EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPL 499
Query: 876 NQLLKKN-NFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDAS----GKGLGAV 930
LK+N + W+ T K+K+ + P L P ++ ++ETDAS G L A+
Sbjct: 500 QAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAI 559
Query: 931 LMQGGRPVAYMSKTLSE--RAQAKSVY--ERELMAVVLAVQKWRHYLLGCKFIVHTDQKS 986
+ G + + S +A K+ + ++E +AV+ ++K+ YL F++ TD
Sbjct: 560 KINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTH 619
Query: 987 LRFLAEQRLMGEEQQ----KWVSKLMGFDFEIKYKPGIENKAADALSRKLQFSAISS 1039
+ G+ + +W + L + F++++ G +N AD LSR +F+ ++S
Sbjct: 620 FKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR--EFNRVNS 674
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 181 bits (458), Expect = 2e-44
Identities = 137/484 (28%), Positives = 234/484 (48%), Gaps = 51/484 (10%)
Query: 587 IKLLTEFPEVFQEPKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLA 646
+K + E P F + ++ K + + G PI ++ P + + + + +L
Sbjct: 1373 MKYIGENPMEFWKNNKIKCKLNIINPDIKIMGRPIKHVTPG-----DEEAMTRQINLLLQ 1427
Query: 647 AGIIRHSTSPFSSPAILV-----------KKKDGGWRFCVDYRALNKVTIPDKFPIPIID 695
+IR S S S A +V K+K G R +Y+ LN+ T D++ +P I+
Sbjct: 1428 MKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGIN 1487
Query: 696 ELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQ 755
++ ++G ++I+SK DLKSG+ Q+ M EE +P TAF YE+LV+PFGL NAP+ FQ
Sbjct: 1488 TIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQ 1547
Query: 756 ALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQS 815
M+ V + KF+ V+ DIL++S + H +HL +LQ+ ++N L+ + K G
Sbjct: 1548 RKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTP 1606
Query: 816 ELIYLGHVISKEGVAADP---SKIKDMLNWPLPKDVKGLRGFLGLTGYYRRFVRNYSKLA 872
E+ +LG + + P SKI D + L +G+R +LG+ Y R ++++ KL
Sbjct: 1607 EIDFLGASLGCTKIKLQPHIISKICDFSDEKLATP-EGMRSWLGILSYARNYIQDIGKLV 1665
Query: 873 QPLNQLL------KKNNFSWSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDASGKG 926
QPL Q + + N +W + ++KE + +P L +P ++ETD G
Sbjct: 1666 QPLRQKMAPTGDKRMNPETW-----KMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTG 1720
Query: 927 LGAVL---------MQGGRPVAYMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCK 977
GAV R AY S + + KS + E+ A + + K++ Y L K
Sbjct: 1721 WGAVCKWKMSKHDPRSTERICAYASGSFN---PIKSTIDAEIQAAIHGLDKFKIYYLDKK 1777
Query: 978 -FIVHTD-QKSLRFLAEQRLMGEEQQKWVS-----KLMGFDFEIKYKPGIENKAADALSR 1030
I+ +D + ++F + + +W++ +G ++ G N ADALSR
Sbjct: 1778 ELIIRSDCEAIIKFYNKTNENKPSRVRWLTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
Query: 1031 KLQF 1034
+ F
Sbjct: 1838 MINF 1841
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 177 bits (450), Expect = 2e-43
Identities = 124/425 (29%), Positives = 214/425 (50%), Gaps = 19/425 (4%)
Query: 621 IPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSSPAILVK----KKDGGWRFCVD 676
+ ++P Y + E ++ +KE+L +I+ S S SPA LV+ ++ G R V+
Sbjct: 228 VVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVN 287
Query: 677 YRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEG 736
Y+A+NK T D +P DELL + +I+S D KSG Q+ + +E TAF +G
Sbjct: 288 YKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQG 347
Query: 737 HYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQ 796
HY++ V+PFGL APS F K+ V+ DIL++SN KEH VL
Sbjct: 348 HYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTG--RKEHYIHVLN 405
Query: 797 VLRDNH---LVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLNWP-LPKDVKGLR 852
+LR ++ ++KK + ++ +LG I + ++ + +P +D K L+
Sbjct: 406 ILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQ 465
Query: 853 GFLGLTGYYRRFVRNYSKLAQPLNQLLKKNN-FSWSAGATQAFDKLKEIMTTVPVLAVPD 911
FLG+ Y ++ + + +PL LK+++ ++W+ +Q K+K+ + + P L P+
Sbjct: 466 RFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPE 525
Query: 912 FQKTFVLETDASGKGLGAVLMQGGRPVAYMSKTLSERAQAKS----VYERELMAVVLAVQ 967
V+ETDAS + G +L Y+ + S +A E+EL+AV+ ++
Sbjct: 526 PNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIK 585
Query: 968 KWRHYLLGCKFIVHTDQKSLRFLAEQRLMGEEQQ----KWVSKLMGFDFEIKYKPGIENK 1023
K+ YL +F++ TD K+ L G+ +Q +W L +DF++++ G +N
Sbjct: 586 KFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNV 645
Query: 1024 AADAL 1028
AD L
Sbjct: 646 FADFL 650
>M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860
(ORF158)
Length = 158
Score = 151 bits (382), Expect = 1e-35
Identities = 72/130 (55%), Positives = 95/130 (72%), Gaps = 2/130 (1%)
Query: 790 HLREVLQVLRDNHLVANQKKCSFGQSELIYLGH--VISKEGVAADPSKIKDMLNWPLPKD 847
HL VLQ+ + AN+KKC+FGQ ++ YLGH +IS EGV+ADP+K++ M+ WP PK+
Sbjct: 3 HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62
Query: 848 VKGLRGFLGLTGYYRRFVRNYSKLAQPLNQLLKKNNFSWSAGATQAFDKLKEIMTTVPVL 907
LRGFLGLTGYYRRFV+NY K+ +PL +LLKKN+ W+ A AF LK +TT+PVL
Sbjct: 63 TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVL 122
Query: 908 AVPDFQKTFV 917
A+PD + FV
Sbjct: 123 ALPDLKLPFV 132
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 692
Score = 149 bits (375), Expect = 8e-35
Identities = 166/631 (26%), Positives = 286/631 (45%), Gaps = 59/631 (9%)
Query: 423 VKGKIGEREILILVDCGATSNFISQELVAELEIPVVATSEYVVEVGNGARERNSGVCKNL 482
+K IG+R L +D GAT F +++ EI + + + ++ N+
Sbjct: 22 IKVSIGKRNFLAYIDTGATLCFGKRKISNNWEI---LKQPKEIIIADKSKHYIREAISNV 78
Query: 483 KLEVQGIPIIQHFFILGLGGTELVLGMDWLASLGNIEANFQDLIIKWE-LNGQKMCMQGE 541
L+++ + L G +L++G ++L + + ++W+ LN K + +
Sbjct: 79 FLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPK---ESQ 135
Query: 542 PSFCKVAATWKSIKKTKHDEGEEYFLSYECSEEEPTANVTIPELWIKLLTEFP--EVFQE 599
K+ TK++ + F E+ TI E ++ +E P E +
Sbjct: 136 MISTKIL--------TKNEVLKLSFEKIHICLEKYLFFKTIEEQLEEVCSEHPLDETKNK 187
Query: 600 PKELPPKRATDHAILLQEGAPIPNIRPYRYPFYQKNEIEKLVKEMLAAGIIRHSTSPFSS 659
L R D LQE + N PY Q E ++ +++L G+IR S SP S+
Sbjct: 188 NGLLIEIRLKDP---LQE-INVTNRIPYTIRDVQ--EFKEECEDLLKKGLIRESQSPHSA 241
Query: 660 PAILVKK----KDGGWRFCVDYRALNKVTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSG 715
PA V+ K G R ++Y+ +N+ TI D + +P D +L++I + FS LD KSG
Sbjct: 242 PAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAKSG 301
Query: 716 YHQIRMREEDIPKTAFR-THEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFF 774
Y+Q+R+ E P TAF + HYE+ VL FGL APS +Q M+Q L+ L L +
Sbjct: 302 YYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKG-LEHICLAYI 360
Query: 775 YDILIYS-NNVDLHKEHLREVLQVLRDNHLVANQKKCSFGQSELIYLGHVISKEG-VAAD 832
DILI++ + + H +R VLQ +++ ++ ++KK Q E+ YLG I G +
Sbjct: 361 DDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGEIDLS 420
Query: 833 PSKIKDMLNWPLP-KDVKGLRGFLGLTGYYRR--FVRNYSKLAQPLNQLLK-KNNFSWSA 888
P + +L +P +D K ++ FLG Y F +N + + L + + KN + W
Sbjct: 421 PHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPWKWDT 480
Query: 889 GATQAFDKLKEIMTTVPVLAVPDFQKTFVLETDAS-----------GKGLGAV-LMQGGR 936
T+ +K + ++P L Q ++ETDAS KG + L + G
Sbjct: 481 IDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKIGLDEFGI 540
Query: 937 PVAYMSKTLSERAQAKSVYE--------RELMAVVLAVQKWRHYLLGCKFI--VHTDQKS 986
P A + S + S E ++ V ++K + LL CK++ TD ++
Sbjct: 541 PTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVSGTFTDTET 600
Query: 987 LRFLAEQRLMG--EEQQKWVSKLMGFDFEIK 1015
+AE ++ + +KW L+ F ++
Sbjct: 601 RYPIAELEVLAGVKVLEKWRIDLLQTRFLLR 631
Score = 37.7 bits (86), Expect = 0.25
Identities = 23/96 (23%), Positives = 44/96 (44%), Gaps = 4/96 (4%)
Query: 940 YMSKTLSERAQAKSVYERELMAVVLAVQKWRHYLLGCKFIVHTDQKSLRFLAEQRLMGEE 999
Y+S T ++ + E E++A V ++KWR LL +F++ TD K + +
Sbjct: 590 YVSGTFTDTETRYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDY 649
Query: 1000 QQ----KWVSKLMGFDFEIKYKPGIENKAADALSRK 1031
+ +W +L + ++ N AD L+R+
Sbjct: 650 RNGRLIRWQLRLQAYQPYVELIKSENNPFADTLTRE 685
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 138 bits (347), Expect = 1e-31
Identities = 119/413 (28%), Positives = 195/413 (46%), Gaps = 34/413 (8%)
Query: 634 KNEIEKLVKEMLAAGII-------RHSTSPF----SSPAILVKKKDGGWRFCVDYRALNK 682
K EK +KE+L +I RH T+ F S + K R +Y+ LN
Sbjct: 1196 KEVFEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKP-----RIVYNYKRLND 1250
Query: 683 VTIPDKFPIPIIDELLDEIGTAEIFSKLDLKSGYHQIRMREEDIPKTAFRTHEGHYEYLV 742
D F IP +++ I A IFSK DLK+G+H ++++++ T F EG Y + V
Sbjct: 1251 NMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNV 1310
Query: 743 LPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSNNVDLHKEHLREVLQVLRDNH 802
PFG+ NAP FQ M + KF L++ DILI SNN H EHL+ +++
Sbjct: 1311 CPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVG 1368
Query: 803 LVANQKKCSFGQSELIYLGHVISKEGVAADPSKIKDMLNWPLPK--DVKGLRGFLGLTGY 860
V ++KK E+ YLG I + ++ P + + + K +KGL+ +LGL Y
Sbjct: 1369 CVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNY 1428
Query: 861 YRRFVRNYSKLAQPLNQLLKKNNFS-WSAGATQAFDKLKEIMTTVPVLAVPDFQKTFVLE 919
R ++++ SKL PL + KN ++ K++ ++ + L P ++E
Sbjct: 1429 ARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIE 1488
Query: 920 TDASGKGLGAVLM---------QGGRPVAYMSKTLSERAQAKSVYERELMAVVLAVQKWR 970
TDAS +G GAVL+ + Y S E+ S+ + E+ A+ A+ K++
Sbjct: 1489 TDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSL-DYEIEAINEALNKFQ 1547
Query: 971 HYLLGCKFIVHTDQKSL-RFLAEQRLMGEEQQKWVSKLMGFDFEIKYKPGIEN 1022
Y L F + TD +++ + + + + +W+ KL + YKP E+
Sbjct: 1548 IY-LDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWI-KLRDNLLKDGYKPTFEH 1598
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.318 0.136 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 189,802,723
Number of Sequences: 164201
Number of extensions: 8684757
Number of successful extensions: 28782
Number of sequences better than 10.0: 196
Number of HSP's better than 10.0 without gapping: 105
Number of HSP's successfully gapped in prelim test: 94
Number of HSP's that attempted gapping in prelim test: 28141
Number of HSP's gapped (non-prelim): 374
length of query: 1555
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1431
effective length of database: 39,613,130
effective search space: 56686389030
effective search space used: 56686389030
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0045.8