
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0029a.5
(1393 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 502 e-141
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 499 e-140
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 498 e-140
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 364 1e-99
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 356 2e-97
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 349 3e-95
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 336 2e-91
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 329 4e-89
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 267 2e-70
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 231 1e-59
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 230 2e-59
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 230 2e-59
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 229 5e-59
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 227 2e-58
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 216 3e-55
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 193 2e-48
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 190 2e-47
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 170 2e-41
M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860... 158 9e-38
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 154 1e-36
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 502 bits (1292), Expect = e-141
Identities = 339/1037 (32%), Positives = 528/1037 (50%), Gaps = 76/1037 (7%)
Query: 306 LIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGISIMQH 365
LID GA +N I+++ V ++P S+ V+ G + N K L + + GISI
Sbjct: 269 LIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINRKTIK-LNISLNGISIKTE 327
Query: 366 FFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCRVTANWKSI 425
F ++ + L NIE + + + ++ K
Sbjct: 328 FLVVKKFSHPAAISFTTLYD-NNIEISSSKHTLSQMN--------------------KVS 366
Query: 426 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQ 485
I ++ E Y ++ E TE ++P+ + K LE E+ QE LP
Sbjct: 367 NIVKEPELPDIYKEFKDITAETNTE-KLPKPI-KGLEFEVELTQENYRLP---------- 414
Query: 486 LQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFC 545
IR Y P + + + + L SGIIR S + + P + V KK+G R
Sbjct: 415 ---------IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 546 VDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 605
VDY+ LNK P+ +P+P+I++LL +I + +F+KLDLKS YH IR+++ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 606 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIV 665
G +EYLV+P+G++ AP+ FQ +N +L V+ + DDILI+SK+E H H++ V
Sbjct: 526 RGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDV 585
Query: 666 LQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGF 725
LQ LK NL+ NQ KC F Q ++ ++G+ IS+ G I +L W PK K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 726 LGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPVLVPPNFD 784
LG Y R+F+ S+L PLN LLKK+ ++WT TQA +K+ + + PVL +F
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 785 KPFILETDASGKGLGAVLMQEG-----RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQK 839
K +LETDAS +GAVL Q+ PV Y S +S SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 840 WRHYLLGS--KFVIHTDQRSL--RFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKA 895
WRHYL + F I TD R+L R + + +W L ++FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 896 ADALSRKL----------QFSAISSV-QCAEWADLEAEILEDERYRKVLQELATQGNSAV 944
ADALSR + + ++I+ V Q + D + +++ + L L + V
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 945 --GYQLKRGRLL-YKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEG 1001
QLK G L+ KD+I+LP + T++K++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1002 MKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTIL 1061
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1062 VVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFK 1121
VVVDRF+K A + + A++ A +F + V+ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1122 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1181
+KFS Y PQTDGQTE N+ VE LRCV + P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1182 IKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQAN 1241
+ TPF+ ++ P + S + D+ ++ + E + + +K +L +M++ +
Sbjct: 1126 TQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1242 KHRRDV-QYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPE 1300
+++ +++ GDLV +K + K+ + KL+P + GP+ ++ K P Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1301 G--SQVHPVFHISLLKK 1315
FH+S L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 499 bits (1285), Expect = e-140
Identities = 338/1037 (32%), Positives = 528/1037 (50%), Gaps = 76/1037 (7%)
Query: 306 LIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGISIMQH 365
LID GA +N I+++ V ++P S+ V+ G + N K L + + GISI
Sbjct: 269 LIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINRKTIK-LNISLNGISIKTE 327
Query: 366 FFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCRVTANWKSI 425
F ++ + L NIE + + + ++ K
Sbjct: 328 FLVVKKFSHPAAISFTTLYD-NNIEISSSKHTLSQMN--------------------KVS 366
Query: 426 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQ 485
I ++ E Y ++ E TE ++P+ + K LE E+ QE LP
Sbjct: 367 NIVKEPELPDIYKEFKDITAETNTE-KLPKPI-KGLEFEVELTQENYRLP---------- 414
Query: 486 LQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFC 545
IR Y P + + + + L SGIIR S + + P + V KK+G R
Sbjct: 415 ---------IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 546 VDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 605
VDY+ LNK P+ +P+P+I++LL +I + +F+KLDLKS YH IR+++ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 606 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIV 665
G +EYLV+P+G++ AP+ FQ +N +L V+ + D+ILI+SK+E H H++ V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 666 LQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGF 725
LQ LK NL+ NQ KC F Q ++ ++G+ IS+ G I +L W PK K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 726 LGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPVLVPPNFD 784
LG Y R+F+ S+L PLN LLKK+ ++WT TQA +K+ + + PVL +F
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 785 KPFILETDASGKGLGAVLMQEG-----RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQK 839
K +LETDAS +GAVL Q+ PV Y S +S SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 840 WRHYLLGS--KFVIHTDQRSL--RFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKA 895
WRHYL + F I TD R+L R + + +W L ++FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 896 ADALSRKL----------QFSAISSV-QCAEWADLEAEILEDERYRKVLQELATQGNSAV 944
ADALSR + + ++I+ V Q + D + +++ + L L + V
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 945 --GYQLKRGRLL-YKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEG 1001
QLK G L+ KD+I+LP + T++K++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1002 MKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTIL 1061
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1062 VVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFK 1121
VVVDRF+K A + + A++ A +F + V+ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1122 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1181
+KFS Y PQTDGQTE N+ VE LRCV + P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1182 IKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQAN 1241
+ TPF+ ++ P + S + D+ ++ + E + + +K +L +M++ +
Sbjct: 1126 TQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1242 KHRRDV-QYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPE 1300
+++ +++ GDLV +K + K+ + KL+P + GP+ ++ K P Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1301 G--SQVHPVFHISLLKK 1315
FH+S L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 498 bits (1281), Expect = e-140
Identities = 337/1037 (32%), Positives = 527/1037 (50%), Gaps = 76/1037 (7%)
Query: 306 LIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGISIMQH 365
LID G +N I+++ V ++P S+ V+ G + N K L + + GISI
Sbjct: 269 LIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINRKTIK-LNISLNGISIKTE 327
Query: 366 FFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCRVTANWKSI 425
F ++ + L NIE + + + ++ K
Sbjct: 328 FLVVKKFSHPAAISFTTLYD-NNIEISSSKHTLSQMN--------------------KVS 366
Query: 426 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQ 485
I ++ E Y ++ E TE ++P+ + K LE E+ QE LP
Sbjct: 367 NIVKEPELPDIYKEFKDITAETNTE-KLPKPI-KGLEFEVELTQENYRLP---------- 414
Query: 486 LQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFC 545
IR Y P + + + + L SGIIR S + + P + V KK+G R
Sbjct: 415 ---------IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 546 VDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 605
VDY+ LNK P+ +P+P+I++LL +I + +F+KLDLKS YH IR+++ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 606 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIV 665
G +EYLV+P+G++ AP+ FQ +N +L V+ + D+ILI+SK+E H H++ V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 666 LQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGF 725
LQ LK NL+ NQ KC F Q ++ ++G+ IS+ G I +L W PK K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 726 LGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPVLVPPNFD 784
LG Y R+F+ S+L PLN LLKK+ ++WT TQA +K+ + + PVL +F
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 785 KPFILETDASGKGLGAVLMQEG-----RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQK 839
K +LETDAS +GAVL Q+ PV Y S +S SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 840 WRHYLLGS--KFVIHTDQRSL--RFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKA 895
WRHYL + F I TD R+L R + + +W L ++FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 896 ADALSRKL----------QFSAISSV-QCAEWADLEAEILEDERYRKVLQELATQGNSAV 944
ADALSR + + ++I+ V Q + D + +++ + L L + V
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 945 --GYQLKRGRLL-YKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEG 1001
QLK G L+ KD+I+LP + T++K++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1002 MKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTIL 1061
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1062 VVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFK 1121
VVVDRF+K A + + A++ A +F + V+ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1122 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1181
+KFS Y PQTDGQTE N+ VE LRCV + P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1182 IKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQAN 1241
+ TPF+ ++ P + S + D+ ++ + E + + +K +L +M++ +
Sbjct: 1126 TQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1242 KHRRDV-QYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPE 1300
+++ +++ GDLV +K + K+ + KL+P + GP+ ++ K P Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1301 G--SQVHPVFHISLLKK 1315
FH+S L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 364 bits (934), Expect = 1e-99
Identities = 192/443 (43%), Positives = 280/443 (62%), Gaps = 12/443 (2%)
Query: 468 FQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIE--KLVKEMLNSGIIRHST 525
++E + L T H + + I + +YP Q +EIE V+EMLN G+IR S
Sbjct: 183 YKEGEKLTFTNTIKHVLNTTHNSPIYS---KQYPLAQTHEIEVENQVQEMLNQGLIRESN 239
Query: 526 SPFSSPAILVKKKDGG-----WRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSK 580
SP++SP +V KK +R +DYR LN+ TIPD++PIP +DE+L ++G F+
Sbjct: 240 SPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTT 299
Query: 581 LDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKF 640
+DL G+HQI M EE I KTAF T GHYEYL +PFGL NAP+TFQ MN +LRP L K
Sbjct: 300 IDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKH 359
Query: 641 VLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGV 700
LV+ DDI+I+S + H + +++V L + NL KC F + E +LGH+++ G+
Sbjct: 360 CLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGI 419
Query: 701 AADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTEG 760
+P K+K ++ +PIP + K +R FLGLTGYYR+F+ NY+ +A+P+ LKK + T+
Sbjct: 420 KPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQK 479
Query: 761 A--TQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSD 818
+AF KLK ++ P+L P+F+K F+L TDAS LGAVL Q G P++++S+TL+D
Sbjct: 480 LEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLND 539
Query: 819 RAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQKWMSKL 878
S E+EL+A+V A + +RHYLLG +F+I +D + LR+L + + G + ++W +L
Sbjct: 540 HELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRL 599
Query: 879 MGYDFEIKYKPGIENKAADALSR 901
Y F+I Y G EN ADALSR
Sbjct: 600 SEYQFKIDYIKGKENSVADALSR 622
Score = 34.7 bits (78), Expect = 1.9
Identities = 62/314 (19%), Positives = 118/314 (36%), Gaps = 23/314 (7%)
Query: 971 VLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFL 1030
++ + H+ L H GI + K ++ +L IQN + +C +C K E N L
Sbjct: 753 IILQSHEKLL--HPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPL 810
Query: 1031 QPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEV-- 1088
+ P P F+ + + GK I +D ++K+A K+ E
Sbjct: 811 KITPNPEH----CREKFVVDIYSSEGKHYI-SCIDIYSKFATL----EQIKTKDWIECRN 861
Query: 1089 FIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCV 1148
+ + G P + +DRD F S + +L+ ++A + D E +++ +
Sbjct: 862 ALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD--VERLHKTI 919
Query: 1149 ETYLRCVTGSKPKQWPKWLSWAE---FWYNTNY-HSAIKTTPFKALYGREPPVIFKGNDS 1204
+R + S ++ LS E + YN H P + P++
Sbjct: 920 NEKIRIINSSDDEEVK--LSKIETILYTYNQKIKHDTTGQRPAQIFLYAGHPILDTQKIK 977
Query: 1205 LTSVDEVEKLTAERNLILEELKSNLEKA--QNRMRQQANKHRRDVQYEVGDLVYLKIQPY 1262
++++ + E N+ K L+K +N + N + D + Y
Sbjct: 978 EKKIEKINEDRREFNIDTNYRKGPLQKGKLENPFKPTKNVEQTDPDHYKITNRNRVTHYY 1037
Query: 1263 KLKSLAKRSNQKLS 1276
K + ++ N KLS
Sbjct: 1038 KTQFKKQKKNNKLS 1051
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 356 bits (914), Expect = 2e-97
Identities = 199/493 (40%), Positives = 297/493 (59%), Gaps = 19/493 (3%)
Query: 426 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEV-FQEPKGLPPRRTTDHAI 484
KI+ E++ Y L + +E+++ A +L++Y ++ + E L T H I
Sbjct: 150 KISPILESDLYRLEHLNNEEKQRLCA--------LLQKYHDIQYHEGDKLTFTNQTKHTI 201
Query: 485 QLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGG--- 541
+ ++P Y YP + E+E +++MLN GIIR S SP++SP +V KK
Sbjct: 202 NTKH--NLPLYSKYSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGK 259
Query: 542 --WRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPK 599
+R +DYR LN+ T+ D+ PIP +DE+L ++G F+ +DL G+HQI M E + K
Sbjct: 260 QKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSK 319
Query: 600 TAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHK 659
TAF T GHYEYL +PFGL NAP+TFQ MN +LRP L K LV+ DDI+++S + + H
Sbjct: 320 TAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHL 379
Query: 660 DHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEV 719
L +V + L + NL KC F + E +LGHV++ G+ +P KI+ + +PIP +
Sbjct: 380 QSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKP 439
Query: 720 KGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTEGA--TQAFVKLKEVMTTVPV 777
K ++ FLGLTGYYR+F+ N++ +A+P+ + LKKN T AF KLK +++ P+
Sbjct: 440 KEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPI 499
Query: 778 LVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAV 837
L P+F K F L TDAS LGAVL Q+G P++Y+S+TL++ S E+EL+A+V A
Sbjct: 500 LKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWAT 559
Query: 838 QKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKAAD 897
+ +RHYLLG F I +D + L +L + + +W KL +DF+IKY G EN AD
Sbjct: 560 KTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVAD 619
Query: 898 ALSR-KLQFSAIS 909
ALSR KL+ + +S
Sbjct: 620 ALSRIKLEETYLS 632
Score = 35.4 bits (80), Expect = 1.1
Identities = 64/305 (20%), Positives = 114/305 (36%), Gaps = 42/305 (13%)
Query: 967 KILTVLKEFHDTALGGHA-----GIFRTYKRISALFYWEGMKLDIQNYVQKCEVCQRNKY 1021
K +T EF + L H GI +T K +Y+ +L IQN + +C +C K
Sbjct: 742 KNITTYAEFKELILTAHEKLLHPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKT 801
Query: 1022 EALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYN 1081
E N + P P +D + + GK + +D ++K+A
Sbjct: 802 EHRNTDMPTKTTPKPEHCREKFMID----IYSSEGKHYV-SCIDIYSKFATL----EEIK 852
Query: 1082 AKEIAEV--FIKEVVRLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSSAYHPQTDG 1139
K+ E + + G P + +DRD F S + +L+ ++ D
Sbjct: 853 TKDWIECKNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD- 911
Query: 1140 QTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSAIKTTPFKALYGREPPVIF 1199
E +++ + +R + S ++ LS E N H T G+ P IF
Sbjct: 912 -IERLHKTINEKIRIIKTSDDEETK--LSKMETVLNIYNHKTKHDTT-----GQTPAHIF 963
Query: 1200 KGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKI 1259
L A + ++ + + N E N++ ++ D +Y G L K+
Sbjct: 964 --------------LYAGQPIL--DTQQNKENKINKINNDRVEYEVDTRYRKGPLQKGKL 1007
Query: 1260 Q-PYK 1263
+ P+K
Sbjct: 1008 ENPFK 1012
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 349 bits (896), Expect = 3e-95
Identities = 260/922 (28%), Positives = 444/922 (47%), Gaps = 103/922 (11%)
Query: 460 ILEEYPEVFQ-EPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNS 518
++E++ +VF L T+ I+L+EGA +P P K EI K++++MLN
Sbjct: 909 VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQ 968
Query: 519 GIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVF 578
+IR S SP+SSP +LVKKKDG R C+DYR +NK + P+P I+ L + ++
Sbjct: 969 KVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLY 1028
Query: 579 SKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLR 638
+ D+ +G+ QI + E+ TAF +E+ VLPFGL +P+ FQ M +++ L
Sbjct: 1029 TVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLG 1088
Query: 639 KFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQA 698
V+ DD+LI SK+ E H ++ L ++++ + KC + E+ YLGH ++
Sbjct: 1089 VCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLD 1148
Query: 699 GVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLK-KNSFQW 757
GV K M + P VK L+ FLGL GYYR+F+ N++++A L L+ K ++ W
Sbjct: 1149 GVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIW 1208
Query: 758 TEGATQAFVKLKEVMTTVPVLVPPNF------DKPFILETDASGKGLGAVLMQEG----- 806
+ AF +LK+++ PVL P+ D+PF++ TDAS KG+GAVL QEG
Sbjct: 1209 EKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQ 1268
Query: 807 RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRI 866
P+A+ SK LS + + E +A++ A+++++ + G+ + TD + L L
Sbjct: 1269 HPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSP 1328
Query: 867 MGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSR-------------KLQFSAISSVQC 913
+ + +W +++ +D +I Y G N ADALSR K S ++++Q
Sbjct: 1329 LADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQ- 1387
Query: 914 AEWAD-------LEAEILEDERYRKVL------------------QELATQGNSAVGYQL 948
E D LE EDE +++V+ E++ + VG L
Sbjct: 1388 TELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVL 1447
Query: 949 KRGRLLYKDRIVLPKGSTKILT-VLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQ 1007
K + + R V+P+ KI T +LKE H+ L GH GI + ++ + FYW M++ ++
Sbjct: 1448 KNTEIEEQSRSVVPE---KIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRVCVE 1504
Query: 1008 NYVQKCEVC-----QRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILV 1062
N V+ C C +L P PL I + D+ + G IL
Sbjct: 1505 NCVRTCAKCLCANDHSKLTSSLTPYRMTFPLEIVACDLMDVGL-------SVQGNRYILT 1557
Query: 1063 VVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHG-FPTSIVSDRDRVFLSTFWSEMFK 1121
++D FTKY + + A+ + + F++ G P +++D+ + F++ +++
Sbjct: 1558 IIDLFTKYGTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTH 1616
Query: 1122 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1181
+ + + Y+ + +G E N+ + ++ T + P +W + +A + YN H
Sbjct: 1617 MLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKT-AVPMEWDDQVVYAVYAYNNCVHEN 1675
Query: 1182 IKTTPFKALYGRE--PPVIFKGNDSL----TSVDEVEKLTAERNLILEELKSNLEKAQNR 1235
TP ++GR+ P+ G D++ +DE + L + L +++ + K
Sbjct: 1676 TGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQELLKVQK----IAKEHAM 1731
Query: 1236 MRQQANKHRRDVQY--------EVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPII- 1286
Q++ K D +Y + G V L+I KL + KL ++ GPY +I
Sbjct: 1732 REQESYKSLFDQKYASKKHRFPQPGSRVLLEIPSEKLGAQC----PKLVNKWSGPYRVIS 1787
Query: 1287 -----AKINPAAYK----LQLP 1299
A+I P K LQ+P
Sbjct: 1788 CSENSAEITPVLGKRKHILQIP 1809
Score = 33.9 bits (76), Expect = 3.2
Identities = 38/144 (26%), Positives = 59/144 (40%), Gaps = 21/144 (14%)
Query: 226 CFKCGDKWGKEHICSMKNYQLILMEVEEDEEEEEIFEEAEDGEFVLEGKVLQLSLNSKEG 285
CF+C + C KN E E+E + E E V L + + K
Sbjct: 591 CFRCNEMGHIAWNCPKKN--------ENTSEKEAPVAKVETIEGVRMKDCLLMVKSEKSE 642
Query: 286 LTSNRSFKVKGKIGNREVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKER 345
RS + KG+IG V IL+D GA+ + +S++ T E +VEV NG
Sbjct: 643 SEVTRSLE-KGQIGKANVEILLDSGASISLMSKN-----------TWEKIVEV-NGKSWE 689
Query: 346 NSGVCKNLKLEVQGISIMQHFFIL 369
+ + L+ + + Q F +L
Sbjct: 690 QDQIYEELEYKTARTANNQLFTLL 713
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 336 bits (862), Expect = 2e-91
Identities = 290/1077 (26%), Positives = 483/1077 (43%), Gaps = 177/1077 (16%)
Query: 294 VKGKIGNREVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKE-RNSGVCKN 352
++ ++ R + +LID A N+I V EL+ + S + V +G+ E ++ + K
Sbjct: 15 IERRLAGRTLKMLIDTDAAKNYIRP--VKELKNVMPVASPFSVSSIHGSTEIKHKCLMKV 72
Query: 353 LKLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGE 412
K I F + L + ++G+D L G ++ N E +++ +K+
Sbjct: 73 FK------HISPFFLLDSLNAFDAIIGLDLLTQAG-VKLNLAEDSLEYQGIAEKLHYFSC 125
Query: 413 PSVCRVTANWKSIKITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPK 472
PSV N + VP+ ++K ++ + + K
Sbjct: 126 PSVNFTDVN----------------------------DIVVPDSVKKEFKD--TIIRRKK 155
Query: 473 GLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQK---------NEIEKLVKEMLNSGIIRH 523
TT+ A+ + P Y + + + VK++L GIIR
Sbjct: 156 AFS---TTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMGVSDFVNNEVKQLLKDGIIRP 212
Query: 524 STSPFSSPAILVKKK------DGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVV 577
S SP++SP +V KK + R +D+R LN+ TIPD++P+P I +L +G A
Sbjct: 213 SRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKF 272
Query: 578 FSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYL 637
F+ LDLKSGYHQI + E D KT+F + G YE+ LPFGL NA S FQ ++ VLR +
Sbjct: 273 FTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQI 332
Query: 638 RKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQ 697
K V+ DD++I+S+NE H H+ VL+ L + N+ +Q+K F + + YLG ++S+
Sbjct: 333 GKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSK 392
Query: 698 AGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLL------- 750
G +DP K+K + ++P P V +R FLGL YYR F+K+++ +A+P+ +L
Sbjct: 393 DGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENGSV 452
Query: 751 -----KKNSFQWTEGATQAFVKLKEVMTTVPVLVP-PNFDKPFILETDASGKGLGAVLMQ 804
KK ++ E AF +L+ ++ + V++ P+F KPF L TDAS G+GAVL Q
Sbjct: 453 SKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQ 512
Query: 805 EGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFV-IHTDQRSLRFLAD 863
EGRP+ +S+TL Q + EREL+A+V A+ K +++L GS+ + I TD + L F
Sbjct: 513 EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVA 572
Query: 864 QRIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSRKLQFSAISSVQCAEWADLEAEI 923
R + ++W S + ++ ++ YKPG EN ADALSR+ +A+ + ++ A + +E+
Sbjct: 573 DRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ-NLNALQNEPQSDAATIHSEL 631
Query: 924 LEDERYRKVLQELATQGN----SAVGYQLKRGRLLYKDR---IVLPKGSTKILTVLKEF- 975
+ L N A + LKR +L++ + ++ + +L LKE
Sbjct: 632 SLTYTVETTDKPLNCFRNQIILEAARFPLKRNLVLFRSKSRHLISFTDKSWLLKTLKEVV 691
Query: 976 -------------------HDTALGGHAGIFR---------------------------- 988
HD A FR
Sbjct: 692 NPDVVNAIHCDLPTLASFQHDLIAHFPATQFRHCKNVVLDITDKNEQIEIVTAEHNRAHR 751
Query: 989 ----TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDIS 1044
K++ +Y+ M + V C VC + KY+ L PIPS +
Sbjct: 752 AAQENIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTGEMVH 811
Query: 1045 MDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPT--S 1102
+D + + L +D+F+KY A+ P ++ I ++ + ++ FP +
Sbjct: 812 IDIF-----STDRKLFLTCIDKFSKY----AIVQPVVSRTIVDITAPLLQIINLFPNIKT 862
Query: 1103 IVSDRDRVFLSTFWSEMFKLA-GTKLKFSSAYHPQTDGQTEVVNRCVETYLRCV-TGSKP 1160
+ D + F S + M K + G + + H ++GQ E + + RC+ K
Sbjct: 863 VYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKT 922
Query: 1161 KQWPKWLSWAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNL 1220
+ + A YN HS + P ++ V ER L
Sbjct: 923 NDTVELILRATIEYNKTVHSVTRERP---------------------IEVVHPGAHERCL 961
Query: 1221 ILEELKSNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSP 1277
E+K+ L KAQ + N R++ +EVG+ V++K KR KL+P
Sbjct: 962 ---EIKARLVKAQQDSIGRNNPSRQNRVFEVGERVFVKNN--------KRLGNKLTP 1007
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 329 bits (843), Expect = 4e-89
Identities = 182/478 (38%), Positives = 283/478 (59%), Gaps = 28/478 (5%)
Query: 451 AEVPEGMRKILE----EYPEVFQEP-KGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQK 505
AE P+G ++IL E+P +F+ P G+ I+ I + Y YP +
Sbjct: 78 AEHPDGTQEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPI-YAKSYPYPVNMR 136
Query: 506 NEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKK-----DGGWRFCVDYRALNKATIPDKF 560
E+E+ + E+L GIIR S SP++SP +V KK + +R VD++ LN TIPD +
Sbjct: 137 GEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 561 PIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTN 620
PIP I+ L +G A F+ LDL SG+HQI MKE DIPKTAF T G YE+L LPFGL N
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 621 APSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKK 680
AP+ FQ +++ +LR ++ K V+ DDI+++S++ + H +LR+VL L + NL N +K
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 681 CSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYS 740
F ++ +LG++++ G+ ADP K++ + + P P VK L+ FLG+T YYR+F+++Y+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 741 KLAQPLNQLLK------------KNSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFI 788
K+A+PL L + K E A Q+F LK ++ + +L P F KPF
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 789 LETDASGKGLGAVLMQE----GRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYL 844
L TDAS +GAVL Q+ RP+AY+S++L+ + + E+E++A++ ++ R YL
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 845 LGSKFV-IHTDQRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSR 901
G+ + ++TD + L F R + ++W +++ Y+ E+ YKPG N ADALSR
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554
Score = 78.2 bits (191), Expect = 1e-13
Identities = 85/334 (25%), Positives = 138/334 (40%), Gaps = 37/334 (11%)
Query: 953 LLYKDRIVLP-----KGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQ 1007
LLYK RI G+ +I ++++ H A H G ++ +Y+ M I+
Sbjct: 673 LLYKIRITQRLVADVSGAEEICEIIEKEHRRA---HRGPTEIRLQLLEKYYFPRMSSTIR 729
Query: 1008 NYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRF 1067
C+ C+ KYE LQP PIP+ + +D A+ K L +D+F
Sbjct: 730 LQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPCEILHIDIF-----ALEKRLYLSCIDKF 784
Query: 1068 TKYAHFIALSHPYNAKEIAEVFIKE--VVRLHGF--PTSIVSDRDRVFLSTFWSEMFKLA 1123
+K+A ++ + A V ++E V LH F P +VSD +R L +
Sbjct: 785 SKFAKL------FHLQSKASVHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSL 838
Query: 1124 GTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWP-KWLSWAEFWYNTNYHSAI 1182
L ++ + +GQ E + RC+ P P + + A YNT+ HS
Sbjct: 839 DIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPTFKPVELVHIAVDRYNTSVHSVT 898
Query: 1183 KTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSNLEKAQNRMRQQANK 1242
P + R V ++G LT R LE++K +E Q R NK
Sbjct: 899 NRKPADVFFDRSSRVNYQG------------LTDFRRQTLEDIKGLIEYKQIRGNMARNK 946
Query: 1243 HRRDVQ-YEVGDLVYLKIQPYKLKSLAKRSNQKL 1275
+R + + Y GD V++ + K K A+ +K+
Sbjct: 947 NRDEPKSYGPGDEVFVANKQIKTKEKARFRCEKV 980
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 267 bits (682), Expect = 2e-70
Identities = 154/457 (33%), Positives = 242/457 (52%), Gaps = 13/457 (2%)
Query: 457 MRKILEEYPEVFQ-EPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEM 515
+ I EY ++F E + + ++L++ + + YR P Q EI+ V+++
Sbjct: 279 LENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYT-KNYRSPHSQVEEIQAQVQKL 337
Query: 516 LNSGIIRHSTSPFSSPAILVKKKDGG------WRFCVDYRALNKATIPDKFPIPIIDELL 569
+ I+ S S ++SP +LV KK WR +DYR +NK + DKFP+P ID++L
Sbjct: 338 IKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDIL 397
Query: 570 DEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALM 629
D++G A FS LDL SG+HQI + E T+F T G Y + LPFGL AP++FQ +M
Sbjct: 398 DQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMM 457
Query: 630 NQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEII 689
++ DD+++ +E+ +L V +E NL + +KCSF E+
Sbjct: 458 TIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVT 517
Query: 690 YLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQL 749
+LGH + G+ D K + ++P+P + R F+ YYRRF+KN++ ++ + +L
Sbjct: 518 FLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRL 577
Query: 750 LKKN-SFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGR- 807
KKN F+WT+ +AF+ LK + +L P+F K F + TDAS + GAVL Q
Sbjct: 578 CKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNG 637
Query: 808 ---PVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQ 864
PVAY S+ + KS E+EL A+ A+ +R Y+ G F + TD R L +L
Sbjct: 638 HQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSM 697
Query: 865 RIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSR 901
+ + +L Y+F ++Y G +N ADALSR
Sbjct: 698 VNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSR 734
Score = 112 bits (280), Expect = 7e-24
Identities = 102/419 (24%), Positives = 179/419 (42%), Gaps = 42/419 (10%)
Query: 881 YDFEIKYKPGIENKAADALSRKLQFSA----ISSVQCAEWADLEAEILEDERYRKVLQEL 936
YD Y GI + D ++L+ A IS ++ A W + + D+
Sbjct: 816 YDVGDLYTNGILD--LDQFLQRLELQAGIYDISQIKMAPWKKIFEHVSIDK--------F 865
Query: 937 ATQGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISAL 996
GN + LK L +I K IL+ L + D GGH GI +T ++
Sbjct: 866 KNMGNKILK-NLKVALLNPVTQINNEKEKEAILSTLHD--DPIQGGHTGITKTLAKVKRH 922
Query: 997 FYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPI---PSQGWTDISMDFIGGLPK 1053
+YW+ M I+ YV+KC+ CQ+ K P+ I P + + +D IG LPK
Sbjct: 923 YYWKNMSKYIKEYVRKCQKCQKAKTTKHTKT----PMTITETPEHAFDRVVVDTIGPLPK 978
Query: 1054 AM-GKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRDRVFL 1112
+ G + + ++ TKY I +++ +AK +A+ + + +G + ++D +
Sbjct: 979 SENGNEYAVTLICDLTKYLVAIPIANK-SAKTVAKAIFESFILKYGPMKTFITDMGTEYK 1037
Query: 1113 STFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEF 1172
++ +++ K K S+A+H QT G E +R + Y+R + W WL + +
Sbjct: 1038 NSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVY 1097
Query: 1173 WYNTNYHSAIKTTPFKALYGREP--PVIFKGNDSLTSVDEVEKLTAERNLILE----ELK 1226
+NT P++ ++GR P F S+ + ++ E LE +
Sbjct: 1098 CFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKESKYRLEVAYARAR 1157
Query: 1227 SNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPI 1285
LE + + ++ + +D++ EVGD V L+ KL +Y GPY I
Sbjct: 1158 KLLEAHKEKNKENYDLKVKDIELEVGDKVLLR----------NEVGHKLDFKYTGPYKI 1206
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 231 bits (588), Expect = 1e-59
Identities = 193/684 (28%), Positives = 328/684 (47%), Gaps = 81/684 (11%)
Query: 286 LTSNRSFKVKGKIGNR-----EVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGN 340
+T+ S +KG++ + E+ +D GA S I+ V+ E V A +V++ +
Sbjct: 19 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGA-SLCIASKFVIPEEHWVNAERPIMVKIAD 77
Query: 341 GAKERNSGVCKNLKLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQW 400
G+ S VCK++ L + G+ G + ++G ++ L F + +I
Sbjct: 78 GSSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNF-CQLYEPFIQFTDRVIFT 136
Query: 401 VSQGQKMVLQGEPSVCRVTA-----NWKSIKITEQQE-------------------AEGY 436
++ + + RV + K T+Q E +EG
Sbjct: 137 KNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGR 196
Query: 437 YLSYEY----QKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDH---AIQLQEG 489
LS E Q+ +KTE E + K+ E P L P +T +I+L +
Sbjct: 197 RLSEEKLFITQQRMQKTE----ELLEKVCSENP--------LDPNKTKQWMKASIKLSDP 244
Query: 490 ASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKD----GGWRFC 545
+ ++P +Y + E +K +KE+L+ +I+ S SP +PA LV + G R
Sbjct: 245 SKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMV 304
Query: 546 VDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 605
V+Y+A+NKAT+ D + +P DELL I +FS D KSG+ Q+ + +E P TAF
Sbjct: 305 VNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 364
Query: 606 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIV 665
+GHYE+ V+PFGL APS FQ M++ R + RKF V+ DDI+++S NEE H H+ ++
Sbjct: 365 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMI 423
Query: 666 LQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWPIPKEV 719
LQ ++ ++ ++KK + +I +LG I Q + +K D L+ +
Sbjct: 424 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE-----DK 478
Query: 720 KGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPVL 778
K L+ FLG+ Y ++ N +++ QPL LK+N ++WT+ T K+K+ + P L
Sbjct: 479 KQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPL 538
Query: 779 VPPNFDKPFILETDAS----GKGLGAVLMQEGRPV----AYMSKTLSDRAQAKSVYEREL 830
P ++ I+ETDAS G L A+ + EG Y S + + ++E
Sbjct: 539 HHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKET 598
Query: 831 MAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIK 886
+AV+ ++K+ YL F+I TD + + G+ + +W + L Y F+++
Sbjct: 599 LAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVE 658
Query: 887 YKPGIENKAADALSRKLQFSAISS 910
+ G +N AD LSR +F+ ++S
Sbjct: 659 HIKGTDNHFADFLSR--EFNKVNS 680
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 230 bits (587), Expect = 2e-59
Identities = 190/680 (27%), Positives = 330/680 (47%), Gaps = 73/680 (10%)
Query: 286 LTSNRSFKVKGKIGNR-----EVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGN 340
+T+ S +KG++ + E+ +D GA S I+ V+ E V A +V++ +
Sbjct: 18 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGA-SLCIASKFVIPEEHWVNAERPIMVKIAD 76
Query: 341 GAKERNSGVCKNLKLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQW 400
G+ S VCK++ L + G G + ++G ++ L F + +I
Sbjct: 77 GSSITISKVCKDIDLIIAGEIFRIPTVYQQESGIDFIIGNNF-CQLYEPFIQFTDRVIFT 135
Query: 401 VSQGQKMVLQGEPSVCRVTA-----NWKSIKITEQQE-------------------AEGY 436
++ + + RV + K T+Q E +EG
Sbjct: 136 KNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGR 195
Query: 437 YLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDH---AIQLQEGASIP 493
LS E ++ ++ E + K+ E P L P +T +I+L + +
Sbjct: 196 RLSEEKLFITQQRMQKIEELLEKVCSENP--------LDPNKTKQWMKASIKLSDPSKAI 247
Query: 494 NIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILV----KKKDGGWRFCVDYR 549
++P +Y + E +K +KE+L+ +I+ S SP +PA LV +K+ G R V+Y+
Sbjct: 248 KVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYK 307
Query: 550 ALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHY 609
A+NKAT+ D + +P DELL I +FS D KSG+ Q+ + +E P TAF +GHY
Sbjct: 308 AMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHY 367
Query: 610 EYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVL 669
E+ V+PFGL APS FQ M++ R + RKF V+ DDIL++S NEE H H+ ++LQ
Sbjct: 368 EWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKC 426
Query: 670 KENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWPIPKEVKGLR 723
++ ++ ++KK + +I +LG I Q + +K D L+ + K L+
Sbjct: 427 NQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE-----DKKQLQ 481
Query: 724 GFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPVLVPPN 782
FLG+ Y ++ +++ +PL LK+N ++WT+ T K+K+ + P L P
Sbjct: 482 RFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPL 541
Query: 783 FDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLSD--RAQAKSVY--ERELMAVV 834
++ I+ETDAS G L A+ + EG + + S +A K+ + ++E +AV+
Sbjct: 542 PEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVI 601
Query: 835 LAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIKYKPG 890
++K+ YL F+I TD + + G+ + +W + L Y F++++ G
Sbjct: 602 NTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKG 661
Query: 891 IENKAADALSRKLQFSAISS 910
+N AD LSR +F+ ++S
Sbjct: 662 TDNHFADFLSR--EFNKVNS 679
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 230 bits (587), Expect = 2e-59
Identities = 191/689 (27%), Positives = 329/689 (47%), Gaps = 73/689 (10%)
Query: 277 QLSLNSKEGLTSNRSFKVKGKIGNR-----EVLILIDCGATSNFISQDLVVELEIPVIAT 331
Q + +T+ S +KG++ + E+ +D GA S I+ V+ E V A
Sbjct: 9 QTQIEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGA-SLCIASKFVIPEEHWVNAE 67
Query: 332 SEYVVEVGNGAKERNSGVCKNLKLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNIEA 391
+V++ +G+ S VCK++ L + G G + ++G ++ L
Sbjct: 68 RPIMVKIADGSSITISKVCKDIDLIIAGEIFKIPTVYQQESGIDFIIGNNF-CQLYEPFI 126
Query: 392 NFQELIIQWVSQGQKMVLQGEPSVCRV-----TANWKSIKITEQQE-------------- 432
F + +I ++ + + RV + K T+Q E
Sbjct: 127 QFTDRVIFTKNKSYPVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLE 186
Query: 433 -----AEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDH---AI 484
+EG LS E ++ ++ E + K+ E P L P +T +I
Sbjct: 187 EIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP--------LDPNKTKQWMKASI 238
Query: 485 QLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILV----KKKDG 540
+L + + ++P +Y + E +K +KE+L+ +I+ S SP +PA LV +K+ G
Sbjct: 239 KLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRG 298
Query: 541 GWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKT 600
R V+Y+A+NKATI D + +P DELL I +FS D KSG+ Q+ + +E P T
Sbjct: 299 KKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLT 358
Query: 601 AFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKD 660
AF +GHYE+ V+PFGL APS FQ M++ R + RKF V+ DDIL++S NEE H
Sbjct: 359 AFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLL 417
Query: 661 HLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWP 714
H+ ++LQ ++ ++ ++KK + +I +LG I Q + +K D L+
Sbjct: 418 HVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE-- 475
Query: 715 IPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMT 773
+ K L+ FLG+ Y ++ +++ +PL LK+N ++WT+ T K+K+ +
Sbjct: 476 ---DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 532
Query: 774 TVPVLVPPNFDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLSDRAQAKS----V 825
P L P ++ I+ETDAS G L A+ + EG + + S +A
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHS 592
Query: 826 YERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGY 881
++E +AV+ ++K+ YL F+I TD + + G+ + +W + L Y
Sbjct: 593 NDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
Query: 882 DFEIKYKPGIENKAADALSRKLQFSAISS 910
F++++ G +N AD LSR +F+ ++S
Sbjct: 653 SFDVEHIKGTDNHFADFLSR--EFNKVNS 679
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 229 bits (583), Expect = 5e-59
Identities = 193/701 (27%), Positives = 345/701 (48%), Gaps = 86/701 (12%)
Query: 268 EFVLEGKVLQLSLNSKEGLTSNRSFKVKGKIGNR-----EVLILIDCGATSNFISQDLVV 322
+ +L+ +Q +T+ S +KG++ + E+ +D GA S I+ V+
Sbjct: 2 DHLLQKTQIQNQTEQVMNITNPNSIYIKGRLYFKGYKKIELHCFVDTGA-SLCIASKFVI 60
Query: 323 ELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGISIMQHFFILGLGGTEVVLGMDW 382
E + A +V++ +G+ + VC+++ L + G + F I + E G+D+
Sbjct: 61 PEEHWINAERPIMVKIADGSSITINKVCRDIDLIIAG----EIFHIPTVYQQES--GIDF 114
Query: 383 LASLGNIEANFQELIIQWVSQGQKMVLQGEPS----VCRVTA-----------NWKSIKI 427
+ +GN NF +L ++ +++ + + + ++T + K
Sbjct: 115 I--IGN---NFCQLYEPFIQFTDRVIFTKDRTYPVHIAKLTRAVRVGTEGFLESMKKRSK 169
Query: 428 TEQQE------------AEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLP 475
T+Q E +EG LS E ++ ++ E + K+ E P L
Sbjct: 170 TQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP--------LD 221
Query: 476 PRRTTDH---AIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPA 532
P +T +I+L + + ++P +Y + E +K +KE+L+ +I+ S SP +PA
Sbjct: 222 PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPA 281
Query: 533 ILV----KKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYH 588
LV +K+ G R V+Y+A+NKAT+ D + P DELL I +FS D KSG+
Sbjct: 282 FLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFW 341
Query: 589 QIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDI 648
Q+ + +E P TAF +GHYE+ V+PFGL APS FQ M++ R + RKF V+ DDI
Sbjct: 342 QVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDI 400
Query: 649 LIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAA 702
L++S NEE H H+ ++LQ ++ ++ ++KK + +I +LG I Q +
Sbjct: 401 LVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILE 460
Query: 703 DPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGA 761
+K D L+ + K L+ FLG+ Y ++ +++ +PL LK+N ++WT+
Sbjct: 461 HINKFPDTLE-----DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 515
Query: 762 TQAFVKLKEVMTTVPVLVPPNFDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLS 817
T K+K+ + P L P ++ I+ETDAS G L A+ + EG + + S
Sbjct: 516 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 575
Query: 818 D--RAQAKSVY--ERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ- 872
+A K+ + ++E +AV+ ++K+ YL F+I TD + + G+ +
Sbjct: 576 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 635
Query: 873 ---KWMSKLMGYDFEIKYKPGIENKAADALSRKLQFSAISS 910
+W + L Y F++++ G +N AD LSR +F+ ++S
Sbjct: 636 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR--EFNRVNS 674
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 227 bits (578), Expect = 2e-58
Identities = 192/682 (28%), Positives = 334/682 (48%), Gaps = 77/682 (11%)
Query: 286 LTSNRSFKVKGKIGNR-----EVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGN 340
+T+ S +KG++ + E+ +D GA S I+ V+ E V A +V++ +
Sbjct: 18 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGA-SLCIASKFVIPEEHWVNAERPIMVKIAD 76
Query: 341 GAKERNSGVCKNLKLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQW 400
G+ S VCK++ L I + F I + E G+D++ +GN NF +L +
Sbjct: 77 GSSITISKVCKDIDL----IIAREIFKIPTVYQQES--GIDFI--IGN---NFCQLYEPF 125
Query: 401 VSQGQKMVLQGEPSV-CRVTANWKSIKI------------TEQQEAEGYYLSYEYQKEEE 447
+ +++ S + +++++ ++ Q+ E +S +
Sbjct: 126 IQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPL 185
Query: 448 KTEAEVPEGMR-------------KILEEYPEVFQEPKGLPPRRTTDH---AIQLQEGAS 491
K A + EG R + +EE E L P +T +I+L + +
Sbjct: 186 KEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSK 245
Query: 492 IPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILV----KKKDGGWRFCVD 547
++P +Y + E +K +KE+L+ +I+ S SP +PA LV +K+ G R V+
Sbjct: 246 AIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVN 305
Query: 548 YRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEG 607
Y+A+NKATI D + +P DELL I +FS D KSG+ Q+ + +E P TAF +G
Sbjct: 306 YKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQG 365
Query: 608 HYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQ 667
HYE+ V+PFGL APS FQ M++ R + RKF V+ DDIL++S NEE H H+ ++LQ
Sbjct: 366 HYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQ 424
Query: 668 VLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWPIPKEVKG 721
++ ++ ++KK + +I +LG I Q + +K D L+ + K
Sbjct: 425 KCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE-----DKKQ 479
Query: 722 LRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPVLVP 780
L+ FLG+ Y ++ +++ +PL LK+N ++WT+ T K+K+ + P L
Sbjct: 480 LQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHH 539
Query: 781 PNFDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLSDRAQAKS----VYERELMA 832
P ++ I+ETDAS G L A+ + EG + + S +A ++E +A
Sbjct: 540 PLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLA 599
Query: 833 VVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIKYK 888
V+ ++K+ YL F+I TD + + G+ + +W + L Y F++++
Sbjct: 600 VINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHI 659
Query: 889 PGIENKAADALSRKLQFSAISS 910
G +N AD LSR +F+ ++S
Sbjct: 660 KGTDNHFADFLSR--EFNKVNS 679
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 216 bits (551), Expect = 3e-55
Identities = 182/673 (27%), Positives = 328/673 (48%), Gaps = 47/673 (6%)
Query: 260 IFEEAEDGEFVLEGKVLQLSLNSKEGLTSNRSFKVKGKIG-----NREVLILIDCGATSN 314
+F E E G F L K L LN +T+ S ++GK+ + + +D GA+
Sbjct: 6 LFREGELGHFCLN-KQEMLHLN----VTNPNSIYIEGKLSFEGYKSFNIHCYVDTGASLC 60
Query: 315 FISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGISIMQHFFILGLGGT 374
S+ ++ E E+ + + V++ N + + VCKNLK++ G S F I +
Sbjct: 61 IASRYIIPE-ELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKS----FEIPTVYQQ 115
Query: 375 EVVLGMDWLASLGNIEANFQELIIQWVSQ-----GQKMVLQGEPSVCRVTANWKSI---- 425
E G+D+L +GN IQW + +MVL + + +N +
Sbjct: 116 ET--GIDFL--IGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMK 171
Query: 426 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLP--PRRTTDHA 483
K ++ ++ G +S EE+ + E +KI + +V E P ++ +
Sbjct: 172 KDSKTEQIPGTNISKNIINPEERYFL-ITEKYQKIEQLLDKVCSENPIDPIKSKQWMKAS 230
Query: 484 IQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVK----KKD 539
I+L + + ++P Y + K +KE+L+ G+I S S SPA LV+ ++
Sbjct: 231 IKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERRR 290
Query: 540 GGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPK 599
G R V+Y+A+N+ATI D +P + ELL + +FS D KSG+ Q+ + EE
Sbjct: 291 GKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKL 350
Query: 600 TAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHK 659
TAF +GH+++ V+PFGL APS FQ M L KF +V+ DDI+++S +E H
Sbjct: 351 TAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHY 409
Query: 660 DHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIP-KE 718
+H+ VL+++++ ++ ++KK + + +I +LG I + ++++ +P ++
Sbjct: 410 NHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLED 469
Query: 719 VKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEGATQAFVKLKEVMTTVPV 777
K L+ FLG+ Y ++ +++ +PL LKK+ ++ WT+ + K+K+ + + P
Sbjct: 470 KKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPK 529
Query: 778 LVPPNFDKPFILETDASGKGLGAVLMQEGRP-----VAYMSKTLSDRAQAKSVYERELMA 832
L P + I+ETDAS G VL Y S + + ++EL+A
Sbjct: 530 LYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLA 589
Query: 833 VVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIKYK 888
V + K+ YL +F + TD ++ + + G+ +Q +W + Y F++++
Sbjct: 590 VKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHL 649
Query: 889 PGIENKAADALSR 901
G++N AD L+R
Sbjct: 650 EGVKNVLADCLTR 662
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 193 bits (491), Expect = 2e-48
Identities = 133/421 (31%), Positives = 210/421 (49%), Gaps = 34/421 (8%)
Query: 515 MLNSGIIRHSTSPFSSPAILV-----------KKKDGGWRFCVDYRALNKATIPDKFPIP 563
+L +IR S S S A +V K+K G R +Y+ LN+ T D++ +P
Sbjct: 1425 LLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLP 1484
Query: 564 IIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPS 623
I+ ++ ++G + ++SK DLKSG+ Q+ M+EE +P TAF YE+LV+PFGL NAP+
Sbjct: 1485 GINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPA 1544
Query: 624 TFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSF 683
FQ M+ V + KF+ V+ DDIL++S+ E H HL +LQ+ KEN L+ + K
Sbjct: 1545 IFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKI 1603
Query: 684 GQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPK--EVKGLRGFLGLTGYYRRFVKNYSK 741
G PEI +LG + + P I + D+ K +G+R +LG+ Y R ++++ K
Sbjct: 1604 GTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIGK 1663
Query: 742 LAQPLNQLLKKNSFQWTEGATQAFVK-LKEVMTTVPVLVPPNFDKPFILETDASGKGLGA 800
L QPL Q + + T V+ +KE + +P L P D I+ETD G GA
Sbjct: 1664 LVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGA 1723
Query: 801 VL---------MQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSK-FV 850
V R AY S + + KS + E+ A + + K++ Y L K +
Sbjct: 1724 VCKWKMSKHDPRSTERICAYASGSFN---PIKSTIDAEIQAAIHGLDKFKIYYLDKKELI 1780
Query: 851 IHTD-QRSLRFLADQRIMGEEQQKWMS-----KLMGYDFEIKYKPGIENKAADALSRKLQ 904
I +D + ++F + +W++ +G ++ G N ADALSR +
Sbjct: 1781 IRSDCEAIIKFYNKTNENKPSRVRWLTFSDFLTGLGITVTFEHIDGKHNGLADALSRMIN 1840
Query: 905 F 905
F
Sbjct: 1841 F 1841
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 190 bits (483), Expect = 2e-47
Identities = 160/638 (25%), Positives = 293/638 (45%), Gaps = 37/638 (5%)
Query: 293 KVKGKIGNREVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKN 352
K G N ++ +D G++ S+ ++ E E A +++ NG + + VC
Sbjct: 19 KFPGYQTNLDLHCYVDTGSSLCMASKYVIPE-EYWQTAEKPLNIKIANGKIIQLTKVCSK 77
Query: 353 LKLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNI---------EANFQELIIQWVSQ 403
L + + G + G +++LG ++ N Q +II +++
Sbjct: 78 LPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKITK 137
Query: 404 ----GQKMVLQGEPSVCRVTANWKSIKITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRK 459
G K L+ +V + I IT Q + E ++ E+
Sbjct: 138 AYQYGVKGFLESMKKKSKVNRP-EPINITSNQ----HLFLEEGGNHVDEMLYEIQISKFS 192
Query: 460 ILEEYPEVFQEPKGLPPRRTTDH---AIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEML 516
+EE E + P ++ I+L + ++ ++P Y + E ++ +KE+L
Sbjct: 193 AIEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKELL 252
Query: 517 NSGIIRHSTSPFSSPAILVK----KKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEI 572
+I+ S S SPA LV+ ++ G R V+Y+A+NKAT D +P DELL +
Sbjct: 253 ELKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLV 312
Query: 573 GAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 632
++S D KSG Q+ + +E TAF +GHY++ V+PFGL APS F
Sbjct: 313 RGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANS 372
Query: 633 LRPYLRKFVLVFFDDILIYSK-NEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 691
K+ V+ DDIL++S + H H+ +L+ ++ ++ ++KK + +I +L
Sbjct: 373 HSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFL 432
Query: 692 GHVISQAGVAADPSKIKDMLDWPIP-KEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLL 750
G I Q ++ + +P ++ K L+ FLG+ Y ++ + + +PL L
Sbjct: 433 GLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKL 492
Query: 751 KKNS-FQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPV 809
K++S + W + +Q K+K+ + + P L P + ++ETDAS + G +L
Sbjct: 493 KEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSH 552
Query: 810 AYMSKTLSDRAQAKS----VYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQR 865
Y+ + S +A E+EL+AV+ ++K+ YL S+F+I TD ++ +
Sbjct: 553 EYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNIN 612
Query: 866 IMGEEQQ----KWMSKLMGYDFEIKYKPGIENKAADAL 899
+ G+ +Q +W L YDF++++ G +N AD L
Sbjct: 613 LKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFL 650
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 692
Score = 170 bits (431), Expect = 2e-41
Identities = 177/689 (25%), Positives = 296/689 (42%), Gaps = 105/689 (15%)
Query: 294 VKGKIGNREVLILIDCGATSNFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNL 353
+K IG R L ID GAT F + + EI + + + +K N+
Sbjct: 22 IKVSIGKRNFLAYIDTGATLCFGKRKISNNWEI---LKQPKEIIIADKSKHYIREAISNV 78
Query: 354 KLEVQGISIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEP 413
L+++ + L G ++++G ++L + + ++W + Q
Sbjct: 79 FLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQ--- 135
Query: 414 SVCRVTANWKSIKITEQQEAEGYYLSYEYQK-----EEEKTEAEVPEGMRKILEEYP--E 466
S KI + E L ++K E+ + E + ++ E+P E
Sbjct: 136 --------MISTKILTKNEV----LKLSFEKIHICLEKYLFFKTIEEQLEEVCSEHPLDE 183
Query: 467 VFQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTS 526
+ L R D ++ IP Y E ++ +++L G+IR S S
Sbjct: 184 TKNKNGLLIEIRLKDPLQEINVTNRIP------YTIRDVQEFKEECEDLLKKGLIRESQS 237
Query: 527 PFSSPAILVKK----KDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGAAVVFSKLD 582
P S+PA V+ K G R ++Y+ +N+ATI D + +P D +L++I ++ FS LD
Sbjct: 238 PHSAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLD 297
Query: 583 LKSGYHQIRMKEEDIPKTAFRTH-EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFV 641
KSGY+Q+R+ E P TAF + HYE+ VL FGL APS +Q M+Q L+ L
Sbjct: 298 AKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKG-LEHIC 356
Query: 642 LVFFDDILIYSK-NEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAG- 699
L + DDILI++K ++E H + +RIVLQ +KE ++ ++KK Q EI YLG I G
Sbjct: 357 LAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGE 416
Query: 700 VAADPSKIKDMLDWPIP-KEVKGLRGFLGLTGYYRR--FVKNYSKLAQPLNQLLK-KNSF 755
+ P + +L +P ++ K ++ FLG Y F KN + + L + + KN +
Sbjct: 417 IDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPW 476
Query: 756 QWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDAS-----------GKGLGAVLMQ 804
+W T+ +K + ++P L + I+ETDAS KG + +
Sbjct: 477 KWDTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKIGLD 536
Query: 805 E-GRPVAYM----SKTLSDRAQAK------------------------------------ 823
E G P A + S SD + A+
Sbjct: 537 EFGIPTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVSGTFT 596
Query: 824 ------SVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ----K 873
+ E E++A V ++KWR LL ++F++ TD + I + + +
Sbjct: 597 DTETRYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDYRNGRLIR 656
Query: 874 WMSKLMGYDFEIKYKPGIENKAADALSRK 902
W +L Y ++ N AD L+R+
Sbjct: 657 WQLRLQAYQPYVELIKSENNPFADTLTRE 685
>M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860
(ORF158)
Length = 158
Score = 158 bits (400), Expect = 9e-38
Identities = 75/131 (57%), Positives = 98/131 (74%), Gaps = 2/131 (1%)
Query: 660 DHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGH--VISQAGVAADPSKIKDMLDWPIPK 717
+HL +VLQ+ +++ AN+KKC+FGQP+I YLGH +IS GV+ADP+K++ M+ WP PK
Sbjct: 2 NHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPK 61
Query: 718 EVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPV 777
LRGFLGLTGYYRRFVKNY K+ +PL +LLKKNS +WTE A AF LK +TT+PV
Sbjct: 62 NTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPV 121
Query: 778 LVPPNFDKPFI 788
L P+ PF+
Sbjct: 122 LALPDLKLPFV 132
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 154 bits (390), Expect = 1e-36
Identities = 127/414 (30%), Positives = 200/414 (47%), Gaps = 36/414 (8%)
Query: 505 KNEIEKLVKEMLNSGII-------RHSTSPF----SSPAILVKKKDGGWRFCVDYRALNK 553
K EK +KE+L++ +I RH T+ F S + K R +Y+ LN
Sbjct: 1196 KEVFEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKP-----RIVYNYKRLND 1250
Query: 554 ATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLV 613
D F IP +++ I A +FSK DLK+G+H +++K++ T F EG Y + V
Sbjct: 1251 NMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNV 1310
Query: 614 LPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENN 673
PFG+ NAP FQ M + KF L++ DDILI S NE+ H +HL+I +KE
Sbjct: 1311 CPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVG 1368
Query: 674 LVANQKKCSFGQPEIIYLGHVISQAGVAADP---SKIKDMLDWPIPKEVKGLRGFLGLTG 730
V ++KK E+ YLG I + ++ P KIK D +KGL+ +LGL
Sbjct: 1369 CVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIK-KFDKNKLNTLKGLQAYLGLLN 1427
Query: 731 YYRRFVKNYSKLAQPLNQLLKKNSFQ-WTEGATQAFVKLKEVMTTVPVLVPPNFDKPFIL 789
Y R ++K+ SKL PL + KN + + + K++ ++ + L P I+
Sbjct: 1428 YARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIII 1487
Query: 790 ETDASGKGLGAVLM---------QEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKW 840
ETDAS +G GAVL+ + Y S ++ S+ + E+ A+ A+ K+
Sbjct: 1488 ETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSL-DYEIEAINEALNKF 1546
Query: 841 RHYLLGSKFVIHTD-QRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIEN 893
+ Y L F I TD + ++ + + + +W+ KL + YKP E+
Sbjct: 1547 QIY-LDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWI-KLRDNLLKDGYKPTFEH 1598
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.318 0.136 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 168,806,323
Number of Sequences: 164201
Number of extensions: 7616895
Number of successful extensions: 25179
Number of sequences better than 10.0: 161
Number of HSP's better than 10.0 without gapping: 103
Number of HSP's successfully gapped in prelim test: 59
Number of HSP's that attempted gapping in prelim test: 24610
Number of HSP's gapped (non-prelim): 294
length of query: 1393
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1270
effective length of database: 39,777,331
effective search space: 50517210370
effective search space used: 50517210370
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0029a.5