
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0106.8
(1558 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 494 e-139
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 491 e-138
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 490 e-137
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 363 2e-99
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 358 6e-98
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 346 2e-94
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 330 2e-89
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 325 6e-88
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 267 1e-70
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 230 3e-59
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 229 3e-59
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 229 3e-59
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 229 6e-59
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 229 6e-59
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 214 1e-54
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 189 7e-47
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 188 1e-46
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 168 9e-41
M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860... 159 7e-38
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 153 3e-36
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 494 bits (1272), Expect = e-139
Identities = 336/1037 (32%), Positives = 527/1037 (50%), Gaps = 76/1037 (7%)
Query: 439 LIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQH 498
LID GA N I+++ V ++P S+ V+ G + N K L + + GI I
Sbjct: 269 LIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINRKTIK-LNISLNGISIKTE 327
Query: 499 FFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCKVAANWKSI 558
F ++ + L NIE + + + ++ K
Sbjct: 328 FLVVKKFSHPAAISFTTLYD-NNIEISSSKHTLSQMN--------------------KVS 366
Query: 559 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQ 618
I ++ E Y ++ E TE ++P+ + K LE E+ QE LP
Sbjct: 367 NIVKEPELPDIYKEFKDITAETNTE-KLPKPI-KGLEFEVELTQENYRLP---------- 414
Query: 619 LQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFC 678
IR Y P + + + + L SGIIR S + + P + V KK+G R
Sbjct: 415 ---------IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 679 VDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 738
VDY+ +NK P+ +P+P+I++LL +I + +F+KLDLKS YH IR+++ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 739 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIV 798
G +EYLV+P+G++ AP+ FQ +N +L V+ + DILI+SK+E H H++ V
Sbjct: 526 RGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDV 585
Query: 799 LQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGF 858
LQ LK NL+ NQ KC F Q ++ ++G+ IS+ G I +L W PK K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 859 LGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVPVLVPPNFD 917
LG Y R+F+ S+L PLN LLKK+ ++WT TQA +K+ + + PVL +F
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 918 KPFILETDASGKGLGAVLMQEG-----RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQK 972
K +LETDAS +GAVL Q+ PV Y S +S SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 973 WRHYLLGS--QFVIHTDQRSL--RFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKA 1028
WRHYL + F I TD R+L R + + +W L ++FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 1029 ADALSRKL----------QFSAISSV-QCAEWADLEAEILGDERYRKVLQELATQGNSAI 1077
ADALSR + + ++I+ V Q + D + +++ + L L + +
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 1078 --GYQLKRGRLL-YKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEG 1134
QLK G L+ KD+I+LP + T++K++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1135 MKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTIL 1194
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1195 VVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVKLHGFPTSIVSDRDRVFLSTFWSEMFK 1254
VVVDRF+K A + + A++ A +F + V+ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1255 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1314
+KFS Y PQTDGQTE N+ VE LRCV + P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1315 IKTTPFKALYGRESPVIFKGNDSLTSVDEVEKWTAERNLILEELKSNLEKAQNRMRQQAN 1374
+ TPF+ ++ R SP + + + D+ ++ + E + + +K +L +M++ +
Sbjct: 1126 TQMTPFEIVH-RYSPAL-SPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1375 KHRRDV-QYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPE 1433
+++ +++ GDLV +K + K+ + KL+P + GP+ ++ K P Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1434 G--SQVHPVFHISLLKK 1448
FH+S L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 491 bits (1265), Expect = e-138
Identities = 335/1037 (32%), Positives = 527/1037 (50%), Gaps = 76/1037 (7%)
Query: 439 LIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQH 498
LID GA N I+++ V ++P S+ V+ G + N K L + + GI I
Sbjct: 269 LIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINRKTIK-LNISLNGISIKTE 327
Query: 499 FFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCKVAANWKSI 558
F ++ + L NIE + + + ++ K
Sbjct: 328 FLVVKKFSHPAAISFTTLYD-NNIEISSSKHTLSQMN--------------------KVS 366
Query: 559 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQ 618
I ++ E Y ++ E TE ++P+ + K LE E+ QE LP
Sbjct: 367 NIVKEPELPDIYKEFKDITAETNTE-KLPKPI-KGLEFEVELTQENYRLP---------- 414
Query: 619 LQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFC 678
IR Y P + + + + L SGIIR S + + P + V KK+G R
Sbjct: 415 ---------IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 679 VDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 738
VDY+ +NK P+ +P+P+I++LL +I + +F+KLDLKS YH IR+++ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 739 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIV 798
G +EYLV+P+G++ AP+ FQ +N +L V+ + +ILI+SK+E H H++ V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 799 LQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGF 858
LQ LK NL+ NQ KC F Q ++ ++G+ IS+ G I +L W PK K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 859 LGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVPVLVPPNFD 917
LG Y R+F+ S+L PLN LLKK+ ++WT TQA +K+ + + PVL +F
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 918 KPFILETDASGKGLGAVLMQEG-----RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQK 972
K +LETDAS +GAVL Q+ PV Y S +S SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 973 WRHYLLGS--QFVIHTDQRSL--RFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKA 1028
WRHYL + F I TD R+L R + + +W L ++FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 1029 ADALSRKL----------QFSAISSV-QCAEWADLEAEILGDERYRKVLQELATQGNSAI 1077
ADALSR + + ++I+ V Q + D + +++ + L L + +
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 1078 --GYQLKRGRLL-YKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEG 1134
QLK G L+ KD+I+LP + T++K++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1135 MKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTIL 1194
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1195 VVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVKLHGFPTSIVSDRDRVFLSTFWSEMFK 1254
VVVDRF+K A + + A++ A +F + V+ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1255 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1314
+KFS Y PQTDGQTE N+ VE LRCV + P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1315 IKTTPFKALYGRESPVIFKGNDSLTSVDEVEKWTAERNLILEELKSNLEKAQNRMRQQAN 1374
+ TPF+ ++ R SP + + + D+ ++ + E + + +K +L +M++ +
Sbjct: 1126 TQMTPFEIVH-RYSPAL-SPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1375 KHRRDV-QYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPE 1433
+++ +++ GDLV +K + K+ + KL+P + GP+ ++ K P Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1434 G--SQVHPVFHISLLKK 1448
FH+S L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 490 bits (1261), Expect = e-137
Identities = 334/1037 (32%), Positives = 526/1037 (50%), Gaps = 76/1037 (7%)
Query: 439 LIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQH 498
LID G N I+++ V ++P S+ V+ G + N K L + + GI I
Sbjct: 269 LIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPNKINRKTIK-LNISLNGISIKTE 327
Query: 499 FFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEPSVCKVAANWKSI 558
F ++ + L NIE + + + ++ K
Sbjct: 328 FLVVKKFSHPAAISFTTLYD-NNIEISSSKHTLSQMN--------------------KVS 366
Query: 559 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQ 618
I ++ E Y ++ E TE ++P+ + K LE E+ QE LP
Sbjct: 367 NIVKEPELPDIYKEFKDITAETNTE-KLPKPI-KGLEFEVELTQENYRLP---------- 414
Query: 619 LQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFC 678
IR Y P + + + + L SGIIR S + + P + V KK+G R
Sbjct: 415 ---------IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMV 465
Query: 679 VDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 738
VDY+ +NK P+ +P+P+I++LL +I + +F+KLDLKS YH IR+++ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 739 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIV 798
G +EYLV+P+G++ AP+ FQ +N +L V+ + +ILI+SK+E H H++ V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 799 LQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGF 858
LQ LK NL+ NQ KC F Q ++ ++G+ IS+ G I +L W PK K LR F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 859 LGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVPVLVPPNFD 917
LG Y R+F+ S+L PLN LLKK+ ++WT TQA +K+ + + PVL +F
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 918 KPFILETDASGKGLGAVLMQEG-----RPVAYMSKTLSDRAQAKSVYERELMAVVLAVQK 972
K +LETDAS +GAVL Q+ PV Y S +S SV ++E++A++ +++
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 973 WRHYLLGS--QFVIHTDQRSL--RFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKA 1028
WRHYL + F I TD R+L R + + +W L ++FEI Y+PG N
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHI 825
Query: 1029 ADALSRKL----------QFSAISSV-QCAEWADLEAEILGDERYRKVLQELATQGNSAI 1077
ADALSR + + ++I+ V Q + D + +++ + L L + +
Sbjct: 826 ADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRV 885
Query: 1078 --GYQLKRGRLL-YKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEG 1134
QLK G L+ KD+I+LP + T++K++H+ H GI I F W+G
Sbjct: 886 EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 945
Query: 1135 MKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTIL 1194
++ IQ YVQ C CQ NK P G LQP+P + W +SMDFI LP++ G + +
Sbjct: 946 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1005
Query: 1195 VVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVKLHGFPTSIVSDRDRVFLSTFWSEMFK 1254
VVVDRF+K A + + A++ A +F + V+ G P I++D D +F S W +
Sbjct: 1006 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1065
Query: 1255 LAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSA 1314
+KFS Y PQTDGQTE N+ VE LRCV + P W +S + YN HSA
Sbjct: 1066 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1125
Query: 1315 IKTTPFKALYGRESPVIFKGNDSLTSVDEVEKWTAERNLILEELKSNLEKAQNRMRQQAN 1374
+ TPF+ ++ R SP + + + D+ ++ + E + + +K +L +M++ +
Sbjct: 1126 TQMTPFEIVH-RYSPAL-SPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1375 KHRRDV-QYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAKINPAAYKLQLPE 1433
+++ +++ GDLV +K + K+ + KL+P + GP+ ++ K P Y+L LP+
Sbjct: 1184 MKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
Query: 1434 G--SQVHPVFHISLLKK 1448
FH+S L+K
Sbjct: 1240 SIKHMFSSTFHVSHLEK 1256
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 363 bits (933), Expect = 2e-99
Identities = 192/443 (43%), Positives = 280/443 (62%), Gaps = 12/443 (2%)
Query: 601 FQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIE--KLVKEMLNSGIIRHST 658
++E + L T H + + I + +YP Q +EIE V+EMLN G+IR S
Sbjct: 183 YKEGEKLTFTNTIKHVLNTTHNSPIYS---KQYPLAQTHEIEVENQVQEMLNQGLIRESN 239
Query: 659 SPFSSPAILVKKKDGG-----WRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSK 713
SP++SP +V KK +R +DYR +N+ TIPD++PIP +DE+L ++G F+
Sbjct: 240 SPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTT 299
Query: 714 LDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKF 773
+DL G+HQI M EE I KTAF T GHYEYL +PFGL NAP+TFQ MN +LRP L K
Sbjct: 300 IDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKH 359
Query: 774 VLVFFYDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGV 833
LV+ DI+I+S + H + +++V L + NL KC F + E +LGH+++ G+
Sbjct: 360 CLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGI 419
Query: 834 AADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTE- 892
+P K+K ++ +PIP + K +R FLGLTGYYR+F+ NY+ +A+P+ LKK + T+
Sbjct: 420 KPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQK 479
Query: 893 -EATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSD 951
E +AF KLK ++ P+L P+F+K F+L TDAS LGAVL Q G P++++S+TL+D
Sbjct: 480 LEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLND 539
Query: 952 RAQAKSVYERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQKWMSKL 1011
S E+EL+A+V A + +RHYLLG QF+I +D + LR+L + + G + ++W +L
Sbjct: 540 HELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRL 599
Query: 1012 MGYDFEIKYKPGIENKAADALSR 1034
Y F+I Y G EN ADALSR
Sbjct: 600 SEYQFKIDYIKGKENSVADALSR 622
Score = 35.0 bits (79), Expect = 1.6
Identities = 62/314 (19%), Positives = 118/314 (36%), Gaps = 23/314 (7%)
Query: 1104 VLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFL 1163
++ + H+ L H GI + K ++ +L IQN + +C +C K E N L
Sbjct: 753 IILQSHEKLL--HPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPL 810
Query: 1164 QPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEV-- 1221
+ P P F+ + + GK I +D ++K+A K+ E
Sbjct: 811 KITPNPEH----CREKFVVDIYSSEGKHYI-SCIDIYSKFATL----EQIKTKDWIECRN 861
Query: 1222 FIKEVVKLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCV 1281
+ + G P + +DRD F S + +L+ ++A + D E +++ +
Sbjct: 862 ALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD--VERLHKTI 919
Query: 1282 ETYLRCVTGSKPKQWPKWLSWAE---FWYNTNY-HSAIKTTPFKALYGRESPVIFKGNDS 1337
+R + S ++ LS E + YN H P + P++
Sbjct: 920 NEKIRIINSSDDEEVK--LSKIETILYTYNQKIKHDTTGQRPAQIFLYAGHPILDTQKIK 977
Query: 1338 LTSVDEVEKWTAERNLILEELKSNLEKA--QNRMRQQANKHRRDVQYEVGDLVYLKIQPY 1395
++++ + E N+ K L+K +N + N + D + Y
Sbjct: 978 EKKIEKINEDRREFNIDTNYRKGPLQKGKLENPFKPTKNVEQTDPDHYKITNRNRVTHYY 1037
Query: 1396 KLKSLAKRSNQKLS 1409
K + ++ N KLS
Sbjct: 1038 KTQFKKQKKNNKLS 1051
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 358 bits (919), Expect = 6e-98
Identities = 223/623 (35%), Positives = 347/623 (54%), Gaps = 37/623 (5%)
Query: 439 LIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQH 498
LID G+T+N S+++ ++P+ TS ++ NG N + K+ P
Sbjct: 28 LIDTGSTVNMTSKNI---FDLPIQNTSTFI-HTSNGPLIVNKSIIIPSKIL---FPTTNE 80
Query: 499 FFILGLGGT-EVVLGMDWLASL-GNIEANFQEL--------IIQWVSQGQKMVLQGEPSV 548
F + +++LG LA I QE+ +I+ ++ ++ Q +
Sbjct: 81 FLLHPFSENYDLLLGRKLLAEAKATISYRDQEVTLYNNKYKLIEGIATHEQSHFQNVNMI 140
Query: 549 CKVAANWKSIKITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEV-FQEPKGL 607
+ KI+ E++ Y L + +E+++ A +L++Y ++ + E L
Sbjct: 141 PDTMLRQPN-KISPILESDLYRLEHLNNEEKQRLCA--------LLQKYHDIQYHEGDKL 191
Query: 608 PPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAIL 667
T H I + ++P Y YP + E+E +++MLN GIIR S SP++SP +
Sbjct: 192 TFTNQTKHTINTKH--NLPLYSKYSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWV 249
Query: 668 VKKKDGG-----WRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQ 722
V KK +R +DYR +N+ T+ D+ PIP +DE+L ++G F+ +DL G+HQ
Sbjct: 250 VPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQ 309
Query: 723 IRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDIL 782
I M E + KTAF T GHYEYL +PFGL NAP+TFQ MN +LRP L K LV+ DI+
Sbjct: 310 IEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDII 369
Query: 783 IYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKD 842
++S + + H L +V + L + NL KC F + E +LGHV++ G+ +P KI+
Sbjct: 370 VFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEA 429
Query: 843 MLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWT--EEATQAFVK 900
+ +PIP + K ++ FLGLTGYYR+F+ N++ +A+P+ + LKKN T E AF K
Sbjct: 430 IQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKK 489
Query: 901 LKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYE 960
LK +++ P+L P+F K F L TDAS LGAVL Q+G P++Y+S+TL++ S E
Sbjct: 490 LKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIE 549
Query: 961 RELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKY 1020
+EL+A+V A + +RHYLLG F I +D + L +L + + +W KL +DF+IKY
Sbjct: 550 KELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKY 609
Query: 1021 KPGIENKAADALSR-KLQFSAIS 1042
G EN ADALSR KL+ + +S
Sbjct: 610 IKGKENCVADALSRIKLEETYLS 632
Score = 35.8 bits (81), Expect = 0.94
Identities = 60/322 (18%), Positives = 119/322 (36%), Gaps = 24/322 (7%)
Query: 1100 KILTVLKEFHDTALGGHA-----GIFRTYKRISALFYWEGMKLDIQNYVQKCEVCQRNKY 1154
K +T EF + L H GI +T K +Y+ +L IQN + +C +C K
Sbjct: 742 KNITTYAEFKELILTAHEKLLHPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKT 801
Query: 1155 EALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYN 1214
E N + P P +D + + GK + +D ++K+A
Sbjct: 802 EHRNTDMPTKTTPKPEHCREKFMID----IYSSEGKHYV-SCIDIYSKFATL----EEIK 852
Query: 1215 AKEIAEV--FIKEVVKLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSSAYHPQTDG 1272
K+ E + + G P + +DRD F S + +L+ ++ D
Sbjct: 853 TKDWIECKNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD- 911
Query: 1273 QTEVVNRCVETYLRCVTGSKPKQ--WPKWLSWAEFWYNTNYHSAIKTTPFKALYGRESPV 1330
E +++ + +R + S ++ K + + + H TP P+
Sbjct: 912 -IERLHKTINEKIRIIKTSDDEETKLSKMETVLNIYNHKTKHDTTGQTPAHIFLYAGQPI 970
Query: 1331 IFKGNDSLTSVDEVEKWTAERNLILEELKSNLEKA--QNRMRQQANKHRRDV-QYEVGDL 1387
+ + ++++ E + K L+K +N + N + D Y++ +
Sbjct: 971 LDTQQNKENKINKINNDRVEYEVDTRYRKGPLQKGKLENPFKPTKNVEQTDSDHYKITNR 1030
Query: 1388 VYLKIQPYKLKSLAKRSNQKLS 1409
+ YK + ++ N +LS
Sbjct: 1031 NRI-THYYKTQFKKRKKNNQLS 1051
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 346 bits (888), Expect = 2e-94
Identities = 267/972 (27%), Positives = 463/972 (47%), Gaps = 102/972 (10%)
Query: 541 VLQGEPSVCKVAANWKSIKITEQQEA-EGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPE 599
VL ++ + S ++ + E E + E+ K E + ++ + ++E++ +
Sbjct: 860 VLNDPTLFSEIETDTNSCEVVKTAETYERFTTICEHLKRENGDDRKIWD----VIEQFQD 915
Query: 600 VFQ-EPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHST 658
VF L T+ I+L+EGA +P P K EI K++++MLN +IR S
Sbjct: 916 VFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESK 975
Query: 659 SPFSSPAILVKKKDGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKS 718
SP+SSP +LVKKKDG R C+DYR +NK + P+P I+ L + +++ D+ +
Sbjct: 976 SPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIA 1035
Query: 719 GYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFF 778
G+ QI + E+ TAF +E+ VLPFGL +P+ FQ M +++ L V+
Sbjct: 1036 GFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYV 1095
Query: 779 YDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPS 838
D+LI SK+ E H ++ L ++++ + KC + E+ YLGH ++ GV
Sbjct: 1096 DDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEV 1155
Query: 839 KIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLK-KNSFQWTEEATQA 897
K M + P VK L+ FLGL GYYR+F+ N++++A L L+ K ++ W +E A
Sbjct: 1156 KTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIA 1215
Query: 898 FVKLKEVMTTVPVLVPPNF------DKPFILETDASGKGLGAVLMQEG-----RPVAYMS 946
F +LK+++ PVL P+ D+PF++ TDAS KG+GAVL QEG P+A+ S
Sbjct: 1216 FQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFAS 1275
Query: 947 KTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQK 1006
K LS + + E +A++ A+++++ + G+ + TD + L L + + +
Sbjct: 1276 KALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWR 1335
Query: 1007 WMSKLMGYDFEIKYKPGIENKAADALSR-------------KLQFSAISSVQCAEWAD-- 1051
W +++ +D +I Y G N ADALSR K S ++++Q E D
Sbjct: 1336 WSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQ-TELPDIL 1394
Query: 1052 -----LEAEILGDERYRKVL------------------QELATQGNSAIGYQLKRGRLLY 1088
LE DE +++V+ E++ + +G LK +
Sbjct: 1395 DSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEE 1454
Query: 1089 KDRIVLPKGSTKILT-VLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQNYVQKCE 1147
+ R V+P+ KI T +LKE H+ L GH GI + ++ + FYW M++ ++N V+ C
Sbjct: 1455 QSRSVVPE---KIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRVCVENCVRTCA 1511
Query: 1148 VC-----QRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTK 1202
C +L P PL I + D+ + G IL ++D FTK
Sbjct: 1512 KCLCANDHSKLTSSLTPYRMTFPLEIVACDLMDVGL-------SVQGNRYILTIIDLFTK 1564
Query: 1203 YAHFIALSHPYNAKEIAEVFIKEVVKLHG-FPTSIVSDRDRVFLSTFWSEMFKLAGTKLK 1261
Y + + A+ + + F++ G P +++D+ + F++ +++ + +
Sbjct: 1565 YGTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHI 1623
Query: 1262 FSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEFWYNTNYHSAIKTTPFK 1321
+ Y+ + +G E N+ + ++ T + P +W + +A + YN H TP
Sbjct: 1624 TTKGYNSRANGAVERFNKTIMHIMKKKT-AVPMEWDDQVVYAVYAYNNCVHENTGETPMF 1682
Query: 1322 ALYGRE--SPVIFKGNDSLTSVDEVEKWTAERNLILEELKSNLEKAQNRMRQQAN-KHRR 1378
++GR+ P+ G D++ ++ + + L E LK ++ MR+Q + K
Sbjct: 1683 LMHGRDVMGPLEMSGEDAV-GINYADMDEYKHLLTQELLKVQKIAKEHAMREQESYKSLF 1741
Query: 1379 DVQY--------EVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPII------AKINP 1424
D +Y + G V L+I KL + KL ++ GPY +I A+I P
Sbjct: 1742 DQKYASKKHRFPQPGSRVLLEIPSEKLGAQC----PKLVNKWSGPYRVISCSENSAEITP 1797
Query: 1425 AAYK----LQLP 1432
K LQ+P
Sbjct: 1798 VLGKRKHILQIP 1809
Score = 35.4 bits (80), Expect = 1.2
Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 20/121 (16%)
Query: 359 CFKCGDKWGKEHICSMKNYQLILMEVEEDEEEEEIFEEAEDGEFVLEGKVLQLSLNSKEG 418
CF+C + C KN E E+E + E E V L + + K
Sbjct: 591 CFRCNEMGHIAWNCPKKN--------ENTSEKEAPVAKVETIEGVRMKDCLLMVKSEKSE 642
Query: 419 LTSNRSFKVKGKIGNREVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKER 478
RS + KG+IG V IL+D GA+I+ +S++ T E +VEV + E+
Sbjct: 643 SEVTRSLE-KGQIGKANVEILLDSGASISLMSKN-----------TWEKIVEVNGKSWEQ 690
Query: 479 N 479
+
Sbjct: 691 D 691
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 330 bits (846), Expect = 2e-89
Identities = 288/1077 (26%), Positives = 482/1077 (44%), Gaps = 177/1077 (16%)
Query: 427 VKGKIGNREVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKE-RNSGVCKN 485
++ ++ R + +LID A N+I V EL+ + S + V +G+ E ++ + K
Sbjct: 15 IERRLAGRTLKMLIDTDAAKNYIRP--VKELKNVMPVASPFSVSSIHGSTEIKHKCLMKV 72
Query: 486 LKLEVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGE 545
K I F + L + ++G+D L G ++ N E +++ +K+
Sbjct: 73 FK------HISPFFLLDSLNAFDAIIGLDLLTQAG-VKLNLAEDSLEYQGIAEKLHYFSC 125
Query: 546 PSVCKVAANWKSIKITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPK 605
PSV N + VP+ ++K ++ + + K
Sbjct: 126 PSVNFTDVN----------------------------DIVVPDSVKKEFKD--TIIRRKK 155
Query: 606 GLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQK---------NEIEKLVKEMLNSGIIRH 656
TT+ A+ + P Y + + + VK++L GIIR
Sbjct: 156 AFS---TTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMGVSDFVNNEVKQLLKDGIIRP 212
Query: 657 STSPFSSPAILVKKK------DGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVV 710
S SP++SP +V KK + R +D+R +N+ TIPD++P+P I +L +G A
Sbjct: 213 SRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKF 272
Query: 711 FSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYL 770
F+ LDLKSGYHQI + E D KT+F + G YE+ LPFGL NA S FQ ++ VLR +
Sbjct: 273 FTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQI 332
Query: 771 RKFVLVFFYDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQ 830
K V+ D++I+S+NE H H+ VL+ L + N+ +Q+K F + + YLG ++S+
Sbjct: 333 GKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSK 392
Query: 831 AGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLL------- 883
G +DP K+K + ++P P V +R FLGL YYR F+K+++ +A+P+ +L
Sbjct: 393 DGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENGSV 452
Query: 884 -----KKNSFQWTEEATQAFVKLKEVMTTVPVLVP-PNFDKPFILETDASGKGLGAVLMQ 937
KK ++ E AF +L+ ++ + V++ P+F KPF L TDAS G+GAVL Q
Sbjct: 453 SKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQ 512
Query: 938 EGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSQFV-IHTDQRSLRFLAD 996
EGRP+ +S+TL Q + EREL+A+V A+ K +++L GS+ + I TD + L F
Sbjct: 513 EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVA 572
Query: 997 QRIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSRKLQFSAISSVQCAEWADLEAEI 1056
R + ++W S + ++ ++ YKPG EN ADALSR+ +A+ + ++ A + +E+
Sbjct: 573 DRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ-NLNALQNEPQSDAATIHSEL 631
Query: 1057 LGDERYRKVLQELATQGN----SAIGYQLKRGRLLYKDR---IVLPKGSTKILTVLKEF- 1108
+ L N A + LKR +L++ + ++ + +L LKE
Sbjct: 632 SLTYTVETTDKPLNCFRNQIILEAARFPLKRNLVLFRSKSRHLISFTDKSWLLKTLKEVV 691
Query: 1109 -------------------HDTALGGHAGIFR---------------------------- 1121
HD A FR
Sbjct: 692 NPDVVNAIHCDLPTLASFQHDLIAHFPATQFRHCKNVVLDITDKNEQIEIVTAEHNRAHR 751
Query: 1122 ----TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDIS 1177
K++ +Y+ M + V C VC + KY+ L PIPS +
Sbjct: 752 AAQENIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTGEMVH 811
Query: 1178 MDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVKLHGFPT--S 1235
+D + + L +D+F+KY A+ P ++ I ++ + ++ FP +
Sbjct: 812 IDIF-----STDRKLFLTCIDKFSKY----AIVQPVVSRTIVDITAPLLQIINLFPNIKT 862
Query: 1236 IVSDRDRVFLSTFWSEMFKLA-GTKLKFSSAYHPQTDGQTEVVNRCVETYLRCV-TGSKP 1293
+ D + F S + M K + G + + H ++GQ E + + RC+ K
Sbjct: 863 VYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKT 922
Query: 1294 KQWPKWLSWAEFWYNTNYHSAIKTTPFKALYGRESPVIFKGNDSLTSVDEVEKWTAERNL 1353
+ + A YN HS + P ++ V ER L
Sbjct: 923 NDTVELILRATIEYNKTVHSVTRERP---------------------IEVVHPGAHERCL 961
Query: 1354 ILEELKSNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSP 1410
E+K+ L KAQ + N R++ +EVG+ V++K KR KL+P
Sbjct: 962 ---EIKARLVKAQQDSIGRNNPSRQNRVFEVGERVFVKNN--------KRLGNKLTP 1007
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 325 bits (833), Expect = 6e-88
Identities = 180/478 (37%), Positives = 282/478 (58%), Gaps = 28/478 (5%)
Query: 584 AEVPEGMRKILE----EYPEVFQEP-KGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQK 638
AE P+G ++IL E+P +F+ P G+ I+ I + Y YP +
Sbjct: 78 AEHPDGTQEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPI-YAKSYPYPVNMR 136
Query: 639 NEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKK-----DGGWRFCVDYRAINKATIPDKF 693
E+E+ + E+L GIIR S SP++SP +V KK + +R VD++ +N TIPD +
Sbjct: 137 GEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 694 PIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTN 753
PIP I+ L +G A F+ LDL SG+HQI MKE DIPKTAF T G YE+L LPFGL N
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 754 APSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIVLQVLKENNLVANQKK 813
AP+ FQ +++ +LR ++ K V+ DI+++S++ + H +LR+VL L + NL N +K
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 814 CSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYS 873
F ++ +LG++++ G+ ADP K++ + + P P VK L+ FLG+T YYR+F+++Y+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 874 KLAQPLNQLLK------------KNSFQWTEEATQAFVKLKEVMTTVPVLVPPNFDKPFI 921
K+A+PL L + K E A Q+F LK ++ + +L P F KPF
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 922 LETDASGKGLGAVLMQE----GRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYL 977
L TDAS +GAVL Q+ RP+AY+S++L+ + + E+E++A++ ++ R YL
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 978 LGSQFV-IHTDQRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSR 1034
G+ + ++TD + L F R + ++W +++ Y+ E+ YKPG N ADALSR
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554
Score = 77.8 bits (190), Expect = 2e-13
Identities = 85/334 (25%), Positives = 138/334 (40%), Gaps = 37/334 (11%)
Query: 1086 LLYKDRIVLP-----KGSTKILTVLKEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQ 1140
LLYK RI G+ +I ++++ H A H G ++ +Y+ M I+
Sbjct: 673 LLYKIRITQRLVADVSGAEEICEIIEKEHRRA---HRGPTEIRLQLLEKYYFPRMSSTIR 729
Query: 1141 NYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRF 1200
C+ C+ KYE LQP PIP+ + +D A+ K L +D+F
Sbjct: 730 LQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPCEILHIDIF-----ALEKRLYLSCIDKF 784
Query: 1201 TKYAHFIALSHPYNAKEIAEVFIKE--VVKLHGF--PTSIVSDRDRVFLSTFWSEMFKLA 1256
+K+A ++ + A V ++E V LH F P +VSD +R L +
Sbjct: 785 SKFAKL------FHLQSKASVHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSL 838
Query: 1257 GTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWP-KWLSWAEFWYNTNYHSAI 1315
L ++ + +GQ E + RC+ P P + + A YNT+ HS
Sbjct: 839 DIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPTFKPVELVHIAVDRYNTSVHSVT 898
Query: 1316 KTTPFKALYGRESPVIFKGNDSLTSVDEVEKWTAERNLILEELKSNLEKAQNRMRQQANK 1375
P + R S V ++G T R LE++K +E Q R NK
Sbjct: 899 NRKPADVFFDRSSRVNYQG------------LTDFRRQTLEDIKGLIEYKQIRGNMARNK 946
Query: 1376 HRRDVQ-YEVGDLVYLKIQPYKLKSLAKRSNQKL 1408
+R + + Y GD V++ + K K A+ +K+
Sbjct: 947 NRDEPKSYGPGDEVFVANKQIKTKEKARFRCEKV 980
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 267 bits (683), Expect = 1e-70
Identities = 155/457 (33%), Positives = 242/457 (52%), Gaps = 13/457 (2%)
Query: 590 MRKILEEYPEVFQ-EPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEM 648
+ I EY ++F E + + ++L++ + + YR P Q EI+ V+++
Sbjct: 279 LENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYT-KNYRSPHSQVEEIQAQVQKL 337
Query: 649 LNSGIIRHSTSPFSSPAILVKKKDGG------WRFCVDYRAINKATIPDKFPIPIIDELL 702
+ I+ S S ++SP +LV KK WR +DYR INK + DKFP+P ID++L
Sbjct: 338 IKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDIL 397
Query: 703 DEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALM 762
D++G A FS LDL SG+HQI + E T+F T G Y + LPFGL AP++FQ +M
Sbjct: 398 DQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMM 457
Query: 763 NQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEII 822
++ D+++ +E+ +L V +E NL + +KCSF E+
Sbjct: 458 TIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVT 517
Query: 823 YLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQL 882
+LGH + G+ D K + ++P+P + R F+ YYRRF+KN++ ++ + +L
Sbjct: 518 FLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRL 577
Query: 883 LKKN-SFQWTEEATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGR- 940
KKN F+WT+E +AF+ LK + +L P+F K F + TDAS + GAVL Q
Sbjct: 578 CKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNG 637
Query: 941 ---PVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQ 997
PVAY S+ + KS E+EL A+ A+ +R Y+ G F + TD R L +L
Sbjct: 638 HQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSM 697
Query: 998 RIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSR 1034
+ + +L Y+F ++Y G +N ADALSR
Sbjct: 698 VNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSR 734
Score = 115 bits (288), Expect = 9e-25
Identities = 103/419 (24%), Positives = 181/419 (42%), Gaps = 42/419 (10%)
Query: 1014 YDFEIKYKPGIENKAADALSRKLQFSA----ISSVQCAEWADLEAEILGDERYRKVLQEL 1069
YD Y GI + D ++L+ A IS ++ A W + + D+
Sbjct: 816 YDVGDLYTNGILD--LDQFLQRLELQAGIYDISQIKMAPWKKIFEHVSIDK--------F 865
Query: 1070 ATQGNSAIGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFRTYKRISAL 1129
GN + LK L +I K IL+ L + D GGH GI +T ++
Sbjct: 866 KNMGNKILK-NLKVALLNPVTQINNEKEKEAILSTLHD--DPIQGGHTGITKTLAKVKRH 922
Query: 1130 FYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPI---PSQGWTDISMDFIGGLPK 1186
+YW+ M I+ YV+KC+ CQ+ K P+ I P + + +D IG LPK
Sbjct: 923 YYWKNMSKYIKEYVRKCQKCQKAKTTKHTKT----PMTITETPEHAFDRVVVDTIGPLPK 978
Query: 1187 AM-GKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVKLHGFPTSIVSDRDRVFL 1245
+ G + + ++ TKY I +++ +AK +A+ + + +G + ++D +
Sbjct: 979 SENGNEYAVTLICDLTKYLVAIPIANK-SAKTVAKAIFESFILKYGPMKTFITDMGTEYK 1037
Query: 1246 STFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLSWAEF 1305
++ +++ K K S+A+H QT G E +R + Y+R + W WL + +
Sbjct: 1038 NSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVY 1097
Query: 1306 WYNTNYHSAIKTTPFKALYGRES--PVIFKGNDSLTSVDEVEKWTAERNLILE----ELK 1359
+NT P++ ++GR S P F S+ + ++ + E LE +
Sbjct: 1098 CFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKESKYRLEVAYARAR 1157
Query: 1360 SNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPI 1418
LE + + ++ + +D++ EVGD V L+ KL +Y GPY I
Sbjct: 1158 KLLEAHKEKNKENYDLKVKDIELEVGDKVLLR----------NEVGHKLDFKYTGPYKI 1206
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 230 bits (586), Expect = 3e-59
Identities = 191/684 (27%), Positives = 330/684 (47%), Gaps = 81/684 (11%)
Query: 419 LTSNRSFKVKGKIGNR-----EVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGN 473
+T+ S +KG++ + E+ +D GA++ S+ ++ E E V A +V++ +
Sbjct: 19 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAERPIMVKIAD 77
Query: 474 GAKERNSGVCKNLKLEVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQW 533
G+ S VCK++ L + G+ G + ++G ++ L F + +I
Sbjct: 78 GSSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNF-CQLYEPFIQFTDRVIFT 136
Query: 534 VSQGQKMVLQGEPSVCKVAA-----NWKSIKITEQQE-------------------AEGY 569
++ + + +V + K T+Q E +EG
Sbjct: 137 KNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGR 196
Query: 570 YLSYEY----QKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDH---AIQLQEG 622
LS E Q+ +KTE E + K+ E P L P +T +I+L +
Sbjct: 197 RLSEEKLFITQQRMQKTE----ELLEKVCSENP--------LDPNKTKQWMKASIKLSDP 244
Query: 623 ASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKD----GGWRFC 678
+ ++P +Y + E +K +KE+L+ +I+ S SP +PA LV + G R
Sbjct: 245 SKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMV 304
Query: 679 VDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTH 738
V+Y+A+NKAT+ D + +P DELL I +FS D KSG+ Q+ + +E P TAF
Sbjct: 305 VNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 364
Query: 739 EGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIV 798
+GHYE+ V+PFGL APS FQ M++ R + RKF V+ DI+++S NEE H H+ ++
Sbjct: 365 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMI 423
Query: 799 LQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWPIPKEV 852
LQ ++ ++ ++KK + +I +LG I Q + +K D L+ +
Sbjct: 424 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE-----DK 478
Query: 853 KGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVPVL 911
K L+ FLG+ Y ++ N +++ QPL LK+N ++WT+E T K+K+ + P L
Sbjct: 479 KQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPL 538
Query: 912 VPPNFDKPFILETDAS----GKGLGAVLMQEGRPV----AYMSKTLSDRAQAKSVYEREL 963
P ++ I+ETDAS G L A+ + EG Y S + + ++E
Sbjct: 539 HHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKET 598
Query: 964 MAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIK 1019
+AV+ ++K+ YL F+I TD + + G+ + +W + L Y F+++
Sbjct: 599 LAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVE 658
Query: 1020 YKPGIENKAADALSRKLQFSAISS 1043
+ G +N AD LSR +F+ ++S
Sbjct: 659 HIKGTDNHFADFLSR--EFNKVNS 680
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 229 bits (585), Expect = 3e-59
Identities = 188/680 (27%), Positives = 332/680 (48%), Gaps = 73/680 (10%)
Query: 419 LTSNRSFKVKGKIGNR-----EVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGN 473
+T+ S +KG++ + E+ +D GA++ S+ ++ E E V A +V++ +
Sbjct: 18 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAERPIMVKIAD 76
Query: 474 GAKERNSGVCKNLKLEVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQW 533
G+ S VCK++ L + G G + ++G ++ L F + +I
Sbjct: 77 GSSITISKVCKDIDLIIAGEIFRIPTVYQQESGIDFIIGNNF-CQLYEPFIQFTDRVIFT 135
Query: 534 VSQGQKMVLQGEPSVCKVAA-----NWKSIKITEQQE-------------------AEGY 569
++ + + +V + K T+Q E +EG
Sbjct: 136 KNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGR 195
Query: 570 YLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDH---AIQLQEGASIP 626
LS E ++ ++ E + K+ E P L P +T +I+L + +
Sbjct: 196 RLSEEKLFITQQRMQKIEELLEKVCSENP--------LDPNKTKQWMKASIKLSDPSKAI 247
Query: 627 NIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILV----KKKDGGWRFCVDYR 682
++P +Y + E +K +KE+L+ +I+ S SP +PA LV +K+ G R V+Y+
Sbjct: 248 KVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYK 307
Query: 683 AINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHY 742
A+NKAT+ D + +P DELL I +FS D KSG+ Q+ + +E P TAF +GHY
Sbjct: 308 AMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHY 367
Query: 743 EYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIVLQVL 802
E+ V+PFGL APS FQ M++ R + RKF V+ DIL++S NEE H H+ ++LQ
Sbjct: 368 EWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKC 426
Query: 803 KENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWPIPKEVKGLR 856
++ ++ ++KK + +I +LG I Q + +K D L+ + K L+
Sbjct: 427 NQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE-----DKKQLQ 481
Query: 857 GFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVPVLVPPN 915
FLG+ Y ++ +++ +PL LK+N ++WT+E T K+K+ + P L P
Sbjct: 482 RFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPL 541
Query: 916 FDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLSD--RAQAKSVY--ERELMAVV 967
++ I+ETDAS G L A+ + EG + + S +A K+ + ++E +AV+
Sbjct: 542 PEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVI 601
Query: 968 LAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIKYKPG 1023
++K+ YL F+I TD + + G+ + +W + L Y F++++ G
Sbjct: 602 NTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKG 661
Query: 1024 IENKAADALSRKLQFSAISS 1043
+N AD LSR +F+ ++S
Sbjct: 662 TDNHFADFLSR--EFNKVNS 679
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 229 bits (585), Expect = 3e-59
Identities = 195/700 (27%), Positives = 340/700 (47%), Gaps = 95/700 (13%)
Query: 410 QLSLNSKEGLTSNRSFKVKGKIGNR-----EVLILIDCGATINFISQDLVVELEIPVIAT 464
Q + +T+ S +KG++ + E+ +D GA++ S+ ++ E E V A
Sbjct: 9 QTQIEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAE 67
Query: 465 SEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNIEA 524
+V++ +G+ S VCK++ L + G + F I + E G+D++ +GN
Sbjct: 68 RPIMVKIADGSSITISKVCKDIDLIIAG----EIFKIPTVYQQES--GIDFI--IGNNFC 119
Query: 525 NFQELIIQW------------------VSQGQKMVLQG------------EPSVCKVAAN 554
E IQ+ +++ ++ ++G +P ++ N
Sbjct: 120 QLYEPFIQFTDRVIFTKNKSYPVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTN 179
Query: 555 WKSIKITEQQE-----AEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPP 609
KI E +EG LS E ++ ++ E + K+ E P L P
Sbjct: 180 ----KIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP--------LDP 227
Query: 610 RRTTDH---AIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAI 666
+T +I+L + + ++P +Y + E +K +KE+L+ +I+ S SP +PA
Sbjct: 228 NKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAF 287
Query: 667 LV----KKKDGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQ 722
LV +K+ G R V+Y+A+NKATI D + +P DELL I +FS D KSG+ Q
Sbjct: 288 LVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQ 347
Query: 723 IRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDIL 782
+ + +E P TAF +GHYE+ V+PFGL APS FQ M++ R + RKF V+ DIL
Sbjct: 348 VLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIL 406
Query: 783 IYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAAD 836
++S NEE H H+ ++LQ ++ ++ ++KK + +I +LG I Q +
Sbjct: 407 VFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEH 466
Query: 837 PSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEAT 895
+K D L+ + K L+ FLG+ Y ++ +++ +PL LK+N ++WT+E T
Sbjct: 467 INKFPDTLE-----DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 521
Query: 896 QAFVKLKEVMTTVPVLVPPNFDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLSD 951
K+K+ + P L P ++ I+ETDAS G L A+ + EG + + S
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 952 RAQAKS----VYERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ-- 1005
+A ++E +AV+ ++K+ YL F+I TD + + G+ +
Sbjct: 582 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1006 --KWMSKLMGYDFEIKYKPGIENKAADALSRKLQFSAISS 1043
+W + L Y F++++ G +N AD LSR +F+ ++S
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR--EFNKVNS 679
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 229 bits (583), Expect = 6e-59
Identities = 191/686 (27%), Positives = 337/686 (48%), Gaps = 85/686 (12%)
Query: 419 LTSNRSFKVKGKIGNR-----EVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGN 473
+T+ S +KG++ + E+ +D GA++ S+ ++ E E V A +V++ +
Sbjct: 18 VTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPE-EHWVNAERPIMVKIAD 76
Query: 474 GAKERNSGVCKNLKL----EVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQEL 529
G+ S VCK++ L E+ IP + + G+D++ +GN NF +L
Sbjct: 77 GSSITISKVCKDIDLIIAREIFKIPTVY----------QQESGIDFI--IGN---NFCQL 121
Query: 530 IIQWVSQGQKMVLQGEPSV-CKVAANWKSIKI------------TEQQEAEGYYLSYEYQ 576
++ +++ S +A +++++ ++ Q+ E +S
Sbjct: 122 YEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKI 181
Query: 577 KEEEKTEAEVPEGMR-------------KILEEYPEVFQEPKGLPPRRTTDH---AIQLQ 620
+ K A + EG R + +EE E L P +T +I+L
Sbjct: 182 ENPLKEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLS 241
Query: 621 EGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILV----KKKDGGWR 676
+ + ++P +Y + E +K +KE+L+ +I+ S SP +PA LV +K+ G R
Sbjct: 242 DPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKR 301
Query: 677 FCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFR 736
V+Y+A+NKATI D + +P DELL I +FS D KSG+ Q+ + +E P TAF
Sbjct: 302 MVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFT 361
Query: 737 THEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLR 796
+GHYE+ V+PFGL APS FQ M++ R + RKF V+ DIL++S NEE H H+
Sbjct: 362 CPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVA 420
Query: 797 IVLQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAADPSKIKDMLDWPIPK 850
++LQ ++ ++ ++KK + +I +LG I Q + +K D L+
Sbjct: 421 MILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLE----- 475
Query: 851 EVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVP 909
+ K L+ FLG+ Y ++ +++ +PL LK+N ++WT+E T K+K+ + P
Sbjct: 476 DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFP 535
Query: 910 VLVPPNFDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLSDRAQAKS----VYER 961
L P ++ I+ETDAS G L A+ + EG + + S +A ++
Sbjct: 536 PLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDK 595
Query: 962 ELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFE 1017
E +AV+ ++K+ YL F+I TD + + G+ + +W + L Y F+
Sbjct: 596 ETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFD 655
Query: 1018 IKYKPGIENKAADALSRKLQFSAISS 1043
+++ G +N AD LSR +F+ ++S
Sbjct: 656 VEHIKGTDNHFADFLSR--EFNKVNS 679
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 229 bits (583), Expect = 6e-59
Identities = 191/701 (27%), Positives = 348/701 (49%), Gaps = 86/701 (12%)
Query: 401 EFVLEGKVLQLSLNSKEGLTSNRSFKVKGKIGNR-----EVLILIDCGATINFISQDLVV 455
+ +L+ +Q +T+ S +KG++ + E+ +D GA++ S+ ++
Sbjct: 2 DHLLQKTQIQNQTEQVMNITNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIP 61
Query: 456 ELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQHFFILGLGGTEVVLGMDW 515
E E + A +V++ +G+ + VC+++ L + G + F I + E G+D+
Sbjct: 62 E-EHWINAERPIMVKIADGSSITINKVCRDIDLIIAG----EIFHIPTVYQQES--GIDF 114
Query: 516 LASLGNIEANFQELIIQWVSQGQKMVLQGEPSV-CKVAANWKSIKI-------------- 560
+ +GN NF +L ++ +++ + + +A +++++
Sbjct: 115 I--IGN---NFCQLYEPFIQFTDRVIFTKDRTYPVHIAKLTRAVRVGTEGFLESMKKRSK 169
Query: 561 TEQQE------------AEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLP 608
T+Q E +EG LS E ++ ++ E + K+ E P L
Sbjct: 170 TQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP--------LD 221
Query: 609 PRRTTDH---AIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPA 665
P +T +I+L + + ++P +Y + E +K +KE+L+ +I+ S SP +PA
Sbjct: 222 PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPA 281
Query: 666 ILV----KKKDGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYH 721
LV +K+ G R V+Y+A+NKAT+ D + P DELL I +FS D KSG+
Sbjct: 282 FLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFW 341
Query: 722 QIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDI 781
Q+ + +E P TAF +GHYE+ V+PFGL APS FQ M++ R + RKF V+ DI
Sbjct: 342 QVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDI 400
Query: 782 LIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVIS------QAGVAA 835
L++S NEE H H+ ++LQ ++ ++ ++KK + +I +LG I Q +
Sbjct: 401 LVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILE 460
Query: 836 DPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEA 894
+K D L+ + K L+ FLG+ Y ++ +++ +PL LK+N ++WT+E
Sbjct: 461 HINKFPDTLE-----DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 515
Query: 895 TQAFVKLKEVMTTVPVLVPPNFDKPFILETDAS----GKGLGAVLMQEGRPVAYMSKTLS 950
T K+K+ + P L P ++ I+ETDAS G L A+ + EG + + S
Sbjct: 516 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 575
Query: 951 D--RAQAKSVY--ERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ- 1005
+A K+ + ++E +AV+ ++K+ YL F+I TD + + G+ +
Sbjct: 576 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 635
Query: 1006 ---KWMSKLMGYDFEIKYKPGIENKAADALSRKLQFSAISS 1043
+W + L Y F++++ G +N AD LSR +F+ ++S
Sbjct: 636 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR--EFNRVNS 674
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 214 bits (546), Expect = 1e-54
Identities = 181/673 (26%), Positives = 328/673 (47%), Gaps = 47/673 (6%)
Query: 393 IFEEAEDGEFVLEGKVLQLSLNSKEGLTSNRSFKVKGKIG-----NREVLILIDCGATIN 447
+F E E G F L K L LN +T+ S ++GK+ + + +D GA++
Sbjct: 6 LFREGELGHFCLN-KQEMLHLN----VTNPNSIYIEGKLSFEGYKSFNIHCYVDTGASLC 60
Query: 448 FISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNLKLEVQGIPIMQHFFILGLGGT 507
S+ ++ E E+ + + V++ N + + VCKNLK++ G + F I +
Sbjct: 61 IASRYIIPE-ELWENSPKDIQVKIANQELIKITKVCKNLKVKFAG----KSFEIPTVYQQ 115
Query: 508 EVVLGMDWLASLGNIEANFQELIIQWVSQ-----GQKMVLQGEPSVCKVAANWKSI---- 558
E G+D+L +GN IQW + +MVL + + +N +
Sbjct: 116 ET--GIDFL--IGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMK 171
Query: 559 KITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVFQEPKGLP--PRRTTDHA 616
K ++ ++ G +S EE+ + E +KI + +V E P ++ +
Sbjct: 172 KDSKTEQIPGTNISKNIINPEERYFL-ITEKYQKIEQLLDKVCSENPIDPIKSKQWMKAS 230
Query: 617 IQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVK----KKD 672
I+L + + ++P Y + K +KE+L+ G+I S S SPA LV+ ++
Sbjct: 231 IKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERRR 290
Query: 673 GGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPK 732
G R V+Y+AIN+ATI D +P + ELL + +FS D KSG+ Q+ + EE
Sbjct: 291 GKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKL 350
Query: 733 TAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHK 792
TAF +GH+++ V+PFGL APS FQ M L KF +V+ DI+++S +E H
Sbjct: 351 TAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHY 409
Query: 793 DHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIP-KE 851
+H+ VL+++++ ++ ++KK + + +I +LG I + ++++ +P ++
Sbjct: 410 NHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLED 469
Query: 852 VKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKN-SFQWTEEATQAFVKLKEVMTTVPV 910
K L+ FLG+ Y ++ +++ +PL LKK+ ++ WT+ + K+K+ + + P
Sbjct: 470 KKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPK 529
Query: 911 LVPPNFDKPFILETDASGKGLGAVLMQEGRP-----VAYMSKTLSDRAQAKSVYERELMA 965
L P + I+ETDAS G VL Y S + + ++EL+A
Sbjct: 530 LYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLA 589
Query: 966 VVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ----KWMSKLMGYDFEIKYK 1021
V + K+ YL +F + TD ++ + + G+ +Q +W + Y F++++
Sbjct: 590 VKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHL 649
Query: 1022 PGIENKAADALSR 1034
G++N AD L+R
Sbjct: 650 EGVKNVLADCLTR 662
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 189 bits (479), Expect = 7e-47
Identities = 130/421 (30%), Positives = 210/421 (49%), Gaps = 34/421 (8%)
Query: 648 MLNSGIIRHSTSPFSSPAILV-----------KKKDGGWRFCVDYRAINKATIPDKFPIP 696
+L +IR S S S A +V K+K G R +Y+ +N+ T D++ +P
Sbjct: 1425 LLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLP 1484
Query: 697 IIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPS 756
I+ ++ ++G + ++SK DLKSG+ Q+ M+EE +P TAF YE+LV+PFGL NAP+
Sbjct: 1485 GINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPA 1544
Query: 757 TFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSF 816
FQ M+ V + KF+ V+ DIL++S+ E H HL +LQ+ KEN L+ + K
Sbjct: 1545 IFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKI 1603
Query: 817 GQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPK--EVKGLRGFLGLTGYYRRFVKNYSK 874
G PEI +LG + + P I + D+ K +G+R +LG+ Y R ++++ K
Sbjct: 1604 GTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIGK 1663
Query: 875 LAQPLNQLLKKNSFQWTEEATQAFVK-LKEVMTTVPVLVPPNFDKPFILETDASGKGLGA 933
L QPL Q + + T V+ +KE + +P L P D I+ETD G GA
Sbjct: 1664 LVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGA 1723
Query: 934 VL---------MQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWR-HYLLGSQFV 983
V R AY S + + KS + E+ A + + K++ +YL + +
Sbjct: 1724 VCKWKMSKHDPRSTERICAYASGSFN---PIKSTIDAEIQAAIHGLDKFKIYYLDKKELI 1780
Query: 984 IHTD-QRSLRFLADQRIMGEEQQKWMS-----KLMGYDFEIKYKPGIENKAADALSRKLQ 1037
I +D + ++F + +W++ +G ++ G N ADALSR +
Sbjct: 1781 IRSDCEAIIKFYNKTNENKPSRVRWLTFSDFLTGLGITVTFEHIDGKHNGLADALSRMIN 1840
Query: 1038 F 1038
F
Sbjct: 1841 F 1841
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 188 bits (477), Expect = 1e-46
Identities = 159/637 (24%), Positives = 294/637 (45%), Gaps = 35/637 (5%)
Query: 426 KVKGKIGNREVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKN 485
K G N ++ +D G+++ S+ ++ E E A +++ NG + + VC
Sbjct: 19 KFPGYQTNLDLHCYVDTGSSLCMASKYVIPE-EYWQTAEKPLNIKIANGKIIQLTKVCSK 77
Query: 486 LKLEVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNI---------EANFQELIIQWVSQ 536
L + + G + G +++LG ++ N Q +II +++
Sbjct: 78 LPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKITK 137
Query: 537 GQKMVLQG--EPSVCKVAANW-KSIKITEQQEAEGYYLSYEYQKEEEKTEAEVPEGMRKI 593
+ ++G E K N + I IT Q + E ++ E+
Sbjct: 138 AYQYGVKGFLESMKKKSKVNRPEPINITSNQ----HLFLEEGGNHVDEMLYEIQISKFSA 193
Query: 594 LEEYPEVFQEPKGLPPRRTTDH---AIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLN 650
+EE E + P ++ I+L + ++ ++P Y + E ++ +KE+L
Sbjct: 194 IEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLE 253
Query: 651 SGIIRHSTSPFSSPAILVK----KKDGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIG 706
+I+ S S SPA LV+ ++ G R V+Y+A+NKAT D +P DELL +
Sbjct: 254 LKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVR 313
Query: 707 AAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVL 766
++S D KSG Q+ + +E TAF +GHY++ V+PFGL APS F
Sbjct: 314 GKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSH 373
Query: 767 RPYLRKFVLVFFYDILIYSK-NEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLG 825
K+ V+ DIL++S + H H+ +L+ ++ ++ ++KK + +I +LG
Sbjct: 374 SNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLG 433
Query: 826 HVISQAGVAADPSKIKDMLDWPIP-KEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLK 884
I Q ++ + +P ++ K L+ FLG+ Y ++ + + +PL LK
Sbjct: 434 LEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLK 493
Query: 885 KNS-FQWTEEATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVA 943
++S + W + +Q K+K+ + + P L P + ++ETDAS + G +L
Sbjct: 494 EDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHE 553
Query: 944 YMSKTLSDRAQAKS----VYERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRI 999
Y+ + S +A E+EL+AV+ ++K+ YL S+F+I TD ++ + +
Sbjct: 554 YICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINL 613
Query: 1000 MGEEQQ----KWMSKLMGYDFEIKYKPGIENKAADAL 1032
G+ +Q +W L YDF++++ G +N AD L
Sbjct: 614 KGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFL 650
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 692
Score = 168 bits (426), Expect = 9e-41
Identities = 176/687 (25%), Positives = 302/687 (43%), Gaps = 101/687 (14%)
Query: 427 VKGKIGNREVLILIDCGATINFISQDLVVELEIPVIATSEYVVEVGNGAKERNSGVCKNL 486
+K IG R L ID GAT+ F + + EI + + + +K N+
Sbjct: 22 IKVSIGKRNFLAYIDTGATLCFGKRKISNNWEI---LKQPKEIIIADKSKHYIREAISNV 78
Query: 487 KLEVQGIPIMQHFFILGLGGTEVVLGMDWLASLGNIEANFQELIIQWVSQGQKMVLQGEP 546
L+++ + L G ++++G ++L + + ++W + + +
Sbjct: 79 FLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPK--ESQM 136
Query: 547 SVCKVAANWKSIKITEQQ-----EAEGYYLSYEYQKEEEKTEAEVPEGMRKILEEYPEVF 601
K+ + +K++ ++ E ++ + E Q EE +E + E K
Sbjct: 137 ISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEEVCSEHPLDETKNKNGLLIEIRL 196
Query: 602 QEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPF 661
++P LQE ++ N PY Q E ++ +++L G+IR S SP
Sbjct: 197 KDP--------------LQE-INVTNRIPYTIRDVQ--EFKEECEDLLKKGLIRESQSPH 239
Query: 662 SSPAILVKK----KDGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGAAVVFSKLDLK 717
S+PA V+ K G R ++Y+ +N+ATI D + +P D +L++I ++ FS LD K
Sbjct: 240 SAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAK 299
Query: 718 SGYHQIRMKEEDIPKTAFR-THEGHYEYLVLPFGLTNAPSTFQALMNQVLRPYLRKFVLV 776
SGY+Q+R+ E P TAF + HYE+ VL FGL APS +Q M+Q L+ L L
Sbjct: 300 SGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKG-LEHICLA 358
Query: 777 FFYDILIYSK-NEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGHVISQAG-VA 834
+ DILI++K ++E H + +RIVLQ +KE ++ ++KK Q EI YLG I G +
Sbjct: 359 YIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGEID 418
Query: 835 ADPSKIKDMLDWPIP-KEVKGLRGFLGLTGYYRR--FVKNYSKLAQPLNQLLK-KNSFQW 890
P + +L +P ++ K ++ FLG Y F KN + + L + + KN ++W
Sbjct: 419 LSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPWKW 478
Query: 891 TEEATQAFVKLKEVMTTVPVLVPPNFDKPFILETDAS-----------GKGLGAVLMQE- 938
T+ +K + ++P L + I+ETDAS KG + + E
Sbjct: 479 DTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKIGLDEF 538
Query: 939 GRPVAYM----SKTLSDRAQAK-------------------------------------- 956
G P A + S SD + A+
Sbjct: 539 GIPTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVSGTFTDT 598
Query: 957 ----SVYERELMAVVLAVQKWRHYLLGSQFVIHTDQRSLRFLADQRIMGEEQQ----KWM 1008
+ E E++A V ++KWR LL ++F++ TD + I + + +W
Sbjct: 599 ETRYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDYRNGRLIRWQ 658
Query: 1009 SKLMGYDFEIKYKPGIENKAADALSRK 1035
+L Y ++ N AD L+R+
Sbjct: 659 LRLQAYQPYVELIKSENNPFADTLTRE 685
>M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860
(ORF158)
Length = 158
Score = 159 bits (401), Expect = 7e-38
Identities = 75/131 (57%), Positives = 98/131 (74%), Gaps = 2/131 (1%)
Query: 793 DHLRIVLQVLKENNLVANQKKCSFGQPEIIYLGH--VISQAGVAADPSKIKDMLDWPIPK 850
+HL +VLQ+ +++ AN+KKC+FGQP+I YLGH +IS GV+ADP+K++ M+ WP PK
Sbjct: 2 NHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPK 61
Query: 851 EVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKKNSFQWTEEATQAFVKLKEVMTTVPV 910
LRGFLGLTGYYRRFVKNY K+ +PL +LLKKNS +WTE A AF LK +TT+PV
Sbjct: 62 NTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPV 121
Query: 911 LVPPNFDKPFI 921
L P+ PF+
Sbjct: 122 LALPDLKLPFV 132
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 153 bits (387), Expect = 3e-36
Identities = 126/414 (30%), Positives = 200/414 (47%), Gaps = 36/414 (8%)
Query: 638 KNEIEKLVKEMLNSGII-------RHSTSPF----SSPAILVKKKDGGWRFCVDYRAINK 686
K EK +KE+L++ +I RH T+ F S + K R +Y+ +N
Sbjct: 1196 KEVFEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKP-----RIVYNYKRLND 1250
Query: 687 ATIPDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLV 746
D F IP +++ I A +FSK DLK+G+H +++K++ T F EG Y + V
Sbjct: 1251 NMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNV 1310
Query: 747 LPFGLTNAPSTFQALMNQVLRPYLRKFVLVFFYDILIYSKNEELHKDHLRIVLQVLKENN 806
PFG+ NAP FQ M + KF L++ DILI S NE+ H +HL+I +KE
Sbjct: 1311 CPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVG 1368
Query: 807 LVANQKKCSFGQPEIIYLGHVISQAGVAADP---SKIKDMLDWPIPKEVKGLRGFLGLTG 863
V ++KK E+ YLG I + ++ P KIK D +KGL+ +LGL
Sbjct: 1369 CVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIK-KFDKNKLNTLKGLQAYLGLLN 1427
Query: 864 YYRRFVKNYSKLAQPLNQLLKKNSFQ-WTEEATQAFVKLKEVMTTVPVLVPPNFDKPFIL 922
Y R ++K+ SKL PL + KN + + +E K++ ++ + L P I+
Sbjct: 1428 YARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIII 1487
Query: 923 ETDASGKGLGAVLM---------QEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKW 973
ETDAS +G GAVL+ + Y S ++ S+ + E+ A+ A+ K+
Sbjct: 1488 ETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSL-DYEIEAINEALNKF 1546
Query: 974 RHYLLGSQFVIHTD-QRSLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIEN 1026
+ Y L F I TD + ++ + + + +W+ KL + YKP E+
Sbjct: 1547 QIY-LDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWI-KLRDNLLKDGYKPTFEH 1598
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.317 0.135 0.398
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 189,150,092
Number of Sequences: 164201
Number of extensions: 8623258
Number of successful extensions: 30550
Number of sequences better than 10.0: 224
Number of HSP's better than 10.0 without gapping: 117
Number of HSP's successfully gapped in prelim test: 110
Number of HSP's that attempted gapping in prelim test: 29735
Number of HSP's gapped (non-prelim): 539
length of query: 1558
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1434
effective length of database: 39,613,130
effective search space: 56805228420
effective search space used: 56805228420
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0106.8