
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0074.19
(2281 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 475 e-133
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 473 e-132
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 473 e-132
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 340 3e-92
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 340 3e-92
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 327 2e-88
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 325 9e-88
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 313 3e-84
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 274 2e-72
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 200 4e-50
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 198 1e-49
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 198 2e-49
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 197 3e-49
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 195 1e-48
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 192 1e-47
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 191 3e-47
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 189 8e-47
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 142 1e-32
M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860... 137 3e-31
RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse... 132 1e-29
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 475 bits (1223), Expect = e-133
Identities = 297/897 (33%), Positives = 474/897 (52%), Gaps = 46/897 (5%)
Query: 1023 ELMETLEEFQEVFR--SKIQLP-PERSKVHQIKLFPEQETINVRPYRYPHHQKEEIERQV 1079
EL + +EF+++ + +LP P + +++L E + +R Y P + + + ++
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEI 432
Query: 1080 AELMEAGIIRPSMSAYSSPVILVKKKDKSWRMCVDYRALNKATIPDKYPIPIVDELLDEL 1139
+ +++GIIR S + + PV+ V KK+ + RM VDY+ LNK P+ YP+P++++LL ++
Sbjct: 433 NQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKI 492
Query: 1140 NGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQAVMNDI 1199
G++IF+K+DLKS YH IRV + D K AFR G +EYLVMP+G+ APA FQ +N I
Sbjct: 493 QGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTI 552
Query: 1200 FRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCASIDYLG 1259
V+ + DDILI+SK EH+ H+K VL L + NQ KC+F + + ++G
Sbjct: 553 LGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIG 612
Query: 1260 HIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYYRKFIKDYGKMANPLTELTKK 1319
+ IS G E + ++ W PKN K +R FLG Y RKFI ++ +PL L KK
Sbjct: 613 YHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKK 672
Query: 1320 D-SFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIGAVLMQQRQ---- 1374
D + W P +A +K+ + SPPVL +F+ +E DA+ +GAVL Q+
Sbjct: 673 DVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKY 732
Query: 1375 -PLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLG--KEFIVYTDHKSLKHFLQQ 1431
P+ ++S +S L SV +KE++A++ S++HWRHYL + F + TDH++L +
Sbjct: 733 YPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITN 792
Query: 1432 KVSSPDQQ---CWLAKLMGYQFQVKYKPGLENKAADALSRRYDEVE----------LHSL 1478
+ S P+ + W L + F++ Y+PG N ADALSR DE E ++ +
Sbjct: 793 E-SEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFV 851
Query: 1479 ISFPLWND-RKRLLEEITQDPYIQDLQSAVQKDPASKPGFAVQHGVLLYHGRLVLSPTSP 1537
+ +D + +++ E T D + +L + +D + ++ G+L+ +L P
Sbjct: 852 NQISITDDFKNQVVTEYTNDTKLLNLLN--NEDKRVEENIQLKDGLLINSKDQILLPNDT 909
Query: 1538 SIP-WLLEEFHGSPSGGHSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALS 1596
+ +++++H H G + W G++K++++YV+ C CQ +K
Sbjct: 910 QLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHK 969
Query: 1597 PGGLLQPLPIPNAVWEDLSLDFITGLPKSKGFEAVLVVVDRLSKYSHFILLKHPYTAKSI 1656
P G LQP+P WE LS+DFIT LP+S G+ A+ VVVDR SK + + TA+
Sbjct: 970 PYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQT 1029
Query: 1657 AEVFVREVVRLHGIPNSVISDRDPIFVSHFWSELFKLQGTKLKMSSAYHPETDGQTEVIN 1716
A +F + V+ G P +I+D D IF S W + +K S Y P+TDGQTE N
Sbjct: 1030 ARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTN 1089
Query: 1717 RCLESYLRCFASDHPKSWSHWISWAEFWYNTTFHSSIGQTPFEVVYGRQAPPI----VKF 1772
+ +E LRC S HP +W IS + YN HS+ TPFE+V+ R +P + +
Sbjct: 1090 QTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-RYSPALSPLELPS 1148
Query: 1773 LSNETKVAAVALELSERDEALRQLRGHLQKAQEQMAIYANKKRRDL-SFAVGEWVFLKLR 1831
S++T + E + + ++ HL +M Y + K +++ F G+ V +K
Sbjct: 1149 FSDKTDENS-----QETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK-- 1201
Query: 1832 PHRQHSVVKRINQKLAARFYGPFQIEAKVGAVAYRLKLPAESK--IHPVFHISLLKK 1886
R + + KLA F GPF + K G Y L LP K FH+S L+K
Sbjct: 1202 --RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 473 bits (1216), Expect = e-132
Identities = 296/897 (32%), Positives = 474/897 (51%), Gaps = 46/897 (5%)
Query: 1023 ELMETLEEFQEVFR--SKIQLP-PERSKVHQIKLFPEQETINVRPYRYPHHQKEEIERQV 1079
EL + +EF+++ + +LP P + +++L E + +R Y P + + + ++
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEI 432
Query: 1080 AELMEAGIIRPSMSAYSSPVILVKKKDKSWRMCVDYRALNKATIPDKYPIPIVDELLDEL 1139
+ +++GIIR S + + PV+ V KK+ + RM VDY+ LNK P+ YP+P++++LL ++
Sbjct: 433 NQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKI 492
Query: 1140 NGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQAVMNDI 1199
G++IF+K+DLKS YH IRV + D K AFR G +EYLVMP+G+ APA FQ +N I
Sbjct: 493 QGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTI 552
Query: 1200 FRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCASIDYLG 1259
V+ + D+ILI+SK EH+ H+K VL L + NQ KC+F + + ++G
Sbjct: 553 LGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIG 612
Query: 1260 HIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYYRKFIKDYGKMANPLTELTKK 1319
+ IS G E + ++ W PKN K +R FLG Y RKFI ++ +PL L KK
Sbjct: 613 YHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKK 672
Query: 1320 D-SFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIGAVLMQQRQ---- 1374
D + W P +A +K+ + SPPVL +F+ +E DA+ +GAVL Q+
Sbjct: 673 DVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKY 732
Query: 1375 -PLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLG--KEFIVYTDHKSLKHFLQQ 1431
P+ ++S +S L SV +KE++A++ S++HWRHYL + F + TDH++L +
Sbjct: 733 YPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITN 792
Query: 1432 KVSSPDQQ---CWLAKLMGYQFQVKYKPGLENKAADALSRRYDEVE----------LHSL 1478
+ S P+ + W L + F++ Y+PG N ADALSR DE E ++ +
Sbjct: 793 E-SEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFV 851
Query: 1479 ISFPLWND-RKRLLEEITQDPYIQDLQSAVQKDPASKPGFAVQHGVLLYHGRLVLSPTSP 1537
+ +D + +++ E T D + +L + +D + ++ G+L+ +L P
Sbjct: 852 NQISITDDFKNQVVTEYTNDTKLLNLLN--NEDKRVEENIQLKDGLLINSKDQILLPNDT 909
Query: 1538 SIP-WLLEEFHGSPSGGHSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALS 1596
+ +++++H H G + W G++K++++YV+ C CQ +K
Sbjct: 910 QLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHK 969
Query: 1597 PGGLLQPLPIPNAVWEDLSLDFITGLPKSKGFEAVLVVVDRLSKYSHFILLKHPYTAKSI 1656
P G LQP+P WE LS+DFIT LP+S G+ A+ VVVDR SK + + TA+
Sbjct: 970 PYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQT 1029
Query: 1657 AEVFVREVVRLHGIPNSVISDRDPIFVSHFWSELFKLQGTKLKMSSAYHPETDGQTEVIN 1716
A +F + V+ G P +I+D D IF S W + +K S Y P+TDGQTE N
Sbjct: 1030 ARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTN 1089
Query: 1717 RCLESYLRCFASDHPKSWSHWISWAEFWYNTTFHSSIGQTPFEVVYGRQAPPI----VKF 1772
+ +E LRC S HP +W IS + YN HS+ TPFE+V+ R +P + +
Sbjct: 1090 QTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-RYSPALSPLELPS 1148
Query: 1773 LSNETKVAAVALELSERDEALRQLRGHLQKAQEQMAIYANKKRRDL-SFAVGEWVFLKLR 1831
S++T + E + + ++ HL +M Y + K +++ F G+ V +K
Sbjct: 1149 FSDKTDENS-----QETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK-- 1201
Query: 1832 PHRQHSVVKRINQKLAARFYGPFQIEAKVGAVAYRLKLPAESK--IHPVFHISLLKK 1886
R + + KLA F GPF + K G Y L LP K FH+S L+K
Sbjct: 1202 --RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 473 bits (1216), Expect = e-132
Identities = 296/897 (32%), Positives = 474/897 (51%), Gaps = 46/897 (5%)
Query: 1023 ELMETLEEFQEVFR--SKIQLP-PERSKVHQIKLFPEQETINVRPYRYPHHQKEEIERQV 1079
EL + +EF+++ + +LP P + +++L E + +R Y P + + + ++
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEI 432
Query: 1080 AELMEAGIIRPSMSAYSSPVILVKKKDKSWRMCVDYRALNKATIPDKYPIPIVDELLDEL 1139
+ +++GIIR S + + PV+ V KK+ + RM VDY+ LNK P+ YP+P++++LL ++
Sbjct: 433 NQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKI 492
Query: 1140 NGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQAVMNDI 1199
G++IF+K+DLKS YH IRV + D K AFR G +EYLVMP+G+ APA FQ +N I
Sbjct: 493 QGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTI 552
Query: 1200 FRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCASIDYLG 1259
V+ + D+ILI+SK EH+ H+K VL L + NQ KC+F + + ++G
Sbjct: 553 LGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIG 612
Query: 1260 HIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYYRKFIKDYGKMANPLTELTKK 1319
+ IS G E + ++ W PKN K +R FLG Y RKFI ++ +PL L KK
Sbjct: 613 YHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKK 672
Query: 1320 D-SFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIGAVLMQQRQ---- 1374
D + W P +A +K+ + SPPVL +F+ +E DA+ +GAVL Q+
Sbjct: 673 DVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKY 732
Query: 1375 -PLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLG--KEFIVYTDHKSLKHFLQQ 1431
P+ ++S +S L SV +KE++A++ S++HWRHYL + F + TDH++L +
Sbjct: 733 YPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITN 792
Query: 1432 KVSSPDQQ---CWLAKLMGYQFQVKYKPGLENKAADALSRRYDEVE----------LHSL 1478
+ S P+ + W L + F++ Y+PG N ADALSR DE E ++ +
Sbjct: 793 E-SEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFV 851
Query: 1479 ISFPLWND-RKRLLEEITQDPYIQDLQSAVQKDPASKPGFAVQHGVLLYHGRLVLSPTSP 1537
+ +D + +++ E T D + +L + +D + ++ G+L+ +L P
Sbjct: 852 NQISITDDFKNQVVTEYTNDTKLLNLLN--NEDKRVEENIQLKDGLLINSKDQILLPNDT 909
Query: 1538 SIP-WLLEEFHGSPSGGHSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALS 1596
+ +++++H H G + W G++K++++YV+ C CQ +K
Sbjct: 910 QLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHK 969
Query: 1597 PGGLLQPLPIPNAVWEDLSLDFITGLPKSKGFEAVLVVVDRLSKYSHFILLKHPYTAKSI 1656
P G LQP+P WE LS+DFIT LP+S G+ A+ VVVDR SK + + TA+
Sbjct: 970 PYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQT 1029
Query: 1657 AEVFVREVVRLHGIPNSVISDRDPIFVSHFWSELFKLQGTKLKMSSAYHPETDGQTEVIN 1716
A +F + V+ G P +I+D D IF S W + +K S Y P+TDGQTE N
Sbjct: 1030 ARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTN 1089
Query: 1717 RCLESYLRCFASDHPKSWSHWISWAEFWYNTTFHSSIGQTPFEVVYGRQAPPI----VKF 1772
+ +E LRC S HP +W IS + YN HS+ TPFE+V+ R +P + +
Sbjct: 1090 QTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-RYSPALSPLELPS 1148
Query: 1773 LSNETKVAAVALELSERDEALRQLRGHLQKAQEQMAIYANKKRRDL-SFAVGEWVFLKLR 1831
S++T + E + + ++ HL +M Y + K +++ F G+ V +K
Sbjct: 1149 FSDKTDENS-----QETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK-- 1201
Query: 1832 PHRQHSVVKRINQKLAARFYGPFQIEAKVGAVAYRLKLPAESK--IHPVFHISLLKK 1886
R + + KLA F GPF + K G Y L LP K FH+S L+K
Sbjct: 1202 --RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 340 bits (872), Expect = 3e-92
Identities = 174/411 (42%), Positives = 254/411 (61%), Gaps = 7/411 (1%)
Query: 1065 YRYPHHQKEEIERQVAELMEAGIIRPSMSAYSSPVILVKKKD-----KSWRMCVDYRALN 1119
Y YP ++E+E Q+ +++ GIIR S S Y+SP+ +V KK + +R+ +DYR LN
Sbjct: 213 YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLN 272
Query: 1120 KATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYL 1179
+ T+ D++PIP +DE+L +L + F+ IDL G+HQI + + + KTAF T +GHYEYL
Sbjct: 273 EITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYL 332
Query: 1180 VMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLAN 1239
MPFGL NAPATFQ MNDI RP L K LV+ DDI+++S L EHL L LV L
Sbjct: 333 RMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKA 392
Query: 1240 CFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYY 1299
KC+F +LGH+++ G+ +PEK++ I +P+P K ++ FLGLTGYY
Sbjct: 393 NLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYY 452
Query: 1300 RKFIKDYGKMANPLTELTKKDS--FSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVE 1357
RKFI ++ +A P+T+ KK+ + PE D AF +LK +++ P+L +P+FT F +
Sbjct: 453 RKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLT 512
Query: 1358 CDAAGRGIGAVLMQQRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLGKEFI 1417
DA+ +GAVL Q PL++ S+ L+ + S EKEL+A+V + + +RHYLLG+ F
Sbjct: 513 TDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFE 572
Query: 1418 VYTDHKSLKHFLQQKVSSPDQQCWLAKLMGYQFQVKYKPGLENKAADALSR 1468
+ +DH+ L + K + W KL + F +KY G EN ADALSR
Sbjct: 573 ISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSR 623
Score = 41.2 bits (95), Expect = 0.033
Identities = 49/254 (19%), Positives = 94/254 (36%), Gaps = 13/254 (5%)
Query: 1554 HSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALSPGGLLQPLPIPNAVWED 1613
H G +T + T Y+ Q +++ + C +C K + + P P E
Sbjct: 763 HPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPEHCREK 822
Query: 1614 LSLDFITGLPKSKGFEAVLVVVDRLSKYSHFILLKHPYTAKSI-AEVFVREVVRLHGIPN 1672
+D + K V + YS F L+ T I + + + G P
Sbjct: 823 FMIDIYSSEGKH--------YVSCIDIYSKFATLEEIKTKDWIECKNALMRIFNQLGKPK 874
Query: 1673 SVISDRDPIFVSHFWSELFKLQGTKLKMSSAYHPETDGQTEVINRCLESYLRCF--ASDH 1730
+ +DRD F S + + +L++++ D E +++ + +R + D
Sbjct: 875 LLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD--IERLHKTINEKIRIIKTSDDE 932
Query: 1731 PKSWSHWISWAEFWYNTTFHSSIGQTPFEVVYGRQAPPIVKFLSNETKVAAVALELSERD 1790
S + + + T H + GQTP + P + + E K+ + + E +
Sbjct: 933 ETKLSKMETVLNIYNHKTKHDTTGQTPAHIFLYAGQPILDTQQNKENKINKINNDRVEYE 992
Query: 1791 EALRQLRGHLQKAQ 1804
R +G LQK +
Sbjct: 993 VDTRYRKGPLQKGK 1006
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 340 bits (871), Expect = 3e-92
Identities = 186/421 (44%), Positives = 257/421 (60%), Gaps = 9/421 (2%)
Query: 1066 RYPHHQKEEIE--RQVAELMEAGIIRPSMSAYSSPVILVKKKDKS-----WRMCVDYRAL 1118
+YP Q EIE QV E++ G+IR S S Y+SP +V KK + +R+ +DYR L
Sbjct: 211 QYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKL 270
Query: 1119 NKATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEY 1178
N+ TIPD+YPIP +DE+L +L F+ IDL G+HQI + E+ I KTAF T +GHYEY
Sbjct: 271 NEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEY 330
Query: 1179 LVMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLA 1238
L MPFGL NAPATFQ MN+I RP L K LV+ DDI+I+S L EHL ++LV + L
Sbjct: 331 LRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLAD 390
Query: 1239 NCFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGY 1298
KC+F ++LGHI++ G+ +P KVK I+ +P+P K +R FLGLTGY
Sbjct: 391 ANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGY 450
Query: 1299 YRKFIKDYGKMANPLTELTKKDS--FSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEV 1356
YRKFI +Y +A P+T KK + + E AF +LK ++ P+L LP+F F +
Sbjct: 451 YRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVL 510
Query: 1357 ECDAAGRGIGAVLMQQRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLGKEF 1416
DA+ +GAVL Q P++F S+ L+ L S EKEL+A+V + + +RHYLLG++F
Sbjct: 511 TTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQF 570
Query: 1417 IVYTDHKSLKHFLQQKVSSPDQQCWLAKLMGYQFQVKYKPGLENKAADALSRRYDEVELH 1476
++ +DH+ L+ K + W +L YQF++ Y G EN ADALSR E H
Sbjct: 571 LIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEENHH 630
Query: 1477 S 1477
S
Sbjct: 631 S 631
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 327 bits (839), Expect = 2e-88
Identities = 187/471 (39%), Positives = 271/471 (56%), Gaps = 22/471 (4%)
Query: 1020 TPRELMETLEEFQEVFRSKIQLPPERSKVHQIKLFPEQETINVRPYRYPHHQKEEIERQV 1079
T L L EF +F + + V Q+ I + Y YP + + E+ERQ+
Sbjct: 84 TQEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEVERQI 143
Query: 1080 AELMEAGIIRPSMSAYSSPVILVKKK-----DKSWRMCVDYRALNKATIPDKYPIPIVDE 1134
EL++ GIIRPS S Y+SP+ +V KK +K +RM VD++ LN TIPD YPIP ++
Sbjct: 144 DELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINA 203
Query: 1135 LLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQA 1194
L L A F+ +DL SG+HQI + E DI KTAF T NG YE+L +PFGL NAPA FQ
Sbjct: 204 TLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQR 263
Query: 1195 VMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCAS 1254
+++DI R ++ K V+ DDI+++S+D H +L+LVL+ L N K F
Sbjct: 264 MIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQ 323
Query: 1255 IDYLGHIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYYRKFIKDYGKMANPLT 1314
+++LG+I++ G+ DP+KV+ I + P P +VK ++ FLG+T YYRKFI+DY K+A PLT
Sbjct: 324 VEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLT 383
Query: 1315 ELTK------------KDSFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAG 1362
LT+ K + A ++F LK ++ S +L P FT PF + DA+
Sbjct: 384 NLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASN 443
Query: 1363 RGIGAVLMQQRQ----PLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLGKEFI- 1417
IGAVL Q Q P+A+ S++L+ + EKE++A++ S+ + R YL G I
Sbjct: 444 WAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIK 503
Query: 1418 VYTDHKSLKHFLQQKVSSPDQQCWLAKLMGYQFQVKYKPGLENKAADALSR 1468
VYTDH+ L L + + + W A++ Y ++ YKPG N ADALSR
Sbjct: 504 VYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554
Score = 53.5 bits (127), Expect = 7e-06
Identities = 74/303 (24%), Positives = 117/303 (38%), Gaps = 38/303 (12%)
Query: 1554 HSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALSPGGLLQPLPIPNAVWED 1613
H G +L Y+ M +R +C C+ +KY LQP PIPN E
Sbjct: 705 HRGPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPCEI 764
Query: 1614 LSLDFITGLPKSKGFEAVLVVVDRLSKYSHFILLKHPYTAKSIAEVFVRE--VVRLH--G 1669
L +D I L K L +D+ SK++ L +S A V +RE V LH
Sbjct: 765 LHID-IFALEK----RLYLSCIDKFSKFAKLFHL------QSKASVHLRETLVEALHYFT 813
Query: 1670 IPNSVISDRDPIFVSHFWSELFKLQGTKLKMSSAYHPETDGQTEVINRCLESYLRCFASD 1729
P ++SD + + + L + E +GQ E + RC +
Sbjct: 814 APKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDE 873
Query: 1730 HPK-SWSHWISWAEFWYNTTFHSSIGQTPFEVVYGRQAPPIVKFLSNETKVAAVALELSE 1788
P + A YNT+ HS + P +V + R + + L++
Sbjct: 874 LPTFKPVELVHIAVDRYNTSVHSVTNRKPADVFFDRSSRVNYQGLTD------------F 921
Query: 1789 RDEALRQLRGHLQKAQEQMAIYANKKRRD-LSFAVGEWVFLKLRPHRQHSVVKRINQKLA 1847
R + L ++G ++ Q + + NK R + S+ G+ VF+ K+I K
Sbjct: 922 RRQTLEDIKGLIEYKQIRGNMARNKNRDEPKSYGPGDEVFV---------ANKQIKTKEK 972
Query: 1848 ARF 1850
ARF
Sbjct: 973 ARF 975
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 325 bits (833), Expect = 9e-88
Identities = 257/923 (27%), Positives = 420/923 (44%), Gaps = 132/923 (14%)
Query: 1022 RELMETLEEFQEVFRSKIQLPPERSKVHQIKLFPEQETINVRPYRYPHHQKEEIERQVAE 1081
+E +T+ ++ F + + P + V + E + R Y + + +V +
Sbjct: 144 KEFKDTIIRRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMGVSDFVNNEVKQ 203
Query: 1082 LMEAGIIRPSMSAYSSPVILVKKK------DKSWRMCVDYRALNKATIPDKYPIPIVDEL 1135
L++ GIIRPS S Y+SP +V KK + + R+ +D+R LN+ TIPD+YP+P + +
Sbjct: 204 LLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMI 263
Query: 1136 LDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQAV 1195
L L A F+ +DLKSGYHQI + E D EKT+F + G YE+ +PFGL NA + FQ
Sbjct: 264 LANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRA 323
Query: 1196 MNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCASI 1255
++D+ R + K V+ DD++I+S++ +H+ H+ VL L+ +Q K +F S+
Sbjct: 324 LDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESV 383
Query: 1256 DYLGHIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYYRKFIKDYGKMANPLTE 1315
+YLG I+S G DPEKVK I ++P P V VR FLGL YYR FIKD+ +A P+T+
Sbjct: 384 EYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPITD 443
Query: 1316 LTKKDSFSWGPEADR------------AFLQLKRVMTSPPVLI-LPNFTLPFEVECDAAG 1362
+ K ++ S + AF +L+ ++ S V++ P+F PF++ DA+
Sbjct: 444 ILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDASA 503
Query: 1363 RGIGAVLMQQRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLG-KEFIVYTD 1421
GIGAVL Q+ +P+ S+ L + E+EL+A+V ++ +++L G +E ++TD
Sbjct: 504 SGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTD 563
Query: 1422 HKSLKHFLQQKVSSPDQQCWLAKLMGYQFQVKYKPGLENKAADALSR----------RYD 1471
H+ L + + ++ + W + + + +V YKPG EN ADALSR + D
Sbjct: 564 HQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQNLNALQNEPQSD 623
Query: 1472 EVELHSLIS-------------------------FP------LWNDRKRLLEEITQDPYI 1500
+HS +S FP L+ + R L T ++
Sbjct: 624 AATIHSELSLTYTVETTDKPLNCFRNQIILEAARFPLKRNLVLFRSKSRHLISFTDKSWL 683
Query: 1501 ---------QDLQSAVQKDPASKPGFAVQHGVLLYH--------GRLVLSPTSPSIPWLL 1543
D+ +A+ D + F QH ++ + +VL T + +
Sbjct: 684 LKTLKEVVNPDVVNAIHCDLPTLASF--QHDLIAHFPATQFRHCKNVVLDITDKNEQ--I 739
Query: 1544 EEFHGSPSGGHSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALSPGGLLQP 1603
E + H +++ Y+ M ++ V C VC + KY L
Sbjct: 740 EIVTAEHNRAHRAAQENIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGE 799
Query: 1604 LPIPNAVWEDLSLDFITGLPKSKGFEAVLVVVDRLSKYSHFILLKHPYTAKSIAEVFVRE 1663
PIP+ E + +D S + L +D+ SKY+ + P +++I ++
Sbjct: 800 TPIPSYTGEMVHIDIF-----STDRKLFLTCIDKFSKYA----IVQPVVSRTIVDITAPL 850
Query: 1664 VVRLHGIPN--SVISDRDPIFVSHFWSELFKLQ-GTKLKMSSAYHPETDGQTEVINRCLE 1720
+ ++ PN +V D +P F S + + K G + + H ++GQ E + L
Sbjct: 851 LQIINLFPNIKTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLA 910
Query: 1721 SYLRCFASDHPKSWS-HWISWAEFWYNTTFHSSIGQTPFEVVYGRQAPPIVKFLSNETKV 1779
RC D + + I A YN T HS + P EVV+
Sbjct: 911 EIARCLKLDKKTNDTVELILRATIEYNKTVHSVTRERPIEVVH----------------- 953
Query: 1780 AAVALELSERDEALRQLRGHLQKAQEQMAIYANKKRRDLSFAVGEWVFLKLRPHRQHSVV 1839
E +++ L KAQ+ N R++ F VGE VF+K
Sbjct: 954 -------PGAHERCLEIKARLVKAQQDSIGRNNPSRQNRVFEVGERVFVKNN-------- 998
Query: 1840 KRINQKLAARFYGPFQIEAKVGA 1862
KR+ KL P E KV A
Sbjct: 999 KRLGNKLT-----PLCTEQKVQA 1016
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 313 bits (802), Expect = 3e-84
Identities = 230/836 (27%), Positives = 397/836 (46%), Gaps = 67/836 (8%)
Query: 1022 RELMETLEEFQEVFR-SKIQLPPERSKVHQIKLFPEQETINVRPYRYPHHQKEEIERQVA 1080
R++ + +E+FQ+VF S +L I+L E I +P P K EI + +
Sbjct: 904 RKIWDVIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQ 963
Query: 1081 ELMEAGIIRPSMSAYSSPVILVKKKDKSWRMCVDYRALNKATIPDKYPIPIVDELLDELN 1140
+++ +IR S S +SSPV+LVKKKD S RMC+DYR +NK + +P+P ++ L L
Sbjct: 964 KMLNQKVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLA 1023
Query: 1141 GASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQAVMNDIF 1200
G +++ D+ +G+ QI + E E TAF + +E+ V+PFGL+ +PA FQ M +I
Sbjct: 1024 GKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEII 1083
Query: 1201 RPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCASIDYLGH 1260
L V+ DD+LI SKD+ +HL +K L+ + + +KC ++YLGH
Sbjct: 1084 GDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGH 1143
Query: 1261 IISGAGMAVDPEKVKCIMDWPVPKNVKGVRGFLGLTGYYRKFIKDYGKMANPLTEL-TKK 1319
++ G+ K + + P NVK ++ FLGL GYYRKFI ++ ++A+ LT L + K
Sbjct: 1144 KVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAK 1203
Query: 1320 DSFSWGPEADRAFLQLKRVMTSPPVLILPNFTL------PFEVECDAAGRGIGAVLMQ-- 1371
++ W E + AF +LK+++ PVL P+ PF + DA+ +GIGAVL Q
Sbjct: 1204 VAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEG 1263
Query: 1372 ---QRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLGKEFIVYTDHKSLKHF 1428
Q+ P+AF SKALS + + E +A++ +++ ++ + G V+TDHK L
Sbjct: 1264 PDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISL 1323
Query: 1429 LQQKVSSPDQQCWLAKLMGYQFQVKYKPGLENKAADALSR-------------------- 1468
L+ + W +++ + ++ Y G N ADALSR
Sbjct: 1324 LKGSPLADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIV 1383
Query: 1469 RYDEVELHSLISFPLWNDRKRLLEEITQDPY-------------IQDLQSAVQKDPASKP 1515
+ EL ++ W +R + +E ++ I ++S + +
Sbjct: 1384 NAIQTELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIV 1443
Query: 1516 GFAVQHGVLLYHGRLVLSPTSPSIPWLLEEFHGSPSGGHSGFLRTYRRLATTLYWVGMQK 1575
G +++ + R V+ P P LL+E H GH G + +R + YW M+
Sbjct: 1444 GGVLKNTEIEEQSRSVV-PEKIRTP-LLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRV 1501
Query: 1576 RVRDYVRACDVC-----QRHKYSALSPGGLLQPLPIPNAVWEDLSLDFITGLPKSKGFEA 1630
V + VR C C S+L+P + PL I D+ L +G
Sbjct: 1502 CVENCVRTCAKCLCANDHSKLTSSLTPYRMTFPLEIVACDLMDVGLSV-------QGNRY 1554
Query: 1631 VLVVVDRLSKYSHFILLKHPYTAKSIAEVFV-REVVRLHGIPNSVISDRDPIFVSHFWSE 1689
+L ++D +KY + + A+++ + FV R + IP +++D+ FV+ +++
Sbjct: 1555 ILTIIDLFTKYGTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQ 1613
Query: 1690 LFKLQGTKLKMSSAYHPETDGQTEVINRCLESYLRCFASDHPKSWSHWISWAEFWYNTTF 1749
+ + + Y+ +G E N+ + ++ + P W + +A + YN
Sbjct: 1614 FTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMK-KKTAVPMEWDDQVVYAVYAYNNCV 1672
Query: 1750 HSSIGQTPFEVVYGRQAPPIVKFLSNETKVAAVALELSERDEALRQLRGHLQKAQE 1805
H + G+TP +++GR ++ + AV + ++ DE L L K Q+
Sbjct: 1673 HENTGETPMFLMHGRDVMGPLEMSGED----AVGINYADMDEYKHLLTQELLKVQK 1724
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 274 bits (700), Expect = 2e-72
Identities = 164/481 (34%), Positives = 252/481 (52%), Gaps = 28/481 (5%)
Query: 1015 NTEVETPRELMETLEEFQEVFRSKIQ-----------LPPERSKVH-----QIKLFPEQE 1058
N+E L + + F E+F+S+++ L E V+ Q++L + E
Sbjct: 255 NSEHRNKTVLSQLKKNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRL-KDDE 313
Query: 1059 TINVRPYRYPHHQKEEIERQVAELMEAGIIRPSMSAYSSPVILVKKKD------KSWRMC 1112
+ + YR PH Q EEI+ QV +L++ I+ PS+S Y+SP++LV KK K WR+
Sbjct: 314 PVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLV 373
Query: 1113 VDYRALNKATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTH 1172
+DYR +NK + DK+P+P +D++LD+L A FS +DL SG+HQI + E + T+F T
Sbjct: 374 IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433
Query: 1173 NGHYEYLVMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLV 1232
NG Y + +PFGL AP +FQ +M F ++ DD+++ L +L V
Sbjct: 434 NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493
Query: 1233 LSVLLANCFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWPVPKNVKGVRGF 1292
+ KC F + +LGH + G+ D +K I ++PVP + R F
Sbjct: 494 FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553
Query: 1293 LGLTGYYRKFIKDYGKMANPLTELTKKD-SFSWGPEADRAFLQLKRVMTSPPVLILPNFT 1351
+ YYR+FIK++ + +T L KK+ F W E +AF+ LK + +P +L P+F+
Sbjct: 554 VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613
Query: 1352 LPFEVECDAAGRGIGAVLMQQRQ----PLAFFSKALSAGNLAKSVYEKELMALVLSIQHW 1407
F + DA+ + GAVL Q P+A+ S+A + G KS E+EL A+ +I H+
Sbjct: 614 KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673
Query: 1408 RHYLLGKEFIVYTDHKSLKHFLQQKVSSPDQQCWLAKLMGYQFQVKYKPGLENKAADALS 1467
R Y+ GK F V TDH+ L + S +L Y F V+Y G +N ADALS
Sbjct: 674 RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALS 733
Query: 1468 R 1468
R
Sbjct: 734 R 734
Score = 108 bits (270), Expect = 2e-22
Identities = 86/334 (25%), Positives = 154/334 (45%), Gaps = 38/334 (11%)
Query: 1542 LLEEFHGSP-SGGHSGFLRTYRRLATTLYWVGMQKRVRDYVRACDVCQRHKYSALSPGGL 1600
+L H P GGH+G +T ++ YW M K +++YVR C CQ+ K + +
Sbjct: 896 ILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTK--- 952
Query: 1601 LQPLPI---PNAVWEDLSLDFITGLPKSK-GFEAVLVVVDRLSKYSHFILLKHPYTAKSI 1656
P+ I P ++ + +D I LPKS+ G E + ++ L+KY I + + +AK++
Sbjct: 953 -TPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANK-SAKTV 1010
Query: 1657 AEVFVREVVRLHGIPNSVISDRDPIFVSHFWSELFKLQGTKLKMSSAYHPETDGQTEVIN 1716
A+ + +G + I+D + + ++L K K S+A+H +T G E +
Sbjct: 1011 AKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSH 1070
Query: 1717 RCLESYLRCFASDHPKSWSHWISWAEFWYNTTFHSSIGQTPFEVVYGRQA---------- 1766
R L Y+R + S W W+ + + +NTT P+E+V+GR +
Sbjct: 1071 RTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLH 1130
Query: 1767 --PPIVKFLSNETKVAAVALELSERDEALRQLRGHLQKAQEQMAIYANKKRRDLSFAVGE 1824
PI + + K + LE++ A + L H +K +E + K +D+ VG+
Sbjct: 1131 SIEPIYN-IDDYAKESKYRLEVAYA-RARKLLEAHKEKNKENYDL----KVKDIELEVGD 1184
Query: 1825 WVFLKLRPHRQHSVVKRINQKLAARFYGPFQIEA 1858
V L+ + KL ++ GP++IE+
Sbjct: 1185 KVLLR----------NEVGHKLDFKYTGPYKIES 1208
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 200 bits (508), Expect = 4e-50
Identities = 162/616 (26%), Positives = 280/616 (45%), Gaps = 54/616 (8%)
Query: 902 IRLGDGHRVVTQGVCKGIKARLGKIEVVIDALVLELGGLDMVLGVSWLSTLGKVVM--DW 959
+++ DG + VCK I + + I + + G+D ++G ++ + D
Sbjct: 72 VKIADGSSITISKVCKDIDLIIAREIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDR 131
Query: 960 KLLTMQFVHGNQVVKLQGLGGKGSNHSFLHSFLMDKQYRGGMEWWWSHLNSAEVTNTEVE 1019
+ T + + KL G+ FL S M K+ + ++ ++E
Sbjct: 132 VIFTKNKSYPVHIAKLTRAVRVGTE-GFLES--MKKRSKTQQP------EPVNISTNKIE 182
Query: 1020 TP---------------------RELMETLEEFQEVFRSKIQLPPERSKVHQ---IKLFP 1055
P ++ M+ +EE E S+ L P ++K IKL
Sbjct: 183 NPLKEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSD 242
Query: 1056 EQETINVRPYRYPHHQKEEIERQVAELMEAGIIRPSMSAYSSPVILV----KKKDKSWRM 1111
+ I V+P +Y +EE ++Q+ EL++ +I+PS S + +P LV +K+ RM
Sbjct: 243 PSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRM 302
Query: 1112 CVDYRALNKATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRT 1171
V+Y+A+NKATI D Y +P DELL + G IFS D KSG+ Q+ + ++ TAF
Sbjct: 303 VVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC 362
Query: 1172 HNGHYEYLVMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKL 1231
GHYE+ V+PFGL AP+ FQ M++ FR + RKF V+ DDIL++S + +HL H+ +
Sbjct: 363 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAM 421
Query: 1232 VLSVLLANCFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWP-VPKNVKGVR 1290
+L + + ++ K + I++LG I ++ I +P ++ K ++
Sbjct: 422 ILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQ 481
Query: 1291 GFLGLTGYYRKFIKDYGKMANPL-TELTKKDSFSWGPEADRAFLQLKRVMTSPPVLILPN 1349
FLG+ Y +I ++ PL +L + + W E ++K+ + P L P
Sbjct: 482 RFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPL 541
Query: 1350 FTLPFEVECDAAGRGIGAVL--------MQQRQPLAFFSKALSAGNLAKSVYEKELMALV 1401
+E DA+ G +L + S + A +KE +A++
Sbjct: 542 PEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVI 601
Query: 1402 LSIQHWRHYLLGKEFIVYTDHKSLKHFLQQKVSSPDQQ----CWLAKLMGYQFQVKYKPG 1457
+I+ + YL F++ TD+ K F+ + W A L Y F V++ G
Sbjct: 602 NTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKG 661
Query: 1458 LENKAADALSRRYDEV 1473
+N AD LSR +++V
Sbjct: 662 TDNHFADFLSREFNKV 677
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 198 bits (504), Expect = 1e-49
Identities = 161/616 (26%), Positives = 276/616 (44%), Gaps = 54/616 (8%)
Query: 902 IRLGDGHRVVTQGVCKGIKARLGKIEVVIDALVLELGGLDMVLGVSWLSTLGKVVM--DW 959
+++ DG + VCK I + I + + G+D ++G ++ + D
Sbjct: 72 VKIADGSSITISKVCKDIDLIIAGEIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDR 131
Query: 960 KLLTMQFVHGNQVVKLQGLGGKGSNHSFLHSFLMDKQYRGGMEWWWSHLNSAEVTNTEVE 1019
+ T + + KL G + FL + R + ++ ++E
Sbjct: 132 VIFTKNKSYPVHITKLTRAVRVG-----IEGFLESMKKRSKTQ----QPEPVNISTNKIE 182
Query: 1020 TPRE---------------------LMETLEEFQEVFRSKIQLPPERSKVHQ---IKLFP 1055
P E M+ +EE E S+ L P ++K IKL
Sbjct: 183 NPLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSD 242
Query: 1056 EQETINVRPYRYPHHQKEEIERQVAELMEAGIIRPSMSAYSSPVILV----KKKDKSWRM 1111
+ I V+P +Y +EE ++Q+ EL++ +I+PS S + +P LV +K+ RM
Sbjct: 243 PSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRM 302
Query: 1112 CVDYRALNKATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRT 1171
V+Y+A+NKATI D Y +P DELL + G IFS D KSG+ Q+ + ++ TAF
Sbjct: 303 VVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC 362
Query: 1172 HNGHYEYLVMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKL 1231
GHYE+ V+PFGL AP+ FQ M++ FR + RKF V+ DDIL++S + +HL H+ +
Sbjct: 363 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAM 421
Query: 1232 VLSVLLANCFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWP-VPKNVKGVR 1290
+L + + ++ K + I++LG I ++ I +P ++ K ++
Sbjct: 422 ILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQ 481
Query: 1291 GFLGLTGYYRKFIKDYGKMANPL-TELTKKDSFSWGPEADRAFLQLKRVMTSPPVLILPN 1349
FLG+ Y +I ++ PL +L + + W E ++K+ + P L P
Sbjct: 482 RFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPL 541
Query: 1350 FTLPFEVECDAAGRGIGAVL--------MQQRQPLAFFSKALSAGNLAKSVYEKELMALV 1401
+E DA+ G +L + S + A +KE +A++
Sbjct: 542 PEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVI 601
Query: 1402 LSIQHWRHYLLGKEFIVYTDHKSLKHFLQQKVSSPDQQ----CWLAKLMGYQFQVKYKPG 1457
+I+ + YL F++ TD+ K F+ + W A L Y F V++ G
Sbjct: 602 NTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKG 661
Query: 1458 LENKAADALSRRYDEV 1473
+N AD LSR +++V
Sbjct: 662 TDNHFADFLSREFNKV 677
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 198 bits (503), Expect = 2e-49
Identities = 157/608 (25%), Positives = 278/608 (44%), Gaps = 38/608 (6%)
Query: 902 IRLGDGHRVVTQGVCKGIKARLGKIEVVIDALVLELGGLDMVLGVSWLSTLGKVVM--DW 959
+++ DG + VCK I + + I + + G+D ++G ++ + D
Sbjct: 73 VKIADGSSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDR 132
Query: 960 KLLTMQFVHGNQVVKLQGLGGKGSNHSFLHSF-------------LMDKQYRGGMEWWWS 1006
+ T + + KL G+ FL S + + +E
Sbjct: 133 VIFTKNKSYPVHIAKLTRAVRVGT-EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAI 191
Query: 1007 HLNSAEVTNTEVETPRELMETLEEFQEVFRSKIQLPPERSK---VHQIKLFPEQETINVR 1063
++ ++ ++ M+ EE E S+ L P ++K IKL + I V+
Sbjct: 192 LSEGRRLSEEKLFITQQRMQKTEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVK 251
Query: 1064 PYRYPHHQKEEIERQVAELMEAGIIRPSMSAYSSPVILVKKKDKSW----RMCVDYRALN 1119
P +Y +EE ++Q+ EL++ +I+PS S + +P LV + ++ RM V+Y+A+N
Sbjct: 252 PMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMN 311
Query: 1120 KATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYL 1179
KAT+ D Y +P DELL + G IFS D KSG+ Q+ + ++ TAF GHYE+
Sbjct: 312 KATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWN 371
Query: 1180 VMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLAN 1239
V+PFGL AP+ FQ M++ FR + RKF V+ DDI+++S + +HL H+ ++L +
Sbjct: 372 VVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQH 430
Query: 1240 CFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWP-VPKNVKGVRGFLGLTGY 1298
+ ++ K + I++LG I ++ I +P ++ K ++ FLG+ Y
Sbjct: 431 GIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTY 490
Query: 1299 YRKFIKDYGKMANPL-TELTKKDSFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVE 1357
+I + +M PL +L + + W E ++K+ + P L P +E
Sbjct: 491 ASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIE 550
Query: 1358 CDAAGRGIGAVL--------MQQRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRH 1409
DA+ G +L + S + A +KE +A++ +I+ +
Sbjct: 551 TDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSI 610
Query: 1410 YLLGKEFIVYTDHKSLKHFLQQKVSSPDQQ----CWLAKLMGYQFQVKYKPGLENKAADA 1465
YL F++ TD+ K F+ + W A L Y F V++ G +N AD
Sbjct: 611 YLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADF 670
Query: 1466 LSRRYDEV 1473
LSR +++V
Sbjct: 671 LSREFNKV 678
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 197 bits (501), Expect = 3e-49
Identities = 158/608 (25%), Positives = 277/608 (44%), Gaps = 38/608 (6%)
Query: 902 IRLGDGHRVVTQGVCKGIKARLGKIEVVIDALVLELGGLDMVLGVSWLSTLGKVVM--DW 959
+++ DG + VCK I + I + + G+D ++G ++ + D
Sbjct: 72 VKIADGSSITISKVCKDIDLIIAGEIFRIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDR 131
Query: 960 KLLTMQFVHGNQVVKLQGLGGKGSNHSFLHSF-------------LMDKQYRGGMEWWWS 1006
+ T + + KL G+ FL S + + +E
Sbjct: 132 VIFTKNKSYPVHIAKLTRAVRVGTE-GFLESMKKRSKTQQPEPVNISTNKIENPLEEIAI 190
Query: 1007 HLNSAEVTNTEVETPRELMETLEEFQEVFRSKIQLPPERSKVHQ---IKLFPEQETINVR 1063
++ ++ ++ M+ +EE E S+ L P ++K IKL + I V+
Sbjct: 191 LSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVK 250
Query: 1064 PYRYPHHQKEEIERQVAELMEAGIIRPSMSAYSSPVILV----KKKDKSWRMCVDYRALN 1119
P +Y +EE ++Q+ EL++ +I+PS S + +P LV +K+ RM V+Y+A+N
Sbjct: 251 PMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMN 310
Query: 1120 KATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYL 1179
KAT+ D Y +P DELL + G IFS D KSG+ Q+ + ++ TAF GHYE+
Sbjct: 311 KATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWN 370
Query: 1180 VMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLAN 1239
V+PFGL AP+ FQ M++ FR + RKF V+ DDIL++S + +HL H+ ++L +
Sbjct: 371 VVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQH 429
Query: 1240 CFVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWP-VPKNVKGVRGFLGLTGY 1298
+ ++ K + I++LG I ++ I +P ++ K ++ FLG+ Y
Sbjct: 430 GIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTY 489
Query: 1299 YRKFIKDYGKMANPL-TELTKKDSFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVE 1357
+I ++ PL +L + + W E ++K+ + P L P +E
Sbjct: 490 ASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIE 549
Query: 1358 CDAAGRGIGAVL--------MQQRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRH 1409
DA+ G +L + S + A +KE +A++ +I+ +
Sbjct: 550 TDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSI 609
Query: 1410 YLLGKEFIVYTDHKSLKHFLQQKVSSPDQQ----CWLAKLMGYQFQVKYKPGLENKAADA 1465
YL F++ TD+ K F+ + W A L Y F V++ G +N AD
Sbjct: 610 YLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADF 669
Query: 1466 LSRRYDEV 1473
LSR +++V
Sbjct: 670 LSREFNKV 677
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 195 bits (495), Expect = 1e-48
Identities = 139/473 (29%), Positives = 232/473 (48%), Gaps = 22/473 (4%)
Query: 1022 RELMETLEEFQEVFRSKIQLPPERSKVHQ---IKLFPEQETINVRPYRYPHHQKEEIERQ 1078
++ M+ +EE E S+ L P ++K IKL + I V+P +Y +EE ++Q
Sbjct: 201 QQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQ 260
Query: 1079 VAELMEAGIIRPSMSAYSSPVILV----KKKDKSWRMCVDYRALNKATIPDKYPIPIVDE 1134
+ EL++ +I+PS S + +P LV +K+ RM V+Y+A+NKAT+ D Y P DE
Sbjct: 261 IKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDE 320
Query: 1135 LLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQA 1194
LL + G IFS D KSG+ Q+ + ++ TAF GHYE+ V+PFGL AP+ FQ
Sbjct: 321 LLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQR 380
Query: 1195 VMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCKFGCAS 1254
M++ FR + RKF V+ DDIL++S + +HL H+ ++L + + ++ K +
Sbjct: 381 HMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKK 439
Query: 1255 IDYLGHIISGAGMAVDPEKVKCIMDWP-VPKNVKGVRGFLGLTGYYRKFIKDYGKMANPL 1313
I++LG I ++ I +P ++ K ++ FLG+ Y +I ++ PL
Sbjct: 440 INFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPL 499
Query: 1314 -TELTKKDSFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIGAVL--- 1369
+L + + W E ++K+ + P L P +E DA+ G +L
Sbjct: 500 QAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAI 559
Query: 1370 -----MQQRQPLAFFSKALSAGNLAKSVYEKELMALVLSIQHWRHYLLGKEFIVYTDHKS 1424
+ S + A +KE +A++ +I+ + YL F++ TD+
Sbjct: 560 KINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTH 619
Query: 1425 LKHFLQQKVSSPDQQ----CWLAKLMGYQFQVKYKPGLENKAADALSRRYDEV 1473
K F+ + W A L Y F V++ G +N AD LSR ++ V
Sbjct: 620 FKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREFNRV 672
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 192 bits (487), Expect = 1e-47
Identities = 137/475 (28%), Positives = 237/475 (49%), Gaps = 29/475 (6%)
Query: 1022 RELMETLEEFQEVFRSKIQLPPERSKVHQIKLFPEQETINVRPYRYPHHQKEE-IERQVA 1080
++L++ ++E + + + ++ ++ + I RP ++ EE + RQ+
Sbjct: 1364 KDLLKEMKEMKYIGENPMEFWKNNKIKCKLNIINPDIKIMGRPIKHVTPGDEEAMTRQIN 1423
Query: 1081 ELMEAGIIRPSMSAYSSPVILV-----------KKKDKSWRMCVDYRALNKATIPDKYPI 1129
L++ +IRPS S + S +V K+K RM +Y+ LN+ T D+Y +
Sbjct: 1424 LLLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSL 1483
Query: 1130 PIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAP 1189
P ++ ++ ++ + I+SK DLKSG+ Q+ + E+ + TAF N YE+LVMPFGL NAP
Sbjct: 1484 PGINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAP 1543
Query: 1190 ATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKCK 1249
A FQ M+++F+ KF+ V+ DDIL++S+ +H HL +L + N + + TK K
Sbjct: 1544 AIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMK 1602
Query: 1250 FGCASIDYLGHIISGAGMAVDPEKVKCIMDWPVPK--NVKGVRGFLGLTGYYRKFIKDYG 1307
G ID+LG + + + P + I D+ K +G+R +LG+ Y R +I+D G
Sbjct: 1603 IGTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIG 1662
Query: 1308 KMANPLTE-LTKKDSFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIG 1366
K+ PL + + PE + Q+K + + P L LP +E D G G
Sbjct: 1663 KLVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWG 1722
Query: 1367 AVL---MQQRQPLA---FFSKALSAGNLAKSVYEKELMALVLSIQHWR-HYLLGKEFIVY 1419
AV M + P + + A + N KS + E+ A + + ++ +YL KE I+
Sbjct: 1723 AVCKWKMSKHDPRSTERICAYASGSFNPIKSTIDAEIQAAIHGLDKFKIYYLDKKELIIR 1782
Query: 1420 TDHKS-LKHFLQQKVSSPDQQCWLA-----KLMGYQFQVKYKPGLENKAADALSR 1468
+D ++ +K + + + P + WL +G ++ G N ADALSR
Sbjct: 1783 SDCEAIIKFYNKTNENKPSRVRWLTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 191 bits (484), Expect = 3e-47
Identities = 154/594 (25%), Positives = 271/594 (44%), Gaps = 33/594 (5%)
Query: 902 IRLGDGHRVVTQGVCKGIKARLGKIEVVIDALVLELGGLDMVLGVSWLSTLGKVVMDWKL 961
+++ + + VCK +K + I + + G+D ++G ++ +
Sbjct: 81 VKIANQELIKITKVCKNLKVKFAGKSFEIPTVYQQETGIDFLIGNNFCRLYNPFIQWEDR 140
Query: 962 LTMQFVHGNQVVKLQGLGGKGSNHSFLHSFLMDKQYRGGMEWWWSHLNSAEVTNTEVETP 1021
+ + ++K SN SFL + D + + ++ +
Sbjct: 141 IAFHLKNEMVLIKKVTKAFSVSNPSFLENMKKDSKTE--------QIPGTNISKNIINPE 192
Query: 1022 RELMETLEEFQEVFR------SKIQLPPERSKVHQ---IKLFPEQETINVRPYRYPHHQK 1072
E++Q++ + S+ + P +SK IKL + I V+P Y +
Sbjct: 193 ERYFLITEKYQKIEQLLDKVCSENPIDPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQDR 252
Query: 1073 EEIERQVAELMEAGIIRPSMSAYSSPVILVKKKDK----SWRMCVDYRALNKATIPDKYP 1128
E +Q+ EL++ G+I PS S + SP LV+ + + RM V+Y+A+N+ATI D +
Sbjct: 253 EGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHN 312
Query: 1129 IPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNA 1188
+P + ELL L G SIFS D KSG+ Q+ + E+ + TAF GH+++ V+PFGL A
Sbjct: 313 LPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQA 372
Query: 1189 PATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANCFVANQTKC 1248
P+ FQ M KF +V+ DDI+++S +H H+ VL ++ + ++ K
Sbjct: 373 PSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKA 431
Query: 1249 KFGCASIDYLGHIISGAGMAVDPEKVKCIMDWPVP-KNVKGVRGFLGLTGYYRKFIKDYG 1307
I++LG I ++ I +P ++ K ++ FLG+ Y +I
Sbjct: 432 NLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLA 491
Query: 1308 KMANPLTELTKKD-SFSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIG 1366
++ PL KKD +++W ++K+ + S P L LP +E DA+ G
Sbjct: 492 EIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWG 551
Query: 1367 AVLMQQRQPLAFFSKALSAGNL--AKSVY---EKELMALVLSIQHWRHYLLGKEFIVYTD 1421
VL + S+G+ A+ Y +KEL+A+ I + YL F V TD
Sbjct: 552 GVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTD 611
Query: 1422 HKSLKHFLQQKVSSPDQQ----CWLAKLMGYQFQVKYKPGLENKAADALSRRYD 1471
+K+ +FL+ + +Q W YQF V++ G++N AD L+R ++
Sbjct: 612 NKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVKNVLADCLTRDFN 665
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 189 bits (480), Expect = 8e-47
Identities = 170/635 (26%), Positives = 283/635 (43%), Gaps = 28/635 (4%)
Query: 858 MKIEGQVDNVNLLVLIDSGASHNFISPAVTNALGLVITPIASRHIRLGDGHRVVTQGVCK 917
+K G N++L +D+G+S S V T +I++ +G + VC
Sbjct: 18 LKFPGYQTNLDLHCYVDTGSSLCMASKYVIPE-EYWQTAEKPLNIKIANGKIIQLTKVCS 76
Query: 918 GIKARLGKIEVVIDALVLELGGLDMVLGVSWLSTLGKVVMD----WKLLTMQFVHGNQVV 973
+ RLG +I L + G+D++LG ++ + + L Q V ++
Sbjct: 77 KLPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKIT 136
Query: 974 KLQGLGGKGSNHSFLHSFLMDKQYRGGMEWWWSHLNSAEVTNTEVETPRELM----ETLE 1029
K G KG S +++ + HL E N E E+ +E
Sbjct: 137 KAYQYGVKGFLESMKKKSKVNRPEPINITSN-QHLFLEEGGNHVDEMLYEIQISKFSAIE 195
Query: 1030 EFQEVFRSKIQLPPERSK---VHQIKLFPEQETINVRPYRYPHHQKEEIERQVAELMEAG 1086
E E S+ + PE+SK I+L + + V+P Y +EE +RQ+ EL+E
Sbjct: 196 EMLERVSSENPIDPEKSKQWMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLELK 255
Query: 1087 IIRPSMSAYSSPVILVKKKDK----SWRMCVDYRALNKATIPDKYPIPIVDELLDELNGA 1142
+I+PS S + SP LV+ + + RM V+Y+A+NKAT D + +P DELL + G
Sbjct: 256 VIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGK 315
Query: 1143 SIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLVMPFGLMNAPATFQAVMNDIFRP 1202
I+S D KSG Q+ + ++ TAF GHY++ V+PFGL AP+ F +
Sbjct: 316 KIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSN 375
Query: 1203 YLRKFVLVFFDDILIYSK-DLPEHLTHLKLVLSVLLANCFVANQTKCKFGCASIDYLGHI 1261
K+ V+ DDIL++S EH H+ +L + ++ K + I++LG
Sbjct: 376 QYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLE 435
Query: 1262 ISGAGMAVDPEKVKCIMDWPVP-KNVKGVRGFLGLTGYYRKFIKDYGKMANPLTELTKKD 1320
I ++ I +P ++ K ++ FLG+ Y +I + PL K+D
Sbjct: 436 IDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKED 495
Query: 1321 S-FSWGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVECDAAGRGIGAVLMQQRQPLAFF 1379
S ++W + ++K+ + S P L P +E DA+ G +L +
Sbjct: 496 STWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYI 555
Query: 1380 SKALSAG-NLAKSVY---EKELMALVLSIQHWRHYLLGKEFIVYTDHKSLKHFLQQKVSS 1435
+ S A+ Y EKEL+A++ I+ + YL F++ TD+K+ HF+ +
Sbjct: 556 CRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKG 615
Query: 1436 PDQQ----CWLAKLMGYQFQVKYKPGLENKAADAL 1466
+Q W L Y F V++ G +N AD L
Sbjct: 616 DRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFL 650
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 142 bits (357), Expect = 1e-32
Identities = 115/412 (27%), Positives = 198/412 (47%), Gaps = 22/412 (5%)
Query: 1067 YPHHQKEEIERQVAELMEAGIIRPS--MSAYSSPVILVKKKDKSW----RMCVDYRALNK 1120
Y KE E+Q+ EL++ +I+ + + + +V+ + R+ +Y+ LN
Sbjct: 1191 YTPADKEVFEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLND 1250
Query: 1121 ATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKTAFRTHNGHYEYLV 1180
D + IP +++ + A+IFSK DLK+G+H +++ +D + T F G Y + V
Sbjct: 1251 NMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNV 1310
Query: 1181 MPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDILIYSKDLPEHLTHLKLVLSVLLANC 1240
PFG+ NAP FQ M + F KF L++ DDILI S + EH+ HLK+ + +
Sbjct: 1311 CPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVG 1368
Query: 1241 FVANQTKCKFGCASIDYLGHIISGAGMAVDPEKVKCIMDWPVPK--NVKGVRGFLGLTGY 1298
V ++ K K ++YLG I +++ P V I + K +KG++ +LGL Y
Sbjct: 1369 CVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNY 1428
Query: 1299 YRKFIKDYGKMANPLTELTKKDSFS-WGPEADRAFLQLKRVMTSPPVLILPNFTLPFEVE 1357
R +IKD K+ PL + T K+ + E +++R ++ L P T +E
Sbjct: 1429 ARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIE 1488
Query: 1358 CDAAGRGIGAVLMQQRQPLAFFSKALSAGNLAKSVYEK--------ELMALVLSIQHWRH 1409
DA+ G GAVL+ + + AG + + EK E+ A+ ++ ++
Sbjct: 1489 TDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEALNKFQI 1548
Query: 1410 YLLGKEFIVYTDHKSL-KHFLQQKVSSPDQQCWLAKLMGYQFQVKYKPGLEN 1460
Y L K+F + TD +++ K + + W+ KL + YKP E+
Sbjct: 1549 Y-LDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWI-KLRDNLLKDGYKPTFEH 1598
>M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860
(ORF158)
Length = 158
Score = 137 bits (346), Expect = 3e-31
Identities = 68/131 (51%), Positives = 88/131 (66%), Gaps = 2/131 (1%)
Query: 1226 LTHLKLVLSVLLANCFVANQTKCKFGCASIDYLGH--IISGAGMAVDPEKVKCIMDWPVP 1283
+ HL +VL + + F AN+ KC FG I YLGH IISG G++ DP K++ ++ WP P
Sbjct: 1 MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60
Query: 1284 KNVKGVRGFLGLTGYYRKFIKDYGKMANPLTELTKKDSFSWGPEADRAFLQLKRVMTSPP 1343
KN +RGFLGLTGYYR+F+K+YGK+ PLTEL KK+S W A AF LK +T+ P
Sbjct: 61 KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120
Query: 1344 VLILPNFTLPF 1354
VL LP+ LPF
Sbjct: 121 VLALPDLKLPF 131
>RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse
transcriptase homolog)
Length = 142
Score = 132 bits (332), Expect = 1e-29
Identities = 67/128 (52%), Positives = 89/128 (69%), Gaps = 3/128 (2%)
Query: 1108 SWRMCVDYRALNKATIPDKYPIPIVDELLDELNGASIFSKIDLKSGYHQIRVHEDDIEKT 1167
S RMC+DYRAL K TI +KYPIP VD+L D L A+ F+K+DL+SGY Q+R+ + D KT
Sbjct: 5 SLRMCIDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQVRIAKGDEPKT 64
Query: 1168 AFRTHNGHYEYLVMPFGLMNAPATFQAVMNDIFRPYLRKFVLVFFDDIL---IYSKDLPE 1224
T G +E+ VMPFGL NA ATF +MN++ YL FV+V+ DD++ IYS L E
Sbjct: 65 TCVTRYGSFEFRVMPFGLTNALATFCNLMNNVLYEYLDHFVVVYLDDLVVYTIYSNSLHE 124
Query: 1225 HLTHLKLV 1232
H+ HL++V
Sbjct: 125 HIKHLRVV 132
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.336 0.147 0.475
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 251,003,440
Number of Sequences: 164201
Number of extensions: 10534886
Number of successful extensions: 42839
Number of sequences better than 10.0: 145
Number of HSP's better than 10.0 without gapping: 85
Number of HSP's successfully gapped in prelim test: 60
Number of HSP's that attempted gapping in prelim test: 42303
Number of HSP's gapped (non-prelim): 305
length of query: 2281
length of database: 59,974,054
effective HSP length: 126
effective length of query: 2155
effective length of database: 39,284,728
effective search space: 84658588840
effective search space used: 84658588840
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 74 (33.1 bits)
Lotus: description of TM0074.19