
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146683.16 + phase: 0
(1672 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 329 4e-89
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 302 6e-81
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 299 4e-80
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 299 4e-80
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 295 5e-79
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 289 4e-77
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 285 7e-76
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 264 1e-69
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 236 5e-61
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 175 8e-43
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 164 1e-39
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 156 5e-37
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 155 7e-37
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 155 7e-37
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 153 4e-36
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 152 1e-35
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 135 7e-31
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 124 3e-27
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 124 3e-27
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 123 5e-27
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 329 bits (843), Expect = 4e-89
Identities = 281/1007 (27%), Positives = 466/1007 (45%), Gaps = 110/1007 (10%)
Query: 670 EGEAKAYEELLDRTLPMKGLSVEELVKEEPTLLPKEAPKVELKTLPSNLRYEFLGPNSTY 729
+GE +E L ++ + ++VEE++ + PTL E++T ++ E + TY
Sbjct: 838 KGETGGFEVLSNKA--EQDITVEEVLND-PTLFS------EIETDTNSC--EVVKTAETY 886
Query: 730 P----VIVNASLDEVETEKLLYVLKKYPKAIGYTIDDIKGINPSLCMHRILLEEDYKPSI 785
+ + + + K+ V++++ + D++ + + C+ I L+E +P
Sbjct: 887 ERFTTICEHLKRENGDDRKIWDVIEQFQDVFAISDDELGRNSGTECV--IELKEGAEPIR 944
Query: 786 EHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIA 845
+ R + +K ++K + K+L+ VI S S W SPV +V KK G
Sbjct: 945 QKPRPIPLALKPEIRKMIQKMLNQKVIRE-SKSPWSSPVVLVKKKDGSI----------- 992
Query: 846 TRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPN 905
RMCIDYRK+NK + + PLP I+ L+ LA + D +GF+QIP+
Sbjct: 993 -------RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEK 1045
Query: 906 DQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNF 965
+E T F F + +PFGL +PA FQ M I D + V++DD + +
Sbjct: 1046 SKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDM 1105
Query: 966 DDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLP 1025
+ L ++++ L R + + L KCH +E LGH V G+E K + +K+
Sbjct: 1106 EQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSR 1165
Query: 1026 PTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALIT 1085
PT+VKE++SFLG G+YR+FI +F+ I LTSL+ + ++ AF LK+ +
Sbjct: 1166 PTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQ 1225
Query: 1086 APIIQPPD------WNLPFEIMCDASDYAVGAVLGQRN-DKKMHAIYYASKTLDGAQVNY 1138
P++ PD + PF I DAS +GAVL Q D + H I +ASK L A+ Y
Sbjct: 1226 TPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRY 1285
Query: 1139 ATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDL 1198
T+ E LA+++A+ +F+ + G+ I V+TDH + LL RL RW + + EFD+
Sbjct: 1286 HITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDV 1345
Query: 1199 EIKDKKGVENVVADHLSR-------LRETNKDEL-----PLDDSFPD-----DQLFLLAQ 1241
+I G N VAD LSR L E EL + PD L L
Sbjct: 1346 KIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLKG 1405
Query: 1242 TDAPWYADFVNFLAAGV---------LPPELNYQQKKKFFNDLKHYYWDEPYLFRRGSDG 1292
D W + + L G + E++ + K LK+ +E
Sbjct: 1406 EDEGW-KEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQ--------- 1455
Query: 1293 IFRRCIPENEVSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKC 1352
R +PE + +L H GH +K ++++H F+WP + V + C KC
Sbjct: 1456 -SRSVVPEKIRTPLLKELHEGMLAGHFGIKK-MWRMVHRKFYWPQMRVCVENCVRTCAKC 1513
Query: 1353 -------QRTGSITK-RNEMPLNNILEVEIFDVWGIDFMGPFPSSFGNQYILVAVDYVSK 1404
+ T S+T R PL ++ D M S GN+YIL +D +K
Sbjct: 1514 LCANDHSKLTSSLTPYRMTFPL---------EIVACDLMDVGLSVQGNRYILTIIDLFTK 1564
Query: 1405 WVEAIASPTNDAQVVIKMF-KKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKI 1463
+ A+ P A+ V+K F ++ +P +++D G F++ F + L + H
Sbjct: 1565 YGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHIT 1624
Query: 1464 ATPYHPQTSGQVEVSNRQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKL 1523
Y+ + +G VE N+ I I++K + +W +++ A++AY G TP L
Sbjct: 1625 TKGYNSRANGAVERFNKTIMHIMKKKTAVP-MEWDDQVVYAVYAYNNCVHENTGETPMFL 1683
Query: 1524 VYGKSCHLPVELEHKAYWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENARIYKERTK 1583
++G+ P+E+ + I ++D + + L EL +++ A E+A +E K
Sbjct: 1684 MHGRDVMGPLEMSGEDAVGINYADMD-----EYKHLLTQELLKVQKIAKEHAMREQESYK 1738
Query: 1584 TWHDKKII-KRHF--KSGDLVLLF--NSRLKLFPGKLRSRWSGPFQV 1625
+ D+K K+H + G VLL + +L KL ++WSGP++V
Sbjct: 1739 SLFDQKYASKKHRFPQPGSRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 302 bits (773), Expect = 6e-81
Identities = 238/875 (27%), Positives = 413/875 (47%), Gaps = 81/875 (9%)
Query: 776 LLEEDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVS--PVQVVPKKGGL 833
L +E+Y+ I + L P + + E+ + L +G+I +SK ++ PV VPKK G
Sbjct: 406 LTQENYRLPIRNYP-LPPGKMQAMNDEINQGLKSGII---RESKAINACPVMFVPKKEGT 461
Query: 834 TVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDG 893
RM +DY+ LNK + + +PLP I+Q+L ++ + F LD
Sbjct: 462 L------------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 894 YSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV 953
S + I + D+ K F CP G F Y MP+G+ APA FQ + +I + E +
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVC 563
Query: 954 FMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVD 1013
+MDD +H + + + +++ VL++ + NL++N KC F + +G+ + ++G
Sbjct: 564 YMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 1014 RAKIEIIKKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCL 1073
+ I+ + + P + KE+R FLG + R+FI S +T PL +LL KD + + +
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683
Query: 1074 QAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQR-NDKKMHAIYYASKTLD 1132
QA +K+ L++ P+++ D++ + DASD AVGAVL Q+ +D K + + Y S +
Sbjct: 684 QAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMS 743
Query: 1133 GAQVNYATTEKELLAVVYAIDKFRQYLVGS--KIIVYTDH-SAIKYLLNKKDAK-PRLIR 1188
AQ+NY+ ++KE+LA++ ++ +R YL + + TDH + I + N+ + + RL R
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 1189 WILLLQEFDLEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYA 1248
W L LQ+F+ EI + G N +AD LSR+ + + P+ D+ +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIVDETE---PIPKDSEDNSI------------ 848
Query: 1249 DFVNFLAAGVLPPELNYQQKKKFFNDLK--HYYWDEPYLFRRG---SDGIF-----RRCI 1298
+FVN ++ + + Q ++ ND K + +E DG+ + +
Sbjct: 849 NFVNQIS---ITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILL 905
Query: 1299 PENE--VSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTG 1356
P + +I+ H H + + IL F W + K + ++ C CQ
Sbjct: 906 PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINK 964
Query: 1357 SITKRNEMPLNNILEVE-IFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAI-ASPTN 1414
S + PL I E ++ +DF+ P S G + V VD SK + + +
Sbjct: 965 SRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSI 1024
Query: 1415 DAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQ 1474
A+ +MF + + FG P+ +I+D F S+ ++ K K + PY PQT GQ
Sbjct: 1025 TAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQ 1084
Query: 1475 VEVSNRQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHL-PV 1533
E +N+ ++ +L ST W + + +Y A + MTPF++V+ S L P+
Sbjct: 1085 TERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPL 1144
Query: 1534 ELEHKAYWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENARIYKERTKTWHDKKIIK- 1592
EL P+ + DK E ++ E+ + K + D KI +
Sbjct: 1145 EL--------------PSFS-DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEI 1189
Query: 1593 RHFKSGDLVLLFNSRLKLF--PGKLRSRWSGPFQV 1625
F+ GDLV++ ++ KL ++GPF V
Sbjct: 1190 EEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYV 1224
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 299 bits (766), Expect = 4e-80
Identities = 237/875 (27%), Positives = 413/875 (47%), Gaps = 81/875 (9%)
Query: 776 LLEEDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVS--PVQVVPKKGGL 833
L +E+Y+ I + L P + + E+ + L +G+I +SK ++ PV VPKK G
Sbjct: 406 LTQENYRLPIRNYP-LPPGKMQAMNDEINQGLKSGII---RESKAINACPVMFVPKKEGT 461
Query: 834 TVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDG 893
RM +DY+ LNK + + +PLP I+Q+L ++ + F LD
Sbjct: 462 L------------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 894 YSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV 953
S + I + D+ K F CP G F Y MP+G+ APA FQ + +I + E +
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVC 563
Query: 954 FMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVD 1013
+MD+ +H + + + +++ VL++ + NL++N KC F + +G+ + ++G
Sbjct: 564 YMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 1014 RAKIEIIKKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCL 1073
+ I+ + + P + KE+R FLG + R+FI S +T PL +LL KD + + +
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683
Query: 1074 QAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQR-NDKKMHAIYYASKTLD 1132
QA +K+ L++ P+++ D++ + DASD AVGAVL Q+ +D K + + Y S +
Sbjct: 684 QAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMS 743
Query: 1133 GAQVNYATTEKELLAVVYAIDKFRQYLVGS--KIIVYTDH-SAIKYLLNKKDAK-PRLIR 1188
AQ+NY+ ++KE+LA++ ++ +R YL + + TDH + I + N+ + + RL R
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 1189 WILLLQEFDLEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYA 1248
W L LQ+F+ EI + G N +AD LSR+ + + P+ D+ +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIVDETE---PIPKDSEDNSI------------ 848
Query: 1249 DFVNFLAAGVLPPELNYQQKKKFFNDLK--HYYWDEPYLFRRG---SDGIF-----RRCI 1298
+FVN ++ + + Q ++ ND K + +E DG+ + +
Sbjct: 849 NFVNQIS---ITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILL 905
Query: 1299 PENE--VSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTG 1356
P + +I+ H H + + IL F W + K + ++ C CQ
Sbjct: 906 PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINK 964
Query: 1357 SITKRNEMPLNNILEVE-IFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAI-ASPTN 1414
S + PL I E ++ +DF+ P S G + V VD SK + + +
Sbjct: 965 SRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSI 1024
Query: 1415 DAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQ 1474
A+ +MF + + FG P+ +I+D F S+ ++ K K + PY PQT GQ
Sbjct: 1025 TAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQ 1084
Query: 1475 VEVSNRQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHL-PV 1533
E +N+ ++ +L ST W + + +Y A + MTPF++V+ S L P+
Sbjct: 1085 TERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPL 1144
Query: 1534 ELEHKAYWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENARIYKERTKTWHDKKIIK- 1592
EL P+ + DK E ++ E+ + K + D KI +
Sbjct: 1145 EL--------------PSFS-DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEI 1189
Query: 1593 RHFKSGDLVLLFNSRLKLF--PGKLRSRWSGPFQV 1625
F+ GDLV++ ++ KL ++GPF V
Sbjct: 1190 EEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYV 1224
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 299 bits (766), Expect = 4e-80
Identities = 237/875 (27%), Positives = 413/875 (47%), Gaps = 81/875 (9%)
Query: 776 LLEEDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVS--PVQVVPKKGGL 833
L +E+Y+ I + L P + + E+ + L +G+I +SK ++ PV VPKK G
Sbjct: 406 LTQENYRLPIRNYP-LPPGKMQAMNDEINQGLKSGII---RESKAINACPVMFVPKKEGT 461
Query: 834 TVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDG 893
RM +DY+ LNK + + +PLP I+Q+L ++ + F LD
Sbjct: 462 L------------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 894 YSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV 953
S + I + D+ K F CP G F Y MP+G+ APA FQ + +I + E +
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVC 563
Query: 954 FMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVD 1013
+MD+ +H + + + +++ VL++ + NL++N KC F + +G+ + ++G
Sbjct: 564 YMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 1014 RAKIEIIKKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCL 1073
+ I+ + + P + KE+R FLG + R+FI S +T PL +LL KD + + +
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683
Query: 1074 QAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQR-NDKKMHAIYYASKTLD 1132
QA +K+ L++ P+++ D++ + DASD AVGAVL Q+ +D K + + Y S +
Sbjct: 684 QAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMS 743
Query: 1133 GAQVNYATTEKELLAVVYAIDKFRQYLVGS--KIIVYTDH-SAIKYLLNKKDAK-PRLIR 1188
AQ+NY+ ++KE+LA++ ++ +R YL + + TDH + I + N+ + + RL R
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 1189 WILLLQEFDLEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYA 1248
W L LQ+F+ EI + G N +AD LSR+ + + P+ D+ +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIVDETE---PIPKDSEDNSI------------ 848
Query: 1249 DFVNFLAAGVLPPELNYQQKKKFFNDLK--HYYWDEPYLFRRG---SDGIF-----RRCI 1298
+FVN ++ + + Q ++ ND K + +E DG+ + +
Sbjct: 849 NFVNQIS---ITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILL 905
Query: 1299 PENE--VSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTG 1356
P + +I+ H H + + IL F W + K + ++ C CQ
Sbjct: 906 PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINK 964
Query: 1357 SITKRNEMPLNNILEVE-IFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAI-ASPTN 1414
S + PL I E ++ +DF+ P S G + V VD SK + + +
Sbjct: 965 SRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSI 1024
Query: 1415 DAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQ 1474
A+ +MF + + FG P+ +I+D F S+ ++ K K + PY PQT GQ
Sbjct: 1025 TAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQ 1084
Query: 1475 VEVSNRQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHL-PV 1533
E +N+ ++ +L ST W + + +Y A + MTPF++V+ S L P+
Sbjct: 1085 TERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPL 1144
Query: 1534 ELEHKAYWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENARIYKERTKTWHDKKIIK- 1592
EL P+ + DK E ++ E+ + K + D KI +
Sbjct: 1145 EL--------------PSFS-DKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEI 1189
Query: 1593 RHFKSGDLVLLFNSRLKLF--PGKLRSRWSGPFQV 1625
F+ GDLV++ ++ KL ++GPF V
Sbjct: 1190 EEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYV 1224
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 295 bits (756), Expect = 5e-79
Identities = 241/793 (30%), Positives = 370/793 (46%), Gaps = 88/793 (11%)
Query: 725 PNSTYPVIVN-----ASLDEVETEKLLYVLKKYPKAIGYTIDDIKGINPSL----CMHRI 775
PN P++ + L+ E ++L +L+KY + D + N + H +
Sbjct: 148 PNKISPILESDLYRLEHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNL 207
Query: 776 LLEEDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTV 835
L Y +++ V+ ++ +L+ G+I S+S + SP+ VVPKK +
Sbjct: 208 PLYSKYSYPQAYEQE--------VESQIQDMLNQGIIRT-SNSPYNSPIWVVPKKQDASG 258
Query: 836 IKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYS 895
+ +R+ IDYRKLN+ T D P+P +D++L +L + ++F +D
Sbjct: 259 KQK-------------FRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305
Query: 896 GFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFM 955
GF QI + P KT F+ G + Y RMPFGL NAPATFQRCM I + K V++
Sbjct: 306 GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365
Query: 956 DDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRA 1015
DD V ++ D+ L +L V E+ + NL L +KC F+ +E LGH++ GI+ +
Sbjct: 366 DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425
Query: 1016 KIEIIKKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADF-TFDDSCLQ 1074
KIE I+K PT KEI++FLG G+YR+FI +F+ I KP+T L K+ T +
Sbjct: 426 KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485
Query: 1075 AFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGA 1134
AF +LK + PI++ PD+ F + DASD A+GAVL Q H + Y S+TL+
Sbjct: 486 AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDG----HPLSYISRTLNEH 541
Query: 1135 QVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQ 1194
++NY+T EKELLA+V+A FR YL+G + +DH + +L KD +L RW + L
Sbjct: 542 EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601
Query: 1195 EFDLEIKDKKGVENVVADHLSR--LRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVN 1252
EFD +IK KG EN VAD LSR L ET E S +D L+ T+ P F
Sbjct: 602 EFDFDIKYIKGKENCVADALSRIKLEETYLSE-QTQHSAEEDNSDLIFITERP-LNTFNR 659
Query: 1253 FLAAGVLPPEL---NYQQK--KKFFNDLKHYYWDEPYLFRR----------GSDGIF--- 1294
+ PP++ Y +K + F D+ E YL SD F
Sbjct: 660 QVIFSKGPPDIKVTKYFKKHITQIFYDIMTREKAEQYLIDHFCGKKSALYIESDADFEVI 719
Query: 1295 ----------------------RRCIPENEVSSILTHCHSSSYGGHASTQKTSFKILHSG 1332
+ E ++ H H QKT+ K+
Sbjct: 720 QAAHKLAINTKYTKILRSTILLKNITTYAEFKELILTAHEKLL--HPGIQKTT-KLFGET 776
Query: 1333 FWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVEIFDVWGIDFMGPFPSSFGN 1392
+++P+ + I++C C + + +MP + E FM SS G
Sbjct: 777 YYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPEHCRE---KFMIDIYSSEGK 833
Query: 1393 QYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEK 1452
Y+ +D SK+ T D + K IF + G P+++ +D F S ++
Sbjct: 834 HYV-SCIDIYSKFATLEEIKTKD-WIECKNALMRIFNQLGKPKLLKADRDGAFSSLALKR 891
Query: 1453 LLQKLGVRHKIAT 1465
L+ V ++ T
Sbjct: 892 WLESEEVELQLNT 904
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 289 bits (740), Expect = 4e-77
Identities = 236/769 (30%), Positives = 376/769 (48%), Gaps = 93/769 (12%)
Query: 799 VKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCIDY 858
V+ +V ++L+ G+I S+S + SP VVPKK + S A + +R+ IDY
Sbjct: 222 VENQVQEMLNQGLIRE-SNSPYNSPTWVVPKK---------PDASGANK----YRVVIDY 267
Query: 859 RKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGT 918
RKLN+ T D +P+P +D++L +L K +F +D GF QI + KT F+ G
Sbjct: 268 RKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGH 327
Query: 919 FAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLER 978
+ Y RMPFGL NAPATFQRCM +I + K V++DD + ++ + L +++ V +
Sbjct: 328 YEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTK 387
Query: 979 CEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGH 1038
NL L +KC F+ +E LGH+V GI+ + K++ I PT KEIR+FLG
Sbjct: 388 LADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGL 447
Query: 1039 AGFYRRFIKDFSSITKPLTSLLLKDADF-TFDDSCLQAFCRLKEALITAPIIQPPDWNLP 1097
G+YR+FI +++ I KP+TS L K T ++AF +LK +I PI+Q PD+
Sbjct: 448 TGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKK 507
Query: 1098 FEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQ 1157
F + DAS+ A+GAVL Q H I + S+TL+ ++NY+ EKELLA+V+A FR
Sbjct: 508 FVLTTDASNLALGAVLSQNG----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRH 563
Query: 1158 YLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENVVADHLSRL 1217
YL+G + ++ +DH +++L N K+ +L RW + L E+ +I KG EN VAD LSR+
Sbjct: 564 YLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRI 623
Query: 1218 R-ETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVL---PPELNYQQKKKFFN 1273
+ E N S +D L+ T+ P +N+ ++ + + K F N
Sbjct: 624 KIEENHHSEATQHSAEEDNSNLIHLTEKP-----INYFKKQIIFIKSDKNKVEHSKIFGN 678
Query: 1274 DLKHYYWDEPYLFRRGS---DGIFRRCIP---ENEVS-SILTHCH----SSSY------- 1315
+ +D L + D R I E++V I+ H +++Y
Sbjct: 679 SITTIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSL 738
Query: 1316 ------GGHASTQ----KTSFKILHSGFW-WPSLFKDVHLF----------ISKCDKCQR 1354
G +A + ++ K+LH G LFK+ H F I++C+ C
Sbjct: 739 FLLKNVGSYAEFKEIILQSHEKLLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNL 798
Query: 1355 TGSITKRNEMPL------NNILEVEIFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEA 1408
+ + +MPL + E + D++ SS G YI +D SK+
Sbjct: 799 AKTEHRNTKMPLKITPNPEHCREKFVVDIY---------SSEGKHYI-SCIDIYSKFATL 848
Query: 1409 IASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYH 1468
T D + + IF + G P+++ +D F S ++ L++ V ++ T
Sbjct: 849 EQIKTKD-WIECRNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNT--- 904
Query: 1469 PQTSGQVEVSNRQIKAILEKTVSTSRTDWS----NKLDDALWAYRTAYK 1513
+G +V R K I EK + +D +K++ L+ Y K
Sbjct: 905 -AKNGVADV-ERLHKTINEKIRIINSSDDEEVKLSKIETILYTYNQKIK 951
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 285 bits (729), Expect = 7e-76
Identities = 170/460 (36%), Positives = 264/460 (56%), Gaps = 28/460 (6%)
Query: 794 NMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWR 853
NM+ V++++ +LL G+I P S+S + SP+ +VPKK K +R
Sbjct: 134 NMRGEVERQIDELLQDGIIRP-SNSPYNSPIWIVPKKPKPNGEKQ-------------YR 179
Query: 854 MCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFT 913
M +D+++LN T D +P+P I+ L L +F LD SGF QI + +D KT F+
Sbjct: 180 MVVDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFS 239
Query: 914 CPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLE 973
G + + R+PFGL NAPA FQR + I + + K+ V++DD V ++D NL
Sbjct: 240 TLNGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLR 299
Query: 974 KVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIR 1033
VL + NL +N EK HF+ + LG++V GI+ D K+ I +M PPTSVKE++
Sbjct: 300 LVLASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELK 359
Query: 1034 SFLGHAGFYRRFIKDFSSITKPLTSLL------LKDAD-----FTFDDSCLQAFCRLKEA 1082
FLG +YR+FI+D++ + KPLT+L +K + T D++ LQ+F LK
Sbjct: 360 RFLGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSI 419
Query: 1083 LITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTE 1142
L ++ I+ P + PF + DAS++A+GAVL Q + + I Y S++L+ + NYAT E
Sbjct: 420 LCSSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIE 479
Query: 1143 KELLAVVYAIDKFRQYLVGSKII-VYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIK 1201
KE+LA+++++D R YL G+ I VYTDH + + L ++ +L RW ++E++ E+
Sbjct: 480 KEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELI 539
Query: 1202 DKKGVENVVADHLSRLRETNKDELPLD-DSFPDDQLFLLA 1240
K G NVVAD LSR+ ++L D D+ P+D + LA
Sbjct: 540 YKPGKSNVVADALSRI-PPQLNQLSTDLDANPEDDMQSLA 578
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 264 bits (675), Expect = 1e-69
Identities = 170/507 (33%), Positives = 266/507 (51%), Gaps = 33/507 (6%)
Query: 712 KTLPSNLRYEFLGPNSTYPVIVNASLDEVETEKL-LYVLKKYPKAIGYTIDDIKGINPSL 770
KT+ S L+ F P + + L+ + +E + ++ L+ P + +L
Sbjct: 261 KTVLSQLKKNF-------PELFKSQLENICSEYIDIFALESEPITVN-----------NL 302
Query: 771 CMHRILLEEDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKK 830
++ L++D +P R + E ++ +V KL+ ++ P S S++ SP+ +VPKK
Sbjct: 303 YKQQLRLKDD-EPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEP-SVSQYNSPLLLVPKK 360
Query: 831 GGLTVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCY 890
+DK + WR+ IDYR++NK D FPLP ID +L++L + +F
Sbjct: 361 SSPN---SDKKK---------WRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSC 408
Query: 891 LDGYSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKI 950
LD SGF QI + ++ T+F+ G++ + R+PFGL AP +FQR M FS
Sbjct: 409 LDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQ 468
Query: 951 MEVFMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGI 1010
++MDD V G + L NL +V +C + NL L+ EKC F + E LGH D+GI
Sbjct: 469 AFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGI 528
Query: 1011 EVDRAKIEIIKKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDD 1070
D K ++I+ P R F+ +YRRFIK+F+ ++ +T L K+ F + D
Sbjct: 529 LPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTD 588
Query: 1071 SCLQAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKT 1130
C +AF LK LI ++Q PD++ F I DAS A GAVL Q ++ + YAS+
Sbjct: 589 ECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRA 648
Query: 1131 LDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWI 1190
+ N +TTE+EL A+ +AI FR Y+ G V TDH + YL + + +L R
Sbjct: 649 FTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIR 708
Query: 1191 LLLQEFDLEIKDKKGVENVVADHLSRL 1217
L L+E++ ++ KG +N VAD LSR+
Sbjct: 709 LELEEYNFTVEYLKGKDNHVADALSRI 735
Score = 125 bits (315), Expect = 7e-28
Identities = 95/364 (26%), Positives = 173/364 (47%), Gaps = 23/364 (6%)
Query: 1300 ENEVSSILTHCHSSSY-GGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSI 1358
E E +IL+ H GGH KT K+ ++W ++ K + ++ KC KCQ+ +
Sbjct: 890 EKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKAKT- 947
Query: 1359 TKRNEMPLNNILEVE-IFDVWGIDFMGPFPSSF-GNQYILVAVDYVSKWVEAIASPTNDA 1416
TK + P+ E FD +D +GP P S GN+Y + + ++K++ AI A
Sbjct: 948 TKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKSA 1007
Query: 1417 QVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVE 1476
+ V K + ++G + I+D G+ + + L + L +++ +T +H QT G VE
Sbjct: 1008 KTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVE 1067
Query: 1477 VSNRQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELE 1536
S+R + + +ST +TDW L ++ + T P++LV+G++ +LP
Sbjct: 1068 RSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF- 1126
Query: 1537 HKAYWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENAR----IYKERTKTWHDKKIIK 1592
+K + N+D K +L++ AY AR +KE+ K +D K+
Sbjct: 1127 NKLHSIEPIYNIDDYAKESKYRLEV---------AYARARKLLEAHKEKNKENYDLKVKD 1177
Query: 1593 RHFKSGDLVLLFNSRLKLFPGKLRSRWSGPFQVRTVYPYGAIEIFSEETGSFTVNGQRLK 1652
+ GD VLL N KL +++GP+++ ++ I + + + V+ RLK
Sbjct: 1178 IELEVGDKVLLRNE----VGHKLDFKYTGPYKIESIGDNNNITLLTNKNKKQIVHKDRLK 1233
Query: 1653 IYNT 1656
+++
Sbjct: 1234 KFHS 1237
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 236 bits (601), Expect = 5e-61
Identities = 151/435 (34%), Positives = 237/435 (53%), Gaps = 30/435 (6%)
Query: 795 MKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRM 854
+ + V EV +LL G+I P S S + SP VV KKG T + N+ R+
Sbjct: 193 VSDFVNNEVKQLLKDGIIRP-SRSPYNSPTWVVDKKG--TDAFGNPNK----------RL 239
Query: 855 CIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTC 914
ID+RKLN+ T D +P+P I +L L K F LD SG+ QI + +D+EKT+F+
Sbjct: 240 VIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSV 299
Query: 915 PFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEK 974
G + + R+PFGL NA + FQR + + + + KI V++DD + N D + +++
Sbjct: 300 NGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDT 359
Query: 975 VLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRS 1034
VL+ N+ ++ EK F LG +V G + D K++ I++ P V ++RS
Sbjct: 360 VLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRS 419
Query: 1035 FLGHAGFYRRFIKDFSSITKPLTSLL-----------LKDADFTFDDSCLQAFCRLKEAL 1083
FLG A +YR FIKDF++I +P+T +L K F+++ AF RL+ L
Sbjct: 420 FLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNIL 479
Query: 1084 ITAPII-QPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTE 1142
+ +I + PD+ PF++ DAS +GAVL Q I S+TL + NYAT E
Sbjct: 480 ASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEG----RPITMISRTLKQPEQNYATNE 535
Query: 1143 KELLAVVYAIDKFRQYLVGSK-IIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIK 1201
+ELLA+V+A+ K + +L GS+ I ++TDH + + + ++ ++ RW + + + ++
Sbjct: 536 RELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF 595
Query: 1202 DKKGVENVVADHLSR 1216
K G EN VAD LSR
Sbjct: 596 YKPGKENFVADALSR 610
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 175 bits (444), Expect = 8e-43
Identities = 133/439 (30%), Positives = 221/439 (50%), Gaps = 27/439 (6%)
Query: 792 NPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTG 851
+P+ +E +++ +LL+ VI P S S +SP +V E+ A R
Sbjct: 237 SPSDREEFDRQIKELLELKVIKP-SKSTHMSPAFLV--------------ENEAERRRGK 281
Query: 852 WRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTT 911
RM ++Y+ +NKAT+ D LP D++L + + D SG +Q+ + Q T
Sbjct: 282 KRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTA 341
Query: 912 FTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNF-DDCLT 970
FTCP G + + +PFGL AP+ F + + S+ K V++DD V + +
Sbjct: 342 FTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYI 401
Query: 971 NLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKI-EIIKKMLPPTSV 1029
++ +L RCE++ ++L+ +K + LG L D+G + I E I K P +
Sbjct: 402 HVLNILRRCEKLGIILSKKKAQLFKEKINFLG-LEIDQGTHCPQNHILEHIHKF--PDRI 458
Query: 1030 ---KEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITA 1086
K+++ FLG + +I +SI KPL S L +D+ +T++D+ Q ++K+ L +
Sbjct: 459 EDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSF 518
Query: 1087 PIIQPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELL 1146
P + P+ N I DAS+ G +L ++ + YAS + A+ NY + EKELL
Sbjct: 519 PKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELL 578
Query: 1147 AVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAKP-RLIRWILLLQEFDLEIKD 1202
AV+ I KF YL S+ ++ TD+ + +N K D K RL+RW + L ++D +++
Sbjct: 579 AVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEH 638
Query: 1203 KKGVENVVADHLSRLRETN 1221
G +NV AD L TN
Sbjct: 639 IAGTKNVFADFLQENTLTN 657
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 164 bits (416), Expect = 1e-39
Identities = 136/432 (31%), Positives = 213/432 (48%), Gaps = 24/432 (5%)
Query: 792 NPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTG 851
+P +E K++ +LLD G+I P S S+ +SP +V E+ A R
Sbjct: 248 SPQDREGFAKQIKELLDLGLIIP-SKSQHMSPAFLV--------------ENEAERRRGK 292
Query: 852 WRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTT 911
RM ++Y+ +N+AT D LP + ++L L S F D SGF+Q+ + Q+ T
Sbjct: 293 KRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTA 352
Query: 912 FTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTN 971
FTCP G F ++ +PFGL AP+ FQR M + + +K V++DD V ++ D +
Sbjct: 353 FTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNH 411
Query: 972 LEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKI-EIIKKMLPP-TSV 1029
+ VL+ E+ ++L+ +K + + +E I L D+G + I E I K
Sbjct: 412 VYAVLKIVEKYGIILSKKKAN-LFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLEDK 470
Query: 1030 KEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPII 1089
K ++ FLG + +I + I KPL L KD + + S ++K+ L + P +
Sbjct: 471 KHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKL 530
Query: 1090 QPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAI-YYASKTLDGAQVNYATTEKELLAV 1148
P I DASD G VL R + I Y+S + A+ NY + +KELLAV
Sbjct: 531 YLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAV 590
Query: 1149 VYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAKP-RLIRWILLLQEFDLEIKDKK 1204
I KF YL + V TD+ Y L K D+K RL+RW ++ +++ +
Sbjct: 591 KQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLE 650
Query: 1205 GVENVVADHLSR 1216
GV+NV+AD L+R
Sbjct: 651 GVKNVLADCLTR 662
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 156 bits (394), Expect = 5e-37
Identities = 133/443 (30%), Positives = 207/443 (46%), Gaps = 29/443 (6%)
Query: 790 RLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTV 849
+ +P +E K++ +LLD VI P S S ++P +V NE+ R
Sbjct: 253 KYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NNEAEKRRGK 299
Query: 850 TGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEK 909
RM ++Y+ +NKAT D + LP D++L + F D SGF+Q+ + +
Sbjct: 300 K--RMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 357
Query: 910 TTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCL 969
T FTCP G + + +PFGL AP+ FQR M F F K V++DD V +N +D L
Sbjct: 358 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 416
Query: 970 TNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT-- 1027
++ +L++C Q ++L+ +K ++ LG L D G + I P T
Sbjct: 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLE 475
Query: 1028 SVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAP 1087
K+++ FLG + +I + I KPL + L ++ + + ++K+ L P
Sbjct: 476 DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFP 535
Query: 1088 IIQPPDWNLPFEIMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEK 1143
+ P I DASD G +L YAS + A+ NY + +K
Sbjct: 536 PLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDK 595
Query: 1144 ELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLE 1199
E LAV+ I KF YL ++ TD++ K +N K D+K R IRW L + +
Sbjct: 596 ETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFD 655
Query: 1200 IKDKKGVENVVADHLSRLRETNK 1222
++ KG +N AD LS RE NK
Sbjct: 656 VEHIKGTDNHFADFLS--REFNK 676
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 155 bits (393), Expect = 7e-37
Identities = 133/443 (30%), Positives = 207/443 (46%), Gaps = 29/443 (6%)
Query: 790 RLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTV 849
+ +P +E K++ +LLD VI P S S ++P +V NE+ R
Sbjct: 253 KYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NNEAEKRRGK 299
Query: 850 TGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEK 909
RM ++Y+ +NKAT D + LP D++L + F D SGF+Q+ + +
Sbjct: 300 K--RMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 357
Query: 910 TTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCL 969
T FTCP G + + +PFGL AP+ FQR M F F K V++DD V +N +D L
Sbjct: 358 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 416
Query: 970 TNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT-- 1027
++ +L++C Q ++L+ +K ++ LG L D G + I P T
Sbjct: 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLE 475
Query: 1028 SVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAP 1087
K+++ FLG + +I + I KPL + L ++ + + ++K+ L P
Sbjct: 476 DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFP 535
Query: 1088 IIQPPDWNLPFEIMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEK 1143
+ P I DASD G +L YAS + A+ NY + +K
Sbjct: 536 PLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDK 595
Query: 1144 ELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLE 1199
E LAV+ I KF YL ++ TD++ K +N K D+K R IRW L + +
Sbjct: 596 ETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFD 655
Query: 1200 IKDKKGVENVVADHLSRLRETNK 1222
++ KG +N AD LS RE NK
Sbjct: 656 VEHIKGTDNHFADFLS--REFNK 676
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 155 bits (393), Expect = 7e-37
Identities = 133/443 (30%), Positives = 207/443 (46%), Gaps = 29/443 (6%)
Query: 790 RLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTV 849
+ +P +E K++ +LLD VI P S S ++P +V NE+ R
Sbjct: 253 KYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NNEAEKRRGK 299
Query: 850 TGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEK 909
RM ++Y+ +NKAT D + LP D++L + F D SGF+Q+ + +
Sbjct: 300 K--RMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 357
Query: 910 TTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCL 969
T FTCP G + + +PFGL AP+ FQR M F F K V++DD V +N +D L
Sbjct: 358 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 416
Query: 970 TNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT-- 1027
++ +L++C Q ++L+ +K ++ LG L D G + I P T
Sbjct: 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLE 475
Query: 1028 SVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAP 1087
K+++ FLG + +I + I KPL + L ++ + + ++K+ L P
Sbjct: 476 DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFP 535
Query: 1088 IIQPPDWNLPFEIMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEK 1143
+ P I DASD G +L YAS + A+ NY + +K
Sbjct: 536 PLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDK 595
Query: 1144 ELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLE 1199
E LAV+ I KF YL ++ TD++ K +N K D+K R IRW L + +
Sbjct: 596 ETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFD 655
Query: 1200 IKDKKGVENVVADHLSRLRETNK 1222
++ KG +N AD LS RE NK
Sbjct: 656 VEHIKGTDNHFADFLS--REFNK 676
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 153 bits (386), Expect = 4e-36
Identities = 129/437 (29%), Positives = 203/437 (45%), Gaps = 27/437 (6%)
Query: 790 RLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTV 849
+ +P +E K++ +LLD VI P S S ++P +V NE+ R
Sbjct: 248 KYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NNEAEKRRGK 294
Query: 850 TGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEK 909
RM ++Y+ +NKAT D + P D++L + F D SGF+Q+ + +
Sbjct: 295 K--RMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 352
Query: 910 TTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCL 969
T FTCP G + + +PFGL AP+ FQR M F F K V++DD V +N +D L
Sbjct: 353 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 411
Query: 970 TNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT-- 1027
++ +L++C Q ++L+ +K ++ LG L D G + I P T
Sbjct: 412 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLE 470
Query: 1028 SVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAP 1087
K+++ FLG + +I + I KPL + L ++ + + ++K+ L P
Sbjct: 471 DKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFP 530
Query: 1088 IIQPPDWNLPFEIMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEK 1143
+ P I DASD G +L YAS + A+ NY + +K
Sbjct: 531 PLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDK 590
Query: 1144 ELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLE 1199
E LAV+ I KF YL ++ TD++ K +N K D+K R IRW L + +
Sbjct: 591 ETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFD 650
Query: 1200 IKDKKGVENVVADHLSR 1216
++ KG +N AD LSR
Sbjct: 651 VEHIKGTDNHFADFLSR 667
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 152 bits (383), Expect = 1e-35
Identities = 128/443 (28%), Positives = 207/443 (45%), Gaps = 29/443 (6%)
Query: 790 RLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTV 849
+ +P +E K++ +LLD VI P S S ++P +V + +N +
Sbjct: 254 KYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLVNNEA-----ENGRGNK------ 301
Query: 850 TGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEK 909
RM ++Y+ +NKAT D + LP D++L + F D SGF+Q+ + +
Sbjct: 302 ---RMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 358
Query: 910 TTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCL 969
T FTCP G + + +PFGL AP+ FQR M F F K V++DD V +N +D L
Sbjct: 359 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHL 417
Query: 970 TNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT-- 1027
++ +L++C Q ++L+ +K ++ LG L D G + I P T
Sbjct: 418 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLE 476
Query: 1028 SVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAP 1087
K+++ FLG + +I + + + +PL + L ++ + + ++K+ L P
Sbjct: 477 DKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFP 536
Query: 1088 IIQPPDWNLPFEIMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEK 1143
+ P I DASD G +L Y S + A+ NY + +K
Sbjct: 537 PLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDK 596
Query: 1144 ELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLE 1199
E LAV+ I KF YL ++ TD++ K +N K D+K R IRW L + +
Sbjct: 597 ETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFD 656
Query: 1200 IKDKKGVENVVADHLSRLRETNK 1222
++ KG +N AD LS RE NK
Sbjct: 657 VEHIKGTDNHFADFLS--REFNK 677
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 135 bits (341), Expect = 7e-31
Identities = 105/406 (25%), Positives = 175/406 (42%), Gaps = 29/406 (7%)
Query: 777 LEEDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVI 836
L P Q ++ +E ++ + K LD GV+ P S W +P+ V K G
Sbjct: 169 LRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCR-SPWNTPLLPVKKPG----- 222
Query: 837 KNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYS 895
ND +R D R++NK + H +P +L L ++ + LD
Sbjct: 223 TND------------YRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKD 270
Query: 896 GFFQIPIHPNDQEKTTFTCP------FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF--- 946
FF + +HPN Q F G + R+P G N+P F + + F
Sbjct: 271 AFFCLRLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRAL 330
Query: 947 -VEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLV 1005
+ ++ ++DD V ++DC +K+L+ ++ ++ +K RE LG+L+
Sbjct: 331 NPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLL 390
Query: 1006 FDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDAD 1065
+ + A+ + K+ PT+ +++R FLG AGF R +I F+S+ PL L +
Sbjct: 391 KEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIP 450
Query: 1066 FTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIY 1125
F + + QAF +K+AL++AP + PD PF + D VL Q +
Sbjct: 451 FIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVA 510
Query: 1126 YASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHS 1171
Y SK LD + T K + AV + + +G + V HS
Sbjct: 511 YLSKKLDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHS 556
Score = 126 bits (317), Expect = 4e-28
Identities = 101/322 (31%), Positives = 143/322 (44%), Gaps = 37/322 (11%)
Query: 1318 HASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVEIFDV 1377
H +K + + P+L V S+C C T ++T E +
Sbjct: 821 HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGV-Y 879
Query: 1378 WGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 1437
W +DF P +GN+Y+LV +D S WVEA + T A +V K + I PRFG+P+V+
Sbjct: 880 WEVDFTEIKPGRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPKVL 939
Query: 1438 ISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 1496
SD G F+++ + L +LG+ K+ Y PQ+SGQVE NR IK L K + T D
Sbjct: 940 GSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKD 999
Query: 1497 WSNKLDDALWAYRTAYKTP--IGMTPFKLVYGKSCHLPVELEHKAYWAIRNLNLDPNLAG 1554
W L AL R TP G+TP++++YG + L L
Sbjct: 1000 WVTLLPLALLRAR---NTPGRFGLTPYEILYGGPPPI--------------LESGETLGP 1042
Query: 1555 DKRKL-----QLNELEELRMDAYENAR-IYKERTKTWHDKKIIKRHFKSGDLVLLFNSRL 1608
D R L L LE +R ++ + +YK T T I F+ GD VL+ R
Sbjct: 1043 DDRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVT------IPHPFQVGDQVLVRRHR- 1095
Query: 1609 KLFPGKLRSRWSGPFQVRTVYP 1630
P L RW GP+ V P
Sbjct: 1096 ---PSSLEPRWKGPYLVLLTTP 1114
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 124 bits (310), Expect = 3e-27
Identities = 125/496 (25%), Positives = 222/496 (44%), Gaps = 40/496 (8%)
Query: 738 DEVETEKLLYVLKKYPKAIGYTIDDI-KGINPSLCMHRILLEEDYKPSIEHQRRL---NP 793
+E+ + L+ ++K+ +A+G+ DDI K +C +I+ P I P
Sbjct: 1140 NEIGNQSLITMVKEL-EALGFIGDDITKNRTTWVCDFKII-----NPDINITCATIPYTP 1193
Query: 794 NMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWR 853
KEV +K++ +LLD +K + + I + +E +A + R
Sbjct: 1194 ADKEVFEKQIKELLD---------NKLIKKADPTCRHRTAAFIVRNHSEEVAQKP----R 1240
Query: 854 MCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFT 913
+ +Y++LN D F +P M+ + K + F D +GF + + + ++ TTFT
Sbjct: 1241 IVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFT 1300
Query: 914 CPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLE 973
C G + + PFG+ NAP FQR M F D K +++DD + +N + + +L+
Sbjct: 1301 CSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLK 1358
Query: 974 KVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKM--LPPTSVKE 1031
R ++V VL+ +K ++E LG + + I + ++ IKK ++K
Sbjct: 1359 IFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKG 1418
Query: 1032 IRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQP 1091
++++LG + R +IKD S + PL K+ F+ +++ + ++
Sbjct: 1419 LQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLER 1478
Query: 1092 PDWNLPFEIMCDASDYAVGAVLGQRNDK-----KMHAIYYASKTLDGAQVNYATTEKELL 1146
P I DAS+ GAVL + DK YAS G + + + + E+
Sbjct: 1479 PKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNF-GEKKTWTSLDYEIE 1537
Query: 1147 AVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPR-LIRWI-----LLLQEFDLEI 1200
A+ A++KF+ YL + TD AI + +D K R RWI LL +
Sbjct: 1538 AINEALNKFQIYL-DKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKLRDNLLKDGYKPTF 1596
Query: 1201 KDKKGVENVVADHLSR 1216
+ KG +N + + LSR
Sbjct: 1597 EHIKGNKNFLPNFLSR 1612
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 124 bits (310), Expect = 3e-27
Identities = 109/438 (24%), Positives = 185/438 (41%), Gaps = 39/438 (8%)
Query: 749 LKKYPKAIGYTIDDIKGINPSLCMHRILLEEDYKPS-IEHQRRLNPNMKEV---VKKEVL 804
L+ +P+A T G+ + C I++ D KP+ + R P KE ++ +
Sbjct: 1 LQDFPQAWAET----GGLGRAKCQVPIII--DLKPTAMPVSIRQYPMSKEAHMGIQPHIT 54
Query: 805 KLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCIDYRKLNKA 864
+ L+ GV+ P S W +P+ V K G +R D R++NK
Sbjct: 55 RFLELGVLRPCR-SPWNTPLLPVKKPG-----------------TRDYRPVQDLREVNKR 96
Query: 865 TRKDHFPLPFIDQMLERLAK-HSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP------FG 917
T H +P +L L+ + + LD FF +P+ P QE F G
Sbjct: 97 TMDIHPTVPNPYNLLSTLSPDRTWYTVLDLKDAFFCLPLAPQSQELFAFEWRDPERGISG 156
Query: 918 TFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV----FMDDFSVHGSNFDDCLTNLE 973
+ R+P G N+P F + +DF + EV ++DD + + C+ +
Sbjct: 157 QLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKEACIRGTK 216
Query: 974 KVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIR 1033
+L + +K + LG+++ + + +IE + + PP + +E+R
Sbjct: 217 HLLRELGDKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVAHIPPPQNPREVR 276
Query: 1034 SFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPD 1093
FLG AGF R +I F+ + PL +L + A FT+ + AF LKEAL++AP + PD
Sbjct: 277 EFLGTAGFCRLWIPGFAELAAPLYALTKESAPFTWQEKHQSAFEALKEALLSAPALGLPD 336
Query: 1094 WNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAID 1153
+ PF + D VL Q+ + Y SK LD + + + A +
Sbjct: 337 TSKPFTLFIDEKQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVK 396
Query: 1154 KFRQYLVGSKIIVYTDHS 1171
+ +G + V T H+
Sbjct: 397 DSAKLTLGQPLTVITPHA 414
Score = 115 bits (288), Expect = 1e-24
Identities = 79/231 (34%), Positives = 113/231 (48%), Gaps = 8/231 (3%)
Query: 1298 IPENEVSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGS 1357
+P E +++ H+ + H S QK I + F P + S C CQ+ +
Sbjct: 689 LPRKEALAMIQQMHAWT---HLSNQKLKLLIEKTDFLIPKAGTLIEQVTSACKVCQQVNA 745
Query: 1358 -ITKRNEMPLNNILEVEIFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDA 1416
T+ E ++ W IDF P G +Y+LV VD S WVEA + A
Sbjct: 746 GATRVPEGKRTRGNRPGVY--WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETA 803
Query: 1417 QVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVE 1476
+V K + IFPRFG+P+V+ SD G F+S+ + L + LG+ K+ Y PQ+SGQVE
Sbjct: 804 HMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVE 863
Query: 1477 VSNRQIKAILEK-TVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYG 1526
NR IK L K T+ T DW L AL R G+TP++++YG
Sbjct: 864 RMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGLTPYEILYG 913
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 123 bits (308), Expect = 5e-27
Identities = 88/254 (34%), Positives = 128/254 (49%), Gaps = 18/254 (7%)
Query: 1378 WGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 1437
W IDF P +G +Y+LV VD S WVEA + A+VV K + IFPRFG+P+V+
Sbjct: 918 WEIDFTEVKPGLYGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVL 977
Query: 1438 ISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 1496
+D G F+S+ + + LGV K+ Y PQ+SGQVE NR IK L K T++T D
Sbjct: 978 GTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRD 1037
Query: 1497 WSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELEHKAYWAIRNLNLDPNLAGDK 1556
W L AL+ R P G+TP++++YG P L + + + +P+L
Sbjct: 1038 WVLLLPLALYRARNT-PGPHGLTPYEILYG----APPPLVNFPDPDMAKVTHNPSLQAHL 1092
Query: 1557 RKLQLNELEELRMDAYENARIYKERTKTWHDKKIIKRHFKSGDLVLLFNSRLKLFPGKLR 1616
+ L L + E R A Y+E+ D+ ++ F+ GD V + + K L
Sbjct: 1093 QALYLVQHEVWR----PLAAAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK----NLE 1140
Query: 1617 SRWSGPFQVRTVYP 1630
RW GP+ V P
Sbjct: 1141 PRWKGPYTVLLTTP 1154
Score = 116 bits (291), Expect = 5e-25
Identities = 113/487 (23%), Positives = 192/487 (39%), Gaps = 56/487 (11%)
Query: 799 VKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCIDY 858
+K + +LLD G++ P S W +P+ V K G ND +R D
Sbjct: 199 IKPHIQRLLDQGILVPCQ-SPWNTPLLPVKKPG-----TND------------YRPVQDL 240
Query: 859 RKLNKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP-- 915
R++NK H +P +L L H + LD FF + +HP Q F
Sbjct: 241 REVNKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQSLFAFEWRDP 300
Query: 916 ----FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF----VEKIMEVFMDDFSVHGSNFDD 967
G + R+P G N+P F + +DF + I+ ++DD + ++ D
Sbjct: 301 EMGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSELD 360
Query: 968 CLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT 1027
C +L+ + + +K ++ LG+L+ + + A+ E + P
Sbjct: 361 CQQGTRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPK 420
Query: 1028 SVKEIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAP 1087
+ +++R FLG AGF R +I F+ + PL L F + +A+ +K+AL+TAP
Sbjct: 421 TPRQLREFLGTAGFCRLWIPGFAEMAAPLYPLTKTGTLFKWGPDQQKAYQEIKQALLTAP 480
Query: 1088 IIQPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLA 1147
+ PD PFE+ D VL Q+ + Y SK LD + + + A
Sbjct: 481 ALGLPDLTKPFELFVDEKQGYAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAA 540
Query: 1148 VVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKD---AKPRLIRWILLLQEFD------- 1197
+ + +G +++ H+ + D + R+ + LL + D
Sbjct: 541 IAVLTKDAGKLTMGQPLVILAPHAVEALVKQPPDRWLSNARMTHYQALLLDTDRVQFGPI 600
Query: 1198 -------LEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYADF 1250
L ++G+++ D L+ T D D PD D WY D
Sbjct: 601 VTLNPATLLPLPEEGLQHDCLDILAEAHGTRPD--LTDQPLPD--------ADHTWYTDG 650
Query: 1251 VNFLAAG 1257
+FL G
Sbjct: 651 SSFLQEG 657
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.137 0.407
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 200,581,054
Number of Sequences: 164201
Number of extensions: 8896965
Number of successful extensions: 24106
Number of sequences better than 10.0: 180
Number of HSP's better than 10.0 without gapping: 127
Number of HSP's successfully gapped in prelim test: 57
Number of HSP's that attempted gapping in prelim test: 23547
Number of HSP's gapped (non-prelim): 399
length of query: 1672
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1548
effective length of database: 39,613,130
effective search space: 61321125240
effective search space used: 61321125240
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)
Medicago: description of AC146683.16