
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146705.12 - phase: 0 /pseudo
(811 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 293 2e-78
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 282 3e-75
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 279 2e-74
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 279 2e-74
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 271 4e-72
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 266 2e-70
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 257 8e-68
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 235 4e-61
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 211 8e-54
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 160 1e-38
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 144 7e-34
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 142 5e-33
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 141 6e-33
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 141 6e-33
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 139 4e-32
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 137 9e-32
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 126 2e-28
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 123 2e-27
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 123 2e-27
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 122 3e-27
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 293 bits (749), Expect = 2e-78
Identities = 233/811 (28%), Positives = 376/811 (45%), Gaps = 74/811 (9%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NK + + PLP I+ L+ LA + D +GF+QIP+ +E T F F +
Sbjct: 1002 NKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEW 1061
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+PFGL +PA FQ M I D + V++DD + + + L ++++ L R +
Sbjct: 1062 NVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRK 1121
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
+ L KCH +E LGH V G+E K + +K+ PT+VKE++SFLG G+
Sbjct: 1122 SGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGY 1181
Query: 181 YRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPD------WN 234
YR+FI +F+ I LTSL+ + ++ AF LK+ + P++ PD +
Sbjct: 1182 YRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGD 1241
Query: 235 LPFEIMCDASDYAVGAVLGQRN-DKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDK 293
PF I DAS +GAVL Q D + H I +ASK L A+ Y T+ E LA+++A+ +
Sbjct: 1242 RPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRR 1301
Query: 294 FRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENVVADHL 353
F+ + G+ I V+TDH + LL RL RW + + EFD++I G N VAD L
Sbjct: 1302 FKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGKANAVADAL 1361
Query: 354 SR-------LRETNKDEL-----PLDDSFPD-----DQLFLLAQTDAPWYADFVNFLAAG 396
SR L E EL + PD L L D W + + L G
Sbjct: 1362 SRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLKGEDEGW-KEVIAALEGG 1420
Query: 397 V---------LPPELNYQQKKKFFNDLKHYYWDEPYLFRRGSDGIFRRCIPENEVSSILT 447
+ E++ + K LK+ +E R +PE + +L
Sbjct: 1421 KTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQ----------SRSVVPEKIRTPLLK 1470
Query: 448 HCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKC-------QRTGSITK- 499
H GH +K ++++H F+WP + V + C KC + T S+T
Sbjct: 1471 ELHEGMLAGHFGIKK-MWRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLTSSLTPY 1529
Query: 500 RNEMPLNNILEVEIFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVI 559
R PL ++ D M S GN+YIL +D +K+ A+ P A+ V+
Sbjct: 1530 RMTFPL---------EIVACDLMDVGLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVL 1580
Query: 560 KMF-KKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSN 618
K F ++ +P +++D G F++ F + L + H Y+ + +G VE N
Sbjct: 1581 KAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFN 1640
Query: 619 RQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELEHKA 678
+ I I++K + +W +++ A++AY G TP L++G+ P+E+ +
Sbjct: 1641 KTIMHIMKKKTAVP-MEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGED 1699
Query: 679 YWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENARIYKERTKTWHDKKII-KRHF--K 735
I ++D + + L EL +++ A E+A +E K+ D+K K+H +
Sbjct: 1700 AVGINYADMD-----EYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFPQ 1754
Query: 736 SGDLVLLF--NSRLKLFPGKLRSRWSGPFQV 764
G VLL + +L KL ++WSGP++V
Sbjct: 1755 PGSRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 282 bits (721), Expect = 3e-75
Identities = 214/787 (27%), Positives = 374/787 (47%), Gaps = 57/787 (7%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NK + + +PLP I+Q+L ++ + F LD S + I + D+ K F CP G F Y
Sbjct: 472 NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 531
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
MP+G+ APA FQ + +I + E + +MDD +H + + + +++ VL++ +
Sbjct: 532 LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 591
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL++N KC F + +G+ + ++G + I+ + + P + KE+R FLG +
Sbjct: 592 ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 651
Query: 181 YRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEIM 240
R+FI S +T PL +LL KD + + + QA +K+ L++ P+++ D++ +
Sbjct: 652 LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 711
Query: 241 CDASDYAVGAVLGQR-NDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLV 299
DASD AVGAVL Q+ +D K + + Y S + AQ+NY+ ++KE+LA++ ++ +R YL
Sbjct: 712 TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 771
Query: 300 GS--KIIVYTDH-SAIKYLLNKKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVADHLSR 355
+ + TDH + I + N+ + + RL RW L LQ+F+ EI + G N +AD LSR
Sbjct: 772 STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 831
Query: 356 LRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVLPPELNYQQKKKFFNDLK 415
+ + + P+ D+ + +FVN ++ + + Q ++ ND K
Sbjct: 832 IVDETE---PIPKDSEDNSI------------NFVNQIS---ITDDFKNQVVTEYTNDTK 873
Query: 416 --HYYWDEPYLFRRG---SDGIF-----RRCIPENE--VSSILTHCHSSSYGGHASTQKT 463
+ +E DG+ + +P + +I+ H H +
Sbjct: 874 LLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELL 933
Query: 464 SFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVE-IFDVWGIDFM 522
+ IL F W + K + ++ C CQ S + PL I E ++ +DF+
Sbjct: 934 TNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFI 992
Query: 523 GPFPSSFGNQYILVAVDYVSKWVEAI-ASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGG 581
P S G + V VD SK + + + A+ +MF + + FG P+ +I+D
Sbjct: 993 TALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADND 1052
Query: 582 SHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEKTVSTSRTDWSNKLD 641
F S+ ++ K K + PY PQT GQ E +N+ ++ +L ST W + +
Sbjct: 1053 HIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHIS 1112
Query: 642 DALWAYRTAYKTPIGMTPFKLVYGKSCHL-PVELEHKAYWAIRNLNLDPNLAGDKRKLQL 700
+Y A + MTPF++V+ S L P+EL P+ + DK
Sbjct: 1113 LVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL--------------PSFS-DKTDENS 1157
Query: 701 NELEELRMDAYENARIYKERTKTWHDKKIIK-RHFKSGDLVLLFNSRLKLF--PGKLRSR 757
E ++ E+ + K + D KI + F+ GDLV++ ++ KL
Sbjct: 1158 QETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPS 1217
Query: 758 WSGPFQV 764
++GPF V
Sbjct: 1218 FAGPFYV 1224
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 279 bits (714), Expect = 2e-74
Identities = 213/787 (27%), Positives = 374/787 (47%), Gaps = 57/787 (7%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NK + + +PLP I+Q+L ++ + F LD S + I + D+ K F CP G F Y
Sbjct: 472 NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 531
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
MP+G+ APA FQ + +I + E + +MD+ +H + + + +++ VL++ +
Sbjct: 532 LVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKN 591
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL++N KC F + +G+ + ++G + I+ + + P + KE+R FLG +
Sbjct: 592 ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 651
Query: 181 YRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEIM 240
R+FI S +T PL +LL KD + + + QA +K+ L++ P+++ D++ +
Sbjct: 652 LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 711
Query: 241 CDASDYAVGAVLGQR-NDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLV 299
DASD AVGAVL Q+ +D K + + Y S + AQ+NY+ ++KE+LA++ ++ +R YL
Sbjct: 712 TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 771
Query: 300 GS--KIIVYTDH-SAIKYLLNKKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVADHLSR 355
+ + TDH + I + N+ + + RL RW L LQ+F+ EI + G N +AD LSR
Sbjct: 772 STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 831
Query: 356 LRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVLPPELNYQQKKKFFNDLK 415
+ + + P+ D+ + +FVN ++ + + Q ++ ND K
Sbjct: 832 IVDETE---PIPKDSEDNSI------------NFVNQIS---ITDDFKNQVVTEYTNDTK 873
Query: 416 --HYYWDEPYLFRRG---SDGIF-----RRCIPENE--VSSILTHCHSSSYGGHASTQKT 463
+ +E DG+ + +P + +I+ H H +
Sbjct: 874 LLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELL 933
Query: 464 SFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVE-IFDVWGIDFM 522
+ IL F W + K + ++ C CQ S + PL I E ++ +DF+
Sbjct: 934 TNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFI 992
Query: 523 GPFPSSFGNQYILVAVDYVSKWVEAI-ASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGG 581
P S G + V VD SK + + + A+ +MF + + FG P+ +I+D
Sbjct: 993 TALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADND 1052
Query: 582 SHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEKTVSTSRTDWSNKLD 641
F S+ ++ K K + PY PQT GQ E +N+ ++ +L ST W + +
Sbjct: 1053 HIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHIS 1112
Query: 642 DALWAYRTAYKTPIGMTPFKLVYGKSCHL-PVELEHKAYWAIRNLNLDPNLAGDKRKLQL 700
+Y A + MTPF++V+ S L P+EL P+ + DK
Sbjct: 1113 LVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL--------------PSFS-DKTDENS 1157
Query: 701 NELEELRMDAYENARIYKERTKTWHDKKIIK-RHFKSGDLVLLFNSRLKLF--PGKLRSR 757
E ++ E+ + K + D KI + F+ GDLV++ ++ KL
Sbjct: 1158 QETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPS 1217
Query: 758 WSGPFQV 764
++GPF V
Sbjct: 1218 FAGPFYV 1224
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 279 bits (714), Expect = 2e-74
Identities = 213/787 (27%), Positives = 374/787 (47%), Gaps = 57/787 (7%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NK + + +PLP I+Q+L ++ + F LD S + I + D+ K F CP G F Y
Sbjct: 472 NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 531
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
MP+G+ APA FQ + +I + E + +MD+ +H + + + +++ VL++ +
Sbjct: 532 LVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKN 591
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL++N KC F + +G+ + ++G + I+ + + P + KE+R FLG +
Sbjct: 592 ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 651
Query: 181 YRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEIM 240
R+FI S +T PL +LL KD + + + QA +K+ L++ P+++ D++ +
Sbjct: 652 LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 711
Query: 241 CDASDYAVGAVLGQR-NDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLV 299
DASD AVGAVL Q+ +D K + + Y S + AQ+NY+ ++KE+LA++ ++ +R YL
Sbjct: 712 TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 771
Query: 300 GS--KIIVYTDH-SAIKYLLNKKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVADHLSR 355
+ + TDH + I + N+ + + RL RW L LQ+F+ EI + G N +AD LSR
Sbjct: 772 STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 831
Query: 356 LRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVLPPELNYQQKKKFFNDLK 415
+ + + P+ D+ + +FVN ++ + + Q ++ ND K
Sbjct: 832 IVDETE---PIPKDSEDNSI------------NFVNQIS---ITDDFKNQVVTEYTNDTK 873
Query: 416 --HYYWDEPYLFRRG---SDGIF-----RRCIPENE--VSSILTHCHSSSYGGHASTQKT 463
+ +E DG+ + +P + +I+ H H +
Sbjct: 874 LLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELL 933
Query: 464 SFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVE-IFDVWGIDFM 522
+ IL F W + K + ++ C CQ S + PL I E ++ +DF+
Sbjct: 934 TNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFI 992
Query: 523 GPFPSSFGNQYILVAVDYVSKWVEAI-ASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGG 581
P S G + V VD SK + + + A+ +MF + + FG P+ +I+D
Sbjct: 993 TALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADND 1052
Query: 582 SHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEKTVSTSRTDWSNKLD 641
F S+ ++ K K + PY PQT GQ E +N+ ++ +L ST W + +
Sbjct: 1053 HIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHIS 1112
Query: 642 DALWAYRTAYKTPIGMTPFKLVYGKSCHL-PVELEHKAYWAIRNLNLDPNLAGDKRKLQL 700
+Y A + MTPF++V+ S L P+EL P+ + DK
Sbjct: 1113 LVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL--------------PSFS-DKTDENS 1157
Query: 701 NELEELRMDAYENARIYKERTKTWHDKKIIK-RHFKSGDLVLLFNSRLKLF--PGKLRSR 757
E ++ E+ + K + D KI + F+ GDLV++ ++ KL
Sbjct: 1158 QETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPS 1217
Query: 758 WSGPFQV 764
++GPF V
Sbjct: 1218 FAGPFYV 1224
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 271 bits (694), Expect = 4e-72
Identities = 207/647 (31%), Positives = 308/647 (46%), Gaps = 57/647 (8%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
N+ T D P+P +D++L +L + ++F +D GF QI + P KT F+ G + Y
Sbjct: 272 NEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEY 331
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
RMPFGL NAPATFQRCM I + K V++DD V ++ D+ L +L V E+ +
Sbjct: 332 LRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAK 391
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL L +KC F+ +E LGH++ GI+ + KIE I+K PT KEI++FLG G+
Sbjct: 392 ANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGY 451
Query: 181 YRRFIKDFSSITKPLTSLLLKDADF-TFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEI 239
YR+FI +F+ I KP+T L K+ T + AF +LK + PI++ PD+ F +
Sbjct: 452 YRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTL 511
Query: 240 MCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLV 299
DASD A+GAVL Q H + Y S+TL+ ++NY+T EKELLA+V+A FR YL+
Sbjct: 512 TTDASDVALGAVLSQDG----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLL 567
Query: 300 GSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENVVADHLSR--LR 357
G + +DH + +L KD +L RW + L EFD +IK KG EN VAD LSR L
Sbjct: 568 GRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLE 627
Query: 358 ETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVLPPEL---NYQQK--KKFFN 412
ET E S +D L+ T+ P F + PP++ Y +K + F
Sbjct: 628 ETYLSE-QTQHSAEEDNSDLIFITERP-LNTFNRQVIFSKGPPDIKVTKYFKKHITQIFY 685
Query: 413 DLKHYYWDEPYLFRR----------GSDGIF-------------------------RRCI 437
D+ E YL SD F +
Sbjct: 686 DIMTREKAEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNIT 745
Query: 438 PENEVSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSI 497
E ++ H H QKT+ K+ +++P+ + I++C C +
Sbjct: 746 TYAEFKELILTAHEKLL--HPGIQKTT-KLFGETYYFPNSQLLIQNIINECSICNLAKTE 802
Query: 498 TKRNEMPLNNILEVEIFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQV 557
+ +MP + E FM SS G Y+ +D SK+ T D +
Sbjct: 803 HRNTDMPTKTTPKPEHCRE---KFMIDIYSSEGKHYV-SCIDIYSKFATLEEIKTKD-WI 857
Query: 558 VIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIAT 604
K IF + G P+++ +D F S ++ L+ V ++ T
Sbjct: 858 ECKNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNT 904
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 266 bits (679), Expect = 2e-70
Identities = 213/706 (30%), Positives = 341/706 (48%), Gaps = 79/706 (11%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
N+ T D +P+P +D++L +L K +F +D GF QI + KT F+ G + Y
Sbjct: 271 NEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEY 330
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
RMPFGL NAPATFQRCM +I + K V++DD + ++ + L +++ V +
Sbjct: 331 LRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLAD 390
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL L +KC F+ +E LGH+V GI+ + K++ I PT KEIR+FLG G+
Sbjct: 391 ANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGY 450
Query: 181 YRRFIKDFSSITKPLTSLLLKDADF-TFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEI 239
YR+FI +++ I KP+TS L K T ++AF +LK +I PI+Q PD+ F +
Sbjct: 451 YRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVL 510
Query: 240 MCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLV 299
DAS+ A+GAVL Q H I + S+TL+ ++NY+ EKELLA+V+A FR YL+
Sbjct: 511 TTDASNLALGAVLSQNG----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLL 566
Query: 300 GSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENVVADHLSRLR-E 358
G + ++ +DH +++L N K+ +L RW + L E+ +I KG EN VAD LSR++ E
Sbjct: 567 GRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIE 626
Query: 359 TNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVL---PPELNYQQKKKFFNDLK 415
N S +D L+ T+ P +N+ ++ + + K F N +
Sbjct: 627 ENHHSEATQHSAEEDNSNLIHLTEKP-----INYFKKQIIFIKSDKNKVEHSKIFGNSIT 681
Query: 416 HYYWDEPYLFRRGS---DGIFRRCIP---ENEVS-SILTHCH----SSSY---------- 454
+D L + D R I E++V I+ H +++Y
Sbjct: 682 TIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLL 741
Query: 455 ---GGHASTQ----KTSFKILHSGFW-WPSLFKDVHLF----------ISKCDKCQRTGS 496
G +A + ++ K+LH G LFK+ H F I++C+ C +
Sbjct: 742 KNVGSYAEFKEIILQSHEKLLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKT 801
Query: 497 ITKRNEMPL------NNILEVEIFDVWGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIAS 550
+ +MPL + E + D++ SS G YI +D SK+
Sbjct: 802 EHRNTKMPLKITPNPEHCREKFVVDIY---------SSEGKHYI-SCIDIYSKFATLEQI 851
Query: 551 PTNDAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQT 610
T D + + IF + G P+++ +D F S ++ L++ V ++ T
Sbjct: 852 KTKD-WIECRNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNT----AK 906
Query: 611 SGQVEVSNRQIKAILEKTVSTSRTDWS----NKLDDALWAYRTAYK 652
+G +V R K I EK + +D +K++ L+ Y K
Sbjct: 907 NGVADV-ERLHKTINEKIRIINSSDDEEVKLSKIETILYTYNQKIK 951
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 257 bits (657), Expect = 8e-68
Identities = 149/392 (38%), Positives = 227/392 (57%), Gaps = 14/392 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
N T D +P+P I+ L L +F LD SGF QI + +D KT F+ G + +
Sbjct: 188 NTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEF 247
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
R+PFGL NAPA FQR + I + + K+ V++DD V ++D NL VL +
Sbjct: 248 LRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSK 307
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL +N EK HF+ + LG++V GI+ D K+ I +M PPTSVKE++ FLG +
Sbjct: 308 ANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSY 367
Query: 181 YRRFIKDFSSITKPLTSLL------LKDAD-----FTFDDSCLQAFCRLKEALITAPIIQ 229
YR+FI+D++ + KPLT+L +K + T D++ LQ+F LK L ++ I+
Sbjct: 368 YRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILA 427
Query: 230 PPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVY 289
P + PF + DAS++A+GAVL Q + + I Y S++L+ + NYAT EKE+LA+++
Sbjct: 428 FPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIW 487
Query: 290 AIDKFRQYLVGSKII-VYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENV 348
++D R YL G+ I VYTDH + + L ++ +L RW ++E++ E+ K G NV
Sbjct: 488 SLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNV 547
Query: 349 VADHLSRLRETNKDELPLD-DSFPDDQLFLLA 379
VAD LSR+ ++L D D+ P+D + LA
Sbjct: 548 VADALSRI-PPQLNQLSTDLDANPEDDMQSLA 578
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 235 bits (599), Expect = 4e-61
Identities = 134/356 (37%), Positives = 196/356 (54%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NK D FPLP ID +L++L + +F LD SGF QI + ++ T+F+ G++ +
Sbjct: 380 NKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRF 439
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
R+PFGL AP +FQR M FS ++MDD V G + L NL +V +C +
Sbjct: 440 TRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCRE 499
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
NL L+ EKC F + E LGH D+GI D K ++I+ P R F+ +
Sbjct: 500 YNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNY 559
Query: 181 YRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEIM 240
YRRFIK+F+ ++ +T L K+ F + D C +AF LK LI ++Q PD++ F I
Sbjct: 560 YRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCIT 619
Query: 241 CDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLVG 300
DAS A GAVL Q ++ + YAS+ + N +TTE+EL A+ +AI FR Y+ G
Sbjct: 620 TDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYG 679
Query: 301 SKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENVVADHLSRL 356
V TDH + YL + + +L R L L+E++ ++ KG +N VAD LSR+
Sbjct: 680 KHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSRI 735
Score = 125 bits (315), Expect = 3e-28
Identities = 95/364 (26%), Positives = 173/364 (47%), Gaps = 23/364 (6%)
Query: 439 ENEVSSILTHCHSSSY-GGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSI 497
E E +IL+ H GGH KT K+ ++W ++ K + ++ KC KCQ+ +
Sbjct: 890 EKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKAKT- 947
Query: 498 TKRNEMPLNNILEVE-IFDVWGIDFMGPFPSSF-GNQYILVAVDYVSKWVEAIASPTNDA 555
TK + P+ E FD +D +GP P S GN+Y + + ++K++ AI A
Sbjct: 948 TKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKSA 1007
Query: 556 QVVIKMFKKVIFPRFGVPRVVISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVE 615
+ V K + ++G + I+D G+ + + L + L +++ +T +H QT G VE
Sbjct: 1008 KTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVE 1067
Query: 616 VSNRQIKAILEKTVSTSRTDWSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELE 675
S+R + + +ST +TDW L ++ + T P++LV+G++ +LP
Sbjct: 1068 RSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF- 1126
Query: 676 HKAYWAIRNLNLDPNLAGDKRKLQLNELEELRMDAYENAR----IYKERTKTWHDKKIIK 731
+K + N+D K +L++ AY AR +KE+ K +D K+
Sbjct: 1127 NKLHSIEPIYNIDDYAKESKYRLEV---------AYARARKLLEAHKEKNKENYDLKVKD 1177
Query: 732 RHFKSGDLVLLFNSRLKLFPGKLRSRWSGPFQVRTVYPYGAIEIFSEETGSFTVNGQRLK 791
+ GD VLL N KL +++GP+++ ++ I + + + V+ RLK
Sbjct: 1178 IELEVGDKVLLRNE----VGHKLDFKYTGPYKIESIGDNNNITLLTNKNKKQIVHKDRLK 1233
Query: 792 IYNT 795
+++
Sbjct: 1234 KFHS 1237
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 211 bits (536), Expect = 8e-54
Identities = 126/368 (34%), Positives = 203/368 (54%), Gaps = 17/368 (4%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
N+ T D +P+P I +L L K F LD SG+ QI + +D+EKT+F+ G + +
Sbjct: 247 NEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEF 306
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
R+PFGL NA + FQR + + + + KI V++DD + N D + +++ VL+
Sbjct: 307 CRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLID 366
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVKEIRSFLGHAGF 180
N+ ++ EK F LG +V G + D K++ I++ P V ++RSFLG A +
Sbjct: 367 ANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASY 426
Query: 181 YRRFIKDFSSITKPLTSLL-----------LKDADFTFDDSCLQAFCRLKEALITAPII- 228
YR FIKDF++I +P+T +L K F+++ AF RL+ L + +I
Sbjct: 427 YRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVIL 486
Query: 229 QPPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVV 288
+ PD+ PF++ DAS +GAVL Q I S+TL + NYAT E+ELLA+V
Sbjct: 487 KYPDFKKPFDLTTDASASGIGAVLSQEG----RPITMISRTLKQPEQNYATNERELLAIV 542
Query: 289 YAIDKFRQYLVGSK-IIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVEN 347
+A+ K + +L GS+ I ++TDH + + + ++ ++ RW + + + ++ K G EN
Sbjct: 543 WALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKEN 602
Query: 348 VVADHLSR 355
VAD LSR
Sbjct: 603 FVADALSR 610
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 160 bits (405), Expect = 1e-38
Identities = 115/369 (31%), Positives = 188/369 (50%), Gaps = 12/369 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NKAT+ D LP D++L + + D SG +Q+ + Q T FTCP G + +
Sbjct: 292 NKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQW 351
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNF-DDCLTNLEKVLERCE 119
+PFGL AP+ F + + S+ K V++DD V + + ++ +L RCE
Sbjct: 352 NVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCE 411
Query: 120 QVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKI-EIIKKMLPPTSV---KEIRSFL 175
++ ++L+ +K + LG L D+G + I E I K P + K+++ FL
Sbjct: 412 KLGIILSKKKAQLFKEKINFLG-LEIDQGTHCPQNHILEHIHKF--PDRIEDKKQLQRFL 468
Query: 176 GHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNL 235
G + +I +SI KPL S L +D+ +T++D+ Q ++K+ L + P + P+ N
Sbjct: 469 GILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPND 528
Query: 236 PFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFR 295
I DAS+ G +L ++ + YAS + A+ NY + EKELLAV+ I KF
Sbjct: 529 KLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFS 588
Query: 296 QYLVGSKIIVYTDHSAIKYLLN---KKDAKP-RLIRWILLLQEFDLEIKDKKGVENVVAD 351
YL S+ ++ TD+ + +N K D K RL+RW + L ++D +++ G +NV AD
Sbjct: 589 IYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFAD 648
Query: 352 HLSRLRETN 360
L TN
Sbjct: 649 FLQENTLTN 657
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 144 bits (364), Expect = 7e-34
Identities = 116/362 (32%), Positives = 179/362 (49%), Gaps = 9/362 (2%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
N+AT D LP + ++L L S F D SGF+Q+ + Q+ T FTCP G F +
Sbjct: 303 NQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQW 362
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+ +PFGL AP+ FQR M + + +K V++DD V ++ D ++ VL+ E+
Sbjct: 363 KVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEK 421
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKI-EIIKKMLPP-TSVKEIRSFLGHA 178
++L+ +K + + +E I L D+G + I E I K K ++ FLG
Sbjct: 422 YGIILSKKKAN-LFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVL 480
Query: 179 GFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFE 238
+ +I + I KPL L KD + + S ++K+ L + P + P
Sbjct: 481 TYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLI 540
Query: 239 IMCDASDYAVGAVLGQRNDKKMHAI-YYASKTLDGAQVNYATTEKELLAVVYAIDKFRQY 297
I DASD G VL R + I Y+S + A+ NY + +KELLAV I KF Y
Sbjct: 541 IETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAY 600
Query: 298 LVGSKIIVYTDHSAIKYLLN---KKDAKP-RLIRWILLLQEFDLEIKDKKGVENVVADHL 353
L + V TD+ Y L K D+K RL+RW ++ +++ +GV+NV+AD L
Sbjct: 601 LTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVKNVLADCL 660
Query: 354 SR 355
+R
Sbjct: 661 TR 662
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 142 bits (357), Expect = 5e-33
Identities = 114/371 (30%), Positives = 174/371 (46%), Gaps = 14/371 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NKAT D + LP D++L + F D SGF+Q+ + + T FTCP G + +
Sbjct: 310 NKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 369
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+PFGL AP+ FQR M F F K V++DD V +N +D L ++ +L++C Q
Sbjct: 370 NVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQ 428
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT--SVKEIRSFLGHA 178
++L+ +K ++ LG L D G + I P T K+++ FLG
Sbjct: 429 HGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 179 GFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFE 238
+ +I + I KPL + L ++ + + ++K+ L P + P
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 239 IMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKF 294
I DASD G +L YAS + A+ NY + +KE LAV+ I KF
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKF 607
Query: 295 RQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVA 350
YL ++ TD++ K +N K D+K R IRW L + +++ KG +N A
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFA 667
Query: 351 DHLSRLRETNK 361
D LS RE NK
Sbjct: 668 DFLS--REFNK 676
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 141 bits (356), Expect = 6e-33
Identities = 114/371 (30%), Positives = 174/371 (46%), Gaps = 14/371 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NKAT D + LP D++L + F D SGF+Q+ + + T FTCP G + +
Sbjct: 310 NKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 369
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+PFGL AP+ FQR M F F K V++DD V +N +D L ++ +L++C Q
Sbjct: 370 NVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQ 428
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT--SVKEIRSFLGHA 178
++L+ +K ++ LG L D G + I P T K+++ FLG
Sbjct: 429 HGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 179 GFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFE 238
+ +I + I KPL + L ++ + + ++K+ L P + P
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 239 IMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKF 294
I DASD G +L YAS + A+ NY + +KE LAV+ I KF
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKF 607
Query: 295 RQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVA 350
YL ++ TD++ K +N K D+K R IRW L + +++ KG +N A
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFA 667
Query: 351 DHLSRLRETNK 361
D LS RE NK
Sbjct: 668 DFLS--REFNK 676
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 141 bits (356), Expect = 6e-33
Identities = 114/371 (30%), Positives = 174/371 (46%), Gaps = 14/371 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NKAT D + LP D++L + F D SGF+Q+ + + T FTCP G + +
Sbjct: 310 NKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 369
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+PFGL AP+ FQR M F F K V++DD V +N +D L ++ +L++C Q
Sbjct: 370 NVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQ 428
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT--SVKEIRSFLGHA 178
++L+ +K ++ LG L D G + I P T K+++ FLG
Sbjct: 429 HGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 179 GFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFE 238
+ +I + I KPL + L ++ + + ++K+ L P + P
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 239 IMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKF 294
I DASD G +L YAS + A+ NY + +KE LAV+ I KF
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKF 607
Query: 295 RQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVA 350
YL ++ TD++ K +N K D+K R IRW L + +++ KG +N A
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFA 667
Query: 351 DHLSRLRETNK 361
D LS RE NK
Sbjct: 668 DFLS--REFNK 676
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 139 bits (349), Expect = 4e-32
Identities = 110/365 (30%), Positives = 170/365 (46%), Gaps = 12/365 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NKAT D + P D++L + F D SGF+Q+ + + T FTCP G + +
Sbjct: 305 NKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 364
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+PFGL AP+ FQR M F F K V++DD V +N +D L ++ +L++C Q
Sbjct: 365 NVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQ 423
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT--SVKEIRSFLGHA 178
++L+ +K ++ LG L D G + I P T K+++ FLG
Sbjct: 424 HGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 482
Query: 179 GFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFE 238
+ +I + I KPL + L ++ + + ++K+ L P + P
Sbjct: 483 TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 542
Query: 239 IMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKF 294
I DASD G +L YAS + A+ NY + +KE LAV+ I KF
Sbjct: 543 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKF 602
Query: 295 RQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVA 350
YL ++ TD++ K +N K D+K R IRW L + +++ KG +N A
Sbjct: 603 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFA 662
Query: 351 DHLSR 355
D LSR
Sbjct: 663 DFLSR 667
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 137 bits (346), Expect = 9e-32
Identities = 111/371 (29%), Positives = 174/371 (45%), Gaps = 14/371 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGTFAY 60
NKAT D + LP D++L + F D SGF+Q+ + + T FTCP G + +
Sbjct: 311 NKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 370
Query: 61 RRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQ 120
+PFGL AP+ FQR M F F K V++DD V +N +D L ++ +L++C Q
Sbjct: 371 NVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQ 429
Query: 121 VNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPT--SVKEIRSFLGHA 178
++L+ +K ++ LG L D G + I P T K+++ FLG
Sbjct: 430 HGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 488
Query: 179 GFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFE 238
+ +I + + + +PL + L ++ + + ++K+ L P + P
Sbjct: 489 TYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 548
Query: 239 IMCDASDYAVGAVLG----QRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKF 294
I DASD G +L Y S + A+ NY + +KE LAV+ I KF
Sbjct: 549 IETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKF 608
Query: 295 RQYLVGSKIIVYTDHSAIKYLLN---KKDAK-PRLIRWILLLQEFDLEIKDKKGVENVVA 350
YL ++ TD++ K +N K D+K R IRW L + +++ KG +N A
Sbjct: 609 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFA 668
Query: 351 DHLSRLRETNK 361
D LS RE NK
Sbjct: 669 DFLS--REFNK 677
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 126 bits (317), Expect = 2e-28
Identities = 101/322 (31%), Positives = 143/322 (44%), Gaps = 37/322 (11%)
Query: 457 HASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVEIFDV 516
H +K + + P+L V S+C C T ++T E +
Sbjct: 821 HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGV-Y 879
Query: 517 WGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 576
W +DF P +GN+Y+LV +D S WVEA + T A +V K + I PRFG+P+V+
Sbjct: 880 WEVDFTEIKPGRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPKVL 939
Query: 577 ISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 635
SD G F+++ + L +LG+ K+ Y PQ+SGQVE NR IK L K + T D
Sbjct: 940 GSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKD 999
Query: 636 WSNKLDDALWAYRTAYKTP--IGMTPFKLVYGKSCHLPVELEHKAYWAIRNLNLDPNLAG 693
W L AL R TP G+TP++++YG + L L
Sbjct: 1000 WVTLLPLALLRAR---NTPGRFGLTPYEILYGGPPPI--------------LESGETLGP 1042
Query: 694 DKRKL-----QLNELEELRMDAYENAR-IYKERTKTWHDKKIIKRHFKSGDLVLLFNSRL 747
D R L L LE +R ++ + +YK T T I F+ GD VL+ R
Sbjct: 1043 DDRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVT------IPHPFQVGDQVLVRRHR- 1095
Query: 748 KLFPGKLRSRWSGPFQVRTVYP 769
P L RW GP+ V P
Sbjct: 1096 ---PSSLEPRWKGPYLVLLTTP 1114
Score = 117 bits (294), Expect = 9e-26
Identities = 84/321 (26%), Positives = 142/321 (44%), Gaps = 11/321 (3%)
Query: 1 NKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP----- 54
NK + H +P +L L ++ + LD FF + +HPN Q F
Sbjct: 236 NKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHPNSQPLFAFEWKDPEKG 295
Query: 55 -FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF----VEKIMEVFMDDFSVHGSNFDDCLT 109
G + R+P G N+P F + + F + ++ ++DD V ++DC
Sbjct: 296 NTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQVVLLQYVDDLLVAAPTYEDCKK 355
Query: 110 NLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVK 169
+K+L+ ++ ++ +K RE LG+L+ + + A+ + K+ PT+ +
Sbjct: 356 GTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLLKEGKRWLTPARKATVMKIPVPTTPR 415
Query: 170 EIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQ 229
++R FLG AGF R +I F+S+ PL L + F + + QAF +K+AL++AP +
Sbjct: 416 QVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIPFIWTEEHQQAFDHIKKALLSAPALA 475
Query: 230 PPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVY 289
PD PF + D VL Q + Y SK LD + T K + AV
Sbjct: 476 LPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVAYLSKKLDPVASGWPTCLKAVAAVAL 535
Query: 290 AIDKFRQYLVGSKIIVYTDHS 310
+ + +G + V HS
Sbjct: 536 LLKDADKLTLGQNVTVIASHS 556
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 123 bits (308), Expect = 2e-27
Identities = 88/254 (34%), Positives = 128/254 (49%), Gaps = 18/254 (7%)
Query: 517 WGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 576
W IDF P +G +Y+LV VD S WVEA + A+VV K + IFPRFG+P+V+
Sbjct: 918 WEIDFTEVKPGLYGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVL 977
Query: 577 ISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 635
+D G F+S+ + + LGV K+ Y PQ+SGQVE NR IK L K T++T D
Sbjct: 978 GTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRD 1037
Query: 636 WSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELEHKAYWAIRNLNLDPNLAGDK 695
W L AL+ R P G+TP++++YG P L + + + +P+L
Sbjct: 1038 WVLLLPLALYRARNT-PGPHGLTPYEILYG----APPPLVNFPDPDMAKVTHNPSLQAHL 1092
Query: 696 RKLQLNELEELRMDAYENARIYKERTKTWHDKKIIKRHFKSGDLVLLFNSRLKLFPGKLR 755
+ L L + E R A Y+E+ D+ ++ F+ GD V + + K L
Sbjct: 1093 QALYLVQHEVWR----PLAAAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK----NLE 1140
Query: 756 SRWSGPFQVRTVYP 769
RW GP+ V P
Sbjct: 1141 PRWKGPYTVLLTTP 1154
Score = 100 bits (250), Expect = 1e-20
Identities = 96/424 (22%), Positives = 165/424 (38%), Gaps = 38/424 (8%)
Query: 1 NKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP----- 54
NK H +P +L L H + LD FF + +HP Q F
Sbjct: 244 NKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQSLFAFEWRDPEMG 303
Query: 55 -FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF----VEKIMEVFMDDFSVHGSNFDDCLT 109
G + R+P G N+P F + +DF + I+ ++DD + ++ DC
Sbjct: 304 ISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQQ 363
Query: 110 NLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVK 169
+L+ + + +K ++ LG+L+ + + A+ E + P + +
Sbjct: 364 GTRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPR 423
Query: 170 EIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQ 229
++R FLG AGF R +I F+ + PL L F + +A+ +K+AL+TAP +
Sbjct: 424 QLREFLGTAGFCRLWIPGFAEMAAPLYPLTKTGTLFKWGPDQQKAYQEIKQALLTAPALG 483
Query: 230 PPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVY 289
PD PFE+ D VL Q+ + Y SK LD + + + A+
Sbjct: 484 LPDLTKPFELFVDEKQGYAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAV 543
Query: 290 AIDKFRQYLVGSKIIVYTDHSAIKYLLNKKD---AKPRLIRWILLLQEFD---------- 336
+ +G +++ H+ + D + R+ + LL + D
Sbjct: 544 LTKDAGKLTMGQPLVILAPHAVEALVKQPPDRWLSNARMTHYQALLLDTDRVQFGPIVTL 603
Query: 337 ----LEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNF 392
L ++G+++ D L+ T D D PD D WY D +F
Sbjct: 604 NPATLLPLPEEGLQHDCLDILAEAHGTRPD--LTDQPLPD--------ADHTWYTDGSSF 653
Query: 393 LAAG 396
L G
Sbjct: 654 LQEG 657
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 123 bits (308), Expect = 2e-27
Identities = 88/254 (34%), Positives = 128/254 (49%), Gaps = 18/254 (7%)
Query: 517 WGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 576
W IDF P +G +Y+LV VD S WVEA + A+VV K + IFPRFG+P+V+
Sbjct: 918 WEIDFTEVKPGLYGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVL 977
Query: 577 ISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 635
+D G F+S+ + + LGV K+ Y PQ+SGQVE NR IK L K T++T D
Sbjct: 978 GTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRD 1037
Query: 636 WSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELEHKAYWAIRNLNLDPNLAGDK 695
W L AL+ R P G+TP++++YG P L + + + +P+L
Sbjct: 1038 WVLLLPLALYRARNT-PGPHGLTPYEILYG----APPPLVNFPDPDMAKVTHNPSLQAHL 1092
Query: 696 RKLQLNELEELRMDAYENARIYKERTKTWHDKKIIKRHFKSGDLVLLFNSRLKLFPGKLR 755
+ L L + E R A Y+E+ D+ ++ F+ GD V + + K L
Sbjct: 1093 QALYLVQHEVWR----PLAAAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK----NLE 1140
Query: 756 SRWSGPFQVRTVYP 769
RW GP+ V P
Sbjct: 1141 PRWKGPYTVLLTTP 1154
Score = 99.4 bits (246), Expect = 3e-20
Identities = 95/424 (22%), Positives = 164/424 (38%), Gaps = 38/424 (8%)
Query: 1 NKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP----- 54
NK H +P +L L H + LD FF + +HP Q F
Sbjct: 244 NKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQSLFAFEWKDPEMG 303
Query: 55 -FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF----VEKIMEVFMDDFSVHGSNFDDCLT 109
G + R+P G N+P F + +DF + I+ ++DD + ++ DC
Sbjct: 304 ISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQQ 363
Query: 110 NLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVK 169
+L+ + + +K ++ LG+L+ + + A+ E + P + +
Sbjct: 364 GTRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPR 423
Query: 170 EIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQ 229
++R FLG AG R +I F+ + PL L F + +A+ +K+AL+TAP +
Sbjct: 424 QLREFLGTAGLCRLWIPGFAEMAAPLYPLTKTGTLFKWGPDQQKAYQEIKQALLTAPALG 483
Query: 230 PPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVY 289
PD PFE+ D VL Q+ + Y SK LD + + + A+
Sbjct: 484 LPDLTKPFELFVDEKQGYAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAV 543
Query: 290 AIDKFRQYLVGSKIIVYTDHSAIKYLLNKKD---AKPRLIRWILLLQEFD---------- 336
+ +G +++ H+ + D + R+ + LL + D
Sbjct: 544 LTKDVGKLTMGQPLVILAPHAVEALVKQPPDRWLSNARMTHYQALLLDTDRVQFGPIVAL 603
Query: 337 ----LEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNF 392
L ++G+++ D L+ T D D PD D WY D +F
Sbjct: 604 NPATLLPLPEEGLQHDCLDILAEAHGTRPD--LTDQPLPD--------ADHTWYTDGSSF 653
Query: 393 LAAG 396
L G
Sbjct: 654 LQEG 657
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 122 bits (307), Expect = 3e-27
Identities = 87/254 (34%), Positives = 128/254 (50%), Gaps = 18/254 (7%)
Query: 517 WGIDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 576
W IDF P +G +Y+LV +D S WVEA + A+VV K + IFPRFG+P+V+
Sbjct: 918 WEIDFTEVKPGLYGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVL 977
Query: 577 ISDGGSHFISRHFEKLLQKLGVRHKIATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 635
+D G F+S+ + + LGV K+ Y PQ+SGQVE NR IK L K T++T D
Sbjct: 978 GTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRD 1037
Query: 636 WSNKLDDALWAYRTAYKTPIGMTPFKLVYGKSCHLPVELEHKAYWAIRNLNLDPNLAGDK 695
W L AL+ R P G+TP++++YG P L + + + +P+L
Sbjct: 1038 WVLLLPLALYRARNT-PGPHGLTPYEILYG----APPPLVNFPDPDMAKVTHNPSLQAHL 1092
Query: 696 RKLQLNELEELRMDAYENARIYKERTKTWHDKKIIKRHFKSGDLVLLFNSRLKLFPGKLR 755
+ L L + E R A Y+E+ D+ ++ F+ GD V + + K L
Sbjct: 1093 QALYLVQHEVWR----PLAAAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK----NLE 1140
Query: 756 SRWSGPFQVRTVYP 769
RW GP+ V P
Sbjct: 1141 PRWKGPYTVLLTTP 1154
Score = 100 bits (250), Expect = 1e-20
Identities = 96/424 (22%), Positives = 165/424 (38%), Gaps = 38/424 (8%)
Query: 1 NKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP----- 54
NK H +P +L L H + LD FF + +HP Q F
Sbjct: 244 NKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQSLFAFEWRDPEMG 303
Query: 55 -FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF----VEKIMEVFMDDFSVHGSNFDDCLT 109
G + R+P G N+P F + +DF + I+ ++DD + ++ DC
Sbjct: 304 ISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQQ 363
Query: 110 NLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVFDRGIEVDRAKIEIIKKMLPPTSVK 169
+L+ + + +K ++ LG+L+ + + A+ E + P + +
Sbjct: 364 GTRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPR 423
Query: 170 EIRSFLGHAGFYRRFIKDFSSITKPLTSLLLKDADFTFDDSCLQAFCRLKEALITAPIIQ 229
++R FLG AGF R +I F+ + PL L F + +A+ +K+AL+TAP +
Sbjct: 424 QLREFLGTAGFCRLWIPGFAEMAAPLYPLTKTGTLFEWGPDQQKAYQEIKQALLTAPALG 483
Query: 230 PPDWNLPFEIMCDASDYAVGAVLGQRNDKKMHAIYYASKTLDGAQVNYATTEKELLAVVY 289
PD PFE+ D VL Q+ + Y SK LD + + + A+
Sbjct: 484 LPDLTKPFELFVDEKQGYAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAV 543
Query: 290 AIDKFRQYLVGSKIIVYTDHSAIKYLLNKKD---AKPRLIRWILLLQEFD---------- 336
+ +G +++ H+ + D + R+ + LL + D
Sbjct: 544 LTKDAGKLTMGQPLVILAPHAVEALVKQPPDRWLSNARMTHYQALLLDTDRVQFGPIVAL 603
Query: 337 ----LEIKDKKGVENVVADHLSRLRETNKDELPLDDSFPDDQLFLLAQTDAPWYADFVNF 392
L ++G+++ D L+ T D D PD D WY D +F
Sbjct: 604 NPATLLPLPEEGLQHDCLDILAEAHGTRPD--LTDQPLPD--------ADHTWYTDGSSF 653
Query: 393 LAAG 396
L G
Sbjct: 654 LQEG 657
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.323 0.140 0.431
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 99,647,153
Number of Sequences: 164201
Number of extensions: 4402883
Number of successful extensions: 9652
Number of sequences better than 10.0: 137
Number of HSP's better than 10.0 without gapping: 124
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 9267
Number of HSP's gapped (non-prelim): 248
length of query: 811
length of database: 59,974,054
effective HSP length: 118
effective length of query: 693
effective length of database: 40,598,336
effective search space: 28134646848
effective search space used: 28134646848
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 70 (31.6 bits)
Medicago: description of AC146705.12