
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC138199.5 + phase: 0
(617 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 286 1e-76
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 286 1e-76
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 286 1e-76
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 113 2e-24
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 105 3e-22
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 103 2e-21
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 102 4e-21
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 97 1e-19
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 97 1e-19
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 85 5e-16
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 84 1e-15
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 82 5e-15
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 79 4e-14
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2... 72 3e-12
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 72 3e-12
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 70 1e-11
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 70 2e-11
POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.2... 69 4e-11
POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse transcript... 69 5e-11
POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse transcript... 66 2e-10
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 286 bits (732), Expect = 1e-76
Identities = 169/545 (31%), Positives = 287/545 (52%), Gaps = 29/545 (5%)
Query: 7 VAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGS--RFEVFSDHKSL--KYLFDQ 62
V Y S ++ + NY D E+ A++ LK WRHYL + F++ +DH++L + +
Sbjct: 735 VGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNES 794
Query: 63 KELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTLHMSAMMVREFELLEQFRDMS 122
+ N R RW L+D++F +NY PG AN +ADALSR +V E E + +
Sbjct: 795 EPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPIPK----- 841
Query: 123 LVCEWSPQSVK-LGMLKIDSEFLRSIKEAQKVDVKFVDLLVARDQTEDSDFKFDDQGVLR 181
+ S+ + + I +F + D K ++LL D+ + + + D ++
Sbjct: 842 ---DSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLIN 898
Query: 182 FRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCL 241
+ +I +P++ ++ + I+++ H IHPG + + + + F W G+++ + ++V +C
Sbjct: 899 SKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 242 VCQKSKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHF 301
CQ +K + KP G + P+ E W+S+SMDF+T+LP + G +A++V+VDR +K A
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESS-GYNALFVVVDRFSKMAIL 1017
Query: 302 LPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAY 361
+P S Q A ++ + ++ G P I++D D FTS+ WK ++ S Y
Sbjct: 1018 VPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPY 1077
Query: 362 HPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYGR 421
PQTDGQ+ERT Q++E LLR TW H+ L++ +YNN+ HS+ M PFE ++
Sbjct: 1078 RPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-- 1135
Query: 422 RCRIPLCWFESGERVVLGPAIVQQTTEKVQMIQEKIKASQSRQKTYHDKRRKDL-EFQEG 480
R L E Q+T + Q ++E + + + K Y D + +++ EFQ G
Sbjct: 1136 RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1195
Query: 481 DHVFLRVTPMTGVGRALKSKKLTPKFIGPYQILERVGTVAYRVGLPPHLSNL-HNVFHVS 539
D V ++ T G KS KL P F GP+ +L++ G Y + LP + ++ + FHVS
Sbjct: 1196 DLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1252
Query: 540 QLQKY 544
L+KY
Sbjct: 1253 HLEKY 1257
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 286 bits (732), Expect = 1e-76
Identities = 169/545 (31%), Positives = 287/545 (52%), Gaps = 29/545 (5%)
Query: 7 VAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGS--RFEVFSDHKSL--KYLFDQ 62
V Y S ++ + NY D E+ A++ LK WRHYL + F++ +DH++L + +
Sbjct: 735 VGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNES 794
Query: 63 KELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTLHMSAMMVREFELLEQFRDMS 122
+ N R RW L+D++F +NY PG AN +ADALSR +V E E + +
Sbjct: 795 EPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPIPK----- 841
Query: 123 LVCEWSPQSVK-LGMLKIDSEFLRSIKEAQKVDVKFVDLLVARDQTEDSDFKFDDQGVLR 181
+ S+ + + I +F + D K ++LL D+ + + + D ++
Sbjct: 842 ---DSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLIN 898
Query: 182 FRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCL 241
+ +I +P++ ++ + I+++ H IHPG + + + + F W G+++ + ++V +C
Sbjct: 899 SKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 242 VCQKSKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHF 301
CQ +K + KP G + P+ E W+S+SMDF+T+LP + G +A++V+VDR +K A
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESS-GYNALFVVVDRFSKMAIL 1017
Query: 302 LPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAY 361
+P S Q A ++ + ++ G P I++D D FTS+ WK ++ S Y
Sbjct: 1018 VPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPY 1077
Query: 362 HPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYGR 421
PQTDGQ+ERT Q++E LLR TW H+ L++ +YNN+ HS+ M PFE ++
Sbjct: 1078 RPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-- 1135
Query: 422 RCRIPLCWFESGERVVLGPAIVQQTTEKVQMIQEKIKASQSRQKTYHDKRRKDL-EFQEG 480
R L E Q+T + Q ++E + + + K Y D + +++ EFQ G
Sbjct: 1136 RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1195
Query: 481 DHVFLRVTPMTGVGRALKSKKLTPKFIGPYQILERVGTVAYRVGLPPHLSNL-HNVFHVS 539
D V ++ T G KS KL P F GP+ +L++ G Y + LP + ++ + FHVS
Sbjct: 1196 DLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1252
Query: 540 QLQKY 544
L+KY
Sbjct: 1253 HLEKY 1257
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 286 bits (732), Expect = 1e-76
Identities = 169/545 (31%), Positives = 287/545 (52%), Gaps = 29/545 (5%)
Query: 7 VAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGS--RFEVFSDHKSL--KYLFDQ 62
V Y S ++ + NY D E+ A++ LK WRHYL + F++ +DH++L + +
Sbjct: 735 VGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNES 794
Query: 63 KELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTLHMSAMMVREFELLEQFRDMS 122
+ N R RW L+D++F +NY PG AN +ADALSR +V E E + +
Sbjct: 795 EPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR--------IVDETEPIPK----- 841
Query: 123 LVCEWSPQSVK-LGMLKIDSEFLRSIKEAQKVDVKFVDLLVARDQTEDSDFKFDDQGVLR 181
+ S+ + + I +F + D K ++LL D+ + + + D ++
Sbjct: 842 ---DSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLIN 898
Query: 182 FRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCL 241
+ +I +P++ ++ + I+++ H IHPG + + + + F W G+++ + ++V +C
Sbjct: 899 SKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 242 VCQKSKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHF 301
CQ +K + KP G + P+ E W+S+SMDF+T+LP + G +A++V+VDR +K A
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESS-GYNALFVVVDRFSKMAIL 1017
Query: 302 LPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAY 361
+P S Q A ++ + ++ G P I++D D FTS+ WK ++ S Y
Sbjct: 1018 VPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPY 1077
Query: 362 HPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYGR 421
PQTDGQ+ERT Q++E LLR TW H+ L++ +YNN+ HS+ M PFE ++
Sbjct: 1078 RPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-- 1135
Query: 422 RCRIPLCWFESGERVVLGPAIVQQTTEKVQMIQEKIKASQSRQKTYHDKRRKDL-EFQEG 480
R L E Q+T + Q ++E + + + K Y D + +++ EFQ G
Sbjct: 1136 RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1195
Query: 481 DHVFLRVTPMTGVGRALKSKKLTPKFIGPYQILERVGTVAYRVGLPPHLSNL-HNVFHVS 539
D V ++ T G KS KL P F GP+ +L++ G Y + LP + ++ + FHVS
Sbjct: 1196 DLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1252
Query: 540 QLQKY 544
L+KY
Sbjct: 1253 HLEKY 1257
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 113 bits (282), Expect = 2e-24
Identities = 129/527 (24%), Positives = 229/527 (42%), Gaps = 31/527 (5%)
Query: 7 VAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFDQKELN 66
+A+AS+ L E Y DLE A++F L+ ++ +YG+ VF+DHK L L L
Sbjct: 1271 IAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLA 1330
Query: 67 MRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTLHMSAMMVREFELLEQFRD--MSLV 124
R RW + ++D + Y GKAN VADALSR + + + + L + + +
Sbjct: 1331 DRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTEL 1390
Query: 125 CEWSPQSVKLGMLKIDSEFLRSIKEA-QKVDVKFVDLLVARDQTEDSDFKFDDQGVLRF- 182
+ S L LK + E + + A + K +V + ++ GVL+
Sbjct: 1391 PDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNT 1450
Query: 183 ----RGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVY 238
+ R +P E+I+ +L+E H L+ H G KM+ + + F+W ++ V V
Sbjct: 1451 EIEEQSRSVVP--EKIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRVCVENCVR 1508
Query: 239 SCLVCQKSKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKS 298
+C C + +H K + P + + + ++ D + + + +GN I I+D TK
Sbjct: 1509 TCAKCLCAN-DHSKLTSSLTPYRM-TFPLEIVACDLM-DVGLSVQGNRYILTIIDLFTKY 1565
Query: 299 AHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLS 358
+PI L + + +P +++D+ F + + L + +
Sbjct: 1566 GTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITT 1625
Query: 359 SAYHPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEAL 418
Y+ + +G ER +++ +++ WD + + YNN H + G P +
Sbjct: 1626 KGYNSRANGAVERFNKTIMHIMKKKTAVP-MEWDDQVVYAVYAYNNCVHENTGETPMFLM 1684
Query: 419 YGRRCRIPLCWFESGERVV-LGPA-------IVQQTTEKVQMI-QEKIKASQSRQKTYHD 469
+GR PL SGE V + A ++ Q KVQ I +E Q K+ D
Sbjct: 1685 HGRDVMGPL--EMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHAMREQESYKSLFD 1742
Query: 470 KR---RKDLEFQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQIL 513
++ +K Q G V L + P +G + KL K+ GPY+++
Sbjct: 1743 QKYASKKHRFPQPGSRVLLEI-PSEKLG--AQCPKLVNKWSGPYRVI 1786
Score = 33.9 bits (76), Expect = 1.3
Identities = 38/160 (23%), Positives = 69/160 (42%), Gaps = 27/160 (16%)
Query: 431 ESGERVVLGPAIVQQTTEKVQMIQEKIKASQSRQKTYHDKRRKDLEFQEGDHVFLRVTPM 490
E E ++ I+ +TT K++ ++E+ DKR+K+ +F+E D R
Sbjct: 130 EKSEELMQKSQILVETTLKLKAVEEE-----------RDKRKKEEQFREAD---ARSNNY 175
Query: 491 TGVGRALKSKKLTPKFIGPYQILE--------RVGTVAYRVGLPPHLSNLHNVFHVSQLQ 542
G S + K QI++ R+ T A R+G SN+ N ++
Sbjct: 176 ARKGEI--SSNIEQKNHQNIQIMDTRCTTSSSRMNTPAQRIGENLSTSNVGNNVVRETVR 233
Query: 543 KYVPDPSHVIQSDDVQVRDNLTVE---TLPVRIDDRKVKT 579
+Y + +++ +V D++ E T VR D +V+T
Sbjct: 234 EYCEETGEILEDFEVNQNDSVLTERNVTGSVRNGDSQVQT 273
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 105 bits (263), Expect = 3e-22
Identities = 81/335 (24%), Positives = 155/335 (46%), Gaps = 19/335 (5%)
Query: 190 DNEEIKKMILEESHRSSLSI-HPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVCQKSKV 248
+NE+ K+ IL H + H G TK +K+ ++W + + + ++V C CQK+K
Sbjct: 888 NNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKT 947
Query: 249 EHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHFLPINISF 308
M + + PE +D + +D + LP + GN+ ++ LTK +PI +
Sbjct: 948 TKHTKTPMTIT-ETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPI-ANK 1005
Query: 309 PVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQ 368
+A+ + + +G + ++D + + L + L K S+A+H QT G
Sbjct: 1006 SAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGV 1065
Query: 369 SERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRRCRIPLC 428
ER+ ++L + +R + WD L + +N + P+E ++GR +P
Sbjct: 1066 VERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKH 1125
Query: 429 W--FESGERVVLGPAIVQQTTEKVQM----IQEKIKASQSRQKTYHDKRRKDLEFQEGDH 482
+ S E + +++ ++++ ++ ++A + + K +D + KD+E + GD
Sbjct: 1126 FNKLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDK 1185
Query: 483 VFLRVTPMTGVGRALKSKKLTPKFIGPYQILERVG 517
V LR VG KL K+ GPY+I E +G
Sbjct: 1186 VLLR----NEVGH-----KLDFKYTGPYKI-ESIG 1210
Score = 71.2 bits (173), Expect = 7e-12
Identities = 41/96 (42%), Positives = 56/96 (57%)
Query: 7 VAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFDQKELN 66
VAYASR E N T + ELAA+ + + +R Y+YG F V +DH+ L YLF +
Sbjct: 642 VAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPS 701
Query: 67 MRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTL 102
+ R L++Y+F + Y GK N VADALSR T+
Sbjct: 702 SKLTRIRLELEEYNFTVEYLKGKDNHVADALSRITI 737
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 103 bits (256), Expect = 2e-21
Identities = 110/458 (24%), Positives = 187/458 (40%), Gaps = 69/458 (15%)
Query: 2 QEGKVVAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFD 61
Q+G ++Y SR L HE NY T + EL A+V+ K +RHYL G FE+ SDH+ L +L+
Sbjct: 526 QDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYR 585
Query: 62 QKELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTLHMSAMMVR-EFELLEQFRD 120
K+ N + RW L ++DF + Y GK N VADALSR L + + + + E D
Sbjct: 586 MKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLSEQTQHSAEEDNSD 645
Query: 121 MSLVCE-------------WSPQSVKLG--MLKIDSEFLRSIKEAQKVDVKFVDLLVARD 165
+ + E P +K+ K ++ I +K + +D +
Sbjct: 646 LIFITERPLNTFNRQVIFSKGPPDIKVTKYFKKHITQIFYDIMTREKAEQYLIDHFCGKK 705
Query: 166 QT----EDSDFK---------FDDQGVLRFRGRIC---IPDNEEIKKMILEESHRSSLSI 209
D+DF+ + + R I I E K++IL + +
Sbjct: 706 SALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNITTYAEFKELILTAHEK---LL 762
Query: 210 HPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVCQKSKVEHQKPAGMMVPLDVPEWKWDS 269
HPG K + +++ + + + C +C +K EH+ PE +
Sbjct: 763 HPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPEHCREK 822
Query: 270 ISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHFLP-INISFPVAQLAEIYIKEIVKLH--- 325
+D +S + H++ I+I A L EI K+ ++
Sbjct: 823 FMIDIYSS-------------------EGKHYVSCIDIYSKFATLEEIKTKDWIECKNAL 863
Query: 326 -------GVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSERTIQSLED 378
G P + +DRD F+S K E+ +L+L++ D ER +++ +
Sbjct: 864 MRIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD--IERLHKTINE 921
Query: 379 LLRIC-VLEQGGTWDSHLPLIEFTYNN-SYHSSIGMAP 414
+RI + T S + + YN+ + H + G P
Sbjct: 922 KIRIIKTSDDEETKLSKMETVLNIYNHKTKHDTTGQTP 959
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 102 bits (253), Expect = 4e-21
Identities = 78/297 (26%), Positives = 138/297 (46%), Gaps = 25/297 (8%)
Query: 178 GVLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFV 237
G L R+ +P + ++K++L++ H HPG +M + +W GL D+ V
Sbjct: 768 GCLLLDDRVIVP--KSLQKIVLKQLHEG----HPGIVQMKQKARSFVFWRGLDSDIENMV 821
Query: 238 YSCLVCQK-SKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLT 296
C CQ+ SK+ P P VPE W I +DF P + V+VD T
Sbjct: 822 RHCNNCQENSKMPRVVPLN---PWPVPEAPWKRIHIDFAG-----PLNGCYLLVVVDAKT 873
Query: 297 KSAHFLPINISFPVAQLAEI-YIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKL 355
K A + ++ ++ + I ++EI +HG P +I+SD + TS + + ++ G +
Sbjct: 874 KYAE---VKLTRSISAVTTIDLLEEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEH 930
Query: 356 RLSSAYHPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSI-GMAP 414
+ S+ Y+P+++G +ER + +L+ + + +G L +Y N+ HS++ G P
Sbjct: 931 KTSAVYYPRSNGAAERFVDTLKRGI-AKIKGEGSVNQQILNKFLISYRNTPHSALNGSTP 989
Query: 415 FEALYGRRCRIPLCWFESGERVVLGPAIVQQTTEKVQMIQ----EKIKASQSRQKTY 467
E +GR+ R + +RV+ P + Q + + KA Q QK Y
Sbjct: 990 AECHFGRKIRTTMSLLMPTDRVLKVPKLTQYQQNMKHHYELRNGARAKAFQVNQKVY 1046
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 97.4 bits (241), Expect = 1e-19
Identities = 110/464 (23%), Positives = 191/464 (40%), Gaps = 80/464 (17%)
Query: 2 QEGKVVAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFD 61
Q G +++ SR L HE NY + EL A+V+ K +RHYL G +F + SDH+ L++L +
Sbjct: 525 QNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHN 584
Query: 62 QKELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSR---------------------K 100
KE + RW L +Y F ++Y GK N VADALSR
Sbjct: 585 LKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSN 644
Query: 101 TLHMSAMMVREF--------------ELLEQFRDMSLVCEWSPQSV-KLGMLKIDSEFLR 145
+H++ + F E + F + ++ ++ K + +D R
Sbjct: 645 LIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTIQYDVMTLEKAKQILLDHFIHR 704
Query: 146 SIKEAQKVDVKFVDLLVARDQTEDSDFKFDDQGVLRFRGRICIPDNEEIKKMILEESHRS 205
+I + DV F + A + ++ + + + + + E K++IL +SH
Sbjct: 705 NITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKN---VGSYAEFKEIIL-QSHEK 760
Query: 206 SLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVCQKSKVEHQKPAGMMVPLDVPEW 265
L HPG KM K+ ++ + + + C +C +K EH+ + PE
Sbjct: 761 LL--HPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKITPNPEH 818
Query: 266 KWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHFLP-INISFPVAQLAEIYIKEIVKL 324
+ +D +S + H++ I+I A L +I K+ ++
Sbjct: 819 CREKFVVDIYSS-------------------EGKHYISCIDIYSKFATLEQIKTKDWIEC 859
Query: 325 H----------GVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSERTIQ 374
G P + +DRD F+S K E +L+L++A + D ER +
Sbjct: 860 RNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD--VERLHK 917
Query: 375 SLEDLLRICVLEQGGTWDSHLPLIE---FTYNNSY-HSSIGMAP 414
++ + +RI + + L IE +TYN H + G P
Sbjct: 918 TINEKIRI--INSSDDEEVKLSKIETILYTYNQKIKHDTTGQRP 959
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 97.1 bits (240), Expect = 1e-19
Identities = 100/455 (21%), Positives = 192/455 (41%), Gaps = 50/455 (10%)
Query: 2 QEGKVVAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGSR-FEVFSDHKSLKYLF 60
QEG+ + SR L+ E+NY T++ EL A+V+ L +++LYGSR +F+DH+ L +
Sbjct: 512 QEGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAV 571
Query: 61 DQKELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSRK--------------TLHMSA 106
+ N + +RW + ++ + Y PGK N VADALSR+ T+H
Sbjct: 572 ADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQNLNALQNEPQSDAATIHSEL 631
Query: 107 MMVREFELLEQ----FRDMSLVCEWSPQSVKLGMLKIDSE------------FLRSIKEA 150
+ E ++ FR+ ++ E + +K ++ S+ L+++KE
Sbjct: 632 SLTYTVETTDKPLNCFRN-QIILEAARFPLKRNLVLFRSKSRHLISFTDKSWLLKTLKEV 690
Query: 151 QKVDVK---FVDLLVARDQTEDSDFKFDDQGVLRFRGRIC-IPDNEEIKKMILEESHRSS 206
DV DL D F + + I D E +++ E +R+
Sbjct: 691 VNPDVVNAIHCDLPTLASFQHDLIAHFPATQFRHCKNVVLDITDKNEQIEIVTAEHNRA- 749
Query: 207 LSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVCQKSKVEHQKPAGMMVPLDVPEWK 266
H A + + + +++ + + V +C VC ++K + + +P +
Sbjct: 750 ---HRAAQENIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYT 806
Query: 267 WDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHFLPINISFPVAQLAEIYIKEIVKLHG 326
+ + +D ++ +D+ +K A P+ +S + + + +I+ L
Sbjct: 807 GEMVHIDIFST------DRKLFLTCIDKFSKYAIVQPV-VSRTIVDITAPLL-QIINLFP 858
Query: 327 VPSSIVSDRDPRFTSRFWKS-LQEALGSKLRLSSAYHPQTDGQSERTIQSLEDLLRICVL 385
++ D +P F S S L+ + G + + H ++GQ ER +L ++ R L
Sbjct: 859 NIKTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKL 918
Query: 386 EQGGTWDSHLPL-IEFTYNNSYHSSIGMAPFEALY 419
++ L L YN + HS P E ++
Sbjct: 919 DKKTNDTVELILRATIEYNKTVHSVTRERPIEVVH 953
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 85.1 bits (209), Expect = 5e-16
Identities = 60/242 (24%), Positives = 107/242 (43%), Gaps = 9/242 (3%)
Query: 179 VLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVY 238
V R G+ IP + ++IL+ + + H G + + +WW L++DV + +
Sbjct: 803 VTRPNGKRIIPPKSDRPQIILQAHNIA----HTGRDSTFLKVSSKYWWPNLRKDVVKVIR 858
Query: 239 SCLVCQKSKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKS 298
C C + ++ P + P +D +D++ LP + G + V+VD +T
Sbjct: 859 QCKQCLVTNAATLAAPPILRP-ERPVKPFDKFFIDYIGPLPPS-NGYLHVLVVVDSMTGF 916
Query: 299 AHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLS 358
P A + + + + VP I SD+ FTS + + G +L S
Sbjct: 917 VWLYPTKAPSTSATVKALNMLTSI---AVPKVIHSDQGAAFTSATFADWAKNKGIQLEFS 973
Query: 359 SAYHPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEAL 418
+ YHPQ+ G+ ER ++ LL ++ + W LP+++ NNSY S P + L
Sbjct: 974 TPYHPQSSGKVERKNSDIKRLLTKLLVGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLL 1033
Query: 419 YG 420
+G
Sbjct: 1034 FG 1035
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 83.6 bits (205), Expect = 1e-15
Identities = 57/242 (23%), Positives = 110/242 (44%), Gaps = 9/242 (3%)
Query: 179 VLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVY 238
V R G IP + +K++L+ + + H G + ++WW +++DV + +
Sbjct: 593 VSRPEGVKIIPPQSDRQKIVLQAHNLA----HTGREATLLKIANLYWWPNMRKDVVKQLG 648
Query: 239 SCLVCQKSKVEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKS 298
C C + + K +G ++ D P+ +D +D++ LP + +G + V+VD +T
Sbjct: 649 RCQQCLITNASN-KASGPILRPDRPQKPFDKFFIDYIGPLPPS-QGYLYVLVVVDGMTGF 706
Query: 299 AHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLS 358
P A + + + + +P I SD+ FTS + + G L S
Sbjct: 707 TWLYPTKAPSTSATVKSLNVLTSI---AIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFS 763
Query: 359 SAYHPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEAL 418
+ YHPQ+ + ER ++ LL ++ + W LP+++ NN+Y + P + L
Sbjct: 764 TPYHPQSGSKVERKNSDIKRLLTKLLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLL 823
Query: 419 YG 420
+G
Sbjct: 824 FG 825
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 81.6 bits (200), Expect = 5e-15
Identities = 53/226 (23%), Positives = 101/226 (44%), Gaps = 8/226 (3%)
Query: 195 KKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVCQKSKVEHQKPA 254
++ I+ +H + H G + + +WW L++DV + + C C + +
Sbjct: 816 REKIISTAHNIA---HTGRDATFLKVSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSP 872
Query: 255 GMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHFLPINISFPVAQLA 314
++ P+ P +D +D++ LP + G + V+VD +T P A +
Sbjct: 873 PILRPVK-PLKPFDKFYIDYIGPLPPS-NGYLHVLVVVDSMTGFVWLYPTKAPSTSATVK 930
Query: 315 EIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSERTIQ 374
+ + + +P + SD+ FTS + + G +L S+ YHPQ+ G+ ER
Sbjct: 931 ALNMLTSI---AIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNS 987
Query: 375 SLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYG 420
++ LL ++ + W LP+++ NNSY S P + L+G
Sbjct: 988 DIKRLLTKLLIGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFG 1033
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 79.0 bits (193), Expect = 4e-14
Identities = 39/94 (41%), Positives = 60/94 (63%), Gaps = 1/94 (1%)
Query: 7 VAYASRQLRVHEKNYPTHDLELAAVVFVLKIWRHYLYGS-RFEVFSDHKSLKYLFDQKEL 65
+AY SR L E+NY T + E+ A+++ L R YLYG+ +V++DH+ L + +
Sbjct: 461 IAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNF 520
Query: 66 NMRQRRWLELLKDYDFGLNYHPGKANVVADALSR 99
N + +RW +++Y+ L Y PGK+NVVADALSR
Sbjct: 521 NAKLKRWKARIEEYNCELIYKPGKSNVVADALSR 554
Score = 39.7 bits (91), Expect = 0.024
Identities = 61/299 (20%), Positives = 116/299 (38%), Gaps = 43/299 (14%)
Query: 198 ILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVCQKSKVEHQKPAGMM 257
I+E+ HR + H G T++ L + +++ + + SC C+ K E +
Sbjct: 696 IIEKEHRRA---HRGPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNL 752
Query: 258 VPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSA--HFLPINISFPVAQLAE 315
P +P + + + +D I+ + RL S F F + A
Sbjct: 753 QPTPIPNYPCEILHID--------------IFALEKRLYLSCIDKFSKFAKLFHLQSKAS 798
Query: 316 IYIKE--IVKLH--GVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSER 371
++++E + LH P +VSD + + +L L + + +GQ ER
Sbjct: 799 VHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVER 858
Query: 372 TIQSLEDLLRICVLEQGGTWDSHLPLIEFT---YNNSYHSSIGMAPFEALYGRRCRIPLC 428
+ ++ R C+ ++ T+ + L+ YN S HS P + + R R+
Sbjct: 859 FHSTFLEIYR-CLKDELPTF-KPVELVHIAVDRYNTSVHSVTNRKPADVFFDRSSRVNYQ 916
Query: 429 WFESGERVVLGPAIVQQTTEKVQMIQE--KIKASQSRQKTYHDKRRKDLEFQEGDHVFL 485
R QT E ++ + E +I+ + +R K R + + GD VF+
Sbjct: 917 GLTDFRR---------QTLEDIKGLIEYKQIRGNMARNK----NRDEPKSYGPGDEVFV 962
>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 72.4 bits (176), Expect = 3e-12
Identities = 89/344 (25%), Positives = 146/344 (41%), Gaps = 29/344 (8%)
Query: 177 QGVLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKK---IFWWSGLKRDV 233
+G F+G+ +PD + +L+ HR + H G KM L + ++ + +
Sbjct: 824 KGYWVFQGKPVMPDQFVFE--LLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTL 878
Query: 234 AQFVYSCLVCQKSKVEHQK-PAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIV 292
SC VC + K AG+ V P W+ +DF P G + V V
Sbjct: 879 QYVADSCTVCAQVNASKAKIGAGVRVRGHRPGSHWE---IDFTEVKPGL-YGYKYLLVFV 934
Query: 293 DRLTKSAHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALG 352
D + P +++ ++EI G+P + SD P FTS+ +S+ + LG
Sbjct: 935 DTFSGWVEAFPTKRE-TARVVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLG 993
Query: 353 SKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQG-GTWDSHLPLIEFTYNNSYHSSIG 411
+L AY PQ+ GQ ER +++++ L L G W LPL + N+ G
Sbjct: 994 IDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHG 1052
Query: 412 MAPFEALYGRRCRIPLCWFESGERVVL-GPAIVQQTTEKVQMIQEKI-KASQSRQKTYHD 469
+ P+E LYG PL F + L +Q + +Q +Q +I K + D
Sbjct: 1053 LTPYEILYG--APPPLVNFHDPDMSELTNSPSLQAHLQALQTVQREIWKPLAEAYRDQLD 1110
Query: 470 KRRKDLEFQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQIL 513
+ F+ GD V++ R ++K L P++ GPY +L
Sbjct: 1111 QPVIPHPFRIGDSVWV---------RRHQTKNLEPRWKGPYTVL 1145
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 72.4 bits (176), Expect = 3e-12
Identities = 84/339 (24%), Positives = 135/339 (39%), Gaps = 41/339 (12%)
Query: 229 LKRDVAQFVYSCLVCQKSK-VEHQKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDA 287
L+ V + C C + V + G D P W+ +DF P GN
Sbjct: 841 LQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGVYWE---VDFTEIKPGR-YGNKY 896
Query: 288 IWVIVDRLTKSAHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSL 347
+ V +D + P + +I ++EI+ G+P + SD P F ++ + L
Sbjct: 897 LLVFIDTFSGWVEAFPTKTETALIVCKKI-LEEILPRFGIPKVLGSDNGPAFVAQVSQGL 955
Query: 348 QEALGSKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQGG-TWDSHLPLIEFTYNNSY 406
LG +L AY PQ+ GQ ER +++++ L LE GG W + LPL N+
Sbjct: 956 ATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKDWVTLLPLALLRARNT- 1014
Query: 407 HSSIGMAPFEALYGRRCRIPLCWFESGERVVLGPAIVQQTTEKVQMIQEKIKASQSRQKT 466
G+ P+E LYG P ESGE LGP + ++ +KA + +
Sbjct: 1015 PGRFGLTPYEILYGG----PPPILESGE--TLGP-----DDRFLPVLFTHLKALEIVRTQ 1063
Query: 467 YHDKRRKDLE---------FQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQILERVG 517
D+ ++ + FQ GD V + R + L P++ GPY +L
Sbjct: 1064 IWDQIKEVYKPGTVTIPHPFQVGDQVLV---------RRHRPSSLEPRWKGPYLVLLTTP 1114
Query: 518 TVAYRVGLPPHLSNLHNVFHVSQLQKYVPDPSHVIQSDD 556
T G+ + + H+ PD S ++ D
Sbjct: 1115 TAVKVDGIAAWV----HASHLKPAPPSAPDESWELEKTD 1149
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 70.5 bits (171), Expect = 1e-11
Identities = 88/345 (25%), Positives = 147/345 (42%), Gaps = 31/345 (8%)
Query: 177 QGVLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKK---IFWWSGLKRDV 233
QG F+G+ +PD + +L+ HR + H G KM L + ++ + +
Sbjct: 209 QGYWVFQGKPVMPDQFVFE--LLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTL 263
Query: 234 AQFVYSCLVCQKSKVEHQK-PAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIV 292
SC VC + K AG+ V P W+ +DF P G + V V
Sbjct: 264 QYVADSCTVCAQVNASKAKIGAGVRVRGHRPGTHWE---IDFTEVKPGL-YGYKYLLVFV 319
Query: 293 DRLTKSAHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALG 352
D + P ++ ++EI G+P + +D P F S+ +S+ + LG
Sbjct: 320 DTFSGWVEAFPTKHETAKIVTKKL-LEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLG 378
Query: 353 SKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQG-GTWDSHLPLIEFTYNNSYHSSIG 411
+L AY PQ+ GQ ER +++++ L L G W LPL + N+ G
Sbjct: 379 IDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHG 437
Query: 412 MAPFEALYGRRCRIPLCWFESGE--RVVLGPAIVQQTTEKVQMIQEKI-KASQSRQKTYH 468
+ P+E LYG PL F E + P++ Q + +Q +Q ++ K + +
Sbjct: 438 LTPYEILYG--APPPLVNFHDPEMSKFTNSPSL-QAHLQALQAVQREVWKPLAAAYQDQL 494
Query: 469 DKRRKDLEFQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQIL 513
D+ F+ GD V++ R ++K L P++ GPY +L
Sbjct: 495 DQPVIPHPFRVGDTVWV---------RRHQTKNLEPRWKGPYTVL 530
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 69.7 bits (169), Expect = 2e-11
Identities = 88/339 (25%), Positives = 144/339 (41%), Gaps = 38/339 (11%)
Query: 184 GRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKI-FWWSGLKRDVAQFVYSCLV 242
G+I +P E + ++++ H + H G K+ ++K F + Q +C V
Sbjct: 828 GKIVLPQKEALA--MIQQMHAWT---HLGNRKLKLLIEKTDFLIPRASTLIEQVTSACKV 882
Query: 243 CQKSKVEHQK-PAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAHF 301
CQ+ + PAG + P W+ +DF P+ G + V VD +
Sbjct: 883 CQQVNAGATRVPAGKRTRGNRPGVYWE---IDFTEVKPHYA-GYKYLLVFVDTFSGWVE- 937
Query: 302 LPINISFPVAQ-----LAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLR 356
+FP Q +A+ ++EI G+P I SD P F S+ + L LG +
Sbjct: 938 -----AFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGINWK 992
Query: 357 LSSAYHPQTDGQSERTIQSLEDLLRICVLEQG-GTWDSHLPLIEFTYNNSYHSSIGMAPF 415
L AY PQ+ GQ ER +++++ L LE G W L L N+ + G+ P+
Sbjct: 993 LHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGLTPY 1051
Query: 416 EALYGRRCRIPLCWFESGERVVLGPAIVQQTTEKVQMIQEKIKASQSR-QKTYHDKRRKD 474
E LYG PL + +Q + +Q +Q +I A + + H +
Sbjct: 1052 EILYGG--PPPLSTLLNSFSPSNSKTDLQARLKGLQAVQAQIWAPLAELYRPGHSQTSH- 1108
Query: 475 LEFQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQIL 513
FQ GD V++ R +S+ L P++ GPY +L
Sbjct: 1109 -PFQVGDSVYV---------RRHRSQGLEPRWKGPYIVL 1137
>POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 68.9 bits (167), Expect = 4e-11
Identities = 87/345 (25%), Positives = 147/345 (42%), Gaps = 31/345 (8%)
Query: 177 QGVLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKK---IFWWSGLKRDV 233
+G F+G+ +PD + +L+ HR + H G KM L + ++ + +
Sbjct: 824 KGYWVFQGKPVMPDQFVFE--LLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTL 878
Query: 234 AQFVYSCLVCQKSKVEHQK-PAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIV 292
SC VC + K AG+ V P W+ +DF P G + V V
Sbjct: 879 QYVADSCTVCAQVNASKAKIGAGVRVRGHRPGTHWE---IDFTEVKPGL-YGYKYLLVFV 934
Query: 293 DRLTKSAHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALG 352
D + P ++ ++EI G+P + +D P F S+ +S+ + LG
Sbjct: 935 DTFSGWVEAFPTKHETAKIVTKKL-LEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLG 993
Query: 353 SKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQG-GTWDSHLPLIEFTYNNSYHSSIG 411
+L AY PQ+ GQ ER +++++ L L G W LPL + N+ G
Sbjct: 994 IDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHG 1052
Query: 412 MAPFEALYGRRCRIPLCWFESGE--RVVLGPAIVQQTTEKVQMIQEKI-KASQSRQKTYH 468
+ P+E LYG PL F E + P++ Q + +Q +Q ++ K + +
Sbjct: 1053 LTPYEILYG--APPPLVNFHDPEMSKFTNSPSL-QAHLQALQAVQREVWKPLAAAYQDQL 1109
Query: 469 DKRRKDLEFQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQIL 513
D+ F+ GD V++ R ++K L P++ GPY +L
Sbjct: 1110 DQPVIPHPFRVGDTVWV---------RRHQTKNLEPRWKGPYTVL 1145
>POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 843
Score = 68.6 bits (166), Expect = 5e-11
Identities = 89/344 (25%), Positives = 146/344 (41%), Gaps = 30/344 (8%)
Query: 177 QGVLRFRGRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKK---IFWWSGLKRDV 233
+G F+G+ +PD + +L+ HR + H G KM L + ++ + +
Sbjct: 472 KGYWVFQGKPVMPDQFVFE--LLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTL 526
Query: 234 AQFVYSCLVCQKSKVEHQK-PAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIV 292
SC VC + K AG+ V P W+ +DF P G + V V
Sbjct: 527 QYVADSCTVCAQVNASKAKIGAGVRVRGHRPGSHWE---IDFTEVKPGL-YGYKYLLVFV 582
Query: 293 DRLTKSAHFLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALG 352
D + P +++ ++EI G+P + SD P FTS+ +S+ + LG
Sbjct: 583 DTFSGWVEAFPTKRE-TARVVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLG 641
Query: 353 SKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQG-GTWDSHLPLIEFTYNNSYHSSIG 411
+L AY PQ+ GQ ER +++++ L L G W LPL + N+ G
Sbjct: 642 ID-KLHCAYRPQSSGQVERMNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHG 699
Query: 412 MAPFEALYGRRCRIPLCWFESGERVVL-GPAIVQQTTEKVQMIQEKI-KASQSRQKTYHD 469
+ P+E LYG PL F + L +Q + +Q +Q +I K + D
Sbjct: 700 LTPYEILYG--APPPLVNFHDPDMSELTNSPSLQAHLQALQTVQREIWKPLAEAYRDQLD 757
Query: 470 KRRKDLEFQEGDHVFLRVTPMTGVGRALKSKKLTPKFIGPYQIL 513
+ F+ GD V++ R ++K L P++ GPY +L
Sbjct: 758 QPVIPHPFRIGDSVWV---------RRHQTKNLEPRWKGPYTVL 792
>POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 473
Score = 66.2 bits (160), Expect = 2e-10
Identities = 63/241 (26%), Positives = 103/241 (42%), Gaps = 16/241 (6%)
Query: 184 GRICIPDNEEIKKMILEESHRSSLSIHPGATKMYHDLKKIFWWSGLKRDVAQFVYSCLVC 243
GR+ +P + + +LE++HR++ H G +K+ ++K + G+ R C+ C
Sbjct: 111 GRLLLP--RAVGRKVLEQTHRAT---HLGESKLTELVRKHYPICGIYRAARDITTRCVAC 165
Query: 244 QKSKVEH---QKPAGMMVPLDVPEWKWDSISMDFVTSLPNTPRGNDAIWVIVDRLTKSAH 300
+ +K + P W+ +DF T + G + V+VD +
Sbjct: 166 AQVNPRAAPVEKGLNSRIRGAAPGEHWE---VDF-TEMITAKGGYKYLLVLVDTFSGWVE 221
Query: 301 FLPINISFPVAQLAEIYIKEIVKLHGVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSA 360
P + + I +I+ G+P I SD P F ++ + L EAL +L A
Sbjct: 222 AYPAKRETSQVVIKHL-ILDIIPRFGLPVQIGSDNGPAFVAKVTQQLCEALNVSWKLHCA 280
Query: 361 YHPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIE-FTYNNSYHSSIGMAPFEALY 419
Y PQ+ GQ ER ++L+ I LE LP F Y G++PFE LY
Sbjct: 281 YRPQSSGQVERMNRTLKK--AIAKLEDRDRRGLGLPPPSGFAPGTVYPGREGLSPFEILY 338
Query: 420 G 420
G
Sbjct: 339 G 339
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.321 0.137 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,519,280
Number of Sequences: 164201
Number of extensions: 3054602
Number of successful extensions: 7661
Number of sequences better than 10.0: 66
Number of HSP's better than 10.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 7558
Number of HSP's gapped (non-prelim): 81
length of query: 617
length of database: 59,974,054
effective HSP length: 116
effective length of query: 501
effective length of database: 40,926,738
effective search space: 20504295738
effective search space used: 20504295738
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)
Medicago: description of AC138199.5