
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0046b.4
(1706 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 202 5e-51
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 202 8e-51
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 201 2e-50
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 181 2e-44
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 174 1e-42
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 162 1e-38
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 158 1e-37
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 158 1e-37
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 145 9e-34
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 130 4e-29
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 129 5e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 129 5e-29
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 127 3e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 126 6e-28
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 125 8e-28
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 120 3e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 118 1e-25
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 106 6e-22
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 101 2e-20
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 100 3e-20
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 202 bits (515), Expect = 5e-51
Identities = 124/363 (34%), Positives = 182/363 (49%), Gaps = 6/363 (1%)
Query: 745 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
P W+ K+R+ DY LN++ D +P+PN+D+++ + +D
Sbjct: 246 PIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305
Query: 805 GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
G++QI M P T F T +Y Y MPFGLKNA AT+QR M+ I + ++ VY+
Sbjct: 306 GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365
Query: 865 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
DD+IV S +H L F++L +KL +KC F Q FLG +LT GI+ NP+
Sbjct: 366 DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425
Query: 925 KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTE-ECEQ 983
K AI + PT KE++ G +F+P D A P CLKKN K T E +
Sbjct: 426 KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485
Query: 984 AFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGA 1043
AF KLK ++ P+L P + L +D A+ VL Q+ + ++S TL
Sbjct: 486 AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQD----GHPLSYISRTLNEH 541
Query: 1044 ELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELS 1102
E+ Y IEK LAI+ + R Y +I +D PL + + D + +L W V+LS
Sbjct: 542 EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601
Query: 1103 EYD 1105
E+D
Sbjct: 602 EFD 604
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 202 bits (513), Expect = 8e-51
Identities = 124/414 (29%), Positives = 209/414 (49%), Gaps = 9/414 (2%)
Query: 702 LAIRPGATPVIQPRRRMSEEKNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKW 761
+ ++ GA P+ Q R + ++ +K++ + IRE + P W + VV+VKK +G
Sbjct: 934 IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992
Query: 762 RMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTF 821
RMC DY +NKV +++PLPN++ + +G +L ++ D +G+ QI + +E T F
Sbjct: 993 RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052
Query: 822 MTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDL 881
+ + +PFGL + A +Q M++I +G VYVDD+++ S H D+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112
Query: 882 KEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEV 941
KEA ++R MKL KC + ++LG +T G+E K + + PT+VKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172
Query: 942 QRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKP 1001
Q G + +F+ A+ + + + W +E E AF +LK+ + PVL++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232
Query: 1002 TPSV------PLVLYLAVTDKAVSTVLLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAA 1054
P ++Y + K + VL QE +Q I F S L AE RY + A
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEA 1292
Query: 1055 LAILKTARRLRPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ 1107
LA++ RR + + + TD PL +L+ L+ RL WS+E+ E+D++
Sbjct: 1293 LAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVK 1346
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 201 bits (510), Expect = 2e-50
Identities = 128/408 (31%), Positives = 201/408 (48%), Gaps = 12/408 (2%)
Query: 709 TPVIQPRRRMSEEKNKAVQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGKWRMC 764
+P+ + +++ V+ + ++++ IRE PTW+ K+R+
Sbjct: 205 SPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264
Query: 765 TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTN 824
DY LN++ D YP+PN+D+++ + + +D G++QI M T F T
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 825 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 884
+Y Y MPFGL+NA AT+QR M+ I + ++ VY+DD+I+ S ++H ++
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 885 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 944
F +L +KL +KC F + FLG ++T GI+ NP K +AI+ PT KE++
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444
Query: 945 TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECE--QAFTKLKETLATLPVLSKPT 1002
G +F+P D A P +CLKK +K T++ E +AF KLK + P+L P
Sbjct: 445 LGLTGYYRKFIPNYADIAKPMTSCLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPD 503
Query: 1003 PSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTAR 1062
VL ++ A+ VL Q I F+S TL EL Y IEK LAI+ +
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559
Query: 1063 RLRPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 1109
R Y Q I +D PLR + + +L W V LSEY + +
Sbjct: 560 TFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKID 607
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 181 bits (459), Expect = 2e-44
Identities = 126/406 (31%), Positives = 205/406 (50%), Gaps = 23/406 (5%)
Query: 726 VQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGK--WRMCTDYTSLNKVCPKDSY 779
V+ + ++L++ IR P W+ V K NG+ +RM D+ LN V D+Y
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 780 PLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKN 839
P+P+++ + + + +D SG++QI M SD T F T Y + +PFGLKN
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 840 AGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEK 899
A A +QR++D I + +G+ VY+DD+IV S H +L+ L +++N EK
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 900 CSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAG 959
F +FLG+++T+ GI+ +P K RAI EM PTSVKE++R G + +F+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 960 DKAAPFFTCLK---------KNSKFQWT--EECEQAFTKLKETLATLPVLSKPTPSVPLV 1008
A P + ++SK T E Q+F LK L + +L+ P + P
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 1009 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 1067
L ++ A+ VL Q++ + + I ++S +L E Y IEK LAI+ + LR Y
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 1068 FQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ--YEP 1110
+ + +K+ TD PL L + + +L W + EY+ + Y+P
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKP 542
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 174 bits (442), Expect = 1e-42
Identities = 119/406 (29%), Positives = 177/406 (43%), Gaps = 6/406 (1%)
Query: 710 PVIQPRRRMSEEKNKAVQLETEKLIKARFIREV--QYPTWLANVVMVKKANG---KWRMC 764
PV R + + +Q + +KLIK + + QY + L V N KWR+
Sbjct: 314 PVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLV 373
Query: 765 TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTN 824
DY +NK D +PLP +D ++D + S +D SG++QI + + T+F T+
Sbjct: 374 IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433
Query: 825 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 884
+Y + +PFGLK A ++QR+M FS +Y+DD+IV +L E
Sbjct: 434 NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493
Query: 885 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 944
F + R Y +KL+PEKCSF + FLG T +GI + K I P +R
Sbjct: 494 FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553
Query: 945 TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPS 1004
RF+ D + KKN F+WT+EC++AF LK L +L P S
Sbjct: 554 VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613
Query: 1005 VPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRL 1064
+ + +A VL Q Q + + S E E+ AI
Sbjct: 614 KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673
Query: 1065 RPYFQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 1109
RPY +KTD PL + + S +L +EL EY+ E
Sbjct: 674 RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 162 bits (409), Expect = 1e-38
Identities = 110/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)
Query: 722 KNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 781
K +A+ E + +K+ IRE + V+ V K G RM DY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 782 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAG 841
P +++L+ G+ + + +D S Y+ I + DE F + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542
Query: 842 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 901
A +Q ++ I + ++ Y+DD+++ S S+H +K+ +L+ + +N KC
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 902 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 961
F KF+G+ ++ +G + +L+ K P + KE+++ G + L +F+P
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 962 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTV 1021
P LKK+ +++WT QA +K+ L + PVL S ++L +D AV V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 1022 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKIKTD 1078
L Q+ + K + + S + A+L Y +K LAI+K+ + R Y +S KI TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 1079 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 1110
+ R + + RL W + L + ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 158 bits (400), Expect = 1e-37
Identities = 109/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)
Query: 722 KNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 781
K +A+ E + +K+ IRE + V+ V K G RM DY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 782 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAG 841
P +++L+ G+ + + +D S Y+ I + DE F + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 842 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 901
A +Q ++ I + ++ Y+D++++ S S+H +K+ +L+ + +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 902 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 961
F KF+G+ ++ +G + +L+ K P + KE+++ G + L +F+P
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 962 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTV 1021
P LKK+ +++WT QA +K+ L + PVL S ++L +D AV V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 1022 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKIKTD 1078
L Q+ + K + + S + A+L Y +K LAI+K+ + R Y +S KI TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 1079 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 1110
+ R + + RL W + L + ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 158 bits (400), Expect = 1e-37
Identities = 109/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)
Query: 722 KNKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 781
K +A+ E + +K+ IRE + V+ V K G RM DY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 782 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAG 841
P +++L+ G+ + + +D S Y+ I + DE F + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 842 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 901
A +Q ++ I + ++ Y+D++++ S S+H +K+ +L+ + +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 902 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 961
F KF+G+ ++ +G + +L+ K P + KE+++ G + L +F+P
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 962 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTV 1021
P LKK+ +++WT QA +K+ L + PVL S ++L +D AV V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 1022 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKIKTD 1078
L Q+ + K + + S + A+L Y +K LAI+K+ + R Y +S KI TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 1079 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 1110
+ R + + RL W + L + ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 145 bits (366), Expect = 9e-34
Identities = 111/406 (27%), Positives = 190/406 (46%), Gaps = 25/406 (6%)
Query: 726 VQLETEKLIKARFIREVQYPTWLANVVMVKKA-----NGKWRMCTDYTSLNKVCPKDSYP 780
V E ++L+K IR + P V+ KK N R+ D+ LN+ D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 781 LPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNA 840
+P++ ++ + + +D SGY+QI + D E T+F N Y + +PFGL+NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 841 GATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKC 900
+ +QR +D + +Q+G+ VYVDD+I+ S SDH + L M+++ EK
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 901 SFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGD 960
F + ++LGF+++ G + +P+K +AI E P V +V+ G + F+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 961 KAAPFFTCLK-----------KNSKFQWTEECEQAFTKLKETLATLPVLSK-PTPSVPLV 1008
A P LK K ++ E AF +L+ LA+ V+ K P P
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 1009 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 1067
L + + VL QE + I +S TL+ E Y E+ LAI+ +L+ +
Sbjct: 497 LTTDASASGIGAVLSQE----GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552
Query: 1068 FQSFQVKIKTD-VPLRQVLQKPDLSGRLVSWSVELSEYD--IQYEP 1110
+ S ++ I TD PL + + + ++ W + +++ + Y+P
Sbjct: 553 YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKP 598
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 130 bits (326), Expect = 4e-29
Identities = 100/378 (26%), Positives = 168/378 (43%), Gaps = 18/378 (4%)
Query: 745 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
P +L N +K GK RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343
Query: 805 GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
G+ Q+++ T F Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402
Query: 865 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459
Query: 925 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKE 519
Query: 981 CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579
Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
S + + AE Y +K LA++ T ++ Y I+TD + K D
Sbjct: 580 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639
Query: 1092 GRLVSWSVELSEYDIQYE 1109
GR + W LS Y E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (325), Expect = 5e-29
Identities = 100/378 (26%), Positives = 168/378 (43%), Gaps = 18/378 (4%)
Query: 745 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
P +L N +K GK RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343
Query: 805 GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
G+ Q+++ T F Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402
Query: 865 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459
Query: 925 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519
Query: 981 CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579
Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
S + + AE Y +K LA++ T ++ Y I+TD + K D
Sbjct: 580 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639
Query: 1092 GRLVSWSVELSEYDIQYE 1109
GR + W LS Y E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (325), Expect = 5e-29
Identities = 100/378 (26%), Positives = 168/378 (43%), Gaps = 18/378 (4%)
Query: 745 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
P +L N +K GK RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343
Query: 805 GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
G+ Q+++ T F Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402
Query: 865 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459
Query: 925 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519
Query: 981 CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579
Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
S + + AE Y +K LA++ T ++ Y I+TD + K D
Sbjct: 580 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639
Query: 1092 GRLVSWSVELSEYDIQYE 1109
GR + W LS Y E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 127 bits (319), Expect = 3e-28
Identities = 99/378 (26%), Positives = 167/378 (43%), Gaps = 18/378 (4%)
Query: 745 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
P +L N +K GK RM +Y ++NK D+Y PN D+L+ G ++ S D S
Sbjct: 280 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKS 338
Query: 805 GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
G+ Q+++ T F Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 339 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 397
Query: 865 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 398 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 454
Query: 925 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 455 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 514
Query: 981 CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 515 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 574
Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
S + + AE Y +K LA++ T ++ Y I+TD + K D
Sbjct: 575 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 634
Query: 1092 GRLVSWSVELSEYDIQYE 1109
GR + W LS Y E
Sbjct: 635 GRNIRWQAWLSHYSFDVE 652
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 126 bits (316), Expect = 6e-28
Identities = 98/378 (25%), Positives = 166/378 (42%), Gaps = 18/378 (4%)
Query: 745 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 804
P +L N + G RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 286 PAFLVNNE-AENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 344
Query: 805 GYNQIMMHPSDEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 864
G+ Q+++ T F Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 345 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 403
Query: 865 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 924
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 404 DDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 460
Query: 925 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 980
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 461 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKE 520
Query: 981 CEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 1036
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 521 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYR 580
Query: 1037 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTD----VPLRQVLQKPDLS- 1091
S + + AE Y +K LA++ T ++ Y I+TD + K D
Sbjct: 581 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 640
Query: 1092 GRLVSWSVELSEYDIQYE 1109
GR + W LS Y E
Sbjct: 641 GRNIRWQAWLSHYSFDVE 658
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 125 bits (315), Expect = 8e-28
Identities = 100/386 (25%), Positives = 172/386 (43%), Gaps = 16/386 (4%)
Query: 755 KKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPS 814
++ GK RM +Y ++NK D++ LPN D+L+ G ++ S D SG Q+++
Sbjct: 276 ERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKE 335
Query: 815 DEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIV-KSAR 873
+ T F Q +Y + +PFGLK A + + + S Q + VYVDD++V +
Sbjct: 336 SQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTG 395
Query: 874 ASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILE-- 931
+H + + + L+ +K + FLG + +G + ILE
Sbjct: 396 RKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGTHCPQNH---ILEHI 451
Query: 932 MKSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKL 988
K P + K++QR G + S ++P P + LK++S + W + Q K+
Sbjct: 452 HKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKI 511
Query: 989 KETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQ 1048
K+ L + P L P P+ LV+ +++ +L + + + S + + AE Y
Sbjct: 512 KKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYH 571
Query: 1049 KIEKAALAILKTARRLRPYFQSFQVKIKTDVP-----LRQVLQKPDLSGRLVSWSVELSE 1103
EK LA+++ ++ Y + I+TD + L+ GRLV W + LS+
Sbjct: 572 SNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQ 631
Query: 1104 YDIQYEPRGQVTVQSLIDFVAELTPT 1129
YD E T DF+ E T T
Sbjct: 632 YDFDVEHIAG-TKNVFADFLQENTLT 656
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 120 bits (301), Expect = 3e-26
Identities = 96/371 (25%), Positives = 160/371 (42%), Gaps = 26/371 (7%)
Query: 755 KKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPS 814
++ GK RM +Y ++N+ DS+ LPN+ +L+ G + S D SG+ Q+++
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346
Query: 815 DEESTTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARA 874
++ T F Q ++ +K +PFGLK A + +QR M + + VYVDD+IV S
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALN-GADKFCMVYVDDIIVFSNSE 405
Query: 875 SDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTS----------RGIEVNPD 924
DH + + Y + L+ +K + + FLG + I PD
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPD 465
Query: 925 KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQA 984
+ LE K K +QR G + ++P + P LKK+ + WT+
Sbjct: 466 R----LEDK-----KHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516
Query: 985 FTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVL-LQEEGKKQKVIYFVSHTLQGA 1043
K+K+ L + P L P P L++ +D VL + + + + S + + A
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576
Query: 1044 ELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTDVP-----LRQVLQKPDLSGRLVSWS 1098
E Y +K LA+ + + Y + ++TD LR L+ GRLV W
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636
Query: 1099 VELSEYDIQYE 1109
S+Y E
Sbjct: 637 NWFSKYQFDVE 647
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 118 bits (296), Expect = 1e-25
Identities = 76/251 (30%), Positives = 128/251 (50%), Gaps = 7/251 (2%)
Query: 708 ATPVIQPRRRMSEEKNKAVQLETEKLIKARFIREVQYPTWLANVVMVKK-ANGKWRMCTD 766
A PV + R + +AV+ E +L + I + Y W A +V++KK GK R+C D
Sbjct: 438 AVPVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCAD 497
Query: 767 Y--TSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTTFMTN 824
+ + LN + +PLP + + G + S +D Y Q+ + ++ T+
Sbjct: 498 FKCSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTH 556
Query: 825 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 884
+ + Y M FGLK A A++Q++MDK+ S G + VY DD+I+ ++ +H L+E
Sbjct: 557 RGIFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILREL 614
Query: 885 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 944
F++ + Y +++ EKC+F + FLGF + G + K AI MK+PT K++
Sbjct: 615 FERFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASF 673
Query: 945 TGRMAALSRFL 955
G LSR +
Sbjct: 674 LGAADWLSRMM 684
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 106 bits (264), Expect = 6e-22
Identities = 98/438 (22%), Positives = 176/438 (39%), Gaps = 28/438 (6%)
Query: 751 VVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIM 810
V V K +G+WRM DY +NK P + + ++ + + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGF---W 61
Query: 811 MHPSDEES---TTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDM 867
HP ES T F YC+ +P G N+ A + D + + N++VYVDD+
Sbjct: 62 AHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQVYVDDI 119
Query: 868 IVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGR 927
+ +H L++ F L ++ +K G + +FLGF +T G +
Sbjct: 120 YLSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKT 179
Query: 928 AILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCL--KKNSKFQWTEECEQAF 985
+L + P +K++Q + G + F+P + P + + K +W+EE +
Sbjct: 180 KLLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQL 239
Query: 986 TKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAEL 1045
+ E L T L + P LV+ + + A E GKK I ++++ AEL
Sbjct: 240 NMVIEALNTASNLEERLPEQRLVIKVNTSPSAGYVRYYNETGKKP--IMYLNYVFSKAEL 297
Query: 1046 RYQKIEKAALAILKTARRLRPYFQSFQVKIKTDVPLRQVLQKPDLSG------RLVSWSV 1099
++ +EK + K + ++ + + + +QK L R ++W
Sbjct: 298 KFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMT 357
Query: 1100 ELSEYDIQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNT 1149
L + IQ+ + ++ + D +Q E V DGS SNN
Sbjct: 358 YLEDPRIQFHYDKTLPELKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAIKSPDPTKSNNA 417
Query: 1150 GSGAGITIESPDKMIIEQ 1167
G G P+ ++ Q
Sbjct: 418 GMGIVHATYKPEYQVLNQ 435
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 101 bits (252), Expect = 2e-20
Identities = 95/467 (20%), Positives = 189/467 (40%), Gaps = 19/467 (4%)
Query: 680 LDLF--AWTINDVPGIDPKVITHKLAIRPGATPVIQPRRRMSEEKNKAVQLETEKLIKAR 737
L LF W G+ +V + +R GA+PV + MS+E + ++ +K +
Sbjct: 143 LQLFPTVWAERAGMGLANQVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLG 202
Query: 738 FIREVQYPTWLANVVMVKK-ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNEL 796
+ + P W ++ VKK +R D +NK +PN L+ +
Sbjct: 203 VLVPCRSP-WNTPLLPVKKPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYT 261
Query: 797 -LSLMDAYSGYNQIMMHPSDEESTTF------MTNQANYCYKTMPFGLKNAGATYQRLMD 849
S++D + + +HP+ + F N + +P G KN+ + +
Sbjct: 262 WYSVLDLKDAFFCLRLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALH 321
Query: 850 KIFSKQVGRNMEV----YVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQ 905
+ + N +V YVDD++V + D ++ +L +++ +K +
Sbjct: 322 RDLAPFRALNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQR 381
Query: 906 GGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPF 965
+LG++L + P + ++++ PT+ ++V+ G ++P AAP
Sbjct: 382 EVTYLGYLLKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPL 441
Query: 966 FTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQE 1025
+ K++ F WTEE +QAF +K+ L + P L+ P + P LY+ VL Q
Sbjct: 442 YPLTKESIPFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQT 501
Query: 1026 EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKIKTDVPLRQVL 1085
G ++ + ++S L + KA A+ + V + L ++
Sbjct: 502 LGPWRRPVAYLSKKLDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHSLESIV 561
Query: 1086 QKPD----LSGRLVSWSVELSEYDIQYEPRGQVTVQSLIDFVAELTP 1128
++P + R+ + L + + P + +L+ +E TP
Sbjct: 562 RQPPDRWMTNARMTHYQSLLLNERVSFAPPAVLNPATLLPVESEATP 608
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 100 bits (250), Expect = 3e-20
Identities = 105/492 (21%), Positives = 198/492 (39%), Gaps = 35/492 (7%)
Query: 697 VITHKLAIRPGATPVIQPRRRMSEEKNKAVQLETEKLIKARFIREVQYPTWLANVVMVKK 756
+ T LA RP I P+ + S +Q+ + L+K + + Q T V V K
Sbjct: 167 IATGTLAPRPQKQYPINPKAKPS------IQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPK 219
Query: 757 ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDE 816
+GKWRM DY +NK P + + ++ + + +D +G+ HP
Sbjct: 220 PDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGF---WAHPITP 276
Query: 817 ES---TTFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSAR 873
ES T F YC+ +P G N+ A + D + + N++ YVDD+ +
Sbjct: 277 ESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQAYVDDIYISHDD 334
Query: 874 ASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMK 933
+H L++ F L ++ +K + +FLGF +T G + + +L +
Sbjct: 335 PQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNIT 394
Query: 934 SPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCL-KKNSKF-QWTEECEQAFTKLKET 991
P +K++Q + G + F+P + P +T + N KF WTE+ +
Sbjct: 395 PPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISV 454
Query: 992 LATLPVLSKPTPSVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIE 1051
L L + P L++ + + A + EG K+ ++Y V++ AE ++ + E
Sbjct: 455 LNQADNLEERNPETRLIIKVNSSPSA-GYIRYYNEGSKRPIMY-VNYIFSKAEAKFTQTE 512
Query: 1052 KAALAILKTARRLRPYFQSFQVKIKTDVPLRQVLQKPDLSG------RLVSWSVELSEYD 1105
K + K + ++ + + + +Q+ L R ++W L +
Sbjct: 513 KLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYLEDPR 572
Query: 1106 IQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNTGSGAGI 1155
IQ+ + +Q + + ++ ++ V DGS S++ G G
Sbjct: 573 IQFHYDKSLPELQQIPNVTEDVIAKTKHPSEFAMVFYTDGSAIKHPDVNKSHSAGMGIAQ 632
Query: 1156 TIESPDKMIIEQ 1167
P+ I+ Q
Sbjct: 633 VQFIPEYKIVHQ 644
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.324 0.139 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 196,268,927
Number of Sequences: 164201
Number of extensions: 8668786
Number of successful extensions: 31774
Number of sequences better than 10.0: 168
Number of HSP's better than 10.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 31468
Number of HSP's gapped (non-prelim): 325
length of query: 1706
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1582
effective length of database: 39,613,130
effective search space: 62667971660
effective search space used: 62667971660
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0046b.4