
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0114.12
(1414 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 202 4e-51
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 202 4e-51
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 201 2e-50
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 182 7e-45
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 175 9e-43
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 172 4e-42
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 161 1e-38
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 158 1e-37
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 158 1e-37
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 144 2e-33
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 130 3e-29
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 130 3e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 130 3e-29
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 127 2e-28
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 127 2e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 126 4e-28
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 121 1e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 118 1e-25
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 104 2e-21
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 102 1e-20
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 202 bits (515), Expect = 4e-51
Identities = 126/414 (30%), Positives = 210/414 (50%), Gaps = 9/414 (2%)
Query: 409 LAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKW 468
+ ++ GA P+ Q R + ++ +K++ + IRE + P W + VV+VKK +G
Sbjct: 934 IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992
Query: 469 RMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAF 528
RMC DY +NKV +++PLPN++ + +G +L ++ D +G+ QI + +E TAF
Sbjct: 993 RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052
Query: 529 MTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDL 588
+ + +PFGL + A +Q M++I +G VYVDD+++ S H D+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112
Query: 589 KEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEV 648
KEA ++R MKL KC + ++LG +T G+E K + + PT+VKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172
Query: 649 QRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKP 708
Q G + +F+ A+ + + + W +E E AF +LK+ + PVL++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232
Query: 709 TPGV------PLVLYLAVTDKAVSTVLLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAA 761
P ++Y + K + VL QE +Q I F S L AE RY + A
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEA 1292
Query: 762 LAILKTARRLRPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ 814
LA++ RR + + V TD PL +L+ L+ RL WS+E+ E+D++
Sbjct: 1293 LAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVK 1346
Score = 110 bits (274), Expect = 4e-23
Identities = 144/606 (23%), Positives = 244/606 (39%), Gaps = 64/606 (10%)
Query: 815 YEPRGQVTVQSLIDFVAELT----PTEGEKTQGEWVLSVDGSSNNTGSGAGITIESPDKM 870
+E ++ Q L V + P +G+ + ++ G GA + E PD
Sbjct: 1208 WEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQ 1267
Query: 871 IIEQSLKFEFKA-SNNQSEYEAL-IAGLRLAIELGVQKLFIKG-------DSQLVVKQVK 921
+ + F KA S ++ Y + L + L K I G D + ++ +K
Sbjct: 1268 --QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLK 1325
Query: 922 GEYQVKDPQLSKYLEVVRRLMMEVKE--IKIEHVPRGQNERADVLAKLA---------ST 970
G S + + R +E+ E +KI ++ N AD L++ T
Sbjct: 1326 G---------SPLADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQT 1376
Query: 971 GRLGNYQTVIQETLPRPSIDLVEIK--LKVVKSVNEGELPWMESIKTFLENPPKEDDLNT 1028
L + IQ LP D+++ L+ +K +EG W E I LE +
Sbjct: 1377 KELTSIVNAIQTELP----DILDSSCWLERLKGEDEG---WKEVIAA-LEGGKTKGTFKI 1428
Query: 1029 RTKRREAS--FYTLVDGELYRRGIMSPMLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAV 1086
E S +Y +V G L I V K ++ E+HEG+ + H G + +
Sbjct: 1429 VGIESEISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGHFGIKKMW- 1487
Query: 1087 KVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMAPWPFAMWGTDILGPF 1146
+++ FYWP M+ V+ C KC +D H LT +P + D++
Sbjct: 1488 RMVHRKFYWPQMRVCVENCVRTCAKCLCAND-HSKLTSSLTPYRMTFPLEIVACDLMD-V 1545
Query: 1147 PVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFG-VPMALVMDNGTQ 1205
++ +YI+ +D FTK+ A + A V +R G +P+ L+ D G +
Sbjct: 1546 GLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKE 1605
Query: 1206 FTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGV 1265
F + + +F + IE + + NG E NK I+ +KKK W +++
Sbjct: 1606 FVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAVPME-WDDQVVYA 1664
Query: 1266 LWAYNTTEQSSTKETPYRLTYGTDAMLSVEIENQSWRVARFNENDNGENLIANLIMLPEE 1325
++AYN +T ETP L +G D M +E+ + + + D ++L+ ++ ++
Sbjct: 1665 VYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQELLKVQK 1724
Query: 1326 QREAHIRNEAGKVKVARKFSTKVVPRKMRV---GDLVL*KNTIPDKH-----NKLSPNWG 1377
+ H E K F K +K R G VL + IP + KL W
Sbjct: 1725 IAKEHAMREQESYK--SLFDQKYASKKHRFPQPGSRVLLE--IPSEKLGAQCPKLVNKWS 1780
Query: 1378 GPYRII 1383
GPYR+I
Sbjct: 1781 GPYRVI 1786
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 202 bits (515), Expect = 4e-51
Identities = 124/363 (34%), Positives = 182/363 (49%), Gaps = 6/363 (1%)
Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
P W+ K+R+ DY LN++ D +P+PN+D+++ + +D
Sbjct: 246 PIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305
Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
G++QI M P TAF T +Y Y MPFGLKNA AT+QR M+ I + ++ VY+
Sbjct: 306 GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365
Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
DD+IV S +H L F++L +KL +KC F Q FLG +LT GI+ NP+
Sbjct: 366 DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425
Query: 632 KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTE-ECEQ 690
K AI + PT KE++ G +F+P D A P CLKKN K T E +
Sbjct: 426 KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485
Query: 691 AFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGA 750
AF KLK ++ P+L P L +D A+ VL Q+ + ++S TL
Sbjct: 486 AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQD----GHPLSYISRTLNEH 541
Query: 751 ELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELS 809
E+ Y IEK LAI+ + R Y ++ +D PL + + D + +L W V+LS
Sbjct: 542 EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601
Query: 810 EYD 812
E+D
Sbjct: 602 EFD 604
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 201 bits (510), Expect = 2e-50
Identities = 128/408 (31%), Positives = 201/408 (48%), Gaps = 12/408 (2%)
Query: 416 TPVIQPMRRMSEEKHKAVQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGKWRMC 471
+P+ +++ V+ + ++++ IRE PTW+ K+R+
Sbjct: 205 SPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264
Query: 472 TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTN 531
DY LN++ D YP+PN+D+++ + + +D G++QI M TAF T
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 532 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 591
+Y Y MPFGL+NA AT+QR M+ I + ++ VY+DD+I+ S ++H ++
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 592 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 651
F +L +KL +KC F + FLG ++T GI+ NP K +AI+ PT KE++
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444
Query: 652 TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECE--QAFTKLKETLATLPVLSKPT 709
G +F+P D A P +CLKK +K T++ E +AF KLK + P+L P
Sbjct: 445 LGLTGYYRKFIPNYADIAKPMTSCLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPD 503
Query: 710 PGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTAR 769
VL ++ A+ VL Q I F+S TL EL Y IEK LAI+ +
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559
Query: 770 RLRPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 816
R Y Q + +D PLR + + +L W V LSEY + +
Sbjct: 560 TFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKID 607
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 182 bits (461), Expect = 7e-45
Identities = 128/406 (31%), Positives = 205/406 (49%), Gaps = 23/406 (5%)
Query: 433 VQLETEKLIKARFIRE----VQYPTWLANVVMVKKANGK--WRMCTDYTSLNKVCPKDSY 486
V+ + ++L++ IR P W+ V K NG+ +RM D+ LN V D+Y
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 487 PLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKN 546
P+P+++ + + + +D SG++QI M SD TAF T Y + +PFGLKN
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 547 AGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEK 606
A A +QR++D I + +G+ VY+DD+IV S H +L+ L +++N EK
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 607 CSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAG 666
F +FLG+++T+ GI+ +P K RAI EM PTSVKE++R G + +F+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 667 DKAAPFFTCLK---------KNSKFQWT--EECEQAFTKLKETLATLPVLSKPTPGVPLV 715
A P + ++SK T E Q+F LK L + +L+ P P
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 716 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 774
L ++ A+ VL Q++ + + I ++S +L E Y IEK LAI+ + LR Y
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 775 FQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQ--YEP 817
+ + +KV TD PL L + + +L W + EY+ + Y+P
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKP 542
Score = 44.3 bits (103), Expect = 0.002
Identities = 41/207 (19%), Positives = 83/207 (39%), Gaps = 10/207 (4%)
Query: 1077 SHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFS-DLHKAPPEELTTMMAPWPF 1135
+H G + ++++ +Y+P M C+ C+++ + H P T + +P
Sbjct: 704 AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762
Query: 1136 AMWGTDILGPFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFGVP 1195
+ DI + + + +D F+K+ + + + + +R L + + F P
Sbjct: 763 EILHIDIFA------LEKRLYLSCIDKFSKFAKLFHLQSKASVHLRETLVEALHY-FTAP 815
Query: 1196 MALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAK 1255
LV DN + + I++ +A + + NGQ E + L+ + DE
Sbjct: 816 KVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELP 875
Query: 1256 GLWAEELPGV-LWAYNTTEQSSTKETP 1281
EL + + YNT+ S T P
Sbjct: 876 TFKPVELVHIAVDRYNTSVHSVTNRKP 902
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 175 bits (443), Expect = 9e-43
Identities = 201/881 (22%), Positives = 362/881 (40%), Gaps = 105/881 (11%)
Query: 458 VVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIM 517
V V K +G+WRM DY +NK P + + ++ + + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGF---W 61
Query: 518 MHPSDEES---TAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDM 574
HP ES TAF YC+ +P G N+ A + D + + N++VYVDD+
Sbjct: 62 AHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQVYVDDI 119
Query: 575 IVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGR 634
+ +H L++ F L ++ +K G + +FLGF +T G +
Sbjct: 120 YLSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKT 179
Query: 635 AILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLK--KNSKFQWTEECEQAF 692
+L + P +K++Q + G + F+P + P + + K +W+EE +
Sbjct: 180 KLLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQL 239
Query: 693 TKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAEL 752
+ E L T L + P LV+ + + A E GKK I ++++ AEL
Sbjct: 240 NMVIEALNTASNLEERLPEQRLVIKVNTSPSAGYVRYYNETGKKP--IMYLNYVFSKAEL 297
Query: 753 RYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVPLRQVLQKPDLSGRL------VSWSV 806
++ +EK + K + ++ V + + +QK L R ++W
Sbjct: 298 KFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMT 357
Query: 807 ELSEYDIQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNT 856
L + IQ+ + ++ + D +Q E V DGS SNN
Sbjct: 358 YLEDPRIQFHYDKTLPELKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAIKSPDPTKSNNA 417
Query: 857 GSGAGITIESPDKMIIEQSLKFEFKASNNQSEYEALIAGLRLAIELGVQ---KLFIKGDS 913
G G P+ ++ ++ N+ ++ A IA + A + ++ + + DS
Sbjct: 418 GMGIVHATYKPEYQVLN---QWSIPLGNHTAQM-AEIAAVEFACKKALKIPGPVLVITDS 473
Query: 914 QLVVKQVKGEYQV----------KDP--QLSKYLEVVRRLMMEVKEIKIEHVPRGQNERA 961
V + E K P +SK+ + L M+ +I I+H +G + +
Sbjct: 474 FYVAESANKELPYWKSNGFVNNKKKPLKHISKWKSIAECLSMK-PDITIQH-EKGISLQI 531
Query: 962 DVLA--------KLASTGRLGNYQTVIQETLPRPSIDLVEIKLKVVKSVNEGELPWMESI 1013
V KLA+ G V+ +P++D + + +G
Sbjct: 532 PVFILKGNALADKLATQGSY-----VVNCNTKKPNLDAE------LDQLLQGH------- 573
Query: 1014 KTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSPM-LKCVDTK-DALGIMAEVH 1071
+++ PK+ Y L DG++ + P +K + + D I+ + H
Sbjct: 574 --YIKGYPKQYT------------YFLEDGKVK---VSRPEGVKIIPPQSDRQKIVLQAH 616
Query: 1072 EGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMA 1131
+H G + +K+ ++WP M+KD ++ + +C++C + + +KA L
Sbjct: 617 N---LAHTGREATLLKIANL-YWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRP 672
Query: 1132 PWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFT--KWIEAEAVATITAAKVRNFLWQRIV 1189
PF + D +GP P ++ + Y++V VD T W+ A T+A V++ ++
Sbjct: 673 QKPFDKFFIDYIGPLPPSQGYL-YVLVVVDGMTGFTWLYPTK-APSTSATVKSL---NVL 727
Query: 1190 CRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKK 1249
+P + D G FTSS E+ E GI + F++ HPQ+ + E N I + L K
Sbjct: 728 TSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTK 787
Query: 1250 KLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRLTYGTDA 1290
L W + LP V A N T K TP++L +G D+
Sbjct: 788 LLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDS 828
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 172 bits (437), Expect = 4e-42
Identities = 119/406 (29%), Positives = 176/406 (43%), Gaps = 6/406 (1%)
Query: 417 PVIQPMRRMSEEKHKAVQLETEKLIKARFIREV--QYPTWLANVVMVKKANG---KWRMC 471
PV R + + +Q + +KLIK + + QY + L V N KWR+
Sbjct: 314 PVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLV 373
Query: 472 TDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTN 531
DY +NK D +PLP +D ++D + S +D SG++QI + + T+F T+
Sbjct: 374 IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433
Query: 532 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 591
+Y + +PFGLK A ++QR+M FS +Y+DD+IV +L E
Sbjct: 434 NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493
Query: 592 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 651
F + R Y +KL+PEKCSF + FLG T +GI + K I P +R
Sbjct: 494 FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553
Query: 652 TGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPG 711
RF+ D + KKN F+WT+EC++AF LK L +L P
Sbjct: 554 VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613
Query: 712 VPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRL 771
+ + +A VL Q Q + + S E E+ AI
Sbjct: 614 KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673
Query: 772 RPYFQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYDIQYE 816
RPY VKTD PL + + S +L +EL EY+ E
Sbjct: 674 RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719
Score = 110 bits (276), Expect = 2e-23
Identities = 78/338 (23%), Positives = 148/338 (43%), Gaps = 8/338 (2%)
Query: 1050 IMSPMLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKC 1109
+++P+ + + K+ I++ +H+ G + + ++ +YW M K EYV+KC
Sbjct: 880 LLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKC 939
Query: 1110 EKCQVFSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTKWIEA 1169
+KCQ T F D +GP P ++ +Y + + TK++ A
Sbjct: 940 QKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVA 999
Query: 1170 EAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRFASVEH 1229
+A +A V +++ + ++G + D GT++ +S+ + C + I+ ++ H
Sbjct: 1000 IPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHH 1059
Query: 1230 PQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRLTYGTD 1289
QT G E +++ + + ++ + K W L ++ +NTT+ PY L +G
Sbjct: 1060 HQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRT 1119
Query: 1290 AMLSVEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARKFSTKVV 1349
+ L +N +D + + L A EA K K + KV
Sbjct: 1120 SNLPKHFNKLHSIEPIYNIDDYAKE---SKYRLEVAYARARKLLEAHKEKNKENYDLKVK 1176
Query: 1350 PRKMRVGDLVL*KNTIPDKHNKLSPNWGGPYRI--IGD 1385
++ VGD VL +N + +KL + GPY+I IGD
Sbjct: 1177 DIELEVGDKVLLRNEV---GHKLDFKYTGPYKIESIGD 1211
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 161 bits (408), Expect = 1e-38
Identities = 109/397 (27%), Positives = 192/397 (47%), Gaps = 9/397 (2%)
Query: 429 KHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 488
K +A+ E + +K+ IRE + V+ V K G RM DY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 489 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAG 548
P +++L+ G+ + + +D S Y+ I + DE AF + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542
Query: 549 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 608
A +Q ++ I + ++ Y+DD+++ S S+H +K+ +L+ + +N KC
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 609 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 668
F KF+G+ ++ +G + +L+ K P + KE+++ G + L +F+P
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 669 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTV 728
P LKK+ +++WT QA +K+ L + PVL ++L +D AV V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 729 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKVKTD 785
L Q+ + K + + S + A+L Y +K LAI+K+ + R Y +S K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 786 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 817
+ R + + RL W + L + ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819
Score = 124 bits (311), Expect = 2e-27
Identities = 132/556 (23%), Positives = 236/556 (41%), Gaps = 61/556 (10%)
Query: 877 KFEFKASNNQSEYEALIAGL---RLAIELGVQKLFIKGDSQLVVKQVKGEYQVKDPQLSK 933
K + S + E A+I L R +E ++ I D + ++ ++ E + ++ +L++
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 934 YLEVVRRLMMEVKEIKIEHVPRGQNERADVLAKLASTGRLGNYQTVIQETLPRPSIDLVE 993
+ +L ++ +I + P N AD L++ ++ ET P P D +
Sbjct: 804 W-----QLFLQDFNFEINYRPGSANHIADALSR------------IVDETEPIPK-DSED 845
Query: 994 IKLKVVKSVNEGELPWMESIKTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSP 1053
+ V ++ + + + + + + LN KR E + L DG L
Sbjct: 846 NSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQ-LKDGLLINS--KDQ 902
Query: 1054 MLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQ 1113
+L DT+ I+ + HE H G L +I F W ++K EYV+ C CQ
Sbjct: 903 ILLPNDTQLTRTIIKKYHEEGKLIH-PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQ 961
Query: 1114 V--------FSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTK 1165
+ + L PP E P+ D + P + + V VD F+K
Sbjct: 962 INKSRNHKPYGPLQPIPPSER-------PWESLSMDFITALPESSGY-NALFVVVDRFSK 1013
Query: 1166 W-IEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRF 1224
I +ITA + QR++ FG P ++ DN FTS ++F + M+F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073
Query: 1225 ASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRL 1284
+ PQT+GQ E N+ + K L+ W + + V +YN S+T+ TP+ +
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEI 1133
Query: 1285 TYGTDAMLS-VEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARK 1343
+ LS +E+ + S + ++N + I + E H+ +K+ +
Sbjct: 1134 VHRYSPALSPLELPSFSDKT-----DENSQETIQVFQTVKE-----HL--NTNNIKMKKY 1181
Query: 1344 FSTKVVP-RKMRVGDLVL*KNT---IPDKHNKLSPNWGGPYRIIGDVGGEAYKLEQLSGQ 1399
F K+ + + GDLV+ K T K NKL+P++ GP+ ++ G Y+L+
Sbjct: 1182 FDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSI 1241
Query: 1400 K--VPRTWNASHLKQY 1413
K T++ SHL++Y
Sbjct: 1242 KHMFSSTFHVSHLEKY 1257
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 158 bits (399), Expect = 1e-37
Identities = 108/397 (27%), Positives = 192/397 (48%), Gaps = 9/397 (2%)
Query: 429 KHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 488
K +A+ E + +K+ IRE + V+ V K G RM DY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 489 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAG 548
P +++L+ G+ + + +D S Y+ I + DE AF + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 549 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 608
A +Q ++ I + ++ Y+D++++ S S+H +K+ +L+ + +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 609 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 668
F KF+G+ ++ +G + +L+ K P + KE+++ G + L +F+P
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 669 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTV 728
P LKK+ +++WT QA +K+ L + PVL ++L +D AV V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 729 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKVKTD 785
L Q+ + K + + S + A+L Y +K LAI+K+ + R Y +S K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 786 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 817
+ R + + RL W + L + ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819
Score = 124 bits (311), Expect = 2e-27
Identities = 132/556 (23%), Positives = 236/556 (41%), Gaps = 61/556 (10%)
Query: 877 KFEFKASNNQSEYEALIAGL---RLAIELGVQKLFIKGDSQLVVKQVKGEYQVKDPQLSK 933
K + S + E A+I L R +E ++ I D + ++ ++ E + ++ +L++
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 934 YLEVVRRLMMEVKEIKIEHVPRGQNERADVLAKLASTGRLGNYQTVIQETLPRPSIDLVE 993
+ +L ++ +I + P N AD L++ ++ ET P P D +
Sbjct: 804 W-----QLFLQDFNFEINYRPGSANHIADALSR------------IVDETEPIPK-DSED 845
Query: 994 IKLKVVKSVNEGELPWMESIKTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSP 1053
+ V ++ + + + + + + LN KR E + L DG L
Sbjct: 846 NSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQ-LKDGLLINS--KDQ 902
Query: 1054 MLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQ 1113
+L DT+ I+ + HE H G L +I F W ++K EYV+ C CQ
Sbjct: 903 ILLPNDTQLTRTIIKKYHEEGKLIH-PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQ 961
Query: 1114 V--------FSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTK 1165
+ + L PP E P+ D + P + + V VD F+K
Sbjct: 962 INKSRNHKPYGPLQPIPPSER-------PWESLSMDFITALPESSGY-NALFVVVDRFSK 1013
Query: 1166 W-IEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRF 1224
I +ITA + QR++ FG P ++ DN FTS ++F + M+F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073
Query: 1225 ASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRL 1284
+ PQT+GQ E N+ + K L+ W + + V +YN S+T+ TP+ +
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEI 1133
Query: 1285 TYGTDAMLS-VEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARK 1343
+ LS +E+ + S + ++N + I + E H+ +K+ +
Sbjct: 1134 VHRYSPALSPLELPSFSDKT-----DENSQETIQVFQTVKE-----HL--NTNNIKMKKY 1181
Query: 1344 FSTKVVP-RKMRVGDLVL*KNT---IPDKHNKLSPNWGGPYRIIGDVGGEAYKLEQLSGQ 1399
F K+ + + GDLV+ K T K NKL+P++ GP+ ++ G Y+L+
Sbjct: 1182 FDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSI 1241
Query: 1400 K--VPRTWNASHLKQY 1413
K T++ SHL++Y
Sbjct: 1242 KHMFSSTFHVSHLEKY 1257
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 158 bits (399), Expect = 1e-37
Identities = 108/397 (27%), Positives = 192/397 (48%), Gaps = 9/397 (2%)
Query: 429 KHKAVQLETEKLIKARFIREVQYPTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPL 488
K +A+ E + +K+ IRE + V+ V K G RM DY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 489 PNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAG 548
P +++L+ G+ + + +D S Y+ I + DE AF + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 549 ATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCS 608
A +Q ++ I + ++ Y+D++++ S S+H +K+ +L+ + +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 609 FGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDK 668
F KF+G+ ++ +G + +L+ K P + KE+++ G + L +F+P
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 669 AAPFFTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTV 728
P LKK+ +++WT QA +K+ L + PVL ++L +D AV V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 729 LLQE-EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSF--QVKVKTD 785
L Q+ + K + + S + A+L Y +K LAI+K+ + R Y +S K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 786 ---VPLRQVLQKPDLSGRLVSWSVELSE--YDIQYEP 817
+ R + + RL W + L + ++I Y P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRP 819
Score = 124 bits (311), Expect = 2e-27
Identities = 132/556 (23%), Positives = 236/556 (41%), Gaps = 61/556 (10%)
Query: 877 KFEFKASNNQSEYEALIAGL---RLAIELGVQKLFIKGDSQLVVKQVKGEYQVKDPQLSK 933
K + S + E A+I L R +E ++ I D + ++ ++ E + ++ +L++
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 934 YLEVVRRLMMEVKEIKIEHVPRGQNERADVLAKLASTGRLGNYQTVIQETLPRPSIDLVE 993
+ +L ++ +I + P N AD L++ ++ ET P P D +
Sbjct: 804 W-----QLFLQDFNFEINYRPGSANHIADALSR------------IVDETEPIPK-DSED 845
Query: 994 IKLKVVKSVNEGELPWMESIKTFLENPPKEDDLNTRTKRREASFYTLVDGELYRRGIMSP 1053
+ V ++ + + + + + + LN KR E + L DG L
Sbjct: 846 NSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQ-LKDGLLINS--KDQ 902
Query: 1054 MLKCVDTKDALGIMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQ 1113
+L DT+ I+ + HE H G L +I F W ++K EYV+ C CQ
Sbjct: 903 ILLPNDTQLTRTIIKKYHEEGKLIH-PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQ 961
Query: 1114 V--------FSDLHKAPPEELTTMMAPWPFAMWGTDILGPFPVAKAQMKYIIVAVDYFTK 1165
+ + L PP E P+ D + P + + V VD F+K
Sbjct: 962 INKSRNHKPYGPLQPIPPSER-------PWESLSMDFITALPESSGY-NALFVVVDRFSK 1013
Query: 1166 W-IEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRF 1224
I +ITA + QR++ FG P ++ DN FTS ++F + M+F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073
Query: 1225 ASVEHPQTNGQAESANKVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTKETPYRL 1284
+ PQT+GQ E N+ + K L+ W + + V +YN S+T+ TP+ +
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEI 1133
Query: 1285 TYGTDAMLS-VEIENQSWRVARFNENDNGENLIANLIMLPEEQREAHIRNEAGKVKVARK 1343
+ LS +E+ + S + ++N + I + E H+ +K+ +
Sbjct: 1134 VHRYSPALSPLELPSFSDKT-----DENSQETIQVFQTVKE-----HL--NTNNIKMKKY 1181
Query: 1344 FSTKVVP-RKMRVGDLVL*KNT---IPDKHNKLSPNWGGPYRIIGDVGGEAYKLEQLSGQ 1399
F K+ + + GDLV+ K T K NKL+P++ GP+ ++ G Y+L+
Sbjct: 1182 FDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSI 1241
Query: 1400 K--VPRTWNASHLKQY 1413
K T++ SHL++Y
Sbjct: 1242 KHMFSSTFHVSHLEKY 1257
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 144 bits (363), Expect = 2e-33
Identities = 110/406 (27%), Positives = 190/406 (46%), Gaps = 25/406 (6%)
Query: 433 VQLETEKLIKARFIREVQYPTWLANVVMVKKA-----NGKWRMCTDYTSLNKVCPKDSYP 487
V E ++L+K IR + P V+ KK N R+ D+ LN+ D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 488 LPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNA 547
+P++ ++ + + +D SGY+QI + D E T+F N Y + +PFGL+NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 548 GATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKC 607
+ +QR +D + +Q+G+ VYVDD+I+ S SDH + L M+++ EK
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 608 SFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGD 667
F + ++LGF+++ G + +P+K +AI E P V +V+ G + F+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 668 KAAPFFTCLK-----------KNSKFQWTEECEQAFTKLKETLATLPVLSK-PTPGVPLV 715
A P LK K ++ E AF +L+ LA+ V+ K P P
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 716 LYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPY- 774
L + + VL QE + I +S TL+ E Y E+ LAI+ +L+ +
Sbjct: 497 LTTDASASGIGAVLSQE----GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552
Query: 775 FQSFQVKVKTD-VPLRQVLQKPDLSGRLVSWSVELSEYD--IQYEP 817
+ S ++ + TD PL + + + ++ W + +++ + Y+P
Sbjct: 553 YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKP 598
Score = 48.1 bits (113), Expect = 2e-04
Identities = 70/294 (23%), Positives = 112/294 (37%), Gaps = 26/294 (8%)
Query: 1086 VKVIRAGFYWPTMKKDCLEYVKKCEKC-QVFSDLHKAPPEELTTMMAPWPFAMWGTDILG 1144
+K + +Y+P M E V C C Q D H E T + + M DI
Sbjct: 757 IKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTGEMVHIDIF- 815
Query: 1145 PFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFGVPMALVMDNGT 1204
K + +D F+K+ + V + T + L Q I+ F + DN
Sbjct: 816 -----STDRKLFLTCIDKFSKYAIVQPVVSRTIVDITAPLLQ-IINLFPNIKTVYCDNEP 869
Query: 1205 QFTS-SVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLK-KKLDEAKGLWAEEL 1262
F S +VT GI++ A H +NGQ E + + + + KLD+ E +
Sbjct: 870 AFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVELI 929
Query: 1263 PGVLWAYNTTEQSSTKETPYRLTYGTDAMLSVEIENQSWRVARFNENDNGENLIANLIML 1322
YN T S T+E P + + +EI+ R+ + ++ G N
Sbjct: 930 LRATIEYNKTVHSVTRERPIEVVHPGAHERCLEIKA---RLVKAQQDSIGRN-------N 979
Query: 1323 PEEQREAHIRNEAGKVKVARKFSTKVVP----RKMR--VGDLVL*KNTIPDKHN 1370
P Q E VK ++ K+ P +K++ +G VL K + K N
Sbjct: 980 PSRQNRVFEVGERVFVKNNKRLGNKLTPLCTEQKVQADLGTSVLIKGRVVHKDN 1033
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 130 bits (327), Expect = 3e-29
Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 18/378 (4%)
Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
P +L N +K GK RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343
Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
G+ Q+++ TAF Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402
Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459
Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKE 519
Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579
Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
S + + AE Y +K LA++ T ++ Y ++TD + K D
Sbjct: 580 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639
Query: 799 GRLVSWSVELSEYDIQYE 816
GR + W LS Y E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 130 bits (326), Expect = 3e-29
Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 18/378 (4%)
Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
P +L N +K GK RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343
Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
G+ Q+++ TAF Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402
Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459
Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519
Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579
Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
S + + AE Y +K LA++ T ++ Y ++TD + K D
Sbjct: 580 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639
Query: 799 GRLVSWSVELSEYDIQYE 816
GR + W LS Y E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 130 bits (326), Expect = 3e-29
Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 18/378 (4%)
Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
P +L N +K GK RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 285 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKS 343
Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
G+ Q+++ TAF Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 344 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 402
Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 403 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 459
Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 460 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 519
Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 520 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 579
Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
S + + AE Y +K LA++ T ++ Y ++TD + K D
Sbjct: 580 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 639
Query: 799 GRLVSWSVELSEYDIQYE 816
GR + W LS Y E
Sbjct: 640 GRNIRWQAWLSHYSFDVE 657
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 127 bits (320), Expect = 2e-28
Identities = 99/378 (26%), Positives = 168/378 (44%), Gaps = 18/378 (4%)
Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
P +L N +K GK RM +Y ++NK D+Y PN D+L+ G ++ S D S
Sbjct: 280 PAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKS 338
Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
G+ Q+++ TAF Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 339 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 397
Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 398 DDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 454
Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 455 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 514
Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 515 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 574
Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
S + + AE Y +K LA++ T ++ Y ++TD + K D
Sbjct: 575 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 634
Query: 799 GRLVSWSVELSEYDIQYE 816
GR + W LS Y E
Sbjct: 635 GRNIRWQAWLSHYSFDVE 652
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 127 bits (319), Expect = 2e-28
Identities = 111/460 (24%), Positives = 201/460 (43%), Gaps = 33/460 (7%)
Query: 392 WTINDVPGIDPKVITHKLAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQY 451
W + IDPK + ++PM ++ + + ++L++ + I+ +
Sbjct: 215 WMTATIELIDPKTVVK-----------VKPMSYSPSDREE-FDRQIKELLELKVIKPSK- 261
Query: 452 PTWLANVVMVK----KANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLM 507
T ++ +V+ + GK RM +Y ++NK D++ LPN D+L+ G ++ S
Sbjct: 262 STHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSF 321
Query: 508 DAYSGYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNM 567
D SG Q+++ + TAF Q +Y + +PFGLK A + + + S Q +
Sbjct: 322 DCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYC 381
Query: 568 EVYVDDMIV-KSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGI 626
VYVDD++V + +H + + + L+ +K + FLG + +G
Sbjct: 382 CVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGT 440
Query: 627 EVNPDKGRAILE--MKSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSK 681
+ ILE K P + K++QR G + S ++P P + LK++S
Sbjct: 441 HCPQNH---ILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDST 497
Query: 682 FQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIY 741
+ W + Q K+K+ L + P L P P LV+ +++ +L + +
Sbjct: 498 WTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICR 557
Query: 742 FVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVP-----LRQVLQKPD 796
+ S + + AE Y EK LA+++ ++ Y + ++TD + L+
Sbjct: 558 YASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDR 617
Query: 797 LSGRLVSWSVELSEYDIQYEPRGQVTVQSLIDFVAELTPT 836
GRLV W + LS+YD E T DF+ E T T
Sbjct: 618 KQGRLVRWQMWLSQYDFDVEHIAG-TKNVFADFLQENTLT 656
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 126 bits (317), Expect = 4e-28
Identities = 98/378 (25%), Positives = 167/378 (43%), Gaps = 18/378 (4%)
Query: 452 PTWLANVVMVKKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYS 511
P +L N + G RM +Y ++NK D+Y LPN D+L+ G ++ S D S
Sbjct: 286 PAFLVNNE-AENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKS 344
Query: 512 GYNQIMMHPSDEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYV 571
G+ Q+++ TAF Q +Y + +PFGLK A + +QR MD+ F + + VYV
Sbjct: 345 GFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYV 403
Query: 572 DDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPD 631
DD++V S DH + + + + L+ +K + FLG + +
Sbjct: 404 DDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE---GTHKP 460
Query: 632 KGRAILEM-KSPTSV---KEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEE 687
+G + + K P ++ K++QR G + S ++P P LK+N ++WT+E
Sbjct: 461 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKE 520
Query: 688 CEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDK----AVSTVLLQEEGKKQKVIYFV 743
K+K+ L P L P P L++ +D + + + E + + +
Sbjct: 521 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYR 580
Query: 744 SHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTD----VPLRQVLQKPDLS- 798
S + + AE Y +K LA++ T ++ Y ++TD + K D
Sbjct: 581 SGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 640
Query: 799 GRLVSWSVELSEYDIQYE 816
GR + W LS Y E
Sbjct: 641 GRNIRWQAWLSHYSFDVE 658
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 121 bits (304), Expect = 1e-26
Identities = 98/371 (26%), Positives = 161/371 (42%), Gaps = 26/371 (7%)
Query: 462 KKANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPS 521
++ GK RM +Y ++N+ DS+ LPN+ +L+ G + S D SG+ Q+++
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346
Query: 522 DEESTAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARA 581
++ TAF Q ++ +K +PFGLK A + +QR M + + VYVDD+IV S
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALN-GADKFCMVYVDDIIVFSNSE 405
Query: 582 SDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTS----------RGIEVNPD 631
DH + + Y + L+ +K + + FLG + I PD
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPD 465
Query: 632 KGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCLKKNSKFQWTEECEQA 691
+ LE K K +QR G + ++P + P LKK+ + WT+
Sbjct: 466 R----LEDK-----KHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516
Query: 692 FTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVL-LQEEGKKQKVIYFVSHTLQGA 750
K+K+ L + P L P P L++ +D VL + + + + S + + A
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576
Query: 751 ELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVP-----LRQVLQKPDLSGRLVSWS 805
E Y +K LA+ + + Y + V+TD LR L+ GRLV W
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636
Query: 806 VELSEYDIQYE 816
S+Y E
Sbjct: 637 NWFSKYQFDVE 647
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 118 bits (296), Expect = 1e-25
Identities = 76/251 (30%), Positives = 128/251 (50%), Gaps = 7/251 (2%)
Query: 415 ATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQYPTWLANVVMVKK-ANGKWRMCTD 473
A PV + R + +AV+ E +L + I + Y W A +V++KK GK R+C D
Sbjct: 438 AVPVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCAD 497
Query: 474 Y--TSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDEESTAFMTN 531
+ + LN + +PLP + + G + S +D Y Q+ + ++ T+
Sbjct: 498 FKCSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTH 556
Query: 532 QANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSARASDHGGDLKEA 591
+ + Y M FGLK A A++Q++MDK+ S G + VY DD+I+ ++ +H L+E
Sbjct: 557 RGIFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILREL 614
Query: 592 FDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRL 651
F++ + Y +++ EKC+F + FLGF + G + K AI MK+PT K++
Sbjct: 615 FERFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASF 673
Query: 652 TGRMAALSRFL 662
G LSR +
Sbjct: 674 LGAADWLSRMM 684
Score = 77.0 bits (188), Expect = 3e-13
Identities = 64/228 (28%), Positives = 102/228 (44%), Gaps = 24/228 (10%)
Query: 1066 IMAEVHEGVCSSHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEE 1125
++ ++HEG H G + K R+ +W + D V+ C CQ S + + P
Sbjct: 786 VLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP-- 838
Query: 1126 LTTMMAPWPF--AMWGT---DILGPFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKV 1180
+ PWP A W D GP Y++V VD TK+ E + +I+A
Sbjct: 839 ----LNPWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSISAVTT 889
Query: 1181 RNFLWQRIVCRFGVPMALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESAN 1240
+ L + I G P ++ DNGTQ TS + + C GIE + ++V +P++NG AE
Sbjct: 890 IDLL-EEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFV 948
Query: 1241 KVILKGLKKKLDEAKGLWAEELPGVLWAYNTTEQSSTK-ETPYRLTYG 1287
+ +G+ K E + + L L +Y T S+ TP +G
Sbjct: 949 DTLKRGIAKIKGEG-SVNQQILNKFLISYRNTPHSALNGSTPAECHFG 995
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 104 bits (259), Expect = 2e-21
Identities = 61/213 (28%), Positives = 106/213 (49%), Gaps = 9/213 (4%)
Query: 1081 GRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMAPWPFAMWGT 1140
GR + + ++WP ++KD ++ +++C++C V + + P L + PF +
Sbjct: 830 GRDATFLKVSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLKPFDKFYI 889
Query: 1141 DILGPFPVAKAQMKYIIVAVDYFTKWI---EAEAVATITAAKVRNFLWQRIVCRFGVPMA 1197
D +GP P + + +++V VD T ++ +A +T K N L +P
Sbjct: 890 DYIGPLPPSNGYL-HVLVVVDSMTGFVWLYPTKAPSTSATVKALNMLTS-----IAIPKV 943
Query: 1198 LVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAKGL 1257
L D G FTSS ++ E GI++ F++ HPQ++G+ E N I + L K L
Sbjct: 944 LHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGRPAK 1003
Query: 1258 WAEELPGVLWAYNTTEQSSTKETPYRLTYGTDA 1290
W + LP V A N + S+K TP++L +G D+
Sbjct: 1004 WYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDS 1036
Score = 100 bits (250), Expect = 2e-20
Identities = 107/492 (21%), Positives = 198/492 (39%), Gaps = 35/492 (7%)
Query: 404 VITHKLAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKARFIREVQYPTWLANVVMVKK 463
+ T LA RP I P + S +Q+ + L+K + + Q T V V K
Sbjct: 167 IATGTLAPRPQKQYPINPKAKPS------IQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPK 219
Query: 464 ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNELLSLMDAYSGYNQIMMHPSDE 523
+GKWRM DY +NK P + + ++ + + +D +G+ HP
Sbjct: 220 PDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGF---WAHPITP 276
Query: 524 ES---TAFMTNQANYCYKTMPFGLKNAGATYQRLMDKIFSKQVGRNMEVYVDDMIVKSAR 580
ES TAF YC+ +P G N+ A + D + + N++ YVDD+ +
Sbjct: 277 ESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQAYVDDIYISHDD 334
Query: 581 ASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQGGKFLGFMLTSRGIEVNPDKGRAILEMK 640
+H L++ F L ++ +K + +FLGF +T G + + +L +
Sbjct: 335 PQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNIT 394
Query: 641 SPTSVKEVQRLTGRMAALSRFLPMAGDKAAPFFTCL-KKNSKF-QWTEECEQAFTKLKET 698
P +K++Q + G + F+P + P +T + N KF WTE+ +
Sbjct: 395 PPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISV 454
Query: 699 LATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQEEGKKQKVIYFVSHTLQGAELRYQKIE 758
L L + P L++ + + A + EG K+ ++Y V++ AE ++ + E
Sbjct: 455 LNQADNLEERNPETRLIIKVNSSPSA-GYIRYYNEGSKRPIMY-VNYIFSKAEAKFTQTE 512
Query: 759 KAALAILKTARRLRPYFQSFQVKVKTDVPLRQVLQKPDLSG------RLVSWSVELSEYD 812
K + K + ++ V + + +Q+ L R ++W L +
Sbjct: 513 KLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYLEDPR 572
Query: 813 IQYE-PRGQVTVQSLIDFVAELTPTEGEKTQGEWVLSVDGS---------SNNTGSGAGI 862
IQ+ + +Q + + ++ ++ V DGS S++ G G
Sbjct: 573 IQFHYDKSLPELQQIPNVTEDVIAKTKHPSEFAMVFYTDGSAIKHPDVNKSHSAGMGIAQ 632
Query: 863 TIESPDKMIIEQ 874
P+ I+ Q
Sbjct: 633 VQFIPEYKIVHQ 644
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 102 bits (253), Expect = 1e-20
Identities = 97/467 (20%), Positives = 188/467 (39%), Gaps = 19/467 (4%)
Query: 387 LDLF--AWTINDVPGIDPKVITHKLAIRPGATPVIQPMRRMSEEKHKAVQLETEKLIKAR 444
L LF W G+ +V + +R GA+PV MS+E + ++ +K +
Sbjct: 143 LQLFPTVWAERAGMGLANQVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLG 202
Query: 445 FIREVQYPTWLANVVMVKK-ANGKWRMCTDYTSLNKVCPKDSYPLPNVDKLVDGASGNEL 503
+ + P W ++ VKK +R D +NK +PN L+ +
Sbjct: 203 VLVPCRSP-WNTPLLPVKKPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYT 261
Query: 504 -LSLMDAYSGYNQIMMHPSDEESTAF------MTNQANYCYKTMPFGLKNAGATYQRLMD 556
S++D + + +HP+ + AF N + +P G KN+ + +
Sbjct: 262 WYSVLDLKDAFFCLRLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALH 321
Query: 557 KIFSKQVGRNMEV----YVDDMIVKSARASDHGGDLKEAFDQLRTYQMKLNPEKCSFGIQ 612
+ + N +V YVDD++V + D ++ +L +++ +K +
Sbjct: 322 RDLAPFRALNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQR 381
Query: 613 GGKFLGFMLTSRGIEVNPDKGRAILEMKSPTSVKEVQRLTGRMAALSRFLPMAGDKAAPF 672
+LG++L + P + ++++ PT+ ++V+ G ++P AAP
Sbjct: 382 EVTYLGYLLKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPL 441
Query: 673 FTCLKKNSKFQWTEECEQAFTKLKETLATLPVLSKPTPGVPLVLYLAVTDKAVSTVLLQE 732
+ K++ F WTEE +QAF +K+ L + P L+ P P LY+ VL Q
Sbjct: 442 YPLTKESIPFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQT 501
Query: 733 EGKKQKVIYFVSHTLQGAELRYQKIEKAALAILKTARRLRPYFQSFQVKVKTDVPLRQVL 792
G ++ + ++S L + KA A+ + V V L ++
Sbjct: 502 LGPWRRPVAYLSKKLDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHSLESIV 561
Query: 793 QKPD----LSGRLVSWSVELSEYDIQYEPRGQVTVQSLIDFVAELTP 835
++P + R+ + L + + P + +L+ +E TP
Sbjct: 562 RQPPDRWMTNARMTHYQSLLLNERVSFAPPAVLNPATLLPVESEATP 608
Score = 94.7 bits (234), Expect = 2e-18
Identities = 84/314 (26%), Positives = 131/314 (40%), Gaps = 30/314 (9%)
Query: 1077 SHIGGRSLAVKVIRAGFYWPTMKKDCLEYVKKCEKCQVFSDLHKAPPEELTTMMAPWPFA 1136
+H+G L V R P ++ E +C+ C + + + E P
Sbjct: 820 THLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAV-TTYRETGKRQRGDRPGV 878
Query: 1137 MWGTDILGPFPVAKAQMKYIIVAVDYFTKWIEAEAVATITAAKVRNFLWQRIVCRFGVPM 1196
W D P + KY++V +D F+ W+EA T TA V + + I+ RFG+P
Sbjct: 879 YWEVDFTEIKP-GRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPK 937
Query: 1197 ALVMDNGTQFTSSVTREFCAEMGIEMRFASVEHPQTNGQAESANKVILKGLKKKLDEAKG 1256
L DNG F + V++ ++GI + PQ++GQ E N+ I + L K E G
Sbjct: 938 VLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGG 997
Query: 1257 L-WAEELP-GVLWAYNTTEQSSTKETPYRLTYGTDAMLSVEIENQSWRVARFNENDNGEN 1314
W LP +L A NT + TPY + YG + ++GE
Sbjct: 998 KDWVTLLPLALLRARNTPGRFGL--TPYEILYGGPPPIL----------------ESGET 1039
Query: 1315 LIANLIMLP----EEQREAHIRNEA-GKVKVARKFSTKVVPRKMRVGDLVL*KNTIPDKH 1369
L + LP + +R + ++K K T +P +VGD VL + P
Sbjct: 1040 LGPDDRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLVRRHRP--- 1096
Query: 1370 NKLSPNWGGPYRII 1383
+ L P W GPY ++
Sbjct: 1097 SSLEPRWKGPYLVL 1110
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.318 0.135 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,959,019
Number of Sequences: 164201
Number of extensions: 7280459
Number of successful extensions: 18322
Number of sequences better than 10.0: 156
Number of HSP's better than 10.0 without gapping: 95
Number of HSP's successfully gapped in prelim test: 61
Number of HSP's that attempted gapping in prelim test: 17917
Number of HSP's gapped (non-prelim): 317
length of query: 1414
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1291
effective length of database: 39,777,331
effective search space: 51352534321
effective search space used: 51352534321
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0114.12