
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0029b.1
(1377 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 206 5e-52
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 204 2e-51
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 189 3e-47
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 189 4e-47
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 187 1e-46
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 180 2e-44
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 176 3e-43
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 173 3e-42
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 173 3e-42
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 170 3e-41
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 151 1e-35
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 132 8e-30
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 131 1e-29
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 130 3e-29
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 128 9e-29
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 126 4e-28
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 124 2e-27
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 122 7e-27
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 114 2e-24
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 113 3e-24
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 206 bits (523), Expect = 5e-52
Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 10/385 (2%)
Query: 403 VQQEVDKLLAAEFIRE----VKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 458
V+ ++ +L IR P W+ K+R+ +DY LN+ D +P+
Sbjct: 223 VESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282
Query: 459 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 518
P++D ++ + +D G+HQI M P KTAF T +Y Y MPFGLKNA
Sbjct: 283 PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342
Query: 519 ATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 578
AT+QR M+ + + ++ VY+DD+IV S +H Q L F ++ K +++L +KC
Sbjct: 343 ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402
Query: 579 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 638
F Q FLG ++T GI+ NPEK +AIQ+ P+ KE++ G +F+P D
Sbjct: 403 FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462
Query: 639 SFPFFKCLRKNVVFEWT-AECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSS 697
+ P KCL+KN+ + T E + AF +LK L+S PIL P L SD AL +
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 698 VMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-L 756
V+ Q DG H + Y +S TL E+ Y IEK LA++ + R Y ++ +D
Sbjct: 523 VLSQ--DG-HPLSY-ISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQ 578
Query: 757 PLRQVLQKPDLSGRLVAWSVELSEY 781
PL + + D + +L W V+LSE+
Sbjct: 579 PLSWLYRMKDPNSKLTRWRVKLSEF 603
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 204 bits (518), Expect = 2e-51
Identities = 125/414 (30%), Positives = 211/414 (50%), Gaps = 9/414 (2%)
Query: 379 LALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKW 438
+ L +P+ Q R + +++ + K+L + IRE K P W + VV+VKK +G
Sbjct: 934 IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992
Query: 439 RMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAF 498
RMC+DY +NK +++PLP+I++ + +G +L ++ D +G+ QI + ++ TAF
Sbjct: 993 RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052
Query: 499 MTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDL 558
+ + +PFGL + A +Q M+ + +G VYVDD+++ S H QD+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112
Query: 559 EEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEV 618
+EA IRK M+L KC + ++LG +T G+E K ++Q P+NVKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172
Query: 619 QRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKP 678
Q G + +F+ + + V + W E E AF LK+L+ P+L++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232
Query: 679 -----IQG-HPLHLYFAVSDSALSSVMLQE-IDGEHRIVYFVSHTLQGAEVRYQKIEKAA 731
++G P +Y S + +V+ QE DG+ + F S L AE RY + A
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEA 1292
Query: 732 LAVLVTARRLRPYFQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEYGLQ 784
LA++ RR + + V TD PL +L+ L+ RL WS+E+ E+ ++
Sbjct: 1293 LAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVK 1346
Score = 104 bits (259), Expect = 2e-21
Identities = 127/571 (22%), Positives = 237/571 (41%), Gaps = 47/571 (8%)
Query: 812 DTQWTLFVDGSSNSSGSGAGVALEGPGELVLEQSLKFEFKATN------NQAEYEALIAG 865
D + ++ D S G GA +A EGP + + F KA + + + EAL
Sbjct: 1241 DRPFMIYTDASRK--GIGAVLAQEGPDGQ--QHPIAFASKALSPAETRYHITDLEALAMM 1296
Query: 866 LKLAREVKI---RSLLIRTDSQLVENQVKGTFQVKDPNLIKYLERV-RYLMTLFQ-EVVV 920
L R I ++ + TD + + + +KG+ +R+ R+ + + + +V +
Sbjct: 1297 FALRRFKTIIYGTAITVFTDHKPLISLLKGS---------PLADRLWRWSIEILEFDVKI 1347
Query: 921 EYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMSCVNRG---------- 970
Y+ N ADAL++ + + T +I+ EL ++
Sbjct: 1348 VYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLKGED 1407
Query: 971 RTWMDPIISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEA- 1029
W + I ++ G + + + Y I G + + + V PEK
Sbjct: 1408 EGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRTP 1467
Query: 1030 IMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKE 1089
++ E+HEG+ A H G + + +++ FYWP +R + V+ C +C D SK
Sbjct: 1468 LLKELHEGMLAGHFGIKKM-WRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLT-SS 1525
Query: 1090 LVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYW 1149
L +P + DL+ + + ++IL +D FTK+ A P+ + ++ +
Sbjct: 1526 LTPYRMTFPLEIVACDLMDVGLSVQGN-RYILTIIDLFTKYGTAVPIPDKKAETVLKAFV 1584
Query: 1150 KRIVCRFG-IPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVIL 1208
+R G IP +++D G +F + +F + I+ + + NG VE N+ I+
Sbjct: 1585 ERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIM 1644
Query: 1209 RGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEI--DNFTWR 1266
++++ A W D++ +++YN T ETP + +G D M P+E+ ++
Sbjct: 1645 HIMKKKTAVPM-EWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGI 1703
Query: 1267 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVL-- 1324
+E L + ++ A + + K K+ S+ + R Q G VL
Sbjct: 1704 NYADMDEYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASK-KHRFPQPGSRVLLE 1762
Query: 1325 --KWRSGAPGNKLTPNWEGPYRIVKVLGNGA 1353
+ GA KL W GPYR++ N A
Sbjct: 1763 IPSEKLGAQCPKLVNKWSGPYRVISCSENSA 1793
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 189 bits (481), Expect = 3e-47
Identities = 122/390 (31%), Positives = 189/390 (48%), Gaps = 10/390 (2%)
Query: 403 VQQEVDKLLAAEFIRE----VKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 458
V+ +V ++L IRE PTW+ K+R+ +DY LN+ D YP+
Sbjct: 222 VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPI 281
Query: 459 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 518
P++D ++ + + +D G+HQI M KTAF T +Y Y MPFGL+NA
Sbjct: 282 PNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAP 341
Query: 519 ATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 578
AT+QR M+ + + ++ VY+DD+I+ S +H ++ F ++ +++L +KC
Sbjct: 342 ATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCE 401
Query: 579 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 638
F + FLG ++T GI+ NP K KAI P+ KE++ G +F+P D
Sbjct: 402 FLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADI 461
Query: 639 SFPFFKCLRKNVVFE-WTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSS 697
+ P CL+K + E EAF +LK L+ PIL P L S+ AL +
Sbjct: 462 AKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGA 521
Query: 698 VMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-L 756
V+ Q + F+S TL E+ Y IEK LA++ + R Y + +D
Sbjct: 522 VLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQ 577
Query: 757 PLRQVLQKPDLSGRLVAWSVELSEYGLQYD 786
PLR + + +L W V LSEY + D
Sbjct: 578 PLRWLHNLKEPGAKLERWRVRLSEYQFKID 607
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 189 bits (480), Expect = 4e-47
Identities = 124/401 (30%), Positives = 202/401 (49%), Gaps = 21/401 (5%)
Query: 403 VQQEVDKLLAAEFIRE----VKYPTWLANVVMVKKANGK--WRMCVDYTDLNKACPKDSY 456
V++++D+LL IR P W+ V K NG+ +RM VD+ LN D+Y
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 457 PLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKN 516
P+P I++ + + + +D SG+HQI M +D KTAF T Y + +PFGLKN
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 517 AGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEK 576
A A +QR++D + +G+ VY+DD+IV S H ++L + K ++++N EK
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 577 CSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSG 636
F +FLG+++T+ GI+ +P+K +AI +M P++VKE++R G + +F+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 637 DRSFPFFKCLR-----------KNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 685
+ P R V ++F LK +L S IL+ P P H
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 686 LYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPY- 744
L S+ A+ +V+ Q+ G R + ++S +L E Y IEK LA++ + LR Y
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 745 FQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEYGLQ 784
+ + +KV TD PL L + + +L W + EY +
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCE 537
Score = 35.4 bits (80), Expect = 1.1
Identities = 42/209 (20%), Positives = 81/209 (38%), Gaps = 14/209 (6%)
Query: 1041 SHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFA-DLSKAPPKELVTMSAPWPF 1099
+H G + ++L +Y+P + C+ C+++ + P T +P
Sbjct: 704 AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762
Query: 1100 AMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRIVCRFGIP 1159
+ +D+ + + L +D F+K+ + L S + + + F P
Sbjct: 763 EILHIDIFA------LEKRLYLSCIDKFSKFAKLFHLQSKASVHLRETLVEALHY-FTAP 815
Query: 1160 RAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVIL---RGLRRRLA 1216
+ +VSDN + + + I + +A + + NGQVE + L R L+ L
Sbjct: 816 KVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELP 875
Query: 1217 EAKGAWLDELPAVLWSYNTTEQSTTRETP 1245
K L + + YNT+ S T P
Sbjct: 876 TFKPVELVHI--AVDRYNTSVHSVTNRKP 902
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 187 bits (476), Expect = 1e-46
Identities = 128/443 (28%), Positives = 207/443 (45%), Gaps = 11/443 (2%)
Query: 346 ETRLTKLLGENLDLFAWSCKDMPGIDPNFICHRLALNPSVKPVSQLRRRLGGDKGKAVQQ 405
+++L + E +D+FA + P N +L L +PV R + + +Q
Sbjct: 276 KSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQVEEIQA 332
Query: 406 EVDKLLAAEFIREVKYPTWLANVVMVKKANG------KWRMCVDYTDLNKACPKDSYPLP 459
+V KL+ + + E + + +++V K + KWR+ +DY +NK D +PLP
Sbjct: 333 QVQKLIKDKIV-EPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 460 SIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAGA 519
ID ++D + S +D SG+HQI + D T+F T+ +Y + +PFGLK A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 520 TYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSF 579
++QR+M F+ +Y+DD+IV ++L E FG+ R+++++L+PEKCSF
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 580 GVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDRS 639
+ FLG T +GI + +K IQ P + +R RF+ D S
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 640 FPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSVM 699
+ +KNV FEWT EC++AF+ LK L +P +L P + S A +V+
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 700 LQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-LPL 758
Q +G V + S E E+ A+ RPY V+TD PL
Sbjct: 632 TQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPL 691
Query: 759 RQVLQKPDLSGRLVAWSVELSEY 781
+ + S +L +EL EY
Sbjct: 692 TYLFSMVNPSSKLTRIRLELEEY 714
Score = 108 bits (269), Expect = 1e-22
Identities = 76/336 (22%), Positives = 157/336 (46%), Gaps = 7/336 (2%)
Query: 1017 PLLKCVSPEKYEAIMSEVHEG-VCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKE 1075
P+ + + ++ EAI+S +H+ + H G KV R +YW + K ++V++C++
Sbjct: 883 PVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQK 941
Query: 1076 CQVFADLSKAPPKELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEP 1135
CQ +T + F VD +GP P + ++ + + TK++ A P
Sbjct: 942 CQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIP 1001
Query: 1136 LAKITSAKIVNFYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQ 1195
+A ++ + ++ + ++G + ++D GT++ +S + CK + I+ ++ H Q
Sbjct: 1002 IANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQ 1061
Query: 1196 TNGQVESANRVILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAM 1255
T G VE ++R + +R ++ K W L ++ +NTT+ P+ + +G +
Sbjct: 1062 TVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSN 1121
Query: 1256 LPVEIDNFTWRTRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVR 1315
LP N P + ++ A + ++ R + A K++ ++ +V+
Sbjct: 1122 LPKHF-NKLHSIEPIYNIDDYAKESKYRLEVAYARARKLLE--AHKEKNKENYDLKVKDI 1178
Query: 1316 DMQVGDLVLKWRSGAPGNKLTPNWEGPYRIVKVLGN 1351
+++VGD VL G+KL + GPY+I + N
Sbjct: 1179 ELEVGDKVL--LRNEVGHKLDFKYTGPYKIESIGDN 1212
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 180 bits (457), Expect = 2e-44
Identities = 227/993 (22%), Positives = 404/993 (39%), Gaps = 120/993 (12%)
Query: 428 VVMVKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIR 487
V V K +GKWRM +DY ++NK P + ++ + + +D +G+
Sbjct: 216 VYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIFRGKYKTTLDLSNGFWAHS 275
Query: 488 MHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVK 547
+ P TAF YC+ +P G N+ A + D V + N++VYVDD+ +
Sbjct: 276 ITPESYWLTAFTWLGQQYCWTRLPQGFLNSPALFTA--DVVDLLKEVPNVQVYVDDIYIS 333
Query: 548 SVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQ 607
+H + LE+ F + ++ +K +FLGF IT G + + +
Sbjct: 334 HDDPREHLEQLEKVFSLLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQKLL 393
Query: 608 QMKSPSNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLR--KNVVFEWTAECEEAFVRL 665
+ P ++K++Q + G + F+P + P + + WT + + +
Sbjct: 394 NITPPRDLKQLQSILGLLNFARNFIPNFSELVKPLYNIIATANGKYITWTTDNSQQLQNI 453
Query: 666 KELLSSPPILSKPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQ 725
+L+S L + + + L V+ S + + + R + ++++ AEV++
Sbjct: 454 ISMLNSAENLEE--RNPEVRLIMKVNTSPSAGYIRFYNEFAKRPIMYLNYVYTKAEVKFT 511
Query: 726 KIEKAALAV---LVTARRL---RPYFQSFPVKVRTDLPLRQVLQKPDLSGRLVAWSVELS 779
EK + L+ A L + P+ T + + ++ L R + W L
Sbjct: 512 NTEKLLTTIHKGLIKALDLGMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMSYLE 571
Query: 780 EYGLQYDKRGTVGAQSLA-----DFVVELT-PDRFERVDTQWTLFVDGSS-------NSS 826
+ +Q+ T+ D + ++ P F V + DGS+ S
Sbjct: 572 DPRIQFHYDKTLPELQQVPTVTDDIIAKIKHPSEFSMV-----FYTDGSAIKHPNVNKSH 626
Query: 827 GSGAGVALEGPGELVLEQSLKFEFKATN-------NQAEYEALIAGLKLAREVKIR---S 876
+G G+A + K EF N + A +A ++ A + ++
Sbjct: 627 NAGMGIA---------QVQFKPEFTVINTWSIPLGDHTAQLAEVAAVEFACKKALKIDGP 677
Query: 877 LLIRTDSQLVENQVK---------GTFQVKDPNLIKYLERVRYLMTLFQEVVVEYVPRAE 927
+LI TDS V V G F K L K++ + + + Q + +
Sbjct: 678 VLIVTDSFYVAESVNKELPYWQSNGFFNNKKKPL-KHVSKWKSIADCIQLKPDIIIIHEK 736
Query: 928 NQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMSCVNRGRTWMDPIISILAGDPAE 987
+ A ++ GNN + + LA +G + +N P + AE
Sbjct: 737 GHQPTA----STFHTEGNN---LADKLA---TQGSYVVNINT-----TPSLD------AE 775
Query: 988 VEQCTKEQQREAS----HYTLIDGHLYRRGFSTPLLKCVSPEKYEA--IMSEVHEGVCAS 1041
++Q + Q + Y L +G + + P K + P K + I+ + H +
Sbjct: 776 LDQLLQGQYPKGFPKHYQYQLENGQVM---VTRPNGKRIIPPKSDRPQIILQAHN---IA 829
Query: 1042 HIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAM 1101
H G S KV + ++WP LRKD + ++QCK+C V + A P L PF
Sbjct: 830 HTGRDSTFLKV-SSKYWWPNLRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKPFDK 888
Query: 1102 WGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPL-AKITSAKIVNFYWKRIVCRFGIPR 1160
+ +D +GP P + + +LV VD T ++ P A TSA + ++ +P+
Sbjct: 889 FFIDYIGPLPPSNGYLH-VLVVVDSMTGFVWLYPTKAPSTSATVKAL---NMLTSIAVPK 944
Query: 1161 AIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKG 1220
I SD G F+S+ ++ K GIQ+ F++ HPQ++G+VE N I R L + L
Sbjct: 945 VIHSDQGAAFTSATFADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPA 1004
Query: 1221 AWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWRTRPGFEEENQANMA 1280
W D LP V + N + +++ TP ++ +G+D+ P + +R EE
Sbjct: 1005 KWYDLLPVVQLALNNSYSPSSKYTPHQLLFGIDSNTPFANSDTLDLSR---EE------- 1054
Query: 1281 VELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKWRSGAPGNKLTPNWE 1340
EL LL E R ++ T + +R VG LV + R P + L P W
Sbjct: 1055 -ELSLLQEIRSSLYLPSTP---------PASIRAWSPSVGQLVQE-RVARPAS-LRPRWH 1102
Query: 1341 GPYRIVKVLGNGAYHLEELDGRRLPRSFNGLSL 1373
P +++V+ A + + G R S + L L
Sbjct: 1103 KPTPVLEVINPRAVVILDHLGNRRTVSVDNLKL 1135
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 176 bits (447), Expect = 3e-43
Identities = 119/416 (28%), Positives = 207/416 (49%), Gaps = 10/416 (2%)
Query: 399 KGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 458
K +A+ E+++ L + IRE K V+ V K G RM VDY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 459 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 518
P I+ L+ G+ + + +D S YH IR+ DE K AF R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542
Query: 519 ATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 578
A +Q ++ + ++ Y+DD+++ S +H + +++ +++ ++ +N KC
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 579 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 638
F KF+G+ I+ +G E + Q K P N KE+++ G + L +F+PK+
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 639 SFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSV 698
+ P L+K+V ++WT +A +K+ L SPP+L + L SD A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 699 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 755
+ Q+ D + + V + S + A++ Y +K LA++ + + R Y +S P K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 756 ---LPLRQVLQKPDLSGRLVAWSVELSEYGLQYDKR-GTVG--AQSLADFVVELTP 805
L R + + RL W + L ++ + + R G+ A +L+ V E P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 127 bits (319), Expect = 2e-28
Identities = 108/455 (23%), Positives = 189/455 (40%), Gaps = 50/455 (10%)
Query: 920 VEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMSCVNR---GRTWMDP 976
+ Y P + N ADAL+++ +P S E ++ VN+ + +
Sbjct: 815 INYRPGSANHIADALSRIVDETEPIPKDS-----------EDNSINFVNQISITDDFKNQ 863
Query: 977 IISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHE 1036
+++ D + E +R + L DG L +L + I+ + HE
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQLTRTIIKKYHE 921
Query: 1037 GVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQV--------FADLSKAPPK 1088
H G L +LR F W +RK ++V+ C CQ+ + L PP
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1089 ELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAK-ITSAKIVNF 1147
E P+ +D + P + + V VD F+K P K IT+ +
Sbjct: 981 ER-------PWESLSMDFITALPESSGY-NALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 1148 YWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVI 1207
+ +R++ FG P+ I++DN F+S ++F + M+F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 1208 LRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVD-AMLPVEIDNFTWR 1266
+ LR + W+D + V SYN S T+ TPF + + A+ P+E+
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL------ 1146
Query: 1267 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1326
P F ++ N + + ++ + MK+ K + + Q GDLV+
Sbjct: 1147 --PSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQ---EIEEFQPGDLVMVK 1201
Query: 1327 RSGA----PGNKLTPNWEGPYRIVKVLGNGAYHLE 1357
R+ NKL P++ GP+ +++ G Y L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 173 bits (438), Expect = 3e-42
Identities = 118/416 (28%), Positives = 207/416 (49%), Gaps = 10/416 (2%)
Query: 399 KGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 458
K +A+ E+++ L + IRE K V+ V K G RM VDY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 459 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 518
P I+ L+ G+ + + +D S YH IR+ DE K AF R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 519 ATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 578
A +Q ++ + ++ Y+D++++ S +H + +++ +++ ++ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 579 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 638
F KF+G+ I+ +G E + Q K P N KE+++ G + L +F+PK+
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 639 SFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSV 698
+ P L+K+V ++WT +A +K+ L SPP+L + L SD A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 699 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 755
+ Q+ D + + V + S + A++ Y +K LA++ + + R Y +S P K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 756 ---LPLRQVLQKPDLSGRLVAWSVELSEYGLQYDKR-GTVG--AQSLADFVVELTP 805
L R + + RL W + L ++ + + R G+ A +L+ V E P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 127 bits (319), Expect = 2e-28
Identities = 108/455 (23%), Positives = 189/455 (40%), Gaps = 50/455 (10%)
Query: 920 VEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMSCVNR---GRTWMDP 976
+ Y P + N ADAL+++ +P S E ++ VN+ + +
Sbjct: 815 INYRPGSANHIADALSRIVDETEPIPKDS-----------EDNSINFVNQISITDDFKNQ 863
Query: 977 IISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHE 1036
+++ D + E +R + L DG L +L + I+ + HE
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQLTRTIIKKYHE 921
Query: 1037 GVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQV--------FADLSKAPPK 1088
H G L +LR F W +RK ++V+ C CQ+ + L PP
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1089 ELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAK-ITSAKIVNF 1147
E P+ +D + P + + V VD F+K P K IT+ +
Sbjct: 981 ER-------PWESLSMDFITALPESSGY-NALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 1148 YWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVI 1207
+ +R++ FG P+ I++DN F+S ++F + M+F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 1208 LRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVD-AMLPVEIDNFTWR 1266
+ LR + W+D + V SYN S T+ TPF + + A+ P+E+
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL------ 1146
Query: 1267 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1326
P F ++ N + + ++ + MK+ K + + Q GDLV+
Sbjct: 1147 --PSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQ---EIEEFQPGDLVMVK 1201
Query: 1327 RSGA----PGNKLTPNWEGPYRIVKVLGNGAYHLE 1357
R+ NKL P++ GP+ +++ G Y L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 173 bits (438), Expect = 3e-42
Identities = 118/416 (28%), Positives = 207/416 (49%), Gaps = 10/416 (2%)
Query: 399 KGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 458
K +A+ E+++ L + IRE K V+ V K G RM VDY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 459 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 518
P I+ L+ G+ + + +D S YH IR+ DE K AF R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 519 ATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 578
A +Q ++ + ++ Y+D++++ S +H + +++ +++ ++ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 579 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGDR 638
F KF+G+ I+ +G E + Q K P N KE+++ G + L +F+PK+
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 639 SFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSV 698
+ P L+K+V ++WT +A +K+ L SPP+L + L SD A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 699 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 755
+ Q+ D + + V + S + A++ Y +K LA++ + + R Y +S P K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 756 ---LPLRQVLQKPDLSGRLVAWSVELSEYGLQYDKR-GTVG--AQSLADFVVELTP 805
L R + + RL W + L ++ + + R G+ A +L+ V E P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 127 bits (319), Expect = 2e-28
Identities = 108/455 (23%), Positives = 189/455 (40%), Gaps = 50/455 (10%)
Query: 920 VEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMSCVNR---GRTWMDP 976
+ Y P + N ADAL+++ +P S E ++ VN+ + +
Sbjct: 815 INYRPGSANHIADALSRIVDETEPIPKDS-----------EDNSINFVNQISITDDFKNQ 863
Query: 977 IISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHE 1036
+++ D + E +R + L DG L +L + I+ + HE
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQLTRTIIKKYHE 921
Query: 1037 GVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQV--------FADLSKAPPK 1088
H G L +LR F W +RK ++V+ C CQ+ + L PP
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1089 ELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAK-ITSAKIVNF 1147
E P+ +D + P + + V VD F+K P K IT+ +
Sbjct: 981 ER-------PWESLSMDFITALPESSGY-NALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 1148 YWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVI 1207
+ +R++ FG P+ I++DN F+S ++F + M+F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 1208 LRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVD-AMLPVEIDNFTWR 1266
+ LR + W+D + V SYN S T+ TPF + + A+ P+E+
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL------ 1146
Query: 1267 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1326
P F ++ N + + ++ + MK+ K + + Q GDLV+
Sbjct: 1147 --PSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQ---EIEEFQPGDLVMVK 1201
Query: 1327 RSGA----PGNKLTPNWEGPYRIVKVLGNGAYHLE 1357
R+ NKL P++ GP+ +++ G Y L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 170 bits (430), Expect = 3e-41
Identities = 200/872 (22%), Positives = 353/872 (39%), Gaps = 87/872 (9%)
Query: 428 VVMVKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIR 487
V V K +G+WRM +DY ++NK P + ++ + + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 488 MHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVK 547
+ P TAF YC+ +P G N+ A + D V + N++VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQVYVDDIYLS 122
Query: 548 SVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQ 607
+H Q LE+ F + + ++ +K G + +FLGF IT G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 608 QMKSPSNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLR--KNVVFEWTAECEEAFVRL 665
+ P ++K++Q + G + F+P + P + + K EW+ E + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 666 KELLSSPPILSKPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQ 725
E L++ L + + L + S SA V G+ I+Y +++ AE+++
Sbjct: 243 IEALNTASNLEERLPEQRLVIKVNTSPSA-GYVRYYNETGKKPIMY-LNYVFSKAELKFS 300
Query: 726 KIEKAALAV---LVTARRL---RPYFQSFPVKVRTDLPLRQVLQKPDLSGRLVAWSVELS 779
+EK + L+ A L + P+ T + + ++ L R + W L
Sbjct: 301 MLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMTYLE 360
Query: 780 EYGLQYDKRGTVGAQSLADFVVELT------PDRFERVDTQWTLFVDGS---------SN 824
+ +Q+ T+ V + P ++E V + DGS SN
Sbjct: 361 DPRIQFHYDKTLPELKHIPDVYTSSQSPVKHPSQYEGV-----FYTDGSAIKSPDPTKSN 415
Query: 825 SSGSGAGVALEGPGELVLEQSLKFEFKATNNQAEYEALIAGLKLAREVKIRSLLIRTDSQ 884
++G G A P VL Q T AE A+ K A ++ L+I
Sbjct: 416 NAGMGIVHATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFACKKALKIPGPVLVITDSFY 475
Query: 885 LVENQVKGTFQVKDPNLIKYLERVRYLMTLFQEVVVEYVPRAENQRADALAKLASTRKPG 944
+ E+ K K + ++ P + ++A+ S +
Sbjct: 476 VAESANKELPYWKSNGFVNNKKK----------------PLKHISKWKSIAECLSMKPD- 518
Query: 945 NNKSVIQETLAYPSIEGELMSCVNRGRTWMDPIISILAGDPAEVEQC-TKEQQREASHYT 1003
IQ I ++ + +G D LA + V C TK+ +A
Sbjct: 519 ---ITIQHE---KGISLQIPVFILKGNALADK----LATQGSYVVNCNTKKPNLDAELDQ 568
Query: 1004 LIDGHLYRRGFSTPL----------------LKCVSPEK-YEAIMSEVHEGVCASHIGGR 1046
L+ GH Y +G+ +K + P+ + I+ + H +H G
Sbjct: 569 LLQGH-YIKGYPKQYTYFLEDGKVKVSRPEGVKIIPPQSDRQKIVLQAHN---LAHTGRE 624
Query: 1047 SLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMWGVDL 1106
+ K+ ++WP +RKD + + +C++C + +KA L PF + +D
Sbjct: 625 ATLLKIANL-YWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRPQKPFDKFFIDY 683
Query: 1107 VGPFPTARAQMKFILVAVDYFTKWIEAEPL-AKITSAKIVNFYWKRIVCRFGIPRAIVSD 1165
+GP P ++ + ++LV VD T + P A TSA + + ++ IP+ I SD
Sbjct: 684 IGPLPPSQGYL-YVLVVVDGMTGFTWLYPTKAPSTSATVKSL---NVLTSIAIPKVIHSD 739
Query: 1166 NGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKGAWLDE 1225
G F+SS E+ KE GI + F++ HPQ+ +VE N I R L + L W D
Sbjct: 740 QGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRPTKWYDL 799
Query: 1226 LPAVLWSYNTTEQSTTRETPFRMTYGVDAMLP 1257
LP V + N T + TP ++ +G+D+ P
Sbjct: 800 LPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 151 bits (381), Expect = 1e-35
Identities = 111/398 (27%), Positives = 189/398 (46%), Gaps = 23/398 (5%)
Query: 403 VQQEVDKLLAAEFIREVKYPTWLANVVMVKKA-----NGKWRMCVDYTDLNKACPKDSYP 457
V EV +LL IR + P V+ KK N R+ +D+ LN+ D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 458 LPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNA 517
+PSI ++ + + +D SGYHQI + D +KT+F Y + +PFGL+NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 518 GATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKC 577
+ +QR +D V Q+G+ VYVDD+I+ S DH + ++ + +MR++ EK
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 578 SFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPKSGD 637
F + ++LGF+++ G + +PEK KAIQ+ P V +V+ G + F+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 638 RSFPFFKCLR-----------KNVVFEWTAECEEAFVRLKELLSSPPILSK-PIQGHPLH 685
+ P L+ K + E+ AF RL+ +L+S ++ K P P
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 686 LYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPY- 744
L S S + +V+ Q E R + +S TL+ E Y E+ LA++ +L+ +
Sbjct: 497 LTTDASASGIGAVLSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552
Query: 745 FQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEY 781
+ S + + TD PL + + + ++ W + ++
Sbjct: 553 YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQH 590
Score = 44.3 bits (103), Expect = 0.002
Identities = 66/289 (22%), Positives = 108/289 (36%), Gaps = 42/289 (14%)
Query: 1051 KVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMWGVDLVGPF 1110
+VLR +Y+P + + V C+ C A + P K+ + P P + + F
Sbjct: 759 QVLR-DYYFPKMGSLAKEVVANCRVCTQ-AKYDRHPKKQELG-ETPIPSYTGEMVHIDIF 815
Query: 1111 PTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRIVCRFGIPRAIVSDNGTQF 1170
T R K L +D F+K+ +P+ T I + I+ F + + DN F
Sbjct: 816 STDR---KLFLTCIDKFSKYAIVQPVVSRTIVDITAPLLQ-IINLFPNIKTVYCDNEPAF 871
Query: 1171 SSSQTREFCKE-MGIQMRFASVEHPQTNGQVESANRVILRGLR-RRLAEAKGAWLDELPA 1228
+S K GI + A H +NGQVE + + R +L + ++ +
Sbjct: 872 NSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVELILR 931
Query: 1229 VLWSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWRTRPGFEEENQANMAVELDLLSE 1288
YN T S TRE P + + PG E + ++ L+
Sbjct: 932 ATIEYNKTVHSVTRERPIEVVH------------------PGAHER---CLEIKARLVKA 970
Query: 1289 TRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKWRSGAPGNKLTP 1337
+D + + RV +VG+ V + GNKLTP
Sbjct: 971 QQDSIGRNNPSRQNRV------------FEVGERVFVKNNKRLGNKLTP 1007
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 132 bits (331), Expect = 8e-30
Identities = 127/498 (25%), Positives = 214/498 (42%), Gaps = 29/498 (5%)
Query: 321 PIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FIC 376
P+EE L G R + +T+++ ++ +LL + C + P +DPN ++
Sbjct: 184 PLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEK-------VCSENP-LDPNKTKQWMK 235
Query: 377 HRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVM---VKK 433
+ L+ K + + + +++ +LL + I+ K P ++ +K
Sbjct: 236 ASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEK 295
Query: 434 ANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADE 493
GK RM V+Y +NKA D+Y LP+ D L+ G ++ S D SG+ Q+ +
Sbjct: 296 RRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESR 355
Query: 494 DKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLD 553
TAF + +Y + +PFGLK A + +QR MD F R + VYVDD++V S D
Sbjct: 356 PLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEED 414
Query: 554 HHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP- 612
H + + +H + L+ +K + FLG I + + I +
Sbjct: 415 HLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTL 474
Query: 613 SNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSP 672
+ K++QR G + S ++PK P L++NV + WT E ++K+ L
Sbjct: 475 EDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGF 534
Query: 673 PILSKPIQGHPLHLYFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIE 728
P L P+ L + SD L ++ + E I + S + + AE Y +
Sbjct: 535 PPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSND 594
Query: 729 KAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYGL 783
K LAV+ T ++ Y +RTD + K D GR + W LS Y
Sbjct: 595 KETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSF 654
Query: 784 QYDK-RGTVGAQSLADFV 800
+ +GT ADF+
Sbjct: 655 DVEHIKGT--DNHFADFL 670
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 131 bits (330), Expect = 1e-29
Identities = 127/498 (25%), Positives = 215/498 (42%), Gaps = 29/498 (5%)
Query: 321 PIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FIC 376
P+EE L G R + +T+++ ++ +LL + C + P +DPN ++
Sbjct: 184 PLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEK-------VCSENP-LDPNKTKQWMK 235
Query: 377 HRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVM---VKK 433
+ L+ K + + + +++ +LL + I+ K P ++ +K
Sbjct: 236 ASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEK 295
Query: 434 ANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADE 493
GK RM V+Y +NKA D+Y LP+ D L+ G ++ S D SG+ Q+ +
Sbjct: 296 RRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESR 355
Query: 494 DKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLD 553
TAF + +Y + +PFGLK A + +QR MD F R + VYVDD++V S D
Sbjct: 356 PLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEED 414
Query: 554 HHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP- 612
H + + +H + L+ +K + FLG I + + I +
Sbjct: 415 HLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTL 474
Query: 613 SNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSP 672
+ K++QR G + S ++PK P L++NV ++WT E ++K+ L
Sbjct: 475 EDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGF 534
Query: 673 PILSKPIQGHPLHLYFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIE 728
P L P+ L + SD L ++ + E I + S + + AE Y +
Sbjct: 535 PPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSND 594
Query: 729 KAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYGL 783
K LAV+ T ++ Y +RTD + K D GR + W LS Y
Sbjct: 595 KETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSF 654
Query: 784 QYDK-RGTVGAQSLADFV 800
+ +GT ADF+
Sbjct: 655 DVEHIKGT--DNHFADFL 670
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 130 bits (326), Expect = 3e-29
Identities = 125/484 (25%), Positives = 209/484 (42%), Gaps = 22/484 (4%)
Query: 335 LKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FICHRLALNPSVKPVSQ 390
L G RL+EE+ + + + +L C + P +DPN ++ + L+ K +
Sbjct: 191 LSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKV 249
Query: 391 LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVM---VKKANGKWRMCVDYTDL 447
+ + +++ +LL + I+ K P ++ +K GK RM V+Y +
Sbjct: 250 KPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAM 309
Query: 448 NKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCY 507
NKA D+Y LP+ D L+ G ++ S D SG+ Q+ + TAF + +Y +
Sbjct: 310 NKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 369
Query: 508 RTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRK 567
+PFGLK A + +QR MD F R + VYVDD++V S DH + + +
Sbjct: 370 NVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQ 428
Query: 568 HSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP-SNVKEVQRLTGRIA 626
H + L+ +K + FLG I + + I + + K++QR G +
Sbjct: 429 HGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILT 488
Query: 627 ALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHL 686
S ++PK P L++NV ++WT E ++K+ L P L P+ L +
Sbjct: 489 YASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLII 548
Query: 687 YFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLR 742
SD L ++ + E I + S + + AE Y +K LAV+ T ++
Sbjct: 549 ETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFS 608
Query: 743 PYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYGLQYDK-RGTVGAQSL 796
Y +RTD + K D GR + W LS Y + +GT
Sbjct: 609 IYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGT--DNHF 666
Query: 797 ADFV 800
ADF+
Sbjct: 667 ADFL 670
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 128 bits (322), Expect = 9e-29
Identities = 129/513 (25%), Positives = 217/513 (42%), Gaps = 27/513 (5%)
Query: 306 ENLDPRGEGRVNRPTPIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCK 365
E++ R + + P I K L G RL+EE+ + + + +L C
Sbjct: 162 ESMKKRSKTQQPEPVNISTNKIA-----ILSEGRRLSEEKLFITQQRMQKIEELLEKVCS 216
Query: 366 DMPGIDPN----FICHRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKY 421
+ P +DPN ++ + L+ K + + + +++ +LL + I+ K
Sbjct: 217 ENP-LDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS 275
Query: 422 PTWLANVVM---VKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMD 478
P ++ +K GK RM V+Y +NKA D+Y P+ D L+ G ++ S D
Sbjct: 276 PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFD 335
Query: 479 AYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNME 538
SG+ Q+ + TAF + +Y + +PFGLK A + +QR MD F R +
Sbjct: 336 CKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCC 394
Query: 539 VYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEI 598
VYVDD++V S DH + + +H + L+ +K + FLG I +
Sbjct: 395 VYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKP 454
Query: 599 NPEKCKAIQQMKSP-SNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAE 657
+ I + + K++QR G + S ++PK P L++NV ++WT E
Sbjct: 455 QGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKE 514
Query: 658 CEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSD----SALSSVMLQEIDGEHRIVYFV 713
++K+ L P L P+ L + SD L ++ + E I +
Sbjct: 515 DTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYA 574
Query: 714 SHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS- 768
S + + AE Y +K LAV+ T ++ Y +RTD + K D
Sbjct: 575 SGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL 634
Query: 769 GRLVAWSVELSEYGLQYDK-RGTVGAQSLADFV 800
GR + W LS Y + +GT ADF+
Sbjct: 635 GRNIRWQAWLSHYSFDVEHIKGT--DNHFADFL 665
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 126 bits (317), Expect = 4e-28
Identities = 123/484 (25%), Positives = 207/484 (42%), Gaps = 22/484 (4%)
Query: 335 LKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FICHRLALNPSVKPVSQ 390
L G RL+EE+ + + + +L C + P +DPN ++ + L+ K +
Sbjct: 192 LSEGRRLSEEKLFITQQRMQKTEELLEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKV 250
Query: 391 LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKAN---GKWRMCVDYTDL 447
+ + +++ +LL + I+ K P ++ +A G RM V+Y +
Sbjct: 251 KPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAM 310
Query: 448 NKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCY 507
NKA D+Y LP+ D L+ G ++ S D SG+ Q+ + TAF + +Y +
Sbjct: 311 NKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 370
Query: 508 RTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRK 567
+PFGLK A + +QR MD F R + VYVDD++V S DH + + +
Sbjct: 371 NVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQ 429
Query: 568 HSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP-SNVKEVQRLTGRIA 626
H + L+ +K + FLG I + + I + + K++QR G +
Sbjct: 430 HGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILT 489
Query: 627 ALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHL 686
S ++P P L++NV ++WT E ++K+ L P L P+ L +
Sbjct: 490 YASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLII 549
Query: 687 YFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLR 742
SD L ++ + E I + S + + AE Y +K LAV+ T ++
Sbjct: 550 ETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFS 609
Query: 743 PYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYGLQYDK-RGTVGAQSL 796
Y +RTD + K D GR + W LS Y + +GT
Sbjct: 610 IYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGT--DNHF 667
Query: 797 ADFV 800
ADF+
Sbjct: 668 ADFL 671
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 124 bits (311), Expect = 2e-27
Identities = 108/377 (28%), Positives = 170/377 (44%), Gaps = 15/377 (3%)
Query: 432 KKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPA 491
++ GK RM V+Y +N+A DS+ LP++ L+ G + S D SG+ Q+ +
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346
Query: 492 DEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRG 551
+ TAF + ++ ++ +PFGLK A + +QR M + VYVDD+IV S
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTAL-NGADKFCMVYVDDIIVFSNSE 405
Query: 552 LDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKS 611
LDH+ + + K+ + L+ +K + + FLG I +G P+ K
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHC-PQNHILENIHKF 463
Query: 612 PSNV---KEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKEL 668
P + K +QR G + ++PK + P L+K+V + WT + ++K+
Sbjct: 464 PDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKN 523
Query: 669 LSSPPILSKPIQGHPLHLYFAVSDSALSSVM-LQEIDGEHRIVYFVSHTLQGAEVRYQKI 727
L S P L P L + SDS V+ + +DG I + S + + AE Y
Sbjct: 524 LGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSN 583
Query: 728 EKAALAVLVTARRLRPYFQSFPVKVRTDLP-----LRQVLQKPDLSGRLVAWSVELSEYG 782
+K LAV + Y VRTD LR L+ GRLV W S+Y
Sbjct: 584 DKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKY- 642
Query: 783 LQYDKRGTVGAQS-LAD 798
Q+D G ++ LAD
Sbjct: 643 -QFDVEHLEGVKNVLAD 658
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 122 bits (306), Expect = 7e-27
Identities = 104/347 (29%), Positives = 160/347 (45%), Gaps = 31/347 (8%)
Query: 1028 EAIMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPP 1087
E I+S H +H G + KV + ++WP LRKD + ++QCK+C V + P
Sbjct: 817 EKIISTAHN---IAHTGRDATFLKV-SSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSP 872
Query: 1088 KELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPL-AKITSAKIVN 1146
L + PF + +D +GP P + + +LV VD T ++ P A TSA +
Sbjct: 873 PILRPVKPLKPFDKFYIDYIGPLPPSNGYLH-VLVVVDSMTGFVWLYPTKAPSTSATVKA 931
Query: 1147 FYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRV 1206
++ IP+ + SD G F+SS ++ KE GIQ+ F++ HPQ++G+VE N
Sbjct: 932 L---NMLTSIAIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSD 988
Query: 1207 ILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWR 1266
I R L + L W D LP V + N + +++ TP ++ +GVD+ P +
Sbjct: 989 IKRLLTKLLIGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPFANSDTLDL 1048
Query: 1267 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1326
+R EE EL LL E R H Q + +S R VG LV +
Sbjct: 1049 SR---EE--------ELSLLQEIRSSLH-------QPTSPPASS--RSWSPSVGQLVQE- 1087
Query: 1327 RSGAPGNKLTPNWEGPYRIVKVLGNGAYHLEELDGRRLPRSFNGLSL 1373
R P + L P W P I++V+ + + G R S + L L
Sbjct: 1088 RVARPAS-LRPRWHKPTAILEVVNPRTVIILDHLGNRRTVSVDNLKL 1133
Score = 87.4 bits (215), Expect = 2e-16
Identities = 87/417 (20%), Positives = 167/417 (39%), Gaps = 29/417 (6%)
Query: 381 LNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRM 440
+NP KP ++Q +D LL + + + T V V K +GKWRM
Sbjct: 182 INPKAKP--------------SIQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPKPDGKWRM 226
Query: 441 CVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMT 500
+DY ++NK P + ++ + + +D +G+ + P TAF
Sbjct: 227 VLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTW 286
Query: 501 ARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEE 560
YC+ +P G N+ A + D V + N++ YVDD+ + +H + LE+
Sbjct: 287 QGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQAYVDDIYISHDDPQEHLEQLEK 344
Query: 561 AFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQR 620
F + ++ +K + +FLGF IT G + + + + P ++K++Q
Sbjct: 345 IFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNITPPKDLKQLQS 404
Query: 621 LTGRIAALSRFLPKSGDRSFPFFKCL--RKNVVFEWTAECEEAFVRLKELLSSPPILSKP 678
+ G + F+P + P + + WT + + +L+ L +
Sbjct: 405 ILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISVLNQADNLEE- 463
Query: 679 IQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAV---L 735
+ L V+ S + + +G R + +V++ AE ++ + EK + L
Sbjct: 464 -RNPETRLIIKVNSSPSAGYIRYYNEGSKRPIMYVNYIFSKAEAKFTQTEKLLTTMHKGL 522
Query: 736 VTARRL---RPYFQSFPVKVRTDLPLRQVLQKPDLSGRLVAWSVELSEYGLQ--YDK 787
+ A L + P+ T + + ++ L R + W L + +Q YDK
Sbjct: 523 IKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYLEDPRIQFHYDK 579
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 114 bits (284), Expect = 2e-24
Identities = 133/545 (24%), Positives = 228/545 (41%), Gaps = 46/545 (8%)
Query: 289 KKSALVGH--RCYEIEASDENLDPRGEGRVNRPTPIEET--KALKFGDRTLKIGTRLTEE 344
K+S ++G + Y+ + + +VNRP PI T + L + + L E
Sbjct: 127 KQSVIIGKITKAYQYGVKGFLESMKKKSKVNRPEPINITSNQHLFLEEGGNHVDEMLYEI 186
Query: 345 Q-------ETRLTKLLGEN-LD---LFAWSCKDMPGIDPNFICHRLALNPSVKPVSQLRR 393
Q E L ++ EN +D W + IDP + VKP+S
Sbjct: 187 QISKFSAIEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVV-------KVKPMSY--- 236
Query: 394 RLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVK----KANGKWRMCVDYTDLNK 449
+ +++ +LL + I+ K T ++ +V+ + GK RM V+Y +NK
Sbjct: 237 --SPSDREEFDRQIKELLELKVIKPSK-STHMSPAFLVENEAERRRGKKRMVVNYKAMNK 293
Query: 450 ACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRT 509
A D++ LP+ D L+ G ++ S D SG Q+ + + TAF + +Y +
Sbjct: 294 ATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNV 353
Query: 510 MPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRG-LDHHQDLEEAFGEIRKH 568
+PFGLK A + + + + Q + VYVDD++V S G +H+ + K
Sbjct: 354 VPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKL 413
Query: 569 SMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNV---KEVQRLTGRI 625
+ L+ +K + FLG I +G P+ K P + K++QR G +
Sbjct: 414 GIILSKKKAQLFKEKINFLGLEI-DQGTHC-PQNHILEHIHKFPDRIEDKKQLQRFLGIL 471
Query: 626 AALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 685
S ++PK P L+++ + W + ++K+ L S P L P L
Sbjct: 472 TYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLV 531
Query: 686 LYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYF 745
+ S+ ++ + I + S + + AE Y EK LAV+ ++ Y
Sbjct: 532 IETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYL 591
Query: 746 QSFPVKVRTDLP-----LRQVLQKPDLSGRLVAWSVELSEYGLQYDKRGTVGAQSL-ADF 799
+RTD + L+ GRLV W + LS+Y +D G +++ ADF
Sbjct: 592 TPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQY--DFDVEHIAGTKNVFADF 649
Query: 800 VVELT 804
+ E T
Sbjct: 650 LQENT 654
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 113 bits (283), Expect = 3e-24
Identities = 74/249 (29%), Positives = 130/249 (51%), Gaps = 7/249 (2%)
Query: 387 PVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKK-ANGKWRMCVDY- 444
PV + R + +AV+ E+++L I + Y W A +V++KK GK R+C D+
Sbjct: 440 PVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFK 499
Query: 445 -TDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARV 503
+ LN A + +PLP+ + + G + S +D Y Q+ + + T R
Sbjct: 500 CSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTHRG 558
Query: 504 NYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFG 563
+ Y M FGLK A A++Q++MD++ + G + VY DD+I+ + +H + L E F
Sbjct: 559 IFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFE 616
Query: 564 EIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTG 623
+++ R++ EKC+F + FLGF + G + +K +AI+ MK+P++ K++ G
Sbjct: 617 RFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLG 675
Query: 624 RIAALSRFL 632
LSR +
Sbjct: 676 AADWLSRMM 684
Score = 74.7 bits (182), Expect = 2e-12
Identities = 62/228 (27%), Positives = 100/228 (43%), Gaps = 24/228 (10%)
Query: 1030 IMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKE 1089
++ ++HEG H G + K R+ +W L D + V+ C CQ + + + P
Sbjct: 786 VLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVPLN 840
Query: 1090 LVTMSAPWPF--AMWG---VDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKI 1144
PWP A W +D GP ++LV VD TK+ E + I++
Sbjct: 841 ------PWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSISAVTT 889
Query: 1145 VNFYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESAN 1204
++ + I G P I+SDNGTQ +S + C+ GI+ + ++V +P++NG E
Sbjct: 890 IDLL-EEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFV 948
Query: 1205 RVILRGLRRRLAEAKGAWLDELPAVLWSY-NTTEQSTTRETPFRMTYG 1251
+ RG+ + E L L SY NT + TP +G
Sbjct: 949 DTLKRGIAKIKGEG-SVNQQILNKFLISYRNTPHSALNGSTPAECHFG 995
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.136 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 161,866,890
Number of Sequences: 164201
Number of extensions: 6976136
Number of successful extensions: 17018
Number of sequences better than 10.0: 159
Number of HSP's better than 10.0 without gapping: 102
Number of HSP's successfully gapped in prelim test: 57
Number of HSP's that attempted gapping in prelim test: 16653
Number of HSP's gapped (non-prelim): 294
length of query: 1377
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1254
effective length of database: 39,777,331
effective search space: 49880773074
effective search space used: 49880773074
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0029b.1