
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0590c.9
(1706 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 206 4e-52
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 206 6e-52
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 191 1e-47
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 190 3e-47
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 189 6e-47
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 178 1e-43
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 175 1e-42
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 171 1e-41
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 171 1e-41
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 168 1e-40
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 151 2e-35
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 129 7e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 129 9e-29
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 127 3e-28
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 125 8e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 125 1e-27
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 125 1e-27
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 122 8e-27
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 116 5e-25
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 112 7e-24
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 206 bits (524), Expect = 4e-52
Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 10/385 (2%)
Query: 731 VQQEVDKLLAAEFIRE----VKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 786
V+ ++ +L IR P W+ K+R+ +DY LN+ D +P+
Sbjct: 223 VESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282
Query: 787 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 846
P++D ++ + +D G+HQI M P KTAF T +Y Y MPFGLKNA
Sbjct: 283 PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342
Query: 847 ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 906
AT+QR M+ + + ++ VY+DD+IV S +H Q L F ++ K +++L +KC
Sbjct: 343 ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402
Query: 907 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGDR 966
F Q FLG ++T GI+ NPEK +AIQ+ P+ KE++ G +F+P D
Sbjct: 403 FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462
Query: 967 SFPFFKCLRKNVAFEWT-AECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSS 1025
+ P KCL+KN+ + T E + AF +LK L+S PIL P L SD AL +
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 1026 VMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-L 1084
V+ Q DG H + Y +S TL E+ Y IEK LA++ + R Y ++ +D
Sbjct: 523 VLSQ--DG-HPLSY-ISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQ 578
Query: 1085 PLRQVLQKPDLSGRLVAWSVELSEY 1109
PL + + D + +L W V+LSE+
Sbjct: 579 PLSWLYRMKDPNSKLTRWRVKLSEF 603
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 206 bits (523), Expect = 6e-52
Identities = 126/414 (30%), Positives = 212/414 (50%), Gaps = 9/414 (2%)
Query: 707 LALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKW 766
+ L +P+ Q R + +++ + K+L + IRE K P W + VV+VKK +G
Sbjct: 934 IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992
Query: 767 RMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAF 826
RMC+DY +NK +++PLP+I++ + +G +L ++ D +G+ QI + ++ TAF
Sbjct: 993 RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052
Query: 827 MTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDL 886
+ + +PFGL + A +Q M+ + +G VYVDD+++ S H QD+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112
Query: 887 EEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEV 946
+EA IRK M+L KC + ++LG +T G+E K ++Q P+NVKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172
Query: 947 QRLTGRIAALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKP 1006
Q G + +F+ + + VA+ W E E AF LK+L+ P+L++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232
Query: 1007 -----IQG-HPLHLYFAVSDSALSSVMLQE-IDGEHRIVYFVSHTLQGAEVRYQKIEKAA 1059
++G P +Y S + +V+ QE DG+ + F S L AE RY + A
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEA 1292
Query: 1060 LAVLVTARRLRPYFQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEYGLQ 1112
LA++ RR + + V TD PL +L+ L+ RL WS+E+ E+ ++
Sbjct: 1293 LAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVK 1346
Score = 102 bits (254), Expect = 9e-21
Identities = 102/455 (22%), Positives = 189/455 (41%), Gaps = 23/455 (5%)
Query: 1245 EVVVEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMACVNRG------ 1298
+V + Y+ N ADAL++ + + T +I+ EL ++
Sbjct: 1344 DVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERL 1403
Query: 1299 ----RTWMDPIISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEK 1354
W + I ++ G + + + Y I G + + + V PEK
Sbjct: 1404 KGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQSRSVVPEK 1463
Query: 1355 YEA-IMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKA 1413
++ E+HEG+ A H G + + +++ FYWP +R + V+ C +C D SK
Sbjct: 1464 IRTPLLKELHEGMLAGHFGIKKM-WRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKL 1522
Query: 1414 PPKELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIV 1473
L +P + DL+ + + ++IL +D FTK+ A P+ + ++
Sbjct: 1523 T-SSLTPYRMTFPLEIVACDLMDVGLSVQGN-RYILTIIDLFTKYGTAVPIPDKKAETVL 1580
Query: 1474 NFYWKRIVCRFG-IPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESAN 1532
+ +R G IP +++D G +F + +F + I+ + + NG VE N
Sbjct: 1581 KAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFN 1640
Query: 1533 RVILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEI--DN 1590
+ I+ ++++ A W D++ +++YN T ETP + +G D M P+E+ ++
Sbjct: 1641 KTIMHIMKKKTAVPM-EWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGED 1699
Query: 1591 FTWRTRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDL 1650
+E L + ++ A + + K K+ S+ + R Q G
Sbjct: 1700 AVGINYADMDEYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASK-KHRFPQPGSR 1758
Query: 1651 VL----KWRSGAPGNKLTPNWEGPYRIVKVLGNGA 1681
VL + GA KL W GPYR++ N A
Sbjct: 1759 VLLEIPSEKLGAQCPKLVNKWSGPYRVISCSENSA 1793
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 191 bits (485), Expect = 1e-47
Identities = 129/443 (29%), Positives = 208/443 (46%), Gaps = 11/443 (2%)
Query: 674 ETRLTKLLGENLDLFAWSCKDMPGIDPNFICHRLALNPSVKPVSQLRRRLGGDKGKAVQQ 733
+++L + E +D+FA + P N +L L +PV R + + +Q
Sbjct: 276 KSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQVEEIQA 332
Query: 734 EVDKLLAAEFIREVKYPTWLANVVMVKKANG------KWRMCVDYTDLNKACPKDSYPLP 787
+V KL+ + + E + + +++V K + KWR+ +DY +NK D +PLP
Sbjct: 333 QVQKLIKDKIV-EPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 788 SIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAGA 847
ID ++D + S +D SG+HQI + D T+F T+ +Y + +PFGLK A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 848 TYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSF 907
++QR+M F+G +Y+DD+IV ++L E FG+ R+++++L+PEKCSF
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 908 GVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGDRS 967
+ FLG T +GI + +K IQ P + +R RF+ D S
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 968 FPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSVM 1027
+ +KNV FEWT EC++AF+ LK L +P +L P + S A +V+
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 1028 LQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-LPL 1086
Q +G V + S E E+ A+ RPY V+TD PL
Sbjct: 632 TQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPL 691
Query: 1087 RQVLQKPDLSGRLVAWSVELSEY 1109
+ + S +L +EL EY
Sbjct: 692 TYLFSMVNPSSKLTRIRLELEEY 714
Score = 108 bits (269), Expect = 2e-22
Identities = 76/336 (22%), Positives = 157/336 (46%), Gaps = 7/336 (2%)
Query: 1345 PLLKCVSPEKYEAIMSEVHEG-VCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKE 1403
P+ + + ++ EAI+S +H+ + H G KV R +YW + K ++V++C++
Sbjct: 883 PVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQK 941
Query: 1404 CQVFADLSKAPPKELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEP 1463
CQ +T + F VD +GP P + ++ + + TK++ A P
Sbjct: 942 CQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIP 1001
Query: 1464 LAKITSAKIVNFYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQ 1523
+A ++ + ++ + ++G + ++D GT++ +S + CK + I+ ++ H Q
Sbjct: 1002 IANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQ 1061
Query: 1524 TNGQVESANRVILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAM 1583
T G VE ++R + +R ++ K W L ++ +NTT+ P+ + +G +
Sbjct: 1062 TVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSN 1121
Query: 1584 LPVEIDNFTWRTRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVR 1643
LP N P + ++ A + ++ R + A K++ ++ +V+
Sbjct: 1122 LPKHF-NKLHSIEPIYNIDDYAKESKYRLEVAYARARKLLE--AHKEKNKENYDLKVKDI 1178
Query: 1644 DMQVGDLVLKWRSGAPGNKLTPNWEGPYRIVKVLGN 1679
+++VGD VL G+KL + GPY+I + N
Sbjct: 1179 ELEVGDKVL--LRNEVGHKLDFKYTGPYKIESIGDN 1212
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 190 bits (482), Expect = 3e-47
Identities = 122/390 (31%), Positives = 189/390 (48%), Gaps = 10/390 (2%)
Query: 731 VQQEVDKLLAAEFIRE----VKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 786
V+ +V ++L IRE PTW+ K+R+ +DY LN+ D YP+
Sbjct: 222 VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPI 281
Query: 787 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 846
P++D ++ + + +D G+HQI M KTAF T +Y Y MPFGL+NA
Sbjct: 282 PNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAP 341
Query: 847 ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 906
AT+QR M+ + + ++ VY+DD+I+ S +H ++ F ++ +++L +KC
Sbjct: 342 ATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCE 401
Query: 907 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGDR 966
F + FLG ++T GI+ NP K KAI P+ KE++ G +F+P D
Sbjct: 402 FLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADI 461
Query: 967 SFPFFKCLRKNVAFE-WTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSS 1025
+ P CL+K + E EAF +LK L+ PIL P L S+ AL +
Sbjct: 462 AKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGA 521
Query: 1026 VMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD-L 1084
V+ Q + F+S TL E+ Y IEK LA++ + R Y + +D
Sbjct: 522 VLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQ 577
Query: 1085 PLRQVLQKPDLSGRLVAWSVELSEYGLQYD 1114
PLR + + +L W V LSEY + D
Sbjct: 578 PLRWLHNLKEPGAKLERWRVRLSEYQFKID 607
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 189 bits (480), Expect = 6e-47
Identities = 124/401 (30%), Positives = 202/401 (49%), Gaps = 21/401 (5%)
Query: 731 VQQEVDKLLAAEFIRE----VKYPTWLANVVMVKKANGK--WRMCVDYTDLNKACPKDSY 784
V++++D+LL IR P W+ V K NG+ +RM VD+ LN D+Y
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 785 PLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKN 844
P+P I++ + + + +D SG+HQI M +D KTAF T Y + +PFGLKN
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 845 AGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEK 904
A A +QR++D + +G+ VY+DD+IV S H ++L + K ++++N EK
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 905 CSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSG 964
F +FLG+++T+ GI+ +P+K +AI +M P++VKE++R G + +F+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 965 DRSFPFFKCLR-----------KNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 1013
+ P R V ++F LK +L S IL+ P P H
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 1014 LYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPY- 1072
L S+ A+ +V+ Q+ G R + ++S +L E Y IEK LA++ + LR Y
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 1073 FQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEYGLQ 1112
+ + +KV TD PL L + + +L W + EY +
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCE 537
Score = 35.4 bits (80), Expect = 1.4
Identities = 42/209 (20%), Positives = 81/209 (38%), Gaps = 14/209 (6%)
Query: 1369 SHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFA-DLSKAPPKELVTMSAPWPF 1427
+H G + ++L +Y+P + C+ C+++ + P T +P
Sbjct: 704 AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762
Query: 1428 AMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRIVCRFGIP 1487
+ +D+ + + L +D F+K+ + L S + + + F P
Sbjct: 763 EILHIDIFA------LEKRLYLSCIDKFSKFAKLFHLQSKASVHLRETLVEALHY-FTAP 815
Query: 1488 RAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVIL---RGLRRRLA 1544
+ +VSDN + + + I + +A + + NGQVE + L R L+ L
Sbjct: 816 KVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELP 875
Query: 1545 EAKGAWLDELPAVLWSYNTTEQSTTRETP 1573
K L + + YNT+ S T P
Sbjct: 876 TFKPVELVHI--AVDRYNTSVHSVTNRKP 902
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 178 bits (452), Expect = 1e-43
Identities = 226/993 (22%), Positives = 403/993 (39%), Gaps = 120/993 (12%)
Query: 756 VVMVKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIR 815
V V K +GKWRM +DY ++NK P + ++ + + +D +G+
Sbjct: 216 VYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIFRGKYKTTLDLSNGFWAHS 275
Query: 816 MHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVK 875
+ P TAF YC+ +P G N+ A + D V + N++VYVDD+ +
Sbjct: 276 ITPESYWLTAFTWLGQQYCWTRLPQGFLNSPALFTA--DVVDLLKEVPNVQVYVDDIYIS 333
Query: 876 SVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQ 935
+H + LE+ F + ++ +K +FLGF IT G + + +
Sbjct: 334 HDDPREHLEQLEKVFSLLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQKLL 393
Query: 936 QMKSPSNVKEVQRLTGRIAALSRFLPQSGDRSFPFFKCLR--KNVAFEWTAECEEAFVRL 993
+ P ++K++Q + G + F+P + P + + WT + + +
Sbjct: 394 NITPPRDLKQLQSILGLLNFARNFIPNFSELVKPLYNIIATANGKYITWTTDNSQQLQNI 453
Query: 994 KELLSSPPILSKPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQ 1053
+L+S L + + + L V+ S + + + R + ++++ AEV++
Sbjct: 454 ISMLNSAENLEE--RNPEVRLIMKVNTSPSAGYIRFYNEFAKRPIMYLNYVYTKAEVKFT 511
Query: 1054 KIEKAALAV---LVTARRL---RPYFQSFPVKVRTDLPLRQVLQKPDLSGRLVAWSVELS 1107
EK + L+ A L + P+ T + + ++ L R + W L
Sbjct: 512 NTEKLLTTIHKGLIKALDLGMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMSYLE 571
Query: 1108 EYGLQYDKRGTVGAQSLA-----DFVVELT-PDRFERVDTQWTLFVDGSS-------NSS 1154
+ +Q+ T+ D + ++ P F V + DGS+ S
Sbjct: 572 DPRIQFHYDKTLPELQQVPTVTDDIIAKIKHPSEFSMV-----FYTDGSAIKHPNVNKSH 626
Query: 1155 GSGAGVTLEGPGELVLEQSLKFEFKATN-------NQAEYEALIAGLKLAREVKIR---S 1204
+G G+ + K EF N + A +A ++ A + ++
Sbjct: 627 NAGMGIA---------QVQFKPEFTVINTWSIPLGDHTAQLAEVAAVEFACKKALKIDGP 677
Query: 1205 LLIRTDSQLVENQVK---------GTFQVKDPNLIKYLERVRYLMTLFQEVVVEYVPRAE 1255
+LI TDS V V G F K L K++ + + + Q + +
Sbjct: 678 VLIVTDSFYVAESVNKELPYWQSNGFFNNKKKPL-KHVSKWKSIADCIQLKPDIIIIHEK 736
Query: 1256 NQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMACVNRGRTWMDPIISILAGDPAE 1315
+ A ++ GNN + + LA +G + +N P + AE
Sbjct: 737 GHQPTA----STFHTEGNN---LADKLA---TQGSYVVNINT-----TPSLD------AE 775
Query: 1316 VEQCTKEQQREAS----HYTLIDGHLYRRGFSTPLLKCVSPEKYEA--IMSEVHEGVCAS 1369
++Q + Q + Y L +G + + P K + P K + I+ + H +
Sbjct: 776 LDQLLQGQYPKGFPKHYQYQLENGQVM---VTRPNGKRIIPPKSDRPQIILQAHN---IA 829
Query: 1370 HIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAM 1429
H G S KV + ++WP LRKD + ++QCK+C V + A P L PF
Sbjct: 830 HTGRDSTFLKV-SSKYWWPNLRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKPFDK 888
Query: 1430 WGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPL-AKITSAKIVNFYWKRIVCRFGIPR 1488
+ +D +GP P + + +LV VD T ++ P A TSA + ++ +P+
Sbjct: 889 FFIDYIGPLPPSNGYLH-VLVVVDSMTGFVWLYPTKAPSTSATVKAL---NMLTSIAVPK 944
Query: 1489 AIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKG 1548
I SD G F+S+ ++ K GIQ+ F++ HPQ++G+VE N I R L + L
Sbjct: 945 VIHSDQGAAFTSATFADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPA 1004
Query: 1549 AWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWRTRPGFEEENQANMA 1608
W D LP V + N + +++ TP ++ +G+D+ P + +R EE
Sbjct: 1005 KWYDLLPVVQLALNNSYSPSSKYTPHQLLFGIDSNTPFANSDTLDLSR---EE------- 1054
Query: 1609 VELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKWRSGAPGNKLTPNWE 1668
EL LL E R ++ T + +R VG LV + R P + L P W
Sbjct: 1055 -ELSLLQEIRSSLYLPSTP---------PASIRAWSPSVGQLVQE-RVARPAS-LRPRWH 1102
Query: 1669 GPYRIVKVLGNGAYHLEELDGRRLPRSFNGLSL 1701
P +++V+ A + + G R S + L L
Sbjct: 1103 KPTPVLEVINPRAVVILDHLGNRRTVSVDNLKL 1135
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 175 bits (443), Expect = 1e-42
Identities = 118/416 (28%), Positives = 207/416 (49%), Gaps = 10/416 (2%)
Query: 727 KGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 786
K +A+ E+++ L + IRE K V+ V K G RM VDY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 787 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 846
P I+ L+ G+ + + +D S YH IR+ DE K AF R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542
Query: 847 ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 906
A +Q ++ + ++ Y+DD+++ S +H + +++ +++ ++ +N KC
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 907 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGDR 966
F KF+G+ I+ +G E + Q K P N KE+++ G + L +F+P++
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 967 SFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSV 1026
+ P L+K+V ++WT +A +K+ L SPP+L + L SD A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 1027 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 1083
+ Q+ D + + V + S + A++ Y +K LA++ + + R Y +S P K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 1084 ---LPLRQVLQKPDLSGRLVAWSVELSEYGLQYDKR-GTVG--AQSLADFVVELTP 1133
L R + + RL W + L ++ + + R G+ A +L+ V E P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 126 bits (316), Expect = 6e-28
Identities = 108/455 (23%), Positives = 188/455 (40%), Gaps = 50/455 (10%)
Query: 1248 VEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMACVNR---GRTWMDP 1304
+ Y P + N ADAL+++ +P S E + VN+ + +
Sbjct: 815 INYRPGSANHIADALSRIVDETEPIPKDS-----------EDNSINFVNQISITDDFKNQ 863
Query: 1305 IISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHE 1364
+++ D + E +R + L DG L +L + I+ + HE
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQLTRTIIKKYHE 921
Query: 1365 GVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQV--------FADLSKAPPK 1416
H G L +LR F W +RK ++V+ C CQ+ + L PP
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1417 ELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAK-ITSAKIVNF 1475
E P+ +D + P + + V VD F+K P K IT+ +
Sbjct: 981 ER-------PWESLSMDFITALPESSGY-NALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 1476 YWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVI 1535
+ +R++ FG P+ I++DN F+S ++F + M+F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 1536 LRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVD-AMLPVEIDNFTWR 1594
+ LR + W+D + V SYN S T+ TPF + + A+ P+E+
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL------ 1146
Query: 1595 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1654
P F ++ N + + ++ + MK+ K + + Q GDLV+
Sbjct: 1147 --PSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQ---EIEEFQPGDLVMVK 1201
Query: 1655 RSGA----PGNKLTPNWEGPYRIVKVLGNGAYHLE 1685
R+ NKL P++ GP+ +++ G Y L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 171 bits (434), Expect = 1e-41
Identities = 117/416 (28%), Positives = 207/416 (49%), Gaps = 10/416 (2%)
Query: 727 KGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 786
K +A+ E+++ L + IRE K V+ V K G RM VDY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 787 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 846
P I+ L+ G+ + + +D S YH IR+ DE K AF R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 847 ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 906
A +Q ++ + ++ Y+D++++ S +H + +++ +++ ++ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 907 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGDR 966
F KF+G+ I+ +G E + Q K P N KE+++ G + L +F+P++
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 967 SFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSV 1026
+ P L+K+V ++WT +A +K+ L SPP+L + L SD A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 1027 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 1083
+ Q+ D + + V + S + A++ Y +K LA++ + + R Y +S P K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 1084 ---LPLRQVLQKPDLSGRLVAWSVELSEYGLQYDKR-GTVG--AQSLADFVVELTP 1133
L R + + RL W + L ++ + + R G+ A +L+ V E P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 126 bits (316), Expect = 6e-28
Identities = 108/455 (23%), Positives = 188/455 (40%), Gaps = 50/455 (10%)
Query: 1248 VEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMACVNR---GRTWMDP 1304
+ Y P + N ADAL+++ +P S E + VN+ + +
Sbjct: 815 INYRPGSANHIADALSRIVDETEPIPKDS-----------EDNSINFVNQISITDDFKNQ 863
Query: 1305 IISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHE 1364
+++ D + E +R + L DG L +L + I+ + HE
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQLTRTIIKKYHE 921
Query: 1365 GVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQV--------FADLSKAPPK 1416
H G L +LR F W +RK ++V+ C CQ+ + L PP
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1417 ELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAK-ITSAKIVNF 1475
E P+ +D + P + + V VD F+K P K IT+ +
Sbjct: 981 ER-------PWESLSMDFITALPESSGY-NALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 1476 YWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVI 1535
+ +R++ FG P+ I++DN F+S ++F + M+F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 1536 LRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVD-AMLPVEIDNFTWR 1594
+ LR + W+D + V SYN S T+ TPF + + A+ P+E+
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL------ 1146
Query: 1595 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1654
P F ++ N + + ++ + MK+ K + + Q GDLV+
Sbjct: 1147 --PSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQ---EIEEFQPGDLVMVK 1201
Query: 1655 RSGA----PGNKLTPNWEGPYRIVKVLGNGAYHLE 1685
R+ NKL P++ GP+ +++ G Y L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 171 bits (434), Expect = 1e-41
Identities = 117/416 (28%), Positives = 207/416 (49%), Gaps = 10/416 (2%)
Query: 727 KGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRMCVDYTDLNKACPKDSYPL 786
K +A+ E+++ L + IRE K V+ V K G RM VDY LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 787 PSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAG 846
P I+ L+ G+ + + +D S YH IR+ DE K AF R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 847 ATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCS 906
A +Q ++ + ++ Y+D++++ S +H + +++ +++ ++ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 907 FGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGDR 966
F KF+G+ I+ +G E + Q K P N KE+++ G + L +F+P++
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 967 SFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSV 1026
+ P L+K+V ++WT +A +K+ L SPP+L + L SD A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 1027 MLQEIDGE-HRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSF--PVKVRTD 1083
+ Q+ D + + V + S + A++ Y +K LA++ + + R Y +S P K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 1084 ---LPLRQVLQKPDLSGRLVAWSVELSEYGLQYDKR-GTVG--AQSLADFVVELTP 1133
L R + + RL W + L ++ + + R G+ A +L+ V E P
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 126 bits (316), Expect = 6e-28
Identities = 108/455 (23%), Positives = 188/455 (40%), Gaps = 50/455 (10%)
Query: 1248 VEYVPRAENQRADALAKLASTRKPGNNKSVIQETLAYPSIEGELMACVNR---GRTWMDP 1304
+ Y P + N ADAL+++ +P S E + VN+ + +
Sbjct: 815 INYRPGSANHIADALSRIVDETEPIPKDS-----------EDNSINFVNQISITDDFKNQ 863
Query: 1305 IISILAGDPAEVEQCTKEQQREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHE 1364
+++ D + E +R + L DG L +L + I+ + HE
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQLTRTIIKKYHE 921
Query: 1365 GVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQV--------FADLSKAPPK 1416
H G L +LR F W +RK ++V+ C CQ+ + L PP
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1417 ELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAK-ITSAKIVNF 1475
E P+ +D + P + + V VD F+K P K IT+ +
Sbjct: 981 ER-------PWESLSMDFITALPESSGY-NALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 1476 YWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVI 1535
+ +R++ FG P+ I++DN F+S ++F + M+F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 1536 LRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVD-AMLPVEIDNFTWR 1594
+ LR + W+D + V SYN S T+ TPF + + A+ P+E+
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL------ 1146
Query: 1595 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1654
P F ++ N + + ++ + MK+ K + + Q GDLV+
Sbjct: 1147 --PSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQ---EIEEFQPGDLVMVK 1201
Query: 1655 RSGA----PGNKLTPNWEGPYRIVKVLGNGAYHLE 1685
R+ NKL P++ GP+ +++ G Y L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 168 bits (425), Expect = 1e-40
Identities = 199/872 (22%), Positives = 352/872 (39%), Gaps = 87/872 (9%)
Query: 756 VVMVKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIR 815
V V K +G+WRM +DY ++NK P + ++ + + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 816 MHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVK 875
+ P TAF YC+ +P G N+ A + D V + N++VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQVYVDDIYLS 122
Query: 876 SVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQ 935
+H Q LE+ F + + ++ +K G + +FLGF IT G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 936 QMKSPSNVKEVQRLTGRIAALSRFLPQSGDRSFPFFKCLR--KNVAFEWTAECEEAFVRL 993
+ P ++K++Q + G + F+P + P + + K EW+ E + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 994 KELLSSPPILSKPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQ 1053
E L++ L + + L + S SA V G+ I+Y +++ AE+++
Sbjct: 243 IEALNTASNLEERLPEQRLVIKVNTSPSA-GYVRYYNETGKKPIMY-LNYVFSKAELKFS 300
Query: 1054 KIEKAALAV---LVTARRL---RPYFQSFPVKVRTDLPLRQVLQKPDLSGRLVAWSVELS 1107
+EK + L+ A L + P+ T + + ++ L R + W L
Sbjct: 301 MLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMTYLE 360
Query: 1108 EYGLQYDKRGTVGAQSLADFVVELT------PDRFERVDTQWTLFVDGS---------SN 1152
+ +Q+ T+ V + P ++E V + DGS SN
Sbjct: 361 DPRIQFHYDKTLPELKHIPDVYTSSQSPVKHPSQYEGV-----FYTDGSAIKSPDPTKSN 415
Query: 1153 SSGSGAGVTLEGPGELVLEQSLKFEFKATNNQAEYEALIAGLKLAREVKIRSLLIRTDSQ 1212
++G G P VL Q T AE A+ K A ++ L+I
Sbjct: 416 NAGMGIVHATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFACKKALKIPGPVLVITDSFY 475
Query: 1213 LVENQVKGTFQVKDPNLIKYLERVRYLMTLFQEVVVEYVPRAENQRADALAKLASTRKPG 1272
+ E+ K K + ++ P + ++A+ S +
Sbjct: 476 VAESANKELPYWKSNGFVNNKKK----------------PLKHISKWKSIAECLSMKPD- 518
Query: 1273 NNKSVIQETLAYPSIEGELMACVNRGRTWMDPIISILAGDPAEVEQC-TKEQQREASHYT 1331
IQ I ++ + +G D LA + V C TK+ +A
Sbjct: 519 ---ITIQHE---KGISLQIPVFILKGNALADK----LATQGSYVVNCNTKKPNLDAELDQ 568
Query: 1332 LIDGHLYRRGFSTPL----------------LKCVSPEK-YEAIMSEVHEGVCASHIGGR 1374
L+ GH Y +G+ +K + P+ + I+ + H +H G
Sbjct: 569 LLQGH-YIKGYPKQYTYFLEDGKVKVSRPEGVKIIPPQSDRQKIVLQAHN---LAHTGRE 624
Query: 1375 SLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMWGVDL 1434
+ K+ ++WP +RKD + + +C++C + +KA L PF + +D
Sbjct: 625 ATLLKIANL-YWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRPQKPFDKFFIDY 683
Query: 1435 VGPFPTARAQMKFILVAVDYFTKWIEAEPL-AKITSAKIVNFYWKRIVCRFGIPRAIVSD 1493
+GP P ++ + ++LV VD T + P A TSA + + ++ IP+ I SD
Sbjct: 684 IGPLPPSQGYL-YVLVVVDGMTGFTWLYPTKAPSTSATVKSL---NVLTSIAIPKVIHSD 739
Query: 1494 NGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKGAWLDE 1553
G F+SS E+ KE GI + F++ HPQ+ +VE N I R L + L W D
Sbjct: 740 QGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRPTKWYDL 799
Query: 1554 LPAVLWSYNTTEQSTTRETPFRMTYGVDAMLP 1585
LP V + N T + TP ++ +G+D+ P
Sbjct: 800 LPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 151 bits (381), Expect = 2e-35
Identities = 111/398 (27%), Positives = 189/398 (46%), Gaps = 23/398 (5%)
Query: 731 VQQEVDKLLAAEFIREVKYPTWLANVVMVKKA-----NGKWRMCVDYTDLNKACPKDSYP 785
V EV +LL IR + P V+ KK N R+ +D+ LN+ D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 786 LPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNA 845
+PSI ++ + + +D SGYHQI + D +KT+F Y + +PFGL+NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 846 GATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKC 905
+ +QR +D V Q+G+ VYVDD+I+ S DH + ++ + +MR++ EK
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 906 SFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGRIAALSRFLPQSGD 965
F + ++LGF+++ G + +PEK KAIQ+ P V +V+ G + F+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 966 RSFPFFKCLR-----------KNVAFEWTAECEEAFVRLKELLSSPPILSK-PIQGHPLH 1013
+ P L+ K + E+ AF RL+ +L+S ++ K P P
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 1014 LYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLRPY- 1072
L S S + +V+ Q E R + +S TL+ E Y E+ LA++ +L+ +
Sbjct: 497 LTTDASASGIGAVLSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552
Query: 1073 FQSFPVKVRTD-LPLRQVLQKPDLSGRLVAWSVELSEY 1109
+ S + + TD PL + + + ++ W + ++
Sbjct: 553 YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQH 590
Score = 44.3 bits (103), Expect = 0.003
Identities = 66/289 (22%), Positives = 108/289 (36%), Gaps = 42/289 (14%)
Query: 1379 KVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMWGVDLVGPF 1438
+VLR +Y+P + + V C+ C A + P K+ + P P + + F
Sbjct: 759 QVLR-DYYFPKMGSLAKEVVANCRVCTQ-AKYDRHPKKQELG-ETPIPSYTGEMVHIDIF 815
Query: 1439 PTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRIVCRFGIPRAIVSDNGTQF 1498
T R K L +D F+K+ +P+ T I + I+ F + + DN F
Sbjct: 816 STDR---KLFLTCIDKFSKYAIVQPVVSRTIVDITAPLLQ-IINLFPNIKTVYCDNEPAF 871
Query: 1499 SSSQTREFCKE-MGIQMRFASVEHPQTNGQVESANRVILRGLR-RRLAEAKGAWLDELPA 1556
+S K GI + A H +NGQVE + + R +L + ++ +
Sbjct: 872 NSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVELILR 931
Query: 1557 VLWSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWRTRPGFEEENQANMAVELDLLSE 1616
YN T S TRE P + + PG E + ++ L+
Sbjct: 932 ATIEYNKTVHSVTRERPIEVVH------------------PGAHER---CLEIKARLVKA 970
Query: 1617 TRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKWRSGAPGNKLTP 1665
+D + + RV +VG+ V + GNKLTP
Sbjct: 971 QQDSIGRNNPSRQNRV------------FEVGERVFVKNNKRLGNKLTP 1007
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (324), Expect = 7e-29
Identities = 127/499 (25%), Positives = 215/499 (42%), Gaps = 31/499 (6%)
Query: 649 PIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FIC 704
P+EE L G R + +T+++ ++ +LL + C + P +DPN ++
Sbjct: 184 PLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEK-------VCSENP-LDPNKTKQWMK 235
Query: 705 HRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVM---VKK 761
+ L+ K + + + +++ +LL + I+ K P ++ +K
Sbjct: 236 ASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEK 295
Query: 762 ANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADE 821
GK RM V+Y +NKA D+Y LP+ D L+ G ++ S D SG+ Q+ +
Sbjct: 296 RRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESR 355
Query: 822 DKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGL 880
TAF + +Y + +PFGLK A + +QR MD F +V R VYVDD++V S
Sbjct: 356 PLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEE 413
Query: 881 DHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP 940
DH + + +H + L+ +K + FLG I + + I +
Sbjct: 414 DHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDT 473
Query: 941 -SNVKEVQRLTGRIAALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSS 999
+ K++QR G + S ++P+ P L++NV + WT E ++K+ L
Sbjct: 474 LEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQG 533
Query: 1000 PPILSKPIQGHPLHLYFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKI 1055
P L P+ L + SD L ++ + E I + S + + AE Y
Sbjct: 534 FPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSN 593
Query: 1056 EKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYG 1110
+K LAV+ T ++ Y +RTD + K D GR + W LS Y
Sbjct: 594 DKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYS 653
Query: 1111 LQYDK-RGTVGAQSLADFV 1128
+ +GT ADF+
Sbjct: 654 FDVEHIKGT--DNHFADFL 670
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (323), Expect = 9e-29
Identities = 127/499 (25%), Positives = 216/499 (42%), Gaps = 31/499 (6%)
Query: 649 PIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FIC 704
P+EE L G R + +T+++ ++ +LL + C + P +DPN ++
Sbjct: 184 PLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEK-------VCSENP-LDPNKTKQWMK 235
Query: 705 HRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVM---VKK 761
+ L+ K + + + +++ +LL + I+ K P ++ +K
Sbjct: 236 ASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEK 295
Query: 762 ANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADE 821
GK RM V+Y +NKA D+Y LP+ D L+ G ++ S D SG+ Q+ +
Sbjct: 296 RRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESR 355
Query: 822 DKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGL 880
TAF + +Y + +PFGLK A + +QR MD F +V R VYVDD++V S
Sbjct: 356 PLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEE 413
Query: 881 DHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP 940
DH + + +H + L+ +K + FLG I + + I +
Sbjct: 414 DHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDT 473
Query: 941 -SNVKEVQRLTGRIAALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSS 999
+ K++QR G + S ++P+ P L++NV ++WT E ++K+ L
Sbjct: 474 LEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQG 533
Query: 1000 PPILSKPIQGHPLHLYFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKI 1055
P L P+ L + SD L ++ + E I + S + + AE Y
Sbjct: 534 FPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSN 593
Query: 1056 EKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYG 1110
+K LAV+ T ++ Y +RTD + K D GR + W LS Y
Sbjct: 594 DKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYS 653
Query: 1111 LQYDK-RGTVGAQSLADFV 1128
+ +GT ADF+
Sbjct: 654 FDVEHIKGT--DNHFADFL 670
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 127 bits (319), Expect = 3e-28
Identities = 125/485 (25%), Positives = 210/485 (42%), Gaps = 24/485 (4%)
Query: 663 LKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FICHRLALNPSVKPVSQ 718
L G RL+EE+ + + + +L C + P +DPN ++ + L+ K +
Sbjct: 191 LSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKV 249
Query: 719 LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVM---VKKANGKWRMCVDYTDL 775
+ + +++ +LL + I+ K P ++ +K GK RM V+Y +
Sbjct: 250 KPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAM 309
Query: 776 NKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCY 835
NKA D+Y LP+ D L+ G ++ S D SG+ Q+ + TAF + +Y +
Sbjct: 310 NKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 369
Query: 836 RTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGLDHHQDLEEAFGEIR 894
+PFGLK A + +QR MD F +V R VYVDD++V S DH + +
Sbjct: 370 NVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCN 427
Query: 895 KHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP-SNVKEVQRLTGRI 953
+H + L+ +K + FLG I + + I + + K++QR G +
Sbjct: 428 QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 487
Query: 954 AALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 1013
S ++P+ P L++NV ++WT E ++K+ L P L P+ L
Sbjct: 488 TYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 547
Query: 1014 LYFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRL 1069
+ SD L ++ + E I + S + + AE Y +K LAV+ T ++
Sbjct: 548 IETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKF 607
Query: 1070 RPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYGLQYDK-RGTVGAQS 1123
Y +RTD + K D GR + W LS Y + +GT
Sbjct: 608 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGT--DNH 665
Query: 1124 LADFV 1128
ADF+
Sbjct: 666 FADFL 670
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 125 bits (315), Expect = 8e-28
Identities = 129/514 (25%), Positives = 218/514 (42%), Gaps = 29/514 (5%)
Query: 634 ENLDPRGEGRVNRPTPIEETKALKFGDRTLKIGTRLTEEQETRLTKLLGENLDLFAWSCK 693
E++ R + + P I K L G RL+EE+ + + + +L C
Sbjct: 162 ESMKKRSKTQQPEPVNISTNKIA-----ILSEGRRLSEEKLFITQQRMQKIEELLEKVCS 216
Query: 694 DMPGIDPN----FICHRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKY 749
+ P +DPN ++ + L+ K + + + +++ +LL + I+ K
Sbjct: 217 ENP-LDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS 275
Query: 750 PTWLANVVM---VKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMD 806
P ++ +K GK RM V+Y +NKA D+Y P+ D L+ G ++ S D
Sbjct: 276 PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFD 335
Query: 807 AYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNM- 865
SG+ Q+ + TAF + +Y + +PFGLK A + +QR MD F +V R
Sbjct: 336 CKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFC 393
Query: 866 EVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIE 925
VYVDD++V S DH + + +H + L+ +K + FLG I +
Sbjct: 394 CVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHK 453
Query: 926 INPEKCKAIQQMKSP-SNVKEVQRLTGRIAALSRFLPQSGDRSFPFFKCLRKNVAFEWTA 984
+ I + + K++QR G + S ++P+ P L++NV ++WT
Sbjct: 454 PQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTK 513
Query: 985 ECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSD----SALSSVMLQEIDGEHRIVYF 1040
E ++K+ L P L P+ L + SD L ++ + E I +
Sbjct: 514 EDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRY 573
Query: 1041 VSHTLQGAEVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTD----LPLRQVLQKPDLS 1096
S + + AE Y +K LAV+ T ++ Y +RTD + K D
Sbjct: 574 ASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSK 633
Query: 1097 -GRLVAWSVELSEYGLQYDK-RGTVGAQSLADFV 1128
GR + W LS Y + +GT ADF+
Sbjct: 634 LGRNIRWQAWLSHYSFDVEHIKGT--DNHFADFL 665
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 125 bits (314), Expect = 1e-27
Identities = 124/485 (25%), Positives = 208/485 (42%), Gaps = 24/485 (4%)
Query: 663 LKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPN----FICHRLALNPSVKPVSQ 718
L G RL+EE+ + + + +L C + P +DPN ++ + L+ K +
Sbjct: 192 LSEGRRLSEEKLFITQQRMQKTEELLEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKV 250
Query: 719 LRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKAN---GKWRMCVDYTDL 775
+ + +++ +LL + I+ K P ++ +A G RM V+Y +
Sbjct: 251 KPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAM 310
Query: 776 NKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVNYCY 835
NKA D+Y LP+ D L+ G ++ S D SG+ Q+ + TAF + +Y +
Sbjct: 311 NKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEW 370
Query: 836 RTMPFGLKNAGATYQRLMDRVFAGQVGRNM-EVYVDDMIVKSVRGLDHHQDLEEAFGEIR 894
+PFGLK A + +QR MD F +V R VYVDD++V S DH + +
Sbjct: 371 NVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCN 428
Query: 895 KHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSP-SNVKEVQRLTGRI 953
+H + L+ +K + FLG I + + I + + K++QR G +
Sbjct: 429 QHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 488
Query: 954 AALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLH 1013
S ++P P L++NV ++WT E ++K+ L P L P+ L
Sbjct: 489 TYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLI 548
Query: 1014 LYFAVSD----SALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRL 1069
+ SD L ++ + E I + S + + AE Y +K LAV+ T ++
Sbjct: 549 IETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKF 608
Query: 1070 RPYFQSFPVKVRTD----LPLRQVLQKPDLS-GRLVAWSVELSEYGLQYDK-RGTVGAQS 1123
Y +RTD + K D GR + W LS Y + +GT
Sbjct: 609 SIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGT--DNH 666
Query: 1124 LADFV 1128
ADF+
Sbjct: 667 FADFL 671
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 125 bits (313), Expect = 1e-27
Identities = 108/377 (28%), Positives = 171/377 (44%), Gaps = 15/377 (3%)
Query: 760 KKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPA 819
++ GK RM V+Y +N+A DS+ LP++ L+ G + S D SG+ Q+ +
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346
Query: 820 DEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRG 879
+ TAF + ++ ++ +PFGLK A + +QR M G + VYVDD+IV S
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSE 405
Query: 880 LDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKS 939
LDH+ + + K+ + L+ +K + + FLG I +G P+ K
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHC-PQNHILENIHKF 463
Query: 940 PSNV---KEVQRLTGRIAALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKEL 996
P + K +QR G + ++P+ + P L+K+V + WT + ++K+
Sbjct: 464 PDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKN 523
Query: 997 LSSPPILSKPIQGHPLHLYFAVSDSALSSVM-LQEIDGEHRIVYFVSHTLQGAEVRYQKI 1055
L S P L P L + SDS V+ + +DG I + S + + AE Y
Sbjct: 524 LGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSN 583
Query: 1056 EKAALAVLVTARRLRPYFQSFPVKVRTDLP-----LRQVLQKPDLSGRLVAWSVELSEYG 1110
+K LAV + Y VRTD LR L+ GRLV W S+Y
Sbjct: 584 DKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKY- 642
Query: 1111 LQYDKRGTVGAQS-LAD 1126
Q+D G ++ LAD
Sbjct: 643 -QFDVEHLEGVKNVLAD 658
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 122 bits (306), Expect = 8e-27
Identities = 104/347 (29%), Positives = 160/347 (45%), Gaps = 31/347 (8%)
Query: 1356 EAIMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPP 1415
E I+S H +H G + KV + ++WP LRKD + ++QCK+C V + P
Sbjct: 817 EKIISTAHN---IAHTGRDATFLKV-SSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSP 872
Query: 1416 KELVTMSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPL-AKITSAKIVN 1474
L + PF + +D +GP P + + +LV VD T ++ P A TSA +
Sbjct: 873 PILRPVKPLKPFDKFYIDYIGPLPPSNGYLH-VLVVVDSMTGFVWLYPTKAPSTSATVKA 931
Query: 1475 FYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRV 1534
++ IP+ + SD G F+SS ++ KE GIQ+ F++ HPQ++G+VE N
Sbjct: 932 L---NMLTSIAIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSD 988
Query: 1535 ILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWR 1594
I R L + L W D LP V + N + +++ TP ++ +GVD+ P +
Sbjct: 989 IKRLLTKLLIGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPFANSDTLDL 1048
Query: 1595 TRPGFEEENQANMAVELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVLKW 1654
+R EE EL LL E R H Q + +S R VG LV +
Sbjct: 1049 SR---EE--------ELSLLQEIRSSLH-------QPTSPPASS--RSWSPSVGQLVQE- 1087
Query: 1655 RSGAPGNKLTPNWEGPYRIVKVLGNGAYHLEELDGRRLPRSFNGLSL 1701
R P + L P W P I++V+ + + G R S + L L
Sbjct: 1088 RVARPAS-LRPRWHKPTAILEVVNPRTVIILDHLGNRRTVSVDNLKL 1133
Score = 86.7 bits (213), Expect = 5e-16
Identities = 89/417 (21%), Positives = 169/417 (40%), Gaps = 29/417 (6%)
Query: 709 LNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANGKWRM 768
+NP KP ++Q +D LL + + + T V V K +GKWRM
Sbjct: 182 INPKAKP--------------SIQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPKPDGKWRM 226
Query: 769 CVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMT 828
+DY ++NK P + ++ + + +D +G+ + P TAF
Sbjct: 227 VLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTW 286
Query: 829 ARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEE 888
YC+ +P G N+ A + D V + N++ YVDD+ + +H + LE+
Sbjct: 287 QGKQYCWTRLPQGFLNSPALFTA--DVVDLLKEIPNVQAYVDDIYISHDDPQEHLEQLEK 344
Query: 889 AFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQR 948
F + ++ +K + +FLGF IT G + + + + P ++K++Q
Sbjct: 345 IFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNITPPKDLKQLQS 404
Query: 949 LTGRIAALSRFLPQSGDRSFPFFKCL-RKNVAF-EWTAECEEAFVRLKELLSSPPILSKP 1006
+ G + F+P + P + + N F WT + + +L+ L +
Sbjct: 405 ILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISVLNQADNLEE- 463
Query: 1007 IQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAV---L 1063
+ L V+ S + + +G R + +V++ AE ++ + EK + L
Sbjct: 464 -RNPETRLIIKVNSSPSAGYIRYYNEGSKRPIMYVNYIFSKAEAKFTQTEKLLTTMHKGL 522
Query: 1064 VTARRL---RPYFQSFPVKVRTDLPLRQVLQKPDLSGRLVAWSVELSEYGLQ--YDK 1115
+ A L + P+ T + + ++ L R + W L + +Q YDK
Sbjct: 523 IKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYLEDPRIQFHYDK 579
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 116 bits (291), Expect = 5e-25
Identities = 75/249 (30%), Positives = 131/249 (52%), Gaps = 7/249 (2%)
Query: 715 PVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKK-ANGKWRMCVDY- 772
PV + R + +AV+ E+++L I + Y W A +V++KK GK R+C D+
Sbjct: 440 PVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFK 499
Query: 773 -TDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARV 831
+ LN A + +PLP+ + + G + S +D Y Q+ + + T R
Sbjct: 500 CSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTHRG 558
Query: 832 NYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFG 891
+ Y M FGLK A A++Q++MD++ +G G + VY DD+I+ + +H + L E F
Sbjct: 559 IFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFE 616
Query: 892 EIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTG 951
+++ R++ EKC+F + FLGF + G + +K +AI+ MK+P++ K++ G
Sbjct: 617 RFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLG 675
Query: 952 RIAALSRFL 960
LSR +
Sbjct: 676 AADWLSRMM 684
Score = 74.7 bits (182), Expect = 2e-12
Identities = 62/228 (27%), Positives = 100/228 (43%), Gaps = 24/228 (10%)
Query: 1358 IMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKE 1417
++ ++HEG H G + K R+ +W L D + V+ C CQ + + + P
Sbjct: 786 VLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVPLN 840
Query: 1418 LVTMSAPWPF--AMWG---VDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKI 1472
PWP A W +D GP ++LV VD TK+ E + I++
Sbjct: 841 ------PWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSISAVTT 889
Query: 1473 VNFYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESAN 1532
++ + I G P I+SDNGTQ +S + C+ GI+ + ++V +P++NG E
Sbjct: 890 IDLL-EEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFV 948
Query: 1533 RVILRGLRRRLAEAKGAWLDELPAVLWSY-NTTEQSTTRETPFRMTYG 1579
+ RG+ + E L L SY NT + TP +G
Sbjct: 949 DTLKRGIAKIKGEG-SVNQQILNKFLISYRNTPHSALNGSTPAECHFG 995
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 112 bits (281), Expect = 7e-24
Identities = 134/555 (24%), Positives = 233/555 (41%), Gaps = 47/555 (8%)
Query: 608 YNNCLNFY-GKKSALVGH--RCYEIEASDENLDPRGEGRVNRPTPIEET--KALKFGDRT 662
Y + + F+ K+S ++G + Y+ + + +VNRP PI T + L +
Sbjct: 117 YTDRIYFHLNKQSVIIGKITKAYQYGVKGFLESMKKKSKVNRPEPINITSNQHLFLEEGG 176
Query: 663 LKIGTRLTEEQ-------ETRLTKLLGEN-LD---LFAWSCKDMPGIDPNFICHRLALNP 711
+ L E Q E L ++ EN +D W + IDP +
Sbjct: 177 NHVDEMLYEIQISKFSAIEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVV------- 229
Query: 712 SVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVK----KANGKWR 767
VKP+S + +++ +LL + I+ K T ++ +V+ + GK R
Sbjct: 230 KVKPMSY-----SPSDREEFDRQIKELLELKVIKPSK-STHMSPAFLVENEAERRRGKKR 283
Query: 768 MCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFM 827
M V+Y +NKA D++ LP+ D L+ G ++ S D SG Q+ + + TAF
Sbjct: 284 MVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFT 343
Query: 828 TARVNYCYRTMPFGLKNAGATYQRLMDRVFAGQVGRNMEVYVDDMIVKSVRG-LDHHQDL 886
+ +Y + +PFGLK A + + + + Q + VYVDD++V S G +H+ +
Sbjct: 344 CPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHV 403
Query: 887 EEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNV--- 943
K + L+ +K + FLG I +G P+ K P +
Sbjct: 404 LNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGTHC-PQNHILEHIHKFPDRIEDK 461
Query: 944 KEVQRLTGRIAALSRFLPQSGDRSFPFFKCLRKNVAFEWTAECEEAFVRLKELLSSPPIL 1003
K++QR G + S ++P+ P L+++ + W + ++K+ L S P L
Sbjct: 462 KQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKL 521
Query: 1004 SKPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVL 1063
P L + S+ ++ + I + S + + AE Y EK LAV+
Sbjct: 522 YHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVI 581
Query: 1064 VTARRLRPYFQSFPVKVRTDLP-----LRQVLQKPDLSGRLVAWSVELSEYGLQYDKRGT 1118
++ Y +RTD + L+ GRLV W + LS+Y +D
Sbjct: 582 RVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQY--DFDVEHI 639
Query: 1119 VGAQSL-ADFVVELT 1132
G +++ ADF+ E T
Sbjct: 640 AGTKNVFADFLQENT 654
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.135 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 201,931,104
Number of Sequences: 164201
Number of extensions: 8855200
Number of successful extensions: 22944
Number of sequences better than 10.0: 222
Number of HSP's better than 10.0 without gapping: 102
Number of HSP's successfully gapped in prelim test: 122
Number of HSP's that attempted gapping in prelim test: 22401
Number of HSP's gapped (non-prelim): 485
length of query: 1706
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1582
effective length of database: 39,613,130
effective search space: 62667971660
effective search space used: 62667971660
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0590c.9