
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0335b.5
(1648 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 316 3e-85
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 294 1e-78
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 289 4e-77
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 289 5e-77
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 287 2e-76
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 287 2e-76
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 278 1e-73
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 268 1e-70
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 233 3e-60
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 158 1e-37
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 158 1e-37
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 157 3e-37
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 154 2e-36
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 152 6e-36
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 151 1e-35
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 151 1e-35
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 131 1e-29
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 125 7e-28
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 121 2e-26
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 118 2e-25
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 316 bits (810), Expect = 3e-85
Identities = 272/995 (27%), Positives = 462/995 (46%), Gaps = 104/995 (10%)
Query: 658 ESVSSLATKSIWKEELTRDEEIPIEEKSELKSLPSSLKYAYLEEGENKPVILNSVLTPLK 717
E +S+ A + I EE+ D + SE+++ +S + E + ++ LK
Sbjct: 845 EVLSNKAEQDITVEEVLNDPTL----FSEIETDTNSCEVVKTAETYER---FTTICEHLK 897
Query: 718 EE-----KLLKVLRDHKSALGWTIDDIKGISPAICMHKILLEENYKPIVQPQRRLNPSMK 772
E K+ V+ + + D++ S C+ I L+E +PI Q R + ++K
Sbjct: 898 RENGDDRKIWDVIEQFQDVFAISDDELGRNSGTECV--IELKEGAEPIRQKPRPIPLALK 955
Query: 773 DVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCI 832
+RK I K+L+ VI S S W SPV +V KK G R+CI
Sbjct: 956 PEIRKMIQKMLNQKVIRE-SKSPWSSPVVLVKKKDGSI------------------RMCI 996
Query: 833 DYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQEKTAFTCPY 892
DYR++N V + + PLP I+ L LAG + Y D +G+ QI + + +E TAF
Sbjct: 997 DYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGS 1056
Query: 893 GVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGNLALVL 952
+F + +PFGL +PA FQ M I DL+ C +++DD + + + L ++ L
Sbjct: 1057 ELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEAL 1116
Query: 953 KRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPTNIKGIRSFL 1012
R +++ + L KCH ++ LGHKV+ G+E K + +++ + PTN+K ++SFL
Sbjct: 1117 TRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFL 1176
Query: 1013 GHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIKESLVTAPVIVAPD--- 1069
G G+YR+FI +F+++A +T+L+ + + +++ AF+ +K+ + PV+ PD
Sbjct: 1177 GLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEA 1236
Query: 1070 ---WSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASRVLNEAQRNYTTTEKELLGVV 1125
PF I DAS +GAVL Q+ + + I +AS+ L+ A+ Y T+ E L ++
Sbjct: 1237 ALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMM 1296
Query: 1126 FACEKFRPYILGFKVIVHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEIIDRRGKDNS 1185
FA +F+ I G + V TDH L L RL RW + + EFD++I+ GK N+
Sbjct: 1297 FALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGKANA 1356
Query: 1186 VADHLSRLEGGACSPIPIQEEFSDEKLLAVSTKEPLPWYVHFANFRVAGLIPHDLTWQQK 1245
VAD LSR G C P ++EE + E V+ + + ++ + L D W++
Sbjct: 1357 VADALSR---GGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLKGEDEGWKEV 1413
Query: 1246 KKFLHDAKSYLWDDPFLFKICS-------------DGVIRRCITEVDFEK---------- 1282
L K+ FKI GV++ TE++ +
Sbjct: 1414 IAALEGGKT-----KGTFKIVGIESEISLEYYKIVGGVLKN--TEIEEQSRSVVPEKIRT 1466
Query: 1283 -ILWHCHGSSYGGHFSGERTAAKVLQSGFYWPTLHRNSRAFVESCDRCQRTGNISR---- 1337
+L H GHF G + +++ FYWP + V +C +C + S+
Sbjct: 1467 PLLKELHEGMLAGHF-GIKKMWRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLTSS 1525
Query: 1338 ----RNEMPLKNILEIELFDVWGIDFMGPFPPSFGC*YILVAVDYVSKWVEASALSTNDS 1393
R PL+ I+ +L DV G+ G YIL +D +K+ A + +
Sbjct: 1526 LTPYRMTFPLE-IVACDLMDV-GLSVQGNR-------YILTIIDLFTKYGTAVPIPDKKA 1576
Query: 1394 KVVV-AFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQV 1452
+ V+ AF+++ +P +++D G F N F ++H + Y+ + +G V
Sbjct: 1577 ETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAV 1636
Query: 1453 EISNRELKRILEKVVNSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHLPVEL 1512
E N+ + I++K + +W ++ A++AY G +P L+ G+ P+E+
Sbjct: 1637 ERFNKTIMHIMKK-KTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEM 1695
Query: 1513 EHKAYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILNREF 1572
+ I + D E + L EL + + A E A +E K D+K +++
Sbjct: 1696 SGEDAVGINYADMD-----EYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASKKH 1750
Query: 1573 ---VSGQLVLLF--NSRLRLFPGKLKSRWSGPFVV 1602
G VLL + +L KL ++WSGP+ V
Sbjct: 1751 RFPQPGSRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 294 bits (753), Expect = 1e-78
Identities = 233/771 (30%), Positives = 367/771 (47%), Gaps = 76/771 (9%)
Query: 775 VRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDY 834
V +I +L+ G+I S+S + SP+ VVPKK + K+R+ IDY
Sbjct: 223 VESQIQDMLNQGIIRT-SNSPYNSPIWVVPKKQDAS-------------GKQKFRIVIDY 268
Query: 835 RRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQEKTAFTCPYGV 894
R+LN +T D P+P +D++L +L Y+ +D G++QI + PE KTAF+ +G
Sbjct: 269 RKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGH 328
Query: 895 FAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGNLALVLKR 954
+ Y RMPFGL NAPATFQRCM I L+ +++DD VF + D L +L LV ++
Sbjct: 329 YEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEK 388
Query: 955 CQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPTNIKGIRSFLGH 1014
+ NL L +KC F+ ++ LGH ++ GI+ + KIE I+K PT K I++FLG
Sbjct: 389 LAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGL 448
Query: 1015 AGFYRRFIKDFSKLAKPMTNLLEKEAPF-TFDENCLRAFESIKESLVTAPVIVAPDWSLP 1073
G+YR+FI +F+ +AKPMT L+K T + AF+ +K + P++ PD++
Sbjct: 449 TGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKK 508
Query: 1074 FEIMCDASDLALGAVLCQKKERVLYVIYYASRVLNEAQRNYTTTEKELLGVVFACEKFRP 1133
F + DASD+ALGAVL Q + Y+ SR LNE + NY+T EKELL +V+A + FR
Sbjct: 509 FTLTTDASDVALGAVLSQDGHPLSYI----SRTLNEHEINYSTIEKELLAIVWATKTFRH 564
Query: 1134 YILGFKVIVHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEIIDRRGKDNSVADHLSR- 1192
Y+LG + +DH L L+ +D +L RW + L EFD +I +GK+N VAD LSR
Sbjct: 565 YLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRI 624
Query: 1193 -LEGGACSPIPIQEEFSDEKLLAVSTKEPLPWY---------------VHFANFRVAGLI 1236
LE S D L T+ PL + + + +
Sbjct: 625 KLEETYLSEQTQHSAEEDNSDLIFITERPLNTFNRQVIFSKGPPDIKVTKYFKKHITQIF 684
Query: 1237 PHDLTWQQKKKFLHD----AKSYLW------------------DDPFLFKICSDGVIRRC 1274
+T ++ +++L D KS L+ + + + S +++
Sbjct: 685 YDIMTREKAEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNI 744
Query: 1275 ITEVDFEKILWHCHGSSYGGHFSGERTAAKVLQSGFYWPTLHRNSRAFVESCDRCQRTGN 1334
T +F++++ H G + K+ +Y+P + + C C
Sbjct: 745 TTYAEFKELILTAHEKLL---HPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKT 801
Query: 1335 ISRRNEMPLKNILEIELFDVWGIDFMGPFPPSFGC*YILVAVDYVSKWVEASALSTNDSK 1394
R +MP K + E FM S G Y+ +D SK+ + T D
Sbjct: 802 EHRNTDMPTKTTPKPEHCRE---KFMIDIYSSEGKHYV-SCIDIYSKFATLEEIKTKDWI 857
Query: 1395 VVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEI 1454
L + IF + G P+ + +D F + A + LE V+ +++T T V
Sbjct: 858 ECKNALMR-IFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNT-----TKTGVAD 911
Query: 1455 SNRELKRILEK--VVNSSRKDWSR--KLDDALWAYRTAFK-TPIGTSPFHL 1500
R K I EK ++ +S + ++ K++ L Y K G +P H+
Sbjct: 912 IERLHKTINEKIRIIKTSDDEETKLSKMETVLNIYNHKTKHDTTGQTPAHI 962
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 289 bits (740), Expect = 4e-77
Identities = 259/960 (26%), Positives = 436/960 (44%), Gaps = 78/960 (8%)
Query: 709 LNSVLTPLKEEKLLKVLRDHKSALGWTIDD-----IKGISPAICMHKILLEENYKPIVQP 763
+N V +KE +L + ++ K T + IKG+ + L +ENY+ ++
Sbjct: 362 MNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEV----ELTQENYRLPIR- 416
Query: 764 QRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTR 823
L P + EI + L +G+I S + PV VPKK G
Sbjct: 417 NYPLPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL------------- 462
Query: 824 QVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQ 883
R+ +DY+ LN + + +PLP I+Q+L ++ G + LD S Y+ I V D+
Sbjct: 463 -----RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDE 517
Query: 884 EKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDA 943
K AF CP GVF Y MP+G+ APA FQ + I + E+ + +MDD + +
Sbjct: 518 HKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESE 577
Query: 944 CLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPT 1003
+ ++ VL++ + NL++N KC F +G+ +SEKG + I+ + + P
Sbjct: 578 HVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPK 637
Query: 1004 NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIKESLVTAP 1063
N K +R FLG + R+FI S+L P+ NLL+K+ + + +A E+IK+ LV+ P
Sbjct: 638 NRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPP 697
Query: 1064 VIVAPDWSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASRVLNEAQRNYTTTEKELL 1122
V+ D+S + DASD+A+GAVL QK + Y + Y S +++AQ NY+ ++KE+L
Sbjct: 698 VLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEML 757
Query: 1123 GVVFACEKFRPY----ILGFKVIVHTDHAAL--RHLFAKQDSKPRLIRWVLLLQEFDLEI 1176
++ + + +R Y I FK++ TDH L R + RL RW L LQ+F+ EI
Sbjct: 758 AIIKSLKHWRHYLESTIEPFKIL--TDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 1177 IDRRGKDNSVADHLSRLEGGACSPIPIQEEFSDEKLL-AVSTKEPLPWYV---HFANFRV 1232
R G N +AD LSR+ PIP E + + +S + V + + ++
Sbjct: 816 NYRPGSANHIADALSRIV-DETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKL 874
Query: 1233 AGLIPHDLTWQQKKKFLHDAKSYLWDDPFLFKICSDGVIRRCITEVDFEKILWHCHGSSY 1292
L+ ++ ++ L D D L + +D + R I + +H G
Sbjct: 875 LNLLNNEDKRVEENIQLKDGLLINSKDQIL--LPNDTQLTRTIIK------KYHEEGKLI 926
Query: 1293 GGHFSGERTAAKVLQSGFYWPTLHRNSRAFVESCDRCQRTGNISRRNEMPLKNILEIEL- 1351
G ++ F W + + + +V++C CQ + + + PL+ I E
Sbjct: 927 ---HPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERP 983
Query: 1352 FDVWGIDFMGPFPPSFGC*YILVAVDYVSKW-VEASALSTNDSKVVVAFLKKNIFTRFGV 1410
++ +DF+ P S G + V VD SK + + ++ + + FG
Sbjct: 984 WESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGN 1043
Query: 1411 PRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKVVNSS 1470
P+ II+D F ++ ++ KY K S PY PQT GQ E +N+ ++++L V ++
Sbjct: 1044 PKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTH 1103
Query: 1471 RKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHL-PVELEHKAYWAIRKLNFDWKV 1529
W + +Y A + +PF +V + L P+EL
Sbjct: 1104 PNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPS--------------- 1148
Query: 1530 ASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILN-REFVSGQLVLLFNSRLRLF 1588
S+K E + E + K KK+ D KI EF G LV++ ++
Sbjct: 1149 FSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFL 1208
Query: 1589 --PGKLKSRWSGPFVVKRVFPHGAVEVENPET-KNIF--TVNGQRLKVYQGGEVLKLETM 1643
KL ++GPF V + E++ P++ K++F T + L+ Y+ L T+
Sbjct: 1209 HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHNSELNYATI 1268
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 289 bits (739), Expect = 5e-77
Identities = 237/827 (28%), Positives = 390/827 (46%), Gaps = 88/827 (10%)
Query: 751 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 810
+L + PI Q L + + V ++ ++L+ G+I S+S + SP VVPKK +
Sbjct: 198 VLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRE-SNSPYNSPTWVVPKKPDAS 256
Query: 811 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGY 870
K+RV IDYR+LN +T D +P+P +D++L +L QY+ +D
Sbjct: 257 -------------GANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLA 303
Query: 871 SGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 930
G++QI + E KTAF+ G + Y RMPFGL NAPATFQRCM I L+ ++
Sbjct: 304 KGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVY 363
Query: 931 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDR 990
+DD +F + L ++ LV + + NL L +KC F+ ++ LGH V+ GI+ +
Sbjct: 364 LDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNP 423
Query: 991 AKIEVIEKLTPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPF-TFDENCL 1049
K++ I PT K IR+FLG G+YR+FI +++ +AKPMT+ L+K T +
Sbjct: 424 IKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYI 483
Query: 1050 RAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASRVLNE 1109
AFE +K ++ P++ PD+ F + DAS+LALGAVL Q + ++ SR LN+
Sbjct: 484 EAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFI----SRTLND 539
Query: 1110 AQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALRHLFAKQDSKPRLIRWVLLL 1169
+ NY+ EKELL +V+A + FR Y+LG + ++ +DH LR L ++ +L RW + L
Sbjct: 540 HELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRL 599
Query: 1170 QEFDLEIIDRRGKDNSVADHLSR--LEGGACSPIPIQEEFSDEKLLAVSTKEPLPWYVHF 1227
E+ +I +GK+NSVAD LSR +E S D L T++P+ ++
Sbjct: 600 SEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQ 659
Query: 1228 ANF-----------RVAG----LIPHDLTWQQKKK------FLHDAKSYLWDDPFLFKIC 1266
F ++ G I +D+ +K K F+H + + F+I
Sbjct: 660 IIFIKSDKNKVEHSKIFGNSITTIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIV 719
Query: 1267 SDG---VIRRCITEV--------------DFEKILWHCHGSSYGGHFSGERTAAKVLQSG 1309
++ T+V +F++I+ H G + K+ +
Sbjct: 720 QRAHIEIVNTTYTKVIRSLFLLKNVGSYAEFKEIILQSHEKLL---HPGIQKMTKLFKEN 776
Query: 1310 FYWPTLHRNSRAFVESCDRCQRTGNISRRNEMPLK------NILEIELFDVWGIDFMGPF 1363
++P + + C+ C R +MPLK + E + D++
Sbjct: 777 HFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKITPNPEHCREKFVVDIYS------- 829
Query: 1364 PPSFGC*YILVAVDYVSKWVEASALSTNDSKVVVAFLKKNIFTRFGVPRAIISDGGTHFC 1423
S G YI +D SK+ + T D + IF + G P+ + +D F
Sbjct: 830 --SEGKHYI-SCIDIYSKFATLEQIKTKD-WIECRNALMRIFNQLGKPKLLKADRDGAFS 885
Query: 1424 NRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEK--VVNSSRKDWSR--KLD 1479
+ A + LE+ V+ +++T +G ++ R K I EK ++NSS + + K++
Sbjct: 886 SLALKRWLEEEEVELQLNT----AKNGVADV-ERLHKTINEKIRIINSSDDEEVKLSKIE 940
Query: 1480 DALWAYRTAFKTPIGTSPFHLVFGKACHLPVELEHKAYWAIRKLNFD 1526
L+ Y K +F A H ++ + I K+N D
Sbjct: 941 TILYTYNQKIKHDTTGQRPAQIFLYAGHPILDTQKIKEKKIEKINED 987
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 287 bits (734), Expect = 2e-76
Identities = 258/960 (26%), Positives = 437/960 (44%), Gaps = 78/960 (8%)
Query: 709 LNSVLTPLKEEKLLKVLRDHKSALGWTIDD-----IKGISPAICMHKILLEENYKPIVQP 763
+N V +KE +L + ++ K T + IKG+ + L +ENY+ ++
Sbjct: 362 MNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEV----ELTQENYRLPIR- 416
Query: 764 QRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTR 823
L P + EI + L +G+I S + PV VPKK G
Sbjct: 417 NYPLPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL------------- 462
Query: 824 QVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQ 883
R+ +DY+ LN + + +PLP I+Q+L ++ G + LD S Y+ I V D+
Sbjct: 463 -----RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDE 517
Query: 884 EKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDA 943
K AF CP GVF Y MP+G+ APA FQ + I ++ E+ + +MD+ + +
Sbjct: 518 HKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESE 577
Query: 944 CLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPT 1003
+ ++ VL++ + NL++N KC F +G+ +SEKG + I+ + + P
Sbjct: 578 HVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPK 637
Query: 1004 NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIKESLVTAP 1063
N K +R FLG + R+FI S+L P+ NLL+K+ + + +A E+IK+ LV+ P
Sbjct: 638 NRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPP 697
Query: 1064 VIVAPDWSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASRVLNEAQRNYTTTEKELL 1122
V+ D+S + DASD+A+GAVL QK + Y + Y S +++AQ NY+ ++KE+L
Sbjct: 698 VLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEML 757
Query: 1123 GVVFACEKFRPY----ILGFKVIVHTDHAAL--RHLFAKQDSKPRLIRWVLLLQEFDLEI 1176
++ + + +R Y I FK++ TDH L R + RL RW L LQ+F+ EI
Sbjct: 758 AIIKSLKHWRHYLESTIEPFKIL--TDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 1177 IDRRGKDNSVADHLSRLEGGACSPIPIQEEFSDEKLL-AVSTKEPLPWYV---HFANFRV 1232
R G N +AD LSR+ PIP E + + +S + V + + ++
Sbjct: 816 NYRPGSANHIADALSRIV-DETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKL 874
Query: 1233 AGLIPHDLTWQQKKKFLHDAKSYLWDDPFLFKICSDGVIRRCITEVDFEKILWHCHGSSY 1292
L+ ++ ++ L D D L + +D + R I + +H G
Sbjct: 875 LNLLNNEDKRVEENIQLKDGLLINSKDQIL--LPNDTQLTRTIIK------KYHEEGKLI 926
Query: 1293 GGHFSGERTAAKVLQSGFYWPTLHRNSRAFVESCDRCQRTGNISRRNEMPLKNILEIEL- 1351
G ++ F W + + + +V++C CQ + + + PL+ I E
Sbjct: 927 ---HPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERP 983
Query: 1352 FDVWGIDFMGPFPPSFGC*YILVAVDYVSKW-VEASALSTNDSKVVVAFLKKNIFTRFGV 1410
++ +DF+ P S G + V VD SK + + ++ + + FG
Sbjct: 984 WESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGN 1043
Query: 1411 PRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKVVNSS 1470
P+ II+D F ++ ++ KY K S PY PQT GQ E +N+ ++++L V ++
Sbjct: 1044 PKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTH 1103
Query: 1471 RKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHL-PVELEHKAYWAIRKLNFDWKV 1529
W + +Y A + +PF +V + L P+EL
Sbjct: 1104 PNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPS--------------- 1148
Query: 1530 ASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILN-REFVSGQLVLLFNSRLRLF 1588
S+K E + E + K KK+ D KI EF G LV++ ++
Sbjct: 1149 FSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFL 1208
Query: 1589 --PGKLKSRWSGPFVVKRVFPHGAVEVENPET-KNIF--TVNGQRLKVYQGGEVLKLETM 1643
KL ++GPF V + E++ P++ K++F T + L+ Y+ L T+
Sbjct: 1209 HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHNSELNYATI 1268
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 287 bits (734), Expect = 2e-76
Identities = 258/960 (26%), Positives = 437/960 (44%), Gaps = 78/960 (8%)
Query: 709 LNSVLTPLKEEKLLKVLRDHKSALGWTIDD-----IKGISPAICMHKILLEENYKPIVQP 763
+N V +KE +L + ++ K T + IKG+ + L +ENY+ ++
Sbjct: 362 MNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEV----ELTQENYRLPIR- 416
Query: 764 QRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTR 823
L P + EI + L +G+I S + PV VPKK G
Sbjct: 417 NYPLPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL------------- 462
Query: 824 QVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQ 883
R+ +DY+ LN + + +PLP I+Q+L ++ G + LD S Y+ I V D+
Sbjct: 463 -----RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDE 517
Query: 884 EKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDA 943
K AF CP GVF Y MP+G+ APA FQ + I ++ E+ + +MD+ + +
Sbjct: 518 HKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESE 577
Query: 944 CLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPT 1003
+ ++ VL++ + NL++N KC F +G+ +SEKG + I+ + + P
Sbjct: 578 HVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPK 637
Query: 1004 NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIKESLVTAP 1063
N K +R FLG + R+FI S+L P+ NLL+K+ + + +A E+IK+ LV+ P
Sbjct: 638 NRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPP 697
Query: 1064 VIVAPDWSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASRVLNEAQRNYTTTEKELL 1122
V+ D+S + DASD+A+GAVL QK + Y + Y S +++AQ NY+ ++KE+L
Sbjct: 698 VLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEML 757
Query: 1123 GVVFACEKFRPY----ILGFKVIVHTDHAAL--RHLFAKQDSKPRLIRWVLLLQEFDLEI 1176
++ + + +R Y I FK++ TDH L R + RL RW L LQ+F+ EI
Sbjct: 758 AIIKSLKHWRHYLESTIEPFKIL--TDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 1177 IDRRGKDNSVADHLSRLEGGACSPIPIQEEFSDEKLL-AVSTKEPLPWYV---HFANFRV 1232
R G N +AD LSR+ PIP E + + +S + V + + ++
Sbjct: 816 NYRPGSANHIADALSRIV-DETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKL 874
Query: 1233 AGLIPHDLTWQQKKKFLHDAKSYLWDDPFLFKICSDGVIRRCITEVDFEKILWHCHGSSY 1292
L+ ++ ++ L D D L + +D + R I + +H G
Sbjct: 875 LNLLNNEDKRVEENIQLKDGLLINSKDQIL--LPNDTQLTRTIIK------KYHEEGKLI 926
Query: 1293 GGHFSGERTAAKVLQSGFYWPTLHRNSRAFVESCDRCQRTGNISRRNEMPLKNILEIEL- 1351
G ++ F W + + + +V++C CQ + + + PL+ I E
Sbjct: 927 ---HPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERP 983
Query: 1352 FDVWGIDFMGPFPPSFGC*YILVAVDYVSKW-VEASALSTNDSKVVVAFLKKNIFTRFGV 1410
++ +DF+ P S G + V VD SK + + ++ + + FG
Sbjct: 984 WESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGN 1043
Query: 1411 PRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKVVNSS 1470
P+ II+D F ++ ++ KY K S PY PQT GQ E +N+ ++++L V ++
Sbjct: 1044 PKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTH 1103
Query: 1471 RKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHL-PVELEHKAYWAIRKLNFDWKV 1529
W + +Y A + +PF +V + L P+EL
Sbjct: 1104 PNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPS--------------- 1148
Query: 1530 ASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILN-REFVSGQLVLLFNSRLRLF 1588
S+K E + E + K KK+ D KI EF G LV++ ++
Sbjct: 1149 FSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFL 1208
Query: 1589 --PGKLKSRWSGPFVVKRVFPHGAVEVENPET-KNIF--TVNGQRLKVYQGGEVLKLETM 1643
KL ++GPF V + E++ P++ K++F T + L+ Y+ L T+
Sbjct: 1209 HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHNSELNYATI 1268
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 278 bits (710), Expect = 1e-73
Identities = 163/467 (34%), Positives = 257/467 (54%), Gaps = 26/467 (5%)
Query: 739 IKGISPAICMHKILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVS 798
+ G+S + + PI +M+ V ++I +LL G+I P S+S + S
Sbjct: 103 LSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEVERQIDELLQDGIIRP-SNSPYNS 161
Query: 799 PVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRL 858
P+ +VPKK N E ++R+ +D++RLN+VT D +P+P I+ L L
Sbjct: 162 PIWIVPKK------PKPNGE-------KQYRMVVDFKRLNTVTIPDTYPIPDINATLASL 208
Query: 859 AGHQYYCFLDGYSGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAI 918
+Y+ LD SG++QI + D KTAF+ G + + R+PFGL NAPA FQR + I
Sbjct: 209 GNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDI 268
Query: 919 FSDLIETCIEIFMDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLG 978
+ I +++DD VF ++D NL LVL + NL +N EK HF+ LG
Sbjct: 269 LREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLG 328
Query: 979 HKVSEKGIEVDRAKIEVIEKLTPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLL-- 1036
+ V+ GI+ D K+ I ++ PPT++K ++ FLG +YR+FI+D++K+AKP+TNL
Sbjct: 329 YIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRG 388
Query: 1037 ---------EKEAPFTFDENCLRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALGA 1087
+ P T DE L++F +K L ++ ++ P ++ PF + DAS+ A+GA
Sbjct: 389 LYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGA 448
Query: 1088 VLCQKKERVLYVIYYASRVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVI-VHTDH 1146
VL Q + I Y SR LN+ + NY T EKE+L ++++ + R Y+ G I V+TDH
Sbjct: 449 VLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDH 508
Query: 1147 AALRHLFAKQDSKPRLIRWVLLLQEFDLEIIDRRGKDNSVADHLSRL 1193
L ++ +L RW ++E++ E+I + GK N VAD LSR+
Sbjct: 509 QPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRI 555
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 268 bits (684), Expect = 1e-70
Identities = 158/436 (36%), Positives = 238/436 (54%), Gaps = 13/436 (2%)
Query: 758 KPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENN 817
+P+ R S + ++ ++ KL+ ++ P S S++ SP+ +VPKK
Sbjct: 313 EPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEP-SVSQYNSPLLLVPKKSS--------- 362
Query: 818 ELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQIC 877
P KWR+ IDYR++N D FPLP ID +LD+L +Y+ LD SG++QI
Sbjct: 363 ---PNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIE 419
Query: 878 VAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVF 937
+ ++ T+F+ G + + R+PFGL AP +FQR M FS + + ++MDD V
Sbjct: 420 LDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVI 479
Query: 938 GPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIE 997
G + L NL V +C+E NL L+ EKC F + + LGHK ++KGI D K +VI+
Sbjct: 480 GCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQ 539
Query: 998 KLTPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIKE 1057
P + R F+ +YRRFIK+F+ ++ +T L +K PF + + C +AF +K
Sbjct: 540 NYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKS 599
Query: 1058 SLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASRVLNEAQRNYTTT 1117
L+ ++ PD+S F I DAS A GAVL Q + YASR + + N +TT
Sbjct: 600 QLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTT 659
Query: 1118 EKELLGVVFACEKFRPYILGFKVIVHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEII 1177
E+EL + +A FRPYI G V TDH L +LF+ + +L R L L+E++ +
Sbjct: 660 EQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719
Query: 1178 DRRGKDNSVADHLSRL 1193
+GKDN VAD LSR+
Sbjct: 720 YLKGKDNHVADALSRI 735
Score = 109 bits (273), Expect = 5e-23
Identities = 95/361 (26%), Positives = 166/361 (45%), Gaps = 20/361 (5%)
Query: 1277 EVDFEKILWHCHGSSY-GGHFSGERTAAKVLQSGFYWPTLHRNSRAFVESCDRCQRTGNI 1335
E + E IL H GGH +T AKV + +YW + + + +V C +CQ+
Sbjct: 890 EKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKA-KT 947
Query: 1336 SRRNEMPLKNILEI--ELFDVWGIDFMGPFPPSF-GC*YILVAVDYVSKWVEASALSTND 1392
++ + P+ I E FD +D +GP P S G Y + + ++K++ A ++
Sbjct: 948 TKHTKTPM-TITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKS 1006
Query: 1393 SKVVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQV 1452
+K V + ++ ++G + I+D GT + N L + +K+ ST +H QT G V
Sbjct: 1007 AKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVV 1066
Query: 1453 EISNRELKRILEKVVNSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHLPVEL 1512
E S+R L + +++ + DW L ++ + T P+ LVFG+ +LP
Sbjct: 1067 ERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF 1126
Query: 1513 EHKAYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILNREF 1572
+K + N D A E + L+ RA + +KEK K+ +D K+ + E
Sbjct: 1127 -NKLHSIEPIYNID-DYAKESKY----RLEVAYARARKLLEAHKEKNKENYDLKVKDIEL 1180
Query: 1573 VSGQLVLLFNSRLRLFPGKLKSRWSGPFVVKRVFPHGAVE-VENPETKNIFTVNGQRLKV 1631
G VLL N KL +++GP+ ++ + + + + N K I V+ RLK
Sbjct: 1181 EVGDKVLLRNE----VGHKLDFKYTGPYKIESIGDNNNITLLTNKNKKQI--VHKDRLKK 1234
Query: 1632 Y 1632
+
Sbjct: 1235 F 1235
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 233 bits (594), Expect = 3e-60
Identities = 146/435 (33%), Positives = 236/435 (53%), Gaps = 30/435 (6%)
Query: 771 MKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTKWRV 830
+ D V E+ +LL G+I P S S + SP VV KKG N N L+
Sbjct: 193 VSDFVNNEVKQLLKDGIIRP-SRSPYNSPTWVVDKKG-TDAFGNPNKRLV---------- 240
Query: 831 CIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQEKTAFTC 890
ID+R+LN T D +P+P I +L L +++ LD SGY+QI +A D+EKT+F+
Sbjct: 241 -IDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSV 299
Query: 891 PYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGNLAL 950
G + + R+PFGL NA + FQR + + + I +++DD +F N + ++
Sbjct: 300 NGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDT 359
Query: 951 VLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPTNIKGIRS 1010
VLK + N+ ++ EK F LG VS+ G + D K++ I++ P + +RS
Sbjct: 360 VLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRS 419
Query: 1011 FLGHAGFYRRFIKDFSKLAKPMTNLLE-----------KEAPFTFDENCLRAFESIKESL 1059
FLG A +YR FIKDF+ +A+P+T++L+ K+ P F+E AF+ ++ L
Sbjct: 420 FLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNIL 479
Query: 1060 VTAPVIVA-PDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASRVLNEAQRNYTTTE 1118
+ VI+ PD+ PF++ DAS +GAVL Q+ + + SR L + ++NY T E
Sbjct: 480 ASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMI----SRTLKQPEQNYATNE 535
Query: 1119 KELLGVVFACEKFRPYILGFKVI-VHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEII 1177
+ELL +V+A K + ++ G + I + TDH L A +++ ++ RW + + + ++
Sbjct: 536 RELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF 595
Query: 1178 DRRGKDNSVADHLSR 1192
+ GK+N VAD LSR
Sbjct: 596 YKPGKENFVADALSR 610
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 158 bits (399), Expect = 1e-37
Identities = 129/452 (28%), Positives = 216/452 (47%), Gaps = 27/452 (5%)
Query: 751 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 810
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 811 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGY 870
NNE R K R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 290 -----NNEAEKRRG--KKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 871 SGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 930
SG+ Q+ + E + TAFTCP G + + +PFGL AP+ FQR M F + C ++
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 401
Query: 931 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDR 990
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 461
Query: 991 AKIEVIEKLTPPT--NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENC 1048
+E I K P T + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 462 HILEHINKF-PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 520
Query: 1049 LRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAS 1104
+ +K++L P + P I DASD G A+ + + YAS
Sbjct: 521 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 580
Query: 1105 RVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALR---HLFAKQDSK-P 1160
A+RNY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK
Sbjct: 581 GSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 640
Query: 1161 RLIRWVLLLQEFDLEIIDRRGKDNSVADHLSR 1192
R IRW L + ++ +G DN AD LSR
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 158 bits (399), Expect = 1e-37
Identities = 129/452 (28%), Positives = 216/452 (47%), Gaps = 27/452 (5%)
Query: 751 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 810
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 811 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGY 870
NNE R K R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 290 -----NNEAEKRRG--KKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 871 SGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 930
SG+ Q+ + E + TAFTCP G + + +PFGL AP+ FQR M F + C ++
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 401
Query: 931 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDR 990
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 461
Query: 991 AKIEVIEKLTPPT--NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENC 1048
+E I K P T + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 462 HILEHINKF-PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 520
Query: 1049 LRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAS 1104
+ +K++L P + P I DASD G A+ + + YAS
Sbjct: 521 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 580
Query: 1105 RVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALR---HLFAKQDSK-P 1160
A+RNY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK
Sbjct: 581 GSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 640
Query: 1161 RLIRWVLLLQEFDLEIIDRRGKDNSVADHLSR 1192
R IRW L + ++ +G DN AD LSR
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 157 bits (396), Expect = 3e-37
Identities = 128/452 (28%), Positives = 216/452 (47%), Gaps = 27/452 (5%)
Query: 751 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 810
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 811 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGY 870
NNE R K R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 290 -----NNEAEKRRG--KKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 871 SGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 930
SG+ Q+ + E + TAFTCP G + + +PFGL AP+ FQR M F + C ++
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 401
Query: 931 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDR 990
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 461
Query: 991 AKIEVIEKLTPPT--NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENC 1048
+E I K P T + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 462 HILEHINKF-PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKED 520
Query: 1049 LRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAS 1104
+ +K++L P + P I DASD G A+ + + YAS
Sbjct: 521 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 580
Query: 1105 RVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALR---HLFAKQDSK-P 1160
A++NY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK
Sbjct: 581 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 640
Query: 1161 RLIRWVLLLQEFDLEIIDRRGKDNSVADHLSR 1192
R IRW L + ++ +G DN AD LSR
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 154 bits (389), Expect = 2e-36
Identities = 127/452 (28%), Positives = 215/452 (47%), Gaps = 27/452 (5%)
Query: 751 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 810
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 233 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 284
Query: 811 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGY 870
NNE R K R+ ++Y+ +N T D + P D++L + G + + D
Sbjct: 285 -----NNEAEKRRG--KKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCK 337
Query: 871 SGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 930
SG+ Q+ + E + TAFTCP G + + +PFGL AP+ FQR M F + C ++
Sbjct: 338 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 396
Query: 931 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDR 990
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 397 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 456
Query: 991 AKIEVIEKLTPPT--NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENC 1048
+E I K P T + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 457 HILEHINKF-PDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 515
Query: 1049 LRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAS 1104
+ +K++L P + P I DASD G A+ + + YAS
Sbjct: 516 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 575
Query: 1105 RVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALR---HLFAKQDSK-P 1160
A++NY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK
Sbjct: 576 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 635
Query: 1161 RLIRWVLLLQEFDLEIIDRRGKDNSVADHLSR 1192
R IRW L + ++ +G DN AD LSR
Sbjct: 636 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 667
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 152 bits (385), Expect = 6e-36
Identities = 126/452 (27%), Positives = 215/452 (46%), Gaps = 27/452 (5%)
Query: 751 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 810
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 239 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 290
Query: 811 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGY 870
NNE R R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 291 -----NNEAENGRG--NKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCK 343
Query: 871 SGYNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 930
SG+ Q+ + E + TAFTCP G + + +PFGL AP+ FQR M F + C ++
Sbjct: 344 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 402
Query: 931 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDR 990
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 403 VDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 462
Query: 991 AKIEVIEKLTPPT--NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENC 1048
+E I K P T + K ++ FLG + +I + +++ +P+ L++ P+ + +
Sbjct: 463 HILEHINKF-PDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKED 521
Query: 1049 LRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAS 1104
+ +K++L P + P I DASD G A+ + + Y S
Sbjct: 522 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRS 581
Query: 1105 RVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALR---HLFAKQDSK-P 1160
A+RNY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK
Sbjct: 582 GSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 641
Query: 1161 RLIRWVLLLQEFDLEIIDRRGKDNSVADHLSR 1192
R IRW L + ++ +G DN AD LSR
Sbjct: 642 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 673
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 151 bits (382), Expect = 1e-35
Identities = 121/438 (27%), Positives = 211/438 (47%), Gaps = 36/438 (8%)
Query: 768 NPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTK 827
+P ++ K+I +LLD G+I P S S+ +SP +V + R+ K
Sbjct: 248 SPQDREGFAKQIKELLDLGLIIP-SKSQHMSPAFLVENEA--------------ERRRGK 292
Query: 828 WRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQEKTA 887
R+ ++Y+ +N T D LP + ++L L G + D SG+ Q+ + E Q+ TA
Sbjct: 293 KRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTA 352
Query: 888 FTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGN 947
FTCP G F +K +PFGL AP+ FQR M + + C+ +++DD VF + +
Sbjct: 353 FTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCM-VYVDDIIVFSNSELDHYNH 411
Query: 948 LALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAK-------IEVIEKLT 1000
+ VLK ++ ++L+ +K + + K++ G+E+D+ +E I K
Sbjct: 412 VYAVLKIVEKYGIILSKKKAN-------LFKEKINFLGLEIDKGTHCPQNHILENIHKFP 464
Query: 1001 PP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIKESL 1059
+ K ++ FLG + +I +++ KP+ L+K+ + + ++ + IK++L
Sbjct: 465 DRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNL 524
Query: 1060 VTAPVIVAPDWSLPFEIMCDASDLALGAVL-CQKKERVLYVIYYASRVLNEAQRNYTTTE 1118
+ P + P I DASD G VL + + V + Y+S +A++NY + +
Sbjct: 525 GSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSND 584
Query: 1119 KELLGVVFACEKFRPYILGFKVIVHTDHAALRHLF---AKQDSKP-RLIRWVLLLQEFDL 1174
KELL V KF Y+ + V TD+ + K DSK RL+RW ++
Sbjct: 585 KELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQF 644
Query: 1175 EIIDRRGKDNSVADHLSR 1192
++ G N +AD L+R
Sbjct: 645 DVEHLEGVKNVLADCLTR 662
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 151 bits (382), Expect = 1e-35
Identities = 119/438 (27%), Positives = 210/438 (47%), Gaps = 39/438 (8%)
Query: 768 NPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTK 827
+PS ++ ++I +LL+ VI P S S +SP +V + R+ K
Sbjct: 237 SPSDREEFDRQIKELLELKVIKP-SKSTHMSPAFLVENEA--------------ERRRGK 281
Query: 828 WRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICVAPEDQEKTA 887
R+ ++Y+ +N T+ D LP D++L + G + Y D SG Q+ + E Q TA
Sbjct: 282 KRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTA 341
Query: 888 FTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFG-PNFDACLG 946
FTCP G + + +PFGL AP+ F + S+ +++DD VF
Sbjct: 342 FTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYI 401
Query: 947 NLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAK-------IEVIEKL 999
++ +L+RC++ ++L+ +K + K++ G+E+D+ +E I K
Sbjct: 402 HVLNILRRCEKLGIILSKKKAQ-------LFKEKINFLGLEIDQGTHCPQNHILEHIHKF 454
Query: 1000 TPPTNI---KGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFESIK 1056
P I K ++ FLG + +I + + KP+ + L++++ +T+++ + IK
Sbjct: 455 --PDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIK 512
Query: 1057 ESLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASRVLNEAQRNYTT 1116
++L + P + P+ + I DAS+ G +L Y+ YAS A+RNY +
Sbjct: 513 KNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHS 572
Query: 1117 TEKELLGVVFACEKFRPYILGFKVIVHTDHAALRH---LFAKQDSKP-RLIRWVLLLQEF 1172
EKELL V+ +KF Y+ + ++ TD+ H + K D K RL+RW + L ++
Sbjct: 573 NEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQY 632
Query: 1173 DLEIIDRRGKDNSVADHL 1190
D ++ G N AD L
Sbjct: 633 DFDVEHIAGTKNVFADFL 650
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 131 bits (330), Expect = 1e-29
Identities = 148/575 (25%), Positives = 250/575 (42%), Gaps = 62/575 (10%)
Query: 639 LERELDSLYHEVNATLSQLESVSSLATKSIWKEELTRDEEIPIEEKSELKSLPSSLKYAY 698
+E+++ + Y V + + S ++ SI + EL+ DE + I E PS L +
Sbjct: 1306 IEKDIITFYKLVTSIET---SRTTQVANSIEELELSEDEYLNIAASVET---PSFLDQEF 1359
Query: 699 LEEGENKPVILNSVLTPLKEEKLLKVLRDHKSALGWTIDDIKGISPAICMHKILLEENYK 758
+ ++ LKE K +K + ++ W + IK C I+ + K
Sbjct: 1360 ARKNKDL----------LKEMKEMKYIGENPMEF-WKNNKIK------CKLNII-NPDIK 1401
Query: 759 PIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNE 818
+ +P + + P ++ + ++I LL VI P S+S+ S +V I + + +
Sbjct: 1402 IMGRPIKHVTPGDEEAMTRQINLLLQMKVIRP-SESKHRSTAFIVRSGTEIDPITGKEKK 1460
Query: 819 LIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSGYNQICV 878
K R+ +Y+ LN T D + LP I+ ++ ++ + Y D SG+ Q+ +
Sbjct: 1461 -------GKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAM 1513
Query: 879 APEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFG 938
E TAF ++ + MPFGL NAPA FQR M +F E I +++DD VF
Sbjct: 1514 EEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFS 1572
Query: 939 PNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAKIEVI-- 996
+ +L +L+ C+E L+L+ K + LG + I++ I I
Sbjct: 1573 ETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKICD 1632
Query: 997 ---EKLTPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLRAFE 1053
EKL P +G+RS+LG + R +I+D KL +P+ + + +
Sbjct: 1633 FSDEKLATP---EGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVR 1689
Query: 1054 SIKESLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKER-----VLYVIYYASRVLN 1108
IKE + P + P I D GAV K + + YAS N
Sbjct: 1690 QIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAYASGSFN 1749
Query: 1109 EAQRNYTTTEKELLGVVFACEKFRPYILGFK-VIVHTDHAALRHLFAK-QDSKPRLIRWV 1166
+ +T + E+ + +KF+ Y L K +I+ +D A+ + K ++KP +RW
Sbjct: 1750 PIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENKPSRVRW- 1805
Query: 1167 LLLQEF--------DLEIIDRRGKDNSVADHLSRL 1193
L +F E ID GK N +AD LSR+
Sbjct: 1806 LTFSDFLTGLGITVTFEHID--GKHNGLADALSRM 1838
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 125 bits (315), Expect = 7e-28
Identities = 112/422 (26%), Positives = 181/422 (42%), Gaps = 35/422 (8%)
Query: 741 GISPAICMHKILLEENYKPIVQPQR-RLNPSMKDV---VRKEIIKLLDAGVIYPISDSEW 796
G+ A C I+++ KP P R P K+ ++ I + L+ GV+ P S W
Sbjct: 13 GLGRAKCQVPIIID--LKPTAMPVSIRQYPMSKEAHMGIQPHITRFLELGVLRPCR-SPW 69
Query: 797 VSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLD 856
+P+ V K G TR +R D R +N T H +P +L
Sbjct: 70 NTPLLPVKKPG--------------TRD---YRPVQDLREVNKRTMDIHPTVPNPYNLLS 112
Query: 857 RLAGHQ-YYCFLDGYSGYNQICVAPEDQEKTAFTCP------YGVFAYKRMPFGLCNAPA 909
L+ + +Y LD + + +AP+ QE AF G + R+P G N+P
Sbjct: 113 TLSPDRTWYTVLDLKDAFFCLPLAPQSQELFAFEWRDPERGISGQLTWTRLPQGFKNSPT 172
Query: 910 TFQRCMFAIFSDLIETCIEI----FMDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWE 965
F + +D E+ ++DD + P +AC+ +L+ + + +
Sbjct: 173 LFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKEACIRGTKHLLRELGDKGYRASAK 232
Query: 966 KCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPTNIKGIRSFLGHAGFYRRFIKDF 1025
K LG+ +SE + +IE + + PP N + +R FLG AGF R +I F
Sbjct: 233 KAQICQTKVTYLGYILSEGKRWLTPGRIETVAHIPPPQNPREVREFLGTAGFCRLWIPGF 292
Query: 1026 SKLAKPMTNLLEKEAPFTFDENCLRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLAL 1085
++LA P+ L ++ APFT+ E AFE++KE+L++AP + PD S PF + D
Sbjct: 293 AELAAPLYALTKESAPFTWQEKHQSAFEALKEALLSAPALGLPDTSKPFTLFIDEKQGIA 352
Query: 1086 GAVLCQKKERVLYVIYYASRVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTD 1145
VL QK + Y S+ L+ + + + + LG + V T
Sbjct: 353 KGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITP 412
Query: 1146 HA 1147
HA
Sbjct: 413 HA 414
Score = 91.7 bits (226), Expect = 2e-17
Identities = 55/150 (36%), Positives = 77/150 (50%), Gaps = 2/150 (1%)
Query: 1355 WGIDFMGPFPPSFGC*YILVAVDYVSKWVEASALSTNDSKVVVAFLKKNIFTRFGVPRAI 1414
W IDF P G Y+LV VD S WVEA + +V + + IF RFG+P+ I
Sbjct: 765 WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVI 824
Query: 1415 ISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKV-VNSSRKD 1473
SD G F ++ + L G+ K+ Y PQ+SGQVE NR +K L K+ + + KD
Sbjct: 825 GSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKD 884
Query: 1474 WSRKLDDALWAYRTAFKTPIGTSPFHLVFG 1503
W R L AL R G +P+ +++G
Sbjct: 885 WRRLLSLALLRARNT-PNRFGLTPYEILYG 913
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 121 bits (303), Expect = 2e-26
Identities = 115/451 (25%), Positives = 187/451 (40%), Gaps = 38/451 (8%)
Query: 741 GISPAICMHKILLEENYKPIVQP----QRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEW 796
G+ A C I+++ KP P Q ++ +R+ IIK L+ GV+ P S W
Sbjct: 156 GLGRAKCQAPIIID--LKPTAVPVSIKQYPMSLEAHMGIRQHIIKFLELGVLRPCR-SPW 212
Query: 797 VSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLD 856
+P+ V K G +R D R +N T H +P +L
Sbjct: 213 NTPLLPVKKPG-----------------TQDYRPVQDLREINKRTVDIHPTVPNPYNLLS 255
Query: 857 RLA-GHQYYCFLDGYSGYNQICVAPEDQEKTAFTCP------YGVFAYKRMPFGLCNAPA 909
L + +Y LD + + +AP+ QE AF G + R+P G N+P
Sbjct: 256 TLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAFEWKDPERGISGQLTWTRLPQGFKNSPT 315
Query: 910 TFQRCMFAIFSDLIETCIEI----FMDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWE 965
F + +D E+ ++DD + P AC +L+ E + +
Sbjct: 316 LFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKKACTQGTRHLLQELGEKGYRASAK 375
Query: 966 KCHFMVRDGIVLGHKVSEKGIEVDRAKIEVIEKLTPPTNIKGIRSFLGHAGFYRRFIKDF 1025
K LG+ +SE + +IE + ++ PP N + +R FLG AGF R +I F
Sbjct: 376 KAQICQTKVTYLGYILSEGKRWLTPGRIETVARIPPPRNPREVREFLGTAGFCRLWIPGF 435
Query: 1026 SKLAKPMTNLLEKEAPFTFDENCLRAFESIKESLVTAPVIVAPDWSLPFEIMCDASDLAL 1085
++LA P+ L ++ PFT+ AFE++K++L++AP + PD S PF + D
Sbjct: 436 AELAAPLYALTKESTPFTWQTEHQLAFEALKKALLSAPALGLPDTSKPFTLFLDERQGIA 495
Query: 1086 GAVLCQKKERVLYVIYYASRVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIV--- 1142
VL QK + Y S+ L+ + + + + LG + V
Sbjct: 496 KGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITP 555
Query: 1143 HTDHAALRHLFAKQDSKPRLIRWVLLLQEFD 1173
HT A +R + + RL + LL + D
Sbjct: 556 HTLEAIVRQPPDRWITNARLTHYQALLLDTD 586
Score = 91.7 bits (226), Expect = 2e-17
Identities = 55/150 (36%), Positives = 77/150 (50%), Gaps = 2/150 (1%)
Query: 1355 WGIDFMGPFPPSFGC*YILVAVDYVSKWVEASALSTNDSKVVVAFLKKNIFTRFGVPRAI 1414
W IDF P G Y+LV VD S WVEA + +V + + IF RFG+P+ I
Sbjct: 908 WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVI 967
Query: 1415 ISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKV-VNSSRKD 1473
SD G F ++ + L G+ K+ Y PQ+SGQVE NR +K L K+ + + KD
Sbjct: 968 GSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKD 1027
Query: 1474 WSRKLDDALWAYRTAFKTPIGTSPFHLVFG 1503
W R L AL R G +P+ +++G
Sbjct: 1028 WRRLLSLALLRARNT-PNRFGLTPYEILYG 1056
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 118 bits (295), Expect = 2e-25
Identities = 132/546 (24%), Positives = 239/546 (43%), Gaps = 41/546 (7%)
Query: 695 KYAYLEEGENKPVILNSVLTPLKEEKLLKVLRDHKSALGWTIDDI-KGISPAICMHKILL 753
K +L + +P++ + + L+ ++++ + ALG+ DDI K + +C KI+
Sbjct: 1123 KIPHLHSYQPQPIL--GYKNEIGNQSLITMVKELE-ALGFIGDDITKNRTTWVCDFKIIN 1179
Query: 754 EE-NYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVV 812
+ N P P+ K+V K+I +LLD +I + + +V
Sbjct: 1180 PDINITCATIPY---TPADKEVFEKQIKELLDNKLIKKADPT--------CRHRTAAFIV 1228
Query: 813 ANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDRLAGHQYYCFLDGYSG 872
N + E+ K R+ +Y+RLN D F +P M++ + + D +G
Sbjct: 1229 RNHSEEV-----AQKPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAG 1283
Query: 873 YNQICVAPEDQEKTAFTCPYGVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMD 932
++ + + + ++ T FTC G++ + PFG+ NAP FQR M F DL +++D
Sbjct: 1284 FHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYID 1341
Query: 933 DFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSEKGIEVDRAK 992
D + N + +L + R +E VL+ +K +++ LG ++ E I +
Sbjct: 1342 DILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHI 1401
Query: 993 IEVIEKL--TPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLR 1050
++ I+K +KG++++LG + R +IKD SKL P+ K F++
Sbjct: 1402 VDKIKKFDKNKLNTLKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWN 1461
Query: 1051 AFESIKESLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKER-----VLYVIYYASR 1105
I+ + + P + I DAS+ GAVL K ++ + YAS
Sbjct: 1462 IIFKIEREVSKIKPLERPKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASG 1521
Query: 1106 VLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVIVHTDHAALRHLFAKQDSKPR-LIR 1164
E ++ +T+ + E+ + A KF+ Y L + TD A+ +D K R R
Sbjct: 1522 NFGE-KKTWTSLDYEIEAINEALNKFQIY-LDKDFTIRTDCEAIVKGIKTEDYKKRSKTR 1579
Query: 1165 WV-----LLLQEFDLEIIDRRGKDNSVADHLSRLEGGACSPIPIQEEFSDEKLLAVSTKE 1219
W+ LL + +G N + + LSR EG +Q S E ++ + E
Sbjct: 1580 WIKLRDNLLKDGYKPTFEHIKGNKNFLPNFLSR-EGDFILKC-LQNPDSTES-YSIDSSE 1636
Query: 1220 PLPWYV 1225
+P Y+
Sbjct: 1637 SIPLYI 1642
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.137 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 193,860,764
Number of Sequences: 164201
Number of extensions: 8444329
Number of successful extensions: 24500
Number of sequences better than 10.0: 233
Number of HSP's better than 10.0 without gapping: 126
Number of HSP's successfully gapped in prelim test: 113
Number of HSP's that attempted gapping in prelim test: 23800
Number of HSP's gapped (non-prelim): 617
length of query: 1648
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1524
effective length of database: 39,613,130
effective search space: 60370410120
effective search space used: 60370410120
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0335b.5