
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0330.2
(1649 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 321 1e-86
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 292 6e-78
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 283 2e-75
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 281 1e-74
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 278 7e-74
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 278 7e-74
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 276 4e-73
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 266 5e-70
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 230 3e-59
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 158 1e-37
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 158 1e-37
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 157 2e-37
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 154 1e-36
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 154 3e-36
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 153 4e-36
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 152 1e-35
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 130 2e-29
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 124 2e-27
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 122 8e-27
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 115 8e-25
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 321 bits (822), Expect = 1e-86
Identities = 263/932 (28%), Positives = 436/932 (46%), Gaps = 94/932 (10%)
Query: 718 EEEKLLRVLRDHKSALGWTIDDIKGISPAICMHKILLEENYKPIVQPQRRLNPSMKDVVR 777
++ K+ V+ + + D++ S C+ I L+E +PI Q R + ++K +R
Sbjct: 902 DDRKIWDVIEQFQDVFAISDDELGRNSGTECV--IELKEGAEPIRQKPRPIPLALKPEIR 959
Query: 778 KEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRR 837
K I K+L+ VI S S W SPV +V KK G R+CIDYR+
Sbjct: 960 KMIQKMLNQKVIRE-SKSPWSSPVVLVKKKDGSI------------------RMCIDYRK 1000
Query: 838 LNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQICVAPEDQEKTAFTCPYDVFA 897
+N V + + PLP I+ L LAG + Y D +G+ QI + + +E TAF ++F
Sbjct: 1001 VNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFE 1060
Query: 898 YKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGNLALVLKRCQ 957
+ +PFGL +PA FQ M I DL+ C +++DD + + + L ++ L R +
Sbjct: 1061 WNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIR 1120
Query: 958 ETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAKIEVIEKLPPPTNIKGIRSFLGHAG 1017
++ + L KCH ++ LGHKV+ G+E K + +++ PTN+K ++SFLG G
Sbjct: 1121 KSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVG 1180
Query: 1018 FYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKAFESIKKSLVTAPVIVAPD------W 1071
+YR+FI +F+++A +T+L+ + + +++ AF+ +KK + PV+ PD
Sbjct: 1181 YYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKG 1240
Query: 1072 SLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASTVLNEAQRNYTTTEKELLGVVFACE 1130
PF I DAS +GAVL Q+ + + I +AS L+ A+ Y T+ E L ++FA
Sbjct: 1241 DRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALR 1300
Query: 1131 KFRPYILGFKVVVHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEIIDRRGKDNGVADH 1190
+F+ I G + V TDH L L RL RW + + EFD++I+ GK N VAD
Sbjct: 1301 RFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGKANAVADA 1360
Query: 1191 LSRLEGGACSPIPIQEEFPDEK---LLAVSTEEPLPWYVHFANFRVAGLIPHDLTWQQKK 1247
LSR G C P ++EE E + A+ TE P + ++ + L D W++
Sbjct: 1361 LSR---GGCPPNELEEEQTKELTSIVNAIQTELP---DILDSSCWLERLKGEDEGWKEVI 1414
Query: 1248 KFLHDAKSYLWDDPFLFKICS-------------DGVI---------RRCIPEVNFEKIL 1285
L K+ FKI GV+ R +PE +L
Sbjct: 1415 AALEGGKT-----KGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRTPLL 1469
Query: 1286 WYCHGSSYGGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCDRCQRTGNISR------- 1338
H GHF G + +++ FYWP + V +C +C + S+
Sbjct: 1470 KELHEGMLAGHF-GIKKMWRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLTSSLTP 1528
Query: 1339 -RNEMPLKNILEIELFDVWGIDFMGPFPPSFGCQYILLAVDYVSKWVEAAALSTNDSKVV 1397
R PL+ I+ +L DV G+ G +YIL +D +K+ A + ++ V
Sbjct: 1529 YRMTFPLE-IVACDLMDV-GLSVQGN-------RYILTIIDLFTKYGTAVPIPDKKAETV 1579
Query: 1398 V-AFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEIS 1456
+ AF+++ +P +++D G F N F ++H + Y+ + +G VE
Sbjct: 1580 LKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERF 1639
Query: 1457 NRELKRILEKVVDSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHLPVELEHK 1516
N+ + I++K + +W ++ A++AY G +P L+ G+ P+E+ +
Sbjct: 1640 NKTIMHIMKKKT-AVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGE 1698
Query: 1517 AYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILNREF--- 1573
I + D E + L EL + + A E A +E K D+K +++
Sbjct: 1699 DAVGINYADMD-----EYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFP 1753
Query: 1574 VSGQLVLLF--NSRLRLFPGKLKSRWSGPFVV 1603
G VLL + +L KL ++WSGP+ V
Sbjct: 1754 QPGSRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 292 bits (747), Expect = 6e-78
Identities = 233/772 (30%), Positives = 363/772 (46%), Gaps = 78/772 (10%)
Query: 776 VRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDY 835
V +I +L+ G+I S+S + SP+ VVPKK + K+R+ IDY
Sbjct: 223 VESQIQDMLNQGIIRT-SNSPYNSPIWVVPKKQDAS-------------GKQKFRIVIDY 268
Query: 836 RRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQICVAPEDQEKTAFTCPYDV 895
R+LN +T D P+P +D++L KL Y+ +D G++QI + PE KTAF+ +
Sbjct: 269 RKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGH 328
Query: 896 FAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGNLALVLKR 955
+ Y RMPFGL NAPATFQRCM I L+ +++DD VF + D L +L LV ++
Sbjct: 329 YEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEK 388
Query: 956 CQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAKIEVIEKLPPPTNIKGIRSFLGH 1015
+ NL L +KC F+ ++ LGH ++ GI+ + KIE I+K P PT K I++FLG
Sbjct: 389 LAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGL 448
Query: 1016 AGFYRRFIKDFSKLAKPMTNLLEKEAPF-TFDENCLKAFESIKKSLVTAPVIVAPDWSLP 1074
G+YR+FI +F+ +AKPMT L+K T + AF+ +K + P++ PD++
Sbjct: 449 TGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKK 508
Query: 1075 FEIMCDASDLALGAVLCQKKERVLYVIYYASTVLNEAQRNYTTTEKELLGVVFACEKFRP 1134
F + DASD+ALGAVL Q + Y+ S LNE + NY+T EKELL +V+A + FR
Sbjct: 509 FTLTTDASDVALGAVLSQDGHPLSYI----SRTLNEHEINYSTIEKELLAIVWATKTFRH 564
Query: 1135 YILGFKVVVHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEIIDRRGKDNGVADHLSR- 1193
Y+LG + +DH L L+ +D +L RW + L EFD +I +GK+N VAD LSR
Sbjct: 565 YLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRI 624
Query: 1194 -LEGGACSPIPIQEEFPDEKLLAVSTEEPLPWY---------------VHFANFRVAGLI 1237
LE S D L TE PL + + + +
Sbjct: 625 KLEETYLSEQTQHSAEEDNSDLIFITERPLNTFNRQVIFSKGPPDIKVTKYFKKHITQIF 684
Query: 1238 PHDLTWQQKKKFLHD----AKSYLW------------------DDPFLFKICSDGVIRRC 1275
+T ++ +++L D KS L+ + + + S +++
Sbjct: 685 YDIMTREKAEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNI 744
Query: 1276 IPEVNFEKILWYCHGSSYGGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCDRCQRTGN 1335
F++++ H G + K+ +Y+P + + C C
Sbjct: 745 TTYAEFKELILTAHEKLL---HPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKT 801
Query: 1336 ISRRNEMPLKNILEIELFDVWGIDFMGPFPPSFGCQYILLAVDYVSKWVEAAALSTNDSK 1395
R +MP K + E FM S G Y+ +D SK+ + T D
Sbjct: 802 EHRNTDMPTKTTPKPEHCRE---KFMIDIYSSEGKHYV-SCIDIYSKFATLEEIKTKDWI 857
Query: 1396 VVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEI 1455
L + IF + G P+ + +D F + A + LE V+ +++T T V
Sbjct: 858 ECKNALMR-IFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNT-----TKTGVAD 911
Query: 1456 SNRELKRILEKV-VDSSRKDWSRKLDDA-----LWAYRTAFKTPIGTSPFHL 1501
R K I EK+ + + D KL ++ ++T T G +P H+
Sbjct: 912 IERLHKTINEKIRIIKTSDDEETKLSKMETVLNIYNHKTKHDT-TGQTPAHI 962
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 283 bits (725), Expect = 2e-75
Identities = 238/827 (28%), Positives = 387/827 (46%), Gaps = 88/827 (10%)
Query: 752 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 811
+L + PI Q L + + V ++ ++L+ G+I S+S + SP VVPKK +
Sbjct: 198 VLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRE-SNSPYNSPTWVVPKKPDAS 256
Query: 812 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGY 871
K+RV IDYR+LN +T D +P+P +D++L KL QY+ +D
Sbjct: 257 -------------GANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLA 303
Query: 872 SGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 931
G++QI + E KTAF+ + Y RMPFGL NAPATFQRCM I L+ ++
Sbjct: 304 KGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVY 363
Query: 932 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDR 991
+DD +F + L ++ LV + + NL L +KC F+ ++ LGH V+ GI+ +
Sbjct: 364 LDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNP 423
Query: 992 AKIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPF-TFDENCL 1050
K++ I P PT K IR+FLG G+YR+FI +++ +AKPMT+ L+K T +
Sbjct: 424 IKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYI 483
Query: 1051 KAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASTVLNE 1110
+AFE +K ++ P++ PD+ F + DAS+LALGAVL Q + ++ S LN+
Sbjct: 484 EAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFI----SRTLND 539
Query: 1111 AQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALRHLFAKQDSKPRLIRWVLLL 1170
+ NY+ EKELL +V+A + FR Y+LG + ++ +DH LR L ++ +L RW + L
Sbjct: 540 HELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRL 599
Query: 1171 QEFDLEIIDRRGKDNGVADHLSR--LEGGACSPIPIQEEFPDEKLLAVSTEEPLPWYVHF 1228
E+ +I +GK+N VAD LSR +E S D L TE+P+ ++
Sbjct: 600 SEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQ 659
Query: 1229 ANF-----------RVAG----LIPHDLTWQQKKK------FLHDAKSYLWDDPFLFKIC 1267
F ++ G I +D+ +K K F+H + + F+I
Sbjct: 660 IIFIKSDKNKVEHSKIFGNSITTIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIV 719
Query: 1268 SDG-----------VIRRCIPEVN------FEKILWYCHGSSYGGHFSGERTAAKVLQSG 1310
VIR N F++I+ H G + K+ +
Sbjct: 720 QRAHIEIVNTTYTKVIRSLFLLKNVGSYAEFKEIILQSHEKLL---HPGIQKMTKLFKEN 776
Query: 1311 FYWPTLNRDSRAFVESCDRCQRTGNISRRNEMPLK------NILEIELFDVWGIDFMGPF 1364
++P + + C+ C R +MPLK + E + D++
Sbjct: 777 HFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKITPNPEHCREKFVVDIYS------- 829
Query: 1365 PPSFGCQYILLAVDYVSKWVEAAALSTNDSKVVVAFLKKNIFTRFGVPRAIISDGGTHFC 1424
S G YI +D SK+ + T D + IF + G P+ + +D F
Sbjct: 830 --SEGKHYI-SCIDIYSKFATLEQIKTKD-WIECRNALMRIFNQLGKPKLLKADRDGAFS 885
Query: 1425 NRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEK--VVDSSRKDWSR--KLD 1480
+ A + LE+ V+ +++T +G ++ R K I EK +++SS + + K++
Sbjct: 886 SLALKRWLEEEEVELQLNT----AKNGVADV-ERLHKTINEKIRIINSSDDEEVKLSKIE 940
Query: 1481 DALWAYRTAFKTPIGTSPFHLVFGKACHLPVELEHKAYWAIRKLNFD 1527
L+ Y K +F A H ++ + I K+N D
Sbjct: 941 TILYTYNQKIKHDTTGQRPAQIFLYAGHPILDTQKIKEKKIEKINED 987
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 281 bits (718), Expect = 1e-74
Identities = 249/925 (26%), Positives = 416/925 (44%), Gaps = 95/925 (10%)
Query: 753 LLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITV 812
L +ENY+ ++ L P + EI + L +G+I S + PV VPKK G
Sbjct: 406 LTQENYRLPIR-NYPLPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL- 462
Query: 813 VANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYS 872
R+ +DY+ LN + + +PLP I+Q+L K+ G + LD S
Sbjct: 463 -----------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKS 505
Query: 873 GYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFM 932
Y+ I V D+ K AF CP VF Y MP+G+ APA FQ + I + E+ + +M
Sbjct: 506 AYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYM 565
Query: 933 DDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRA 992
DD + + + ++ VL++ + NL++N KC F +G+ +SE+G +
Sbjct: 566 DDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQE 625
Query: 993 KIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKA 1052
I+ + + P N K +R FLG + R+FI S+L P+ NLL+K+ + + +A
Sbjct: 626 NIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQA 685
Query: 1053 FESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASTVLNEA 1111
E+IK+ LV+ PV+ D+S + DASD+A+GAVL QK + Y + Y S +++A
Sbjct: 686 IENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKA 745
Query: 1112 QRNYTTTEKELLGVVFACEKFRPY----ILGFKVVVHTDHAAL--RHLFAKQDSKPRLIR 1165
Q NY+ ++KE+L ++ + + +R Y I FK++ TDH L R + RL R
Sbjct: 746 QLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKIL--TDHRNLIGRITNESEPENKRLAR 803
Query: 1166 WVLLLQEFDLEIIDRRGKDNGVADHLSRLEGGACSPIP---------------IQEEFPD 1210
W L LQ+F+ EI R G N +AD LSR+ PIP I ++F +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIV-DETEPIPKDSEDNSINFVNQISITDDFKN 862
Query: 1211 EKLLAVSTEEPLPWYVHFANFRVAGLIPHDLTWQQKKKFLHDAKSYLWDDPFLFKICSDG 1270
+ + + + L ++ + RV I Q K L ++K + + +D
Sbjct: 863 QVVTEYTNDTKLLNLLNNEDKRVEENI------QLKDGLLINSKDQI-------LLPNDT 909
Query: 1271 VIRRCIPEVNFE--KILWYCHGSSYGGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCD 1328
+ R I + E K++ G ++ F W + + + +V++C
Sbjct: 910 QLTRTIIKKYHEEGKLI-----------HPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 1329 RCQRTGNISRRNEMPLKNILEIEL-FDVWGIDFMGPFPPSFGCQYILLAVDYVSKW-VEA 1386
CQ + + + PL+ I E ++ +DF+ P S G + + VD SK +
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILV 1018
Query: 1387 AALSTNDSKVVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYH 1446
+ ++ + + FG P+ II+D F ++ ++ KY K S PY
Sbjct: 1019 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1078
Query: 1447 PQTSGQVEISNRELKRILEKVVDSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKA 1506
PQT GQ E +N+ ++++L V + W + +Y A + +PF +V +
Sbjct: 1079 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1138
Query: 1507 CHL-PVELEHKAYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHD 1565
L P+EL S+K E + E + K KK+ D
Sbjct: 1139 PALSPLELPS---------------FSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1566 RKILN-REFVSGQLVLLFNSRLRLF--PGKLKSRWSGPFVVKRVFPHGAVEVENPET-KN 1621
KI EF G LV++ ++ KL ++GPF V + E++ P++ K+
Sbjct: 1184 MKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKH 1243
Query: 1622 TF--TVNGQRLKVYHGGEVLKLETM 1644
F T + L+ Y L T+
Sbjct: 1244 MFSSTFHVSHLEKYRHNSELNYATI 1268
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 278 bits (712), Expect = 7e-74
Identities = 248/925 (26%), Positives = 417/925 (44%), Gaps = 95/925 (10%)
Query: 753 LLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITV 812
L +ENY+ ++ L P + EI + L +G+I S + PV VPKK G
Sbjct: 406 LTQENYRLPIR-NYPLPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL- 462
Query: 813 VANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYS 872
R+ +DY+ LN + + +PLP I+Q+L K+ G + LD S
Sbjct: 463 -----------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKS 505
Query: 873 GYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFM 932
Y+ I V D+ K AF CP VF Y MP+G+ APA FQ + I ++ E+ + +M
Sbjct: 506 AYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYM 565
Query: 933 DDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRA 992
D+ + + + ++ VL++ + NL++N KC F +G+ +SE+G +
Sbjct: 566 DNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQE 625
Query: 993 KIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKA 1052
I+ + + P N K +R FLG + R+FI S+L P+ NLL+K+ + + +A
Sbjct: 626 NIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQA 685
Query: 1053 FESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASTVLNEA 1111
E+IK+ LV+ PV+ D+S + DASD+A+GAVL QK + Y + Y S +++A
Sbjct: 686 IENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKA 745
Query: 1112 QRNYTTTEKELLGVVFACEKFRPY----ILGFKVVVHTDHAAL--RHLFAKQDSKPRLIR 1165
Q NY+ ++KE+L ++ + + +R Y I FK++ TDH L R + RL R
Sbjct: 746 QLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKIL--TDHRNLIGRITNESEPENKRLAR 803
Query: 1166 WVLLLQEFDLEIIDRRGKDNGVADHLSRLEGGACSPIP---------------IQEEFPD 1210
W L LQ+F+ EI R G N +AD LSR+ PIP I ++F +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIV-DETEPIPKDSEDNSINFVNQISITDDFKN 862
Query: 1211 EKLLAVSTEEPLPWYVHFANFRVAGLIPHDLTWQQKKKFLHDAKSYLWDDPFLFKICSDG 1270
+ + + + L ++ + RV I Q K L ++K + + +D
Sbjct: 863 QVVTEYTNDTKLLNLLNNEDKRVEENI------QLKDGLLINSKDQI-------LLPNDT 909
Query: 1271 VIRRCIPEVNFE--KILWYCHGSSYGGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCD 1328
+ R I + E K++ G ++ F W + + + +V++C
Sbjct: 910 QLTRTIIKKYHEEGKLI-----------HPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 1329 RCQRTGNISRRNEMPLKNILEIEL-FDVWGIDFMGPFPPSFGCQYILLAVDYVSKW-VEA 1386
CQ + + + PL+ I E ++ +DF+ P S G + + VD SK +
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILV 1018
Query: 1387 AALSTNDSKVVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYH 1446
+ ++ + + FG P+ II+D F ++ ++ KY K S PY
Sbjct: 1019 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1078
Query: 1447 PQTSGQVEISNRELKRILEKVVDSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKA 1506
PQT GQ E +N+ ++++L V + W + +Y A + +PF +V +
Sbjct: 1079 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1138
Query: 1507 CHL-PVELEHKAYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHD 1565
L P+EL S+K E + E + K KK+ D
Sbjct: 1139 PALSPLELPS---------------FSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1566 RKILN-REFVSGQLVLLFNSRLRLF--PGKLKSRWSGPFVVKRVFPHGAVEVENPET-KN 1621
KI EF G LV++ ++ KL ++GPF V + E++ P++ K+
Sbjct: 1184 MKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKH 1243
Query: 1622 TF--TVNGQRLKVYHGGEVLKLETM 1644
F T + L+ Y L T+
Sbjct: 1244 MFSSTFHVSHLEKYRHNSELNYATI 1268
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 278 bits (712), Expect = 7e-74
Identities = 248/925 (26%), Positives = 417/925 (44%), Gaps = 95/925 (10%)
Query: 753 LLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITV 812
L +ENY+ ++ L P + EI + L +G+I S + PV VPKK G
Sbjct: 406 LTQENYRLPIR-NYPLPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL- 462
Query: 813 VANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYS 872
R+ +DY+ LN + + +PLP I+Q+L K+ G + LD S
Sbjct: 463 -----------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKS 505
Query: 873 GYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFM 932
Y+ I V D+ K AF CP VF Y MP+G+ APA FQ + I ++ E+ + +M
Sbjct: 506 AYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYM 565
Query: 933 DDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRA 992
D+ + + + ++ VL++ + NL++N KC F +G+ +SE+G +
Sbjct: 566 DNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQE 625
Query: 993 KIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKA 1052
I+ + + P N K +R FLG + R+FI S+L P+ NLL+K+ + + +A
Sbjct: 626 NIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQA 685
Query: 1053 FESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQK-KERVLYVIYYASTVLNEA 1111
E+IK+ LV+ PV+ D+S + DASD+A+GAVL QK + Y + Y S +++A
Sbjct: 686 IENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKA 745
Query: 1112 QRNYTTTEKELLGVVFACEKFRPY----ILGFKVVVHTDHAAL--RHLFAKQDSKPRLIR 1165
Q NY+ ++KE+L ++ + + +R Y I FK++ TDH L R + RL R
Sbjct: 746 QLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKIL--TDHRNLIGRITNESEPENKRLAR 803
Query: 1166 WVLLLQEFDLEIIDRRGKDNGVADHLSRLEGGACSPIP---------------IQEEFPD 1210
W L LQ+F+ EI R G N +AD LSR+ PIP I ++F +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIV-DETEPIPKDSEDNSINFVNQISITDDFKN 862
Query: 1211 EKLLAVSTEEPLPWYVHFANFRVAGLIPHDLTWQQKKKFLHDAKSYLWDDPFLFKICSDG 1270
+ + + + L ++ + RV I Q K L ++K + + +D
Sbjct: 863 QVVTEYTNDTKLLNLLNNEDKRVEENI------QLKDGLLINSKDQI-------LLPNDT 909
Query: 1271 VIRRCIPEVNFE--KILWYCHGSSYGGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCD 1328
+ R I + E K++ G ++ F W + + + +V++C
Sbjct: 910 QLTRTIIKKYHEEGKLI-----------HPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 1329 RCQRTGNISRRNEMPLKNILEIEL-FDVWGIDFMGPFPPSFGCQYILLAVDYVSKW-VEA 1386
CQ + + + PL+ I E ++ +DF+ P S G + + VD SK +
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILV 1018
Query: 1387 AALSTNDSKVVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYH 1446
+ ++ + + FG P+ II+D F ++ ++ KY K S PY
Sbjct: 1019 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1078
Query: 1447 PQTSGQVEISNRELKRILEKVVDSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKA 1506
PQT GQ E +N+ ++++L V + W + +Y A + +PF +V +
Sbjct: 1079 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1138
Query: 1507 CHL-PVELEHKAYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHD 1565
L P+EL S+K E + E + K KK+ D
Sbjct: 1139 PALSPLELPS---------------FSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFD 1183
Query: 1566 RKILN-REFVSGQLVLLFNSRLRLF--PGKLKSRWSGPFVVKRVFPHGAVEVENPET-KN 1621
KI EF G LV++ ++ KL ++GPF V + E++ P++ K+
Sbjct: 1184 MKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKH 1243
Query: 1622 TF--TVNGQRLKVYHGGEVLKLETM 1644
F T + L+ Y L T+
Sbjct: 1244 MFSSTFHVSHLEKYRHNSELNYATI 1268
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 276 bits (705), Expect = 4e-73
Identities = 169/512 (33%), Positives = 272/512 (53%), Gaps = 33/512 (6%)
Query: 740 IKGISPAICMHKILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVS 799
+ G+S + + PI +M+ V ++I +LL G+I P S+S + S
Sbjct: 103 LSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEVERQIDELLQDGIIRP-SNSPYNS 161
Query: 800 PVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKL 859
P+ +VPKK N E ++R+ +D++RLN+VT D +P+P I+ L L
Sbjct: 162 PIWIVPKK------PKPNGE-------KQYRMVVDFKRLNTVTIPDTYPIPDINATLASL 208
Query: 860 AGHQYYCFLDGYSGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAI 919
+Y+ LD SG++QI + D KTAF+ + + R+PFGL NAPA FQR + I
Sbjct: 209 GNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDI 268
Query: 920 FSDLIETCIEIFMDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLG 979
+ I +++DD VF ++D NL LVL + NL +N EK HF+ LG
Sbjct: 269 LREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLG 328
Query: 980 HKVSERGIEVDRAKIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLL-- 1037
+ V+ GI+ D K+ I ++PPPT++K ++ FLG +YR+FI+D++K+AKP+TNL
Sbjct: 329 YIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRG 388
Query: 1038 ---------EKEAPFTFDENCLKAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALGA 1088
+ P T DE L++F +K L ++ ++ P ++ PF + DAS+ A+GA
Sbjct: 389 LYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGA 448
Query: 1089 VLCQKKERVLYVIYYASTVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVV-VHTDH 1147
VL Q + I Y S LN+ + NY T EKE+L ++++ + R Y+ G + V+TDH
Sbjct: 449 VLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDH 508
Query: 1148 AALRHLFAKQDSKPRLIRWVLLLQEFDLEIIDRRGKDNGVADHLSRLEGGACSPIPIQEE 1207
L ++ +L RW ++E++ E+I + GK N VAD LSR+ +
Sbjct: 509 QPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRIPPQLNQLSTDLDA 568
Query: 1208 FPDEKLLAVSTEEPLPWYVHFANFRVAGLIPH 1239
P++ + +++T H A + LIPH
Sbjct: 569 NPEDDMQSLAT-------AHSALHDSSRLIPH 593
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 266 bits (679), Expect = 5e-70
Identities = 157/436 (36%), Positives = 237/436 (54%), Gaps = 13/436 (2%)
Query: 759 KPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENN 818
+P+ R S + ++ ++ KL+ ++ P S S++ SP+ +VPKK
Sbjct: 313 EPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEP-SVSQYNSPLLLVPKKSS--------- 362
Query: 819 ELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQIC 878
P KWR+ IDYR++N D FPLP ID +LD+L +Y+ LD SG++QI
Sbjct: 363 ---PNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIE 419
Query: 879 VAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVF 938
+ ++ T+F+ + + R+PFGL AP +FQR M FS + + ++MDD V
Sbjct: 420 LDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVI 479
Query: 939 GPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAKIEVIE 998
G + L NL V +C+E NL L+ EKC F + + LGHK +++GI D K +VI+
Sbjct: 480 GCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQ 539
Query: 999 KLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKAFESIKK 1058
P P + R F+ +YRRFIK+F+ ++ +T L +K PF + + C KAF +K
Sbjct: 540 NYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKS 599
Query: 1059 SLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASTVLNEAQRNYTTT 1118
L+ ++ PD+S F I DAS A GAVL Q + YAS + + N +TT
Sbjct: 600 QLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTT 659
Query: 1119 EKELLGVVFACEKFRPYILGFKVVVHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEII 1178
E+EL + +A FRPYI G V TDH L +LF+ + +L R L L+E++ +
Sbjct: 660 EQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719
Query: 1179 DRRGKDNGVADHLSRL 1194
+GKDN VAD LSR+
Sbjct: 720 YLKGKDNHVADALSRI 735
Score = 115 bits (287), Expect = 1e-24
Identities = 89/344 (25%), Positives = 161/344 (45%), Gaps = 17/344 (4%)
Query: 1294 GGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCDRCQRTGNISRRNEMPLKNILEI--E 1351
GGH +T AKV + +YW +++ + +V C +CQ+ ++ + P+ I E
Sbjct: 907 GGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKA-KTTKHTKTPM-TITETPEH 963
Query: 1352 LFDVWGIDFMGPFPPSF-GCQYILLAVDYVSKWVEAAALSTNDSKVVVAFLKKNIFTRFG 1410
FD +D +GP P S G +Y + + ++K++ A ++ +K V + ++ ++G
Sbjct: 964 AFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYG 1023
Query: 1411 VPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKVVDS 1470
+ I+D GT + N L + +K+ ST +H QT G VE S+R L + + +
Sbjct: 1024 PMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYIST 1083
Query: 1471 SRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHLPVELEHKAYWAIRKLNFDWKV 1530
+ DW L ++ + T P+ LVFG+ +LP +K + N D
Sbjct: 1084 DKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF-NKLHSIEPIYNID-DY 1141
Query: 1531 ASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILNREFVSGQLVLLFNSRLRLFP 1590
A E + L+ RA + +KEK K+ +D K+ + E G VLL N
Sbjct: 1142 AKESKY----RLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLRNE----VG 1193
Query: 1591 GKLKSRWSGPFVVKRVFPHGAVEVENPETKNTFTVNGQRLKVYH 1634
KL +++GP+ ++ + + + + + K V+ RLK +H
Sbjct: 1194 HKLDFKYTGPYKIESIGDNNNITLLTNKNKKQI-VHKDRLKKFH 1236
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 230 bits (586), Expect = 3e-59
Identities = 144/435 (33%), Positives = 235/435 (53%), Gaps = 30/435 (6%)
Query: 772 MKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTKWRV 831
+ D V E+ +LL G+I P S S + SP VV KKG N N L+
Sbjct: 193 VSDFVNNEVKQLLKDGIIRP-SRSPYNSPTWVVDKKG-TDAFGNPNKRLV---------- 240
Query: 832 CIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQICVAPEDQEKTAFTC 891
ID+R+LN T D +P+P I +L L +++ LD SGY+QI +A D+EKT+F+
Sbjct: 241 -IDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSV 299
Query: 892 PYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGNLAL 951
+ + R+PFGL NA + FQR + + + I +++DD +F N + ++
Sbjct: 300 NGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDT 359
Query: 952 VLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAKIEVIEKLPPPTNIKGIRS 1011
VLK + N+ ++ EK F LG VS+ G + D K++ I++ P P + +RS
Sbjct: 360 VLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRS 419
Query: 1012 FLGHAGFYRRFIKDFSKLAKPMTNLLE-----------KEAPFTFDENCLKAFESIKKSL 1060
FLG A +YR FIKDF+ +A+P+T++L+ K+ P F+E AF+ ++ L
Sbjct: 420 FLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNIL 479
Query: 1061 VTAPVIVA-PDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASTVLNEAQRNYTTTE 1119
+ VI+ PD+ PF++ DAS +GAVL Q+ + + S L + ++NY T E
Sbjct: 480 ASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMI----SRTLKQPEQNYATNE 535
Query: 1120 KELLGVVFACEKFRPYILGFKVV-VHTDHAALRHLFAKQDSKPRLIRWVLLLQEFDLEII 1178
+ELL +V+A K + ++ G + + + TDH L A +++ ++ RW + + + ++
Sbjct: 536 RELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF 595
Query: 1179 DRRGKDNGVADHLSR 1193
+ GK+N VAD LSR
Sbjct: 596 YKPGKENFVADALSR 610
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 158 bits (400), Expect = 1e-37
Identities = 128/451 (28%), Positives = 214/451 (47%), Gaps = 25/451 (5%)
Query: 752 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 811
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 812 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGY 871
NNE R K R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 290 -----NNEAEKRRG--KKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 872 SGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 931
SG+ Q+ + E + TAFTCP + + +PFGL AP+ FQR M F + C ++
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 401
Query: 932 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDR 991
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 461
Query: 992 AKIEVIEKLPPP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCL 1050
+E I K P + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 462 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 521
Query: 1051 KAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAST 1106
+ +KK+L P + P I DASD G A+ + + YAS
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 1107 VLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALR---HLFAKQDSK-PR 1162
A+RNY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK R
Sbjct: 582 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1163 LIRWVLLLQEFDLEIIDRRGKDNGVADHLSR 1193
IRW L + ++ +G DN AD LSR
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 158 bits (400), Expect = 1e-37
Identities = 128/451 (28%), Positives = 214/451 (47%), Gaps = 25/451 (5%)
Query: 752 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 811
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 812 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGY 871
NNE R K R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 290 -----NNEAEKRRG--KKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 872 SGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 931
SG+ Q+ + E + TAFTCP + + +PFGL AP+ FQR M F + C ++
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 401
Query: 932 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDR 991
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 461
Query: 992 AKIEVIEKLPPP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCL 1050
+E I K P + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 462 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 521
Query: 1051 KAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAST 1106
+ +KK+L P + P I DASD G A+ + + YAS
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 1107 VLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALR---HLFAKQDSK-PR 1162
A+RNY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK R
Sbjct: 582 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1163 LIRWVLLLQEFDLEIIDRRGKDNGVADHLSR 1193
IRW L + ++ +G DN AD LSR
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 157 bits (397), Expect = 2e-37
Identities = 127/451 (28%), Positives = 214/451 (47%), Gaps = 25/451 (5%)
Query: 752 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 811
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 812 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGY 871
NNE R K R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 290 -----NNEAEKRRG--KKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 872 SGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 931
SG+ Q+ + E + TAFTCP + + +PFGL AP+ FQR M F + C ++
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 401
Query: 932 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDR 991
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 461
Query: 992 AKIEVIEKLPPP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCL 1050
+E I K P + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 462 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDT 521
Query: 1051 KAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAST 1106
+ +KK+L P + P I DASD G A+ + + YAS
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 1107 VLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALR---HLFAKQDSK-PR 1162
A++NY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK R
Sbjct: 582 SFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1163 LIRWVLLLQEFDLEIIDRRGKDNGVADHLSR 1193
IRW L + ++ +G DN AD LSR
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 154 bits (390), Expect = 1e-36
Identities = 126/451 (27%), Positives = 213/451 (46%), Gaps = 25/451 (5%)
Query: 752 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 811
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 233 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 284
Query: 812 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGY 871
NNE R K R+ ++Y+ +N T D + P D++L + G + + D
Sbjct: 285 -----NNEAEKRRG--KKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCK 337
Query: 872 SGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 931
SG+ Q+ + E + TAFTCP + + +PFGL AP+ FQR M F + C ++
Sbjct: 338 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 396
Query: 932 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDR 991
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 397 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 456
Query: 992 AKIEVIEKLPPP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCL 1050
+E I K P + K ++ FLG + +I +++ KP+ L++ P+ + +
Sbjct: 457 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 516
Query: 1051 KAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAST 1106
+ +KK+L P + P I DASD G A+ + + YAS
Sbjct: 517 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 576
Query: 1107 VLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALR---HLFAKQDSK-PR 1162
A++NY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK R
Sbjct: 577 SFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 636
Query: 1163 LIRWVLLLQEFDLEIIDRRGKDNGVADHLSR 1193
IRW L + ++ +G DN AD LSR
Sbjct: 637 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 667
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 154 bits (388), Expect = 3e-36
Identities = 122/438 (27%), Positives = 211/438 (47%), Gaps = 36/438 (8%)
Query: 769 NPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTK 828
+P ++ K+I +LLD G+I P S S+ +SP +V + R+ K
Sbjct: 248 SPQDREGFAKQIKELLDLGLIIP-SKSQHMSPAFLVENEA--------------ERRRGK 292
Query: 829 WRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQICVAPEDQEKTA 888
R+ ++Y+ +N T D LP + ++L L G + D SG+ Q+ + E Q+ TA
Sbjct: 293 KRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTA 352
Query: 889 FTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFGPNFDACLGN 948
FTCP F +K +PFGL AP+ FQR M + + C+ +++DD VF + +
Sbjct: 353 FTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCM-VYVDDIIVFSNSELDHYNH 411
Query: 949 LALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAK-------IEVIEKLP 1001
+ VLK ++ ++L+ +K + + K++ G+E+D+ +E I K P
Sbjct: 412 VYAVLKIVEKYGIILSKKKAN-------LFKEKINFLGLEIDKGTHCPQNHILENIHKFP 464
Query: 1002 PP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKAFESIKKSL 1060
+ K ++ FLG + +I +++ KP+ L+K+ + + ++ + IKK+L
Sbjct: 465 DRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNL 524
Query: 1061 VTAPVIVAPDWSLPFEIMCDASDLALGAVL-CQKKERVLYVIYYASTVLNEAQRNYTTTE 1119
+ P + P I DASD G VL + + V + Y+S +A++NY + +
Sbjct: 525 GSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSND 584
Query: 1120 KELLGVVFACEKFRPYILGFKVVVHTDHAALRHLF---AKQDSKP-RLIRWVLLLQEFDL 1175
KELL V KF Y+ + V TD+ + K DSK RL+RW ++
Sbjct: 585 KELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQF 644
Query: 1176 EIIDRRGKDNGVADHLSR 1193
++ G N +AD L+R
Sbjct: 645 DVEHLEGVKNVLADCLTR 662
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 153 bits (386), Expect = 4e-36
Identities = 125/451 (27%), Positives = 213/451 (46%), Gaps = 25/451 (5%)
Query: 752 ILLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGIT 811
I L + K I + +P ++ K+I +LLD VI P S S ++P +V
Sbjct: 239 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 290
Query: 812 VVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGY 871
NNE R R+ ++Y+ +N T D + LP D++L + G + + D
Sbjct: 291 -----NNEAENGRG--NKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCK 343
Query: 872 SGYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIF 931
SG+ Q+ + E + TAFTCP + + +PFGL AP+ FQR M F + C ++
Sbjct: 344 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCC-VY 402
Query: 932 MDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDR 991
+DD VF N + L ++A++L++C + ++L+ +K + LG ++ E +
Sbjct: 403 VDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQG 462
Query: 992 AKIEVIEKLPPP-TNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCL 1050
+E I K P + K ++ FLG + +I + +++ +P+ L++ P+ + +
Sbjct: 463 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDT 522
Query: 1051 KAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALG----AVLCQKKERVLYVIYYAST 1106
+ +KK+L P + P I DASD G A+ + + Y S
Sbjct: 523 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSG 582
Query: 1107 VLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALR---HLFAKQDSK-PR 1162
A+RNY + +KE L V+ +KF Y+ ++ TD+ + +L K DSK R
Sbjct: 583 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 642
Query: 1163 LIRWVLLLQEFDLEIIDRRGKDNGVADHLSR 1193
IRW L + ++ +G DN AD LSR
Sbjct: 643 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 673
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 152 bits (383), Expect = 1e-35
Identities = 118/436 (27%), Positives = 209/436 (47%), Gaps = 35/436 (8%)
Query: 769 NPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNELIPTRQVTK 828
+PS ++ ++I +LL+ VI P S S +SP +V + R+ K
Sbjct: 237 SPSDREEFDRQIKELLELKVIKP-SKSTHMSPAFLVENEA--------------ERRRGK 281
Query: 829 WRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQICVAPEDQEKTA 888
R+ ++Y+ +N T+ D LP D++L + G + Y D SG Q+ + E Q TA
Sbjct: 282 KRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTA 341
Query: 889 FTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFG-PNFDACLG 947
FTCP + + +PFGL AP+ F + S+ +++DD VF
Sbjct: 342 FTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYI 401
Query: 948 NLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAK-------IEVIEKL 1000
++ +L+RC++ ++L+ +K + K++ G+E+D+ +E I K
Sbjct: 402 HVLNILRRCEKLGIILSKKKAQ-------LFKEKINFLGLEIDQGTHCPQNHILEHIHKF 454
Query: 1001 PPPT-NIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDENCLKAFESIKKS 1059
P + K ++ FLG + +I + + KP+ + L++++ +T+++ + IKK+
Sbjct: 455 PDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKN 514
Query: 1060 LVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASTVLNEAQRNYTTTE 1119
L + P + P+ + I DAS+ G +L Y+ YAS A+RNY + E
Sbjct: 515 LKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNE 574
Query: 1120 KELLGVVFACEKFRPYILGFKVVVHTDHAALRH---LFAKQDSKP-RLIRWVLLLQEFDL 1175
KELL V+ +KF Y+ + ++ TD+ H + K D K RL+RW + L ++D
Sbjct: 575 KELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDF 634
Query: 1176 EIIDRRGKDNGVADHL 1191
++ G N AD L
Sbjct: 635 DVEHIAGTKNVFADFL 650
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 130 bits (328), Expect = 2e-29
Identities = 124/462 (26%), Positives = 206/462 (43%), Gaps = 38/462 (8%)
Query: 753 LLEENYKPIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITV 812
++ + K + +P + + P ++ + ++I LL VI P S+S+ S +V I
Sbjct: 1395 IINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRP-SESKHRSTAFIVRSGTEIDP 1453
Query: 813 VANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYS 872
+ + + K R+ +Y+ LN T D + LP I+ ++ K+ + Y D S
Sbjct: 1454 ITGKEKK-------GKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKS 1506
Query: 873 GYNQICVAPEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFM 932
G+ Q+ + E TAF ++ + MPFGL NAPA FQR M +F E I +++
Sbjct: 1507 GFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYI 1565
Query: 933 DDFSVFGPNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRA 992
DD VF + +L +L+ C+E L+L+ K + LG + I++
Sbjct: 1566 DDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPH 1625
Query: 993 KIEVI-----EKLPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAPFTFDE 1047
I I EKL P +G+RS+LG + R +I+D KL +P+ + +
Sbjct: 1626 IISKICDFSDEKLATP---EGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNP 1682
Query: 1048 NCLKAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLALGAVLCQKKER-----VLYVIY 1102
K IK+ + P + P I D GAV K + +
Sbjct: 1683 ETWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICA 1742
Query: 1103 YASTVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFK-VVVHTDHAALRHLFAK-QDSK 1160
YAS N + +T + E+ + +KF+ Y L K +++ +D A+ + K ++K
Sbjct: 1743 YASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENK 1799
Query: 1161 PRLIRWVLLLQEF--------DLEIIDRRGKDNGVADHLSRL 1194
P +RW L +F E ID GK NG+AD LSR+
Sbjct: 1800 PSRVRW-LTFSDFLTGLGITVTFEHID--GKHNGLADALSRM 1838
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 124 bits (311), Expect = 2e-27
Identities = 111/422 (26%), Positives = 180/422 (42%), Gaps = 35/422 (8%)
Query: 742 GISPAICMHKILLEENYKPIVQPQR-RLNPSMKDV---VRKEIIKLLDAGVIYPISDSEW 797
G+ A C I+++ KP P R P K+ ++ I + L+ GV+ P S W
Sbjct: 13 GLGRAKCQVPIIID--LKPTAMPVSIRQYPMSKEAHMGIQPHITRFLELGVLRPCR-SPW 69
Query: 798 VSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLD 857
+P+ V K G TR +R D R +N T H +P +L
Sbjct: 70 NTPLLPVKKPG--------------TRD---YRPVQDLREVNKRTMDIHPTVPNPYNLLS 112
Query: 858 KLAGHQ-YYCFLDGYSGYNQICVAPEDQEKTAFTCP------YDVFAYKRMPFGLCNAPA 910
L+ + +Y LD + + +AP+ QE AF + R+P G N+P
Sbjct: 113 TLSPDRTWYTVLDLKDAFFCLPLAPQSQELFAFEWRDPERGISGQLTWTRLPQGFKNSPT 172
Query: 911 TFQRCMFAIFSDLIETCIEI----FMDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWE 966
F + +D E+ ++DD + P +AC+ +L+ + + +
Sbjct: 173 LFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKEACIRGTKHLLRELGDKGYRASAK 232
Query: 967 KCHFMVRDGIVLGHKVSERGIEVDRAKIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDF 1026
K LG+ +SE + +IE + +PPP N + +R FLG AGF R +I F
Sbjct: 233 KAQICQTKVTYLGYILSEGKRWLTPGRIETVAHIPPPQNPREVREFLGTAGFCRLWIPGF 292
Query: 1027 SKLAKPMTNLLEKEAPFTFDENCLKAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLAL 1086
++LA P+ L ++ APFT+ E AFE++K++L++AP + PD S PF + D
Sbjct: 293 AELAAPLYALTKESAPFTWQEKHQSAFEALKEALLSAPALGLPDTSKPFTLFIDEKQGIA 352
Query: 1087 GAVLCQKKERVLYVIYYASTVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVVHTD 1146
VL QK + Y S L+ + + + + LG + V T
Sbjct: 353 KGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITP 412
Query: 1147 HA 1148
HA
Sbjct: 413 HA 414
Score = 93.2 bits (230), Expect = 5e-18
Identities = 54/150 (36%), Positives = 79/150 (52%), Gaps = 2/150 (1%)
Query: 1356 WGIDFMGPFPPSFGCQYILLAVDYVSKWVEAAALSTNDSKVVVAFLKKNIFTRFGVPRAI 1415
W IDF P G +Y+L+ VD S WVEA + +V + + IF RFG+P+ I
Sbjct: 765 WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVI 824
Query: 1416 ISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKV-VDSSRKD 1474
SD G F ++ + L G+ K+ Y PQ+SGQVE NR +K L K+ +++ KD
Sbjct: 825 GSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKD 884
Query: 1475 WSRKLDDALWAYRTAFKTPIGTSPFHLVFG 1504
W R L AL R G +P+ +++G
Sbjct: 885 WRRLLSLALLRARNT-PNRFGLTPYEILYG 913
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 122 bits (306), Expect = 8e-27
Identities = 116/451 (25%), Positives = 186/451 (40%), Gaps = 38/451 (8%)
Query: 742 GISPAICMHKILLEENYKPIVQP----QRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEW 797
G+ A C I+++ KP P Q ++ +R+ IIK L+ GV+ P S W
Sbjct: 156 GLGRAKCQAPIIID--LKPTAVPVSIKQYPMSLEAHMGIRQHIIKFLELGVLRPCR-SPW 212
Query: 798 VSPVQVVPKKGGITVVANENNELIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLD 857
+P+ V K G +R D R +N T H +P +L
Sbjct: 213 NTPLLPVKKPG-----------------TQDYRPVQDLREINKRTVDIHPTVPNPYNLLS 255
Query: 858 KLA-GHQYYCFLDGYSGYNQICVAPEDQEKTAFTCP------YDVFAYKRMPFGLCNAPA 910
L + +Y LD + + +AP+ QE AF + R+P G N+P
Sbjct: 256 TLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAFEWKDPERGISGQLTWTRLPQGFKNSPT 315
Query: 911 TFQRCMFAIFSDLIETCIEI----FMDDFSVFGPNFDACLGNLALVLKRCQETNLVLNWE 966
F + +D E+ ++DD + P AC +L+ E + +
Sbjct: 316 LFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKKACTQGTRHLLQELGEKGYRASAK 375
Query: 967 KCHFMVRDGIVLGHKVSERGIEVDRAKIEVIEKLPPPTNIKGIRSFLGHAGFYRRFIKDF 1026
K LG+ +SE + +IE + ++PPP N + +R FLG AGF R +I F
Sbjct: 376 KAQICQTKVTYLGYILSEGKRWLTPGRIETVARIPPPRNPREVREFLGTAGFCRLWIPGF 435
Query: 1027 SKLAKPMTNLLEKEAPFTFDENCLKAFESIKKSLVTAPVIVAPDWSLPFEIMCDASDLAL 1086
++LA P+ L ++ PFT+ AFE++KK+L++AP + PD S PF + D
Sbjct: 436 AELAAPLYALTKESTPFTWQTEHQLAFEALKKALLSAPALGLPDTSKPFTLFLDERQGIA 495
Query: 1087 GAVLCQKKERVLYVIYYASTVLNEAQRNYTTTEKELLGVVFACEKFRPYILGFKVVV--- 1143
VL QK + Y S L+ + + + + LG + V
Sbjct: 496 KGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITP 555
Query: 1144 HTDHAALRHLFAKQDSKPRLIRWVLLLQEFD 1174
HT A +R + + RL + LL + D
Sbjct: 556 HTLEAIVRQPPDRWITNARLTHYQALLLDTD 586
Score = 93.2 bits (230), Expect = 5e-18
Identities = 54/150 (36%), Positives = 79/150 (52%), Gaps = 2/150 (1%)
Query: 1356 WGIDFMGPFPPSFGCQYILLAVDYVSKWVEAAALSTNDSKVVVAFLKKNIFTRFGVPRAI 1415
W IDF P G +Y+L+ VD S WVEA + +V + + IF RFG+P+ I
Sbjct: 908 WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVI 967
Query: 1416 ISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSGQVEISNRELKRILEKV-VDSSRKD 1474
SD G F ++ + L G+ K+ Y PQ+SGQVE NR +K L K+ +++ KD
Sbjct: 968 GSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKD 1027
Query: 1475 WSRKLDDALWAYRTAFKTPIGTSPFHLVFG 1504
W R L AL R G +P+ +++G
Sbjct: 1028 WRRLLSLALLRARNT-PNRFGLTPYEILYG 1056
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 115 bits (289), Expect = 8e-25
Identities = 95/363 (26%), Positives = 160/363 (43%), Gaps = 34/363 (9%)
Query: 1272 IRRCIPEVNFEKILWYCHGSSYGGHFSGERTAAKVLQSGFYWPTLNRDSRAFVESCDRCQ 1331
IR P+ + EKI+ H ++ G + T KV S ++WP L +D + C +C
Sbjct: 807 IRIVPPKADREKIISTAHNIAHTGR---DATFLKV-SSKYWWPNLRKDVVKSIRQCKQCL 862
Query: 1332 RTGNISRRNEMPLKNILEIELFDVWGIDFMGPFPPSFGCQYILLAVDYVSKWVEAAALST 1391
T + + L+ + ++ FD + ID++GP PPS G ++L+ VD ++ +V
Sbjct: 863 VTNATNLTSPPILRPVKPLKPFDKFYIDYIGPLPPSNGYLHVLVVVDSMTGFVWLYPTKA 922
Query: 1392 NDSKVVVAFLKKNIFTRFGVPRAIISDGGTHFCNRAFESLLEKYGVKHKVSTPYHPQTSG 1451
+ V L N+ T +P+ + SD G F + F ++ G++ + STPYHPQ+SG
Sbjct: 923 PSTSATVKAL--NMLTSIAIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSG 980
Query: 1452 QVEISNRELKRILEKVVDSSRKDWSRKLDDALWAYRTAFKTPIGTSPFHLVFGKACHLPV 1511
+VE N ++KR+L K++ W L A ++ +P L+FG + P
Sbjct: 981 KVERKNSDIKRLLTKLLIGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPF 1040
Query: 1512 ELEHKAYWAIRKLNFDWKVASEKRLLQLNELDEFRLRAYESASIYKEKTKKWHDRKILNR 1571
N D S + L L L E R ++ S ++ W
Sbjct: 1041 ------------ANSDTLDLSREEELSL--LQEIRSSLHQPTS-PPASSRSWSPS----- 1080
Query: 1572 EFVSGQLVLLFNSRLRLFPGKLKSRWSGPFVVKRVF-PHGAVEVENPETKNTFTVNGQRL 1630
GQLV +R P L+ RW P + V P + +++ + T +V+ +L
Sbjct: 1081 ---VGQLVQERVAR----PASLRPRWHKPTAILEVVNPRTVIILDHLGNRRTVSVDNLKL 1133
Query: 1631 KVY 1633
Y
Sbjct: 1134 TAY 1136
Score = 84.7 bits (208), Expect = 2e-15
Identities = 90/414 (21%), Positives = 170/414 (40%), Gaps = 31/414 (7%)
Query: 760 PIVQPQRRLNPSMKDVVRKEIIKLLDAGVIYPISDSEWVSPVQVVPKKGGITVVANENNE 819
P Q Q +NP K ++ I LL GV+ +S +PV VPK G
Sbjct: 174 PRPQKQYPINPKAKPSIQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPKPDG---------- 222
Query: 820 LIPTRQVTKWRVCIDYRRLNSVTRKDHFPLPFIDQMLDKLAGHQYYCFLDGYSGYNQICV 879
KWR+ +DYR +N +L + +Y LD +G+ +
Sbjct: 223 --------KWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPI 274
Query: 880 APEDQEKTAFTCPYDVFAYKRMPFGLCNAPATFQRCMFAIFSDLIETCIEIFMDDFSVFG 939
PE TAFT + + R+P G N+PA F + + ++ ++ ++DD +
Sbjct: 275 TPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEIPN--VQAYVDDIYISH 332
Query: 940 PNFDACLGNLALVLKRCQETNLVLNWEKCHFMVRDGIVLGHKVSERGIEVDRAKIEVIEK 999
+ L L + V++ +K R+ LG +++ G + + +
Sbjct: 333 DDPQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLN 392
Query: 1000 LPPPTNIKGIRSFLGHAGFYRRFIKDFSKLAKPMTNLLEKEAP--FTFDENCLKAFESIK 1057
+ PP ++K ++S LG F R FI ++S+L KP+ ++ ++ E+ + I
Sbjct: 393 ITPPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHII 452
Query: 1058 KSLVTAPVIVA--PDWSLPFEIMCDASDLALGAVLCQKKERVLYVIYYASTVLNEAQRNY 1115
L A + P+ L ++ S + K ++YV Y + ++A+ +
Sbjct: 453 SVLNQADNLEERNPETRLIIKVNSSPSAGYIRYYNEGSKRPIMYVNY----IFSKAEAKF 508
Query: 1116 TTTEKELLGVVFACEKFRPYILGFKVVVHTDHAALRHL--FAKQDSKPRLIRWV 1167
T TEK L + K +G +++V++ ++ + + K +RW+
Sbjct: 509 TQTEKLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWI 562
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.137 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 196,977,786
Number of Sequences: 164201
Number of extensions: 8684336
Number of successful extensions: 23870
Number of sequences better than 10.0: 191
Number of HSP's better than 10.0 without gapping: 126
Number of HSP's successfully gapped in prelim test: 67
Number of HSP's that attempted gapping in prelim test: 23347
Number of HSP's gapped (non-prelim): 435
length of query: 1649
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1525
effective length of database: 39,613,130
effective search space: 60410023250
effective search space used: 60410023250
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0330.2