
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC141323.18 - phase: 0
(1638 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 200 3e-50
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 188 1e-46
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 184 2e-45
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 176 4e-43
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 162 9e-39
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 152 7e-36
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 149 8e-35
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 149 8e-35
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 139 5e-32
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 135 7e-31
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 125 1e-27
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 124 2e-27
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 124 2e-27
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 124 2e-27
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 122 8e-27
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 122 8e-27
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 120 2e-26
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 106 6e-22
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 101 2e-20
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 92 9e-18
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 200 bits (508), Expect = 3e-50
Identities = 122/435 (28%), Positives = 220/435 (50%), Gaps = 11/435 (2%)
Query: 626 DLFAWAPSDMPGIDIGVACHHLAVRTSVKPVVQRKRKMGEEKRKAVDEEVKKLQEAHFIC 685
D+FA + ++ G + G C + ++ +P+ Q+ R + + + + ++K+ I
Sbjct: 915 DVFAISDDEL-GRNSGTECV-IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIR 972
Query: 686 EIKYPTWLANTVLVKKASGKWRMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFM 745
E K P W + VLVKK G RMC+DY +N + +PLP+I+ + + +G K +
Sbjct: 973 ESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVF 1031
Query: 746 DAYSGYNQIKMDPLDAPKTAFMTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNL 805
D +G+ QI +D TAF + + + V+ FGL + A FQ +M+ I + +G
Sbjct: 1032 DMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCA 1091
Query: 806 EVYIDDLVVKTSEKQSHSVDLKEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIE 865
VY+DDL++ + + + H D+KE +IRK M+L +KC + ++LG +T G+E
Sbjct: 1092 FVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVE 1151
Query: 866 ANPDKCQAIINMRNPCNIREVQQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQE 925
K + P N++E+Q G + +F+ A + + I K + W +E
Sbjct: 1152 TQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKE 1211
Query: 926 CDEAFGKIKQFLTTPPILHRPTKGAGL------FLYLSVSENALSSVLVEES-DEREKPI 978
+ AF ++K+ + P+L +P A L +Y S + +VL +E D ++ PI
Sbjct: 1212 QEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPI 1271
Query: 979 YFVSRVLKGAELRYQKIEKLALAVIITARKLRPYFQSHKVVIRTNY-PVKQILGKLDLAG 1037
F S+ L AE RY + ALA++ R+ + + + T++ P+ +L LA
Sbjct: 1272 AFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLAD 1331
Query: 1038 RMLSWSVELSEYDIQ 1052
R+ WS+E+ E+D++
Sbjct: 1332 RLWRWSIEILEFDVK 1346
Score = 103 bits (258), Expect = 3e-21
Identities = 105/449 (23%), Positives = 198/449 (43%), Gaps = 32/449 (7%)
Query: 1189 IYVPRESNSRADLLAK--LASTKKPGNNRTVIQEVISAPSTDEKAVFE-------LNQEP 1239
+Y+ ++N+ AD L++ + + +++A T+ + + L E
Sbjct: 1348 VYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLKGED 1407
Query: 1240 EGWMTPLLKFLTGSFVAKNDEYAQLVRRRATKFVVIAGKLYKRGRASPLLRCLGEGETEL 1299
EGW ++ L G + + + ++ I G + K R + +
Sbjct: 1408 EGWKE-VIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRT 1466
Query: 1300 VLL-EVHEGVCGSHIGGRSLAAKLLRAGYYWPRMAHDCCEFVKKCDKCQRFSDKKNAPAN 1358
LL E+HEG+ H G + + +++ +YWP+M V+ C KC +D +
Sbjct: 1467 PLLKELHEGMLAGHFGIKKMW-RMVHRKFYWPQMRVCVENCVRTCAKCLCANDH-----S 1520
Query: 1359 ELTSVFSPW----PFHKWGVDIVGPFPQAPGQLKFLIVDVDYFTKWVEAEAVSKITAERV 1414
+LTS +P+ P D++ G ++++ +D FTK+ A + AE V
Sbjct: 1521 KLTSSLTPYRMTFPLEIVACDLMDVGLSVQGN-RYILTIIDLFTKYGTAVPIPDKKAETV 1579
Query: 1415 VKFYWKKIICHFG-LPKYIVTDNGTQFASSKVVNFCKQLGIETKFVSVIHPQANGQAESA 1473
+K + ++ G +P ++TD G +F + F L IE + +ANG E
Sbjct: 1580 LKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERF 1639
Query: 1474 NKMIVNDIKKKLEDAKGLWAEQLHEVLWSYHTTPHSTTGETPFTMVYGADAMLPVEIDTP 1533
NK I++ +KKK W +Q+ +++Y+ H TGETP +++G D M P+E+
Sbjct: 1640 NKTIMHIMKKKTAVPME-WDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGE 1698
Query: 1534 TWRREHFSEESNEVGIRCTMDMIDE---VREAAHIREFAAKQRAARRYNSKVIPRSMKEG 1590
+++ + +E T +++ +E A + + K ++Y SK R + G
Sbjct: 1699 DAVGINYA-DMDEYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASKK-HRFPQPG 1756
Query: 1591 DLVLKQVVAP---TRIGKLLPSWKGPYRV 1616
VL ++ + + KL+ W GPYRV
Sbjct: 1757 SRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 188 bits (477), Expect = 1e-46
Identities = 117/413 (28%), Positives = 201/413 (48%), Gaps = 10/413 (2%)
Query: 646 HLAVRTSVKPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICE----IKYPTWLANTVLVKK 701
H+ T P+ ++ + + V+ +V+++ I E PTW+
Sbjct: 197 HVLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDAS 256
Query: 702 ASGKWRMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDA 761
+ K+R+ +DY LN D YP+P++D ++ + + +D G++QI+MD
Sbjct: 257 GANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESI 316
Query: 762 PKTAFMTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQS 821
KTAF T +Y Y M FGLRNA ATFQR M+ I + ++ VY+DD+++ ++
Sbjct: 317 SKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTE 376
Query: 822 HSVDLKEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPC 881
H ++ +F ++ +++L KC F + FLG ++T GI+ NP K +AI++ P
Sbjct: 377 HLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPT 436
Query: 882 NIREVQQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQ-ECDEAFGKIKQFLTTP 940
+E++ G +F+ D A + +KK+ + + + E EAF K+K +
Sbjct: 437 KDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRD 496
Query: 941 PILHRPTKGAGLFLYLSVSENALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLAL 1000
PIL P L S AL +VL + PI F+SR L EL Y IEK L
Sbjct: 497 PILQLPDFEKKFVLTTDASNLALGAVL----SQNGHPISFISRTLNDHELNYSAIEKELL 552
Query: 1001 AVIITARKLRPYFQSHKVVIRTNY-PVKQILGKLDLAGRMLSWSVELSEYDIQ 1052
A++ + R Y + +I +++ P++ + + ++ W V LSEY +
Sbjct: 553 AIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFK 605
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 184 bits (467), Expect = 2e-45
Identities = 112/363 (30%), Positives = 180/363 (48%), Gaps = 6/363 (1%)
Query: 690 PTWLANTVLVKKASGKWRMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYS 749
P W+ K+R+ +DY LN D +P+P++D ++ + +D
Sbjct: 246 PIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305
Query: 750 GYNQIKMDPLDAPKTAFMTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYI 809
G++QI+MDP KTAF T +Y Y M FGL+NA ATFQR M+ I + ++ VY+
Sbjct: 306 GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365
Query: 810 DDLVVKTSEKQSHSVDLKEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPD 869
DD++V ++ H L +F+++ K +++L KC F Q FLG +LT GI+ NP+
Sbjct: 366 DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425
Query: 870 KCQAIINMRNPCNIREVQQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFE-WNQECDE 928
K +AI P +E++ G +F+ D A +KK + + N E D
Sbjct: 426 KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485
Query: 929 AFGKIKQFLTTPPILHRPTKGAGLFLYLSVSENALSSVLVEESDEREKPIYFVSRVLKGA 988
AF K+K ++ PIL P L S+ AL +VL ++ P+ ++SR L
Sbjct: 486 AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDG----HPLSYISRTLNEH 541
Query: 989 ELRYQKIEKLALAVIITARKLRPYFQSHKVVIRTNY-PVKQILGKLDLAGRMLSWSVELS 1047
E+ Y IEK LA++ + R Y I +++ P+ + D ++ W V+LS
Sbjct: 542 EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601
Query: 1048 EYD 1050
E+D
Sbjct: 602 EFD 604
Score = 33.5 bits (75), Expect = 4.9
Identities = 51/226 (22%), Positives = 89/226 (38%), Gaps = 21/226 (9%)
Query: 1296 ETELVLLEVHEGVCGSHIGGRSLAAKLLRAGYYWPRMAHDCCEFVKKCDKCQRFSDKKNA 1355
E + ++L HE + G KL YY+P + +C C K
Sbjct: 749 EFKELILTAHEKLLHP---GIQKTTKLFGETYYFPNSQLLIQNIINECSICNLA--KTEH 803
Query: 1356 PANELTSVFSPWPFH---KWGVDIVGPFPQAPGQLKFLIVDVDYFTKWVEAEAVSKITAE 1412
++ + +P P H K+ +DI + K + +D ++K+ E + T +
Sbjct: 804 RNTDMPTKTTPKPEHCREKFMIDIYS------SEGKHYVSCIDIYSKFATLEEIK--TKD 855
Query: 1413 RV-VKFYWKKIICHFGLPKYIVTDNGTQFASSKVVNFCKQLGIETKFVSVIHPQANGQAE 1471
+ K +I G PK + D F+S + + + +E + + A+ E
Sbjct: 856 WIECKNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVAD--IE 913
Query: 1472 SANKMIVNDIKK-KLEDAKGLWAEQLHEVLWSY-HTTPHSTTGETP 1515
+K I I+ K D + ++ VL Y H T H TTG+TP
Sbjct: 914 RLHKTINEKIRIIKTSDDEETKLSKMETVLNIYNHKTKHDTTGQTP 959
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 176 bits (447), Expect = 4e-43
Identities = 132/471 (28%), Positives = 223/471 (47%), Gaps = 23/471 (4%)
Query: 614 EEILVQLLRENVDLFAWAPSDMPGIDIGVACHHLAVRTSVK-PVVQRKRKMGEEKRKAVD 672
+EIL LL E +F S M ++ V +RT+ + P+ + R V+
Sbjct: 85 QEILNSLLGEFPRIFEPPLSGM-SVETAVKAE---IRTNTQDPIYAKSYPYPVNMRGEVE 140
Query: 673 EEVKKLQEAHFI----CEIKYPTWLANTVLVKKASGKWRMCVDYTDLNMACPKDPYPLPS 728
++ +L + I P W+ ++RM VD+ LN D YP+P
Sbjct: 141 RQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200
Query: 729 IDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTNQKNYHYRVMSFGLRNAGAT 788
I+ + + K + +D SG++QI M D PKTAF T Y + + FGL+NA A
Sbjct: 201 INATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAI 260
Query: 789 FQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEIFQQIRKFSMRLNPTKCTFG 848
FQR +D I IG+ VYIDD++V + + +H +L+ + + K ++++N K F
Sbjct: 261 FQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFL 320
Query: 849 VQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREVQQLTGRLAALSRFLSCAGDKA- 907
+FLG+++T GI+A+P K +AI M P +++E+++ G + +F+ A
Sbjct: 321 DTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAK 380
Query: 908 ------FAFFATIKKKEEFEWNQECDE----AFGKIKQFLTTPPILHRPTKGAGLFLYLS 957
+A IK + + DE +F +K L + IL P L
Sbjct: 381 PLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTD 440
Query: 958 VSENALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVIITARKLRPY-FQSH 1016
S A+ +VL ++ R++PI ++SR L E Y IEK LA+I + LR Y + +
Sbjct: 441 ASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAG 500
Query: 1017 KVVIRTNY-PVKQILGKLDLAGRMLSWSVELSEYDIQFAPRNNIKSQVLAD 1066
+ + T++ P+ LG + ++ W + EY+ + + KS V+AD
Sbjct: 501 TIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPG-KSNVVAD 550
Score = 44.3 bits (103), Expect = 0.003
Identities = 45/208 (21%), Positives = 88/208 (41%), Gaps = 12/208 (5%)
Query: 1311 SHIGGRSLAAKLLRAGYYWPRMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVFSP-WPF 1369
+H G + +LL YY+PRM+ C C+ + +++ L P +P
Sbjct: 704 AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762
Query: 1370 HKWGVDIVGPFPQAPGQLKFLIVDVDYFTKWVEAEAVSKITAERVVKFYWKKIICHFGLP 1429
+DI + + + +D F+K+ + + A ++ + + +F P
Sbjct: 763 EILHIDIFAL------EKRLYLSCIDKFSKFAKLFHLQS-KASVHLRETLVEALHYFTAP 815
Query: 1430 KYIVTDNGTQFASSKVVNFCKQLGIETKFVSVIHPQANGQAESANKMIVNDIKKKLEDAK 1489
K +V+DN V+N+ + L I+ + + NGQ E + + +I + L+D
Sbjct: 816 KVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFL-EIYRCLKDEL 874
Query: 1490 GLW--AEQLHEVLWSYHTTPHSTTGETP 1515
+ E +H + Y+T+ HS T P
Sbjct: 875 PTFKPVELVHIAVDRYNTSVHSVTNRKP 902
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 162 bits (409), Expect = 9e-39
Identities = 106/404 (26%), Positives = 185/404 (45%), Gaps = 8/404 (1%)
Query: 654 KPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKASG------KWR 707
+PV + + + + + +V+KL + + E + + +LV K S KWR
Sbjct: 313 EPVYTKNYRSPHSQVEEIQAQVQKLIKDKIV-EPSVSQYNSPLLLVPKKSSPNSDKKKWR 371
Query: 708 MCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFM 767
+ +DY +N D +PLP ID ++D K S +D SG++QI++D T+F
Sbjct: 372 LVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFS 431
Query: 768 TNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLK 827
T+ +Y + + FGL+ A +FQR M FS +Y+DDL+V ++ +L
Sbjct: 432 TSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLT 491
Query: 828 EIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREVQ 887
E+F + R+++++L+P KC+F + FLG T KGI + K I N P + +
Sbjct: 492 EVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSAR 551
Query: 888 QLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPT 947
+ RF+ D + KK FEW EC +AF +K L P +L P
Sbjct: 552 RFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPD 611
Query: 948 KGAGLFLYLSVSENALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVIITAR 1007
+ S+ A +VL + + + P+ + SR E E+ A+
Sbjct: 612 FSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAII 671
Query: 1008 KLRPYFQSHKVVIRTNY-PVKQILGKLDLAGRMLSWSVELSEYD 1050
RPY ++T++ P+ + ++ + ++ +EL EY+
Sbjct: 672 HFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYN 715
Score = 110 bits (276), Expect = 2e-23
Identities = 87/360 (24%), Positives = 163/360 (45%), Gaps = 29/360 (8%)
Query: 1271 KFVVIAGKLYKRGRAS---PLLRCLGEGETELVLLEVHEG-VCGSHIGGRSLAAKLLRAG 1326
KF + K+ K + + P+ + E E E +L +H+ + G H G AK+ R
Sbjct: 864 KFKNMGNKILKNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH- 922
Query: 1327 YYWPRMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVFSPWPFHKWGVDIVGPFPQAPGQ 1386
YYW M+ E+V+KC KCQ+ K+ + F + VD +GP P++
Sbjct: 923 YYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENG 982
Query: 1387 LKFLIVDVDYFTKWVEAEAVSKITAERVVKFYWKKIICHFGLPKYIVTDNGTQFASSKVV 1446
++ + + TK++ A ++ +A+ V K ++ I +G K +TD GT++ +S +
Sbjct: 983 NEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIIT 1042
Query: 1447 NFCKQLGIETKFVSVIHPQANGQAESANKMIVNDIKKKLEDAKGLWAEQLHEVLWSYHTT 1506
+ CK L I+ + H Q G E +++ + I+ + K W L ++ ++TT
Sbjct: 1043 DLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTT 1102
Query: 1507 PHSTTGETPFTMVYGADAMLPVEID-----TPTWRREHFSEESN---EVGIRCTMDMIDE 1558
P+ +V+G + LP + P + + +++ES EV +++
Sbjct: 1103 QSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKESKYRLEVAYARARKLLE- 1161
Query: 1559 VREAAHIREFAAKQRAARRYNSKVIPRSMKEGDLVLKQVVAPTRIG-KLLPSWKGPYRVK 1617
A K++ Y+ KV ++ GD VL + +G KL + GPY+++
Sbjct: 1162 ----------AHKEKNKENYDLKVKDIELEVGDKVLLR----NEVGHKLDFKYTGPYKIE 1207
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 152 bits (384), Expect = 7e-36
Identities = 114/497 (22%), Positives = 224/497 (44%), Gaps = 19/497 (3%)
Query: 592 DVVIGPLPHQITKIGTSLSGLEEEILVQLLRENVDLFAWA-----PSDMPGIDIGVACHH 646
++ I H ++++ + ++E L + +E D+ A P + G++ V
Sbjct: 349 NIEISSSKHTLSQMNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQ 408
Query: 647 LAVRTSVKPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKASGKW 706
R + R + K +A+++E+ + ++ I E K + V K G
Sbjct: 409 ENYRLPI-----RNYPLPPGKMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTL 462
Query: 707 RMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAF 766
RM VDY LN + YPLP I+ L+ G + +D S Y+ I++ D K AF
Sbjct: 463 RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAF 522
Query: 767 MTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDL 826
+ + Y VM +G+ A A FQ ++TI ++ Y+DD+++ + + H +
Sbjct: 523 RCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHV 582
Query: 827 KEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREV 886
K++ Q+++ ++ +N KC F KF+G+ +++KG + ++ + P N +E+
Sbjct: 583 KDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKEL 642
Query: 887 QQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRP 946
+Q G + L +F+ +KK ++W +A IKQ L +PP+L
Sbjct: 643 RQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHF 702
Query: 947 TKGAGLFLYLSVSENALSSVLVEE-SDEREKPIYFVSRVLKGAELRYQKIEKLALAVIIT 1005
+ L S+ A+ +VL ++ D++ P+ + S + A+L Y +K LA+I +
Sbjct: 703 DFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKS 762
Query: 1006 ARKLRPYFQSHKVVIRTNYPVKQILGKLDLAG-----RMLSWSVELSE--YDIQFAPRNN 1058
+ R Y +S + + ++G++ R+ W + L + ++I + P +
Sbjct: 763 LKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSA 822
Query: 1059 IKSQVLADFVVEFTSPI 1075
+V+ T PI
Sbjct: 823 NHIADALSRIVDETEPI 839
Score = 107 bits (267), Expect = 3e-22
Identities = 117/488 (23%), Positives = 206/488 (41%), Gaps = 58/488 (11%)
Query: 1160 GEYQTKDQQLSKYLARVRKLAGDFQFFEAIYVPRESNSRADLLAKLASTKKP------GN 1213
G + + +K LAR + DF F E Y P +N AD L+++ +P N
Sbjct: 788 GRITNESEPENKRLARWQLFLQDFNF-EINYRPGSANHIADALSRIVDETEPIPKDSEDN 846
Query: 1214 NRTVIQEVISAPSTDEKAVFELNQEPEGWMTPLLKFLTGSFVAKNDEYAQLVRRRATKFV 1273
+ + ++ + V E + T LL L N++ +R + +
Sbjct: 847 SINFVNQISITDDFKNQVVTEYTND-----TKLLNLLN------NED------KRVEENI 889
Query: 1274 VIAGKLYKRGRASPLLRCLGEGETEL---VLLEVHEGVCGSHIGGRSLAAKLLRAGYYWP 1330
+ L + LL +T+L ++ + HE H G L +LR + W
Sbjct: 890 QLKDGLLINSKDQILL----PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWK 944
Query: 1331 RMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVF-SPWPFHKWGVDIVGPFPQAPGQLKF 1389
+ E+V+ C CQ + + P L + S P+ +D + P++ G
Sbjct: 945 GIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNAL 1004
Query: 1390 LIVDVDYFTKW-VEAEAVSKITAERVVKFYWKKIICHFGLPKYIVTDNGTQFASSKVVNF 1448
+V VD F+K + ITAE+ + + +++I +FG PK I+ DN F S +F
Sbjct: 1005 FVV-VDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDF 1063
Query: 1449 CKQLGIETKFVSVIHPQANGQAESANKMIVNDIKKKLEDAKGLWAEQLHEVLWSYHTTPH 1508
+ KF PQ +GQ E N+ + ++ W + + V SY+ H
Sbjct: 1064 AHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIH 1123
Query: 1509 STTGETPFTMVYG-ADAMLPVEIDTPTWRREHFSEESNEVGIRCTMDMIDEVREAAHIRE 1567
S T TPF +V+ + A+ P+E+ + FS++++E + T+ + V+E +
Sbjct: 1124 SATQMTPFEIVHRYSPALSPLELPS-------FSDKTDE-NSQETIQVFQTVKEHLNTNN 1175
Query: 1568 FAAKQRAARRYNSKVIP-RSMKEGDLVLKQVVAPTRIG------KLLPSWKGPYRVKEKL 1620
K + ++ K+ + GDLV+ V T+ G KL PS+ GP+ V +K
Sbjct: 1176 IKMK----KYFDMKIQEIEEFQPGDLVM---VKRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Query: 1621 QHGAYKLE 1628
Y+L+
Sbjct: 1229 GPNNYELD 1236
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 149 bits (375), Expect = 8e-35
Identities = 113/497 (22%), Positives = 224/497 (44%), Gaps = 19/497 (3%)
Query: 592 DVVIGPLPHQITKIGTSLSGLEEEILVQLLRENVDLFAWA-----PSDMPGIDIGVACHH 646
++ I H ++++ + ++E L + +E D+ A P + G++ V
Sbjct: 349 NIEISSSKHTLSQMNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQ 408
Query: 647 LAVRTSVKPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKASGKW 706
R + R + K +A+++E+ + ++ I E K + V K G
Sbjct: 409 ENYRLPI-----RNYPLPPGKMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTL 462
Query: 707 RMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAF 766
RM VDY LN + YPLP I+ L+ G + +D S Y+ I++ D K AF
Sbjct: 463 RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAF 522
Query: 767 MTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDL 826
+ + Y VM +G+ A A FQ ++TI ++ Y+D++++ + + H +
Sbjct: 523 RCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHV 582
Query: 827 KEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREV 886
K++ Q+++ ++ +N KC F KF+G+ +++KG + ++ + P N +E+
Sbjct: 583 KDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKEL 642
Query: 887 QQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRP 946
+Q G + L +F+ +KK ++W +A IKQ L +PP+L
Sbjct: 643 RQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHF 702
Query: 947 TKGAGLFLYLSVSENALSSVLVEE-SDEREKPIYFVSRVLKGAELRYQKIEKLALAVIIT 1005
+ L S+ A+ +VL ++ D++ P+ + S + A+L Y +K LA+I +
Sbjct: 703 DFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKS 762
Query: 1006 ARKLRPYFQSHKVVIRTNYPVKQILGKLDLAG-----RMLSWSVELSE--YDIQFAPRNN 1058
+ R Y +S + + ++G++ R+ W + L + ++I + P +
Sbjct: 763 LKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSA 822
Query: 1059 IKSQVLADFVVEFTSPI 1075
+V+ T PI
Sbjct: 823 NHIADALSRIVDETEPI 839
Score = 107 bits (267), Expect = 3e-22
Identities = 117/488 (23%), Positives = 206/488 (41%), Gaps = 58/488 (11%)
Query: 1160 GEYQTKDQQLSKYLARVRKLAGDFQFFEAIYVPRESNSRADLLAKLASTKKP------GN 1213
G + + +K LAR + DF F E Y P +N AD L+++ +P N
Sbjct: 788 GRITNESEPENKRLARWQLFLQDFNF-EINYRPGSANHIADALSRIVDETEPIPKDSEDN 846
Query: 1214 NRTVIQEVISAPSTDEKAVFELNQEPEGWMTPLLKFLTGSFVAKNDEYAQLVRRRATKFV 1273
+ + ++ + V E + T LL L N++ +R + +
Sbjct: 847 SINFVNQISITDDFKNQVVTEYTND-----TKLLNLLN------NED------KRVEENI 889
Query: 1274 VIAGKLYKRGRASPLLRCLGEGETEL---VLLEVHEGVCGSHIGGRSLAAKLLRAGYYWP 1330
+ L + LL +T+L ++ + HE H G L +LR + W
Sbjct: 890 QLKDGLLINSKDQILL----PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWK 944
Query: 1331 RMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVF-SPWPFHKWGVDIVGPFPQAPGQLKF 1389
+ E+V+ C CQ + + P L + S P+ +D + P++ G
Sbjct: 945 GIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNAL 1004
Query: 1390 LIVDVDYFTKW-VEAEAVSKITAERVVKFYWKKIICHFGLPKYIVTDNGTQFASSKVVNF 1448
+V VD F+K + ITAE+ + + +++I +FG PK I+ DN F S +F
Sbjct: 1005 FVV-VDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDF 1063
Query: 1449 CKQLGIETKFVSVIHPQANGQAESANKMIVNDIKKKLEDAKGLWAEQLHEVLWSYHTTPH 1508
+ KF PQ +GQ E N+ + ++ W + + V SY+ H
Sbjct: 1064 AHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIH 1123
Query: 1509 STTGETPFTMVYG-ADAMLPVEIDTPTWRREHFSEESNEVGIRCTMDMIDEVREAAHIRE 1567
S T TPF +V+ + A+ P+E+ + FS++++E + T+ + V+E +
Sbjct: 1124 SATQMTPFEIVHRYSPALSPLELPS-------FSDKTDE-NSQETIQVFQTVKEHLNTNN 1175
Query: 1568 FAAKQRAARRYNSKVIP-RSMKEGDLVLKQVVAPTRIG------KLLPSWKGPYRVKEKL 1620
K + ++ K+ + GDLV+ V T+ G KL PS+ GP+ V +K
Sbjct: 1176 IKMK----KYFDMKIQEIEEFQPGDLVM---VKRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Query: 1621 QHGAYKLE 1628
Y+L+
Sbjct: 1229 GPNNYELD 1236
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 149 bits (375), Expect = 8e-35
Identities = 113/497 (22%), Positives = 224/497 (44%), Gaps = 19/497 (3%)
Query: 592 DVVIGPLPHQITKIGTSLSGLEEEILVQLLRENVDLFAWA-----PSDMPGIDIGVACHH 646
++ I H ++++ + ++E L + +E D+ A P + G++ V
Sbjct: 349 NIEISSSKHTLSQMNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQ 408
Query: 647 LAVRTSVKPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKASGKW 706
R + R + K +A+++E+ + ++ I E K + V K G
Sbjct: 409 ENYRLPI-----RNYPLPPGKMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTL 462
Query: 707 RMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAF 766
RM VDY LN + YPLP I+ L+ G + +D S Y+ I++ D K AF
Sbjct: 463 RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAF 522
Query: 767 MTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDL 826
+ + Y VM +G+ A A FQ ++TI ++ Y+D++++ + + H +
Sbjct: 523 RCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHV 582
Query: 827 KEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREV 886
K++ Q+++ ++ +N KC F KF+G+ +++KG + ++ + P N +E+
Sbjct: 583 KDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKEL 642
Query: 887 QQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRP 946
+Q G + L +F+ +KK ++W +A IKQ L +PP+L
Sbjct: 643 RQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHF 702
Query: 947 TKGAGLFLYLSVSENALSSVLVEE-SDEREKPIYFVSRVLKGAELRYQKIEKLALAVIIT 1005
+ L S+ A+ +VL ++ D++ P+ + S + A+L Y +K LA+I +
Sbjct: 703 DFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKS 762
Query: 1006 ARKLRPYFQSHKVVIRTNYPVKQILGKLDLAG-----RMLSWSVELSE--YDIQFAPRNN 1058
+ R Y +S + + ++G++ R+ W + L + ++I + P +
Sbjct: 763 LKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSA 822
Query: 1059 IKSQVLADFVVEFTSPI 1075
+V+ T PI
Sbjct: 823 NHIADALSRIVDETEPI 839
Score = 107 bits (267), Expect = 3e-22
Identities = 117/488 (23%), Positives = 206/488 (41%), Gaps = 58/488 (11%)
Query: 1160 GEYQTKDQQLSKYLARVRKLAGDFQFFEAIYVPRESNSRADLLAKLASTKKP------GN 1213
G + + +K LAR + DF F E Y P +N AD L+++ +P N
Sbjct: 788 GRITNESEPENKRLARWQLFLQDFNF-EINYRPGSANHIADALSRIVDETEPIPKDSEDN 846
Query: 1214 NRTVIQEVISAPSTDEKAVFELNQEPEGWMTPLLKFLTGSFVAKNDEYAQLVRRRATKFV 1273
+ + ++ + V E + T LL L N++ +R + +
Sbjct: 847 SINFVNQISITDDFKNQVVTEYTND-----TKLLNLLN------NED------KRVEENI 889
Query: 1274 VIAGKLYKRGRASPLLRCLGEGETEL---VLLEVHEGVCGSHIGGRSLAAKLLRAGYYWP 1330
+ L + LL +T+L ++ + HE H G L +LR + W
Sbjct: 890 QLKDGLLINSKDQILL----PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWK 944
Query: 1331 RMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVF-SPWPFHKWGVDIVGPFPQAPGQLKF 1389
+ E+V+ C CQ + + P L + S P+ +D + P++ G
Sbjct: 945 GIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNAL 1004
Query: 1390 LIVDVDYFTKW-VEAEAVSKITAERVVKFYWKKIICHFGLPKYIVTDNGTQFASSKVVNF 1448
+V VD F+K + ITAE+ + + +++I +FG PK I+ DN F S +F
Sbjct: 1005 FVV-VDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDF 1063
Query: 1449 CKQLGIETKFVSVIHPQANGQAESANKMIVNDIKKKLEDAKGLWAEQLHEVLWSYHTTPH 1508
+ KF PQ +GQ E N+ + ++ W + + V SY+ H
Sbjct: 1064 AHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIH 1123
Query: 1509 STTGETPFTMVYG-ADAMLPVEIDTPTWRREHFSEESNEVGIRCTMDMIDEVREAAHIRE 1567
S T TPF +V+ + A+ P+E+ + FS++++E + T+ + V+E +
Sbjct: 1124 SATQMTPFEIVHRYSPALSPLELPS-------FSDKTDE-NSQETIQVFQTVKEHLNTNN 1175
Query: 1568 FAAKQRAARRYNSKVIP-RSMKEGDLVLKQVVAPTRIG------KLLPSWKGPYRVKEKL 1620
K + ++ K+ + GDLV+ V T+ G KL PS+ GP+ V +K
Sbjct: 1176 IKMK----KYFDMKIQEIEEFQPGDLVM---VKRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Query: 1621 QHGAYKLE 1628
Y+L+
Sbjct: 1229 GPNNYELD 1236
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 139 bits (351), Expect = 5e-32
Identities = 106/404 (26%), Positives = 193/404 (47%), Gaps = 33/404 (8%)
Query: 671 VDEEVKKLQEAHFICEIKYPTWLANTVLVKKASGKW-----RMCVDYTDLNMACPKDPYP 725
V+ EVK+L + I + P V+ KK + + R+ +D+ LN D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 726 LPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTNQKNYHYRVMSFGLRNA 785
+PSI ++ N K + +D SGY+QI + D KT+F N Y + + FGLRNA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 786 GATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEIFQQIRKFSMRLNPTKC 845
+ FQR++D + QIG+ VY+DD+++ + + H + + + + +MR++ K
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 846 TFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREVQQLTGRLAALSRFLSCAGD 905
F ++ ++LGF+++K G +++P+K +AI P + +V+ G + F+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFI----- 431
Query: 906 KAFAFFAT----------------IKKKEEFEWNQECDEAFGKIKQFLTTPP-ILHRPTK 948
K FA A + KK E+N+ AF +++ L + IL P
Sbjct: 432 KDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDF 491
Query: 949 GAGLFLYLSVSENALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVIITARK 1008
L S + + +VL +E +PI +SR LK E Y E+ LA++ K
Sbjct: 492 KKPFDLTTDASASGIGAVLSQEG----RPITMISRTLKQPEQNYATNERELLAIVWALGK 547
Query: 1009 LRPY-FQSHKVVIRTNY-PVKQILGKLDLAGRMLSWSVELSEYD 1050
L+ + + S ++ I T++ P+ + + ++ W + +++
Sbjct: 548 LQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591
Score = 43.9 bits (102), Expect = 0.004
Identities = 49/201 (24%), Positives = 79/201 (38%), Gaps = 18/201 (8%)
Query: 1327 YYWPRMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVFSPWPFHKWGVDIVGPFPQAPGQ 1386
YY+P+M E V C C + ++ EL +P P + + + F
Sbjct: 764 YYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGE--TPIPSYTGEMVHIDIFST---D 818
Query: 1387 LKFLIVDVDYFTKW-----VEAEAVSKITAERVVKFYWKKIICHFGLPKYIVTDNGTQFA 1441
K + +D F+K+ V + + ITA + +II F K + DN F
Sbjct: 819 RKLFLTCIDKFSKYAIVQPVVSRTIVDITAPLL------QIINLFPNIKTVYCDNEPAFN 872
Query: 1442 SSKVVNFCK-QLGIETKFVSVIHPQANGQAESANKMIVNDIK-KKLEDAKGLWAEQLHEV 1499
S V + K GI+ +H +NGQ E + + + KL+ E +
Sbjct: 873 SETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCLKLDKKTNDTVELILRA 932
Query: 1500 LWSYHTTPHSTTGETPFTMVY 1520
Y+ T HS T E P +V+
Sbjct: 933 TIEYNKTVHSVTRERPIEVVH 953
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 135 bits (341), Expect = 7e-31
Identities = 212/1019 (20%), Positives = 381/1019 (36%), Gaps = 103/1019 (10%)
Query: 649 VRTSVKPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKASGK-WR 707
++ + PV ++ M E + + + K E + + P W + VKK + +R
Sbjct: 170 LKPTAVPVSIKQYPMSLEAHMGIRQHIIKFLELGVLRPCRSP-WNTPLLPVKKPGTQDYR 228
Query: 708 MCVDYTDLNMAC----PKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPK 763
D ++N P P P + L + S Y L DA+ + + P
Sbjct: 229 PVQDLREINKRTVDIHPTVPNPYNLLSTLKPDYSWYTVLDLKDAFFC---LPLAPQSQEL 285
Query: 764 TAFMTNQKN------YHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEV----YIDDLV 813
AF + + G +N+ F ++ ++ ++ EV Y+DDL+
Sbjct: 286 FAFEWKDPERGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLL 345
Query: 814 VKTSEKQSHSVDLKEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQA 873
+ K++ + + + Q++ + R + K +LG++L++ P + +
Sbjct: 346 LAAPTKKACTQGTRHLLQELGEKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIET 405
Query: 874 IINMRNPCNIREVQQLTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKI 933
+ + P N REV++ G ++ + A +A K+ F W E AF +
Sbjct: 406 VARIPPPRNPREVREFLGTAGFCRLWIPGFAELAAPLYALTKESTPFTWQTEHQLAFEAL 465
Query: 934 KQFLTTPPILHRPTKGAGLFLYLSVSENALSSVLVEESDEREKPIYFVSRVLKGAELRYQ 993
K+ L + P L P L+L + VL ++ ++P+ ++S+ L +
Sbjct: 466 KKALLSAPALGLPDTSKPFTLFLDERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWP 525
Query: 994 KIEKLALAVIITARKLRPYFQSHKVVIRTNYPVKQIL----GKLDLAGRMLSWSVELSEY 1049
++ A + + + + T + ++ I+ + R+ + L +
Sbjct: 526 PCLRIMAATAMLVKDSAKLTLGQPLTVITPHTLEAIVRQPPDRWITNARLTHYQALLLDT 585
Query: 1050 D-IQFAPRNNIKSQVLADFVVEFTSPIQ-----------------EAVP---HVWLLSVD 1088
D +QF P + L SP + +P H W D
Sbjct: 586 DRVQFGPPVTLNPATLLPVPENQPSPHDCRQVLAETHGTREDLKDQELPDADHTWY--TD 643
Query: 1089 GSSNIKGSG--AGIILEGPGDLLIEQSLKFDFKASNNQAEYEALIAGMLLAQEMGAKNLR 1146
GSS + AG + + + QSL A +AE AL + L++ A N+
Sbjct: 644 GSSYLDSGTRRAGAAVVDGHNTIWAQSLPPGTSAQ--KAELIALTKALELSKGKKA-NIY 700
Query: 1147 ARSDSQLMTNQISGEYQTKDQQLSKYLARVRKLAGDFQFFEAIYVPRESNSRADLLAKLA 1206
S T G + L+ ++ A +A+++P+E +A
Sbjct: 701 TDSRYAFATAHTHGSIYERRGLLTSEGKEIKNKAEIIALLKALFLPQE----------VA 750
Query: 1207 STKKPGNNRTVIQEVISAPSTDEKA-------VFELNQEPEGWMTPLLKFLTGSFVAKND 1259
PG+ + + D A V L EP+ ++ ++ +++
Sbjct: 751 IIHCPGHQKGQDPVAVGNRQADRVARQAAMAEVLTLATEPDNTSHITIEH---TYTSEDQ 807
Query: 1260 EYAQLVRRRATKFVVIAGKLYKRGRASPLLRCLGEGETELVLLEVHEGVCGSHIGGRSLA 1319
E A+ + K K G+ L + E ++ ++H +H+G R L
Sbjct: 808 EEARAIGATENKDT---RNWEKEGKI-----VLPQKEALAMIQQMH---AWTHLGNRKLK 856
Query: 1320 AKLLRAGYYWPRMAHDCCEFVKKCDKCQRFS-DKKNAPANELTSVFSPWPFHKWGVDIVG 1378
+ + + PR + + C CQ+ + PA + T P + W +D
Sbjct: 857 LLIEKTDFLIPRASTLIEQVTSACKVCQQVNAGATRVPAGKRTRGNRPGVY--WEIDFTE 914
Query: 1379 PFPQAPGQLKFLIVDVDYFTKWVEAEAVSKITAERVVKFYWKKIICHFGLPKYIVTDNGT 1438
P G K+L+V VD F+ WVEA + TA V K ++I FGLPK I +DNG
Sbjct: 915 VKPHYAGY-KYLLVFVDTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGP 973
Query: 1439 QFASSKVVNFCKQLGIETKFVSVIHPQANGQAESANKMIVNDIKK-KLEDAKGLWAEQLH 1497
F S + LGI K PQ++GQ E N+ I + K LE W L
Sbjct: 974 AFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLS 1033
Query: 1498 EVLWSYHTTPHSTTGETPFTMVYGADAMLPVEIDTPTWRREHFSEESNEVGIRCTMDMID 1557
L TP + G TP+ ++YG L +++ FS +++ ++ + +
Sbjct: 1034 LALLRARNTP-NRFGLTPYEILYGGPPPLSTLLNS-------FSPSNSKTDLQARLKGLQ 1085
Query: 1558 EVREAAHIREFAAKQRAARRYNSKVIPRSMKEGDLVLKQVVAPTRIGKLLPSWKGPYRV 1616
V+ + A R + GD V V R L P WKGPY V
Sbjct: 1086 AVQ-----AQIWAPLAELYRPGHSQTSHPFQVGDSV---YVRRHRSQGLEPRWKGPYIV 1136
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 125 bits (313), Expect = 1e-27
Identities = 112/425 (26%), Positives = 182/425 (42%), Gaps = 31/425 (7%)
Query: 668 RKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKAS----GKWRMCVDYTDLNMACPKDP 723
R+ D ++K+L E I K T ++ LV+ + GK RM V+Y +N A D
Sbjct: 241 REEFDRQIKELLELKVIKPSK-STHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDA 299
Query: 724 YPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTNQKNYHYRVMSFGLR 783
+ LP+ D L+ G K S D SG Q+ +D TAF Q +Y + V+ FGL+
Sbjct: 300 HNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLK 359
Query: 784 NAGATFQRSMDTIFSNQIGRNLEVYIDD-LVVKTSEKQSHSVDLKEIFQQIRKFSMRLNP 842
A + F ++ SNQ + VY+DD LV + ++ H + + I ++ K + L+
Sbjct: 360 QAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSK 419
Query: 843 TKCTFGVQAGKFLGFLLTK----------KGIEANPDKCQAIINMRNPCNIREVQQLTGR 892
K + FLG + + + I PD+ + + +++Q+ G
Sbjct: 420 KKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIE---------DKKQLQRFLGI 470
Query: 893 LAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTKGAGL 952
L S ++ + +K+ + WN + KIK+ L + P L+ P L
Sbjct: 471 LTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKL 530
Query: 953 FLYLSVSENALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVIITARKLRPY 1012
+ SE +L + E + S K AE Y EK LAVI +K Y
Sbjct: 531 VIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIY 590
Query: 1013 FQSHKVVIRTNYPVKQILGKLDL-----AGRMLSWSVELSEYDIQFAPRNNIKSQVLADF 1067
+ +IRT+ ++L GR++ W + LS+YD K+ V ADF
Sbjct: 591 LTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKN-VFADF 649
Query: 1068 VVEFT 1072
+ E T
Sbjct: 650 LQENT 654
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 124 bits (312), Expect = 2e-27
Identities = 109/410 (26%), Positives = 178/410 (42%), Gaps = 16/410 (3%)
Query: 654 KPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLV----KKASGKWRMC 709
K + + K R+ D+++K+L + I K P +A LV +K GK RM
Sbjct: 245 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPH-MAPAFLVNNEAEKRRGKKRMV 303
Query: 710 VDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTN 769
V+Y +N A D Y LP+ D L+ G K S D SG+ Q+ +D P TAF
Sbjct: 304 VNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 363
Query: 770 QKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEI 829
Q +Y + V+ FGL+ A + FQR MD F + VY+DD++V ++ ++ H + + I
Sbjct: 364 QGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEEDHLLHVAMI 422
Query: 830 FQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPC-NIREVQQ 888
Q+ + + L+ K + FLG + + + + I + + +++Q+
Sbjct: 423 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 482
Query: 889 LTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTK 948
G L S ++ A +K+ + W +E K+K+ L P LH P
Sbjct: 483 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLP 542
Query: 949 GAGLFLYLSVSEN----ALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVII 1004
L + S++ L ++ + E E + S K AE Y +K LAVI
Sbjct: 543 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVIN 602
Query: 1005 TARKLRPYFQSHKVVIRTNYPVKQILGKLDL-----AGRMLSWSVELSEY 1049
T +K Y +IRT+ + L+ GR + W LS Y
Sbjct: 603 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 124 bits (312), Expect = 2e-27
Identities = 109/410 (26%), Positives = 179/410 (43%), Gaps = 16/410 (3%)
Query: 654 KPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLV----KKASGKWRMC 709
K + + K R+ D+++K+L + I K P +A LV +K GK RM
Sbjct: 245 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPH-MAPAFLVNNEAEKRRGKKRMV 303
Query: 710 VDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTN 769
V+Y +N A D Y LP+ D L+ G K S D SG+ Q+ +D P TAF
Sbjct: 304 VNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 363
Query: 770 QKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEI 829
Q +Y + V+ FGL+ A + FQR MD F + VY+DD++V ++ ++ H + + I
Sbjct: 364 QGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEEDHLLHVAMI 422
Query: 830 FQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPC-NIREVQQ 888
Q+ + + L+ K + FLG + + + + I + + +++Q+
Sbjct: 423 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 482
Query: 889 LTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTK 948
G L S ++ A +K+ ++W +E K+K+ L P LH P
Sbjct: 483 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 542
Query: 949 GAGLFLYLSVSEN----ALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVII 1004
L + S++ L ++ + E E + S K AE Y +K LAVI
Sbjct: 543 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVIN 602
Query: 1005 TARKLRPYFQSHKVVIRTNYPVKQILGKLDL-----AGRMLSWSVELSEY 1049
T +K Y +IRT+ + L+ GR + W LS Y
Sbjct: 603 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 124 bits (312), Expect = 2e-27
Identities = 109/410 (26%), Positives = 179/410 (43%), Gaps = 16/410 (3%)
Query: 654 KPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLV----KKASGKWRMC 709
K + + K R+ D+++K+L + I K P +A LV +K GK RM
Sbjct: 245 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPH-MAPAFLVNNEAEKRRGKKRMV 303
Query: 710 VDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTN 769
V+Y +N A D Y LP+ D L+ G K S D SG+ Q+ +D P TAF
Sbjct: 304 VNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 363
Query: 770 QKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEI 829
Q +Y + V+ FGL+ A + FQR MD F + VY+DD++V ++ ++ H + + I
Sbjct: 364 QGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEEDHLLHVAMI 422
Query: 830 FQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPC-NIREVQQ 888
Q+ + + L+ K + FLG + + + + I + + +++Q+
Sbjct: 423 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 482
Query: 889 LTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTK 948
G L S ++ A +K+ ++W +E K+K+ L P LH P
Sbjct: 483 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 542
Query: 949 GAGLFLYLSVSEN----ALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVII 1004
L + S++ L ++ + E E + S K AE Y +K LAVI
Sbjct: 543 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVIN 602
Query: 1005 TARKLRPYFQSHKVVIRTNYPVKQILGKLDL-----AGRMLSWSVELSEY 1049
T +K Y +IRT+ + L+ GR + W LS Y
Sbjct: 603 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 122 bits (306), Expect = 8e-27
Identities = 108/410 (26%), Positives = 177/410 (42%), Gaps = 16/410 (3%)
Query: 654 KPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKKAS----GKWRMC 709
K + + K R+ D+++K+L + I K P +A LV + G RM
Sbjct: 246 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPH-MAPAFLVNNEAENGRGNKRMV 304
Query: 710 VDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTN 769
V+Y +N A D Y LP+ D L+ G K S D SG+ Q+ +D P TAF
Sbjct: 305 VNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 364
Query: 770 QKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEI 829
Q +Y + V+ FGL+ A + FQR MD F + VY+DD+VV ++ ++ H + + I
Sbjct: 365 QGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDIVVFSNNEEDHLLHVAMI 423
Query: 830 FQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPC-NIREVQQ 888
Q+ + + L+ K + FLG + + + + I + + +++Q+
Sbjct: 424 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 483
Query: 889 LTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTK 948
G L S ++ A +K+ ++W +E K+K+ L P LH P
Sbjct: 484 FLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 543
Query: 949 GAGLFLYLSVSEN----ALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVII 1004
L + S++ L ++ + E E + S K AE Y +K LAVI
Sbjct: 544 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVIN 603
Query: 1005 TARKLRPYFQSHKVVIRTNYPVKQILGKLDL-----AGRMLSWSVELSEY 1049
T +K Y +IRT+ + L+ GR + W LS Y
Sbjct: 604 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 653
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 122 bits (306), Expect = 8e-27
Identities = 108/410 (26%), Positives = 178/410 (43%), Gaps = 16/410 (3%)
Query: 654 KPVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLV----KKASGKWRMC 709
K + + K R+ D+++K+L + I K P +A LV +K GK RM
Sbjct: 240 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPH-MAPAFLVNNEAEKRRGKKRMV 298
Query: 710 VDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTN 769
V+Y +N A D Y P+ D L+ G K S D SG+ Q+ +D P TAF
Sbjct: 299 VNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 358
Query: 770 QKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEI 829
Q +Y + V+ FGL+ A + FQR MD F + VY+DD++V ++ ++ H + + I
Sbjct: 359 QGHYEWNVVPFGLKQAPSIFQRHMDEAF-RVFRKFCCVYVDDILVFSNNEEDHLLHVAMI 417
Query: 830 FQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPC-NIREVQQ 888
Q+ + + L+ K + FLG + + + + I + + +++Q+
Sbjct: 418 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 477
Query: 889 LTGRLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTK 948
G L S ++ A +K+ ++W +E K+K+ L P LH P
Sbjct: 478 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 537
Query: 949 GAGLFLYLSVSEN----ALSSVLVEESDEREKPIYFVSRVLKGAELRYQKIEKLALAVII 1004
L + S++ L ++ + E E + S K AE Y +K LAVI
Sbjct: 538 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVIN 597
Query: 1005 TARKLRPYFQSHKVVIRTNYPVKQILGKLDL-----AGRMLSWSVELSEY 1049
T +K Y +IRT+ + L+ GR + W LS Y
Sbjct: 598 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 647
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 120 bits (302), Expect = 2e-26
Identities = 76/249 (30%), Positives = 131/249 (52%), Gaps = 7/249 (2%)
Query: 655 PVVQRKRKMGEEKRKAVDEEVKKLQEAHFICEIKYPTWLANTVLVKK-ASGKWRMCVDY- 712
PV +R R + +AV+ E+ +LQE I I Y W A V++KK +GK R+C D+
Sbjct: 440 PVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFK 499
Query: 713 -TDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTNQK 771
+ LN A + +PLP+ + + G S +D Y Q+++D T++
Sbjct: 500 CSGLNAALKDEFHPLPTSEDIFSRLKG-TVYSQIDLKDAYLQVELDEEAQKLAVINTHRG 558
Query: 772 NYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEIFQ 831
+ Y M+FGL+ A A+FQ+ MD + S G + VY DD+++ S + H L+E+F+
Sbjct: 559 IFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFE 616
Query: 832 QIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANPDKCQAIINMRNPCNIREVQQLTG 891
+ +++ R++ KC F + FLGF + + G + K +AI +M+ P + +++ G
Sbjct: 617 RFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLG 675
Query: 892 RLAALSRFL 900
LSR +
Sbjct: 676 AADWLSRMM 684
Score = 87.0 bits (214), Expect = 4e-16
Identities = 72/244 (29%), Positives = 112/244 (45%), Gaps = 22/244 (9%)
Query: 1298 ELVLLEVHEGVCGSHIGGRSLAAKLLRAGYYWPRMAHDCCEFVKKCDKCQRFSDKKNA-P 1356
++VL ++HEG H G + K R+ +W + D V+ C+ CQ S P
Sbjct: 784 KIVLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP 838
Query: 1357 ANELTSVFSPWPFHKWGVDIVGPFPQAPGQLKFLIVDVDYFTKWVEAEAVSKITAERVVK 1416
N +PW + +D GP +L+V VD TK+ E + I+A +
Sbjct: 839 LNPWPVPEAPWK--RIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSISAVTTID 891
Query: 1417 FYWKKIICHFGLPKYIVTDNGTQFASSKVVNFCKQLGIETKFVSVIHPQANGQAESANKM 1476
++I G P+ I++DNGTQ S C+ GIE K +V +P++NG AE
Sbjct: 892 LL-EEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAE----R 946
Query: 1477 IVNDIKKKLEDAKG---LWAEQLHEVLWSYHTTPHST-TGETPFTMVYGADAMLPVEIDT 1532
V+ +K+ + KG + + L++ L SY TPHS G TP +G + +
Sbjct: 947 FVDTLKRGIAKIKGEGSVNQQILNKFLISYRNTPHSALNGSTPAECHFGRKIRTTMSLLM 1006
Query: 1533 PTWR 1536
PT R
Sbjct: 1007 PTDR 1010
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 106 bits (264), Expect = 6e-22
Identities = 102/421 (24%), Positives = 176/421 (41%), Gaps = 32/421 (7%)
Query: 666 EKRKAVDEEVKKLQEAHFICEIKY----PTWLANTVLVKKASGKWRMCVDYTDLNMACPK 721
+ R+ +++K+L + I K P +L ++ GK RM V+Y +N A
Sbjct: 250 QDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENE-AERRRGKKRMVVNYKAINQATIG 308
Query: 722 DPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPLDAPKTAFMTNQKNYHYRVMSFG 781
D + LP++ L+ G S D SG+ Q+ +D TAF Q ++ ++V+ FG
Sbjct: 309 DSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFG 368
Query: 782 LRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEKQSHSVDLKEIFQQIRKFSMRLN 841
L+ A + FQR M T N + VY+DD++V ++ + H + + + + K+ + L+
Sbjct: 369 LKQAPSIFQRHMQTAL-NGADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILS 427
Query: 842 PTKCTFGVQAGKFLGFLLTK----------KGIEANPDKCQAIINMRNPCNIREVQQLTG 891
K + FLG + K + I PD+ + + + +Q+ G
Sbjct: 428 KKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLE---------DKKHLQRFLG 478
Query: 892 RLAALSRFLSCAGDKAFAFFATIKKKEEFEWNQECDEAFGKIKQFLTTPPILHRPTKGAG 951
L ++ + +KK + W Q + KIK+ L + P L+ P
Sbjct: 479 VLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDH 538
Query: 952 LFLYLSVSENALSSVLVEES-DEREKPIYFVSRVLKGAELRYQKIEKLALAVIITARKLR 1010
L + S++ VL + D E + S K AE Y +K LAV K
Sbjct: 539 LIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFS 598
Query: 1011 PYFQSHKVVIRTN-----YPVKQILGKLDLAGRMLSWSVELSEYDIQFAPRNNIKSQVLA 1065
Y + +RT+ Y ++ L GR++ W S+Y +K+ VLA
Sbjct: 599 AYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVKN-VLA 657
Query: 1066 D 1066
D
Sbjct: 658 D 658
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 101 bits (251), Expect = 2e-20
Identities = 62/169 (36%), Positives = 85/169 (49%), Gaps = 1/169 (0%)
Query: 700 KKASGKWRMCVDYTDLNMACPKDPYPLPSIDHLIDNASGYKTLSFMDAYSGYNQIKMDPL 759
K+ GK RM +Y LN D Y LP I+ +I K S D SG+ Q+ M+
Sbjct: 1457 KEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEE 1516
Query: 760 DAPKTAFMTNQKNYHYRVMSFGLRNAGATFQRSMDTIFSNQIGRNLEVYIDDLVVKTSEK 819
P TAF+ K Y + VM FGL+NA A FQR MD +F + + VYIDD++V +
Sbjct: 1517 SVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETA 1575
Query: 820 QSHSVDLKEIFQQIRKFSMRLNPTKCTFGVQAGKFLGFLLTKKGIEANP 868
+ HS L + Q ++ + L+PTK G FLG L I+ P
Sbjct: 1576 EQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQP 1624
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 92.4 bits (228), Expect = 9e-18
Identities = 90/309 (29%), Positives = 128/309 (41%), Gaps = 18/309 (5%)
Query: 1311 SHIGGRSLAAKLLR--AGYYWPRMAHDCCEFVKKCDKCQRFSDKKNAPANELTSVFSPWP 1368
+H+G + + A L R + YY C C + + K A V P
Sbjct: 236 THLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASK-AKIGAGVRVRGHRP 294
Query: 1369 FHKWGVDIVGPFPQAPGQLKFLIVDVDYFTKWVEAEAVSKITAERVVKFYWKKIICHFGL 1428
W +D P G K+L+V VD F+ WVEA TA+ V K ++I FG+
Sbjct: 295 GTHWEIDFTEVKPGLYGY-KYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRFGM 353
Query: 1429 PKYIVTDNGTQFASSKVVNFCKQLGIETKFVSVIHPQANGQAESANKMIVNDIKK-KLED 1487
P+ + TDNG F S + K LGI+ K PQ++GQ E N+ I + K L
Sbjct: 354 PQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLAT 413
Query: 1488 AKGLWAEQLHEVLWSYHTTPHSTTGETPFTMVYGADAMLPVEIDTPTWRREHFSEESNEV 1547
W L L+ TP G TP+ ++YGA L V P S+ +N
Sbjct: 414 GTRDWVLLLPLALYRARNTP-GPHGLTPYEILYGAPPPL-VNFHDP-----EMSKFTNSP 466
Query: 1548 GIRCTMDMIDEVREAAHIREFAAKQRAARRYNSKVIPRSMKEGDLVLKQVVAPTRIGKLL 1607
++ + + V+ AA Q + + VIP + GD V V + L
Sbjct: 467 SLQAHLQALQAVQREVWKPLAAAYQ---DQLDQPVIPHPFRVGDTVW---VRRHQTKNLE 520
Query: 1608 PSWKGPYRV 1616
P WKGPY V
Sbjct: 521 PRWKGPYTV 529
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.318 0.135 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 197,653,919
Number of Sequences: 164201
Number of extensions: 8901839
Number of successful extensions: 22874
Number of sequences better than 10.0: 163
Number of HSP's better than 10.0 without gapping: 106
Number of HSP's successfully gapped in prelim test: 57
Number of HSP's that attempted gapping in prelim test: 22467
Number of HSP's gapped (non-prelim): 343
length of query: 1638
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1514
effective length of database: 39,613,130
effective search space: 59974278820
effective search space used: 59974278820
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)
Medicago: description of AC141323.18