
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135959.8 + phase: 0 /pseudo/partial
(1454 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 297 1e-79
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 294 1e-78
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 287 1e-76
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 286 3e-76
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 280 3e-74
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 277 2e-73
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 277 2e-73
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 265 5e-70
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 237 2e-61
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 170 3e-41
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 163 3e-39
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 154 1e-36
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 154 2e-36
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 154 2e-36
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 151 1e-35
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 150 2e-35
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 135 6e-31
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 124 2e-27
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 122 5e-27
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 122 7e-27
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 297 bits (761), Expect = 1e-79
Identities = 237/861 (27%), Positives = 392/861 (45%), Gaps = 91/861 (10%)
Query: 626 EGEAKAYEVLLDRTPPIKDLSVEELIKEEPTLQPKEAPKVELKTLPSNLRYEFLGPNSTY 685
+GE +EVL ++ +D++VEE++ + E + + + YE
Sbjct: 838 KGETGGFEVLSNKAE--QDITVEEVLNDPTLFSEIETDTNSCEVVKTAETYERFTT---- 891
Query: 686 PVIVNASLDEVETEKLLYVLKKYPKAIGYTIDDIKGINPSLCMHRILLEDDYKPSIEHQR 745
+ + + + K+ V++++ + D++ + + C+ I L++ +P + R
Sbjct: 892 -ICEHLKRENGDDRKIWDVIEQFQDVFAISDDELGRNSGTECV--IELKEGAEPIRQKPR 948
Query: 746 RLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTV 805
+ +K ++K + K+L+ VI S S W SPV +V KK G
Sbjct: 949 PIPLALKPEIRKMIQKMLNQKVIRE-SKSPWSSPVVLVKKKDGSI--------------- 992
Query: 806 TGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEK 865
RMCIDYRK+NK + + PLP I+ L+ LA + D +GF+QIP+ +E
Sbjct: 993 ---RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEI 1049
Query: 866 TTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCL 925
T F F + +PFGL +PA FQ M I D + V++DD + + + L
Sbjct: 1050 TAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHL 1109
Query: 926 TNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKMLPPTSV 985
++++ L R + + L KCH +E LGH V+ G+E K + +++ PT+V
Sbjct: 1110 QDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNV 1169
Query: 986 KEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCLQAFCRLKEALITAPII 1045
KE++SFLG G+YR+FI +F+ I LT+L+ + ++ AF LK+ + P++
Sbjct: 1170 KELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVL 1229
Query: 1046 QPPD------WNLPFEIMCDASDYAVGVVLGQR-KDKKMHAIYYASKTLDGAQVNYATTE 1098
PD + PF I DAS +G VL Q D + H I +ASK L A+ Y T+
Sbjct: 1230 AQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITD 1289
Query: 1099 KELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKD 1158
E LA+++A+ +F+ + G+ I V+TDH + LL RL RW + + EFD++I
Sbjct: 1290 LEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVY 1349
Query: 1159 KKGVENVVADHLSR-------LREANKDEL-----PLDDSFPD-----DQLFLLAQTDAP 1201
G N VAD LSR L E EL + PD L L D
Sbjct: 1350 LAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLKGEDEG 1409
Query: 1202 WYADFVNFLAAGV---------LPPELNYQQKKKFFNDLKHYYWDEPYLFRRGSDGIFRR 1252
W + + L G + E++ + K LK+ +E R
Sbjct: 1410 W-KEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQ----------SRS 1458
Query: 1253 CIPESEVSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKC---- 1308
+PE + +L H GH +K ++++H F+WP + V + C KC
Sbjct: 1459 VVPEKIRTPLLKELHEGMLAGHFGIKK-MWRMVHRKFYWPQMRVCVENCVRTCAKCLCAN 1517
Query: 1309 ---QRTGSITK-RNEMPLNNILEVEIFDVWGVDFMGPFPSSFGNQYILVAVDYVSKWVEA 1364
+ T S+T R PL ++ D M S GN+YIL +D +K+ A
Sbjct: 1518 DHSKLTSSLTPYRMTFPL---------EIVACDLMDVGLSVQGNRYILTIIDLFTKYGTA 1568
Query: 1365 IASPTNDAQVVIKMF-KKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPY 1423
+ P A+ V+K F ++ +P +++D G F++ F + L + H Y
Sbjct: 1569 VPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGY 1628
Query: 1424 HPQTSGQVEVSNRQIKAILEK 1444
+ + +G VE N+ I I++K
Sbjct: 1629 NSRANGAVERFNKTIMHIMKK 1649
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 294 bits (752), Expect = 1e-78
Identities = 236/792 (29%), Positives = 370/792 (45%), Gaps = 86/792 (10%)
Query: 681 PNSTYPVIVN-----ASLDEVETEKLLYVLKKYPKAIGYTIDDIKGINPSL----CMHRI 731
PN P++ + L+ E ++L +L+KY + D + N + H +
Sbjct: 148 PNKISPILESDLYRLEHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNL 207
Query: 732 LLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTV 791
L Y +++ V+ ++ +L+ G+I S+S + SP+ VVPKK +
Sbjct: 208 PLYSKYSYPQAYEQE--------VESQIQDMLNQGIIRT-SNSPYNSPIWVVPKKQDASG 258
Query: 792 IKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYS 851
+ +R+ IDYRKLN+ T D P+P +D++L +L + ++F +D
Sbjct: 259 KQK-------------FRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAK 305
Query: 852 GFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFM 911
GF QI + P KT F+ G + Y RMPFGL NAPATFQRCM I + K V++
Sbjct: 306 GFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYL 365
Query: 912 DDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRA 971
DD V ++ D+ L +L V E+ + NL L +KC F+ +E LGH+++ GI+ +
Sbjct: 366 DDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPE 425
Query: 972 KVEIIEKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADF-TFDDSCLQ 1030
K+E I+K PT KEI++FLG G+YR+FI +F+ I KP+T L K+ T +
Sbjct: 426 KIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDS 485
Query: 1031 AFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGA 1090
AF +LK + PI++ PD+ F + DASD A+G VL Q H + Y S+TL+
Sbjct: 486 AFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQ----DGHPLSYISRTLNEH 541
Query: 1091 QVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQ 1150
++NY+T EKELLA+V+A FR YL+G + +DH + +L KD +L RW + L
Sbjct: 542 EINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLS 601
Query: 1151 EFDLEIKDKKGVENVVADHLSRLR-EANKDELPLDDSFPDDQLFLLAQTDAPWYADFVNF 1209
EFD +IK KG EN VAD LSR++ E S +D L+ T+ P F
Sbjct: 602 EFDFDIKYIKGKENCVADALSRIKLEETYLSEQTQHSAEEDNSDLIFITERP-LNTFNRQ 660
Query: 1210 LAAGVLPPEL---NYQQK--KKFFNDLKHYYWDEPYLFRR----------GSDGIF---- 1250
+ PP++ Y +K + F D+ E YL SD F
Sbjct: 661 VIFSKGPPDIKVTKYFKKHITQIFYDIMTREKAEQYLIDHFCGKKSALYIESDADFEVIQ 720
Query: 1251 ---------------------RRCIPESEVSSILTHCHSSSYGGHASTQKTSFKILHSGF 1289
+ +E ++ H H QKT+ K+ +
Sbjct: 721 AAHKLAINTKYTKILRSTILLKNITTYAEFKELILTAHEKLL--HPGIQKTT-KLFGETY 777
Query: 1290 WWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVEIFDVWGVDFMGPFPSSFGNQ 1349
++P+ + I++C C + + +MP + E FM SS G
Sbjct: 778 YFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPE---HCREKFMIDIYSSEGKH 834
Query: 1350 YILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKL 1409
Y+ +D SK+ T D + K IF + G P+++ +D F S ++
Sbjct: 835 YV-SCIDIYSKFATLEEIKTKD-WIECKNALMRIFNQLGKPKLLKADRDGAFSSLALKRW 892
Query: 1410 LQKLGVRHKVAT 1421
L+ V ++ T
Sbjct: 893 LESEEVELQLNT 904
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 287 bits (735), Expect = 1e-76
Identities = 224/717 (31%), Positives = 356/717 (49%), Gaps = 84/717 (11%)
Query: 755 VKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCIDY 814
V+ +V ++L+ G+I S+S + SP VVPKK + S A + +R+ IDY
Sbjct: 222 VENQVQEMLNQGLIRE-SNSPYNSPTWVVPKK---------PDASGANK----YRVVIDY 267
Query: 815 RKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPFGT 874
RKLN+ T D +P+P +D++L +L K +F +D GF QI + KT F+ G
Sbjct: 268 RKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGH 327
Query: 875 FAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVLER 934
+ Y RMPFGL NAPATFQRCM +I + K V++DD + ++ + L +++ V +
Sbjct: 328 YEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTK 387
Query: 935 CEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKMLPPTSVKEIRSFLGH 994
NL L +KC F+ +E LGH+V+ GI+ + KV+ I PT KEIR+FLG
Sbjct: 388 LADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGL 447
Query: 995 AGFYRRFIKDFSSITKPLTNLLLKDADF-TFDDSCLQAFCRLKEALITAPIIQPPDWNLP 1053
G+YR+FI +++ I KP+T+ L K T ++AF +LK +I PI+Q PD+
Sbjct: 448 TGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKK 507
Query: 1054 FEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAIDKFRQ 1113
F + DAS+ A+G VL Q H I + S+TL+ ++NY+ EKELLA+V+A FR
Sbjct: 508 FVLTTDASNLALGAVLSQNG----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRH 563
Query: 1114 YLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIKDKKGVENVVADHLSRL 1173
YL+G + ++ +DH +++L N K+ +L RW + L E+ +I KG EN VAD LSR+
Sbjct: 564 YLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRI 623
Query: 1174 R-EANKDELPLDDSFPDDQLFLLAQTDAPWYADFVNFLAAGVL---PPELNYQQKKKFFN 1229
+ E N S +D L+ T+ P +N+ ++ + + K F N
Sbjct: 624 KIEENHHSEATQHSAEEDNSNLIHLTEKP-----INYFKKQIIFIKSDKNKVEHSKIFGN 678
Query: 1230 DLKHYYWDEPYLFRRGS---DGIFRRCIP---ESEVS-SILTHCH----SSSY------- 1271
+ +D L + D R I ES+V I+ H +++Y
Sbjct: 679 SITTIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSL 738
Query: 1272 ------GGHASTQ----KTSFKILHSGFW-WPSLFKDVHLF----------ISKCDKCQR 1310
G +A + ++ K+LH G LFK+ H F I++C+ C
Sbjct: 739 FLLKNVGSYAEFKEIILQSHEKLLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNL 798
Query: 1311 TGSITKRNEMPL------NNILEVEIFDVWGVDFMGPFPSSFGNQYILVAVDYVSKWVEA 1364
+ + +MPL + E + D++ SS G YI +D SK+
Sbjct: 799 AKTEHRNTKMPLKITPNPEHCREKFVVDIY---------SSEGKHYI-SCIDIYSKFATL 848
Query: 1365 IASPTNDAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVAT 1421
T D + + IF + G P+++ +D F S ++ L++ V ++ T
Sbjct: 849 EQIKTKD-WIECRNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNT 904
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 286 bits (732), Expect = 3e-76
Identities = 171/460 (37%), Positives = 263/460 (57%), Gaps = 28/460 (6%)
Query: 750 NMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWR 809
NM+ V++++ +LL G+I P S+S + SP+ +VPKK K +R
Sbjct: 134 NMRGEVERQIDELLQDGIIRP-SNSPYNSPIWIVPKKPKPNGEKQ-------------YR 179
Query: 810 MCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFT 869
M +D+++LN T D +P+P I+ L L +F LD SGF QI + +D KT F+
Sbjct: 180 MVVDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFS 239
Query: 870 CPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLE 929
G + + R+PFGL NAPA FQR + I + + K+ V++DD V ++D NL
Sbjct: 240 TLNGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLR 299
Query: 930 KVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKMLPPTSVKEIR 989
VL + NL +N EK HF+ + LG++V+ GI+ D KV I +M PPTSVKE++
Sbjct: 300 LVLASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELK 359
Query: 990 SFLGHAGFYRRFIKDFSSITKPLTNLL------LKDAD-----FTFDDSCLQAFCRLKEA 1038
FLG +YR+FI+D++ + KPLTNL +K + T D++ LQ+F LK
Sbjct: 360 RFLGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSI 419
Query: 1039 LITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGAQVNYATTE 1098
L ++ I+ P + PF + DAS++A+G VL Q + I Y S++L+ + NYAT E
Sbjct: 420 LCSSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIE 479
Query: 1099 KELLAVVYAIDKFRQYLVGSKII-VYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIK 1157
KE+LA+++++D R YL G+ I VYTDH + + L ++ +L RW ++E++ E+
Sbjct: 480 KEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELI 539
Query: 1158 DKKGVENVVADHLSRLREANKDELPLD-DSFPDDQLFLLA 1196
K G NVVAD LSR+ ++L D D+ P+D + LA
Sbjct: 540 YKPGKSNVVADALSRI-PPQLNQLSTDLDANPEDDMQSLA 578
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 280 bits (715), Expect = 3e-74
Identities = 205/743 (27%), Positives = 356/743 (47%), Gaps = 62/743 (8%)
Query: 732 LLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVS--PVQVVPKKGGL 789
L +++Y+ I + L P + + E+ + L +G+I +SK ++ PV VPKK G
Sbjct: 406 LTQENYRLPIRNYP-LPPGKMQAMNDEINQGLKSGII---RESKAINACPVMFVPKKEGT 461
Query: 790 TVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDG 849
RM +DY+ LNK + + +PLP I+Q+L ++ + F LD
Sbjct: 462 L------------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 850 YSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV 909
S + I + D+ K F CP G F Y MP+G+ APA FQ + +I + E +
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVC 563
Query: 910 FMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVD 969
+MDD +H + + + +++ VL++ + NL++N KC F + +G+ +S++G
Sbjct: 564 YMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 970 RAKVEIIEKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCL 1029
+ ++ + + P + KE+R FLG + R+FI S +T PL NLL KD + + +
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683
Query: 1030 QAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQR-KDKKMHAIYYASKTLD 1088
QA +K+ L++ P+++ D++ + DASD AVG VL Q+ D K + + Y S +
Sbjct: 684 QAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMS 743
Query: 1089 GAQVNYATTEKELLAVVYAIDKFRQYLVGS--KIIVYTDH-SAIKYLLNKKDAK-PRLIR 1144
AQ+NY+ ++KE+LA++ ++ +R YL + + TDH + I + N+ + + RL R
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 1145 WILLLQEFDLEIKDKKGVENVVADHLSRLREANKDELPLDDSFPDDQLFLLAQTDAPWYA 1204
W L LQ+F+ EI + G N +AD LSR+ + + P+ D+ +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIVDETE---PIPKDSEDNSI------------ 848
Query: 1205 DFVNFLAAGVLPPELNYQQKKKFFNDLK--HYYWDEPYLFRRG---SDGIF-----RRCI 1254
+FVN ++ + + Q ++ ND K + +E DG+ + +
Sbjct: 849 NFVNQIS---ITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILL 905
Query: 1255 PESE--VSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTG 1312
P +I+ H H + + IL F W + K + ++ C CQ
Sbjct: 906 PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINK 964
Query: 1313 SITKRNEMPLNNILEVE-IFDVWGVDFMGPFPSSFGNQYILVAVDYVSKWVEAI-ASPTN 1370
S + PL I E ++ +DF+ P S G + V VD SK + + +
Sbjct: 965 SRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSI 1024
Query: 1371 DAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQ 1430
A+ +MF + + FG P+ +I+D F S+ ++ K K + PY PQT GQ
Sbjct: 1025 TAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQ 1084
Query: 1431 VEVSNRQIKAILEKTVSTSRTDW 1453
E +N+ ++ +L ST W
Sbjct: 1085 TERTNQTVEKLLRCVCSTHPNTW 1107
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 277 bits (708), Expect = 2e-73
Identities = 204/743 (27%), Positives = 356/743 (47%), Gaps = 62/743 (8%)
Query: 732 LLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVS--PVQVVPKKGGL 789
L +++Y+ I + L P + + E+ + L +G+I +SK ++ PV VPKK G
Sbjct: 406 LTQENYRLPIRNYP-LPPGKMQAMNDEINQGLKSGII---RESKAINACPVMFVPKKEGT 461
Query: 790 TVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDG 849
RM +DY+ LNK + + +PLP I+Q+L ++ + F LD
Sbjct: 462 L------------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 850 YSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV 909
S + I + D+ K F CP G F Y MP+G+ APA FQ + +I + E +
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVC 563
Query: 910 FMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVD 969
+MD+ +H + + + +++ VL++ + NL++N KC F + +G+ +S++G
Sbjct: 564 YMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 970 RAKVEIIEKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCL 1029
+ ++ + + P + KE+R FLG + R+FI S +T PL NLL KD + + +
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683
Query: 1030 QAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQR-KDKKMHAIYYASKTLD 1088
QA +K+ L++ P+++ D++ + DASD AVG VL Q+ D K + + Y S +
Sbjct: 684 QAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMS 743
Query: 1089 GAQVNYATTEKELLAVVYAIDKFRQYLVGS--KIIVYTDH-SAIKYLLNKKDAK-PRLIR 1144
AQ+NY+ ++KE+LA++ ++ +R YL + + TDH + I + N+ + + RL R
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 1145 WILLLQEFDLEIKDKKGVENVVADHLSRLREANKDELPLDDSFPDDQLFLLAQTDAPWYA 1204
W L LQ+F+ EI + G N +AD LSR+ + + P+ D+ +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIVDETE---PIPKDSEDNSI------------ 848
Query: 1205 DFVNFLAAGVLPPELNYQQKKKFFNDLK--HYYWDEPYLFRRG---SDGIF-----RRCI 1254
+FVN ++ + + Q ++ ND K + +E DG+ + +
Sbjct: 849 NFVNQIS---ITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILL 905
Query: 1255 PESE--VSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTG 1312
P +I+ H H + + IL F W + K + ++ C CQ
Sbjct: 906 PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINK 964
Query: 1313 SITKRNEMPLNNILEVE-IFDVWGVDFMGPFPSSFGNQYILVAVDYVSKWVEAI-ASPTN 1370
S + PL I E ++ +DF+ P S G + V VD SK + + +
Sbjct: 965 SRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSI 1024
Query: 1371 DAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQ 1430
A+ +MF + + FG P+ +I+D F S+ ++ K K + PY PQT GQ
Sbjct: 1025 TAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQ 1084
Query: 1431 VEVSNRQIKAILEKTVSTSRTDW 1453
E +N+ ++ +L ST W
Sbjct: 1085 TERTNQTVEKLLRCVCSTHPNTW 1107
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 277 bits (708), Expect = 2e-73
Identities = 204/743 (27%), Positives = 356/743 (47%), Gaps = 62/743 (8%)
Query: 732 LLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVS--PVQVVPKKGGL 789
L +++Y+ I + L P + + E+ + L +G+I +SK ++ PV VPKK G
Sbjct: 406 LTQENYRLPIRNYP-LPPGKMQAMNDEINQGLKSGII---RESKAINACPVMFVPKKEGT 461
Query: 790 TVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDG 849
RM +DY+ LNK + + +PLP I+Q+L ++ + F LD
Sbjct: 462 L------------------RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 850 YSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV 909
S + I + D+ K F CP G F Y MP+G+ APA FQ + +I + E +
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVC 563
Query: 910 FMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVD 969
+MD+ +H + + + +++ VL++ + NL++N KC F + +G+ +S++G
Sbjct: 564 YMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 970 RAKVEIIEKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCL 1029
+ ++ + + P + KE+R FLG + R+FI S +T PL NLL KD + + +
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683
Query: 1030 QAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQR-KDKKMHAIYYASKTLD 1088
QA +K+ L++ P+++ D++ + DASD AVG VL Q+ D K + + Y S +
Sbjct: 684 QAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMS 743
Query: 1089 GAQVNYATTEKELLAVVYAIDKFRQYLVGS--KIIVYTDH-SAIKYLLNKKDAK-PRLIR 1144
AQ+NY+ ++KE+LA++ ++ +R YL + + TDH + I + N+ + + RL R
Sbjct: 744 KAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR 803
Query: 1145 WILLLQEFDLEIKDKKGVENVVADHLSRLREANKDELPLDDSFPDDQLFLLAQTDAPWYA 1204
W L LQ+F+ EI + G N +AD LSR+ + + P+ D+ +
Sbjct: 804 WQLFLQDFNFEINYRPGSANHIADALSRIVDETE---PIPKDSEDNSI------------ 848
Query: 1205 DFVNFLAAGVLPPELNYQQKKKFFNDLK--HYYWDEPYLFRRG---SDGIF-----RRCI 1254
+FVN ++ + + Q ++ ND K + +E DG+ + +
Sbjct: 849 NFVNQIS---ITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILL 905
Query: 1255 PESE--VSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTG 1312
P +I+ H H + + IL F W + K + ++ C CQ
Sbjct: 906 PNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINK 964
Query: 1313 SITKRNEMPLNNILEVE-IFDVWGVDFMGPFPSSFGNQYILVAVDYVSKWVEAI-ASPTN 1370
S + PL I E ++ +DF+ P S G + V VD SK + + +
Sbjct: 965 SRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSI 1024
Query: 1371 DAQVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQ 1430
A+ +MF + + FG P+ +I+D F S+ ++ K K + PY PQT GQ
Sbjct: 1025 TAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQ 1084
Query: 1431 VEVSNRQIKAILEKTVSTSRTDW 1453
E +N+ ++ +L ST W
Sbjct: 1085 TERTNQTVEKLLRCVCSTHPNTW 1107
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 265 bits (678), Expect = 5e-70
Identities = 170/507 (33%), Positives = 265/507 (51%), Gaps = 33/507 (6%)
Query: 668 KTLPSNLRYEFLGPNSTYPVIVNASLDEVETEKL-LYVLKKYPKAIGYTIDDIKGINPSL 726
KT+ S L+ F P + + L+ + +E + ++ L+ P + +L
Sbjct: 261 KTVLSQLKKNF-------PELFKSQLENICSEYIDIFALESEPITVN-----------NL 302
Query: 727 CMHRILLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKK 786
++ L+DD +P R + E ++ +V KL+ ++ P S S++ SP+ +VPKK
Sbjct: 303 YKQQLRLKDD-EPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEP-SVSQYNSPLLLVPKK 360
Query: 787 GGLTVIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCY 846
+DK + WR+ IDYR++NK D FPLP ID +L++L + +F
Sbjct: 361 SSPN---SDKKK---------WRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSC 408
Query: 847 LDGYSGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKI 906
LD SGF QI + ++ T+F+ G++ + R+PFGL AP +FQR M FS
Sbjct: 409 LDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQ 468
Query: 907 MEVFMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGI 966
++MDD V G + L NL +V +C + NL L+ EKC F + E LGH +D+GI
Sbjct: 469 AFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGI 528
Query: 967 EVDRAKVEIIEKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDD 1026
D K ++I+ P R F+ +YRRFIK+F+ ++ +T L K+ F + D
Sbjct: 529 LPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTD 588
Query: 1027 SCLQAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKT 1086
C +AF LK LI ++Q PD++ F I DAS A G VL Q + + YAS+
Sbjct: 589 ECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRA 648
Query: 1087 LDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWI 1146
+ N +TTE+EL A+ +AI FR Y+ G V TDH + YL + + +L R
Sbjct: 649 FTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIR 708
Query: 1147 LLLQEFDLEIKDKKGVENVVADHLSRL 1173
L L+E++ ++ KG +N VAD LSR+
Sbjct: 709 LELEEYNFTVEYLKGKDNHVADALSRI 735
Score = 92.4 bits (228), Expect = 8e-18
Identities = 59/201 (29%), Positives = 101/201 (49%), Gaps = 5/201 (2%)
Query: 1256 ESEVSSILTHCHSSSY-GGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSI 1314
E E +IL+ H GGH KT K+ ++W ++ K + ++ KC KCQ+ +
Sbjct: 890 EKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKAKT- 947
Query: 1315 TKRNEMPLNNILEVE-IFDVWGVDFMGPFPSSF-GNQYILVAVDYVSKWVEAIASPTNDA 1372
TK + P+ E FD VD +GP P S GN+Y + + ++K++ AI A
Sbjct: 948 TKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKSA 1007
Query: 1373 QVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQVE 1432
+ V K + ++G + I+D G+ + + L + L +++ +T +H QT G VE
Sbjct: 1008 KTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVE 1067
Query: 1433 VSNRQIKAILEKTVSTSRTDW 1453
S+R + + +ST +TDW
Sbjct: 1068 RSHRTLNEYIRSYISTDKTDW 1088
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 237 bits (604), Expect = 2e-61
Identities = 152/435 (34%), Positives = 239/435 (54%), Gaps = 30/435 (6%)
Query: 751 MKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRM 810
+ + V EV +LL G+I P S S + SP VV KKG T + N+ R+
Sbjct: 193 VSDFVNNEVKQLLKDGIIRP-SRSPYNSPTWVVDKKG--TDAFGNPNK----------RL 239
Query: 811 CIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTC 870
ID+RKLN+ T D +P+P I +L L K F LD SG+ QI + +D+EKT+F+
Sbjct: 240 VIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSV 299
Query: 871 PFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEK 930
G + + R+PFGL NA + FQR + + + + KI V++DD + N D + +++
Sbjct: 300 NGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDT 359
Query: 931 VLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKMLPPTSVKEIRS 990
VL+ N+ ++ EK F LG +VS G + D KV+ I++ P V ++RS
Sbjct: 360 VLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRS 419
Query: 991 FLGHAGFYRRFIKDFSSITKPLTNLL-----------LKDADFTFDDSCLQAFCRLKEAL 1039
FLG A +YR FIKDF++I +P+T++L K F+++ AF RL+ L
Sbjct: 420 FLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNIL 479
Query: 1040 ITAPII-QPPDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGAQVNYATTE 1098
+ +I + PD+ PF++ DAS +G VL Q + I S+TL + NYAT E
Sbjct: 480 ASEDVILKYPDFKKPFDLTTDASASGIGAVLSQ----EGRPITMISRTLKQPEQNYATNE 535
Query: 1099 KELLAVVYAIDKFRQYLVGSK-IIVYTDHSAIKYLLNKKDAKPRLIRWILLLQEFDLEIK 1157
+ELLA+V+A+ K + +L GS+ I ++TDH + + + ++ ++ RW + + + ++
Sbjct: 536 RELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF 595
Query: 1158 DKKGVENVVADHLSR 1172
K G EN VAD LSR
Sbjct: 596 YKPGKENFVADALSR 610
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 170 bits (430), Expect = 3e-41
Identities = 129/432 (29%), Positives = 218/432 (49%), Gaps = 27/432 (6%)
Query: 748 NPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTG 807
+P+ +E +++ +LL+ VI P S S +SP +V E+ A R
Sbjct: 237 SPSDREEFDRQIKELLELKVIKP-SKSTHMSPAFLV--------------ENEAERRRGK 281
Query: 808 WRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTT 867
RM ++Y+ +NKAT+ D LP D++L + + D SG +Q+ + Q T
Sbjct: 282 KRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTA 341
Query: 868 FTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNF-DDCLT 926
FTCP G + + +PFGL AP+ F + + S+ K V++DD V + +
Sbjct: 342 FTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYI 401
Query: 927 NLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKV-EIIEKMLPPTSV 985
++ +L RCE++ ++L+ +K + LG L D+G + + E I K P +
Sbjct: 402 HVLNILRRCEKLGIILSKKKAQLFKEKINFLG-LEIDQGTHCPQNHILEHIHKF--PDRI 458
Query: 986 ---KEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCLQAFCRLKEALITA 1042
K+++ FLG + +I +SI KPL + L +D+ +T++D+ Q ++K+ L +
Sbjct: 459 EDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSF 518
Query: 1043 PIIQPPDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGAQVNYATTEKELL 1102
P + P+ N I DAS+ G +L + + YAS + A+ NY + EKELL
Sbjct: 519 PKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELL 578
Query: 1103 AVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAKP-RLIRWILLLQEFDLEIKD 1158
AV+ I KF YL S+ ++ TD+ + +N K D K RL+RW + L ++D +++
Sbjct: 579 AVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEH 638
Query: 1159 KKGVENVVADHL 1170
G +NV AD L
Sbjct: 639 IAGTKNVFADFL 650
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 163 bits (413), Expect = 3e-39
Identities = 147/488 (30%), Positives = 230/488 (47%), Gaps = 32/488 (6%)
Query: 699 EKLLYVLKKYPKAIGYTIDDIKGINP-------SLCMHRILLEDDYKPSIEHQRRLNPNM 751
E+ + +KY K I +D + NP I L D K +P
Sbjct: 193 ERYFLITEKYQK-IEQLLDKVCSENPIDPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQD 251
Query: 752 KEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMC 811
+E K++ +LLD G+I P S S+ +SP +V E+ A R RM
Sbjct: 252 REGFAKQIKELLDLGLIIP-SKSQHMSPAFLV--------------ENEAERRRGKKRMV 296
Query: 812 IDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP 871
++Y+ +N+AT D LP + ++L L S F D SGF+Q+ + Q+ T FTCP
Sbjct: 297 VNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCP 356
Query: 872 FGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKV 931
G F ++ +PFGL AP+ FQR M + + +K V++DD V ++ D ++ V
Sbjct: 357 QGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAV 415
Query: 932 LERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKV-EIIEKMLPP-TSVKEIR 989
L+ E+ ++L+ +K + + +E I L D+G + + E I K K ++
Sbjct: 416 LKIVEKYGIILSKKKAN-LFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLEDKKHLQ 474
Query: 990 SFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPD 1049
FLG + +I + I KPL L KD + + S ++K+ L + P + P
Sbjct: 475 RFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPK 534
Query: 1050 WNLPFEIMCDASDYAVGVVLGQRKDKKMHAI-YYASKTLDGAQVNYATTEKELLAVVYAI 1108
I DASD G VL R + I Y+S + A+ NY + +KELLAV I
Sbjct: 535 PEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVI 594
Query: 1109 DKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAKP-RLIRWILLLQEFDLEIKDKKGVEN 1164
KF YL + V TD+ Y L K D+K RL+RW ++ +++ +GV+N
Sbjct: 595 TKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVKN 654
Query: 1165 VVADHLSR 1172
V+AD L+R
Sbjct: 655 VLADCLTR 662
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 154 bits (390), Expect = 1e-36
Identities = 136/458 (29%), Positives = 215/458 (46%), Gaps = 29/458 (6%)
Query: 731 ILLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLT 790
I L D K + +P +E K++ +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 791 VIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGY 850
NE+ R RM ++Y+ +NKAT D + LP D++L + F D
Sbjct: 290 -----NNEAEKRRGKK--RMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 851 SGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVF 910
SGF+Q+ + + T FTCP G + + +PFGL AP+ FQR M F F K V+
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVY 401
Query: 911 MDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDR 970
+DD V +N +D L ++ +L++C Q ++L+ +K ++ LG L D G +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQ 460
Query: 971 AKVEIIEKMLPPT--SVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSC 1028
+ P T K+++ FLG + +I + I KPL L ++ + +
Sbjct: 461 GHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKED 520
Query: 1029 LQAFCRLKEALITAPIIQPPDWNLPFEIMCDAS-DYAVGVVLGQRKDKKMHA---IYYAS 1084
++K+ L P + P I DAS DY G++ + ++ + YAS
Sbjct: 521 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 580
Query: 1085 KTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-P 1140
+ A+ NY + +KE LAV+ I KF YL ++ TD++ K +N K D+K
Sbjct: 581 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 640
Query: 1141 RLIRWILLLQEFDLEIKDKKGVENVVADHLSRLREANK 1178
R IRW L + +++ KG +N AD LS RE NK
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLS--REFNK 676
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 154 bits (389), Expect = 2e-36
Identities = 136/458 (29%), Positives = 215/458 (46%), Gaps = 29/458 (6%)
Query: 731 ILLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLT 790
I L D K + +P +E K++ +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 791 VIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGY 850
NE+ R RM ++Y+ +NKAT D + LP D++L + F D
Sbjct: 290 -----NNEAEKRRGKK--RMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 851 SGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVF 910
SGF+Q+ + + T FTCP G + + +PFGL AP+ FQR M F F K V+
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVY 401
Query: 911 MDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDR 970
+DD V +N +D L ++ +L++C Q ++L+ +K ++ LG L D G +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQ 460
Query: 971 AKVEIIEKMLPPT--SVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSC 1028
+ P T K+++ FLG + +I + I KPL L ++ + +
Sbjct: 461 GHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 520
Query: 1029 LQAFCRLKEALITAPIIQPPDWNLPFEIMCDAS-DYAVGVVLGQRKDKKMHA---IYYAS 1084
++K+ L P + P I DAS DY G++ + ++ + YAS
Sbjct: 521 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 580
Query: 1085 KTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-P 1140
+ A+ NY + +KE LAV+ I KF YL ++ TD++ K +N K D+K
Sbjct: 581 GSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 640
Query: 1141 RLIRWILLLQEFDLEIKDKKGVENVVADHLSRLREANK 1178
R IRW L + +++ KG +N AD LS RE NK
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLS--REFNK 676
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 154 bits (389), Expect = 2e-36
Identities = 136/458 (29%), Positives = 215/458 (46%), Gaps = 29/458 (6%)
Query: 731 ILLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLT 790
I L D K + +P +E K++ +LLD VI P S S ++P +V
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 289
Query: 791 VIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGY 850
NE+ R RM ++Y+ +NKAT D + LP D++L + F D
Sbjct: 290 -----NNEAEKRRGKK--RMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCK 342
Query: 851 SGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVF 910
SGF+Q+ + + T FTCP G + + +PFGL AP+ FQR M F F K V+
Sbjct: 343 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVY 401
Query: 911 MDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDR 970
+DD V +N +D L ++ +L++C Q ++L+ +K ++ LG L D G +
Sbjct: 402 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQ 460
Query: 971 AKVEIIEKMLPPT--SVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSC 1028
+ P T K+++ FLG + +I + I KPL L ++ + +
Sbjct: 461 GHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 520
Query: 1029 LQAFCRLKEALITAPIIQPPDWNLPFEIMCDAS-DYAVGVVLGQRKDKKMHA---IYYAS 1084
++K+ L P + P I DAS DY G++ + ++ + YAS
Sbjct: 521 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 580
Query: 1085 KTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-P 1140
+ A+ NY + +KE LAV+ I KF YL ++ TD++ K +N K D+K
Sbjct: 581 GSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 640
Query: 1141 RLIRWILLLQEFDLEIKDKKGVENVVADHLSRLREANK 1178
R IRW L + +++ KG +N AD LS RE NK
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLS--REFNK 676
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 151 bits (382), Expect = 1e-35
Identities = 132/452 (29%), Positives = 211/452 (46%), Gaps = 27/452 (5%)
Query: 731 ILLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLT 790
I L D K + +P +E K++ +LLD VI P S S ++P +V
Sbjct: 233 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------- 284
Query: 791 VIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGY 850
NE+ R RM ++Y+ +NKAT D + P D++L + F D
Sbjct: 285 -----NNEAEKRRGKK--RMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCK 337
Query: 851 SGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVF 910
SGF+Q+ + + T FTCP G + + +PFGL AP+ FQR M F F K V+
Sbjct: 338 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVY 396
Query: 911 MDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDR 970
+DD V +N +D L ++ +L++C Q ++L+ +K ++ LG L D G +
Sbjct: 397 VDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQ 455
Query: 971 AKVEIIEKMLPPT--SVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSC 1028
+ P T K+++ FLG + +I + I KPL L ++ + +
Sbjct: 456 GHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 515
Query: 1029 LQAFCRLKEALITAPIIQPPDWNLPFEIMCDAS-DYAVGVVLGQRKDKKMHA---IYYAS 1084
++K+ L P + P I DAS DY G++ + ++ + YAS
Sbjct: 516 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 575
Query: 1085 KTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-P 1140
+ A+ NY + +KE LAV+ I KF YL ++ TD++ K +N K D+K
Sbjct: 576 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 635
Query: 1141 RLIRWILLLQEFDLEIKDKKGVENVVADHLSR 1172
R IRW L + +++ KG +N AD LSR
Sbjct: 636 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 667
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 150 bits (379), Expect = 2e-35
Identities = 131/458 (28%), Positives = 215/458 (46%), Gaps = 29/458 (6%)
Query: 731 ILLEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLT 790
I L D K + +P +E K++ +LLD VI P S S ++P +V +
Sbjct: 239 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLVNNEA--- 294
Query: 791 VIKNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGY 850
+N + RM ++Y+ +NKAT D + LP D++L + F D
Sbjct: 295 --ENGRGNK---------RMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCK 343
Query: 851 SGFFQIPIHPNDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVF 910
SGF+Q+ + + T FTCP G + + +PFGL AP+ FQR M F F K V+
Sbjct: 344 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVY 402
Query: 911 MDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDR 970
+DD V +N +D L ++ +L++C Q ++L+ +K ++ LG L D G +
Sbjct: 403 VDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG-LEIDEGTHKPQ 461
Query: 971 AKVEIIEKMLPPT--SVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSC 1028
+ P T K+++ FLG + +I + + + +PL L ++ + +
Sbjct: 462 GHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKED 521
Query: 1029 LQAFCRLKEALITAPIIQPPDWNLPFEIMCDAS-DYAVGVVLGQRKDKKMHA---IYYAS 1084
++K+ L P + P I DAS DY G++ + ++ + Y S
Sbjct: 522 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRS 581
Query: 1085 KTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHSAIKYLLN---KKDAK-P 1140
+ A+ NY + +KE LAV+ I KF YL ++ TD++ K +N K D+K
Sbjct: 582 GSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLG 641
Query: 1141 RLIRWILLLQEFDLEIKDKKGVENVVADHLSRLREANK 1178
R IRW L + +++ KG +N AD LS RE NK
Sbjct: 642 RNIRWQAWLSHYSFDVEHIKGTDNHFADFLS--REFNK 677
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 135 bits (341), Expect = 6e-31
Identities = 105/406 (25%), Positives = 175/406 (42%), Gaps = 29/406 (7%)
Query: 733 LEDDYKPSIEHQRRLNPNMKEVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVI 792
L P Q ++ +E ++ + K LD GV+ P S W +P+ V K G
Sbjct: 169 LRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCR-SPWNTPLLPVKKPG----- 222
Query: 793 KNDKNESIATRTVTGWRMCIDYRKLNKATRKDHFPLPFIDQMLERLA-KHSHFCYLDGYS 851
ND +R D R++NK + H +P +L L ++ + LD
Sbjct: 223 TND------------YRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKD 270
Query: 852 GFFQIPIHPNDQEKTTFTCP------FGTFAYRRMPFGLCNAPATFQRCMMSIFSDF--- 902
FF + +HPN Q F G + R+P G N+P F + + F
Sbjct: 271 AFFCLRLHPNSQPLFAFEWKDPEKGNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRAL 330
Query: 903 -VEKIMEVFMDDFSVHGSNFDDCLTNLEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLV 961
+ ++ ++DD V ++DC +K+L+ ++ ++ +K RE LG+L+
Sbjct: 331 NPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYLL 390
Query: 962 SDRGIEVDRAKVEIIEKMLPPTSVKEIRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDAD 1021
+ + A+ + K+ PT+ +++R FLG AGF R +I F+S+ PL L +
Sbjct: 391 KEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESIP 450
Query: 1022 FTFDDSCLQAFCRLKEALITAPIIQPPDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIY 1081
F + + QAF +K+AL++AP + PD PF + D VL Q +
Sbjct: 451 FIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYIDERAGVARGVLTQTLGPWRRPVA 510
Query: 1082 YASKTLDGAQVNYATTEKELLAVVYAIDKFRQYLVGSKIIVYTDHS 1127
Y SK LD + T K + AV + + +G + V HS
Sbjct: 511 YLSKKLDPVASGWPTCLKAVAAVALLLKDADKLTLGQNVTVIASHS 556
Score = 103 bits (257), Expect = 3e-21
Identities = 63/181 (34%), Positives = 90/181 (48%), Gaps = 2/181 (1%)
Query: 1274 HASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGSITKRNEMPLNNILEVEIFDV 1333
H +K + + P+L V S+C C T ++T E +
Sbjct: 821 HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGV-Y 879
Query: 1334 WGVDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDAQVVIKMFKKVIFPRFGVPRVV 1393
W VDF P +GN+Y+LV +D S WVEA + T A +V K + I PRFG+P+V+
Sbjct: 880 WEVDFTEIKPGRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPKVL 939
Query: 1394 ISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQVEVSNRQIKAILEK-TVSTSRTD 1452
SD G F+++ + L +LG+ K+ Y PQ+SGQVE NR IK L K + T D
Sbjct: 940 GSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKD 999
Query: 1453 W 1453
W
Sbjct: 1000 W 1000
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 124 bits (310), Expect = 2e-27
Identities = 109/438 (24%), Positives = 185/438 (41%), Gaps = 39/438 (8%)
Query: 705 LKKYPKAIGYTIDDIKGINPSLCMHRILLEDDYKPS-IEHQRRLNPNMKEV---VKKEVL 760
L+ +P+A T G+ + C I++ D KP+ + R P KE ++ +
Sbjct: 1 LQDFPQAWAET----GGLGRAKCQVPIII--DLKPTAMPVSIRQYPMSKEAHMGIQPHIT 54
Query: 761 KLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCIDYRKLNKA 820
+ L+ GV+ P S W +P+ V K G +R D R++NK
Sbjct: 55 RFLELGVLRPCR-SPWNTPLLPVKKPG-----------------TRDYRPVQDLREVNKR 96
Query: 821 TRKDHFPLPFIDQMLERLAK-HSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP------FG 873
T H +P +L L+ + + LD FF +P+ P QE F G
Sbjct: 97 TMDIHPTVPNPYNLLSTLSPDRTWYTVLDLKDAFFCLPLAPQSQELFAFEWRDPERGISG 156
Query: 874 TFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV----FMDDFSVHGSNFDDCLTNLE 929
+ R+P G N+P F + +DF + EV ++DD + + C+ +
Sbjct: 157 QLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKEACIRGTK 216
Query: 930 KVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKMLPPTSVKEIR 989
+L + +K + LG+++S+ + ++E + + PP + +E+R
Sbjct: 217 HLLRELGDKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVAHIPPPQNPREVR 276
Query: 990 SFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPD 1049
FLG AGF R +I F+ + PL L + A FT+ + AF LKEAL++AP + PD
Sbjct: 277 EFLGTAGFCRLWIPGFAELAAPLYALTKESAPFTWQEKHQSAFEALKEALLSAPALGLPD 336
Query: 1050 WNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYAID 1109
+ PF + D VL Q+ + Y SK LD + + + A +
Sbjct: 337 TSKPFTLFIDEKQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVK 396
Query: 1110 KFRQYLVGSKIIVYTDHS 1127
+ +G + V T H+
Sbjct: 397 DSAKLTLGQPLTVITPHA 414
Score = 103 bits (256), Expect = 4e-21
Identities = 69/202 (34%), Positives = 99/202 (48%), Gaps = 7/202 (3%)
Query: 1254 IPESEVSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGS 1313
+P E +++ H+ + H S QK I + F P + S C CQ+ +
Sbjct: 689 LPRKEALAMIQQMHAWT---HLSNQKLKLLIEKTDFLIPKAGTLIEQVTSACKVCQQVNA 745
Query: 1314 -ITKRNEMPLNNILEVEIFDVWGVDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDA 1372
T+ E ++ W +DF P G +Y+LV VD S WVEA + A
Sbjct: 746 GATRVPEGKRTRGNRPGVY--WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETA 803
Query: 1373 QVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQVE 1432
+V K + IFPRFG+P+V+ SD G F+S+ + L + LG+ K+ Y PQ+SGQVE
Sbjct: 804 HMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVE 863
Query: 1433 VSNRQIKAILEK-TVSTSRTDW 1453
NR IK L K T+ T DW
Sbjct: 864 RMNRTIKETLTKLTLETGLKDW 885
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 122 bits (307), Expect = 5e-27
Identities = 123/493 (24%), Positives = 220/493 (43%), Gaps = 34/493 (6%)
Query: 694 DEVETEKLLYVLKKYPKAIGYTIDDI-KGINPSLCMHRILLEDDYKPSIEHQRRLNPNMK 752
+E+ + L+ ++K+ +A+G+ DDI K +C +I+ D P K
Sbjct: 1140 NEIGNQSLITMVKEL-EALGFIGDDITKNRTTWVCDFKIINPDINITCATIP--YTPADK 1196
Query: 753 EVVKKEVLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCI 812
EV +K++ +LLD +K + + I + +E +A + R+
Sbjct: 1197 EVFEKQIKELLD---------NKLIKKADPTCRHRTAAFIVRNHSEEVAQKP----RIVY 1243
Query: 813 DYRKLNKATRKDHFPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPNDQEKTTFTCPF 872
+Y++LN D F +P M+ + K + F D +GF + + + ++ TTFTC
Sbjct: 1244 NYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSE 1303
Query: 873 GTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVHGSNFDDCLTNLEKVL 932
G + + PFG+ NAP FQR M F D K +++DD + +N + + +L+
Sbjct: 1304 GLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFF 1361
Query: 933 ERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKM--LPPTSVKEIRS 990
R ++V VL+ +K ++E LG + + I + V+ I+K ++K +++
Sbjct: 1362 NRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQA 1421
Query: 991 FLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCLQAFCRLKEALITAPIIQPPDW 1050
+LG + R +IKD S + PL K+ F+ +++ + ++ P
Sbjct: 1422 YLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKE 1481
Query: 1051 NLPFEIMCDASDYAVGVVLGQRKDK-----KMHAIYYASKTLDGAQVNYATTEKELLAVV 1105
I DAS+ G VL + DK YAS G + + + + E+ A+
Sbjct: 1482 TDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNF-GEKKTWTSLDYEIEAIN 1540
Query: 1106 YAIDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPR-LIRWI-----LLLQEFDLEIKDK 1159
A++KF+ YL + TD AI + +D K R RWI LL + +
Sbjct: 1541 EALNKFQIYL-DKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKLRDNLLKDGYKPTFEHI 1599
Query: 1160 KGVENVVADHLSR 1172
KG +N + + LSR
Sbjct: 1600 KGNKNFLPNFLSR 1612
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 122 bits (306), Expect = 7e-27
Identities = 128/543 (23%), Positives = 217/543 (39%), Gaps = 69/543 (12%)
Query: 705 LKKYPKAIGYTIDDIKGINPSLCMHRILLEDDYKPSI------EHQRRLNPNMKEVVKKE 758
L+ +P+A T G+ + C I++ D KP+ ++ L +M +++
Sbjct: 144 LQDFPQAWAET----GGLGRAKCQAPIII--DLKPTAVPVSIKQYPMSLEAHMG--IRQH 195
Query: 759 VLKLLDAGVIYPISDSKWVSPVQVVPKKGGLTVIKNDKNESIATRTVTGWRMCIDYRKLN 818
++K L+ GV+ P S W +P+ V K G +R D R++N
Sbjct: 196 IIKFLELGVLRPCR-SPWNTPLLPVKKPG-----------------TQDYRPVQDLREIN 237
Query: 819 KATRKDHFPLPFIDQMLERLAK-HSHFCYLDGYSGFFQIPIHPNDQEKTTFTCP------ 871
K T H +P +L L +S + LD FF +P+ P QE F
Sbjct: 238 KRTVDIHPTVPNPYNLLSTLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAFEWKDPERGI 297
Query: 872 FGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEV----FMDDFSVHGSNFDDCLTN 927
G + R+P G N+P F + +DF + EV ++DD + C
Sbjct: 298 SGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKKACTQG 357
Query: 928 LEKVLERCEQVNLVLNWEKCHFMVREGIVLGHLVSDRGIEVDRAKVEIIEKMLPPTSVKE 987
+L+ + + +K + LG+++S+ + ++E + ++ PP + +E
Sbjct: 358 TRHLLQELGEKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVARIPPPRNPRE 417
Query: 988 IRSFLGHAGFYRRFIKDFSSITKPLTNLLLKDADFTFDDSCLQAFCRLKEALITAPIIQP 1047
+R FLG AGF R +I F+ + PL L + FT+ AF LK+AL++AP +
Sbjct: 418 VREFLGTAGFCRLWIPGFAELAAPLYALTKESTPFTWQTEHQLAFEALKKALLSAPALGL 477
Query: 1048 PDWNLPFEIMCDASDYAVGVVLGQRKDKKMHAIYYASKTLDGAQVNYATTEKELLAVVYA 1107
PD + PF + D VL Q+ + Y SK LD + + + A
Sbjct: 478 PDTSKPFTLFLDERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAML 537
Query: 1108 IDKFRQYLVGSKIIVYTDHSAIKYLLNKKDAKPRLIRWI--LLLQEFDLEIKDKKGVENV 1165
+ + +G + V T H+ + D RWI L + + D + V
Sbjct: 538 VKDSAKLTLGQPLTVITPHTLEAIVRQPPD------RWITNARLTHYQALLLD---TDRV 588
Query: 1166 VADHLSRLREANKDELPLDDSFPDDQLFLLAQT---------------DAPWYADFVNFL 1210
L A +P + P D +LA+T D WY D ++L
Sbjct: 589 QFGPPVTLNPATLLPVPENQPSPHDCRQVLAETHGTREDLKDQELPDADHTWYTDGSSYL 648
Query: 1211 AAG 1213
+G
Sbjct: 649 DSG 651
Score = 101 bits (252), Expect = 1e-20
Identities = 68/202 (33%), Positives = 98/202 (47%), Gaps = 7/202 (3%)
Query: 1254 IPESEVSSILTHCHSSSYGGHASTQKTSFKILHSGFWWPSLFKDVHLFISKCDKCQRTGS 1313
+P+ E +++ H+ + H +K I + F P + S C CQ+ +
Sbjct: 832 LPQKEALAMIQQMHAWT---HLGNRKLKLLIEKTDFLIPRASTLIEQVTSACKVCQQVNA 888
Query: 1314 ITKRNEMPLNNILEVEIFDV-WGVDFMGPFPSSFGNQYILVAVDYVSKWVEAIASPTNDA 1372
R +P V W +DF P G +Y+LV VD S WVEA + A
Sbjct: 889 GATR--VPAGKRTRGNRPGVYWEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAFPTRQETA 946
Query: 1373 QVVIKMFKKVIFPRFGVPRVVISDGGSHFISKHFEKLLQKLGVRHKVATPYHPQTSGQVE 1432
+V K + IFPRFG+P+V+ SD G F+S+ + L + LG+ K+ Y PQ+SGQVE
Sbjct: 947 HIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVE 1006
Query: 1433 VSNRQIKAILEK-TVSTSRTDW 1453
NR IK L K T+ T DW
Sbjct: 1007 RMNRTIKETLTKLTLETGLKDW 1028
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.137 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 173,502,184
Number of Sequences: 164201
Number of extensions: 7668758
Number of successful extensions: 21112
Number of sequences better than 10.0: 173
Number of HSP's better than 10.0 without gapping: 127
Number of HSP's successfully gapped in prelim test: 46
Number of HSP's that attempted gapping in prelim test: 20589
Number of HSP's gapped (non-prelim): 363
length of query: 1454
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1331
effective length of database: 39,777,331
effective search space: 52943627561
effective search space used: 52943627561
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)
Medicago: description of AC135959.8