
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0026.5
(1710 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 227 2e-58
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 225 9e-58
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 194 2e-48
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 191 1e-47
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 191 1e-47
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 191 2e-47
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 190 3e-47
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 185 8e-46
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 155 1e-36
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 146 5e-34
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 143 5e-33
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 143 5e-33
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 142 8e-33
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 140 4e-32
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 127 3e-28
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 121 1e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 120 4e-26
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 117 2e-25
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 116 6e-25
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 115 1e-24
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 227 bits (579), Expect = 2e-58
Identities = 290/1319 (21%), Positives = 554/1319 (41%), Gaps = 160/1319 (12%)
Query: 430 SVEDERAIFQRPTEQMKSHLKPLFVWAKVDEKGVNKVLVDGGATINLMP----------- 478
SVE++R ++ T ++ F+ AK K V L+D GA I+++
Sbjct: 29 SVEEDRRVY---TINYNLNIFSTFIHAKTGVKLV--FLLDTGADISILKENSDKFSNIQI 83
Query: 479 --KFMLKKLGKTEAD--------------LIPHDMVLSD--YEGKTGSSLGAIML---NI 517
K ++ +G+ + +IPHD L D + +G + N
Sbjct: 84 TNKINIQGIGQQKIQSRGQTFIEIQTGKYVIPHDFHLVDKNFPIPCDGIIGIDFIKKYNC 143
Query: 518 TVGTVARSTLFIVVPSKANYNLLLGREWIHGVGAV----PSTLHQRISIWKLDG--VVEN 571
+ FI+ P+ + + + + G+ S + +R+ + D ++ N
Sbjct: 144 QIDLNQEEDWFIIRPNNLKFPIYIPIAYSSGINTTLLPARSQVVRRLIVSSKDDNILIPN 203
Query: 572 VQADQSYYLAETGYVGKKNFEKSLATIAPLDTVANQYFNPYSEYSVMLDPIRGLNLNEAY 631
+ Y+A T F + L T T ++Q N + + +P+ N+ +A
Sbjct: 204 QEIQTGIYVANTIATSSNTFVRILNT-----TDSDQLVNMDT---LKYEPLSNYNVVQAN 255
Query: 632 KPDAMSGWHNDEEKDATTEWQNEMVKLLKEFKDCFAWDYDEMPGLSRDLVELKLPIKEDK 691
+ +K+ +++++ + E+ D FA + + P +L + +L +K+D+
Sbjct: 256 SEHRNKTVLSQLKKNFPELFKSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDDE 313
Query: 692 KPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNG------KMR 745
+ R H V +I+ ++++L+K K + + + + ++ V KK+ K R
Sbjct: 314 PVYTKNYRSPHSQV-EEIQAQVQKLIKDKIVEPS-VSQYNSPLLLVPKKSSPNSDKKKWR 371
Query: 746 VCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFR 805
+ ID+R +N D++ +P + ++D +Y S LD SG++QI + E T+F
Sbjct: 372 LVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFS 431
Query: 806 CPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAH 865
G+Y + +PFGLK A ++QR+M F + +Y+DD++V S L +
Sbjct: 432 TSN--GSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKN 489
Query: 866 LRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQ 925
L + F + R+Y LK++P KC+F + FLG KGI + K I + P
Sbjct: 490 LTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADS 549
Query: 926 LQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVM 985
+ + N+ RRFI N ++ ++ + L KK F W E QKAF LK L +P ++
Sbjct: 550 ARRFVAFCNYYRRFIKNFADYSRHITRL--CKKNVPFEWTDECQKAFIHLKSQLINPTLL 607
Query: 986 APPIRGKPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCL 1045
P K + A+ G++L Q + + + + Y SR E+ + E+ +
Sbjct: 608 QYPDFSKEFCITTDASKQACGAVLTQ-NHNGHQLPVAYASRAFTKGESNKSTTEQELAAI 666
Query: 1046 YFSCVKLKYYIKPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYSLTYAPLKAVKG 1105
+++ + + YI V + + + ++ S S++ + L L EY+ T LK K
Sbjct: 667 HWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KD 725
Query: 1106 QAIADFLVDHTLPKEIVTYVGIQPWKLFFDGSSHKNGTGIGMFIVSPGGIPTKFKFRIKK 1165
+AD L T+ KE+ K+ TG + + T+F+ R +K
Sbjct: 726 NHVADALSRITI-KEL------------------KDITGNIL------KVTTRFQSR-QK 759
Query: 1166 NCSNNEAEYEALISGLEILIAFGAKNVVIKGDSELVIKQLTKEYKCVSENLARYYTKANN 1225
+C+ E + + EI V+ + V+ + C+ +
Sbjct: 760 SCAGKE-QLDLQKQTKEIASEPNVYEVITNDEVRKVVTLQLNDSICL-------FKHGKK 811
Query: 1226 LLAKFDEARLSHVSRVDNQEANELAQIASGYMVDKCRLKELIEVKEKLNLSDLNILVIDN 1285
++A++D L +D + + ++ +G + ++ ++K
Sbjct: 812 IIARYDVGDLYTNGILDLDQFLQRLELQAG-------IYDISQIK--------------- 849
Query: 1286 MAPNDWRKPIVDYLQNPVGTTDRKTKYRAMSYVIMGNELFKKNVDGTLLKCL----SEDD 1341
MAP W+K I +++ + D+ + MGN++ KN+ LL + +E +
Sbjct: 850 MAP--WKK-IFEHV-----SIDK--------FKNMGNKIL-KNLKVALLNPVTQINNEKE 892
Query: 1342 AFIAISAVHDGLCGAHQAGIKMKWILFRQGMYWPTIMKDCMEYDKGCQDFQRHAGIQHVP 1401
+S +HD GI ++ YW + K EY + CQ Q+ +H
Sbjct: 893 KEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTK 952
Query: 1402 ASELHSIIKPWPFRGWALDLIGEINPCSSRQHKYIIVAIDYFTKWVEAIPLQNVTQDTVI 1461
+ F +D IG + P S ++Y + I TK++ AIP+ N + TV
Sbjct: 953 TPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVA 1011
Query: 1462 DFIQNHIVYRFGLPESLTTDQGTVFVGQKVASFAESWGIKLLNSTPYYAQANGQVEAANK 1521
I + ++G ++ TD GT + + + IK + ST ++ Q G VE +++
Sbjct: 1012 KAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHR 1071
Query: 1522 TLISLIKKHVGRKPKRWHQTLGQVLWAYRNSPKEATGATPFRLAYGQEAVLPAEVYLQSC 1581
TL I+ ++ W L ++ + + P+ L +G+ + LP
Sbjct: 1072 TLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFN---- 1127
Query: 1582 RIQRQEEIPSEDYWNMMLDELVNLDEERLSALDILTRQKDRVAKAYNKKVRAKSFMPGDY 1641
++ E I + D + + L+ A +L K++ + Y+ KV+ GD
Sbjct: 1128 KLHSIEPIYNID--DYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGD- 1184
Query: 1642 VWKVVLPVDKRDKRYGKWAPNWEGPFTVEKILLNNAYSIKELGGRNRQMTVNGKYLKTY 1700
KV+L R++ K + GP+ +E I NN +I L +N++ V+ LK +
Sbjct: 1185 --KVLL----RNEVGHKLDFKYTGPYKIESIGDNN--NITLLTNKNKKQIVHKDRLKKF 1235
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 225 bits (573), Expect = 9e-58
Identities = 144/461 (31%), Positives = 246/461 (53%), Gaps = 14/461 (3%)
Query: 658 LLKEFKDCFAWDYDEMPGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLL 717
++++F+D FA DE+ S E + +KE +P++Q PR + +I++ I+++L
Sbjct: 909 VIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKML 966
Query: 718 KCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGH 777
K IR ++ W + VV V KK+G +R+CID+R +N + + +P E + S AG
Sbjct: 967 NQKVIRESKS-PWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGK 1025
Query: 778 EYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIF 837
+ ++ D +G+ QI + E+ TAF L +EW V+PFGL + A +Q M I
Sbjct: 1026 KLYTVFDMIAGFWQIPLDEKSKEITAFAIGSEL--FEWNVLPFGLVISPALFQGTMEEII 1083
Query: 838 HDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGF 897
D + VY+DD+++ S + HL ++++ R+RK G+K+ KC ++LG
Sbjct: 1084 GDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGH 1143
Query: 898 VVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLK 957
V G+E + K + S PT+ K+LQS LG + + R+FI N ++ S + L+ K
Sbjct: 1144 KVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAK 1203
Query: 958 KEDAFRWEAEHQKAFDELKVYLSSPHVMAPP-----IRG-KPMKLYISATDGTIGSMLAQ 1011
A+ WE E + AF ELK + V+A P ++G +P +Y A+ IG++LAQ
Sbjct: 1204 V--AWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQ 1261
Query: 1012 EDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMVFSHYDIIK 1071
E D ++ I + S+ L+ AETRY + + L + F+ + K I + VF+ + +
Sbjct: 1262 EGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLI 1321
Query: 1072 HMLSKPILHSRIGKWALALTEYSLTYAPLKAVKGQAIADFL 1112
+L L R+ +W++ + E+ + L A K A+AD L
Sbjct: 1322 SLLKGSPLADRLWRWSIEILEFDVKIVYL-AGKANAVADAL 1361
Score = 116 bits (290), Expect = 6e-25
Identities = 98/400 (24%), Positives = 179/400 (44%), Gaps = 21/400 (5%)
Query: 1317 YVIMGNELFKKNVDGTLLKCLSEDDAFIAISAVHDGLCGAHQAGIKMKWILFRQGMYWPT 1376
Y I+G L ++ + E + +H+G+ H GIK W + + YWP
Sbjct: 1440 YKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGH-FGIKKMWRMVHRKFYWPQ 1498
Query: 1377 I---MKDCMEYDKGCQDFQRHAGIQHVPASELHSIIKPWPFRGWALDLIGEINPCSSRQH 1433
+ +++C+ C H+ + S L +P A DL+ S + +
Sbjct: 1499 MRVCVENCVRTCAKCLCANDHSKL----TSSLTPYRMTFPLEIVACDLMDV--GLSVQGN 1552
Query: 1434 KYIIVAIDYFTKWVEAIPLQNVTQDTVID-FIQNHIVYRFGLPESLTTDQGTVFVGQKVA 1492
+YI+ ID FTK+ A+P+ + +TV+ F++ + +P L TDQG FV A
Sbjct: 1553 RYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFA 1612
Query: 1493 SFAESWGIKLLNSTPYYAQANGQVEAANKTLISLIKKHVGRKPKRWHQTLGQVLWAYRNS 1552
F I+ + + Y ++ANG VE NKT++ ++KK P W + ++AY N
Sbjct: 1613 QFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTA-VPMEWDDQVVYAVYAYNNC 1671
Query: 1553 PKEATGATPFRLAYGQEAVLPAEVYLQSCRIQRQEEIPSEDYWNMMLDELVNLDEERLSA 1612
E TG TP L +G++ + P E+ + ++ ++Y +++ EL+ + + A
Sbjct: 1672 VHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADM--DEYKHLLTQELLKVQK---IA 1726
Query: 1613 LDILTRQKDRVAKAYNKKVRAKSF---MPGDYVWKVVLPVDKRDKRYGKWAPNWEGPFTV 1669
+ R+++ +++K +K PG V + +P +K + K W GP+ V
Sbjct: 1727 KEHAMREQESYKSLFDQKYASKKHRFPQPGSRV-LLEIPSEKLGAQCPKLVNKWSGPYRV 1785
Query: 1670 EKILLNNAYSIKELGGRNRQMTVNGKYLKTYKPTVHEINI 1709
N+A LG R + + + L+ + +I I
Sbjct: 1786 ISCSENSAEITPVLGKRKHILQIPFENLRVIPEAMPDILI 1825
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 194 bits (493), Expect = 2e-48
Identities = 133/475 (28%), Positives = 246/475 (51%), Gaps = 19/475 (4%)
Query: 654 EMVKLLKEFKDCFA-WDYDEMPGLSRDL-VELKLPIKEDKKPVKQLPRRFHPDVLVKIKE 711
E+ + KEFKD A + +++P + L E++L + + P++ P P + + +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYP--LPPGKMQAMND 430
Query: 712 EIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMV 771
EI + LK IR ++ ++ V+ V KK G +R+ +D++ LN + Y +P+ E ++
Sbjct: 431 EINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLL 489
Query: 772 DSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQR 831
G + LD S Y+ I + + D K AFRCP G +E++VMP+G+ A A +Q
Sbjct: 490 AKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEYLVMPYGISTAPAHFQY 547
Query: 832 VMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIA 891
+NTI + E+ + Y+DDI++ S S +H+ H++ ++++ L +N KC F
Sbjct: 548 FINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQ 607
Query: 892 GDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFIANLSEKTKSFS 951
F+G+ + +KG + +L P ++K+L+ LG +N+LR+FI S+ T +
Sbjct: 608 VKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLN 667
Query: 952 PLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQ 1011
L LKK+ ++W +A + +K L SP V+ K + L A+D +G++L+Q
Sbjct: 668 NL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQ 725
Query: 1012 EDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYY----IKPIDVMVFSHY 1067
+ +D K + Y S ++ A+ Y++ +K L + S ++Y I+P ++ H
Sbjct: 726 KHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILT-DHR 784
Query: 1068 DIIKHML--SKPILHSRIGKWALALTEYS--LTYAPLKAVKGQAIADFLVDHTLP 1118
++I + S+P + R+ +W L L +++ + Y P A +VD T P
Sbjct: 785 NLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 87.8 bits (216), Expect = 2e-16
Identities = 81/362 (22%), Positives = 147/362 (40%), Gaps = 46/362 (12%)
Query: 1360 GIKMKWILFRQGMYWPTIMKDCMEYDKGCQDFQRHAGIQHVPASELHSII---KPWPFRG 1416
GI++ + + W I K EY + C Q + H P L I +PW
Sbjct: 929 GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPW--ES 986
Query: 1417 WALDLIGEINPCSSRQHKYIIVAIDYFTKWVEAIPL-QNVTQDTVIDFIQNHIVYRFGLP 1475
++D I + S + + V +D F+K +P +++T + ++ FG P
Sbjct: 987 LSMDFITALPESSG--YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1044
Query: 1476 ESLTTDQGTVFVGQKVASFAESWGIKLLNSTPYYAQANGQVEAANKTLISLIKKHVGRKP 1535
+ + D +F Q FA + + S PY Q +GQ E N+T+ L++ P
Sbjct: 1045 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1104
Query: 1536 KRWHQTLGQVLWAYRNSPKEATGATPFRLAYG-QEAVLPAEVYLQSCRIQRQEEIPSEDY 1594
W + V +Y N+ AT TPF + + A+ P E+ S + + + +
Sbjct: 1105 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTDENSQETIQVF 1164
Query: 1595 WNMMLDELVNLDEERLSALDILTRQKDRVAKAYNKKVR-AKSFMPGDYVWKVVLPVDKRD 1653
+ +E L+ +I ++ K ++ K++ + F PGD V + KR
Sbjct: 1165 QTV---------KEHLNTNNI------KMKKYFDMKIQEIEEFQPGDLV------MVKRT 1203
Query: 1654 K-----RYGKWAPNWEGPFTVEKILLNNAYSIKELGGRNRQMTVNGKYLKTYKPTVHEIN 1708
K + K AP++ GPF Y +++ G N ++ + + T H +
Sbjct: 1204 KTGFLHKSNKLAPSFAGPF----------YVLQKSGPNNYELDLPDSIKHMFSSTFHVSH 1253
Query: 1709 IE 1710
+E
Sbjct: 1254 LE 1255
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 191 bits (486), Expect = 1e-47
Identities = 132/475 (27%), Positives = 246/475 (51%), Gaps = 19/475 (4%)
Query: 654 EMVKLLKEFKDCFA-WDYDEMPGLSRDL-VELKLPIKEDKKPVKQLPRRFHPDVLVKIKE 711
E+ + KEFKD A + +++P + L E++L + + P++ P P + + +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYP--LPPGKMQAMND 430
Query: 712 EIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMV 771
EI + LK IR ++ ++ V+ V KK G +R+ +D++ LN + Y +P+ E ++
Sbjct: 431 EINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLL 489
Query: 772 DSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQR 831
G + LD S Y+ I + + D K AFRCP G +E++VMP+G+ A A +Q
Sbjct: 490 AKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEYLVMPYGISIAPAHFQY 547
Query: 832 VMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIA 891
+NTI + E+ + Y+D+I++ S S +H+ H++ ++++ L +N KC F
Sbjct: 548 FINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQ 607
Query: 892 GDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFIANLSEKTKSFS 951
F+G+ + +KG + +L P ++K+L+ LG +N+LR+FI S+ T +
Sbjct: 608 VKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLN 667
Query: 952 PLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQ 1011
L LKK+ ++W +A + +K L SP V+ K + L A+D +G++L+Q
Sbjct: 668 NL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQ 725
Query: 1012 EDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYY----IKPIDVMVFSHY 1067
+ +D K + Y S ++ A+ Y++ +K L + S ++Y I+P ++ H
Sbjct: 726 KHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILT-DHR 784
Query: 1068 DIIKHML--SKPILHSRIGKWALALTEYS--LTYAPLKAVKGQAIADFLVDHTLP 1118
++I + S+P + R+ +W L L +++ + Y P A +VD T P
Sbjct: 785 NLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 87.8 bits (216), Expect = 2e-16
Identities = 81/362 (22%), Positives = 147/362 (40%), Gaps = 46/362 (12%)
Query: 1360 GIKMKWILFRQGMYWPTIMKDCMEYDKGCQDFQRHAGIQHVPASELHSII---KPWPFRG 1416
GI++ + + W I K EY + C Q + H P L I +PW
Sbjct: 929 GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPW--ES 986
Query: 1417 WALDLIGEINPCSSRQHKYIIVAIDYFTKWVEAIPL-QNVTQDTVIDFIQNHIVYRFGLP 1475
++D I + S + + V +D F+K +P +++T + ++ FG P
Sbjct: 987 LSMDFITALPESSG--YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1044
Query: 1476 ESLTTDQGTVFVGQKVASFAESWGIKLLNSTPYYAQANGQVEAANKTLISLIKKHVGRKP 1535
+ + D +F Q FA + + S PY Q +GQ E N+T+ L++ P
Sbjct: 1045 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1104
Query: 1536 KRWHQTLGQVLWAYRNSPKEATGATPFRLAYG-QEAVLPAEVYLQSCRIQRQEEIPSEDY 1594
W + V +Y N+ AT TPF + + A+ P E+ S + + + +
Sbjct: 1105 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTDENSQETIQVF 1164
Query: 1595 WNMMLDELVNLDEERLSALDILTRQKDRVAKAYNKKVR-AKSFMPGDYVWKVVLPVDKRD 1653
+ +E L+ +I ++ K ++ K++ + F PGD V + KR
Sbjct: 1165 QTV---------KEHLNTNNI------KMKKYFDMKIQEIEEFQPGDLV------MVKRT 1203
Query: 1654 K-----RYGKWAPNWEGPFTVEKILLNNAYSIKELGGRNRQMTVNGKYLKTYKPTVHEIN 1708
K + K AP++ GPF Y +++ G N ++ + + T H +
Sbjct: 1204 KTGFLHKSNKLAPSFAGPF----------YVLQKSGPNNYELDLPDSIKHMFSSTFHVSH 1253
Query: 1709 IE 1710
+E
Sbjct: 1254 LE 1255
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 191 bits (486), Expect = 1e-47
Identities = 132/475 (27%), Positives = 246/475 (51%), Gaps = 19/475 (4%)
Query: 654 EMVKLLKEFKDCFA-WDYDEMPGLSRDL-VELKLPIKEDKKPVKQLPRRFHPDVLVKIKE 711
E+ + KEFKD A + +++P + L E++L + + P++ P P + + +
Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYP--LPPGKMQAMND 430
Query: 712 EIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMV 771
EI + LK IR ++ ++ V+ V KK G +R+ +D++ LN + Y +P+ E ++
Sbjct: 431 EINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLL 489
Query: 772 DSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQR 831
G + LD S Y+ I + + D K AFRCP G +E++VMP+G+ A A +Q
Sbjct: 490 AKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEYLVMPYGISIAPAHFQY 547
Query: 832 VMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIA 891
+NTI + E+ + Y+D+I++ S S +H+ H++ ++++ L +N KC F
Sbjct: 548 FINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQ 607
Query: 892 GDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFIANLSEKTKSFS 951
F+G+ + +KG + +L P ++K+L+ LG +N+LR+FI S+ T +
Sbjct: 608 VKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLN 667
Query: 952 PLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQ 1011
L LKK+ ++W +A + +K L SP V+ K + L A+D +G++L+Q
Sbjct: 668 NL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQ 725
Query: 1012 EDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYY----IKPIDVMVFSHY 1067
+ +D K + Y S ++ A+ Y++ +K L + S ++Y I+P ++ H
Sbjct: 726 KHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILT-DHR 784
Query: 1068 DIIKHML--SKPILHSRIGKWALALTEYS--LTYAPLKAVKGQAIADFLVDHTLP 1118
++I + S+P + R+ +W L L +++ + Y P A +VD T P
Sbjct: 785 NLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEP 838
Score = 87.8 bits (216), Expect = 2e-16
Identities = 81/362 (22%), Positives = 147/362 (40%), Gaps = 46/362 (12%)
Query: 1360 GIKMKWILFRQGMYWPTIMKDCMEYDKGCQDFQRHAGIQHVPASELHSII---KPWPFRG 1416
GI++ + + W I K EY + C Q + H P L I +PW
Sbjct: 929 GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPW--ES 986
Query: 1417 WALDLIGEINPCSSRQHKYIIVAIDYFTKWVEAIPL-QNVTQDTVIDFIQNHIVYRFGLP 1475
++D I + S + + V +D F+K +P +++T + ++ FG P
Sbjct: 987 LSMDFITALPESSG--YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1044
Query: 1476 ESLTTDQGTVFVGQKVASFAESWGIKLLNSTPYYAQANGQVEAANKTLISLIKKHVGRKP 1535
+ + D +F Q FA + + S PY Q +GQ E N+T+ L++ P
Sbjct: 1045 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1104
Query: 1536 KRWHQTLGQVLWAYRNSPKEATGATPFRLAYG-QEAVLPAEVYLQSCRIQRQEEIPSEDY 1594
W + V +Y N+ AT TPF + + A+ P E+ S + + + +
Sbjct: 1105 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTDENSQETIQVF 1164
Query: 1595 WNMMLDELVNLDEERLSALDILTRQKDRVAKAYNKKVR-AKSFMPGDYVWKVVLPVDKRD 1653
+ +E L+ +I ++ K ++ K++ + F PGD V + KR
Sbjct: 1165 QTV---------KEHLNTNNI------KMKKYFDMKIQEIEEFQPGDLV------MVKRT 1203
Query: 1654 K-----RYGKWAPNWEGPFTVEKILLNNAYSIKELGGRNRQMTVNGKYLKTYKPTVHEIN 1708
K + K AP++ GPF Y +++ G N ++ + + T H +
Sbjct: 1204 KTGFLHKSNKLAPSFAGPF----------YVLQKSGPNNYELDLPDSIKHMFSSTFHVSH 1253
Query: 1709 IE 1710
+E
Sbjct: 1254 LE 1255
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 191 bits (484), Expect = 2e-47
Identities = 148/489 (30%), Positives = 242/489 (49%), Gaps = 34/489 (6%)
Query: 643 EEKDATTEWQNEMVKLLKEFKDCFAWDYDEMPGLSRDLVE--LKLPIKEDKK-PVKQLPR 699
E D T E N LL EF F P LS VE +K I+ + + P+
Sbjct: 79 EHPDGTQEILNS---LLGEFPRIFE------PPLSGMSVETAVKAEIRTNTQDPIYAKSY 129
Query: 700 RFHPDVLVKIKEEIERLLKCKFIRTARYVD----WLANVVPVIKKNGKMRVCIDFRDLNA 755
+ ++ +++ +I+ LL+ IR + W+ P + R+ +DF+ LN
Sbjct: 130 PYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNT 189
Query: 756 ATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEW 815
T D Y +P + S +Y + LD SG++QI + E D+ KTAF G YE+
Sbjct: 190 VTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLN--GKYEF 247
Query: 816 VVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRK 875
+ +PFGLKNA A +QR+++ I + I VYIDDI+V S D H +LR + K
Sbjct: 248 LRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSK 307
Query: 876 YGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINF 935
L++N K F +FLG++V GI+ + K +AI + PPTS K+L+ LG ++
Sbjct: 308 ANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSY 367
Query: 936 LRRFIANLSEKTKSFSPLLR-----LKKEDAFR----WEAEHQKAFDELKVYLSSPHVMA 986
R+FI + ++ K + L R +K + + + ++F++LK L S ++A
Sbjct: 368 YRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILA 427
Query: 987 PPIRGKPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLY 1046
P KP L A++ IG++L+Q+D+ ++R I Y+SR LN E Y IEK L +
Sbjct: 428 FPCFTKPFHLTTDASNWAIGAVLSQDDQ-GRDRPIAYISRSLNKTEENYATIEKEMLAII 486
Query: 1047 FSCVKLKYYIKPI-DVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYS--LTYAPLKAV 1103
+S L+ Y+ + V++ + + L ++++ +W + EY+ L Y P
Sbjct: 487 WSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKP---G 543
Query: 1104 KGQAIADFL 1112
K +AD L
Sbjct: 544 KSNVVADAL 552
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 190 bits (483), Expect = 3e-47
Identities = 171/672 (25%), Positives = 306/672 (45%), Gaps = 71/672 (10%)
Query: 450 KPLFVWAKVDEKGVNKVLVDGGATINLMPKFMLKKLGKTEADLIPHDMVLSDYEGKTGSS 509
KP ++ K E + K L+D G+T+N+ K + D+ + + +S
Sbjct: 12 KPQYITIKYKENNL-KCLIDTGSTVNMTSKNIF-------------DLPIQNTSTFIHTS 57
Query: 510 LGAIMLNITVGTVAR-----STLFIVVPSKANYNLLLGREWIHGVGAVPSTLHQRISIWK 564
G +++N ++ ++ + F++ P NY+LLLGR+ + A S Q ++++
Sbjct: 58 NGPLIVNKSIIIPSKILFPTTNEFLLHPFSENYDLLLGRKLLAEAKATISYRDQEVTLY- 116
Query: 565 LDGVVENVQADQSYYLAETGYVGKKNFEKSLATIAPLDTVANQYFNPYSEYSVMLDPIRG 624
+ Y L E +++ +++ I DT+ Q ++ S +L+
Sbjct: 117 ----------NNKYKLIEGIATHEQSHFQNVNMIP--DTMLRQP----NKISPILE---- 156
Query: 625 LNLNEAYKPDAMSGWHNDEEKDATTEWQNEMVKLLKEFKDCFAWDYDEMPGLSRDLVELK 684
++ Y+ + + N+EEK + LL+++ D + D++ ++ +
Sbjct: 157 ---SDLYRLEHL----NNEEKQ-------RLCALLQKYHDIQYHEGDKLTFTNQTKHTIN 202
Query: 685 LPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVD----WLANVVPVIKK 740
P+ + +V + +I+ +L IRT+ W+
Sbjct: 203 TKHNLPLYSKYSYPQAYEQEV----ESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASG 258
Query: 741 NGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVS 800
K R+ ID+R LN T D + +P + ++ Y + +D G++QI + E VS
Sbjct: 259 KQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVS 318
Query: 801 KTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRD 860
KTAF G YE++ MPFGLKNA AT+QR MN I + VY+DDI+V S S D
Sbjct: 319 KTAFSTKH--GHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLD 376
Query: 861 DHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPP 920
+HL L FE++ K LK+ KC F FLG V+ GI+ N K +AI P
Sbjct: 377 EHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIP 436
Query: 921 TSKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAEHQKAFDELKVYLS 980
T K++++ LG + R+FI N ++ K + L+ K E+ AF +LK +S
Sbjct: 437 TKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLK-KNMKIDTTNPEYDSAFKKLKYLIS 495
Query: 981 SPHVMAPPIRGKPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYTMIEK 1040
++ P K L A+D +G++L+Q+ + Y+SR LN+ E Y+ IEK
Sbjct: 496 EDPILKVPDFTKKFTLTTDASDVALGAVLSQDG-----HPLSYISRTLNEHEINYSTIEK 550
Query: 1041 LCLCLYFSCVKLKYYIKPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYSLTYAPL 1100
L + ++ ++Y+ + S + + + +S++ +W + L+E+ +
Sbjct: 551 ELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYI 610
Query: 1101 KAVKGQAIADFL 1112
K K +AD L
Sbjct: 611 KG-KENCVADAL 621
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 185 bits (470), Expect = 8e-46
Identities = 122/412 (29%), Positives = 211/412 (50%), Gaps = 17/412 (4%)
Query: 707 VKIKEEIERLLKCKFIRTARYV----DWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEY 762
++++ +++ +L IR + W+ P K RV ID+R LN T D Y
Sbjct: 220 IEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRY 279
Query: 763 HMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGL 822
+P + ++ +Y + +D G++QI + EE +SKTAF G YE++ MPFGL
Sbjct: 280 PIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKS--GHYEYLRMPFGL 337
Query: 823 KNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNP 882
+NA AT+QR MN I + VY+DDI++ S S +HL ++ F ++ LK+
Sbjct: 338 RNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQL 397
Query: 883 LKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFIAN 942
KC F +FLG +V GI+ N K KAI+ PT K++++ LG + R+FI N
Sbjct: 398 DKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPN 457
Query: 943 LSEKTKSFSPLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATD 1002
++ K + L+ K+ + E+ +AF++LK + ++ P K L A++
Sbjct: 458 YADIAKPMTSCLK-KRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASN 516
Query: 1003 GTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVM 1062
+G++L+Q I ++SR LND E Y+ IEK L + ++ ++Y+ +
Sbjct: 517 LALGAVLSQNG-----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFL 571
Query: 1063 VFSHYDIIK--HMLSKPILHSRIGKWALALTEYSLTYAPLKAVKGQAIADFL 1112
+ S + ++ H L +P +++ +W + L+EY +K K ++AD L
Sbjct: 572 IASDHQPLRWLHNLKEP--GAKLERWRVRLSEYQFKIDYIKG-KENSVADAL 620
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 155 bits (391), Expect = 1e-36
Identities = 109/403 (27%), Positives = 202/403 (50%), Gaps = 25/403 (6%)
Query: 709 IKEEIERLLKCKFIRTARYVDWLANVVPVIKK------NGKMRVCIDFRDLNAATPKDEY 762
+ E+++LLK IR +R + + V KK N R+ IDFR LN T D Y
Sbjct: 197 VNNEVKQLLKDGIIRPSRS-PYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRY 255
Query: 763 HMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGL 822
MP M++ + ++ + LD SGY+QI++AE D KT+F G G YE+ +PFGL
Sbjct: 256 PMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNG--GKYEFCRLPFGL 313
Query: 823 KNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNP 882
+NA + +QR ++ + + I VY+DD+++ S + DH+ H+ + + ++++
Sbjct: 314 RNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQ 373
Query: 883 LKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFIAN 942
K F + ++LGF+V K G + + K KAI + P +++S LG ++ R FI +
Sbjct: 374 EKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKD 433
Query: 943 LSEKTKSFSPLLR---------LKKEDAFRWEAEHQKAFDELKVYLSSPHVMAP-PIRGK 992
+ + + +L+ + K+ + + AF L+ L+S V+ P K
Sbjct: 434 FAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKK 493
Query: 993 PMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKL 1052
P L A+ IG++L+QE R I +SR L E Y E+ L + ++ KL
Sbjct: 494 PFDLTTDASASGIGAVLSQEG-----RPITMISRTLKQPEQNYATNERELLAIVWALGKL 548
Query: 1053 KYYI-KPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYS 1094
+ ++ ++ +F+ + + ++ +++I +W + +++
Sbjct: 549 QNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591
Score = 40.0 bits (92), Expect = 0.055
Identities = 28/102 (27%), Positives = 49/102 (47%), Gaps = 6/102 (5%)
Query: 1430 SRQHKYIIVAIDYFTKWVEAIPLQNVTQDTVIDFIQN--HIVYRFGLPESLTTDQGTVFV 1487
S K + ID F+K+ P V T++D I+ F +++ D F
Sbjct: 816 STDRKLFLTCIDKFSKYAIVQP---VVSRTIVDITAPLLQIINLFPNIKTVYCDNEPAFN 872
Query: 1488 GQKVASFAE-SWGIKLLNSTPYYAQANGQVEAANKTLISLIK 1528
+ V S + S+GI ++N+ P ++ +NGQVE + TL + +
Sbjct: 873 SETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIAR 914
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 146 bits (368), Expect = 5e-34
Identities = 158/698 (22%), Positives = 297/698 (41%), Gaps = 60/698 (8%)
Query: 439 QRPTEQMKSHLKP--LFVWAKVDEKGVNKV----LVDGGATINLMPKFMLKKLGKTEADL 492
Q TEQ+ + P +++ ++ KG K+ VD GA++ + KF++ + A+
Sbjct: 9 QTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAER 68
Query: 493 IPHDMVLSDYEGKTGSSLGAIMLNITVGTVARSTLFIVVPSKANYNLLLGREWIHGVGAV 552
P + ++D T S + + I G + R + V ++ + ++G +
Sbjct: 69 -PIMVKIADGSSITISKVCKDIDLIIAGEIFR--IPTVYQQESGIDFIIGNNFCQLYEPF 125
Query: 553 PSTLHQRISIWKLDGVVENVQADQSYYLAETGYVGKKNFEKSLATIAPLDTVANQYFNPY 612
+ I V + ++ + G++ P++ N+ NP
Sbjct: 126 IQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPL 185
Query: 613 SEYSVMLDPIRGLNLNEAYKPDAMSGWHNDEEKDATTEWQNEMVKLLKEFKDCFAWDYDE 672
E +++ + G EEK T+ + + ++ L E K C D
Sbjct: 186 EEIAILSE-----------------GRRLSEEKLFITQQRMQKIEELLE-KVCSENPLD- 226
Query: 673 MPGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLA 732
P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 227 -PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS----P 281
Query: 733 NVVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDG 785
++ P K+ GK R+ ++++ +N AT D Y++P + ++ G + S D
Sbjct: 282 HMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDC 341
Query: 786 YSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFM 845
SG+ Q+ + +E TAF CP G YEW V+PFGLK A + +QR M+ F F F
Sbjct: 342 KSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFC 398
Query: 846 QVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIE 905
VY+DDI+V S + +DHL H+ ++ ++G+ ++ K +FLG + +G
Sbjct: 399 CVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DEGTH 457
Query: 906 INKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFR 963
+ ++ P T KKQLQ LG + + +I L++ K +LK+ +R
Sbjct: 458 KPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQ--AKLKENVPWR 515
Query: 964 WEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQ---EDEDSKERA 1020
W E ++K L + P+ + + + A+D G ML + + E
Sbjct: 516 WTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELI 575
Query: 1021 IFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDIIKHMLSKP 1077
Y S AE Y +K L + + K Y+ P+ ++ +H+ ++ K
Sbjct: 576 CRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG 635
Query: 1078 ILHSRIG---KWALALTEYSLTYAPLKAVKGQAIADFL 1112
S++G +W L+ YS +K ADFL
Sbjct: 636 --DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFL 670
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 143 bits (360), Expect = 5e-33
Identities = 157/698 (22%), Positives = 296/698 (41%), Gaps = 60/698 (8%)
Query: 439 QRPTEQMKSHLKP--LFVWAKVDEKGVNKV----LVDGGATINLMPKFMLKKLGKTEADL 492
Q TEQ+ + P +++ ++ KG K+ VD GA++ + KF++ + A+
Sbjct: 10 QTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAER 69
Query: 493 IPHDMVLSDYEGKTGSSLGAIMLNITVGTVARSTLFIVVPSKANYNLLLGREWIHGVGAV 552
P + ++D T S + + I VG + + + V ++ + ++G +
Sbjct: 70 -PIMVKIADGSSITISKVCKDIDLIIVGVIFK--IPTVYQQESGIDFIIGNNFCQLYEPF 126
Query: 553 PSTLHQRISIWKLDGVVENVQADQSYYLAETGYVGKKNFEKSLATIAPLDTVANQYFNPY 612
+ I V + ++ + G++ P++ N+ NP
Sbjct: 127 IQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPL 186
Query: 613 SEYSVMLDPIRGLNLNEAYKPDAMSGWHNDEEKDATTEWQNEMVKLLKEFKDCFAWDYDE 672
E +++ + G EEK T+ + + + L E K C D
Sbjct: 187 EEIAILSE-----------------GRRLSEEKLFITQQRMQKTEELLE-KVCSENPLD- 227
Query: 673 MPGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLA 732
P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 228 -PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS----P 282
Query: 733 NVVPVIKKN-------GKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDG 785
++ P N G R+ ++++ +N AT D Y++P + ++ G + S D
Sbjct: 283 HMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDC 342
Query: 786 YSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFM 845
SG+ Q+ + +E TAF CP G YEW V+PFGLK A + +QR M+ F F F
Sbjct: 343 KSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFC 399
Query: 846 QVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIE 905
VY+DDIVV S + +DHL H+ ++ ++G+ ++ K +FLG + +G
Sbjct: 400 CVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DEGTH 458
Query: 906 INKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFR 963
+ ++ P T KKQLQ LG + + +I NL++ + +LK+ ++
Sbjct: 459 KPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQ--AKLKENVPWK 516
Query: 964 WEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQ---EDEDSKERA 1020
W E ++K L + P+ + + + A+D G ML + + E
Sbjct: 517 WTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELI 576
Query: 1021 IFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDIIKHMLSKP 1077
Y S AE Y +K L + + K Y+ P+ ++ +H+ ++ K
Sbjct: 577 CRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG 636
Query: 1078 ILHSRIG---KWALALTEYSLTYAPLKAVKGQAIADFL 1112
S++G +W L+ YS +K ADFL
Sbjct: 637 --DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFL 671
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 143 bits (360), Expect = 5e-33
Identities = 156/701 (22%), Positives = 297/701 (42%), Gaps = 66/701 (9%)
Query: 439 QRPTEQMKSHLKP--LFVWAKVDEKGVNKV----LVDGGATINLMPKFMLKKLGKTEADL 492
Q TEQ+ + P +++ ++ KG K+ VD GA++ + KF++ + A+
Sbjct: 9 QTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAER 68
Query: 493 IPHDMVLSDYEGKTGSSLGAIMLNITVGTVARSTLF---IVVPSKANYNLLLGREWIHGV 549
P + ++D GSS+ + + + +F V ++ + ++G +
Sbjct: 69 -PIMVKIAD-----GSSITISKVCKDIDLIIAREIFKIPTVYQQESGIDFIIGNNFCQLY 122
Query: 550 GAVPSTLHQRISIWKLDGVVENVQADQSYYLAETGYVGKKNFEKSLATIAPLDTVANQYF 609
+ I V + ++ + G++ P++ N+
Sbjct: 123 EPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIE 182
Query: 610 NPYSEYSVMLDPIRGLNLNEAYKPDAMSGWHNDEEKDATTEWQNEMVKLLKEFKDCFAWD 669
NP E +++ + G EEK T+ + + ++ L E K C
Sbjct: 183 NPLKEIAILSE-----------------GRRLSEEKLFITQQRMQKIEELLE-KVCSENP 224
Query: 670 YDEMPGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVD 729
D P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 225 LD--PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS-- 280
Query: 730 WLANVVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSL 782
++ P K+ GK R+ ++++ +N AT D Y++P + ++ G + S
Sbjct: 281 --PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSS 338
Query: 783 LDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIE 842
D SG+ Q+ + +E TAF CP G YEW V+PFGLK A + +QR M+ F F
Sbjct: 339 FDCKSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-R 395
Query: 843 TFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKK 902
F VY+DDI+V S + +DHL H+ ++ ++G+ ++ K +FLG + +
Sbjct: 396 KFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DE 454
Query: 903 GIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKED 960
G + ++ P T KKQLQ LG + + +I L++ K +LK+
Sbjct: 455 GTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQ--AKLKENV 512
Query: 961 AFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQ---EDEDSK 1017
++W E ++K L + P+ + + + A+D G ML + +
Sbjct: 513 PWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNT 572
Query: 1018 ERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDIIKHML 1074
E Y S AE Y +K L + + K Y+ P+ ++ +H+ ++
Sbjct: 573 ELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLN 632
Query: 1075 SKPILHSRIG---KWALALTEYSLTYAPLKAVKGQAIADFL 1112
K S++G +W L+ YS +K ADFL
Sbjct: 633 YKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFL 670
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 142 bits (358), Expect = 8e-33
Identities = 130/531 (24%), Positives = 235/531 (43%), Gaps = 51/531 (9%)
Query: 600 PLDTVANQYFNPYSEYSVMLDPIRGLNLNEAYKPDAMSGWHNDEEKDATTEWQNEMVKLL 659
P++ N+ NP E +++ + G EEK T+ + + ++ L
Sbjct: 173 PVNISTNKIENPLEEIAILSE-----------------GRRLSEEKLFITQQRMQKIEEL 215
Query: 660 KEFKDCFAWDYDEMPGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKC 719
E K C D P ++ ++ + + + K +K P ++ P + ++I+ LL
Sbjct: 216 LE-KVCSENPLD--PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDL 272
Query: 720 KFIRTARYVDWLANVVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVD 772
K I+ ++ ++ P K+ GK R+ ++++ +N AT D Y++P + ++
Sbjct: 273 KVIKPSKS----PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLT 328
Query: 773 STAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRV 832
G + S D SG+ Q+ + +E TAF CP G YEW V+PFGLK A + +QR
Sbjct: 329 LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPFGLKQAPSIFQRH 386
Query: 833 MNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAG 892
M+ F F F VY+DDI+V S + +DHL H+ ++ ++G+ ++ K
Sbjct: 387 MDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 445
Query: 893 DFLGFVVHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFIANLSEKTKSF 950
+FLG + +G + ++ P T KKQLQ LG + + +I L++ K
Sbjct: 446 NFLGLEI-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPL 504
Query: 951 SPLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLA 1010
+LK+ ++W E ++K L + P+ + + + A+D G ML
Sbjct: 505 Q--AKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLK 562
Query: 1011 Q---EDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---F 1064
+ + E Y S AE Y +K L + + K Y+ P+ ++
Sbjct: 563 AIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDN 622
Query: 1065 SHYDIIKHMLSKPILHSRIG---KWALALTEYSLTYAPLKAVKGQAIADFL 1112
+H+ ++ K S++G +W L+ YS +K ADFL
Sbjct: 623 THFKSFVNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFL 670
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 140 bits (352), Expect = 4e-32
Identities = 124/488 (25%), Positives = 221/488 (44%), Gaps = 34/488 (6%)
Query: 643 EEKDATTEWQNEMVKLLKEFKDCFAWDYDEMPGLSRDLVELKLPIKEDKKPVKQLPRRFH 702
EEK T+ + + ++ L E K C D P ++ ++ + + + K +K P ++
Sbjct: 194 EEKLFITQQRMQKIEELLE-KVCSENPLD--PNKTKQWMKASIKLSDPSKAIKVKPMKYS 250
Query: 703 PDVLVKIKEEIERLLKCKFIRTARYVDWLANVVPVI-------KKNGKMRVCIDFRDLNA 755
P + ++I+ LL K I+ ++ ++ P K+ GK R+ ++++ +N
Sbjct: 251 PMDREEFDKQIKELLDLKVIKPSKS----PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNK 306
Query: 756 ATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEW 815
AT D Y+ P + ++ G + S D SG+ Q+ + +E TAF CP G YEW
Sbjct: 307 ATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP--QGHYEW 364
Query: 816 VVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRK 875
V+PFGLK A + +QR M+ F F F VY+DDI+V S + +DHL H+ ++ +
Sbjct: 365 NVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQ 423
Query: 876 YGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKI 933
+G+ ++ K +FLG + +G + ++ P T KKQLQ LG +
Sbjct: 424 HGIILSKKKAQLFKKKINFLGLEI-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGIL 482
Query: 934 NFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRGKP 993
+ +I L++ K +LK+ ++W E ++K L + P+ +
Sbjct: 483 TYASDYIPKLAQIRKPLQ--AKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEK 540
Query: 994 MKLYISATDGTIGSMLAQ---EDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCV 1050
+ + A+D G ML + + E Y S AE Y +K L + +
Sbjct: 541 LIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIK 600
Query: 1051 KLKYYIKPIDVMV---FSHYDIIKHMLSKPILHSRIG---KWALALTEYSLTYAPLKAVK 1104
K Y+ P+ ++ +H+ ++ K S++G +W L+ YS +K
Sbjct: 601 KFSIYLTPVHFLIRTDNTHFKSFVNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTD 658
Query: 1105 GQAIADFL 1112
ADFL
Sbjct: 659 NH-FADFL 665
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 127 bits (319), Expect = 3e-28
Identities = 103/382 (26%), Positives = 169/382 (43%), Gaps = 17/382 (4%)
Query: 739 KKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEED 798
++ GK R+ ++++ +N AT D +++P + ++ G S D SG+ Q+ + EE
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 799 VSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPS 858
TAF CP G ++W V+PFGLK A + +QR M T + + F VY+DDI+V S S
Sbjct: 348 QKLTAFTCP--QGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNS 404
Query: 859 RDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTS 918
DH H+ + + KYG+ ++ K +FLG + KG +N +
Sbjct: 405 ELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILENIHKF 463
Query: 919 PP--TSKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAEHQKAFDELK 976
P KK LQ LG + + +I L+E K ++LKK+ + W ++K
Sbjct: 464 PDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQ--VKLKKDVTWNWTQSDSDYVKKIK 521
Query: 977 VYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYT 1036
L S + P + + A+D G +L D E Y S AE Y
Sbjct: 522 KNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYH 581
Query: 1037 MIEKLCLCLYFSCVKLKYYIKPIDVMV------FSHYDIIKHMLSKPILHSRIGKWALAL 1090
+K L + K Y+ P+ V F+++ ++ L R+ +W
Sbjct: 582 SNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYF--LRINLKGDSKQGRLVRWQNWF 639
Query: 1091 TEYSLTYAPLKAVKGQAIADFL 1112
++Y L+ VK +AD L
Sbjct: 640 SKYQFDVEHLEGVK-NVLADCL 660
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 121 bits (304), Expect = 1e-26
Identities = 108/456 (23%), Positives = 194/456 (41%), Gaps = 21/456 (4%)
Query: 674 PGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLAN 733
P S+ + + + + K VK P + P + +I+ LL+ K I+ ++
Sbjct: 209 PEKSKQWMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPA 268
Query: 734 VV---PVIKKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYN 790
+ ++ GK R+ ++++ +N AT D +++P + ++ G + S D SG
Sbjct: 269 FLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLW 328
Query: 791 QIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYID 850
Q+ + +E TAF CP G Y+W V+PFGLK A + + + + + VY+D
Sbjct: 329 QVLLDKESQLLTAFTCP--QGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVD 386
Query: 851 DIVV-KSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKN 909
DI+V + R +H H+ R K G+ ++ K +FLG + +G +N
Sbjct: 387 DILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGTHCPQN 445
Query: 910 KAKAILDTSPP--TSKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAE 967
+ P KKQLQ LG + + +I L+ K +LK++ + W
Sbjct: 446 HILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQS--KLKEDSTWTWNDT 503
Query: 968 HQKAFDELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRV 1027
+ ++K L S + P + + A++ G +L + +S E Y S
Sbjct: 504 DSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGIL-KAIHNSHEYICRYASGS 562
Query: 1028 LNDAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV------FSHYDIIKHMLSKPILHS 1081
AE Y EK L + K Y+ P ++ F+H+ + L
Sbjct: 563 FKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHF--VNINLKGDRKQG 620
Query: 1082 RIGKWALALTEYSLTYAPLKAVKGQAIADFLVDHTL 1117
R+ +W + L++Y + K ADFL ++TL
Sbjct: 621 RLVRWQMWLSQYDFDVEHIAGTK-NVFADFLQENTL 655
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 120 bits (300), Expect = 4e-26
Identities = 96/296 (32%), Positives = 150/296 (50%), Gaps = 16/296 (5%)
Query: 649 TEWQNEMVKLLKEFKDCFAWDYDEMPGLSRDLVELKLPIKEDKKPVKQLPRRFHPDVLVK 708
TE V L +F + F D + +++ E + +E+ PV + R L
Sbjct: 401 TEASRLEVMLKNDFPEVFK---DGLGLCTKEKAEFRT--EENAVPVFKRARPVPYGSLEA 455
Query: 709 IKEEIERLLKCKFIRTARYVDWLANVVPVIKKN-GKMRVCIDFR--DLNAATPKDEYH-M 764
++ E+ RL + I Y W A +V + KK GK+RVC DF+ LNAA KDE+H +
Sbjct: 456 VETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAAL-KDEFHPL 514
Query: 765 PVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVMPFGLKN 824
P +E + G Y S +D Y Q+ + EE G ++++ M FGLK
Sbjct: 515 PTSEDIFSRLKGTVY-SQIDLKDAYLQVELDEEAQKLAVINTHR--GIFKYLRMTFGLKP 571
Query: 825 AGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGLKMNPLK 884
A A++Q++M+ + T + VY DDI++ + S ++H LR+ FER ++YG +++ K
Sbjct: 572 APASFQKIMDKMVSGL--TGVAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEK 629
Query: 885 CAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFI 940
CAF FLGF V + G + K +AI PT +KQL S LG ++L R +
Sbjct: 630 CAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
Score = 75.1 bits (183), Expect = 2e-12
Identities = 62/232 (26%), Positives = 104/232 (44%), Gaps = 25/232 (10%)
Query: 1344 IAISAVHDGLCGAHQAGIKMKWILFRQGMYWPTIMKDCMEYDKGCQDFQRHAGIQHVPAS 1403
I + +H+G H ++MK R ++W + D + C + Q ++ + V
Sbjct: 785 IVLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP- 838
Query: 1404 ELHSIIKPWP-----FRGWALDLIGEINPCSSRQHKYIIVAIDYFTKWVEAIPLQNVTQD 1458
+ PWP ++ +D G +N C Y++V +D TK+ E ++++
Sbjct: 839 -----LNPWPVPEAPWKRIHIDFAGPLNGC------YLLVVVDAKTKYAEVKLTRSISAV 887
Query: 1459 TVIDFIQNHIVYRFGLPESLTTDQGTVFVGQKVASFAESWGIKLLNSTPYYAQANGQVEA 1518
T ID ++ I G PE++ +D GT A +S GI+ S YY ++NG E
Sbjct: 888 TTIDLLEE-IFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAER 946
Query: 1519 ANKTLISLIKKHVGRKPKRWHQTLGQVLWAYRNSPKEA-TGATPFRLAYGQE 1569
TL I K G Q L + L +YRN+P A G+TP +G++
Sbjct: 947 FVDTLKRGIAKIKGEGSVN-QQILNKFLISYRNTPHSALNGSTPAECHFGRK 997
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 117 bits (294), Expect = 2e-25
Identities = 114/402 (28%), Positives = 180/402 (44%), Gaps = 32/402 (7%)
Query: 650 EWQNEMVKLLKEFKDCFAWDYDEMPGLSRDLVELKLPI-KEDKKPVKQLPRRFHPDVLVK 708
E+ + LLKE K+ + M + ++ KL I D K + + + P
Sbjct: 1358 EFARKNKDLLKEMKEMKYIGENPMEFWKNNKIKCKLNIINPDIKIMGRPIKHVTPGDEEA 1417
Query: 709 IKEEIERLLKCKFIR--------TARYVDWLANVVPVI--KKNGKMRVCIDFRDLNAATP 758
+ +I LL+ K IR TA V + P+ +K GK R+ +++ LN T
Sbjct: 1418 MTRQINLLLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTE 1477
Query: 759 KDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALGTYEWVVM 818
D+Y +P ++ + S D SG+ Q+ + EE V TAF L YEW+VM
Sbjct: 1478 SDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKL--YEWLVM 1535
Query: 819 PFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFERMRKYGL 878
PFGLKNA A +QR M+ +F E F+ VYIDDI+V S + + H HL + ++ GL
Sbjct: 1536 PFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGL 1594
Query: 879 KMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPP--TSKKQLQSLLGKINFL 936
++P K G DFLG + I++ + I D S + + ++S LG +++
Sbjct: 1595 ILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWLGILSYA 1654
Query: 937 RRFIANLSEKTKSFSPL-LRLKKEDAFRWEAEHQKAFDELKVYLSS-PHVMAPPIRGKPM 994
R +I ++ K PL ++ R E K ++K + + P + PP
Sbjct: 1655 RNYIQDIG---KLVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPP----KD 1707
Query: 995 KLYISATDGTIGS-------MLAQEDEDSKERAIFYLSRVLN 1029
I TDG + +++ D S ER Y S N
Sbjct: 1708 SFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAYASGSFN 1749
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 116 bits (290), Expect = 6e-25
Identities = 86/332 (25%), Positives = 153/332 (45%), Gaps = 7/332 (2%)
Query: 734 VVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIF 793
V PV K +G+ R+ +D+R++N P + ++ + +Y + LD +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 794 IAEEDVSKTAFRCPGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIV 853
I E TAF G Y W +P G N+ A + + + + +QVY+DDI
Sbjct: 65 ITPESYWLTAFTWQGK--QYCWTRLPQGFLNSPALFTADVVDLLKEIPN--VQVYVDDIY 120
Query: 854 VKSPSRDDHLAHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKA 913
+ +H+ L K F+ + + G ++ K G +FLGF + K+G +
Sbjct: 121 LSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTK 180
Query: 914 ILDTSPPTSKKQLQSLLGKINFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAEHQKAFD 973
+L+ +PP KQLQS+LG +NF R FI N +E + L+ K W E+ K +
Sbjct: 181 LLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLN 240
Query: 974 ELKVYLSSPHVMAPPIRGKPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAET 1033
+ L++ + R +L I + +E K + I YL+ V + AE
Sbjct: 241 MVIEALNTASNLEE--RLPEQRLVIKVNTSPSAGYVRYYNETGK-KPIMYLNYVFSKAEL 297
Query: 1034 RYTMIEKLCLCLYFSCVKLKYYIKPIDVMVFS 1065
+++M+EKL ++ + +K +++V+S
Sbjct: 298 KFSMLEKLLTTMHKALIKAMDLAMGQEILVYS 329
Score = 75.5 bits (184), Expect = 1e-12
Identities = 50/201 (24%), Positives = 90/201 (43%), Gaps = 4/201 (1%)
Query: 1373 YWPTIMKDCMEYDKGCQDFQRHAGIQHVPASELHSIIKPWPFRGWALDLIGEINPCSSRQ 1432
+WP + KD ++ CQ L PF + +D IG + P S+
Sbjct: 635 WWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRPQKPFDKFFIDYIGPLPP--SQG 692
Query: 1433 HKYIIVAIDYFTKWVEAIPLQNVTQDTVIDFIQNHIVYRFGLPESLTTDQGTVFVGQKVA 1492
+ Y++V +D T + P + + + + +++ +P+ + +DQG F A
Sbjct: 693 YLYVLVVVDGMTGFTWLYPTKAPSTSATVKSL--NVLTSIAIPKVIHSDQGAAFTSSTFA 750
Query: 1493 SFAESWGIKLLNSTPYYAQANGQVEAANKTLISLIKKHVGRKPKRWHQTLGQVLWAYRNS 1552
+A+ GI L STPY+ Q+ +VE N + L+ K + +P +W+ L V A N+
Sbjct: 751 EWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRPTKWYDLLPVVQLALNNT 810
Query: 1553 PKEATGATPFRLAYGQEAVLP 1573
TP +L +G ++ P
Sbjct: 811 YSPVLKYTPHQLLFGIDSNTP 831
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 115 bits (288), Expect = 1e-24
Identities = 97/374 (25%), Positives = 163/374 (42%), Gaps = 10/374 (2%)
Query: 692 KPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFR 751
+P KQ P +P I+ I LLK + + V PV K +GK R+ +D+R
Sbjct: 177 RPQKQYP--INPKAKASIQTVINDLLKQGVLIQQNSI-MNTPVYPVPKPDGKWRMVLDYR 233
Query: 752 DLNAATPKDEYHMPVAEMMVDSTAGHEYLSLLDGYSGYNQIFIAEEDVSKTAFRCPGALG 811
++N P + ++ S +Y + LD +G+ I E TAF G
Sbjct: 234 EVNKTIPLIAAQNQHSAGILSSIFRGKYKTTLDLSNGFWAHSITPESYWLTAFTWLGQ-- 291
Query: 812 TYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDDHLAHLRKSFE 871
Y W +P G N+ A + + + + +QVY+DDI + +HL L K F
Sbjct: 292 QYCWTRLPQGFLNSPALFTADVVDLLKEVPN--VQVYVDDIYISHDDPREHLEQLEKVFS 349
Query: 872 RMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLG 931
+ G ++ K +FLGF + K+G + + + +L+ +PP KQLQS+LG
Sbjct: 350 LLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQKLLNITPPRDLKQLQSILG 409
Query: 932 KINFLRRFIANLSEKTKSFSPLLRLKKEDAFRWEAEHQKAFDELKVYLSSPHVMAPPIRG 991
+NF R FI N SE K ++ W ++ + + L+S + R
Sbjct: 410 LLNFARNFIPNFSELVKPLYNIIATANGKYITWTTDNSQQLQNIISMLNSAENLEE--RN 467
Query: 992 KPMKLYISATDGTIGSMLAQEDEDSKERAIFYLSRVLNDAETRYTMIEKLCLCLYFSCVK 1051
++L + + +E +K R I YL+ V AE ++T EKL ++ +K
Sbjct: 468 PEVRLIMKVNTSPSAGYIRFYNEFAK-RPIMYLNYVYTKAEVKFTNTEKLLTTIHKGLIK 526
Query: 1052 LKYYIKPIDVMVFS 1065
+++V+S
Sbjct: 527 ALDLGMGQEILVYS 540
Score = 84.3 bits (207), Expect = 3e-15
Identities = 77/331 (23%), Positives = 144/331 (43%), Gaps = 38/331 (11%)
Query: 1373 YWPTIMKDCMEYDKGCQD--FQRHAGIQHVPASELHSIIKPWPFRGWALDLIGEINPCSS 1430
+WP + KD ++ + C+ A + P +KP F + +D IG + P +
Sbjct: 845 WWPNLRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKP--FDKFFIDYIGPLPPSNG 902
Query: 1431 RQHKYIIVAIDYFTKWVEAIPLQNVTQDTVIDFIQNHIVYRFGLPESLTTDQGTVFVGQK 1490
H ++V +D T +V P + + + + +++ +P+ + +DQG F
Sbjct: 903 YLH--VLVVVDSMTGFVWLYPTKAPSTSATVKAL--NMLTSIAVPKVIHSDQGAAFTSAT 958
Query: 1491 VASFAESWGIKLLNSTPYYAQANGQVEAANKTLISLIKKHVGRKPKRWHQTLGQVLWAYR 1550
A +A++ GI+L STPY+ Q++G+VE N + L+ K + +P +W+ L V A
Sbjct: 959 FADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPAKWYDLLPVVQLALN 1018
Query: 1551 NSPKEATGATPFRLAYGQEAVLPAEVYLQSCRIQRQEEIPSEDYWNMMLDELVNLDEERL 1610
NS ++ TP +L +G ++ P + D ++ +E ++L +E
Sbjct: 1019 NSYSPSSKYTPHQLLFGIDSNTP---------------FANSDTLDLSREEELSLLQEIR 1063
Query: 1611 SALDILTRQKDRVAKAYNKKVRAKSFMPGDYVWKVVLPVDKRDKRYGKWAPNWEGPFTVE 1670
S+L + + +RA S G V +R R P W P V
Sbjct: 1064 SSLYLPSTPP--------ASIRAWSPSVGQL-------VQERVARPASLRPRWHKPTPVL 1108
Query: 1671 KILLNNAYSIKELGGRNRQMTVNGKYLKTYK 1701
+++ A I + G R ++V+ L Y+
Sbjct: 1109 EVINPRAVVILDHLGNRRTVSVDNLKLTAYQ 1139
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.137 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 210,985,095
Number of Sequences: 164201
Number of extensions: 9574724
Number of successful extensions: 26063
Number of sequences better than 10.0: 163
Number of HSP's better than 10.0 without gapping: 89
Number of HSP's successfully gapped in prelim test: 74
Number of HSP's that attempted gapping in prelim test: 25529
Number of HSP's gapped (non-prelim): 344
length of query: 1710
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1586
effective length of database: 39,613,130
effective search space: 62826424180
effective search space used: 62826424180
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0026.5