
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0545.3
(2087 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 231 2e-59
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 227 3e-58
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 184 2e-45
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 181 1e-44
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 181 1e-44
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 180 4e-44
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 179 7e-44
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 177 4e-43
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 174 2e-42
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 167 3e-40
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 142 7e-33
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 141 2e-32
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 141 2e-32
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 139 6e-32
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 139 8e-32
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 137 2e-31
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 121 2e-26
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 120 4e-26
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 112 1e-23
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 109 7e-23
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 231 bits (589), Expect = 2e-59
Identities = 181/605 (29%), Positives = 296/605 (48%), Gaps = 43/605 (7%)
Query: 1003 EEVDLGDGLEKRPTYISALIDPELKDRMV-KLLKEFKDCFAWDYDEMPGLNRELVELKLP 1061
E V + E+ T L DR + ++++F+D FA DE+ G N E +
Sbjct: 878 EVVKTAETYERFTTICEHLKRENGDDRKIWDVIEQFQDVFAISDDEL-GRNSG-TECVIE 935
Query: 1062 IKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRV 1121
+KE +P++Q PR + +I++ I+++L K IR ++ W + VV V KK+G +R+
Sbjct: 936 LKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKS-PWSSPVVLVKKKDGSIRM 994
Query: 1122 CIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRC 1181
CID+R +N + + +P E + S AG + ++ D +G+ QI + + TAF
Sbjct: 995 CIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAI 1054
Query: 1182 PGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDG 1241
L +EW V+PFGL + A +Q M I D + VY+DD+++ S +
Sbjct: 1055 GSEL--FEWNVLPFGLVISP-----ALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQ 1107
Query: 1242 HLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPT 1301
HL ++++ R+RK G+K+ KC ++LG V G+E + K + S PT
Sbjct: 1108 HLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPT 1167
Query: 1302 SKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLSN 1361
+ K+LQS LG + + R+FI N ++ S +SL+ K + WE E + AF ELKK +
Sbjct: 1168 NVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKV--AWIWEKEQEIAFQELKKLVCQ 1225
Query: 1362 PPVMIPP-----IKG-RPMKLYISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRY 1415
PV+ P +KG RP +Y A+ + IG++LAQE DG + I + S+ L+ AETRY
Sbjct: 1226 TPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRY 1285
Query: 1416 TMIEKLCLCLYFSCVKLKYYIKPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYSL 1475
+ + L + F+ + K I + VF+ + + +L L R+ +W++ + E+ +
Sbjct: 1286 HITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDV 1345
Query: 1476 TYAPLKAIKGQAVADFLADHTVP---------KEIVTYVGIQPWKL--YFDGSS-----H 1519
L A K AVAD L+ P KE+ + V +L D S
Sbjct: 1346 KIVYL-AGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQTELPDILDSSCWLERLK 1404
Query: 1520 KNGTGIGMFIMSPQGAPTKFKFRIEGNCSNNEVEYEALISGLEILLALGAKNVVIKGDSE 1579
G I + +G TK F+I G S +EY ++ G+ KN I+ S
Sbjct: 1405 GEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGV-------LKNTEIEEQSR 1457
Query: 1580 LVVKQ 1584
VV +
Sbjct: 1458 SVVPE 1462
Score = 118 bits (296), Expect = 2e-25
Identities = 106/447 (23%), Positives = 194/447 (42%), Gaps = 32/447 (7%)
Query: 1642 KLTELIKIKEKLSPSDLDIMC-IDNLTSND--WRKPIVEYLQNPVGSTDRKVKYRALS-- 1696
+LT ++ + P LD C ++ L D W K ++ L+ G T K +
Sbjct: 1378 ELTSIVNAIQTELPDILDSSCWLERLKGEDEGW-KEVIAALEG--GKTKGTFKIVGIESE 1434
Query: 1697 -----YTILGNELFKKNINGTLLKCLSENDAFMAVSAAHDGLCGAHQAGAKMKWILFRQG 1751
Y I+G L I + E + H+G+ H G K W + +
Sbjct: 1435 ISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGH-FGIKKMWRMVHRK 1493
Query: 1752 LYWPTI---MKDCIEYARGCQDCQKHSGIQHVPASELHSIIKPWPFRGWAIDLIGEIHPA 1808
YWP + +++C+ C HS + S L +P A DL+
Sbjct: 1494 FYWPQMRVCVENCVRTCAKCLCANDHSKL----TSSLTPYRMTFPLEIVACDLMDV--GL 1547
Query: 1809 SSKQHKYIIVAVDYFTKWVEAIPLQNVTQETVIE-FIQNHIVYRFGLPESITTDQGTVFV 1867
S + ++YI+ +D FTK+ A+P+ + ETV++ F++ + +P + TDQG FV
Sbjct: 1548 SVQGNRYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFV 1607
Query: 1868 GRKVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIKKHVGRKPKRWHESLSQVLW 1927
A F I+ +T+ Y ++ANG VE NK ++ ++KK P W + + ++
Sbjct: 1608 NGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTA-VPMEWDDQVVYAVY 1666
Query: 1928 AYRNSPTEATGTTPFRLAYGQEAVLPAEVYLQSCRIQRQEEIPSEVYWNMMLDEMVNLDE 1987
AY N E TG TP L +G++ + P E+ + ++ + Y +++ E++ + +
Sbjct: 1667 AYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADM--DEYKHLLTQELLKVQK 1724
Query: 1988 ERLLALDVLTRQKDRIAKAYNKKVRDRSFITGDYVWKVIL--PMDKKDRVYGKWTPNWEG 2045
+A + R+++ +++K + +V+L P +K K W G
Sbjct: 1725 ---IAKEHAMREQESYKSLFDQKYASKKHRFPQPGSRVLLEIPSEKLGAQCPKLVNKWSG 1781
Query: 2046 PFTVEKTLPNNAYVIKELGNQRQCVTI 2072
P+ V N+A + LG ++ + I
Sbjct: 1782 PYRVISCSENSAEITPVLGKRKHILQI 1808
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 227 bits (578), Expect = 3e-58
Identities = 252/1061 (23%), Positives = 451/1061 (41%), Gaps = 119/1061 (11%)
Query: 1024 PEL-KDRMVKLLKEFKDCFAWDYDEMPGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLV 1082
PEL K ++ + E+ D FA + + + N +L+L +D +PV R +
Sbjct: 272 PELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRL---KDDEPVYTKNYRSPHSQVE 328
Query: 1083 KIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNG------KMRVCIDFRDLNAATPKDE 1136
+I+ ++++L+K K + + + + ++ V KK+ K R+ ID+R +N D+
Sbjct: 329 EIQAQVQKLIKDKIVEPS-VSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADK 387
Query: 1137 YHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFG 1196
+ +P + ++D +Y S LD SG++QI + T+F G+Y + +PFG
Sbjct: 388 FPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSN--GSYRFTRLPFG 445
Query: 1197 LKNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKY 1256
LK A ++QR+M F + +Y+DD++V S L +L + F + R+Y
Sbjct: 446 LKIAP-----NSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREY 500
Query: 1257 GLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFL 1316
LK++P KC+F + FLG KGI + K I + P + + N+
Sbjct: 501 NLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYY 560
Query: 1317 RRFITNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKL 1376
RRFI N ++ ++ + L KK F W E QKAF LK L NP ++ P + +
Sbjct: 561 RRFIKNFADYSRHITRL--CKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCI 618
Query: 1377 YISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYI 1436
A+ + G++L Q + +G++ + Y SR E+ + E+ ++++ + + YI
Sbjct: 619 TTDASKQACGAVLTQ-NHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYI 677
Query: 1437 KPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYSLTYAPLKAIKGQAVADFLADHT 1496
V + + + ++ S S++ + L L EY+ T LK K VAD L+ T
Sbjct: 678 YGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALSRIT 736
Query: 1497 VPKEIVTYVGIQPWKLYFDGSSHKNGTGIGMFIMSPQGAPTKFKFRIEGNCSNNEVEYEA 1556
+ KE+ K+ TG + + T+F+ R + +C+ E + +
Sbjct: 737 I-KEL------------------KDITGNILKV------TTRFQSR-QKSCAGKE-QLDL 769
Query: 1557 LISGLEILLALGAKNVVIKGDSELVVKQLTKEYKCVSEHLAKYYVKANNLLAKFNEIGIG 1616
EI V+ + VV + C+ +H ++A++ ++G
Sbjct: 770 QKQTKEIASEPNVYEVITNDEVRKVVTLQLNDSICLFKH-------GKKIIARY-DVGDL 821
Query: 1617 HVPRI--ENQEANELAQIASGYMVDKLKLTELIKIKEKLSPSDLDIMCIDNLTSNDWRKP 1674
+ I +Q L A Y + ++K+ KI E +S M N K
Sbjct: 822 YTNGILDLDQFLQRLELQAGIYDISQIKMAPWKKIFEHVSIDKFKNM------GNKILKN 875
Query: 1675 IVEYLQNPVGSTDRKVKYRALSYTILGNELFKKNINGTLLKCLSENDAFMAVSAAHDGLC 1734
+ L NPV T + NE K+ I TL HD
Sbjct: 876 LKVALLNPV--------------TQINNEKEKEAILSTL----------------HDDPI 905
Query: 1735 GAHQAGAKMKWILFRQGLYWPTIMKDCIEYARGCQDCQKHSGIQHVPASELHSIIKPWPF 1794
G ++ YW + K EY R CQ CQK +H + F
Sbjct: 906 QGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAF 965
Query: 1795 RGWAIDLIGEIHPASSKQHKYIIVAVDYFTKWVEAIPLQNVTQETVIEFIQNHIVYRFGL 1854
+D IG + P S ++Y + + TK++ AIP+ N + +TV + I + ++G
Sbjct: 966 DRVVVDTIGPL-PKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGP 1024
Query: 1855 PESITTDQGTVFVGRKVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIKKHVGRK 1914
++ TD GT + + + IK +TST ++ Q G VE +++ L I+ ++
Sbjct: 1025 MKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTD 1084
Query: 1915 PKRWHESLSQVLWAYRNSPTEATGTTPFRLAYGQEAVLPAEVYLQSCRIQRQEEIPSEVY 1974
W L ++ + + + P+ L +G+ + LP ++ E I +
Sbjct: 1085 KTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFN----KLHSIEPIYN--- 1137
Query: 1975 WNMMLDEMVNLDEERL-----LALDVLTRQKDRIAKAYNKKVRDRSFITGDYVWKVILPM 2029
+D+ + RL A +L K++ + Y+ KV+D GD KV+L
Sbjct: 1138 ----IDDYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGD---KVLLRN 1190
Query: 2030 DKKDRVYGKWTPNWEGPFTVEKTLPNNAYVIKELGNQRQCV 2070
+ ++ K+T GP+ +E NN + N++Q V
Sbjct: 1191 EVGHKLDFKYT----GPYKIESIGDNNNITLLTNKNKKQIV 1227
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 184 bits (467), Expect = 2e-45
Identities = 133/486 (27%), Positives = 248/486 (50%), Gaps = 28/486 (5%)
Query: 1023 DPELKDRMVKLLKEFKDCFA-WDYDEMPGLNREL-VELKLPIKEDKKPVKQLPRRFHPDV 1080
+PEL D + KEFKD A + +++P + L E++L + + P++ P P
Sbjct: 371 EPELPD----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYP--LPPGK 424
Query: 1081 LVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMP 1140
+ + +EI + LK IR ++ ++ V+ V KK G +R+ +D++ LN + Y +P
Sbjct: 425 MQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLP 483
Query: 1141 IAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNA 1200
+ E ++ G + LD S Y+ I + D K AFRCP G +E++VMP+G+ A
Sbjct: 484 LIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEYLVMPYGISTA 541
Query: 1201 GLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKM 1260
A +Q +NTI + E+ + Y+DDI++ S S H+ H++ ++++ L +
Sbjct: 542 P-----AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLII 596
Query: 1261 NPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFI 1320
N KC F F+G+ + +KG + +L P ++K+L+ LG +N+LR+FI
Sbjct: 597 NQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFI 656
Query: 1321 TNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISA 1380
S+ T ++L LKK+ ++W +A + +K+ L +PPV+ + + L A
Sbjct: 657 PKTSQLTHPLNNL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDA 714
Query: 1381 TDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYY----I 1436
+D +G++L+Q+ +D + Y S ++ A+ Y++ +K L + S ++Y I
Sbjct: 715 SDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTI 774
Query: 1437 KPIDVMVFSHYDIIKHML--SKPILHSRIGKWALALTEYS--LTYAPLKAIKGQAVADFL 1492
+P ++ H ++I + S+P + R+ +W L L +++ + Y P A +
Sbjct: 775 EPFKILT-DHRNLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRI 832
Query: 1493 ADHTVP 1498
D T P
Sbjct: 833 VDETEP 838
Score = 89.4 bits (220), Expect = 1e-16
Identities = 77/315 (24%), Positives = 130/315 (40%), Gaps = 37/315 (11%)
Query: 1754 WPTIMKDCIEYARGCQDCQKHSGIQHVPASELHSII---KPWPFRGWAIDLIGEIHPASS 1810
W I K EY + C CQ + H P L I +PW ++D I + +S
Sbjct: 943 WKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPW--ESLSMDFITALPESSG 1000
Query: 1811 KQHKYIIVAVDYFTKWVEAIPL-QNVTQETVIEFIQNHIVYRFGLPESITTDQGTVFVGR 1869
+ + V VD F+K +P +++T E ++ FG P+ I D +F +
Sbjct: 1001 --YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQ 1058
Query: 1870 KVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIKKHVGRKPKRWHESLSQVLWAY 1929
FA + + S PY Q +GQ E N+ + L++ P W + +S V +Y
Sbjct: 1059 TWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSY 1118
Query: 1930 RNSPTEATGTTPFRLAYG-QEAVLPAEVYLQSCRIQRQEEIPSEVYWNMMLDEMVNLDEE 1988
N+ AT TPF + + A+ P E+ S + + +V+ +
Sbjct: 1119 NNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTDENSQETIQVFQTVK---------- 1168
Query: 1989 RLLALDVLTRQKDRIAKAYNKKVRD-RSFITGDYVWKVILPMDKKDRV-----YGKWTPN 2042
+ L ++ K ++ K+++ F GD V M K+ + K P+
Sbjct: 1169 -----EHLNTNNIKMKKYFDMKIQEIEEFQPGDLV------MVKRTKTGFLHKSNKLAPS 1217
Query: 2043 WEGPFTV-EKTLPNN 2056
+ GPF V +K+ PNN
Sbjct: 1218 FAGPFYVLQKSGPNN 1232
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 181 bits (460), Expect = 1e-44
Identities = 132/486 (27%), Positives = 248/486 (50%), Gaps = 28/486 (5%)
Query: 1023 DPELKDRMVKLLKEFKDCFA-WDYDEMPGLNREL-VELKLPIKEDKKPVKQLPRRFHPDV 1080
+PEL D + KEFKD A + +++P + L E++L + + P++ P P
Sbjct: 371 EPELPD----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYP--LPPGK 424
Query: 1081 LVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMP 1140
+ + +EI + LK IR ++ ++ V+ V KK G +R+ +D++ LN + Y +P
Sbjct: 425 MQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLP 483
Query: 1141 IAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNA 1200
+ E ++ G + LD S Y+ I + D K AFRCP G +E++VMP+G+ A
Sbjct: 484 LIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEYLVMPYGISIA 541
Query: 1201 GLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKM 1260
A +Q +NTI + E+ + Y+D+I++ S S H+ H++ ++++ L +
Sbjct: 542 P-----AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLII 596
Query: 1261 NPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFI 1320
N KC F F+G+ + +KG + +L P ++K+L+ LG +N+LR+FI
Sbjct: 597 NQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFI 656
Query: 1321 TNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISA 1380
S+ T ++L LKK+ ++W +A + +K+ L +PPV+ + + L A
Sbjct: 657 PKTSQLTHPLNNL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDA 714
Query: 1381 TDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYY----I 1436
+D +G++L+Q+ +D + Y S ++ A+ Y++ +K L + S ++Y I
Sbjct: 715 SDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTI 774
Query: 1437 KPIDVMVFSHYDIIKHML--SKPILHSRIGKWALALTEYS--LTYAPLKAIKGQAVADFL 1492
+P ++ H ++I + S+P + R+ +W L L +++ + Y P A +
Sbjct: 775 EPFKILT-DHRNLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRI 832
Query: 1493 ADHTVP 1498
D T P
Sbjct: 833 VDETEP 838
Score = 89.4 bits (220), Expect = 1e-16
Identities = 77/315 (24%), Positives = 130/315 (40%), Gaps = 37/315 (11%)
Query: 1754 WPTIMKDCIEYARGCQDCQKHSGIQHVPASELHSII---KPWPFRGWAIDLIGEIHPASS 1810
W I K EY + C CQ + H P L I +PW ++D I + +S
Sbjct: 943 WKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPW--ESLSMDFITALPESSG 1000
Query: 1811 KQHKYIIVAVDYFTKWVEAIPL-QNVTQETVIEFIQNHIVYRFGLPESITTDQGTVFVGR 1869
+ + V VD F+K +P +++T E ++ FG P+ I D +F +
Sbjct: 1001 --YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQ 1058
Query: 1870 KVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIKKHVGRKPKRWHESLSQVLWAY 1929
FA + + S PY Q +GQ E N+ + L++ P W + +S V +Y
Sbjct: 1059 TWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSY 1118
Query: 1930 RNSPTEATGTTPFRLAYG-QEAVLPAEVYLQSCRIQRQEEIPSEVYWNMMLDEMVNLDEE 1988
N+ AT TPF + + A+ P E+ S + + +V+ +
Sbjct: 1119 NNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTDENSQETIQVFQTVK---------- 1168
Query: 1989 RLLALDVLTRQKDRIAKAYNKKVRD-RSFITGDYVWKVILPMDKKDRV-----YGKWTPN 2042
+ L ++ K ++ K+++ F GD V M K+ + K P+
Sbjct: 1169 -----EHLNTNNIKMKKYFDMKIQEIEEFQPGDLV------MVKRTKTGFLHKSNKLAPS 1217
Query: 2043 WEGPFTV-EKTLPNN 2056
+ GPF V +K+ PNN
Sbjct: 1218 FAGPFYVLQKSGPNN 1232
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 181 bits (460), Expect = 1e-44
Identities = 132/486 (27%), Positives = 248/486 (50%), Gaps = 28/486 (5%)
Query: 1023 DPELKDRMVKLLKEFKDCFA-WDYDEMPGLNREL-VELKLPIKEDKKPVKQLPRRFHPDV 1080
+PEL D + KEFKD A + +++P + L E++L + + P++ P P
Sbjct: 371 EPELPD----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYP--LPPGK 424
Query: 1081 LVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMP 1140
+ + +EI + LK IR ++ ++ V+ V KK G +R+ +D++ LN + Y +P
Sbjct: 425 MQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLP 483
Query: 1141 IAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNA 1200
+ E ++ G + LD S Y+ I + D K AFRCP G +E++VMP+G+ A
Sbjct: 484 LIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEYLVMPYGISIA 541
Query: 1201 GLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKM 1260
A +Q +NTI + E+ + Y+D+I++ S S H+ H++ ++++ L +
Sbjct: 542 P-----AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLII 596
Query: 1261 NPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFI 1320
N KC F F+G+ + +KG + +L P ++K+L+ LG +N+LR+FI
Sbjct: 597 NQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFI 656
Query: 1321 TNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISA 1380
S+ T ++L LKK+ ++W +A + +K+ L +PPV+ + + L A
Sbjct: 657 PKTSQLTHPLNNL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDA 714
Query: 1381 TDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYY----I 1436
+D +G++L+Q+ +D + Y S ++ A+ Y++ +K L + S ++Y I
Sbjct: 715 SDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTI 774
Query: 1437 KPIDVMVFSHYDIIKHML--SKPILHSRIGKWALALTEYS--LTYAPLKAIKGQAVADFL 1492
+P ++ H ++I + S+P + R+ +W L L +++ + Y P A +
Sbjct: 775 EPFKILT-DHRNLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRI 832
Query: 1493 ADHTVP 1498
D T P
Sbjct: 833 VDETEP 838
Score = 89.4 bits (220), Expect = 1e-16
Identities = 77/315 (24%), Positives = 130/315 (40%), Gaps = 37/315 (11%)
Query: 1754 WPTIMKDCIEYARGCQDCQKHSGIQHVPASELHSII---KPWPFRGWAIDLIGEIHPASS 1810
W I K EY + C CQ + H P L I +PW ++D I + +S
Sbjct: 943 WKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPW--ESLSMDFITALPESSG 1000
Query: 1811 KQHKYIIVAVDYFTKWVEAIPL-QNVTQETVIEFIQNHIVYRFGLPESITTDQGTVFVGR 1869
+ + V VD F+K +P +++T E ++ FG P+ I D +F +
Sbjct: 1001 --YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQ 1058
Query: 1870 KVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIKKHVGRKPKRWHESLSQVLWAY 1929
FA + + S PY Q +GQ E N+ + L++ P W + +S V +Y
Sbjct: 1059 TWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSY 1118
Query: 1930 RNSPTEATGTTPFRLAYG-QEAVLPAEVYLQSCRIQRQEEIPSEVYWNMMLDEMVNLDEE 1988
N+ AT TPF + + A+ P E+ S + + +V+ +
Sbjct: 1119 NNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTDENSQETIQVFQTVK---------- 1168
Query: 1989 RLLALDVLTRQKDRIAKAYNKKVRD-RSFITGDYVWKVILPMDKKDRV-----YGKWTPN 2042
+ L ++ K ++ K+++ F GD V M K+ + K P+
Sbjct: 1169 -----EHLNTNNIKMKKYFDMKIQEIEEFQPGDLV------MVKRTKTGFLHKSNKLAPS 1217
Query: 2043 WEGPFTV-EKTLPNN 2056
+ GPF V +K+ PNN
Sbjct: 1218 FAGPFYVLQKSGPNN 1232
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 180 bits (456), Expect = 4e-44
Identities = 143/501 (28%), Positives = 238/501 (46%), Gaps = 36/501 (7%)
Query: 1009 DGLEKRPTYISALIDPEL----------KDRMVKLLKEFKDCFAWDYDEMPGLNRELVEL 1058
D + ++P IS +++ +L K R+ LL+++ D + D++ N+ +
Sbjct: 142 DTMLRQPNKISPILESDLYRLEHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTI 201
Query: 1059 KLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVD----WLANVVPVIK 1114
P+ + +V + +I+ +L IRT+ W+
Sbjct: 202 NTKHNLPLYSKYSYPQAYEQEV----ESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDAS 257
Query: 1115 KNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDV 1174
K R+ ID+R LN T D + +P + ++ Y + +D G++QI + E V
Sbjct: 258 GKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESV 317
Query: 1175 SKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVV 1234
SKTAF G YE++ MPFGLKNA AT+QR MN I + VY+DDI+V
Sbjct: 318 SKTAFSTKH--GHYEYLRMPFGLKNAP-----ATFQRCMNDILRPLLNKHCLVYLDDIIV 370
Query: 1235 KSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAI 1294
S S D HL L FE++ K LK+ KC F FLG V+ GI+ N K +AI
Sbjct: 371 FSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAI 430
Query: 1295 LDTSPPTSKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDE 1354
PT K++++ LG + R+FI N ++ K + L+ K + E+ AF +
Sbjct: 431 QKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLK-KNMKIDTTNPEYDSAFKK 489
Query: 1355 LKKYLSNPPVMIPPIKGRPMKLYISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETR 1414
LK +S P++ P + L A+D +G++L+Q DG+ + Y+SR LN E
Sbjct: 490 LKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQ---DGH--PLSYISRTLNEHEIN 544
Query: 1415 YTMIEKLCLCLYFSCVKLKYYIKPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYS 1474
Y+ IEK L + ++ ++Y+ + S + + + +S++ +W + L+E+
Sbjct: 545 YSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFD 604
Query: 1475 LTYAPLKAIKGQ--AVADFLA 1493
+K IKG+ VAD L+
Sbjct: 605 F---DIKYIKGKENCVADALS 622
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 179 bits (454), Expect = 7e-44
Identities = 123/418 (29%), Positives = 213/418 (50%), Gaps = 22/418 (5%)
Query: 1082 VKIKEEIERLLKCKFIRTARYV----DWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEY 1137
++++ +++ +L IR + W+ P K RV ID+R LN T D Y
Sbjct: 220 IEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRY 279
Query: 1138 HMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGL 1197
+P + ++ +Y + +D G++QI + E +SKTAF G YE++ MPFGL
Sbjct: 280 PIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKS--GHYEYLRMPFGL 337
Query: 1198 KNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYG 1257
+NA AT+QR MN I + VY+DDI++ S S HL ++ F ++
Sbjct: 338 RNAP-----ATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADAN 392
Query: 1258 LKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLR 1317
LK+ KC F +FLG +V GI+ N K KAI+ PT K++++ LG + R
Sbjct: 393 LKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYR 452
Query: 1318 RFITNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLY 1377
+FI N ++ K +S L+ K+ + + E+ +AF++LK + P++ P + L
Sbjct: 453 KFIPNYADIAKPMTSCLK-KRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLT 511
Query: 1378 ISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIK 1437
A++ +G++L+Q N I ++SR LN E Y+ IEK L + ++ ++Y+
Sbjct: 512 TDASNLALGAVLSQ-----NGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLL 566
Query: 1438 PIDVMVFSHYDIIK--HMLSKPILHSRIGKWALALTEYSLTYAPLKAIKGQAVADFLA 1493
++ S + ++ H L +P +++ +W + L+EY +K K +VAD L+
Sbjct: 567 GRQFLIASDHQPLRWLHNLKEP--GAKLERWRVRLSEYQFKIDYIKG-KENSVADALS 621
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 177 bits (448), Expect = 4e-43
Identities = 127/427 (29%), Positives = 215/427 (49%), Gaps = 27/427 (6%)
Query: 1083 KIKEEIERLLKCKFIRTARYVD----WLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYH 1138
+++ +I+ LL+ IR + W+ P + R+ +DF+ LN T D Y
Sbjct: 138 EVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYP 197
Query: 1139 MPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLK 1198
+P + S +Y + LD SG++QI + D+ KTAF G YE++ +PFGLK
Sbjct: 198 IPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLN--GKYEFLRLPFGLK 255
Query: 1199 NAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGL 1258
NA A +QR+++ I + I VYIDDI+V S D H +LR + K L
Sbjct: 256 NAP-----AIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANL 310
Query: 1259 KMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRR 1318
++N K F +FLG++V GI+ + K +AI + PPTS K+L+ LG ++ R+
Sbjct: 311 QVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRK 370
Query: 1319 FITNLSEKTKSFSSLLRLKKEDV---------FRWEAEHQKAFDELKKYLSNPPVMIPPI 1369
FI + ++ K ++L R ++ + ++F++LK L + ++ P
Sbjct: 371 FIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPC 430
Query: 1370 KGRPMKLYISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYFSC 1429
+P L A++ IG++L+Q+D+ G +R I Y+SR LN E Y IEK L + +S
Sbjct: 431 FTKPFHLTTDASNWAIGAVLSQDDQ-GRDRPIAYISRSLNKTEENYATIEKEMLAIIWSL 489
Query: 1430 VKLKYYIKPI-DVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYS--LTYAPLKAIKGQ 1486
L+ Y+ + V++ + + L ++++ +W + EY+ L Y P K+
Sbjct: 490 DNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKS---N 546
Query: 1487 AVADFLA 1493
VAD L+
Sbjct: 547 VVADALS 553
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 174 bits (441), Expect = 2e-42
Identities = 220/943 (23%), Positives = 377/943 (39%), Gaps = 103/943 (10%)
Query: 1067 KPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLANVVPVIKKNGKMRVCIDFR 1126
+P KQ P +P I+ I LLK + + V PV K +GK R+ +D+R
Sbjct: 177 RPQKQYP--INPKAKASIQTVINDLLKQGVLIQQNSI-MNTPVYPVPKPDGKWRMVLDYR 233
Query: 1127 DLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALG 1186
++N P + ++ S +Y + LD +G+ I E TAF G
Sbjct: 234 EVNKTIPLIAAQNQHSAGILSSIFRGKYKTTLDLSNGFWAHSITPESYWLTAFTWLGQ-- 291
Query: 1187 TYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHL 1246
Y W +P G N+ A + + + + +QVY+DDI + HL L
Sbjct: 292 QYCWTRLP-----QGFLNSPALFTADVVDLLKEVPN--VQVYVDDIYISHDDPREHLEQL 344
Query: 1247 RKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQL 1306
K F + G ++ K +FLGF + K+G + + + +L+ +PP KQL
Sbjct: 345 EKVFSLLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQKLLNITPPRDLKQL 404
Query: 1307 QSLLGKINFLRRFITNLSEKTKSFSSLLRLKKEDVFRWEAEHQKAFDELKKYLS------ 1360
QS+LG +NF R FI N SE K +++ W ++ + + L+
Sbjct: 405 QSILGLLNFARNFIPNFSELVKPLYNIIATANGKYITWTTDNSQQLQNIISMLNSAENLE 464
Query: 1361 --NPPV-MIPPIKGRPMKLYISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTM 1417
NP V +I + P YI +E +R I YL+ V AE ++T
Sbjct: 465 ERNPEVRLIMKVNTSPSAGYIRFYNEFA------------KRPIMYLNYVYTKAEVKFTN 512
Query: 1418 IEKLCLCLYFSCVKLKYYIKPIDVMVFSHYDIIKHMLSKPI-----LHSRIGKWALALTE 1472
EKL ++ +K +++V+S + + P+ L R W L +
Sbjct: 513 TEKLLTTIHKGLIKALDLGMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMSYLED 572
Query: 1473 YSLTYAPLKAIKGQAVADFLADHTVPKEIVTYVGIQPWKLYFDGS---------SHKNGT 1523
+ + K + + D + K + + Y DGS SH G
Sbjct: 573 PRIQFHYDKTLPELQQVPTVTDDIIAK--IKHPSEFSMVFYTDGSAIKHPNVNKSHNAGM 630
Query: 1524 GIGMFIMSPQGAPTKFKFRIEGNCSNNEVEYEALISGLEILLALGAKNVVIKGDSELVVK 1583
GI P+ G+ + E A+ + L + V+I DS V +
Sbjct: 631 GIAQVQFKPEFTVINTWSIPLGDHTAQLAEVAAVEFACKKALKIDGP-VLIVTDSFYVAE 689
Query: 1584 QLTKEY------------KCVSEHLAKYYVKANNLLAKFNEIGI---GHVPRIE--NQEA 1626
+ KE K +H++K+ A+ + K + I I GH P + E
Sbjct: 690 SVNKELPYWQSNGFFNNKKKPLKHVSKWKSIADCIQLKPDIIIIHEKGHQPTASTFHTEG 749
Query: 1627 NELAQIASGYMVDKLKLTELIKIKEKLSPSDLDIMCIDNLTSNDWRKPIVEYLQNPVGST 1686
N LA DKL + +PS LD +D L + K ++ Q
Sbjct: 750 NNLA--------DKLATQGSYVVNINTTPS-LDAE-LDQLLQGQYPKGFPKHYQ------ 793
Query: 1687 DRKVKYRALSYTILGNELFKKNINGTLLKCLSENDAFMAVSAAHDGLCGAHQAGAKMKWI 1746
Y + ++ NG + ++D + AH+ G ++
Sbjct: 794 ----------YQLENGQVMVTRPNGKRI-IPPKSDRPQIILQAHN----IAHTGRDSTFL 838
Query: 1747 LFRQGLYWPTIMKDCIEYARGCQDCQKHSGIQHVPASELHSIIKPWPFRGWAIDLIGEIH 1806
+WP + KD ++ R C+ C + L PF + ID IG +
Sbjct: 839 KVSSKYWWPNLRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKPFDKFFIDYIGPLP 898
Query: 1807 PASSKQHKYIIVAVDYFTKWVEAIPLQNVTQETVIEFIQNHIVYRFGLPESITTDQGTVF 1866
P++ H ++V VD T +V P + + ++ + +++ +P+ I +DQG F
Sbjct: 899 PSNGYLH--VLVVVDSMTGFVWLYPTKAPSTSATVKAL--NMLTSIAVPKVIHSDQGAAF 954
Query: 1867 VGRKVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIKKHVGRKPKRWHESLSQVL 1926
A +A++ GI+L STPY+ Q++G+VE N + L+ K + +P +W++ L V
Sbjct: 955 TSATFADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPAKWYDLLPVVQ 1014
Query: 1927 WAYRNSPTEATGTTPFRLAYGQEAVLPAEVYLQSCRIQRQEEI 1969
A NS + ++ TP +L +G ++ P + + R+EE+
Sbjct: 1015 LALNNSYSPSSKYTPHQLLFGIDSNTPF-ANSDTLDLSREEEL 1056
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 167 bits (423), Expect = 3e-40
Identities = 210/885 (23%), Positives = 354/885 (39%), Gaps = 98/885 (11%)
Query: 1109 VVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIF 1168
V PV K +G+ R+ +D+R++N P + ++ + +Y + LD +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 1169 IA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFIETFMQVY 1228
I E TAF G Y W +P G N+ A + + + + +QVY
Sbjct: 65 ITPESYWLTAFTWQGK--QYCWTRLP-----QGFLNSPALFTADVVDLLKEIPN--VQVY 115
Query: 1229 IDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINK 1288
+DDI + H+ L K F+ + + G ++ K G +FLGF + K+G +
Sbjct: 116 VDDIYLSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTD 175
Query: 1289 NKAKAILDTSPPTSKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRLKKEDVFRWEAEH 1348
+L+ +PP KQLQS+LG +NF R FI N +E + +L+ K W E+
Sbjct: 176 TFKTKLLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEEN 235
Query: 1349 QKAFD---ELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSMLAQEDEDGNERAIFYLS 1405
K + E SN +P +L I + +E G ++ I YL+
Sbjct: 236 TKQLNMVIEALNTASNLEERLP-----EQRLVIKVNTSPSAGYVRYYNETG-KKPIMYLN 289
Query: 1406 RVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMVFSHYDIIKHMLSKPI-----LH 1460
V + AE +++M+EKL ++ + +K +++V+S + + P+ L
Sbjct: 290 YVFSKAELKFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALP 349
Query: 1461 SRIGKWALALTE------YSLTYAPLKAIKGQAVADFLADHTVPKEIVTYVGIQPWKLYF 1514
R W L + Y T LK I + + + K Y G+ Y
Sbjct: 350 IRWITWMTYLEDPRIQFHYDKTLPELKHIPDV----YTSSQSPVKHPSQYEGV----FYT 401
Query: 1515 DGSSHK-------NGTGIGMFIMSPQGAPTKFKFRIEGNCS---NNEVEYEALISGLEIL 1564
DGS+ K N G+G+ A K ++++ S N A I+ +E
Sbjct: 402 DGSAIKSPDPTKSNNAGMGIV-----HATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFA 456
Query: 1565 LALGAK---NVVIKGDSELVVKQLTKEY------------KCVSEHLAKYYVKANNLLAK 1609
K V++ DS V + KE K +H++K+ A L K
Sbjct: 457 CKKALKIPGPVLVITDSFYVAESANKELPYWKSNGFVNNKKKPLKHISKWKSIAECLSMK 516
Query: 1610 FNEIGIGHVPRIENQEANELAQIASGYMVDKLKLTELIKIKEKLSPSDLDIMCIDNLTSN 1669
+I I H I Q + + + DKL + +LD +D L
Sbjct: 517 -PDITIQHEKGISLQ--IPVFILKGNALADKLATQGSYVVNCNTKKPNLDAE-LDQLLQG 572
Query: 1670 DWRKPIVEYLQNPVGSTDRKVKY-RALSYTILGNELFKKNINGTLLKCLSENDAFMAVSA 1728
+ K + Q D KVK R I+ + ++ I A +
Sbjct: 573 HYIKGYPK--QYTYFLEDGKVKVSRPEGVKIIPPQSDRQKI------------VLQAHNL 618
Query: 1729 AHDGLCGAHQAGAKMKWILFRQGLYWPTIMKDCIEYARGCQDCQKHSGIQHVPASELHSI 1788
AH G A + W WP + KD ++ CQ C + L
Sbjct: 619 AHTGREATLLKIANLYW--------WPNMRKDVVKQLGRCQQCLITNASNKASGPILRPD 670
Query: 1789 IKPWPFRGWAIDLIGEIHPASSKQHKYIIVAVDYFTKWVEAIPLQNVTQETVIEFIQNHI 1848
PF + ID IG + P S+ + Y++V VD T + P + + ++ + ++
Sbjct: 671 RPQKPFDKFFIDYIGPLPP--SQGYLYVLVVVDGMTGFTWLYPTKAPSTSATVKSL--NV 726
Query: 1849 VYRFGLPESITTDQGTVFVGRKVAAFAESWGIKLLTSTPYYAQANGQVEAANKILISLIK 1908
+ +P+ I +DQG F A +A+ GI L STPY+ Q+ +VE N + L+
Sbjct: 727 LTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLT 786
Query: 1909 KHVGRKPKRWHESLSQVLWAYRNSPTEATGTTPFRLAYGQEAVLP 1953
K + +P +W++ L V A N+ + TP +L +G ++ P
Sbjct: 787 KLLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 142 bits (359), Expect = 7e-33
Identities = 136/526 (25%), Positives = 243/526 (45%), Gaps = 45/526 (8%)
Query: 987 KPVVDVSKKMEAQDPLEEVD-LGDGLEKRPTYISALIDPELKDRMVKLLKEFKDCFAWDY 1045
+PV + K+E +PLEE+ L +G +R + I + ++ +LL+ K C
Sbjct: 172 EPVNISTNKIE--NPLEEIAILSEG--RRLSEEKLFITQQRMQKIEELLE--KVCSENPL 225
Query: 1046 DEMPGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDW 1105
D P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 226 D--PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS--- 280
Query: 1106 LANVVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLL 1158
++ P K+ GK R+ ++++ +N AT D Y++P + ++ G + S
Sbjct: 281 -PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSF 339
Query: 1159 DGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFH 1218
D SG+ Q+ + E TAF CP G YEW V+PF GLK A + +QR M+ F
Sbjct: 340 DCKSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPF-----GLKQAPSIFQRHMDEAFR 392
Query: 1219 DFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFV 1278
F F VY+DDI+V S + + HLLH+ ++ ++G+ ++ K +FLG
Sbjct: 393 VF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLE 451
Query: 1279 VHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRL 1336
+ +G + ++ P T KKQLQ LG + + +I L++ K + +L
Sbjct: 452 I-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KL 508
Query: 1337 KKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSML-AQEDED 1395
K+ +RW E ++KK L P + P+ + + A+D+ G ML A + +
Sbjct: 509 KENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568
Query: 1396 GNERAIF--YLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDII 1450
G + Y S AE Y +K L + + K Y+ P+ ++ +H+
Sbjct: 569 GTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628
Query: 1451 KHMLSKPILHSRIG---KWALALTEYSLTYAPLKAIKGQAVADFLA 1493
++ K S++G +W L+ YS +K ADFL+
Sbjct: 629 VNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 671
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 141 bits (356), Expect = 2e-32
Identities = 105/408 (25%), Positives = 201/408 (48%), Gaps = 30/408 (7%)
Query: 1084 IKEEIERLLKCKFIRTARYVDWLANVVPVIKK------NGKMRVCIDFRDLNAATPKDEY 1137
+ E+++LLK IR +R + + V KK N R+ IDFR LN T D Y
Sbjct: 197 VNNEVKQLLKDGIIRPSRS-PYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRY 255
Query: 1138 HMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGL 1197
MP M++ + ++ + LD SGY+QI++A D KT+F G G YE+ +PFGL
Sbjct: 256 PMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNG--GKYEFCRLPFGL 313
Query: 1198 KNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYG 1257
+NA + +QR ++ + + I VY+DD+++ S + H+ H+ + +
Sbjct: 314 RNAS-----SIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDAN 368
Query: 1258 LKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLR 1317
++++ K F + ++LGF+V K G + + K KAI + P +++S LG ++ R
Sbjct: 369 MRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYR 428
Query: 1318 RFITNLSEKTKSFSSLLR---------LKKEDVFRWEAEHQKAFDELKKYLSNPPVMIP- 1367
FI + + + + +L+ + K+ + + AF L+ L++ V++
Sbjct: 429 VFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKY 488
Query: 1368 PIKGRPMKLYISATDETIGSMLAQEDEDGNERAIFYLSRVLNGAETRYTMIEKLCLCLYF 1427
P +P L A+ IG++L+QE R I +SR L E Y E+ L + +
Sbjct: 489 PDFKKPFDLTTDASASGIGAVLSQEG-----RPITMISRTLKQPEQNYATNERELLAIVW 543
Query: 1428 SCVKLKYYI-KPIDVMVFSHYDIIKHMLSKPILHSRIGKWALALTEYS 1474
+ KL+ ++ ++ +F+ + + ++ +++I +W + +++
Sbjct: 544 ALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591
Score = 38.9 bits (89), Expect = 0.15
Identities = 43/206 (20%), Positives = 86/206 (40%), Gaps = 18/206 (8%)
Query: 1707 KNINGTLLKCLSENDAFMAVSAAHDGLCGAHQAGAK-MKWILFRQGLYWPTIMKDCIEYA 1765
++ +L +N+ V+A H+ AH+A + +K +L + Y+P + E
Sbjct: 723 RHCKNVVLDITDKNEQIEIVTAEHNR---AHRAAQENIKQVL--RDYYFPKMGSLAKEVV 777
Query: 1766 RGCQDCQKHSGIQHVPASELHSIIKPWPFRGWAIDLIGEIHPASSKQHKYIIVAVDYFTK 1825
C+ C + +H EL P + G + + S K + +D F+K
Sbjct: 778 ANCRVCTQAKYDRHPKKQELGETPIP-SYTGEMVHI-----DIFSTDRKLFLTCIDKFSK 831
Query: 1826 WVEAIPLQNVTQETVIEFIQN--HIVYRFGLPESITTDQGTVFVGRKVAAFAE-SWGIKL 1882
+ P V T+++ I+ F +++ D F V + + S+GI +
Sbjct: 832 YAIVQP---VVSRTIVDITAPLLQIINLFPNIKTVYCDNEPAFNSETVTSMLKNSFGIDI 888
Query: 1883 LTSTPYYAQANGQVEAANKILISLIK 1908
+ + P ++ +NGQVE + L + +
Sbjct: 889 VNAPPLHSSSNGQVERFHSTLAEIAR 914
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 141 bits (355), Expect = 2e-32
Identities = 135/526 (25%), Positives = 243/526 (45%), Gaps = 45/526 (8%)
Query: 987 KPVVDVSKKMEAQDPLEEVD-LGDGLEKRPTYISALIDPELKDRMVKLLKEFKDCFAWDY 1045
+PV + K+E +PLEE+ L +G +R + I + ++ +LL+ K C
Sbjct: 172 EPVNISTNKIE--NPLEEIAILSEG--RRLSEEKLFITQQRMQKIEELLE--KVCSENPL 225
Query: 1046 DEMPGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDW 1105
D P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 226 D--PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS--- 280
Query: 1106 LANVVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLL 1158
++ P K+ GK R+ ++++ +N AT D Y++P + ++ G + S
Sbjct: 281 -PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSF 339
Query: 1159 DGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFH 1218
D SG+ Q+ + E TAF CP G YEW V+PF GLK A + +QR M+ F
Sbjct: 340 DCKSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPF-----GLKQAPSIFQRHMDEAFR 392
Query: 1219 DFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFV 1278
F F VY+DDI+V S + + HLLH+ ++ ++G+ ++ K +FLG
Sbjct: 393 VF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLE 451
Query: 1279 VHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRL 1336
+ +G + ++ P T KKQLQ LG + + +I L++ K + +L
Sbjct: 452 I-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KL 508
Query: 1337 KKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSML-AQEDED 1395
K+ ++W E ++KK L P + P+ + + A+D+ G ML A + +
Sbjct: 509 KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568
Query: 1396 GNERAIF--YLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDII 1450
G + Y S AE Y +K L + + K Y+ P+ ++ +H+
Sbjct: 569 GTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628
Query: 1451 KHMLSKPILHSRIG---KWALALTEYSLTYAPLKAIKGQAVADFLA 1493
++ K S++G +W L+ YS +K ADFL+
Sbjct: 629 VNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 671
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 139 bits (351), Expect = 6e-32
Identities = 134/526 (25%), Positives = 243/526 (45%), Gaps = 45/526 (8%)
Query: 987 KPVVDVSKKMEAQDPLEEVD-LGDGLEKRPTYISALIDPELKDRMVKLLKEFKDCFAWDY 1045
+PV + K+E +PL+E+ L +G +R + I + ++ +LL+ K C
Sbjct: 172 EPVNISTNKIE--NPLKEIAILSEG--RRLSEEKLFITQQRMQKIEELLE--KVCSENPL 225
Query: 1046 DEMPGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDW 1105
D P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 226 D--PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS--- 280
Query: 1106 LANVVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLL 1158
++ P K+ GK R+ ++++ +N AT D Y++P + ++ G + S
Sbjct: 281 -PHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSF 339
Query: 1159 DGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFH 1218
D SG+ Q+ + E TAF CP G YEW V+PF GLK A + +QR M+ F
Sbjct: 340 DCKSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPF-----GLKQAPSIFQRHMDEAFR 392
Query: 1219 DFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFV 1278
F F VY+DDI+V S + + HLLH+ ++ ++G+ ++ K +FLG
Sbjct: 393 VF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLE 451
Query: 1279 VHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRL 1336
+ +G + ++ P T KKQLQ LG + + +I L++ K + +L
Sbjct: 452 I-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KL 508
Query: 1337 KKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSML-AQEDED 1395
K+ ++W E ++KK L P + P+ + + A+D+ G ML A + +
Sbjct: 509 KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568
Query: 1396 GNERAIF--YLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDII 1450
G + Y S AE Y +K L + + K Y+ P+ ++ +H+
Sbjct: 569 GTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628
Query: 1451 KHMLSKPILHSRIG---KWALALTEYSLTYAPLKAIKGQAVADFLA 1493
++ K S++G +W L+ YS +K ADFL+
Sbjct: 629 VNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 671
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 139 bits (350), Expect = 8e-32
Identities = 135/526 (25%), Positives = 241/526 (45%), Gaps = 45/526 (8%)
Query: 987 KPVVDVSKKMEAQDPLEEVD-LGDGLEKRPTYISALIDPELKDRMVKLLKEFKDCFAWDY 1045
+PV + K+E +PLEE+ L +G +R + I + + +LL+ K C
Sbjct: 173 EPVNISTNKIE--NPLEEIAILSEG--RRLSEEKLFITQQRMQKTEELLE--KVCSENPL 226
Query: 1046 DEMPGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDW 1105
D P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++
Sbjct: 227 D--PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS--- 281
Query: 1106 LANVVPVIKKN-------GKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLL 1158
++ P N G R+ ++++ +N AT D Y++P + ++ G + S
Sbjct: 282 -PHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSF 340
Query: 1159 DGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFH 1218
D SG+ Q+ + E TAF CP G YEW V+PF GLK A + +QR M+ F
Sbjct: 341 DCKSGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPF-----GLKQAPSIFQRHMDEAFR 393
Query: 1219 DFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFV 1278
F F VY+DDIVV S + + HLLH+ ++ ++G+ ++ K +FLG
Sbjct: 394 VF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLE 452
Query: 1279 VHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRL 1336
+ +G + ++ P T KKQLQ LG + + +I NL++ + + +L
Sbjct: 453 I-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQA--KL 509
Query: 1337 KKEDVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSML-AQEDED 1395
K+ ++W E ++KK L P + P+ + + A+D+ G ML A + +
Sbjct: 510 KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 569
Query: 1396 GNERAIF--YLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDII 1450
G + Y S AE Y +K L + + K Y+ P+ ++ +H+
Sbjct: 570 GTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 629
Query: 1451 KHMLSKPILHSRIG---KWALALTEYSLTYAPLKAIKGQAVADFLA 1493
++ K S++G +W L+ YS +K ADFL+
Sbjct: 630 VNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 672
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 137 bits (346), Expect = 2e-31
Identities = 118/463 (25%), Positives = 212/463 (45%), Gaps = 36/463 (7%)
Query: 1049 PGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLAN 1108
P ++ ++ + + + K +K P ++ P + ++I+ LL K I+ ++ +
Sbjct: 222 PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS----PH 277
Query: 1109 VVPVI-------KKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGY 1161
+ P K+ GK R+ ++++ +N AT D Y+ P + ++ G + S D
Sbjct: 278 MAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCK 337
Query: 1162 SGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFI 1221
SG+ Q+ + E TAF CP G YEW V+PF GLK A + +QR M+ F F
Sbjct: 338 SGFWQVLLDQESRPLTAFTCP--QGHYEWNVVPF-----GLKQAPSIFQRHMDEAFRVF- 389
Query: 1222 ETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHK 1281
F VY+DDI+V S + + HLLH+ ++ ++G+ ++ K +FLG +
Sbjct: 390 RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-D 448
Query: 1282 KGIEINKNKAKAILDTSPPT--SKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRLKKE 1339
+G + ++ P T KKQLQ LG + + +I L++ K + +LK+
Sbjct: 449 EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KLKEN 506
Query: 1340 DVFRWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSML-AQEDEDGNE 1398
++W E ++KK L P + P+ + + A+D+ G ML A + +G
Sbjct: 507 VPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTN 566
Query: 1399 RAIF--YLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV---FSHYDIIKHM 1453
+ Y S AE Y +K L + + K Y+ P+ ++ +H+ ++
Sbjct: 567 TELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNL 626
Query: 1454 LSKPILHSRIG---KWALALTEYSLTYAPLKAIKGQAVADFLA 1493
K S++G +W L+ YS +K ADFL+
Sbjct: 627 NYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 666
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 121 bits (303), Expect = 2e-26
Identities = 108/461 (23%), Positives = 198/461 (42%), Gaps = 26/461 (5%)
Query: 1049 PGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIRTARYVDWLAN 1108
P +++ + + + + K VK P + P + +I+ LL+ K I+ ++
Sbjct: 209 PEKSKQWMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPA 268
Query: 1109 VV---PVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYN 1165
+ ++ GK R+ ++++ +N AT D +++P + ++ G + S D SG
Sbjct: 269 FLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLW 328
Query: 1166 QIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFIETFM 1225
Q+ + E TAF CP G Y+W V+PF GLK A + + + + +
Sbjct: 329 QVLLDKESQLLTAFTCP--QGHYQWNVVPF-----GLKQAPSIFPKTYANSHSNQYSKYC 381
Query: 1226 QVYIDDIVV-KSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGI 1284
VY+DDI+V + R H +H+ R K G+ ++ K +FLG + +G
Sbjct: 382 CVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGT 440
Query: 1285 EINKNKAKAILDTSPP--TSKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRLKKEDVF 1342
+N + P KKQLQ LG + + +I L+ K S +LK++ +
Sbjct: 441 HCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQS--KLKEDSTW 498
Query: 1343 RWEAEHQKAFDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSMLAQEDEDGNERAIF 1402
W + ++KK L + P + P + + A++E G +L + + +E
Sbjct: 499 TWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGIL-KAIHNSHEYICR 557
Query: 1403 YLSRVLNGAETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV------FSHYDIIKHMLSK 1456
Y S AE Y EK L + K Y+ P ++ F+H+ + L
Sbjct: 558 YASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHF--VNINLKG 615
Query: 1457 PILHSRIGKWALALTEYSLTYAPLKAIKGQAVADFLADHTV 1497
R+ +W + L++Y + K ADFL ++T+
Sbjct: 616 DRKQGRLVRWQMWLSQYDFDVEHIAGTK-NVFADFLQENTL 655
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 120 bits (301), Expect = 4e-26
Identities = 102/387 (26%), Positives = 170/387 (43%), Gaps = 22/387 (5%)
Query: 1114 KKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIA*ED 1173
++ GK R+ ++++ +N AT D +++P + ++ G S D SG+ Q+ + E
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 1174 VSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNTIFHDFIETFMQVYIDDIV 1233
TAF CP G ++W V+PF GLK A + +QR M T + + F VY+DDI+
Sbjct: 348 QKLTAFTCP--QGHFQWKVVPF-----GLKQAPSIFQRHMQTALNG-ADKFCMVYVDDII 399
Query: 1234 VKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKA 1293
V S S H H+ + + KYG+ ++ K +FLG + KG +N
Sbjct: 400 VFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILE 458
Query: 1294 ILDTSPP--TSKKQLQSLLGKINFLRRFITNLSEKTKSFSSLLRLKKEDVFRWEAEHQKA 1351
+ P KK LQ LG + + +I L+E K ++LKK+ + W
Sbjct: 459 NIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQ--VKLKKDVTWNWTQSDSDY 516
Query: 1352 FDELKKYLSNPPVMIPPIKGRPMKLYISATDETIGSMLAQEDEDGNERAIFYLSRVLNGA 1411
++KK L + P + P + + A+D G +L DG E Y S A
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576
Query: 1412 ETRYTMIEKLCLCLYFSCVKLKYYIKPIDVMV------FSHYDIIKHMLSKPILHSRIGK 1465
E Y +K L + K Y+ P+ V F+++ ++ L R+ +
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYF--LRINLKGDSKQGRLVR 634
Query: 1466 WALALTEYSLTYAPLKAIKGQAVADFL 1492
W ++Y L+ +K +AD L
Sbjct: 635 WQNWFSKYQFDVEHLEGVK-NVLADCL 660
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 112 bits (279), Expect = 1e-23
Identities = 105/396 (26%), Positives = 179/396 (44%), Gaps = 27/396 (6%)
Query: 990 VDVSKKMEAQDPLEEVDLGDGLEKR---PTYISALIDPELKDRMVKLLKEFKDCFAWDYD 1046
++ S+ + + +EE++L + + +D E + LLKE K+ +
Sbjct: 1320 IETSRTTQVANSIEELELSEDEYLNIAASVETPSFLDQEFARKNKDLLKEMKEMKYIGEN 1379
Query: 1047 EMPGLNRELVELKLPI-KEDKKPVKQLPRRFHPDVLVKIKEEIERLLKCKFIR------- 1098
M ++ KL I D K + + + P + +I LL+ K IR
Sbjct: 1380 PMEFWKNNKIKCKLNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESKHR 1439
Query: 1099 -TARYVDWLANVVPVI--KKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYL 1155
TA V + P+ +K GK R+ +++ LN T D+Y +P ++ +
Sbjct: 1440 STAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIY 1499
Query: 1156 SLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAGATYQRVMNT 1215
S D SG+ Q+ + E V TAF L YEW+VMPFGLKNA A +QR M+
Sbjct: 1500 SKFDLKSGFWQVAMEEESVPWTAFLAGNKL--YEWLVMPFGLKNAP-----AIFQRKMDN 1552
Query: 1216 IFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCAFGVIAGDFL 1275
+F E F+ VYIDDI+V S + + H HL + ++ GL ++P K G DFL
Sbjct: 1553 VFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFL 1611
Query: 1276 GFVVHKKGIEINKNKAKAILDTSPP--TSKKQLQSLLGKINFLRRFITNLSEKTKSFSSL 1333
G + I++ + I D S + + ++S LG +++ R +I ++ + +
Sbjct: 1612 GASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQ- 1670
Query: 1334 LRLKKEDVFRWEAEHQKAFDELKKYLSN-PPVMIPP 1368
++ R E K ++K+ + N P + +PP
Sbjct: 1671 -KMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPP 1705
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 109 bits (273), Expect = 7e-23
Identities = 96/294 (32%), Positives = 148/294 (49%), Gaps = 21/294 (7%)
Query: 1031 VKLLKEFKDCFAWDYDEMPGLNRELVELKLPIKEDKKPVKQLPRRFHPDVLVKIKEEIER 1090
V L +F + F D + +E E + +E+ PV + R L ++ E+ R
Sbjct: 408 VMLKNDFPEVFK---DGLGLCTKEKAEFRT--EENAVPVFKRARPVPYGSLEAVETELNR 462
Query: 1091 LLKCKFIRTARYVDWLANVVPVIKKN-GKMRVCIDFR--DLNAATPKDEYH-MPIAEMMV 1146
L + I Y W A +V + KK GK+RVC DF+ LNAA KDE+H +P +E +
Sbjct: 463 LQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAAL-KDEFHPLPTSEDIF 521
Query: 1147 DSAAGHEYLSLLDGYSGYNQIFIA*EDVSKTAFRCPGALGTYEWVVMPFGLKNAGLKNAG 1206
G Y S +D Y Q+ + E+ K A G ++++ M FGLK A
Sbjct: 522 SRLKGTVY-SQIDLKDAYLQVELD-EEAQKLAV-INTHRGIFKYLRMTFGLKPAP----- 573
Query: 1207 ATYQRVMNTIFHDFIETFMQVYIDDIVVKSPSRDGHLLHLRKSFERMRKYGLKMNPLKCA 1266
A++Q++M+ + T + VY DDI++ + S + H LR+ FER ++YG +++ KCA
Sbjct: 574 ASFQKIMDKMVSGL--TGVAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEKCA 631
Query: 1267 FGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKINFLRRFI 1320
F FLGF V + G + K +AI PT +KQL S LG ++L R +
Sbjct: 632 FAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
Score = 74.3 bits (181), Expect = 3e-12
Identities = 78/304 (25%), Positives = 129/304 (41%), Gaps = 40/304 (13%)
Query: 1654 SPSDLDIMCIDNLTSND-WR-KPIVEYLQNPVGSTDRKVKYRALSYTILGNELFKKNING 1711
S D ++ + L ND W+ KP E ++ + DR +K + + K++
Sbjct: 726 SQKDHEVSSVVKLVRNDSWKPKPSTEIEKHWIRYRDR-LKLIHGCLLLDDRVIVPKSLQK 784
Query: 1712 TLLKCLSENDAFMAVSAAHDGLCGAHQAGAKMKWILFRQGLYWPTIMKDCIEYARGCQDC 1771
+LK L E H G+ Q R ++W + D R C +C
Sbjct: 785 IVLKQLHEG---------HPGIVQMKQKA--------RSFVFWRGLDSDIENMVRHCNNC 827
Query: 1772 QKHSGIQHVPASELHSIIKPWP-----FRGWAIDLIGEIHPASSKQHKYIIVAVDYFTKW 1826
Q++S + V + PWP ++ ID G ++ Y++V VD TK+
Sbjct: 828 QENSKMPRVVP------LNPWPVPEAPWKRIHIDFAGPLNGC------YLLVVVDAKTKY 875
Query: 1827 VEAIPLQNVTQETVIEFIQNHIVYRFGLPESITTDQGTVFVGRKVAAFAESWGIKLLTST 1886
E ++++ T I+ ++ I G PE+I +D GT A +S GI+ TS
Sbjct: 876 AEVKLTRSISAVTTIDLLEE-IFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSA 934
Query: 1887 PYYAQANGQVEAANKILISLIKKHVGRKPKRWHESLSQVLWAYRNSPTEA-TGTTPFRLA 1945
YY ++NG E L I K G + L++ L +YRN+P A G+TP
Sbjct: 935 VYYPRSNGAAERFVDTLKRGIAKIKGEGSVN-QQILNKFLISYRNTPHSALNGSTPAECH 993
Query: 1946 YGQE 1949
+G++
Sbjct: 994 FGRK 997
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.333 0.143 0.452
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 225,925,579
Number of Sequences: 164201
Number of extensions: 9227306
Number of successful extensions: 34476
Number of sequences better than 10.0: 150
Number of HSP's better than 10.0 without gapping: 80
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 34041
Number of HSP's gapped (non-prelim): 339
length of query: 2087
length of database: 59,974,054
effective HSP length: 126
effective length of query: 1961
effective length of database: 39,284,728
effective search space: 77037351608
effective search space used: 77037351608
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.5 bits)
S2: 74 (33.1 bits)
Lotus: description of TM0545.3