
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146664.7 - phase: 0
(2305 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 231 1e-59
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 219 5e-56
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 208 2e-52
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 203 5e-51
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 201 3e-50
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 201 3e-50
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 196 6e-49
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 188 2e-46
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 180 4e-44
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 167 3e-40
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 129 7e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 129 9e-29
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 129 1e-28
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 127 4e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 125 1e-27
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 125 2e-27
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 120 6e-26
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 119 1e-25
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 112 1e-23
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 107 5e-22
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 231 bits (590), Expect = 1e-59
Identities = 142/473 (30%), Positives = 251/473 (53%), Gaps = 22/473 (4%)
Query: 1231 KKRVIELIREYVDIFAWSYKDMPGLD-PDVVEHRLPLKPECPPVKQKLRRSHPDM-ALKI 1288
K+R+ L+++Y DI Y + L + +H + K P + S+P ++
Sbjct: 170 KQRLCALLQKYHDI---QYHEGDKLTFTNQTKHTINTKHNLPLYS---KYSYPQAYEQEV 223
Query: 1289 KEEVRKQIDAGFLVTSEYPQWLANIVPVPKKDG-----KIRMCVDYRDLNKASPKDNFPL 1343
+ +++ ++ G + TS P + + I VPKK K R+ +DYR LN+ + D P+
Sbjct: 224 ESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282
Query: 1344 PHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAG 1403
P++D ++ +C F+ +D G++QI M PE KT+F T G + Y+ MPFGL NA
Sbjct: 283 PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342
Query: 1404 ATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCT 1463
AT+QR M I +++K VY+DD+IV S + +EH++ L +F++L K L+L +KC
Sbjct: 343 ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402
Query: 1464 FGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTAT 1523
F + LG +++ GI+ +P+K+ AI++ P+P K+++ FLG Y +FI +
Sbjct: 403 FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462
Query: 1524 CGPIFKLLRKDQGV-KWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGC 1582
P+ K L+K+ + N + AF ++K + E PIL P + + + ++G
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 1583 VLGQQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISK 1642
VL Q H + Y+S+ + E YS +EK A+ WA K RHY++ + S
Sbjct: 523 VLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576
Query: 1643 MDPIKYIFEKPALTGRIARWQMLLSEYDIEYRTQKAVKGSILAEHLAHQPIED 1695
P+ +++ ++ RW++ LSE+D + + K K + +A+ L+ +E+
Sbjct: 577 HQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKG-KENCVADALSRIKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 219 bits (559), Expect = 5e-56
Identities = 129/419 (30%), Positives = 218/419 (51%), Gaps = 20/419 (4%)
Query: 1286 LKIKEEVRKQIDAGFLVTSEYPQ----WLANIVPVPKKDGKIRMCVDYRDLNKASPKDNF 1341
++++ +V++ ++ G + S P W+ P K R+ +DYR LN+ + D +
Sbjct: 220 IEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRY 279
Query: 1342 PLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLIN 1401
P+P++D ++ KC+ F+ +D G++QI M E KT+F T G + Y+ MPFGL N
Sbjct: 280 PIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRN 339
Query: 1402 AGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNK 1461
A AT+QR M I +++K VY+DD+I+ S + EH+ + +F +L L+L +K
Sbjct: 340 APATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDK 399
Query: 1462 CTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMT 1521
C F + LG IV+ GI+ +P KV+AI P+P +K++R FLG Y +FI +
Sbjct: 400 CEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYA 459
Query: 1522 ATCGPIFKLLRKDQGVKWNDDCQK-----AFDQIKEYLLEPPILVPPVDGRPLIMYLTVL 1576
P+ L+K + D QK AF+++K ++ PIL P + ++
Sbjct: 460 DIAKPMTSCLKKRTKI----DTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDAS 515
Query: 1577 EDSMGCVLGQQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHT 1636
++G VL Q H I ++S+ D E YS +EK A+ WA K RHY++
Sbjct: 516 NLALGAVLSQNG------HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQ 569
Query: 1637 TWLISKMDPIKYIFEKPALTGRIARWQMLLSEYDIEYRTQKAVKGSILAEHLAHQPIED 1695
+ S P++++ ++ RW++ LSEY + K + S+ A+ L+ IE+
Sbjct: 570 FLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSV-ADALSRIKIEE 627
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 208 bits (529), Expect = 2e-52
Identities = 144/475 (30%), Positives = 238/475 (49%), Gaps = 13/475 (2%)
Query: 1231 KKRVIELIREYVDIFAWSYKDMPGLDPDVVEHRLPLKPECPPVKQKLRRSHPDMALKIKE 1290
K ++ + EY+DIFA + P ++ + +L LK + P + R H + +I+
Sbjct: 276 KSQLENICSEYIDIFA--LESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE-EIQA 332
Query: 1291 EVRKQIDAGFLVTSEYPQWLANIVPVPKKDG------KIRMCVDYRDLNKASPKDNFPLP 1344
+V+K I +V Q+ + ++ VPKK K R+ +DYR +NK D FPLP
Sbjct: 333 QVQKLIKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 1345 HIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGA 1404
ID ++D + K FS +D SG++QI + R+ TSF T G++ + +PFGL A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 1405 TYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTF 1464
++QR MT F + + +Y+DD+IV +E+ ++ L ++F + R+Y L+L+P KC+F
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 1465 GVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTATC 1524
+ LG + KGI D K I+ PVP R F+ NY RFI +
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 1525 GPIFKLLRKDQGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVL 1584
I +L +K+ +W D+CQKAF +K L+ P +L P + + + + G VL
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 1585 GQQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMD 1644
Q+ G + V Y S+ FT ES S E+ A+ WA R Y+ + +
Sbjct: 632 -TQNHNGHQLPVA-YASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHR 689
Query: 1645 PIKYIFEKPALTGRIARWQMLLSEYDIEYRTQKAVKGSILAEHLAHQPIEDYQPI 1699
P+ Y+F + ++ R ++ L EY+ K K + +A+ L+ I++ + I
Sbjct: 690 PLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALSRITIKELKDI 743
Score = 110 bits (274), Expect = 6e-23
Identities = 95/398 (23%), Positives = 174/398 (42%), Gaps = 23/398 (5%)
Query: 1913 KKTLRKLSSRFFLNEDVLYKRNFDGVLLRCV----DKHEAEKLMCEIHEGSFGTHSCGHA 1968
KK +S F N +N LL V ++ E E ++ +H+ G
Sbjct: 854 KKIFEHVSIDKFKNMGNKILKNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGIT 913
Query: 1969 MAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSMLNVISSPWPFSMWGIDMI 2028
++ YYW M + ++C KCQ H M + F +D I
Sbjct: 914 KTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTI 973
Query: 2029 GRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVRFIKNNLISRYGVPNRIITDN 2088
G + PK+ NG+ + + I TK++ A AN + + V + I + I +YG ITD
Sbjct: 974 GPL-PKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDM 1032
Query: 2089 GTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANKNIKKIIQKMVVTYK-DWHEML 2147
GT N+++ +LC KI++ S+ + Q G VE +++ + + I+ + T K DW L
Sbjct: 1033 GTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWL 1092
Query: 2148 PYALYGYRTSVRTSTGATPFSLVYGMEAVLPVEV-EIPSLRVLMEAELSEAEWCQSRYDQ 2206
Y +Y + T+ P+ LV+G + LP ++ S+ + + ++ + +
Sbjct: 1093 QYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNID----DYAKESKYR 1148
Query: 2207 LNLIEEKRMAALCHGQLYQSRMKQAFDKGVHPREFKEGDLVLKCIKSFQPDPRGKWTPNY 2266
L + + L + ++ + K+ +D V E + GD VL + + K Y
Sbjct: 1149 LEVAYARARKLL---EAHKEKNKENYDLKVKDIELEVGDKVL-----LRNEVGHKLDFKY 1200
Query: 2267 EGPYVVKR-AFSGGALILTNMDGEELPRPVNSDAVKKY 2303
GPY ++ + +LTN + +++ V+ D +KK+
Sbjct: 1201 TGPYKIESIGDNNNITLLTNKNKKQI---VHKDRLKKF 1235
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 203 bits (516), Expect = 5e-51
Identities = 143/500 (28%), Positives = 250/500 (49%), Gaps = 48/500 (9%)
Query: 1236 ELIREYVDIFAWSY-----KDMPGLDPDVV----EHRLPLKP-ECPPVKQKLRRSHPDMA 1285
++ +E+ DI A + K + GL+ +V +RLP++ PP K +
Sbjct: 376 DIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQA-------- 427
Query: 1286 LKIKEEVRKQIDAGFLVTSEYPQWLANIVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPH 1345
+ +E+ + + +G + S+ ++ VPKK+G +RM VDY+ LNK + +PLP
Sbjct: 428 --MNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 1346 IDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGAT 1405
I+ L+ +F+ +D S Y+ IR+ D K +F P G F Y+VMP+G+ A A
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAH 544
Query: 1406 YQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFG 1465
+Q + I + + Y+DD+++ S +E EHV+++ + Q+L+ L +N KC F
Sbjct: 545 FQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFH 604
Query: 1466 VRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTATCG 1525
K +G+ +S+KG + + + + PK K++R FLG +NY+ +FI +
Sbjct: 605 QSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTH 664
Query: 1526 PIFKLLRKDQGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVLG 1585
P+ LL+KD KW +A + IK+ L+ PP+L + +++ + ++G VL
Sbjct: 665 PLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLS 724
Query: 1586 QQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMDP 1645
Q+ + K V YY S K + + YSV +K A+ + K RHY L S ++P
Sbjct: 725 QKHDDDKYYPVGYY-SAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY-------LESTIEP 776
Query: 1646 IKYIFEKPALTGRI-----------ARWQMLLSEYDIEYRTQKAVKGSILAEHLA---HQ 1691
K + + L GRI ARWQ+ L +++ E + GS A H+A +
Sbjct: 777 FKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYR---PGS--ANHIADALSR 831
Query: 1692 PIEDYQPIKFDFPDKEVMYL 1711
+++ +PI D D + ++
Sbjct: 832 IVDETEPIPKDSEDNSINFV 851
Score = 97.1 bits (240), Expect = 5e-19
Identities = 113/492 (22%), Positives = 199/492 (39%), Gaps = 42/492 (8%)
Query: 1786 IDLRIKKIVIYGDSALVINQIKGEWETRHPGLIPYRDYARRLLTFFNKVELHHVPRDENQ 1845
++ I+ I D +I +I E E + L ++ + L FN E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLF----LQDFN-FEINYRPGSANH 824
Query: 1846 MADALATLSSMINVNGHNTVPVINVQFLDRPAYVFVAEAIDDDKPWYHDIQVFLQTQKYP 1905
+ADAL+ + T P+ + +V DD K + + K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQV--VTEYTNDTKLL 875
Query: 1906 PGASNKDKKTLRKLSSRFFLNEDVLYKRNFDGVLLRCVDKHEAEKLMCEIHEGSFGTHSC 1965
+N+DK+ + + D L + D +LL D ++ + HE H
Sbjct: 876 NLLNNEDKRVEENIQLK-----DGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHPG 929
Query: 1966 GHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSMLNVIS-SPWPFSMWG 2024
+ ILR + W + + + CH CQI + H P L I S P+
Sbjct: 930 IELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLS 988
Query: 2025 IDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVRFIKNNLISRYGVPNR 2083
+D I + S+G+ + V +D F+K + ++T + R +I+ +G P
Sbjct: 989 MDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKE 1046
Query: 2084 IITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANKNIKKIIQKMVVTYKD- 2142
II DN + K+ + S PYRPQ +G E N+ ++K+++ + T+ +
Sbjct: 1047 IIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNT 1106
Query: 2143 WHEMLPYALYGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLMEAELSEAEWCQS 2202
W + + Y ++ ++T TPF +V+ L +E+PS + E Q
Sbjct: 1107 WVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD------ENSQE 1159
Query: 2203 RYDQLNLIEEKRMAALCHGQLYQSRMKQAFDKGVHP-REFKEGDLVL-KCIKSFQPDPRG 2260
++E H +MK+ FD + EF+ GDLV+ K K+
Sbjct: 1160 TIQVFQTVKE-------HLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSN 1212
Query: 2261 KWTPNYEGPYVV 2272
K P++ GP+ V
Sbjct: 1213 KLAPSFAGPFYV 1224
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 201 bits (510), Expect = 3e-50
Identities = 142/500 (28%), Positives = 251/500 (49%), Gaps = 48/500 (9%)
Query: 1236 ELIREYVDIFAWSY-----KDMPGLDPDVV----EHRLPLKP-ECPPVKQKLRRSHPDMA 1285
++ +E+ DI A + K + GL+ +V +RLP++ PP K +
Sbjct: 376 DIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQA-------- 427
Query: 1286 LKIKEEVRKQIDAGFLVTSEYPQWLANIVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPH 1345
+ +E+ + + +G + S+ ++ VPKK+G +RM VDY+ LNK + +PLP
Sbjct: 428 --MNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 1346 IDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGAT 1405
I+ L+ +F+ +D S Y+ IR+ D K +F P G F Y+VMP+G+ A A
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAH 544
Query: 1406 YQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFG 1465
+Q + I ++ + Y+D++++ S +E EHV+++ + Q+L+ L +N KC F
Sbjct: 545 FQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFH 604
Query: 1466 VRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTATCG 1525
K +G+ +S+KG + + + + PK K++R FLG +NY+ +FI +
Sbjct: 605 QSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTH 664
Query: 1526 PIFKLLRKDQGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVLG 1585
P+ LL+KD KW +A + IK+ L+ PP+L + +++ + ++G VL
Sbjct: 665 PLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLS 724
Query: 1586 QQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMDP 1645
Q+ + K V YY S K + + YSV +K A+ + K RHY L S ++P
Sbjct: 725 QKHDDDKYYPVGYY-SAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY-------LESTIEP 776
Query: 1646 IKYIFEKPALTGRI-----------ARWQMLLSEYDIEYRTQKAVKGSILAEHLA---HQ 1691
K + + L GRI ARWQ+ L +++ E + GS A H+A +
Sbjct: 777 FKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYR---PGS--ANHIADALSR 831
Query: 1692 PIEDYQPIKFDFPDKEVMYL 1711
+++ +PI D D + ++
Sbjct: 832 IVDETEPIPKDSEDNSINFV 851
Score = 97.1 bits (240), Expect = 5e-19
Identities = 113/492 (22%), Positives = 199/492 (39%), Gaps = 42/492 (8%)
Query: 1786 IDLRIKKIVIYGDSALVINQIKGEWETRHPGLIPYRDYARRLLTFFNKVELHHVPRDENQ 1845
++ I+ I D +I +I E E + L ++ + L FN E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLF----LQDFN-FEINYRPGSANH 824
Query: 1846 MADALATLSSMINVNGHNTVPVINVQFLDRPAYVFVAEAIDDDKPWYHDIQVFLQTQKYP 1905
+ADAL+ + T P+ + +V DD K + + K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQV--VTEYTNDTKLL 875
Query: 1906 PGASNKDKKTLRKLSSRFFLNEDVLYKRNFDGVLLRCVDKHEAEKLMCEIHEGSFGTHSC 1965
+N+DK+ + + D L + D +LL D ++ + HE H
Sbjct: 876 NLLNNEDKRVEENIQLK-----DGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHPG 929
Query: 1966 GHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSMLNVIS-SPWPFSMWG 2024
+ ILR + W + + + CH CQI + H P L I S P+
Sbjct: 930 IELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLS 988
Query: 2025 IDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVRFIKNNLISRYGVPNR 2083
+D I + S+G+ + V +D F+K + ++T + R +I+ +G P
Sbjct: 989 MDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKE 1046
Query: 2084 IITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANKNIKKIIQKMVVTYKD- 2142
II DN + K+ + S PYRPQ +G E N+ ++K+++ + T+ +
Sbjct: 1047 IIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNT 1106
Query: 2143 WHEMLPYALYGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLMEAELSEAEWCQS 2202
W + + Y ++ ++T TPF +V+ L +E+PS + E Q
Sbjct: 1107 WVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD------ENSQE 1159
Query: 2203 RYDQLNLIEEKRMAALCHGQLYQSRMKQAFDKGVHP-REFKEGDLVL-KCIKSFQPDPRG 2260
++E H +MK+ FD + EF+ GDLV+ K K+
Sbjct: 1160 TIQVFQTVKE-------HLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSN 1212
Query: 2261 KWTPNYEGPYVV 2272
K P++ GP+ V
Sbjct: 1213 KLAPSFAGPFYV 1224
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 201 bits (510), Expect = 3e-50
Identities = 142/500 (28%), Positives = 251/500 (49%), Gaps = 48/500 (9%)
Query: 1236 ELIREYVDIFAWSY-----KDMPGLDPDVV----EHRLPLKP-ECPPVKQKLRRSHPDMA 1285
++ +E+ DI A + K + GL+ +V +RLP++ PP K +
Sbjct: 376 DIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQA-------- 427
Query: 1286 LKIKEEVRKQIDAGFLVTSEYPQWLANIVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPH 1345
+ +E+ + + +G + S+ ++ VPKK+G +RM VDY+ LNK + +PLP
Sbjct: 428 --MNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 1346 IDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGAT 1405
I+ L+ +F+ +D S Y+ IR+ D K +F P G F Y+VMP+G+ A A
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAH 544
Query: 1406 YQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFG 1465
+Q + I ++ + Y+D++++ S +E EHV+++ + Q+L+ L +N KC F
Sbjct: 545 FQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFH 604
Query: 1466 VRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTATCG 1525
K +G+ +S+KG + + + + PK K++R FLG +NY+ +FI +
Sbjct: 605 QSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTH 664
Query: 1526 PIFKLLRKDQGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVLG 1585
P+ LL+KD KW +A + IK+ L+ PP+L + +++ + ++G VL
Sbjct: 665 PLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLS 724
Query: 1586 QQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMDP 1645
Q+ + K V YY S K + + YSV +K A+ + K RHY L S ++P
Sbjct: 725 QKHDDDKYYPVGYY-SAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY-------LESTIEP 776
Query: 1646 IKYIFEKPALTGRI-----------ARWQMLLSEYDIEYRTQKAVKGSILAEHLA---HQ 1691
K + + L GRI ARWQ+ L +++ E + GS A H+A +
Sbjct: 777 FKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYR---PGS--ANHIADALSR 831
Query: 1692 PIEDYQPIKFDFPDKEVMYL 1711
+++ +PI D D + ++
Sbjct: 832 IVDETEPIPKDSEDNSINFV 851
Score = 97.1 bits (240), Expect = 5e-19
Identities = 113/492 (22%), Positives = 199/492 (39%), Gaps = 42/492 (8%)
Query: 1786 IDLRIKKIVIYGDSALVINQIKGEWETRHPGLIPYRDYARRLLTFFNKVELHHVPRDENQ 1845
++ I+ I D +I +I E E + L ++ + L FN E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLF----LQDFN-FEINYRPGSANH 824
Query: 1846 MADALATLSSMINVNGHNTVPVINVQFLDRPAYVFVAEAIDDDKPWYHDIQVFLQTQKYP 1905
+ADAL+ + T P+ + +V DD K + + K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQV--VTEYTNDTKLL 875
Query: 1906 PGASNKDKKTLRKLSSRFFLNEDVLYKRNFDGVLLRCVDKHEAEKLMCEIHEGSFGTHSC 1965
+N+DK+ + + D L + D +LL D ++ + HE H
Sbjct: 876 NLLNNEDKRVEENIQLK-----DGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHPG 929
Query: 1966 GHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSMLNVIS-SPWPFSMWG 2024
+ ILR + W + + + CH CQI + H P L I S P+
Sbjct: 930 IELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLS 988
Query: 2025 IDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVRFIKNNLISRYGVPNR 2083
+D I + S+G+ + V +D F+K + ++T + R +I+ +G P
Sbjct: 989 MDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKE 1046
Query: 2084 IITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANKNIKKIIQKMVVTYKD- 2142
II DN + K+ + S PYRPQ +G E N+ ++K+++ + T+ +
Sbjct: 1047 IIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNT 1106
Query: 2143 WHEMLPYALYGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLMEAELSEAEWCQS 2202
W + + Y ++ ++T TPF +V+ L +E+PS + E Q
Sbjct: 1107 WVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD------ENSQE 1159
Query: 2203 RYDQLNLIEEKRMAALCHGQLYQSRMKQAFDKGVHP-REFKEGDLVL-KCIKSFQPDPRG 2260
++E H +MK+ FD + EF+ GDLV+ K K+
Sbjct: 1160 TIQVFQTVKE-------HLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSN 1212
Query: 2261 KWTPNYEGPYVV 2272
K P++ GP+ V
Sbjct: 1213 KLAPSFAGPFYV 1224
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 196 bits (498), Expect = 6e-49
Identities = 118/447 (26%), Positives = 230/447 (51%), Gaps = 10/447 (2%)
Query: 1232 KRVIELIREYVDIFAWSYKDMPGLDPDVVEHRLPLKPECPPVKQKLRRSHPDMALKIKEE 1291
+++ ++I ++ D+FA S ++ G + E + LK P++QK R + +I++
Sbjct: 904 RKIWDVIEQFQDVFAISDDEL-GRNSGT-ECVIELKEGAEPIRQKPRPIPLALKPEIRKM 961
Query: 1292 VRKQIDAGFLVTSEYPQWLANIVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVD 1351
++K ++ + S+ P W + +V V KKDG IRMC+DYR +NK + PLP+I+ +
Sbjct: 962 IQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQ 1020
Query: 1352 NTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGATYQRGMT 1411
+ A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A +Q M
Sbjct: 1021 SLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTME 1080
Query: 1412 KIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKL 1471
+I D++ VYVDD+++ S E+H++ + + R+RK ++L +KC + +
Sbjct: 1081 EIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEY 1140
Query: 1472 LGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLL 1531
LG V+ G+E K +++ P K+++ FLG + Y +FI + + L+
Sbjct: 1141 LGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLI 1200
Query: 1532 RKDQGVKWNDDCQKAFDQIKEYLLEPPILVPP------VDGRPLIMYLTVLEDSMGCVLG 1585
W + + AF ++K+ + + P+L P RP ++Y +G VL
Sbjct: 1201 SAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLA 1260
Query: 1586 QQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMDP 1645
Q+ G ++H I + SK + E+RY + + A+ +A +R + + + + P
Sbjct: 1261 QEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKP 1319
Query: 1646 IKYIFEKPALTGRIARWQMLLSEYDIE 1672
+ + + L R+ RW + + E+D++
Sbjct: 1320 LISLLKGSPLADRLWRWSIEILEFDVK 1346
Score = 105 bits (263), Expect = 1e-21
Identities = 87/328 (26%), Positives = 147/328 (44%), Gaps = 16/328 (4%)
Query: 1951 LMCEIHEGSFGTHSCGHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSM 2010
L+ E+HEG H M + + R +YW M N + C KC D + S
Sbjct: 1468 LLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVCVENCVRTCAKCLCANDHSKLTSS- 1525
Query: 2011 LNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVR-F 2069
L +P + D++ + G+R+IL ID FTK+ A + + V++ F
Sbjct: 1526 LTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVLKAF 1583
Query: 2070 IKNNLISRYGVPNRIITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANKNI 2129
++ I +P +++TD G N + + KI+H + Y + NGAVE NK I
Sbjct: 1584 VERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTI 1643
Query: 2130 KKIIQKMVVTYKDWHEMLPYALYGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVL 2189
I++K +W + + YA+Y Y V +TG TP L++G + + P+E+ +
Sbjct: 1644 MHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGI 1703
Query: 2190 MEAELSEAEWCQSRYDQLNLIEEKRMAALCHGQLYQSRMKQAFDKGVHPREFKEGDLVLK 2249
A++ E + ++ + L + + + A+ + Y+S Q + H R + G VL
Sbjct: 1704 NYADMDEYKHLLTQ-ELLKVQKIAKEHAMREQESYKSLFDQKYASKKH-RFPQPGSRVLL 1761
Query: 2250 CIKSFQ-----PDPRGKWTPNYEGPYVV 2272
I S + P KW+ GPY V
Sbjct: 1762 EIPSEKLGAQCPKLVNKWS----GPYRV 1785
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 188 bits (477), Expect = 2e-46
Identities = 127/434 (29%), Positives = 210/434 (48%), Gaps = 24/434 (5%)
Query: 1279 RSHPDMALKIKEEVRKQIDA----GFLVTSEYPQ----WLANIVPVPKKDGKIRMCVDYR 1330
+S+P + ++ EV +QID G + S P W+ P P + + RM VD++
Sbjct: 127 KSYP-YPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFK 185
Query: 1331 DLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAF 1390
LN + D +P+P I+ + + K F+ +D SG++QI M D KT+F T G +
Sbjct: 186 RLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKY 245
Query: 1391 CYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRL 1450
++ +PFGL NA A +QR + I + I K VY+DD+IV S + H + L + L
Sbjct: 246 EFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASL 305
Query: 1451 RKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRL 1510
K L++N K F + LG+IV+ GI+ DP KVRAI EMP P + K+++ FLG
Sbjct: 306 SKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMT 365
Query: 1511 NYISRFISHMTATCGPIFKLLR-----------KDQGVKWNDDCQKAFDQIKEYLLEPPI 1559
+Y +FI P+ L R + ++ ++F+ +K L I
Sbjct: 366 SYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEI 425
Query: 1560 LVPPVDGRPLIMYLTVLEDSMGCVLGQQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCC 1619
L P +P + ++G VL Q D+ ++ I Y+S+ E Y+ +EK
Sbjct: 426 LAFPCFTKPFHLTTDASNWAIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEML 483
Query: 1620 ALAWAAKRLRHYMIN-HTTWLISKMDPIKYIFEKPALTGRIARWQMLLSEYDIEYRTQKA 1678
A+ W+ LR Y+ T + + P+ + ++ RW+ + EY+ E K
Sbjct: 484 AIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCEL-IYKP 542
Query: 1679 VKGSILAEHLAHQP 1692
K +++A+ L+ P
Sbjct: 543 GKSNVVADALSRIP 556
Score = 34.7 bits (78), Expect = 3.2
Identities = 47/226 (20%), Positives = 89/226 (38%), Gaps = 17/226 (7%)
Query: 1952 MCEIHEGSFGTHSCGHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSML 2011
+CEI E G + L YY+ M + C C++Y + H P+
Sbjct: 693 ICEIIEKEHRRAHRGPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERH--PNKP 750
Query: 2012 NVISSP---WPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVR 2068
N+ +P +P + ID+ + R L ID F+K+ + + V +R
Sbjct: 751 NLQPTPIPNYPCEILHIDIFALEK-------RLYLSCIDKFSKFAK-LFHLQSKASVHLR 802
Query: 2069 FIKNNLISRYGVPNRIITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANK- 2127
+ + P +++DN L + I + + + ++NG VE +
Sbjct: 803 ETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHST 862
Query: 2128 --NIKKIIQKMVVTYKDWHEMLPYALYGYRTSVRTSTGATPFSLVY 2171
I + ++ + T+K E++ A+ Y TSV + T P + +
Sbjct: 863 FLEIYRCLKDELPTFKP-VELVHIAVDRYNTSVHSVTNRKPADVFF 907
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 180 bits (457), Expect = 4e-44
Identities = 115/387 (29%), Positives = 196/387 (49%), Gaps = 20/387 (5%)
Query: 1324 RMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSF 1383
R+ +D+R LN+ + D +P+P I +++ N K K F+ +D SGY+QI +A DREKTSF
Sbjct: 238 RLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSF 297
Query: 1384 ITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYL 1443
G + + +PFGL NA + +QR + + + I K VYVDD+I+ S E +HV ++
Sbjct: 298 SVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHI 357
Query: 1444 LKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQV 1503
+ + L +R++ K F S + LGFIVS+ G + DP+KV+AI+E P P +V
Sbjct: 358 DTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKV 417
Query: 1504 RGFLGRLNYISRFISHMTATCGPIFKLLRKDQG-----------VKWNDDCQKAFDQIKE 1552
R FLG +Y FI A PI +L+ + G V++N+ + AF +++
Sbjct: 418 RSFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRN 477
Query: 1553 YLL-EPPILVPPVDGRPLIMYLTVLEDSMGCVLGQQDETGKKEHVIYYLSKKFTDCESRY 1611
L E IL P +P + +G VL Q+ I +S+ E Y
Sbjct: 478 ILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRP------ITMISRTLKQPEQNY 531
Query: 1612 SVLEKTCCALAWAAKRLRHYMI-NHTTWLISKMDPIKYIFEKPALTGRIARWQMLLSEYD 1670
+ E+ A+ WA +L++++ + + + P+ + +I RW+ + +++
Sbjct: 532 ATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591
Query: 1671 IEYRTQKAVKGSILAEHLAHQPIEDYQ 1697
+ K K + +A+ L+ Q + Q
Sbjct: 592 AKV-FYKPGKENFVADALSRQNLNALQ 617
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 167 bits (423), Expect = 3e-40
Identities = 198/902 (21%), Positives = 366/902 (39%), Gaps = 78/902 (8%)
Query: 1313 IVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIR 1372
+ PVPK DG+ RM +DYR++NK P H ++ + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 1373 MAPEDREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVK 1432
+ PE T+F +C+ +P G +N+ A + + + ++ ++VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 1433 SGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEV-DPDKVRAI 1491
+EHV+ L K+FQ L + ++ K G ++ + LGF ++++G + D K + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 1492 REMPVPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQG--VKWNDDCQKAFDQ 1549
P PK KQ++ LG LN+ FI + P++ L+ +G ++W+++ K +
Sbjct: 183 NITP-PKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNM 241
Query: 1550 IKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVLGQQDETGKKEHVIYYLSKKFTDCES 1609
+ E L L + + L++ + S G V +ETGKK I YL+ F+ E
Sbjct: 242 VIEALNTASNLEERLPEQRLVIKVNT-SPSAGYV-RYYNETGKKP--IMYLNYVFSKAEL 297
Query: 1610 RYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYIFEKP-----ALTGRIARWQM 1664
++S+LEK + A + + + S + + I + P AL R W
Sbjct: 298 KFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMT 357
Query: 1665 LLSEYDIEYRTQKAVKGSILAEHLAHQPIEDYQPIKFDFPDKEVMYLKAKDCDEPVFGEG 1724
L + I++ K + +H+ P+K + V Y +
Sbjct: 358 YLEDPRIQFHYDKTLPE---LKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAI------KS 408
Query: 1725 PDPESEWGLIFDGAVNVYGSGIGAVLITPKGTHIPFTARLRFDCTNNIVEYEACIMGIEE 1784
PDP N G GI P+ + + + T + E A ++
Sbjct: 409 PDPTKS---------NNAGMGIVHATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFACKK 459
Query: 1785 AIDLRIKKIVIYGDSALVINQIKGEWETRHPGLIPYRDYARRLLTFFNKVELHHVPRDEN 1844
A+ + +++ DS V E +PY + K L H+ +
Sbjct: 460 ALKIP-GPVLVITDSFYVAESANKE--------LPY--WKSNGFVNNKKKPLKHISK--- 505
Query: 1845 QMADALATLSSMINVNGHNTVPVINVQFLDRPAYVFVAEAIDDDKPWYHDIQVFLQTQKY 1904
+++ +++ T+ L P ++ A+ D V T+K
Sbjct: 506 -----WKSIAECLSMKPDITIQHEKGISLQIPVFILKGNALADKLATQGSYVVNCNTKK- 559
Query: 1905 PPGASNKDKKTLRKLSSR----------FFLNEDVLYKRNFDGVLLRCVDKHEAEKLMCE 1954
N D + + L +FL + + +GV + + + +K++ +
Sbjct: 560 ----PNLDAELDQLLQGHYIKGYPKQYTYFLEDGKVKVSRPEGVKI-IPPQSDRQKIVLQ 614
Query: 1955 IHEGSFGTHSCGHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYADKIHIPPSMLNVI 2014
H + H+ A KI Y+W M D RC +C I +L
Sbjct: 615 AHNLA---HTGREATLLKIANL-YWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPD 670
Query: 2015 SSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVRFIKNNL 2074
PF + ID IG + P S G+ ++LV +D T + + V+ + N+
Sbjct: 671 RPQKPFDKFFIDYIGPLPP--SQGYLYVLVVVDGMTGFTWLYPTKAPSTSATVKSL--NV 726
Query: 2075 ISRYGVPNRIITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQMNGAVEAANKNIKKIIQ 2134
++ +P I +D G ++ E + I S+PY PQ VE N +IK+++
Sbjct: 727 LTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLT 786
Query: 2135 KMVV-TYKDWHEMLPYALYGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLMEAE 2193
K++V W+++LP + TP L++G+++ P + +L + E E
Sbjct: 787 KLLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDSNTPFANQ-DTLDLTREEE 845
Query: 2194 LS 2195
LS
Sbjct: 846 LS 847
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (325), Expect = 7e-29
Identities = 133/511 (26%), Positives = 224/511 (43%), Gaps = 43/511 (8%)
Query: 1192 LEQERKTIQPHQEEIEIINLGTEEDKKEIKIGASLDVSIKKRVIELIREYVDIFAWSYKD 1251
+++ KT QP E +N+ T + + +K +++I L E + I +
Sbjct: 162 MKKRSKTQQP-----EPVNISTNKIENPLK-----EIAILSEGRRLSEEKLFITQQRMQK 211
Query: 1252 MPGLDPDVVEHRLPLKPECPP--VKQKLRRSHPDMALKIK---------EEVRKQI---- 1296
+ L V PL P +K ++ S P A+K+K EE KQI
Sbjct: 212 IEELLEKVCSEN-PLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELL 270
Query: 1297 DAGFLVTSEYPQWLANIV---PVPKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNT 1353
D + S+ P + K+ GK RM V+Y+ +NKA+ D + LP+ D L+
Sbjct: 271 DLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLI 330
Query: 1354 AKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKI 1413
K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR M +
Sbjct: 331 RGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEA 390
Query: 1414 FHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLG 1473
F + K VYVDD++V S EE+H+ ++ + Q+ ++ + L+ K + LG
Sbjct: 391 FR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG 449
Query: 1474 FIVSQKGIEVDPDKVRAIREMP-VPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLR 1532
+ + + + I + P + +KQ++ FLG L Y S +I + P+ L+
Sbjct: 450 LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLK 509
Query: 1533 KDQGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVLG--QQDET 1590
++ KW + ++K+ L P L P+ LI+ +D G +L + +E
Sbjct: 510 ENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEG 569
Query: 1591 GKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLR------HYMINHTTWLISKMD 1644
E + Y S F E Y +K A+ K+ H++I
Sbjct: 570 TNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFV 629
Query: 1645 PIKYIFEKPALTGRIARWQMLLSEY--DIEY 1673
+ Y + + GR RWQ LS Y D+E+
Sbjct: 630 NLNY--KGDSKLGRNIRWQAWLSHYSFDVEH 658
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (324), Expect = 9e-29
Identities = 116/428 (27%), Positives = 193/428 (44%), Gaps = 30/428 (7%)
Query: 1273 VKQKLRRSHPDMALKIK---------EEVRKQI----DAGFLVTSEYPQWLANIV---PV 1316
+K ++ S P A+K+K EE KQI D + S+ P +
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 1317 PKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPE 1376
K+ GK RM V+Y+ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 1377 DREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTE 1436
R T+F P G + + V+PFGL A + +QR M + F + K VYVDD++V S E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 1437 EEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1495
E+H+ ++ + Q+ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 1496 VPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQGVKWNDDCQKAFDQIKEYLL 1555
+ +KQ++ FLG L Y S +I + P+ L+++ KW + ++K+ L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 532
Query: 1556 EPPILVPPVDGRPLIMYLTVLEDSMGCVLG--QQDETGKKEHVIYYLSKKFTDCESRYSV 1613
P L P+ LI+ +D G +L + +E E + Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHS 592
Query: 1614 LEKTCCALAWAAKRLR------HYMINHTTWLISKMDPIKYIFEKPALTGRIARWQMLLS 1667
+K A+ K+ H++I + Y + + GR RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNY--KGDSKLGRNIRWQAWLS 650
Query: 1668 EY--DIEY 1673
Y D+E+
Sbjct: 651 HYSFDVEH 658
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (323), Expect = 1e-28
Identities = 115/428 (26%), Positives = 193/428 (44%), Gaps = 30/428 (7%)
Query: 1273 VKQKLRRSHPDMALKIK---------EEVRKQI----DAGFLVTSEYPQWLANIV---PV 1316
+K ++ S P A+K+K EE KQI D + S+ P +
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 1317 PKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPE 1376
K+ GK RM V+Y+ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 1377 DREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTE 1436
R T+F P G + + V+PFGL A + +QR M + F + K VYVDD++V S E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 1437 EEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1495
E+H+ ++ + Q+ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 1496 VPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQGVKWNDDCQKAFDQIKEYLL 1555
+ +KQ++ FLG L Y S +I + P+ L+++ +W + ++K+ L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQ 532
Query: 1556 EPPILVPPVDGRPLIMYLTVLEDSMGCVLG--QQDETGKKEHVIYYLSKKFTDCESRYSV 1613
P L P+ LI+ +D G +L + +E E + Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 592
Query: 1614 LEKTCCALAWAAKRLR------HYMINHTTWLISKMDPIKYIFEKPALTGRIARWQMLLS 1667
+K A+ K+ H++I + Y + + GR RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNY--KGDSKLGRNIRWQAWLS 650
Query: 1668 EY--DIEY 1673
Y D+E+
Sbjct: 651 HYSFDVEH 658
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 127 bits (319), Expect = 4e-28
Identities = 115/428 (26%), Positives = 192/428 (43%), Gaps = 30/428 (7%)
Query: 1273 VKQKLRRSHPDMALKIK---------EEVRKQI----DAGFLVTSEYPQWLANIV---PV 1316
+K ++ S P A+K+K EE KQI D + S+ P +
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 1317 PKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPE 1376
K+ GK RM V+Y+ +NKA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 1377 DREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTE 1436
R T+F P G + + V+PFGL A + +QR M + F + K VYVDD++V S E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 1437 EEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1495
E+H+ ++ + Q+ ++ + L+ K + LG + + + + I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 1496 VPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQGVKWNDDCQKAFDQIKEYLL 1555
+ +KQ++ FLG L Y S +I + P+ L+++ KW + ++K+ L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 1556 EPPILVPPVDGRPLIMYLTVLEDSMGCVLG--QQDETGKKEHVIYYLSKKFTDCESRYSV 1613
P L P+ LI+ +D G +L + +E E + Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 1614 LEKTCCALAWAAKRLR------HYMINHTTWLISKMDPIKYIFEKPALTGRIARWQMLLS 1667
+K A+ K+ H++I + Y + + GR RWQ LS
Sbjct: 588 NDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNY--KGDSKLGRNIRWQAWLS 645
Query: 1668 EY--DIEY 1673
Y D+E+
Sbjct: 646 HYSFDVEH 653
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 125 bits (314), Expect = 1e-27
Identities = 114/428 (26%), Positives = 193/428 (44%), Gaps = 30/428 (7%)
Query: 1273 VKQKLRRSHPDMALKIK---------EEVRKQI----DAGFLVTSEYPQWLANIVPVPKK 1319
+K ++ S P A+K+K EE KQI D + S+ P + +
Sbjct: 235 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 294
Query: 1320 D---GKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPE 1376
+ G RM V+Y+ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 295 ENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 354
Query: 1377 DREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTE 1436
R T+F P G + + V+PFGL A + +QR M + F + K VYVDD++V S E
Sbjct: 355 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDIVVFSNNE 413
Query: 1437 EEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1495
E+H+ ++ + Q+ ++ + L+ K + LG + + + + I + P
Sbjct: 414 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 473
Query: 1496 VPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQGVKWNDDCQKAFDQIKEYLL 1555
+ +KQ++ FLG L Y S +I ++ P+ L+++ KW + ++K+ L
Sbjct: 474 TLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 533
Query: 1556 EPPILVPPVDGRPLIMYLTVLEDSMGCVLG--QQDETGKKEHVIYYLSKKFTDCESRYSV 1613
P L P+ LI+ +D G +L + +E E + Y S F E Y
Sbjct: 534 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHS 593
Query: 1614 LEKTCCALAWAAKRLR------HYMINHTTWLISKMDPIKYIFEKPALTGRIARWQMLLS 1667
+K A+ K+ H++I + Y + + GR RWQ LS
Sbjct: 594 NDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNY--KGDSKLGRNIRWQAWLS 651
Query: 1668 EY--DIEY 1673
Y D+E+
Sbjct: 652 HYSFDVEH 659
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 125 bits (313), Expect = 2e-27
Identities = 119/467 (25%), Positives = 198/467 (41%), Gaps = 40/467 (8%)
Query: 1247 WSYKDMPGLDPDVVEHRLPLKPECPPVKQKLRRSHPDMALK-IKEEVRKQIDAGFLVTSE 1305
W + +DP V P+ ++ R+ + LK IK + FLV +E
Sbjct: 215 WMTATIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENE 274
Query: 1306 YPQWLANIVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGF 1365
++ GK RM V+Y+ +NKA+ D LP+ D L+ K++S D
Sbjct: 275 ----------AERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCK 324
Query: 1366 SGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVY 1425
SG Q+ + E + T+F P G + + V+PFGL A + + + + K VY
Sbjct: 325 SGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVY 384
Query: 1426 VDDMIVKSGT-EEEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQ------ 1478
VDD++V S T +EH ++L + +R K + L+ K LG + Q
Sbjct: 385 VDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQ 444
Query: 1479 ----KGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKD 1534
+ I PD++ + +KQ++ FLG L Y S +I + + P+ L++D
Sbjct: 445 NHILEHIHKFPDRI---------EDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKED 495
Query: 1535 QGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCVLGQQDETGKKE 1594
WND + +IK+ L P L P L++ E+ G +L + E
Sbjct: 496 STWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGIL--KAIHNSHE 553
Query: 1595 HVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYMINHTTWLISKMDP-----IKYI 1649
++ Y S F E Y EK A+ K+ Y + + +LI + +
Sbjct: 554 YICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIY-LTPSRFLIRTDNKNFTHFVNIN 612
Query: 1650 FEKPALTGRIARWQMLLSEYDIEYRTQKAVKGSILAEHLAHQPIEDY 1696
+ GR+ RWQM LS+YD + K ++ A+ L + +Y
Sbjct: 613 LKGDRKQGRLVRWQMWLSQYDFDVEHIAGTK-NVFADFLQENTLTNY 658
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 120 bits (300), Expect = 6e-26
Identities = 77/233 (33%), Positives = 124/233 (53%), Gaps = 7/233 (3%)
Query: 1288 IKEEVRKQIDAGFLVTSEYPQWLANIVPVPKKD-GKIRMCVDYR--DLNKASPKDNFPLP 1344
++ E+ + + G +V Y +W A IV + KK GKIR+C D++ LN A + PLP
Sbjct: 456 VETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEFHPLP 515
Query: 1345 HIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGA 1404
+ + K V+S +D Y Q+ + E ++ T G F Y+ M FGL A A
Sbjct: 516 TSEDIFSRL-KGTVYSQIDLKDAYLQVELDEEAQKLAVINTHRGIFKYLRMTFGLKPAPA 574
Query: 1405 TYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTF 1464
++Q+ M K+ + + VY DD+I+ + + EEH + L ++F+R ++Y R++ KC F
Sbjct: 575 SFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEKCAF 632
Query: 1465 GVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPKTEKQVRGFLGRLNYISRFI 1517
+ LGF V + G D K AIR M P +KQ+ FLG +++SR +
Sbjct: 633 AQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
Score = 84.0 bits (206), Expect = 5e-15
Identities = 66/260 (25%), Positives = 122/260 (46%), Gaps = 24/260 (9%)
Query: 1943 VDKHEAEKLMCEIHEGSFGTHSCGHAMAKKILRAGYYWITMHADCYNHAKRCHKCQIYAD 2002
V K + ++ ++HEG G K+ R+ +W + +D N + C+ CQ +
Sbjct: 778 VPKSLQKIVLKQLHEGHPGI-----VQMKQKARSFVFWRGLDSDIENMVRHCNNCQENSK 832
Query: 2003 KIHIPPSMLNVISSPWPF--SMWG---IDMIGRIEPKASNGHRFILVAIDYFTKWVEAAS 2057
+ P +PWP + W ID G + NG ++LV +D TK+ E
Sbjct: 833 MPRVVPL------NPWPVPEAPWKRIHIDFAGPL-----NGC-YLLVVVDAKTKYAEVKL 880
Query: 2058 YANVTKQVVVRFIKNNLISRYGVPNRIITDNGTNLNNNMMKELCDDFKIQHHNSSPYRPQ 2117
+++ + ++ + S +G P II+DNGT L +++ ++C I+H S+ Y P+
Sbjct: 881 TRSISAVTTIDLLEE-IFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPR 939
Query: 2118 MNGAVEAANKNIKKIIQKMVVTYKDWHEMLPYALYGYRTSVRTS-TGATPFSLVYGMEAV 2176
NGA E +K+ I K+ ++L L YR + ++ G+TP +G +
Sbjct: 940 SNGAAERFVDTLKRGIAKIKGEGSVNQQILNKFLISYRNTPHSALNGSTPAECHFGRKIR 999
Query: 2177 LPVEVEIPSLRVLMEAELSE 2196
+ + +P+ RVL +L++
Sbjct: 1000 TTMSLLMPTDRVLKVPKLTQ 1019
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 119 bits (297), Expect = 1e-25
Identities = 95/339 (28%), Positives = 162/339 (47%), Gaps = 22/339 (6%)
Query: 1255 LDPDVVEHRLPLKPECPPVKQKLRRSHPDMALKIK---EEVRKQIDAGFLVTSEYPQWLA 1311
++PD+ P+K P ++ + R ++ L++K K F+V S
Sbjct: 1396 INPDIKIMGRPIKHVTPGDEEAMTRQI-NLLLQMKVIRPSESKHRSTAFIVRSG-----T 1449
Query: 1312 NIVPVPKKD--GKIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYN 1369
I P+ K+ GK RM +Y+ LN+ + D + LP I+ ++ + K++S D SG+
Sbjct: 1450 EIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFW 1509
Query: 1370 QIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDM 1429
Q+ M E T+F+ + ++VMPFGL NA A +QR M +F K I VY+DD+
Sbjct: 1510 QVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDI 1568
Query: 1430 IVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVR 1489
+V S T E+H ++L M Q ++ L L+P K G LG + I++ P +
Sbjct: 1569 LVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIIS 1628
Query: 1490 AIREMPVPK--TEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQGVKWNDDCQKAF 1547
I + K T + +R +LG L+Y +I + P+ + + + N + K
Sbjct: 1629 KICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMV 1688
Query: 1548 DQIKEYLLE-PPILVPPVDGRPLIMYLTVLEDSMGCVLG 1585
QIKE + P + +PP D ++ ++ GC+ G
Sbjct: 1689 RQIKEKVKNLPDLQLPPKDS-------FIIIETDGCMTG 1720
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 112 bits (280), Expect = 1e-23
Identities = 106/409 (25%), Positives = 184/409 (44%), Gaps = 15/409 (3%)
Query: 1290 EEVRKQIDAGFLVTSEY----PQWLANIVPVPKKDGKIRMCVDYRDLNKASPKDNFPLPH 1345
+++++ +D G ++ S+ P +L ++ GK RM V+Y+ +N+A+ D+ LP+
Sbjct: 257 KQIKELLDLGLIIPSKSQHMSPAFLVEN-EAERRRGKKRMVVNYKAINQATIGDSHNLPN 315
Query: 1346 IDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKTSFITPWGAFCYVVMPFGLINAGAT 1405
+ L+ +FS D SG+ Q+ + E ++ T+F P G F + V+PFGL A +
Sbjct: 316 MQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSI 375
Query: 1406 YQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLRKYKLRLNPNKCTFG 1465
+QR M + K VYVDD+IV S +E +H ++ + + + KY + L+ K
Sbjct: 376 FQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLF 434
Query: 1466 VRSGKLLGFIVSQKGIEVDPDKV-RAIREMPVP-KTEKQVRGFLGRLNYISRFISHMTAT 1523
LG + KG + + I + P + +K ++ FLG L Y +I +
Sbjct: 435 KEKINFLGLEI-DKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEI 493
Query: 1524 CGPIFKLLRKDQGVKWNDDCQKAFDQIKEYLLEPPILVPPVDGRPLIMYLTVLEDSMGCV 1583
P+ L+KD W +IK+ L P L P LI+ + G V
Sbjct: 494 RKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGV 553
Query: 1584 LGQQDETGKKEHVIYYLSKKFTDCESRYSVLEKTCCALAWAAKRLRHYM--INHTTWLIS 1641
L + G E + Y S F E Y +K A+ + Y+ + T +
Sbjct: 554 LKARALDG-VELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDN 612
Query: 1642 KMDP--IKYIFEKPALTGRIARWQMLLSEYDIEYRTQKAVKGSILAEHL 1688
K ++ + + GR+ RWQ S+Y + + VK ++LA+ L
Sbjct: 613 KNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVK-NVLADCL 660
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 107 bits (266), Expect = 5e-22
Identities = 70/265 (26%), Positives = 133/265 (49%), Gaps = 4/265 (1%)
Query: 1322 KIRMCVDYRDLNKASPKDNFPLPHIDVLVDNTAKCKVFSFMDGFSGYNQIRMAPEDREKT 1381
K R+ +Y+ LN D F +PH +++ K +FS D +G++ +++ + ++ T
Sbjct: 1238 KPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWT 1297
Query: 1382 SFITPWGAFCYVVMPFGLINAGATYQRGMTKIFHDMIHKEIEVYVDDMIVKSGTEEEHVE 1441
+F G + + V PFG+ NA +QR M + F D+ K +Y+DD+++ S E+EH+E
Sbjct: 1298 TFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIE 1355
Query: 1442 YLLKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPVPK--T 1499
+L F R+++ L+ K ++ + LG + + I + P V I++ K T
Sbjct: 1356 HLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNT 1415
Query: 1500 EKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKDQGVKWNDDCQKAFDQIKEYLLEPPI 1559
K ++ +LG LNY +I ++ GP++K K+ +N + +I+ + +
Sbjct: 1416 LKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKP 1475
Query: 1560 LVPPVDGRPLIMYLTVLEDSMGCVL 1584
L P + +I+ E+ G VL
Sbjct: 1476 LERPKETDYIIIETDASEEGWGAVL 1500
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.136 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 289,493,528
Number of Sequences: 164201
Number of extensions: 13560576
Number of successful extensions: 50967
Number of sequences better than 10.0: 563
Number of HSP's better than 10.0 without gapping: 231
Number of HSP's successfully gapped in prelim test: 359
Number of HSP's that attempted gapping in prelim test: 42171
Number of HSP's gapped (non-prelim): 3545
length of query: 2305
length of database: 59,974,054
effective HSP length: 126
effective length of query: 2179
effective length of database: 39,284,728
effective search space: 85601422312
effective search space used: 85601422312
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)
Medicago: description of AC146664.7