
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147714.3 - phase: 0 /pseudo
(1586 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 204 1e-51
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 192 6e-48
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 191 1e-47
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 190 3e-47
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 181 1e-44
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 174 2e-42
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 172 9e-42
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 172 9e-42
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 171 1e-41
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 120 2e-26
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 120 3e-26
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 119 5e-26
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 119 9e-26
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 115 1e-24
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 114 2e-24
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 110 3e-23
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 109 7e-23
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 103 5e-21
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 100 3e-20
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 100 4e-20
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 204 bits (520), Expect = 1e-51
Identities = 133/428 (31%), Positives = 218/428 (50%), Gaps = 21/428 (4%)
Query: 504 KRKIFQLL*EYPDIFAWSYEDMPGLD-PKIVEHRIPTKPECPPVRQKLRRTHPDM-ALKI 561
K+++ LL +Y DI Y + L +H I TK P + ++P ++
Sbjct: 170 KQRLCALLQKYHDI---QYHEGDKLTFTNQTKHTINTKHNLPLYS---KYSYPQAYEQEV 223
Query: 562 KSEV*KQIDAGFLMTIEYPEWVANIVPVPKKDG-----KVRMCVDFRDLNKASPKDNFPL 616
+S++ ++ G + T P + + I VPKK K R+ +D+R LN+ + D P+
Sbjct: 224 ESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282
Query: 617 PHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAG 676
P++D ++ + F+ +D G++QI+M PE KT+F T G + Y MPFGL NA
Sbjct: 283 PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342
Query: 677 ATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCT 736
AT+QR M + +++K VY+DD+IV S ++H++ L +FE+L K L+L +KC
Sbjct: 343 ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402
Query: 737 FGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLRRLNYISRFISHMTAT 796
F + LG +++ GI+ +P+K+ AI++ P P K+++ FL Y +FI +
Sbjct: 403 FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462
Query: 797 CGPIFKLLRKNQPI-VWNDECQGAFDSIKNYLLEPPILVPPMEGKPLIMYLSVFDESVGC 855
P+ K L+KN I N E AF +K + E PIL P K + D ++G
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 856 VLGQQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCCAPAWAAKRLRHYLVNHTTWLISR 915
VL Q H + Y+S+ + E Y +EK A WA K RHYL+ + S
Sbjct: 523 VLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576
Query: 916 MDPIKYIF 923
P+ +++
Sbjct: 577 HQPLSWLY 584
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 192 bits (488), Expect = 6e-48
Identities = 115/409 (28%), Positives = 210/409 (51%), Gaps = 10/409 (2%)
Query: 500 EEGVKRKIFQLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMAL 559
E G RKI+ ++ ++ D+FA S +++ E I K P+RQK R +
Sbjct: 899 ENGDDRKIWDVIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKP 956
Query: 560 KIKSEV*KQIDAGFLMTIEYPEWVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHI 619
+I+ + K ++ + + P W + +V V KKDG +RMC+D+R +NK + PLP+I
Sbjct: 957 EIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNI 1015
Query: 620 DVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATY 679
+ + + A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A +
Sbjct: 1016 EATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALF 1075
Query: 680 QRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGV 739
Q M + D++ VYVDD+++ SKD EQH++ + + R+RK ++L +KC
Sbjct: 1076 QGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAK 1135
Query: 740 RSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLRRLNYISRFISHMTATCGP 799
+ + LG V+ G+E K +++ P K+++ FL + Y +FI +
Sbjct: 1136 KEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASS 1195
Query: 800 IFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPP-----MEG-KPLIMYLSVFDESV 853
+ L+ +W E + AF +K + + P+L P ++G +P ++Y + +
Sbjct: 1196 LTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGI 1255
Query: 854 GCVLGQQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCCAPAWAAKRLR 902
G VL Q+ G ++H I + SK + ETRY + + A +A +R +
Sbjct: 1256 GAVLAQEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFK 1303
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 191 bits (485), Expect = 1e-47
Identities = 129/426 (30%), Positives = 209/426 (48%), Gaps = 12/426 (2%)
Query: 504 KRKIFQLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALKIKS 563
K ++ + EY DIFA E P + + ++ K + P + R H + +I++
Sbjct: 276 KSQLENICSEYIDIFA--LESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE-EIQA 332
Query: 564 EV*KQIDAGFLMTIEYPEWVANIVPVPKKDG------KVRMCVDFRDLNKASPKDNFPLP 617
+V K I ++ ++ + ++ VPKK K R+ +D+R +NK D FPLP
Sbjct: 333 QVQKLIKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 618 HIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGA 677
ID ++D ++K FS +D SG++QI++ R+ TSF T G++ + +PFGL A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 678 TYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTF 737
++QR MT F + + +Y+DD+IV E+ ++ LT++F + R+Y L+L+P KC+F
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 738 GVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLRRLNYISRFISHMTATC 797
+ LG + KGI D K I+ P P R F+ NY RFI +
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 798 GPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPMEGKPLIMYLSVFDESVGCVL 857
I +L +KN P W DECQ AF +K+ L+ P +L P K + ++ G VL
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 858 GQQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCCAPAWAAKRLRHYLVNHTTWLISRMD 917
Q + + Y S+ FT E+ E+ A WA R Y+ + +
Sbjct: 632 TQNH--NGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHR 689
Query: 918 PIKYIF 923
P+ Y+F
Sbjct: 690 PLTYLF 695
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 190 bits (482), Expect = 3e-47
Identities = 120/409 (29%), Positives = 204/409 (49%), Gaps = 18/409 (4%)
Query: 533 VEHRIPTKPECPPVRQK--LRRTHPDMALKIKSEV*KQIDAGFLMT----IEYPEWVANI 586
++H + T P ++ L +TH ++++++V + ++ G + P WV
Sbjct: 195 IKHVLNTTHNSPIYSKQYPLAQTHE---IEVENQVQEMLNQGLIRESNSPYNSPTWVVPK 251
Query: 587 VPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKM 646
P K R+ +D+R LN+ + D +P+P++D ++ + + F+ +D G++QI+M
Sbjct: 252 KPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEM 311
Query: 647 SPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKS 706
E KT+F T G + Y MPFGL NA AT+QR M + +++K VY+DD+I+ S
Sbjct: 312 DEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFS 371
Query: 707 KDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIRE 766
+H+ + +F +L L+L +KC F + LG IV+ GI+ +P KV+AI
Sbjct: 372 TSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVS 431
Query: 767 MPAPQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPI-VWNDECQGAFDSIKN 825
P P +K++R FL Y +FI + P+ L+K I E AF+ +K
Sbjct: 432 YPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKA 491
Query: 826 YLLEPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYM 885
++ PIL P K ++ + ++G VL Q H I ++S+ D E Y
Sbjct: 492 LIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNG------HPISFISRTLNDHELNYS 545
Query: 886 MLEKTCCAPAWAAKRLRHYLVNHTTWLISRMDPIKYI--FEKPAVLLGR 932
+EK A WA K RHYL+ + S P++++ ++P L R
Sbjct: 546 AIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLER 594
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 181 bits (460), Expect = 1e-44
Identities = 128/424 (30%), Positives = 201/424 (47%), Gaps = 28/424 (6%)
Query: 501 EGVKRKIFQLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALK 560
+G + + LL E+P IF P L VE + + +++P +
Sbjct: 82 DGTQEILNSLLGEFPRIFE------PPLSGMSVETAVKAEIRTNTQDPIYAKSYP-YPVN 134
Query: 561 IKSEV*KQIDA----GFLMTIEYPE----WVANIVPVPKKDGKVRMCVDFRDLNKASPKD 612
++ EV +QID G + P W+ P P + + RM VDF+ LN + D
Sbjct: 135 MRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPD 194
Query: 613 NFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGL 672
+P+P I+ + + +K F+ +D SG++QI M D KT+F T G + + +PFGL
Sbjct: 195 TYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGL 254
Query: 673 INAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNP 732
NA A +QR + + + I K VY+DD+IV S+D + H + L + L K L++N
Sbjct: 255 KNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNL 314
Query: 733 NKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLRRLNYISRFISH 792
K F + LG+IV+ GI+ DP KVRAI EMP P + K+++ FL +Y +FI
Sbjct: 315 EKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQD 374
Query: 793 MTATCGPIFKLLR-----------KNQPIVWNDECQGAFDSIKNYLLEPPILVPPMEGKP 841
P+ L R PI ++ +F+ +K+ L IL P KP
Sbjct: 375 YAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKP 434
Query: 842 LIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCCAPAWAAKRL 901
+ + ++G VL Q D+ ++ I Y+S+ E Y +EK A W+ L
Sbjct: 435 FHLTTDASNWAIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNL 492
Query: 902 RHYL 905
R YL
Sbjct: 493 RAYL 496
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 174 bits (441), Expect = 2e-42
Identities = 105/345 (30%), Positives = 178/345 (51%), Gaps = 9/345 (2%)
Query: 589 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 648
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 649 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKD 708
D K +F P G F Y VMP+G+ A A +Q + T+ + V Y+DD+++ SK
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574
Query: 709 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 768
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 769 APQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLL 828
P+ K++R FL +NY+ +FI + P+ LL+K+ W A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 829 EPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYMMLE 888
PP+L K +++ D +VG VL Q+ + K + + Y S K + + Y + +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 889 KTCCAPAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKPAVLLGRL 933
K A + K RHYL S ++P K I L+GR+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFK-ILTDHRNLIGRI 790
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 172 bits (435), Expect = 9e-42
Identities = 104/345 (30%), Positives = 179/345 (51%), Gaps = 9/345 (2%)
Query: 589 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 648
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 649 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKD 708
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ SK
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 709 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 768
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 769 APQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLL 828
P+ K++R FL +NY+ +FI + P+ LL+K+ W A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 829 EPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYMMLE 888
PP+L K +++ D +VG VL Q+ + K + + Y S K + + Y + +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 889 KTCCAPAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKPAVLLGRL 933
K A + K RHYL S ++P K I L+GR+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFK-ILTDHRNLIGRI 790
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 172 bits (435), Expect = 9e-42
Identities = 104/345 (30%), Positives = 179/345 (51%), Gaps = 9/345 (2%)
Query: 589 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 648
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 649 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKD 708
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ SK
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 709 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 768
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 769 APQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLL 828
P+ K++R FL +NY+ +FI + P+ LL+K+ W A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 829 EPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYMMLE 888
PP+L K +++ D +VG VL Q+ + K + + Y S K + + Y + +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 889 KTCCAPAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKPAVLLGRL 933
K A + K RHYL S ++P K I L+GR+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFK-ILTDHRNLIGRI 790
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 171 bits (433), Expect = 1e-41
Identities = 106/321 (33%), Positives = 167/321 (52%), Gaps = 18/321 (5%)
Query: 597 RMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSF 656
R+ +DFR LN+ + D +P+P I +++ N ++K F+ +D SGY+QI ++ DREKTSF
Sbjct: 238 RLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSF 297
Query: 657 ITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYL 716
G + + +PFGL NA + +QR + + + I K VYVDD+I+ S++E HV ++
Sbjct: 298 SVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHI 357
Query: 717 TKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQV 776
+ + L +R++ K F S + LGFIVS+ G + DP+KV+AI+E P P +V
Sbjct: 358 DTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKV 417
Query: 777 RGFLRRLNYISRFISHMTATCGPIFKLLR-----------KNQPIVWNDECQGAFDSIKN 825
R FL +Y FI A PI +L+ K P+ +N+ + AF ++N
Sbjct: 418 RSFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRN 477
Query: 826 YLL-EPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRY 884
L E IL P KP + +G VL Q+ I +S+ E Y
Sbjct: 478 ILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEG------RPITMISRTLKQPEQNY 531
Query: 885 MMLEKTCCAPAWAAKRLRHYL 905
E+ A WA +L+++L
Sbjct: 532 ATNERELLAIVWALGKLQNFL 552
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 120 bits (302), Expect = 2e-26
Identities = 86/281 (30%), Positives = 143/281 (50%), Gaps = 12/281 (4%)
Query: 513 EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALKIKSEV*KQIDAG 572
++P++F +D GL K + T+ PV ++ R +++E+ + + G
Sbjct: 413 DFPEVF----KDGLGLCTK-EKAEFRTEENAVPVFKRARPVPYGSLEAVETELNRLQEMG 467
Query: 573 FLMTIEYPEWVANIVPVPKKD-GKVRMCVDFR--DLNKASPKDNFPLPHIDVLVDNTAQS 629
++ I Y +W A IV + KK GK+R+C DF+ LN A + PLP + + +
Sbjct: 468 VIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEFHPLPTSEDIFSRL-KG 526
Query: 630 KVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHD 689
V+S +D Y Q+++ E ++ T G F Y M FGL A A++Q+ M +
Sbjct: 527 TVYSQIDLKDAYLQVELDEEAQKLAVINTHRGIFKYLRMTFGLKPAPASFQKIMDKMVSG 586
Query: 690 MIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIV 749
+ V VY DD+I+ + E+H + L ++FER ++Y R++ KC F + LGF V
Sbjct: 587 LTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEKCAFAQKQVTFLGF-V 643
Query: 750 SQKGIEVDPDKVRAIREMPAPQTEKQVRGFLRRLNYISRFI 790
+ G D K AIR M AP +KQ+ FL +++SR +
Sbjct: 644 DEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 120 bits (301), Expect = 3e-26
Identities = 125/493 (25%), Positives = 213/493 (42%), Gaps = 40/493 (8%)
Query: 438 FPLNFEFPVYEAEDEEGDDIPYEITRLLEQERKAIQPHQEEIELINLGTEENKREIKVGA 497
F N +PV+ A+ + E LE +K + Q E +N+ T + + +K A
Sbjct: 134 FTKNKSYPVHIAKLTRAVRVGTE--GFLESMKKRSKTQQPEP--VNISTNKIENPLKEIA 189
Query: 498 ALEEGVK---RKIF---QLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLR 551
L EG + K+F Q + + ++ + P LDP + + + ++
Sbjct: 190 ILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWM---------KASIK 239
Query: 552 RTHPDMALKIK---------SEV*KQIDAGFLMTIEYPEWVANIVPV-------PKKDGK 595
+ P A+K+K E KQI + + P ++ P K+ GK
Sbjct: 240 LSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGK 299
Query: 596 VRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTS 655
RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E R T+
Sbjct: 300 KRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTA 359
Query: 656 FITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEY 715
F P G + + V+PFGL A + +QR M F + K VYVDD++V S +EE H+ +
Sbjct: 360 FTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLH 418
Query: 716 LTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEK 774
+ + ++ ++ + L+ K + LG + + + + I + P + +K
Sbjct: 419 VAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKK 478
Query: 775 QVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV 834
Q++ FL L Y S +I + P+ L++N P W E +K L P L
Sbjct: 479 QLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLH 538
Query: 835 PPMEGKPLIMYLSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCC 892
P+ + LI+ D+ G +L + +E E Y S F E Y +K
Sbjct: 539 HPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETL 598
Query: 893 APAWAAKRLRHYL 905
A K+ YL
Sbjct: 599 AVINTIKKFSIYL 611
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 119 bits (299), Expect = 5e-26
Identities = 121/482 (25%), Positives = 207/482 (42%), Gaps = 45/482 (9%)
Query: 458 PYEITRLLEQERKAIQPHQEEI---------ELINLGTEENKREIKVGAALEEGVK---R 505
P IT+L R I+ E + E +N+ T + + ++ A L EG +
Sbjct: 141 PVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEE 200
Query: 506 KIF---QLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALKIK 562
K+F Q + + ++ + P LDP + + + ++ + P A+K+K
Sbjct: 201 KLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWM---------KASIKLSDPSKAIKVK 250
Query: 563 ---------SEV*KQIDAGFLMTIEYPEWVANIVPV-------PKKDGKVRMCVDFRDLN 606
E KQI + + P ++ P K+ GK RM V+++ +N
Sbjct: 251 PMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMN 310
Query: 607 KASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYK 666
KA+ D + LP+ D L+ K+FS D SG+ Q+ + E R T+F P G + +
Sbjct: 311 KATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWN 370
Query: 667 VMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKY 726
V+PFGL A + +QR M F + K VYVDD++V S +EE H+ ++ + ++ ++
Sbjct: 371 VVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQH 429
Query: 727 KLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLRRLNY 785
+ L+ K + LG + + + + I + P + +KQ++ FL L Y
Sbjct: 430 GIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTY 489
Query: 786 ISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPMEGKPLIMY 845
S +I + P+ L++N P W E +K L P L P+ + LI+
Sbjct: 490 ASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIE 549
Query: 846 LSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCCAPAWAAKRLRH 903
D+ G +L + +E E Y S F E Y +K A K+
Sbjct: 550 TDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSI 609
Query: 904 YL 905
YL
Sbjct: 610 YL 611
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 119 bits (297), Expect = 9e-26
Identities = 124/493 (25%), Positives = 213/493 (43%), Gaps = 40/493 (8%)
Query: 438 FPLNFEFPVYEAEDEEGDDIPYEITRLLEQERKAIQPHQEEIELINLGTEENKREIKVGA 497
F N +PV+ A+ + E LE +K + Q E +N+ T + + ++ A
Sbjct: 134 FTKNKSYPVHIAKLTRAVRVGTE--GFLESMKKRSKTQQPEP--VNISTNKIENPLEEIA 189
Query: 498 ALEEGVK---RKIF---QLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLR 551
L EG + K+F Q + + ++ + P LDP + + + ++
Sbjct: 190 ILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWM---------KASIK 239
Query: 552 RTHPDMALKIK---------SEV*KQIDAGFLMTIEYPEWVANIVPV-------PKKDGK 595
+ P A+K+K E KQI + + P ++ P K+ GK
Sbjct: 240 LSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGK 299
Query: 596 VRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTS 655
RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E R T+
Sbjct: 300 KRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTA 359
Query: 656 FITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEY 715
F P G + + V+PFGL A + +QR M F + K VYVDD++V S +EE H+ +
Sbjct: 360 FTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLH 418
Query: 716 LTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEK 774
+ + ++ ++ + L+ K + LG + + + + I + P + +K
Sbjct: 419 VAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKK 478
Query: 775 QVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV 834
Q++ FL L Y S +I + P+ L++N P W E +K L P L
Sbjct: 479 QLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLH 538
Query: 835 PPMEGKPLIMYLSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCC 892
P+ + LI+ D+ G +L + +E E Y S F E Y +K
Sbjct: 539 HPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETL 598
Query: 893 APAWAAKRLRHYL 905
A K+ YL
Sbjct: 599 AVINTIKKFSIYL 611
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 115 bits (288), Expect = 1e-24
Identities = 99/379 (26%), Positives = 168/379 (44%), Gaps = 20/379 (5%)
Query: 546 VRQKLRRTHPDMALKIK---------SEV*KQIDAGFLMTIEYPEWVANIVPV------- 589
++ ++ + P A+K+K E KQI + + P ++ P
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 590 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 649
K+ GK RM V+++ +NKA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 650 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDE 709
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 710 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 768
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 769 APQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLL 828
+ +KQ++ FL L Y S +I + P+ L++N P W E +K L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 829 EPPILVPPMEGKPLIMYLSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYMM 886
P L P+ + LI+ D+ G +L + +E E Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 887 LEKTCCAPAWAAKRLRHYL 905
+K A K+ YL
Sbjct: 588 NDKETLAVINTIKKFSIYL 606
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 114 bits (285), Expect = 2e-24
Identities = 122/493 (24%), Positives = 211/493 (42%), Gaps = 40/493 (8%)
Query: 438 FPLNFEFPVYEAEDEEGDDIPYEITRLLEQERKAIQPHQEEIELINLGTEENKREIKVGA 497
F N +PV+ A+ + E LE +K + Q E +N+ T + + ++ A
Sbjct: 135 FTKNKSYPVHIAKLTRAVRVGTE--GFLESMKKRSKTQQPEP--VNISTNKIENPLEEIA 190
Query: 498 ALEEGVK---RKIF---QLL*EYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLR 551
L EG + K+F Q + + ++ + P LDP + + + ++
Sbjct: 191 ILSEGRRLSEEKLFITQQRMQKTEELLEKVCSENP-LDPNKTKQWM---------KASIK 240
Query: 552 RTHPDMALKIK---------SEV*KQIDAGFLMTIEYPEWVANIVPV-------PKKDGK 595
+ P A+K+K E KQI + + P ++ P G
Sbjct: 241 LSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGN 300
Query: 596 VRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTS 655
RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E R T+
Sbjct: 301 KRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTA 360
Query: 656 FITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEY 715
F P G + + V+PFGL A + +QR M F + K VYVDD++V S +EE H+ +
Sbjct: 361 FTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDIVVFSNNEEDHLLH 419
Query: 716 LTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEK 774
+ + ++ ++ + L+ K + LG + + + + I + P + +K
Sbjct: 420 VAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKK 479
Query: 775 QVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV 834
Q++ FL L Y S +I ++ P+ L++N P W E +K L P L
Sbjct: 480 QLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLH 539
Query: 835 PPMEGKPLIMYLSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYMMLEKTCC 892
P+ + LI+ D+ G +L + +E E Y S F E Y +K
Sbjct: 540 HPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETL 599
Query: 893 APAWAAKRLRHYL 905
A K+ YL
Sbjct: 600 AVINTIKKFSIYL 612
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 110 bits (275), Expect = 3e-23
Identities = 80/306 (26%), Positives = 150/306 (48%), Gaps = 8/306 (2%)
Query: 586 IVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIK 645
+ PVPK DG+ RM +D+R++NK P H ++ + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 646 MSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVK 705
++PE T+F +C+ +P G +N+ A + + L ++ V+VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 706 SKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIR 765
D ++HV+ L K+F+ L + ++ K G ++ + LGF ++++G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 766 EMPAPQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLL--RKNQPIVWNDECQGAFDSI 823
+ P+ KQ++ L LN+ FI + P++ L+ K + I W++E + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 824 KNYLLEPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETR 883
L L + + L++ ++ S G V +ETGKK I YL+ F+ E +
Sbjct: 243 IEALNTASNLEERLPEQRLVIKVNT-SPSAGYV-RYYNETGKK--PIMYLNYVFSKAELK 298
Query: 884 YMMLEK 889
+ MLEK
Sbjct: 299 FSMLEK 304
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 109 bits (272), Expect = 7e-23
Identities = 69/265 (26%), Positives = 133/265 (50%), Gaps = 4/265 (1%)
Query: 595 KVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 654
K R+ +++ LN D F +PH +++ ++ +FS D +G++ +K+ + ++ T
Sbjct: 1238 KPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWT 1297
Query: 655 SFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVE 714
+F G + + V PFG+ NA +QR M F D+ K +Y+DD+++ S +E++H+E
Sbjct: 1298 TFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIE 1355
Query: 715 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQ--T 772
+L F R+++ L+ K ++ + LG + + I + P V I++ + T
Sbjct: 1356 HLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNT 1415
Query: 773 EKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPI 832
K ++ +L LNY +I ++ GP++K KN ++N E I+ + +
Sbjct: 1416 LKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKP 1475
Query: 833 LVPPMEGKPLIMYLSVFDESVGCVL 857
L P E +I+ +E G VL
Sbjct: 1476 LERPKETDYIIIETDASEEGWGAVL 1500
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 103 bits (256), Expect = 5e-21
Identities = 77/275 (28%), Positives = 134/275 (48%), Gaps = 19/275 (6%)
Query: 591 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPED 650
+K GK RM +++ LN+ + D + LP I+ ++ +SK++S D SG+ Q+ M E
Sbjct: 1458 EKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEES 1517
Query: 651 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEE 710
T+F+ + + VMPFGL NA A +QR M +F K + VY+DD++V S+ E
Sbjct: 1518 VPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAE 1576
Query: 711 QHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAP 770
QH ++L M + ++ L L+P K G LG + I++ P + I +
Sbjct: 1577 QHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKICDFSDE 1636
Query: 771 Q--TEKQVRGFLRRLNYISRFISHMTATCGPIFKLL-----RKNQPIVWNDECQGAFDSI 823
+ T + +R +L L+Y +I + P+ + + ++ P W Q + +
Sbjct: 1637 KLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVRQ-IKEKV 1695
Query: 824 KNYLLEPPILVPPMEGKPLIMYLSVFDESVGCVLG 858
KN P + +PP + +I E+ GC+ G
Sbjct: 1696 KNL---PDLQLPPKDSFIII-------ETDGCMTG 1720
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 100 bits (249), Expect = 3e-20
Identities = 95/392 (24%), Positives = 163/392 (41%), Gaps = 48/392 (12%)
Query: 542 ECPPVRQKLRRTHPDMALK-----------IKSEV*KQIDAGFLMTIEYPEWVANIVPVP 590
+ PPV +LR +A++ I+ + K +D G L+ P W ++PV
Sbjct: 161 QVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSP-WNTPLLPVK 219
Query: 591 KKD-GKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKV-FSFMDGFSGYNQIKMSP 648
K R D R++NK + +P+ L+ + S +S +D + +++ P
Sbjct: 220 KPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHP 279
Query: 649 EDREKTSFITPW--------GTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEV--- 697
+ +F W G + +P G N+ TLF + +H+++
Sbjct: 280 NSQPLFAF--EWKDPEKGNTGQLTWTRLPQGFKNS--------PTLFDEALHRDLAPFRA 329
Query: 698 ---------YVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFI 748
YVDD++V + E + K+ + L K R++ K R LG++
Sbjct: 330 LNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYL 389
Query: 749 VSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQ 808
+ + + P + + ++P P T +QVR FL + +I + P++ L +++
Sbjct: 390 LKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESI 449
Query: 809 PIVWNDECQGAFDSIKNYLLEPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDET-GKKE 867
P +W +E Q AFD IK LL P L P KP +Y+ DE G G +T G
Sbjct: 450 PFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYI---DERAGVARGVLTQTLGPWR 506
Query: 868 HAIYYLSKKFTDCETRYMMLEKTCCAPAWAAK 899
+ YLSKK + + K A A K
Sbjct: 507 RPVAYLSKKLDPVASGWPTCLKAVAAVALLLK 538
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 100 bits (248), Expect = 4e-20
Identities = 88/317 (27%), Positives = 137/317 (42%), Gaps = 5/317 (1%)
Query: 591 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPED 650
++ GK RM V+++ +N+A+ D+ LP++ L+ +FS D SG+ Q+ + E
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 651 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSKDEE 710
++ T+F P G F +KV+PFGL A + +QR M T + K VYVDD+IV S E
Sbjct: 348 QKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSEL 406
Query: 711 QHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKV-RAIREMP- 768
H ++ + + + KY + L+ K LG + KG + + I + P
Sbjct: 407 DHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILENIHKFPD 465
Query: 769 APQTEKQVRGFLRRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLL 828
+ +K ++ FL L Y +I + P+ L+K+ W IK L
Sbjct: 466 RLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLG 525
Query: 829 EPPILVPPMEGKPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYMMLE 888
P L P LI+ D G VL + G E Y S F E Y +
Sbjct: 526 SFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDG-VELICRYSSGSFKQAEKNYHSND 584
Query: 889 KTCCAPAWAAKRLRHYL 905
K A + YL
Sbjct: 585 KELLAVKQVITKFSAYL 601
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.346 0.153 0.549
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 171,384,975
Number of Sequences: 164201
Number of extensions: 6992072
Number of successful extensions: 24669
Number of sequences better than 10.0: 105
Number of HSP's better than 10.0 without gapping: 37
Number of HSP's successfully gapped in prelim test: 68
Number of HSP's that attempted gapping in prelim test: 24524
Number of HSP's gapped (non-prelim): 139
length of query: 1586
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1462
effective length of database: 39,613,130
effective search space: 57914396060
effective search space used: 57914396060
T: 11
A: 40
X1: 15 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.7 bits)
S2: 73 (32.7 bits)
Medicago: description of AC147714.3