
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC145220.4 - phase: 0 /pseudo
(1391 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 231 1e-59
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 214 2e-54
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 207 2e-52
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 204 1e-51
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 197 1e-49
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 186 5e-46
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 183 2e-45
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 183 2e-45
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 178 8e-44
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 129 4e-29
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 129 7e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 129 7e-29
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 126 4e-28
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 126 4e-28
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 124 1e-27
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 115 6e-25
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 115 1e-24
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 113 3e-24
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 107 2e-22
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 107 3e-22
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 231 bits (588), Expect = 1e-59
Identities = 148/473 (31%), Positives = 246/473 (51%), Gaps = 22/473 (4%)
Query: 585 KKKIIQLLREYPDIFAWSYEDMPGLD-PKIVEHRIPTKPECPPVRQKLRRTHPDM-ALKI 642
K+++ LL++Y DI Y + L +H I TK P + ++P ++
Sbjct: 170 KQRLCALLQKYHDI---QYHEGDKLTFTNQTKHTINTKHNLPLYS---KYSYPQAYEQEV 223
Query: 643 KSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG-----KVRMCVDFRDLNKASPKDNFPL 697
+S++Q ++ G + T P + + I VPKK K R+ +D+R LN+ + D P+
Sbjct: 224 ESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282
Query: 698 PHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAG 757
P++D ++ + F+ +D G++QI+M PE KT+F T G + Y MPFGL NA
Sbjct: 283 PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342
Query: 758 ATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCT 817
AT+QR M + +++K VY+DD+IV ST ++H++ L +FE+L K L+L +KC
Sbjct: 343 ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402
Query: 818 FGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPRTEKQVRGFLGRLNYISRFISHMTAT 877
F + LG +++ GI+ +P+K+ AI++ P P K+++ FLG Y +FI +
Sbjct: 403 FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462
Query: 878 CGPIFKLLRKNQPI-VWNDECQEAFDSIKNYLLKPPILVPPVEGRPLIMYLVVFDESMGC 936
P+ K L+KN I N E AF +K + + PIL P + + D ++G
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 937 VLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISR 996
VL Q H + Y+S+ + E Y+ +EK A+ WA K RHYL+ + S
Sbjct: 523 VLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576
Query: 997 MDPIKYIFEKAAVTGKNARWQMLLSEYDIVFKTQKAIKGSILADHLAYQPLDD 1049
P+ +++ K RW++ LSE+D K K K + +AD L+ L++
Sbjct: 577 HQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKG-KENCVADALSRIKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 214 bits (544), Expect = 2e-54
Identities = 132/437 (30%), Positives = 223/437 (50%), Gaps = 17/437 (3%)
Query: 614 VEHRIPTKPECPPVRQK--LRRTHPDMALKIKSEVQKQIDAGFLMT----VEYPEWVANI 667
++H + T P ++ L +TH ++++++VQ+ ++ G + P WV
Sbjct: 195 IKHVLNTTHNSPIYSKQYPLAQTHE---IEVENQVQEMLNQGLIRESNSPYNSPTWVVPK 251
Query: 668 VPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKM 727
P K R+ +D+R LN+ + D +P+P++D ++ + + F+ +D G++QI+M
Sbjct: 252 KPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEM 311
Query: 728 SPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKS 787
E KT+F T G + Y MPFGL NA AT+QR M + +++K VY+DD+I+ S
Sbjct: 312 DEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFS 371
Query: 788 TDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIRE 847
T +H+ + +F +L L+L +KC F + LG IV+ GI+ +P KV+AI
Sbjct: 372 TSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVS 431
Query: 848 MPAPRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPI-VWNDECQEAFDSIKN 906
P P +K++R FLG Y +FI + P+ L+K I E EAF+ +K
Sbjct: 432 YPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKA 491
Query: 907 YLLKPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYT 966
+++ PIL P + ++ + ++G VL Q H I ++S+ D E Y+
Sbjct: 492 LIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNG------HPISFISRTLNDHELNYS 545
Query: 967 MLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKNARWQMLLSEYDIV 1026
+EK A+ WA K RHYL+ + S P++++ K RW++ LSEY
Sbjct: 546 AIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFK 605
Query: 1027 FKTQKAIKGSILADHLA 1043
K + S+ AD L+
Sbjct: 606 IDYIKGKENSV-ADALS 621
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 207 bits (526), Expect = 2e-52
Identities = 139/465 (29%), Positives = 230/465 (48%), Gaps = 13/465 (2%)
Query: 585 KKKIIQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALKIKS 644
K ++ + EY DIFA E P + + ++ K + P + R H + +I++
Sbjct: 276 KSQLENICSEYIDIFA--LESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE-EIQA 332
Query: 645 EVQKQIDAGFLMTVEYPEWVANIVPVPKKDG------KVRMCVDFRDLNKASPKDNFPLP 698
+VQK I ++ ++ + ++ VPKK K R+ +D+R +NK D FPLP
Sbjct: 333 QVQKLIKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 699 HIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGA 758
ID ++D + K FS +D SG++QI++ R+ TSF T G++ + +PFGL A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 759 TYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTF 818
++QR MT F + + +Y+DD+IV E+ ++ LT++F + R+Y L+L+P KC+F
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 819 GVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPRTEKQVRGFLGRLNYISRFISHMTATC 878
+ LG + KGI D K I+ P P R F+ NY RFI +
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 879 GPIFKLLRKNQPIVWNDECQEAFDSIKNYLLKPPILVPPVEGRPLIMYLVVFDESMGCVL 938
I +L +KN P W DECQ+AF +K+ L+ P +L P + + ++ G VL
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 939 GQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMD 998
Q + + Y S+ FT E+ + E+ A+ WA R Y+ + +
Sbjct: 632 TQNH--NGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHR 689
Query: 999 PIKYIFEKAAVTGKNARWQMLLSEYDIVFKTQKAIKGSILADHLA 1043
P+ Y+F + K R ++ L EY+ + K K + +AD L+
Sbjct: 690 PLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALS 733
Score = 40.4 bits (93), Expect = 0.034
Identities = 22/70 (31%), Positives = 33/70 (46%), Gaps = 4/70 (5%)
Query: 1295 RNYDMVLLRCV----DEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMTMEHDCY 1350
+N + LL V +E E E ++ +HD TG T + ++ YYW M
Sbjct: 874 KNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIK 933
Query: 1351 QHARKCHKCQ 1360
++ RKC KCQ
Sbjct: 934 EYVRKCQKCQ 943
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 204 bits (520), Expect = 1e-51
Identities = 127/469 (27%), Positives = 236/469 (50%), Gaps = 11/469 (2%)
Query: 581 EEGVKKKIIQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMAL 640
E G +KI ++ ++ D+FA S +++ E I K P+RQK R +
Sbjct: 899 ENGDDRKIWDVIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKP 956
Query: 641 KIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHI 700
+I+ +QK ++ + + P W + +V V KKDG +RMC+D+R +NK + PLP+I
Sbjct: 957 EIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNI 1015
Query: 701 DMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATY 760
+ + + A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A +
Sbjct: 1016 EATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALF 1075
Query: 761 QRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGV 820
Q M + D++ VYVDD+++ S D EQH++ + + R+RK ++L +KC
Sbjct: 1076 QGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAK 1135
Query: 821 RSGKLLGFIVSQKGIEVDPDKVRAIREMPAPRTEKQVRGFLGRLNYISRFISHMTATCGP 880
+ + LG V+ G+E K +++ P K+++ FLG + Y +FI +
Sbjct: 1136 KEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASS 1195
Query: 881 IFKLLRKNQPIVWNDECQEAFDSIKNYLLKPPILV-PPVEG-----RPLIMYLVVFDESM 934
+ L+ +W E + AF +K + + P+L P VE RP ++Y + +
Sbjct: 1196 LTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGI 1255
Query: 935 GCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLI 994
G VL Q+ G ++H I + SK + ETRY + + A+ +A +R + + +
Sbjct: 1256 GAVLAQEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVF 1314
Query: 995 SRMDPIKYIFEKAAVTGKNARWQMLLSEYDIVFKTQKAIKGSILADHLA 1043
+ P+ + + + + + RW + + E+D+ A K + +AD L+
Sbjct: 1315 TDHKPLISLLKGSPLADRLWRWSIEILEFDVKI-VYLAGKANAVADALS 1362
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 197 bits (502), Expect = 1e-49
Identities = 142/493 (28%), Positives = 234/493 (46%), Gaps = 34/493 (6%)
Query: 576 IGAALEEGVKKKIIQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTH 635
+ A +G ++ + LL E+P IF P L VE + + +++
Sbjct: 76 LAAEHPDGTQEILNSLLGEFPRIFE------PPLSGMSVETAVKAEIRTNTQDPIYAKSY 129
Query: 636 PDMALKIKSEVQKQIDA----GFLMTVEYPE----WVANIVPVPKKDGKVRMCVDFRDLN 687
P + ++ EV++QID G + P W+ P P + + RM VDF+ LN
Sbjct: 130 P-YPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLN 188
Query: 688 KASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYK 747
+ D +P+P I+ + + K F+ +D SG++QI M D KT+F T G + +
Sbjct: 189 TVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFL 248
Query: 748 VMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKY 807
+PFGL NA A +QR + + + I K VY+DD+IV S D + H + L + L K
Sbjct: 249 RLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKA 308
Query: 808 KLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPRTEKQVRGFLGRLNYI 867
L++N K F + LG+IV+ GI+ DP KVRAI EMP P + K+++ FLG +Y
Sbjct: 309 NLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYY 368
Query: 868 SRFISHMTATCGPIFKLLR-----------KNQPIVWNDECQEAFDSIKNYLLKPPILVP 916
+FI P+ L R PI ++ ++F+ +K+ L IL
Sbjct: 369 RKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAF 428
Query: 917 PVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALA 976
P +P + + ++G VL Q D+ ++ I Y+S+ E Y +EK A+
Sbjct: 429 PCFTKPFHLTTDASNWAIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEMLAII 486
Query: 977 WAAKRLRHYLVN-HTTWLISRMDPIKYIFEKAAVTGKNARWQMLLSEY--DIVFKTQKAI 1033
W+ LR YL T + + P+ + K RW+ + EY ++++K K+
Sbjct: 487 WSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKS- 545
Query: 1034 KGSILADHLAYQP 1046
+++AD L+ P
Sbjct: 546 --NVVADALSRIP 556
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 186 bits (471), Expect = 5e-46
Identities = 113/407 (27%), Positives = 207/407 (50%), Gaps = 21/407 (5%)
Query: 670 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSP 729
VPKK+G +RM VD++ LNK + +PLP I+ L+ +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 730 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTD 789
D K +F P G F Y VMP+G+ A A +Q + T+ + V Y+DD+++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574
Query: 790 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 849
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 969
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 970 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTG-----------KNARWQM 1018
K A+ + K RHYL S ++P K + + + G + ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1019 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYL 1065
L +++ + + +AD L+ + +D+ +PI D D I ++
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFV 851
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 183 bits (465), Expect = 2e-45
Identities = 112/407 (27%), Positives = 208/407 (50%), Gaps = 21/407 (5%)
Query: 670 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSP 729
VPKK+G +RM VD++ LNK + +PLP I+ L+ +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 730 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTD 789
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 790 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 849
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 969
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 970 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTG-----------KNARWQM 1018
K A+ + K RHYL S ++P K + + + G + ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1019 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYL 1065
L +++ + + +AD L+ + +D+ +PI D D I ++
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFV 851
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 183 bits (465), Expect = 2e-45
Identities = 112/407 (27%), Positives = 208/407 (50%), Gaps = 21/407 (5%)
Query: 670 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSP 729
VPKK+G +RM VD++ LNK + +PLP I+ L+ +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 730 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTD 789
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 790 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 849
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 969
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 970 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTG-----------KNARWQM 1018
K A+ + K RHYL S ++P K + + + G + ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1019 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYL 1065
L +++ + + +AD L+ + +D+ +PI D D I ++
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFV 851
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 178 bits (452), Expect = 8e-44
Identities = 118/388 (30%), Positives = 195/388 (49%), Gaps = 22/388 (5%)
Query: 678 RMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSF 737
R+ +DFR LN+ + D +P+P I M++ N + K F+ +D SGY+QI ++ DREKTSF
Sbjct: 238 RLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSF 297
Query: 738 ITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYL 797
G + + +PFGL NA + +QR + + + I K VYVDD+I+ S +E HV ++
Sbjct: 298 SVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHI 357
Query: 798 TKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPRTEKQV 857
+ + L +R++ K F S + LGFIVS+ G + DP+KV+AI+E P P +V
Sbjct: 358 DTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKV 417
Query: 858 RGFLGRLNYISRFISHMTATCGPIFKLLR-----------KNQPIVWNDECQEAFDSIKN 906
R FLG +Y FI A PI +L+ K P+ +N+ + AF ++N
Sbjct: 418 RSFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRN 477
Query: 907 YLLKPPILVP-PVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRY 965
L +++ P +P + +G VL Q+ I +S+ E Y
Sbjct: 478 ILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEG------RPITMISRTLKQPEQNY 531
Query: 966 TMLEKTCCALAWAAKRLRHYLV-NHTTWLISRMDPIKYIFEKAAVTGKNARWQMLLSEYD 1024
E+ A+ WA +L+++L + + + P+ + K RW+ + +++
Sbjct: 532 ATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591
Query: 1025 I-VFKTQKAIKGSILADHLAYQPLDDYQ 1051
VF K K + +AD L+ Q L+ Q
Sbjct: 592 AKVF--YKPGKENFVADALSRQNLNALQ 617
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (325), Expect = 4e-29
Identities = 131/509 (25%), Positives = 219/509 (42%), Gaps = 46/509 (9%)
Query: 546 LEQEKKAIQPHQEEIELINIGIEENKREIKIGAALEEGVK---KKII---QLLREYPDIF 599
LE KK + Q E +NI + + +K A L EG + +K+ Q +++ ++
Sbjct: 159 LESMKKRSKTQQPEP--VNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQKIEELL 216
Query: 600 AWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALKIK---------SEVQKQI 650
+ P LDP + + + ++ + P A+K+K E KQI
Sbjct: 217 EKVCSENP-LDPNKTKQWM---------KASIKLSDPSKAIKVKPMKYSPMDREEFDKQI 266
Query: 651 DAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDML 703
+ V P ++ P K+ GK RM V+++ +NKA+ D + LP+ D L
Sbjct: 267 KELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDEL 326
Query: 704 VDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRG 763
+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR
Sbjct: 327 LTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRH 386
Query: 764 MTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSG 823
M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K +
Sbjct: 387 MDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 445
Query: 824 KLLGFIVSQKGIEVDPDKVRAIREMP-APRTEKQVRGFLGRLNYISRFISHMTATCGPIF 882
LG + + + + I + P +KQ++ FLG L Y S +I + P+
Sbjct: 446 NFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQ 505
Query: 883 KLLRKNQPIVWNDECQEAFDSIKNYLLKPPILVPPVEGRPLIMYLVVFDESMGCVLG--Q 940
L++N P W E +K L P L P+ LI+ D+ G +L +
Sbjct: 506 AKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIK 565
Query: 941 QDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDP- 999
+E E Y S F E Y +K A+ K+ YL + R D
Sbjct: 566 INEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRTDNT 623
Query: 1000 -----IKYIFEKAAVTGKNARWQMLLSEY 1023
+ ++ + G+N RWQ LS Y
Sbjct: 624 HFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (323), Expect = 7e-29
Identities = 112/422 (26%), Positives = 185/422 (43%), Gaps = 28/422 (6%)
Query: 627 VRQKLRRTHPDMALKIK---------SEVQKQIDAGFLMTVEYPEWVANIVPV------- 670
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 671 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPE 730
K+ GK RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 731 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDE 790
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 791 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 849
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
+KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQ 532
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 967
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 592
Query: 968 LEKTCCALAWAAKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKNARWQMLLS 1021
+K A+ K+ YL + R D + ++ + G+N RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 650
Query: 1022 EY 1023
Y
Sbjct: 651 HY 652
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (323), Expect = 7e-29
Identities = 112/422 (26%), Positives = 185/422 (43%), Gaps = 28/422 (6%)
Query: 627 VRQKLRRTHPDMALKIK---------SEVQKQIDAGFLMTVEYPEWVANIVPV------- 670
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 671 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPE 730
K+ GK RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 731 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDE 790
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 791 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 849
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
+KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 532
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 967
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHS 592
Query: 968 LEKTCCALAWAAKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKNARWQMLLS 1021
+K A+ K+ YL + R D + ++ + G+N RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 650
Query: 1022 EY 1023
Y
Sbjct: 651 HY 652
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 126 bits (317), Expect = 4e-28
Identities = 87/286 (30%), Positives = 147/286 (50%), Gaps = 12/286 (4%)
Query: 589 IQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPECPPVRQKLRRTHPDMALKIKSEVQK 648
+ L ++P++F +D GL K + T+ PV ++ R +++E+ +
Sbjct: 408 VMLKNDFPEVF----KDGLGLCTK-EKAEFRTEENAVPVFKRARPVPYGSLEAVETELNR 462
Query: 649 QIDAGFLMTVEYPEWVANIVPVPKKD-GKVRMCVDFR--DLNKASPKDNFPLPHIDMLVD 705
+ G ++ + Y +W A IV + KK GK+R+C DF+ LN A + PLP + +
Sbjct: 463 LQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEFHPLPTSEDIFS 522
Query: 706 NTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMT 765
+ V+S +D Y Q+++ E ++ T G F Y M FGL A A++Q+ M
Sbjct: 523 RL-KGTVYSQIDLKDAYLQVELDEEAQKLAVINTHRGIFKYLRMTFGLKPAPASFQKIMD 581
Query: 766 TLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKL 825
+ + V VY DD+I+ ++ E+H + L ++FER ++Y R++ KC F +
Sbjct: 582 KMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEKCAFAQKQVTF 639
Query: 826 LGFIVSQKGIEVDPDKVRAIREMPAPRTEKQVRGFLGRLNYISRFI 871
LGF V + G D K AIR M AP +KQ+ FLG +++SR +
Sbjct: 640 LGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 126 bits (317), Expect = 4e-28
Identities = 111/422 (26%), Positives = 184/422 (43%), Gaps = 28/422 (6%)
Query: 627 VRQKLRRTHPDMALKIK---------SEVQKQIDAGFLMTVEYPEWVANIVPV------- 670
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 671 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPE 730
K+ GK RM V+++ +NKA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 731 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDE 790
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 791 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 849
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
+KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 967
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 968 LEKTCCALAWAAKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKNARWQMLLS 1021
+K A+ K+ YL + R D + ++ + G+N RWQ LS
Sbjct: 588 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 645
Query: 1022 EY 1023
Y
Sbjct: 646 HY 647
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 124 bits (312), Expect = 1e-27
Identities = 110/422 (26%), Positives = 183/422 (43%), Gaps = 28/422 (6%)
Query: 627 VRQKLRRTHPDMALKIK---------SEVQKQIDAGFLMTVEYPEWVANIVPV------- 670
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 235 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 294
Query: 671 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPE 730
G RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 295 ENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 354
Query: 731 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDE 790
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 355 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDIVVFSNNE 413
Query: 791 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 849
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 414 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 473
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
+KQ++ FLG L Y S +I ++ P+ L++N P W E +K L
Sbjct: 474 TLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 533
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 967
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 534 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHS 593
Query: 968 LEKTCCALAWAAKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKNARWQMLLS 1021
+K A+ K+ YL + R D + ++ + G+N RWQ LS
Sbjct: 594 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 651
Query: 1022 EY 1023
Y
Sbjct: 652 HY 653
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 115 bits (289), Expect = 6e-25
Identities = 123/496 (24%), Positives = 207/496 (40%), Gaps = 28/496 (5%)
Query: 567 IEENKREIKIG--AALEEGVKKKIIQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEC 624
++E EI+I +A+EE +++ + + W + +DPK V ++
Sbjct: 179 VDEMLYEIQISKFSAIEEMLERVSSENPIDPEKSKQWMTATIELIDPKTVV-KVKPMSYS 237
Query: 625 PPVRQKLRRTHPDMA-LK-IKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKVRMCVD 682
P R++ R ++ LK IK + FL+ E ++ GK RM V+
Sbjct: 238 PSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENE----------AERRRGKKRMVVN 287
Query: 683 FRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKTSFITPWG 742
++ +NKA+ D LP+ D L+ K++S D SG Q+ + E + T+F P G
Sbjct: 288 YKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQG 347
Query: 743 TFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIV-KSTDEEQHVEYLTKMF 801
+ + V+PFGL A + + + + K VYVDD++V +T ++H ++ +
Sbjct: 348 HYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNIL 407
Query: 802 ERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APRTEKQVRGF 860
R K + L+ K LG + Q + I + P +KQ++ F
Sbjct: 408 RRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRF 467
Query: 861 LGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLLKPPILVPPVEG 920
LG L Y S +I + + P+ L+++ WND + IK L P L P
Sbjct: 468 LGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPN 527
Query: 921 RPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAK 980
L++ +E G +L + E+ Y S F E Y EK A+ K
Sbjct: 528 DKLVIETDASEEFWGGIL--KAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIK 585
Query: 981 RLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKNARWQMLLSEYDIVFKTQKAIK 1034
+ YL + + R D + + G+ RWQM LS+YD + K
Sbjct: 586 KFSIYLT--PSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTK 643
Query: 1035 GSILADHLAYQPLDDY 1050
++ AD L L +Y
Sbjct: 644 -NVFADFLQENTLTNY 658
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 115 bits (287), Expect = 1e-24
Identities = 84/316 (26%), Positives = 156/316 (48%), Gaps = 8/316 (2%)
Query: 676 KVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPEDREKT 735
K R+ +++ LN D F +PH +++ + +FS D +G++ +K+ + ++ T
Sbjct: 1238 KPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWT 1297
Query: 736 SFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVE 795
+F G + + V PFG+ NA +QR M F D+ K +Y+DD+++ S +E++H+E
Sbjct: 1298 TFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIE 1355
Query: 796 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPR--T 853
+L F R+++ L+ K ++ + LG + + I + P V I++ + T
Sbjct: 1356 HLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNT 1415
Query: 854 EKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLLKPPI 913
K ++ +LG LNY +I ++ GP++K KN ++N E I+ + K
Sbjct: 1416 LKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKP 1475
Query: 914 LVPPVEGRPLIMYLVVFDESMGCVLGQQDE--TGKKEHAIY-YLSKKFTDCETRYTMLEK 970
L P E +I+ +E G VL + + +GK I Y S F + +T +T L+
Sbjct: 1476 LERPKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKT-WTSLDY 1534
Query: 971 TCCALAWAAKRLRHYL 986
A+ A + + YL
Sbjct: 1535 EIEAINEALNKFQIYL 1550
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 113 bits (283), Expect = 3e-24
Identities = 82/306 (26%), Positives = 152/306 (48%), Gaps = 8/306 (2%)
Query: 667 IVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIK 726
+ PVPK DG+ RM +D+R++NK P H ++ + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 727 MSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVK 786
++PE T+F +C+ +P G +N+ A + + L ++ V+VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 787 STDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIR 846
D ++HV+ L K+F+ L + ++ K G ++ + LGF ++++G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 847 EMPAPRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLL--RKNQPIVWNDECQEAFDSI 904
+ P+ KQ++ LG LN+ FI + P++ L+ K + I W++E + + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 905 KNYLLKPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETR 964
L L + + L++ V S G V +ETGKK I YL+ F+ E +
Sbjct: 243 IEALNTASNLEERLPEQRLVI-KVNTSPSAGYV-RYYNETGKK--PIMYLNYVFSKAELK 298
Query: 965 YTMLEK 970
++MLEK
Sbjct: 299 FSMLEK 304
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 107 bits (267), Expect = 2e-22
Identities = 101/377 (26%), Positives = 162/377 (42%), Gaps = 10/377 (2%)
Query: 672 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKVFSFMDGFSGYNQIKMSPED 731
++ GK RM V+++ +N+A+ D+ LP++ L+ +FS D SG+ Q+ + E
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 732 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEE 791
++ T+F P G F +KV+PFGL A + +QR M T + K VYVDD+IV S E
Sbjct: 348 QKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSEL 406
Query: 792 QHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKV-RAIREMP- 849
H ++ + + + KY + L+ K LG + KG + + I + P
Sbjct: 407 DHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILENIHKFPD 465
Query: 850 APRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 909
+K ++ FLG L Y +I + P+ L+K+ W + IK L
Sbjct: 466 RLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLG 525
Query: 910 KPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 969
P L P LI+ D G VL + G E Y S F E Y +
Sbjct: 526 SFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDG-VELICRYSSGSFKQAEKNYHSND 584
Query: 970 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYI----FEKAAVTGKNARWQMLLSEYDI 1025
K A+ + YL + + Y + + G+ RWQ S+Y
Sbjct: 585 KELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQF 644
Query: 1026 VFKTQKAIKGSILADHL 1042
+ + +K ++LAD L
Sbjct: 645 DVEHLEGVK-NVLADCL 660
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 107 bits (266), Expect = 3e-22
Identities = 95/392 (24%), Positives = 166/392 (42%), Gaps = 48/392 (12%)
Query: 623 ECPPVRQKLRRTHPDMALK-----------IKSEVQKQIDAGFLMTVEYPEWVANIVPVP 671
+ PPV +LR +A++ I+ +QK +D G L+ P W ++PV
Sbjct: 161 QVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSP-WNTPLLPVK 219
Query: 672 KKD-GKVRMCVDFRDLNKASPKDNFPLPHIDMLVDNTAQPKV-FSFMDGFSGYNQIKMSP 729
K R D R++NK + +P+ L+ + +S +D + +++ P
Sbjct: 220 KPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHP 279
Query: 730 EDREKTSFITPW--------GTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEV--- 778
+ +F W G + +P G N+ TLF + +H+++
Sbjct: 280 NSQPLFAF--EWKDPEKGNTGQLTWTRLPQGFKNS--------PTLFDEALHRDLAPFRA 329
Query: 779 ---------YVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFI 829
YVDD++V + E + K+ + L K R++ K R LG++
Sbjct: 330 LNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYL 389
Query: 830 VSQKGIEVDPDKVRAIREMPAPRTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQ 889
+ + + P + + ++P P T +QVR FLG + +I + P++ L +++
Sbjct: 390 LKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESI 449
Query: 890 PIVWNDECQEAFDSIKNYLLKPPILVPPVEGRPLIMYLVVFDESMGCVLGQQDET-GKKE 948
P +W +E Q+AFD IK LL P L P +P +Y+ DE G G +T G
Sbjct: 450 PFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYI---DERAGVARGVLTQTLGPWR 506
Query: 949 HAIYYLSKKFTDCETRYTMLEKTCCALAWAAK 980
+ YLSKK + + K A+A K
Sbjct: 507 RPVAYLSKKLDPVASGWPTCLKAVAAVALLLK 538
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.323 0.140 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 167,224,704
Number of Sequences: 164201
Number of extensions: 7496085
Number of successful extensions: 17770
Number of sequences better than 10.0: 115
Number of HSP's better than 10.0 without gapping: 40
Number of HSP's successfully gapped in prelim test: 75
Number of HSP's that attempted gapping in prelim test: 17625
Number of HSP's gapped (non-prelim): 163
length of query: 1391
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1268
effective length of database: 39,777,331
effective search space: 50437655708
effective search space used: 50437655708
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 72 (32.3 bits)
Medicago: description of AC145220.4