
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147000.7 - phase: 0
(1185 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran... 688 0.0
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain... 334 1e-90
YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein 171 1e-41
M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810... 142 4e-33
YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B) 132 4e-30
YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein 130 2e-29
YMT5_YEAST (Q04214) Transposon Ty1 protein B 128 8e-29
YMU0_YEAST (Q04670) Transposon Ty1 protein B 128 1e-28
YJZ9_YEAST (P47100) Transposon Ty1 protein B 128 1e-28
YME4_YEAST (Q04711) Transposon Ty1 protein B 125 5e-28
YMD9_YEAST (Q03434) Transposon Ty1 protein B 125 7e-28
YJZ7_YEAST (P47098) Transposon Ty1 protein B 124 2e-27
M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820... 100 4e-20
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 61 2e-08
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 61 2e-08
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2... 59 6e-08
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl... 59 6e-08
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 59 8e-08
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 59 8e-08
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 58 1e-07
>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1328
Score = 688 bits (1775), Expect = 0.0
Identities = 404/1217 (33%), Positives = 649/1217 (53%), Gaps = 89/1217 (7%)
Query: 25 GMALPEQFQIAVIIDKLPPAWKDFKSLLRHKTKEFSLESLITRLRIEEEARKQEQNEEVF 84
G+ + E+ + ++++ LP ++ + + + H L+ + + L + E+ RK+ +N+
Sbjct: 136 GVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLNEKMRKKPENQGQA 195
Query: 85 VVSNNNTKKKFVGAVLKPAGKPFKNQNRPMNKNSNRNKTGNNSRPQIQQPPKNDAAPPFN 144
+++ G+ ++ + ++ R K+ N S+ +++ N
Sbjct: 196 LITEGR-------------GRSYQRSSNNYGRSGARGKSKNRSKSRVR-----------N 231
Query: 145 CYNCGQADHMARKCRNRTNRPAQA-------HMATDAAPDEPYVAMITE----INMIAGS 193
CYNC Q H R C N + + A ++ V I E +++
Sbjct: 232 CYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE 291
Query: 194 DGWWVDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILK 253
W VDT AS H RD+F Y A D V +G++ + + GIGDI +K TL+LK
Sbjct: 292 SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 351
Query: 254 DVLHTPKIRKNLVSGFLLNKAGFTQSIGADLYTITKNGIFVGKGYATDGMFKLNIDMNKI 313
DV H P +R NL+SG L++ G+ + +TK + + KG A +++ N ++ +
Sbjct: 352 DVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQG 411
Query: 314 S-SSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFEKCQFCSQAKINKESHK 372
++A ++WH R+ H++++ + ++ LI + C +C K ++ S +
Sbjct: 412 ELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQ 471
Query: 373 -SVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNEALDIFKQ 431
S R +L++SD+C + G +YF+TFIDD S VY+++ K++ +F++
Sbjct: 472 TSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQK 531
Query: 432 YVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKAERKNRT 491
+ +E + ++KR RSD G EY S F EY GI HE T P +P+ NG AER NRT
Sbjct: 532 FHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRT 591
Query: 492 FTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKIS-PYEILKKRQPNLSYFRTW 550
E V + + + +WGE + T CY++NR P P + ++ + S+ + +
Sbjct: 592 IVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVF 651
Query: 551 GCLAYVRKPDPKRVKLASRAYECAFIGYALNSKAYRFYDLKSKTIIESNDVDFYENKFP- 609
GC A+ P +R KL ++ C FIGY YR +D K +I S DV F E++
Sbjct: 652 GCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRT 711
Query: 610 ---------------FKSGDSGGNSGGTDNSVLD-------QPSEIITSNENIERDVIEP 647
F + S N+ + S D QP E+I E ++ V E
Sbjct: 712 AADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEV 771
Query: 648 G-------------RGKRARIAKEYGP--EYVAYTIEEDPSSIKEALSSIDADLWQEAIN 692
R +R R+ P EYV + + +P S+KE LS + + +A+
Sbjct: 772 EHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQ 831
Query: 693 DEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDF 752
+EM+SL N T+ L +LP G + + CKW+ K K D + +YKARLV KGF Q++ +DF
Sbjct: 832 EEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDF 891
Query: 753 FDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQEN 812
+ +SPV ++TSIR ++SLAA +L V Q+DVKTAFL+G+LEEEIYM+QPEGF + G+++
Sbjct: 892 DEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKH 951
Query: 813 KVCKLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNESDKCIYSK-YENNTCTIICLYVDD 871
VCKL+KSLYGLKQAP+QW+ KFD+ M + SD C+Y K + N I+ LYVDD
Sbjct: 952 MVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDD 1011
Query: 872 LLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKIT--RTDNGISLNQSHYVEKILR 929
+LI G + I +K L +FDMKDLG A ILG+KI RT + L+Q Y+E++L
Sbjct: 1012 MLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLE 1071
Query: 930 KYNYFYCKPASTPCDPSVKLFK-------NTGDSVRQTEYASIIGSLRYATDCTRPDISY 982
++N KP STP +KL K ++ + Y+S +GSL YA CTRPDI++
Sbjct: 1072 RFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAH 1131
Query: 983 AVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAVLEGYSDADWNNLSDDSK 1042
AVG++ +F P EHW+A++ ++RYL+ T L + +L+GY+DAD D+ K
Sbjct: 1132 AVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRK 1191
Query: 1043 ATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAASEEASWLRCLLSEIPLWERPL 1102
+++GY+F+ +GGA+SW+SK Q +A ST E+E IA +E WL+ L E+ L ++
Sbjct: 1192 SSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQK-- 1249
Query: 1103 PAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLT 1162
+++CDS +AI +N Y+ + + I ++ IRE + + +++V + TNEN AD LT
Sbjct: 1250 -EYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLT 1308
Query: 1163 KGLNREKVANTSSRMGL 1179
K + R K +G+
Sbjct: 1309 KVVPRNKFELCKELVGM 1325
>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
Copia VLP protein; Copia protease (EC 3.4.23.-)]
Length = 1409
Score = 334 bits (856), Expect = 1e-90
Identities = 195/527 (37%), Positives = 297/527 (56%), Gaps = 13/527 (2%)
Query: 665 AYTIEED-PSSIKEALSSIDADLWQEAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILK 723
A+TI D P+S E D W+EAIN E+++ N TW +T P + +W+
Sbjct: 883 AHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFS 942
Query: 724 KKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMD 783
K G+ +YKARLVA+GF Q+ +D+ +T++PV RI+S R ++SL +NL VHQMD
Sbjct: 943 VKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMD 1002
Query: 784 VKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIENE 843
VKTAFLNG L+EEIYM P+G I + VCKL+K++YGLKQA + W E F+ + E E
Sbjct: 1003 VKTAFLNGTLKEEIYMRLPQG--ISCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECE 1060
Query: 844 FKVNESDKCIY--SKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKA 901
F + D+CIY K N + LYVDD++I ++ + + K L F M DL +
Sbjct: 1061 FVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEI 1120
Query: 902 DVILGIKITRTDNGISLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVRQTE 961
+GI+I ++ I L+QS YV+KIL K+N C STP + D T
Sbjct: 1121 KHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTP 1180
Query: 962 YASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQR 1021
S+IG L Y CTRPD++ AV +L +++S+ + E WQ ++RV+RYLK T+ + L +++
Sbjct: 1181 CRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKK 1240
Query: 1022 ---YPAVLEGYSDADWNNLSDDSKATSGYIFSIAG-GAVSWKSKKQTILAQSTMESEMIA 1077
+ + GY D+DW D K+T+GY+F + + W +K+Q +A S+ E+E +A
Sbjct: 1241 NLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMA 1300
Query: 1078 LAAASEEASWLRCLLSEIPL-WERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHST 1136
L A EA WL+ LL+ I + E P + I+ D+ I+ N + + + I K+
Sbjct: 1301 LFEAVREALWLKFLLTSINIKLENP---IKIYEDNQGCISIANNPSCHKRAKHIDIKYHF 1357
Query: 1137 IREYLSNGTVRVDFVRTNENLADPLTKGLNREKVANTSSRMGLMPID 1183
RE + N + ++++ T LAD TK L + ++GL+ D
Sbjct: 1358 AREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQDD 1404
Score = 198 bits (504), Expect = 6e-50
Identities = 148/623 (23%), Positives = 285/623 (44%), Gaps = 51/623 (8%)
Query: 1 MSDDKSVEAQSHELQQIAHEIIAEGMALPEQFQIAVIIDKLPPAWKDFKSLLRHKTKEFS 60
+S + S+ + H ++ E++A G + E +I+ ++ LP + + + ++E
Sbjct: 108 LSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENL 167
Query: 61 LESLITRLRIEEEARKQEQNEEVFVVSNNNTKKKFVGAVLKPAGKPFKNQNRPMNKNSNR 120
+ + +++E + + + N+T KK + A++ N N+ +
Sbjct: 168 TLAFVKNRLLDQEIKIKNDH--------NDTSKKVMNAIVHN------------NNNTYK 207
Query: 121 NKTGNNSRPQIQQPPKNDAAPPFNCYNCGQADHMARKC----RNRTNRPAQAHMATDAAP 176
N N + ++ K ++ C++CG+ H+ + C R N+ + A
Sbjct: 208 NNLFKNRVTKPKKIFKGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTAT 267
Query: 177 DEPYVAMITEINMIAGSD--GWWVDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDV 234
M+ E+N + D G+ +D+GAS H+ D ++ + +
Sbjct: 268 SHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFI 327
Query: 235 VGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLLNKAGFTQSIGADLYTITKNGIFV 294
++ ++ + L+DVL + NL+S L +AG + TI+KNG+ V
Sbjct: 328 YATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMV 387
Query: 295 GKGYATDGMFKLN----IDMNKISSSAYMLCDFNIWHSRLCHVN---------KRIISNM 341
K GM LN I+ S +A +F +WH R H++ K + S+
Sbjct: 388 VKN---SGM--LNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQ 442
Query: 342 SGLGLIPKISLNDFEKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKR 401
S L + ++S E C QA++ + K T I P ++HSD+C +T + K
Sbjct: 443 SLLNNL-ELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKN 501
Query: 402 YFITFIDDCSDYTHVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFN 461
YF+ F+D + Y YL++ K++ +F+ +V + E FN+++ D G EY S+
Sbjct: 502 YFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMR 561
Query: 462 EYYKELGIIHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVL 521
++ + GI + T P++P++NG +ER RT TE + + +WGE +LT Y++
Sbjct: 562 QFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLI 621
Query: 522 NRVPK---TKNKISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGY 578
NR+P + +PYE+ ++P L + R +G YV + K+ K ++++ F+GY
Sbjct: 622 NRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGY 680
Query: 579 ALNSKAYRFYDLKSKTIIESNDV 601
N ++ +D ++ I + DV
Sbjct: 681 EPN--GFKLWDAVNEKFIVARDV 701
>YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein
Length = 308
Score = 171 bits (432), Expect = 1e-41
Identities = 109/299 (36%), Positives = 159/299 (52%), Gaps = 4/299 (1%)
Query: 782 MDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIE 841
MDV TAFLN ++E IY+ QP GFV + V +L +YGLKQAP W+E +N + +
Sbjct: 1 MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60
Query: 842 NEFKVNESDKCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKA 901
F +E + +Y + ++ I +YVDDLL+ + VK L + MKDLGK
Sbjct: 61 IGFCRHEGEHGLYFRSTSDGPIYIGVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120
Query: 902 DVILGIKITRTDNG-ISLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVRQ- 959
D LG+ I ++ NG I+L+ Y+ K + K TP S LF+ T ++
Sbjct: 121 DKFLGLNIHQSTNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180
Query: 960 TEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHY 1019
T Y SI+G L + + RPDISY V LL +F P H ++ RV+RYL T ++ L Y
Sbjct: 181 TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240
Query: 1020 QRYPAV-LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKK-QTILAQSTMESEMI 1076
+ V L Y DA + D +T GY+ +AG V+W SKK + ++ + E+E I
Sbjct: 241 RSGSQVALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYI 299
>M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810
(ORF240b)
Length = 240
Score = 142 bits (359), Expect = 4e-33
Identities = 77/233 (33%), Positives = 131/233 (56%), Gaps = 2/233 (0%)
Query: 865 ICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGISLNQSHYV 924
+ LYVDD+L+ GS+ + + L F MKDLG LGI+I +G+ L+Q+ Y
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62
Query: 925 EKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVRQTEYASIIGSLRYATDCTRPDISYAV 984
E+IL CKP STP + +T +++ SI+G+L+Y T TRPDISYAV
Sbjct: 63 EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLT-LTRPDISYAV 121
Query: 985 GLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV-LEGYSDADWNNLSDDSKA 1043
++C+ P++ + ++RV+RY+K T+ GL+ + + ++ + D+DW + ++
Sbjct: 122 NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 181
Query: 1044 TSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAASEEASWLRCLLSEIP 1096
T+G+ + +SW +K+Q +++S+ E+E ALA + E +W S P
Sbjct: 182 TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTWSSASRSRDP 234
>YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)
Length = 1770
Score = 132 bits (333), Expect = 4e-30
Identities = 147/580 (25%), Positives = 257/580 (43%), Gaps = 59/580 (10%)
Query: 623 DNSVLDQPSEIITSNENIERDVIEPGRGKR-----ARIAKEYGPEYVAYTIEEDPSSIKE 677
DN + S +N+N+ +EP R K+ A I + V T+ D +I
Sbjct: 1199 DNETEIEVSRDTWNNKNMRS--LEPPRSKKRINLIAAIKGVKSIKPVRTTLRYD-EAITY 1255
Query: 678 ALSSIDADLWQEAINDEMDSLMSNETWHLT------DLPPGCKTIGCKWILKKKLKPDGS 731
+ + D + EA + E+ L+ TW D+ P K I +I KK DG+
Sbjct: 1256 NKDNKEKDRYVEAYHKEISQLLKMNTWDTNKYYDRNDIDPK-KVINSMFIFNKKR--DGT 1312
Query: 732 IDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNG 791
+KAR VA+G Q + D S ++ +S+A ++ + Q+D+ +A+L
Sbjct: 1313 ---HKARFVARGDIQHPDTYDSDMQSNTVHHYALMTSLSIALDNDYYITQLDISSAYLYA 1369
Query: 792 ELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIE-NEFKVNESD 850
+++EE+Y+ P G +K+ +L KSLYGLKQ+ W+E + +I + +
Sbjct: 1370 DIKEELYIRPPPHL---GLNDKLLRLRKSLYGLKQSGANWYETIKSYLINCCDMQEVRGW 1426
Query: 851 KCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKADVILGIK 908
C++ N+ ICL+VDD+++F +LNA K + + L +D K +LG++D +
Sbjct: 1427 SCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYD 1482
Query: 909 ITRTDNGISLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGD----------SVR 958
I + I +S Y++ + K + P +P K + G +
Sbjct: 1483 ILGLE--IKYQRSKYMKLGMEKSLTEKLPKLNVPLNPKGKKLRAPGQPGHYIDQDELEID 1540
Query: 959 QTEY-------ASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKK 1011
+ EY +IG Y R D+ Y + L + PS + +++++
Sbjct: 1541 EDEYKEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWD 1600
Query: 1012 TMTLGLHYQRYPAV-----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTIL 1066
T L + + L SDA + N K+ G IF + G + KS K ++
Sbjct: 1601 TRDKQLIWHKNKPTKPDNKLVAISDASYGN-QPYYKSQIGNIFLLNGKVIGGKSTKASLT 1659
Query: 1067 AQSTMESEMIALAAASEEASWLRCLLSEIPLWERP-LPAVLIHCDSTAAIAKIENRYYNG 1125
ST E+E+ A++ A + L L+ E L ++P + +L ST +I K N
Sbjct: 1660 CTSTTEAEIHAVSEAIPLLNNLSHLVQE--LNKKPIIKGLLTDSRSTISIIKSTNE-EKF 1716
Query: 1126 KRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLTKGL 1165
+ R K +R+ +S + V ++ T +N+AD +TK L
Sbjct: 1717 RNRFFGTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPL 1756
Score = 99.0 bits (245), Expect = 7e-20
Identities = 108/442 (24%), Positives = 184/442 (41%), Gaps = 41/442 (9%)
Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVV--GIGDIELKFTSEKTLILKDV 255
+D+GAS+ + R + A + ++ + D+ D+ IG++ F + +K
Sbjct: 456 IDSGASQTLV--RSAHYLHHATPNSEINIVDAQKQDIPINAIGNLHFNFQNGTKTSIK-A 512
Query: 256 LHTPKIRKNLVSGFLLNKAGFT---------QSIGADLYTITKNGIF--VGKGYATDG-M 303
LHTP I +L+S L T +S G L I K+G F + K Y +
Sbjct: 513 LHTPNIAYDLLSLSELANQNITACFTRNTLERSDGTVLAPIVKHGDFYWLSKKYLIPSHI 572
Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
KL I+ N S + + + H L H N R I + + +D E
Sbjct: 573 SKLTIN-NVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTY 631
Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
+C C K K H +R+ EPF+ +H+D+ +L ++ YFI+F D+ +
Sbjct: 632 QCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKT 691
Query: 412 DYTHVYLMRNKNEA--LDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
+ VY + ++ E L++F + I+NQFN R+ + DRG+EY + ++++ GI
Sbjct: 692 RFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGI 751
Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
T +G AER NRT + SG H W + + N + KN
Sbjct: 752 TACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVSPKN 811
Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
S + +++ +G V +P S+ + GYAL NS Y
Sbjct: 812 DKSARQHAGLAGLDITTILPFGQPVIVNNHNPD-----SKIHPRGIPGYALHPSRNSYGY 866
Query: 586 RFYDLKSKTIIESNDVDFYENK 607
Y K +++ + ++K
Sbjct: 867 IIYLPSLKKTVDTTNYVILQDK 888
>YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein
Length = 1803
Score = 130 bits (327), Expect = 2e-29
Identities = 120/454 (26%), Positives = 204/454 (44%), Gaps = 41/454 (9%)
Query: 735 YKARLVAKGFRQRENVDFFDTYSPVTRIT----SIRVLISLAAIHNLIVHQMDVKTAFLN 790
YKAR+V +G Q DTYS +T + I++ + +A N+ + +D+ AFL
Sbjct: 1337 YKARIVCRGDTQSP-----DTYSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLY 1391
Query: 791 GELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNESD 850
+LEEEIY+ P V KL+K+LYGLKQ+PK+W++ + K N
Sbjct: 1392 AKLEEEIYIPHPH------DRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT 1445
Query: 851 KCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKA-DVILGIKI 909
+Y + N +I +YVDD +I SN + + + L NF++K G D +L I
Sbjct: 1446 PGLYQTEDKNL--MIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDI 1503
Query: 910 TRTD-------NGISLNQSHYVEKILRKYNYFYCK--PASTP------CDPSVKLFKNTG 954
D I L ++ ++ +KYN K +S P DP + + +
Sbjct: 1504 LGMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE 1563
Query: 955 DSVRQ--TEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKT 1012
+ RQ + ++G L Y R DI +AV + + + P + I ++++YL +
Sbjct: 1564 EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRY 1623
Query: 1013 MTLGLHYQR---YPAVLEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQS 1069
+G+HY R + +DA + D+++ G I + S K T S
Sbjct: 1624 KDIGIHYDRDCNKDKKVIAITDASVGS-EYDAQSRIGVILWYGMNIFNVYSNKSTNRCVS 1682
Query: 1070 TMESEMIALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQ 1129
+ E+E+ A+ ++ L+ L E L E +++ DS AI + Y K +
Sbjct: 1683 STEAELHAIYEGYADSETLKVTLKE--LGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKF 1740
Query: 1130 IRRKHSTIREYLSNGTVRVDFVRTNENLADPLTK 1163
K I+E + ++++ + N+AD LTK
Sbjct: 1741 TWIKTEIIKEKIKEKSIKLLKITGKGNIADLLTK 1774
Score = 72.8 bits (177), Expect = 5e-12
Identities = 103/446 (23%), Positives = 182/446 (40%), Gaps = 57/446 (12%)
Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVL--LGDSHSTDVVGIGDIELKF----TSEKTLI 251
+DTG+ ++ D+ + Y + +G + S V G G I++K T K L+
Sbjct: 414 IDTGSGVNITNDKTLLHNYEDSNRSTRFFGIGKNSSVSVKGYGYIKIKNGHNNTDNKCLL 473
Query: 252 LKDVLHTPKIRKNLVSGFLLNKAGFTQSIGADLYTITKNGIFVGKGYATDGMFKLNIDMN 311
+ P+ ++S + L K T+ + + YT N I K +G+ +++ MN
Sbjct: 474 ---TYYVPEEESTIISCYDLAKK--TKMVLSRKYTRLGNKIIKIKTKIVNGV--IHVKMN 526
Query: 312 KI----------------SSSAYMLCDFNIW----HSRLCHVNKRIISNM-------SGL 344
++ SS + L +I H R+ H + I N L
Sbjct: 527 ELIERPSDDSKINAIKPTSSPGFKLNKRSITLEDAHKRMGHTGIQQIENSIKHNHYEESL 586
Query: 345 GLIPKISLNDFEKCQFCSQAKINKESHK--SVTRITEPFELIHSDLCELDGNLTRNG--- 399
LI + N+F CQ C +K K +H S+ + E S ++ G ++ +
Sbjct: 587 DLIKEP--NEFW-CQTCKISKATKRNHYTGSMNNHSTDHEPGSSWCMDIFGPVSSSNADT 643
Query: 400 KRYFITFIDDCSDY--THVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGS 457
KRY + +D+ + Y T + +N L ++ ++ +E QF+ +++ SDRGTE+ +
Sbjct: 644 KRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQFDRKVREINSDRGTEFTN 703
Query: 458 HIFNEYYKELGIIHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTV 517
EY+ GI H T+ NG+AER RT + S +W + +
Sbjct: 704 DQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITDATTLLRQSNLRVKFWEYAVTSA 763
Query: 518 CYVLNRVPKTKNKISPYEILKKRQP---NLSYFRTWGCLAYVRKPDPKRVKLASRAYECA 574
+ N + P + + RQP L F +G + + K KL
Sbjct: 764 TNIRNYLEHKSTGKLPLKAI-SRQPVTVRLMSFLPFGEKGIIWNHNHK--KLKPSGLPSI 820
Query: 575 FIGYALNSKAYRFYDLKSKTIIESND 600
+ NS Y+F+ + SK I ++D
Sbjct: 821 ILCKDPNSYGYKFF-IPSKNKIVTSD 845
>YMT5_YEAST (Q04214) Transposon Ty1 protein B
Length = 1328
Score = 128 bits (322), Expect = 8e-29
Identities = 128/504 (25%), Positives = 231/504 (45%), Gaps = 41/504 (8%)
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
EA + E++ L+ +TW TD K I K ++ K DG+ +KAR VA+G
Sbjct: 825 EAYHKEVNQLLKMKTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDGT---HKARFVARGD 880
Query: 745 RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
Q + S ++ +SLA +N + Q+D+ +A+L +++EE+Y+ P
Sbjct: 881 IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 940
Query: 805 FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
G +K+ +L KSLYGLKQ+ W+E + +I+ + C+ +EN+ T
Sbjct: 941 L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCV---FENSQVT 994
Query: 864 IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKADV-----ILGIKIT-RTDNG 915
ICL+VDD+++F NLN+ K + L +D K +LG++D ILG++I +
Sbjct: 995 -ICLFVDDMVLFSKNLNSNKRIIDKLKMQYDTKIINLGESDEEIQYDILGLEIKYQRGKY 1053
Query: 916 ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKL------FKNTGDSVRQTEYASII 966
+ L + + + + K N + S P P + + + ++ E +I
Sbjct: 1054 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMKVHEMQKLI 1113
Query: 967 GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
G Y R D+ Y + L + PS + +++++ T L + + V
Sbjct: 1114 GLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIWHKSKPVK 1173
Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
L SDA + N K+ G I+ + G + KS K ++ ST E+E+ A++ +
Sbjct: 1174 PTNKLVVISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1232
Query: 1082 SEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYL 1141
+ L L+ E+ ++P+ L+ + I N + R K +R+ +
Sbjct: 1233 VPLLNNLSYLIQELD--KKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLRDEV 1290
Query: 1142 SNGTVRVDFVRTNENLADPLTKGL 1165
S + V ++ T +N+AD +TK L
Sbjct: 1291 SGNHLHVCYIETKKNIADVMTKPL 1314
Score = 106 bits (264), Expect = 4e-22
Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)
Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
+D+GASR + + ++ D V+ + + IGD++ F +K VLH
Sbjct: 33 LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91
Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
TP I +L+S LN+ A FT+++ G L I K G F V K Y
Sbjct: 92 TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148
Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
+ N +S + + H L H N + I I + +D +
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDY 208
Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
+C C K K H +R+ EPF+ +H+D+ NL ++ YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268
Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
+ VY + ++ E LD+F + I+NQF + + DRG+EY + +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328
Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
T +G AER NRT + + SG H W + V N + K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388
Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
K S + ++S +G V +P S+ + GYAL NS Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443
Query: 586 RFYDLKSKTIIESNDVDFYENK 607
Y K +++ + + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465
>YMU0_YEAST (Q04670) Transposon Ty1 protein B
Length = 1328
Score = 128 bits (321), Expect = 1e-28
Identities = 128/509 (25%), Positives = 234/509 (45%), Gaps = 51/509 (10%)
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
+A + E++ L+ +TW TD K I K ++ K DG+ +KAR VA+G
Sbjct: 825 QAYHKEVNQLLKMKTWD-TDRYYDRKEIDPKRVINSMFIFNRKRDGT---HKARFVARG- 879
Query: 745 RQRENVDFFDTYSPVTRITSIR-----VLISLAAIHNLIVHQMDVKTAFLNGELEEEIYM 799
++ DTY P + ++ +SLA +N + Q+D+ +A+L +++EE+Y+
Sbjct: 880 ----DIQHPDTYDPGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYI 935
Query: 800 DQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYE 858
P G +K+ +L KSLYGLKQ+ W+E + +I+ + C++
Sbjct: 936 RPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVFK--- 989
Query: 859 NNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT- 910
N+ ICL+VDD+++F +LNA K + + L +D K +LG++D ILG++I
Sbjct: 990 -NSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKY 1048
Query: 911 RTDNGISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKL------FKNTGDSVRQTE 961
+ + L + + + + K N + S P P + + + ++ E
Sbjct: 1049 QRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMKVHE 1108
Query: 962 YASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQR 1021
+IG Y R D+ Y + L + PS + +++++ T L + +
Sbjct: 1109 MQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIWHK 1168
Query: 1022 YPAV-----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMI 1076
V L SDA + N K+ G I+ + G + KS K ++ ST E+E+
Sbjct: 1169 SKPVKPTNKLVVISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIH 1227
Query: 1077 ALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHST 1136
A++ + + L L+ E L ++P+ L+ + I N + R K
Sbjct: 1228 AISESVPLLNNLSHLVQE--LNKKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMR 1285
Query: 1137 IREYLSNGTVRVDFVRTNENLADPLTKGL 1165
+R+ +S + V ++ T +N+AD +TK L
Sbjct: 1286 LRDEVSGNHLHVCYIETKKNIADVMTKPL 1314
Score = 106 bits (264), Expect = 4e-22
Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)
Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
+D+GASR + + ++ D V+ + + IGD++ F +K VLH
Sbjct: 33 LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91
Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
TP I +L+S LN+ A FT+++ G L I K G F V K Y
Sbjct: 92 TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148
Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
+ N +S + + H L H N + I I + +D +
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDY 208
Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
+C C K K H +R+ EPF+ +H+D+ NL ++ YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268
Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
+ VY + ++ E LD+F + I+NQF + + DRG+EY + +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328
Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
T +G AER NRT + + SG H W + V N + K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388
Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
K S + ++S +G V +P S+ + GYAL NS Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443
Query: 586 RFYDLKSKTIIESNDVDFYENK 607
Y K +++ + + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465
>YJZ9_YEAST (P47100) Transposon Ty1 protein B
Length = 1755
Score = 128 bits (321), Expect = 1e-28
Identities = 126/504 (25%), Positives = 229/504 (45%), Gaps = 41/504 (8%)
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
EA + E++ L+ +TW TD K I K ++ K DG+ +KAR VA+G
Sbjct: 1252 EAYHKEVNQLLKMKTWD-TDEYYDRKEIDPKRVINSMFIFNKKRDGT---HKARFVARGD 1307
Query: 745 RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
Q + S ++ +SLA +N + Q+D+ +A+L +++EE+Y+ P
Sbjct: 1308 IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 1367
Query: 805 FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
G +K+ +L KSLYGLKQ+ W+E + +I+ + C++ N+
Sbjct: 1368 L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIQQCGMEEVRGWSCVF----KNSQV 1420
Query: 864 IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKADV-----ILGIKIT-RTDNG 915
ICL+VDD+++F NLN+ K + L +D K +LG++D ILG++I +
Sbjct: 1421 TICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINLGESDEEIQYDILGLEIKYQRGKY 1480
Query: 916 ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKL------FKNTGDSVRQTEYASII 966
+ L + + + + K N + S P P + + + ++ E +I
Sbjct: 1481 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMKVHEMQKLI 1540
Query: 967 GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
G Y R D+ Y + L + PS + +++++ T L + + V
Sbjct: 1541 GLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIWHKSKPVK 1600
Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
L SDA + N K+ G I+ + G + KS K ++ ST E+E+ A++ +
Sbjct: 1601 PTNKLVVISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1659
Query: 1082 SEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYL 1141
+ L L+ E+ ++P+ L+ + I N + R K +R+ +
Sbjct: 1660 VPLLNNLSYLIQELD--KKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLRDEV 1717
Query: 1142 SNGTVRVDFVRTNENLADPLTKGL 1165
S + V ++ T +N+AD +TK L
Sbjct: 1718 SGNHLHVCYIETKKNIADVMTKPL 1741
Score = 107 bits (268), Expect = 1e-22
Identities = 128/546 (23%), Positives = 219/546 (39%), Gaps = 53/546 (9%)
Query: 105 KPFKNQNRPMNKNSNRNKTGNNSRPQI----QQPPKNDAAPPFNCYNCGQADHMARKCRN 160
KP +N KN +R+ T N ++P++ Q N + +N +++ +
Sbjct: 357 KPNYRRNPSDEKNDSRSYT-NTTKPKVIARNPQKTNNSKSKTARAHNVSTSNNSPSTDND 415
Query: 161 RTNRPAQAHMATDAAPDEPYVAMITE--INMIAGSDG-----WWVDTGASRHVCYDRDMF 213
++ + + D +TE +N SD +D+GASR +
Sbjct: 416 SISKSTTEPIQLNNKHDLILGQKLTESTVNHTNHSDDELPGHLLLDSGASRTLIRSAHHI 475
Query: 214 KTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLLNK 273
+ ++ D V+ + + IGD++ F +K VLHTP I +L+S LN+
Sbjct: 476 HSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLHTPNIAYDLLS---LNE 531
Query: 274 -------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGMFKLNIDMNKISSSAYM 319
A FT+++ G L I K G F V K Y + N +S +
Sbjct: 532 LAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSNISVPTINNVHTSESTR 591
Query: 320 LCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE-------KCQFCSQAKINKESHK 372
+ H L H N + I I + +D + +C C K K H
Sbjct: 592 KYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHRHI 651
Query: 373 SVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNE--A 425
+R+ EPF+ +H+D+ NL ++ YFI+F D+ + + VY + ++ E
Sbjct: 652 KGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSI 711
Query: 426 LDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKA 485
LD+F + I+NQF + + DRG+EY + +++ ++ GI T +G A
Sbjct: 712 LDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVA 771
Query: 486 ERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYEILKKRQPNLS 545
ER NRT + + SG H W + V N + K+K S + ++S
Sbjct: 772 ERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSKKSARQHAGLAGLDIS 831
Query: 546 YFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAYRFYDLKSKTIIESNDV 601
+G V +P S+ + GYAL NS Y Y K +++ +
Sbjct: 832 TLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNY 886
Query: 602 DFYENK 607
+ K
Sbjct: 887 VILQGK 892
>YME4_YEAST (Q04711) Transposon Ty1 protein B
Length = 1328
Score = 125 bits (315), Expect = 5e-28
Identities = 127/504 (25%), Positives = 230/504 (45%), Gaps = 41/504 (8%)
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
EA + E++ L+ TW TD K I K ++ K DG+ +KAR VA+G
Sbjct: 825 EAYHKEVNQLLKMNTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDGT---HKARFVARGD 880
Query: 745 RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
Q + S ++ +SLA +N + Q+D+ +A+L +++EE+Y+ P
Sbjct: 881 IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 940
Query: 805 FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
G +K+ +L KSLYGLKQ+ W+E + +I+ + C++ N+
Sbjct: 941 L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVF----KNSQV 993
Query: 864 IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT-RTDNG 915
ICL+VDD+++F +LNA K + + L +D K +LG++D ILG++I +
Sbjct: 994 TICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRGKY 1053
Query: 916 ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKLFKN----TGDSVRQT--EYASII 966
+ L + + + + K N + S P P + + ++ D ++ E +I
Sbjct: 1054 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEMQKLI 1113
Query: 967 GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
G Y R D+ Y + L + PS + +++++ T L + +
Sbjct: 1114 GLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTE 1173
Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
L SDA + N K+ G I+ + G + KS K ++ ST E+E+ A++ +
Sbjct: 1174 PDNKLVAISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1232
Query: 1082 SEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYL 1141
+ L L+ E L ++P+ L+ + I N + R K +R+ +
Sbjct: 1233 VPLLNNLSHLVQE--LNKKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLRDEV 1290
Query: 1142 SNGTVRVDFVRTNENLADPLTKGL 1165
S + V ++ T +N+AD +TK L
Sbjct: 1291 SGNHLHVCYIETKKNIADVMTKPL 1314
Score = 106 bits (264), Expect = 4e-22
Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)
Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
+D+GASR + + ++ D V+ + + IGD++ F +K VLH
Sbjct: 33 LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91
Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
TP I +L+S LN+ A FT+++ G L I K G F V K Y
Sbjct: 92 TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148
Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
+ N +S + + H L H N + I I + +D +
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDY 208
Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
+C C K K H +R+ EPF+ +H+D+ NL ++ YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268
Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
+ VY + ++ E LD+F + I+NQF + + DRG+EY + +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328
Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
T +G AER NRT + + SG H W + V N + K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388
Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
K S + ++S +G V +P S+ + GYAL NS Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443
Query: 586 RFYDLKSKTIIESNDVDFYENK 607
Y K +++ + + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465
>YMD9_YEAST (Q03434) Transposon Ty1 protein B
Length = 1328
Score = 125 bits (314), Expect = 7e-28
Identities = 130/505 (25%), Positives = 234/505 (45%), Gaps = 43/505 (8%)
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
EA + E++ L+ +TW TD K I K ++ K DG+ +KAR VA+G
Sbjct: 825 EAYHKEVNQLLKMKTWD-TDEYYDRKEIDPKRVINSMFIFNKKRDGT---HKARFVARGD 880
Query: 745 RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
Q + S ++ +SLA +N + Q+D+ +A+L +++EE+Y+ P
Sbjct: 881 IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 940
Query: 805 FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
G +K+ +L KSLYGLKQ+ W+E + +I+ + C++ N+
Sbjct: 941 L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVF----KNSQV 993
Query: 864 IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT-RTDNG 915
ICL+VDD+++F +LNA K + + L +D K +LG++D ILG++I +
Sbjct: 994 TICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRGKY 1053
Query: 916 ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKLFKN----TGDSVRQT--EYASII 966
+ L + + + + K N + S P P + + ++ D ++ E +I
Sbjct: 1054 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEMQKLI 1113
Query: 967 GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
G Y R D+ Y + L + PS + +++++ T L + +
Sbjct: 1114 GLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTE 1173
Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
L SDA + N K+ G I+ + G + KS K ++ ST E+E+ A++ +
Sbjct: 1174 PDNKLVAISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1232
Query: 1082 SEEASWLRCLLSEIPLWERP-LPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREY 1140
+ L L+ E L ++P + +L ST +I K N + R K +R+
Sbjct: 1233 VPLLNNLSYLIQE--LNKKPIIKGLLTDSRSTISIIKSTNE-EKFRNRFFGTKAMRLRDE 1289
Query: 1141 LSNGTVRVDFVRTNENLADPLTKGL 1165
+S + V ++ T +N+AD +TK L
Sbjct: 1290 VSGNNLYVYYIETKKNIADVMTKPL 1314
Score = 106 bits (265), Expect = 3e-22
Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)
Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
+D+GASR + + ++ D V+ + + IGD++ F +K VLH
Sbjct: 33 LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91
Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
TP I +L+S LN+ A FT+++ G L I K G F V K Y
Sbjct: 92 TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148
Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFEK------ 357
+ N +S + + H L H N + I I + +D ++
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDRSSAIDY 208
Query: 358 -CQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
C C K K H +R+ EPF+ +H+D+ NL ++ YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268
Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
+ VY + ++ E LD+F + I+NQF + + DRG+EY + +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328
Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
T +G AER NRT + + SG H W + V N + K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388
Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
K S + ++S +G V +P S+ + GYAL NS Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443
Query: 586 RFYDLKSKTIIESNDVDFYENK 607
Y K +++ + + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465
>YJZ7_YEAST (P47098) Transposon Ty1 protein B
Length = 1755
Score = 124 bits (311), Expect = 2e-27
Identities = 132/507 (26%), Positives = 237/507 (46%), Gaps = 47/507 (9%)
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
EA + E++ L+ +TW TD K I K ++ K DG+ +KAR VA+G
Sbjct: 1252 EAYHKEVNQLLKMKTWD-TDEYYDRKEIDPKRVINSMFIFNKKRDGT---HKARFVARG- 1306
Query: 745 RQRENVDFFDT--YSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQP 802
++ D +DT S ++ +SLA +N + Q+D+ +A+L +++EE+Y+ P
Sbjct: 1307 -DIQHPDTYDTGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPP 1365
Query: 803 EGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNT 861
G +K+ +L KS YGLKQ+ W+E + +I+ + C++ N+
Sbjct: 1366 PHL---GMNDKLIRLKKSHYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVF----KNS 1418
Query: 862 CTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT-RTD 913
ICL+VDD+++F +LNA K + + L +D K +LG++D ILG++I +
Sbjct: 1419 QVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRG 1478
Query: 914 NGISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKLFKN----TGDSVRQT--EYAS 964
+ L + + + + K N + S P P + + ++ D ++ E
Sbjct: 1479 KYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEMQK 1538
Query: 965 IIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPA 1024
+IG Y R D+ Y + L + PS + +++++ T L + +
Sbjct: 1539 LIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKP 1598
Query: 1025 V-----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALA 1079
L SDA + N K+ G IF + G + KS K ++ ST E+E+ A++
Sbjct: 1599 TEPDNKLVAISDASYGN-QPYYKSQIGNIFLLNGKVIGGKSTKASLTCTSTTEAEIHAIS 1657
Query: 1080 AASEEASWLRCLLSEIPLWERP-LPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIR 1138
+ + L L+ E L ++P + +L ST +I K N + R K +R
Sbjct: 1658 ESVPLLNNLSYLIQE--LNKKPIIKGLLTDSRSTISIIKSTNE-EKFRNRFFGTKAMRLR 1714
Query: 1139 EYLSNGTVRVDFVRTNENLADPLTKGL 1165
+ +S + V ++ T +N+AD +TK L
Sbjct: 1715 DEVSGNNLYVYYIETKKNIADVMTKPL 1741
Score = 106 bits (265), Expect = 3e-22
Identities = 127/546 (23%), Positives = 219/546 (39%), Gaps = 53/546 (9%)
Query: 105 KPFKNQNRPMNKNSNRNKTGNNSRPQI----QQPPKNDAAPPFNCYNCGQADHMARKCRN 160
KP +N KN +R+ T N ++P++ Q N + +N +++ +
Sbjct: 357 KPNYRRNLSDEKNDSRSYT-NTTKPKVIARNPQKTNNSKSKTARAHNVSTSNNSPSTDND 415
Query: 161 RTNRPAQAHMATDAAPDEPYVAMITE--INMIAGSDG-----WWVDTGASRHVCYDRDMF 213
++ + + D +TE +N SD +D+GASR +
Sbjct: 416 SISKSTTEPIQLNNKHDLTLGQELTESTVNHTNHSDDELPGHLLLDSGASRTLIRSAHHI 475
Query: 214 KTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLLNK 273
+ ++ D V+ + + IGD++ F +K VLHTP I +L+S LN+
Sbjct: 476 HSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLHTPNIAYDLLS---LNE 531
Query: 274 -------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGMFKLNIDMNKISSSAYM 319
A FT+++ G L I + G F V K Y + N +S +
Sbjct: 532 LAAVDITACFTKNVLERSDGTVLAPIVQYGDFYWVSKRYLLPSNISVPTINNVHTSESTR 591
Query: 320 LCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE-------KCQFCSQAKINKESHK 372
+ H L H N + I I + +D + +C C K K H
Sbjct: 592 KYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHRHI 651
Query: 373 SVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNE--A 425
+R+ EPF+ +H+D+ NL ++ YFI+F D+ + + VY + ++ E
Sbjct: 652 KGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSI 711
Query: 426 LDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKA 485
LD+F + I+NQF + + DRG+EY + +++ ++ GI T +G A
Sbjct: 712 LDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVA 771
Query: 486 ERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYEILKKRQPNLS 545
ER NRT + + SG H W + V N + K+K S + ++S
Sbjct: 772 ERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSKKSARQHAGLAGLDIS 831
Query: 546 YFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAYRFYDLKSKTIIESNDV 601
+G V +P S+ + GYAL NS Y Y K +++ +
Sbjct: 832 TLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNY 886
Query: 602 DFYENK 607
+ K
Sbjct: 887 VILQGK 892
>M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820
(ORF170)
Length = 170
Score = 99.8 bits (247), Expect = 4e-20
Identities = 49/115 (42%), Positives = 73/115 (62%), Gaps = 6/115 (5%)
Query: 661 PEY---VAYTIEEDPSSIKEALSSIDADLWQEAINDEMDSLMSNETWHLTDLPPGCKTIG 717
P+Y + TI+++P S+ AL W +A+ +E+D+L N+TW L P +G
Sbjct: 14 PKYSLTITTTIKKEPKSVIFALKDPG---WCQAMQEELDALSRNKTWILVPPPVNQNILG 70
Query: 718 CKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLA 772
CKW+ K KL DG++D+ KARLVAKGF Q E + F +TYSPV R +IR ++++A
Sbjct: 71 CKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 61.2 bits (147), Expect = 2e-08
Identities = 52/193 (26%), Positives = 79/193 (39%), Gaps = 13/193 (6%)
Query: 357 KCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTR--NGKRYFITFIDDCSDYT 414
KCQ C +AK K + +T P + + G L + NG Y +T I D + Y
Sbjct: 938 KCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYL 997
Query: 415 HVYLMRNKNEALDIFKQYVKEIENQFNIR---IKRFRSDRGTEYGSHIFNEYYKELGIIH 471
+ NK+ K K I F ++ +K F +D GTEY + I + K L I +
Sbjct: 998 VAIPIANKSA-----KTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKN 1052
Query: 472 ETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKI 531
T+ + + G ER +RT E + + + W L Y N +
Sbjct: 1053 ITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTD---WDVWLQYFVYCFNTTQSMVHNY 1109
Query: 532 SPYEILKKRQPNL 544
PYE++ R NL
Sbjct: 1110 CPYELVFGRTSNL 1122
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 60.8 bits (146), Expect = 2e-08
Identities = 54/211 (25%), Positives = 89/211 (41%), Gaps = 16/211 (7%)
Query: 356 EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
E CQ C+Q +K + K TR+ H ++ + G +Y + FID S +
Sbjct: 888 ETCQACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEVKPGLYGYKYLLVFIDTFSGWVE 947
Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
+ + K A + K+ ++EI +F + + +D G + S + LG+ +
Sbjct: 948 AFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGVDWKLHC 1005
Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
Y P+ +G+ ER NRT E + L +G+ W +L Y P + ++PYE
Sbjct: 1006 AYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHGLTPYE 1062
Query: 536 ILKKRQPNLSYFRTWGCLAYVRKPDPKRVKL 566
IL P L F PDP K+
Sbjct: 1063 ILYGAPPPLVNF-----------PDPDMAKV 1082
>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1199
Score = 59.3 bits (142), Expect = 6e-08
Identities = 54/197 (27%), Positives = 86/197 (43%), Gaps = 9/197 (4%)
Query: 353 NDFEKCQFCSQAKINKESHKSVTRIT--EPFELIHSDLCELDGNLTRNGKRYFITFIDDC 410
N E C+ C+Q +K + K TR+ P D E+ L G +Y + FID
Sbjct: 880 NITETCKACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEIKPGLY--GYKYLLVFIDTF 937
Query: 411 SDYTHVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGII 470
S + + + K A + K+ ++EI +F + + +D G + S + LGI
Sbjct: 938 SGWIEAFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGID 995
Query: 471 HETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNK 530
+ Y P+ +G+ ER NRT E + L +G+ W +L Y P +
Sbjct: 996 WKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHG 1052
Query: 531 ISPYEILKKRQPNLSYF 547
++PYEIL P L F
Sbjct: 1053 LTPYEILYGAPPPLVNF 1069
>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein
(Endonuclease) (Fragment)
Length = 390
Score = 59.3 bits (142), Expect = 6e-08
Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 5/192 (2%)
Query: 356 EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
E CQ C Q +K ++ TR+ H ++ + G +Y + F+D S +
Sbjct: 92 ESCQACVQVNASKTKIRAGTRVRGHRLGTHWEIDFTEVKPGLYGYKYLLVFVDTFSGWVE 151
Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
+ +++ + + K+ ++EI +F + + +D G + S + K LGI +
Sbjct: 152 AFPTKHETAKI-VTKKLLEEIFPRFGMP-QVLGTDNGPAFVSQVSQSVAKLLGIDWKLHC 209
Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
Y P+ +G+ ER NRT E + L +G W +L Y P + ++PYE
Sbjct: 210 AYRPQSSGQVERMNRTIKETLTKLTLATGTRD--WVLLLPLALYRARNTP-GPHGLTPYE 266
Query: 536 ILKKRQPNLSYF 547
IL P L F
Sbjct: 267 ILYGAPPPLVNF 278
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 58.9 bits (141), Expect = 8e-08
Identities = 52/211 (24%), Positives = 89/211 (41%), Gaps = 16/211 (7%)
Query: 356 EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
E C+ C+Q +K + K TR+ H ++ + G +Y + F+D S +
Sbjct: 888 ETCKACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEVKPGLYGYKYLLVFVDTFSGWVE 947
Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
+ + K A + K+ ++EI +F + + +D G + S + LG+ +
Sbjct: 948 AFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGVDWKLHC 1005
Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
Y P+ +G+ ER NRT E + L +G+ W +L Y P + ++PYE
Sbjct: 1006 AYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHGLTPYE 1062
Query: 536 ILKKRQPNLSYFRTWGCLAYVRKPDPKRVKL 566
IL P L F PDP K+
Sbjct: 1063 ILYGAPPPLVNF-----------PDPDMAKV 1082
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 58.9 bits (141), Expect = 8e-08
Identities = 52/211 (24%), Positives = 89/211 (41%), Gaps = 16/211 (7%)
Query: 356 EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
E C+ C+Q +K + K TR+ H ++ + G +Y + F+D S +
Sbjct: 888 ETCKACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEVKPGLYGYKYLLVFVDTFSGWVE 947
Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
+ + K A + K+ ++EI +F + + +D G + S + LG+ +
Sbjct: 948 AFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGVDWKLHC 1005
Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
Y P+ +G+ ER NRT E + L +G+ W +L Y P + ++PYE
Sbjct: 1006 AYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHGLTPYE 1062
Query: 536 ILKKRQPNLSYFRTWGCLAYVRKPDPKRVKL 566
IL P L F PDP K+
Sbjct: 1063 ILYGAPPPLVNF-----------PDPDMAKV 1082
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 58.2 bits (139), Expect = 1e-07
Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 13/192 (6%)
Query: 358 CQFCSQ--AKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
C+ C Q A + TR P D E+ + G +Y + F+D S +
Sbjct: 737 CKVCQQVNAGATRVPEGKRTRGNRPGVYWEIDFTEVKPHYA--GYKYLLVFVDTFSGWVE 794
Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
Y R + + + K+ ++EI +F + K SD G + S + + LGI +
Sbjct: 795 AYPTRQETAHM-VAKKILEEIFPRFGLP-KVIGSDNGPAFVSQVSQGLARTLGINWKLHC 852
Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNK--ISP 533
Y P+ +G+ ER NRT E + L +G W +L L R T N+ ++P
Sbjct: 853 AYRPQSSGQVERMNRTIKETLTKLTLETGLKD--WRRLL---SLALLRARNTPNRFGLTP 907
Query: 534 YEILKKRQPNLS 545
YEIL P LS
Sbjct: 908 YEILYGGPPPLS 919
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.318 0.134 0.400
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 143,125,711
Number of Sequences: 164201
Number of extensions: 6371422
Number of successful extensions: 17607
Number of sequences better than 10.0: 94
Number of HSP's better than 10.0 without gapping: 62
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 17417
Number of HSP's gapped (non-prelim): 139
length of query: 1185
length of database: 59,974,054
effective HSP length: 121
effective length of query: 1064
effective length of database: 40,105,733
effective search space: 42672499912
effective search space used: 42672499912
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)
Medicago: description of AC147000.7