
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144760.12 + phase: 0 /pseudo
(1302 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran... 1226 0.0
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain... 350 2e-95
YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein 171 2e-41
YMU0_YEAST (Q04670) Transposon Ty1 protein B 143 3e-33
YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein 142 6e-33
YMD9_YEAST (Q03434) Transposon Ty1 protein B 139 5e-32
YJZ7_YEAST (P47098) Transposon Ty1 protein B 139 5e-32
YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B) 139 5e-32
YME4_YEAST (Q04711) Transposon Ty1 protein B 139 7e-32
YMT5_YEAST (Q04214) Transposon Ty1 protein B 138 1e-31
YJZ9_YEAST (P47100) Transposon Ty1 protein B 138 1e-31
M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810... 137 2e-31
M300_ARATH (P93293) Hypothetical mitochondrial protein AtMg00300... 91 3e-17
M710_ARATH (P92512) Hypothetical mitochondrial protein AtMg00710... 84 3e-15
M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820... 79 6e-14
M240_ARATH (P93290) Hypothetical mitochondrial protein AtMg00240... 51 2e-05
POL_HV1RH (P05959) Pol polyprotein [Contains: Protease (Retropep... 46 6e-04
POL_HV1BR (P03367) Pol polyprotein [Contains: Protease (Retropep... 46 6e-04
POL_HV1OY (P20892) Pol polyprotein [Contains: Protease (Retropep... 45 0.001
POL_HV1N5 (P12497) Pol polyprotein [Contains: Protease (Retropep... 45 0.001
>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1328
Score = 1226 bits (3171), Expect = 0.0
Identities = 646/1329 (48%), Positives = 884/1329 (65%), Gaps = 51/1329 (3%)
Query: 7 KIEKFDGAD-FGFWKMQIEDYLYQKKLHQPLT--EKKPDSMKDDEWSLLDRQALGVVRLS 63
++ KF+G + F W+ ++ D L Q+ LH+ L KKPD+MK ++W+ LD +A +RL
Sbjct: 7 EVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIRLH 66
Query: 64 LSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELN 123
LS +V NI E T G+ L S+Y +NK++L ++L+ L M+EG + H+N N
Sbjct: 67 LSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFN 126
Query: 124 IVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEI 183
+ TQL+++G++ ++E +A++LL+SLP S+ + T + + ++K D L+L+E++
Sbjct: 127 GLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELK-DVTSALLLNEKM 185
Query: 184 RRRELGESSSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTG 243
R++ + L TE RGR+ R + R KSKN S + C+NC + G
Sbjct: 186 RKKP---ENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKS---RVRNCYNCNQPG 239
Query: 244 HFKNQCRLPTKNQEE----KDEANVASTSGGGDALICSLESKEE---------SWVLDSG 290
HFK C P K + E K++ N A+ D ++ + +EE WV+D+
Sbjct: 240 HFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTA 299
Query: 291 ASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKLN-GSVWELKNVRHIPNL 349
AS HA+ ++ F YV G+ G V +GN K+ G G++ IK N G LK+VRH+P+L
Sbjct: 300 ASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDL 359
Query: 350 TKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKT-AGACH--LIAVATN 406
NLIS L +GY + F W+++KG++ IA+G GTLY+T A C L A
Sbjct: 360 RMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQDE 419
Query: 407 ENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKK 466
+ +LWHKR+GHMSEKG++++ K + + + C+ C+ GKQ RVSFQTS K
Sbjct: 420 ISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSER-KL 478
Query: 467 EKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVEN 526
L+LV+SDV GP + S+GG YFVTFIDD SRK+WVY LK K +VF+ F+++ A+VE
Sbjct: 479 NILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVER 538
Query: 527 ETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARS 586
ET K+K+LR+DNGGEY +F+++C HGIR E+TVPGTPQHNGVAERMNRT+ E+ RS
Sbjct: 539 ETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRS 598
Query: 587 LRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVH 646
+ + LPK FW EAV T+ YLINR PSVPL +IPE VW+ KEV SHL+VFGC A+ H
Sbjct: 599 MLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAH 658
Query: 647 ISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTT 706
+ + R KLD KS CIFIGYG++EFGYRLWD KK++RS+DV+F E + +
Sbjct: 659 VPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEK 718
Query: 707 NDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLV-------------HPTP 753
+G+ P +V + +PT + E++E QP + + HPT
Sbjct: 719 VKNGII-PNFVTIPSTSNNPTSAESTTDEVSEQG-EQPGEVIEQGEQLDEGVEEVEHPTQ 776
Query: 754 VPV----LRRSSRPHAPNRRY--IDYMLLTDGGEPEDYDEACQTTDASKWELAMKEEMKS 807
LRRS RP +RRY +Y+L++D EPE E + ++ AM+EEM+S
Sbjct: 777 GEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMES 836
Query: 808 LISNQTWELAKLPIGKKALHNKWVYRVKEDHDGSK-RYKARLVVKGFRQKEGIDYTEIFA 866
L N T++L +LP GK+ L KWV+++K+D D RYKARLVVKGF QK+GID+ EIF+
Sbjct: 837 LQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFS 896
Query: 867 PVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCML 926
PVVK+ +IR++LS+ AS +L +EQLDVKTAFLHGDL EEIYM QPEGF GK++MVC L
Sbjct: 897 PVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKL 956
Query: 927 KKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRY-KSSYIILLLYVDDMLVAG 985
KSLYGLKQAPRQWYMKF+SFM + + K +D C +FKR+ ++++IILLLYVDDML+ G
Sbjct: 957 NKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVG 1016
Query: 986 SNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMG 1045
+ I LK LSK FDMKDLGPA++ILGM+I R++ L LS +YI RVL+RFNM
Sbjct: 1017 KDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMK 1076
Query: 1046 DAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVV 1105
+AK VSTPLA H +LS++ P T EEK MAK+PY+SA+GSLMYAMVCTRPDI HAVGVV
Sbjct: 1077 NAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVV 1136
Query: 1106 SRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGY 1165
SRF+ NPGK HWEAVKWILRYLRGTT CL FG + ++GY DAD AG++D+R+S+TGY
Sbjct: 1137 SRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGY 1196
Query: 1166 IFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSD 1225
+FT ++SW S++QK VALSTTE EY+A TE KE+IWL+ L ELG Q++ +Y D
Sbjct: 1197 LFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCD 1256
Query: 1226 SQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKL 1285
SQSAI L+KNS +H+RTKHI +RYH+IR +++DE L ++KI ++NPADMLTKVV +K
Sbjct: 1257 SQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKF 1316
Query: 1286 KLCSTLVGL 1294
+LC LVG+
Sbjct: 1317 ELCKELVGM 1325
>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
Copia VLP protein; Copia protease (EC 3.4.23.-)]
Length = 1409
Score = 350 bits (897), Expect = 2e-95
Identities = 202/523 (38%), Positives = 301/523 (56%), Gaps = 19/523 (3%)
Query: 782 PEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVKEDHDGS 841
P +DE D S WE A+ E+ + N TW + K P K + ++WV+ VK + G+
Sbjct: 891 PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGN 950
Query: 842 K-RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHG 900
RYKARLV +GF QK IDY E FAPV ++++ R +LS+V NL + Q+DVKTAFL+G
Sbjct: 951 PIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNG 1010
Query: 901 DLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADH 960
L EEIYM P+G + VC L K++YGLKQA R W+ FE + + F + D
Sbjct: 1011 TLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDR 1068
Query: 961 CCFF--KRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQI 1018
C + K + I +LLYVDD+++A ++ + N K L ++F M DL K +G++I
Sbjct: 1069 CIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI 1128
Query: 1019 TRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEKELMAKI 1078
+ Q+ + LS + Y+ ++L +FNM + VSTPL S +++ E E+
Sbjct: 1129 --EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPS--KINYELLNSDED-----CNT 1179
Query: 1079 PYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFG 1138
P S IG LMY M+CTRPD+ AV ++SR+ S W+ +K +LRYL+GT + L F
Sbjct: 1180 PCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFK 1239
Query: 1139 KG---EIKVEGYVDADFAGEVDHRRSTTGYIFTV-GTRSVSWMSRIQKIVALSTTEVEYV 1194
K E K+ GYVD+D+AG R+STTGY+F + + W ++ Q VA S+TE EY+
Sbjct: 1240 KNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYM 1299
Query: 1195 AVTEASKELIWLQGLLTELGF-MQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLRYHFIR 1253
A+ EA +E +WL+ LLT + ++ +Y D+Q I +A N + H R KHI ++YHF R
Sbjct: 1300 ALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAR 1359
Query: 1254 SLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTLVGLLE 1296
+++ V+ L I AD+ TK + + +GLL+
Sbjct: 1360 EQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQ 1402
Score = 326 bits (835), Expect = 3e-88
Identities = 223/724 (30%), Positives = 371/724 (50%), Gaps = 37/724 (5%)
Query: 2 DEGEVKIEKFDGADFGFWKMQIEDYLYQKKLHQPLTEKKPDSMKDDEWSLLDRQALGVVR 61
D+ + I+ FDG + WK +I L ++ + + + P+ + DD W +R A +
Sbjct: 2 DKAKRNIKPFDGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEV-DDSWKKAERCAKSTII 60
Query: 62 LSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINE 121
LS + + T +++ L ++YE+ ++++ L +RL +L+++ MS+ H +
Sbjct: 61 EYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMSLLSHFHI 120
Query: 122 LNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSE 181
+ + ++L + G + ++ + LL +LP + I+TA+ + S + + V++ +L +
Sbjct: 121 FDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLS-EENLTLAFVKNRLLDQ 179
Query: 182 EIRRRELGESSSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGK 241
EI+ + +S V++ N+T N K++ +K K ++ ++C +CG+
Sbjct: 180 EIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRV--TKPKKIFKGNSKYKVKCHHCGR 237
Query: 242 TGHFKNQCR-----LPTKNQEEKDEANVASTSGGGDALICSLESKEE----SWVLDSGAS 292
GH K C L KN+E + + A TS G ++ + + +VLDSGAS
Sbjct: 238 EGHIKKDCFHYKRILNNKNKENEKQVQTA-TSHGIAFMVKEVNNTSVMDNCGFVLDSGAS 296
Query: 293 FHASSQKEFFKNYV----PGNLGKVYLGNEQSCKVVGKGEVKIKLNGSVWELKNVRHIPN 348
H + + + + V P + G + +G V+++ + + L++V
Sbjct: 297 DHLINDESLYTDSVEVVPPLKIAVAKQG--EFIYATKRGIVRLRNDHEI-TLEDVLFCKE 353
Query: 349 LTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTAGA---CHLIAVAT 405
NL+SV +L + G + F ISK + + + SG L + I
Sbjct: 354 AAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKN--SGMLNNVPVINFQAYSINAKH 411
Query: 406 NENPNLWHKRLGHMSEKGM-----KVMHSKGKLPSLRSIEIDICEDCILGKQKRVSF-QT 459
N LWH+R GH+S+ + K M S L + + +ICE C+ GKQ R+ F Q
Sbjct: 412 KNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQL 471
Query: 460 SGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKR 519
+T K L +VHSDV GP T ++ K+YFV F+D + Y +K+KS+VF F+
Sbjct: 472 KDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQD 531
Query: 520 WKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRT 579
+ A E +LK+ L DNG EY + ++FC + GI TVP TPQ NGV+ERM RT
Sbjct: 532 FVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRT 591
Query: 580 LTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPL--EHKIPEEVWSGKEVKLSHLR 637
+TE+AR++ + L K FW EAV T+ YLINR PS L K P E+W K+ L HLR
Sbjct: 592 ITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLR 651
Query: 638 VFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVM 697
VFG YVHI ++ + K D KS K IF+GY + G++LWD N+K + ++DV+ +E M
Sbjct: 652 VFGATVYVHIKNK-QGKFDDKSFKSIFVGY--EPNGFKLWDAVNEKFIVARDVVVDETNM 708
Query: 698 YKDK 701
+
Sbjct: 709 VNSR 712
>YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein
Length = 308
Score = 171 bits (432), Expect = 2e-41
Identities = 103/315 (32%), Positives = 160/315 (50%), Gaps = 9/315 (2%)
Query: 891 LDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHK 950
+DV TAFL+ + E IY+ QP GF+ E + V L +YGLKQAP W + + K
Sbjct: 1 MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60
Query: 951 EGFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPA 1010
GF + +H +F+ I + +YVDD+LVA + +K +L+K + MKDLG
Sbjct: 61 IGFCRHEGEHGLYFRSTSDGPIYIGVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120
Query: 1011 KKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEE 1070
K LG+ I G + LS +YI + + KL TPL + L + SP ++
Sbjct: 121 DKFLGLNI-HQSTNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 179
Query: 1071 EKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGT 1130
PY S +G L++ RPDI + V ++SRF+ P H E+ + +LRYL T
Sbjct: 180 ------ITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 233
Query: 1131 TEKCLYFGKG-EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMS-RIQKIVALST 1188
CL + G ++ + Y DA D ST GY+ + V+W S +++ ++ + +
Sbjct: 234 RSMCLKYRSGSQVALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 293
Query: 1189 TEVEYVAVTEASKEL 1203
TE EY+ +E E+
Sbjct: 294 TEAEYITASETVMEI 308
>YMU0_YEAST (Q04670) Transposon Ty1 protein B
Length = 1328
Score = 143 bits (360), Expect = 3e-33
Identities = 138/534 (25%), Positives = 246/534 (45%), Gaps = 50/534 (9%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWEL-----AKLPIGKKALHNKWVYRVK 835
YDEA + K+ A +E+ L+ +TW+ K K+ +++ +++ K
Sbjct: 807 YDEAITYNKDIKEKEKYIQAYHKEVNQLLKMKTWDTDRYYDRKEIDPKRVINSMFIFNRK 866
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIR-----SVLSIVASENLYLEQ 890
D +KAR V +G I + + + P ++ NT+ + LS+ N Y+ Q
Sbjct: 867 RDGT----HKARFVARG-----DIQHPDTYDPGMQSNTVHHYALMTSLSLALDNNYYITQ 917
Query: 891 LDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHK 950
LD+ +A+L+ D+ EE+Y+ P G + + LKKSLYGLKQ+ WY +S++ K
Sbjct: 918 LDISSAYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIK 974
Query: 951 E-GFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DL 1007
+ G ++ C F K+S + + L+VDDM++ +++ K + L K++D K +L
Sbjct: 975 QCGMEEVRGWSCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINL 1030
Query: 1008 GPAKK-----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFR 1059
G + ILG++I + K + ++ S E I ++ N KL S P
Sbjct: 1031 GESDNEIQYDILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKL-SAPGQPGLY 1089
Query: 1060 LSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEA 1119
+ Q Q + EE+ M IG Y R D+ + + +++ + P K +
Sbjct: 1090 IDQ-QELELEEDDYKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDM 1148
Query: 1120 VKWILRYLRGTTEKCLYFGKGE-----IKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSV 1174
+++++ T +K L + K + K+ DA + G + +S G I+ + + +
Sbjct: 1149 TYELIQFIWNTRDKQLIWHKSKPVKPTNKLVVISDASY-GNQPYYKSQIGNIYLLNGKVI 1207
Query: 1175 SWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIH-LA 1233
S + STTE E A++E+ L L L+ EL L +DS+S I +
Sbjct: 1208 GGKSTKASLTCTSTTEAEIHAISESVPLLNNLSHLVQELNKKPITKGLLTDSKSTISIII 1267
Query: 1234 KNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
N+ R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1268 SNNEEKFRNRFFGTKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKL 1321
Score = 99.8 bits (247), Expect = 4e-20
Identities = 110/483 (22%), Positives = 194/483 (39%), Gaps = 62/483 (12%)
Query: 343 VRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTAGACHL-- 400
V H PN+ +L+S+ +LA T F + + S G + +A K G Y + L
Sbjct: 89 VLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTV-LAPIVKYGDFYWVSKKYLLPS 147
Query: 401 -IAVATNENPN-----------LWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDI----- 443
I+V T N + H+ L H + + ++ + ++D
Sbjct: 148 NISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAID 207
Query: 444 --CEDCILGKQ--------KRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVT 493
C DC++GK R+ +Q S E + +H+D++GP YF++
Sbjct: 208 YQCPDCLIGKSTKHRHIKGSRLKYQNS-----YEPFQYLHTDIFGPVHNLPKSAPSYFIS 262
Query: 494 FIDDHSRKVWVYFLKHKSE--VFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKF 551
F D+ ++ WVY L + E + + F A ++N+ + ++ D G EY + KF
Sbjct: 263 FTDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKF 322
Query: 552 CYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINR 611
++GI T + +GVAER+NRTL + R+ SGLP W A+ S ++
Sbjct: 323 LEKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFST-IVRN 381
Query: 612 GPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRN-KLDPKSKKCIFIGYGED 670
+ P K + + +S L FG V ++D N K+ P+ + +
Sbjct: 382 SLASPKSKKSARQHAGLAGLDISTLLPFG--QPVIVNDHNPNSKIHPRGIPGYALHPSRN 439
Query: 671 EFGYRLWDDENKKMVRSKD-VIFNERVMYKDKHN--------------------TTTNDS 709
+GY ++ KK V + + VI + D+ N +N+
Sbjct: 440 SYGYIIYLPSLKKTVDTTNYVILQGKESRLDQFNYDALTFDEDLNRLTASYQSFIASNEI 499
Query: 710 GLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPTPVPVLRRSSRPHAPNRR 769
S+ + +E D S + P+ S P+D+ T +R S+ + R
Sbjct: 500 QQSDDLNIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRAPR 559
Query: 770 YID 772
+D
Sbjct: 560 EVD 562
>YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein
Length = 1803
Score = 142 bits (358), Expect = 6e-33
Identities = 120/456 (26%), Positives = 211/456 (45%), Gaps = 23/456 (5%)
Query: 844 YKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLV 903
YKAR+V +G Q Y+ I + N I+ L I + N++++ LD+ AFL+ L
Sbjct: 1337 YKARIVCRGDTQSPDT-YSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYAKLE 1395
Query: 904 EEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCF 963
EEIY+ P V L K+LYGLKQ+P++W +++ G + + +
Sbjct: 1396 EEIYIPHPH------DRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLK--DNSYTPG 1447
Query: 964 FKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPA------KKILGMQ 1017
+ + +++ +YVDD ++A SN + +L F++K G ILGM
Sbjct: 1448 LYQTEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGMD 1507
Query: 1018 ITRDKQKGVLQLS*AEYINRVLQRFN--MGDAKLVSTPLASHFRLSQEQSP-QTEEEKEL 1074
+ +K+ G + L+ +INR+ +++N + + S P S +++ ++ Q EE+
Sbjct: 1508 LVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEFR 1567
Query: 1075 MAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKC 1134
+ +G L Y R DI AV V+R ++ P + + + I++YL +
Sbjct: 1568 QGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIG 1627
Query: 1135 LYFGKG---EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEV 1191
+++ + + KV DA E D +S G I G + S +S+TE
Sbjct: 1628 IHYDRDCNKDKKVIAITDASVGSEYD-AQSRIGVILWYGMNIFNVYSNKSTNRCVSSTEA 1686
Query: 1192 EYVAVTEASKELIWLQGLLTELGFMQEKS-ALYSDSQSAIHLAKNSAFHSRTKHIGLRYH 1250
E A+ E + L+ L ELG + +DS+ AI S + K ++
Sbjct: 1687 ELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTE 1746
Query: 1251 FIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLK 1286
I+ ++++ + L+KI G N AD+LTK V+ K
Sbjct: 1747 IIKEKIKEKSIKLLKITGKGNIADLLTKPVSASDFK 1782
Score = 86.7 bits (213), Expect = 4e-16
Identities = 101/458 (22%), Positives = 181/458 (39%), Gaps = 48/458 (10%)
Query: 286 VLDSGASFHASSQKEFFKNYVPGNLGKVY--LGNEQSCKVVGKGEVKIKLNGSVWELKNV 343
++D+G+ + ++ K NY N + +G S V G G +KIK + + K +
Sbjct: 413 IIDTGSGVNITNDKTLLHNYEDSNRSTRFFGIGKNSSVSVKGYGYIKIKNGHNNTDNKCL 472
Query: 344 R--HIPNLTKNLISVGQLADEG-------YTTVFHGDDWKISKGAMTIARG----RKSGT 390
++P +IS LA + YT + + KI K I G + +
Sbjct: 473 LTYYVPEEESTIISCYDLAKKTKMVLSRKYTRLGN----KIIKIKTKIVNGVIHVKMNEL 528
Query: 391 LYKTAGACHLIAVATNENPNLW-----------HKRLGHMS----EKGMKVMHSKGKLPS 435
+ + + + A+ +P HKR+GH E +K H + L
Sbjct: 529 IERPSDDSKINAIKPTSSPGFKLNKRSITLEDAHKRMGHTGIQQIENSIKHNHYEESLDL 588
Query: 436 LRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHS---DVWGPTTVPSIGGKHYFV 492
++ C+ C + K + + T E S D++GP + + K Y +
Sbjct: 589 IKEPNEFWCQTCKISKATKRNHYTGSMNNHSTDHEPGSSWCMDIFGPVSSSNADTKRYML 648
Query: 493 TFIDDHSRKVWV--YFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKK 550
+D+++R +F K+ + ++ VE + D K++++ +D G E+ + + ++
Sbjct: 649 IMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQFDRKVREINSDRGTEFTNDQIEE 708
Query: 551 FCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLIN 610
+ GI T NG AER RT+ A +L QS L KFW AV ++ + N
Sbjct: 709 YFISKGIHHILTSTQDHAANGRAERYIRTIITDATTLLRQSNLRVKFWEYAVTSATNIRN 768
Query: 611 RGPSVPLEH----KIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIG 666
LEH K+P + S + V + + I + KL P I +
Sbjct: 769 Y-----LEHKSTGKLPLKAISRQPVTVRLMSFLPFGEKGIIWNHNHKKLKPSGLPSIILC 823
Query: 667 YGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNT 704
+ +GY+ + K+V S + M NT
Sbjct: 824 KDPNSYGYKFFIPSKNKIVTSDNYTIPNYTMDGRVRNT 861
>YMD9_YEAST (Q03434) Transposon Ty1 protein B
Length = 1328
Score = 139 bits (350), Expect = 5e-32
Identities = 135/530 (25%), Positives = 247/530 (46%), Gaps = 42/530 (7%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWEL-----AKLPIGKKALHNKWVYRVK 835
YDEA + K+ A +E+ L+ +TW+ K K+ +++ +++ K
Sbjct: 807 YDEAITYNKDIKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKK 866
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKT 895
D +KAR V +G Q + + + V + + LS+ N Y+ QLD+ +
Sbjct: 867 RDGT----HKARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISS 922
Query: 896 AFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKE-GFQ 954
A+L+ D+ EE+Y+ P G + + LKKSLYGLKQ+ WY +S++ K+ G +
Sbjct: 923 AYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGME 979
Query: 955 KCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DLGPAKK 1012
+ C F K+S + + L+VDDM++ +++ K + L K++D K +LG +
Sbjct: 980 EVRGWSCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDN 1035
Query: 1013 -----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQ 1064
ILG++I + K + ++ S E I ++ N KL S P + Q++
Sbjct: 1036 EIQYDILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKL-SAPGQPGLYIDQDE 1094
Query: 1065 SPQTEEE-KELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWI 1123
E+E KE + ++ IG Y R D+ + + +++ + P + + +
Sbjct: 1095 LEIDEDEYKEKVHEM--QKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYEL 1152
Query: 1124 LRYLRGTTEKCLYFGKG-----EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMS 1178
++++ T +K L + K + K+ DA + G + +S G I+ + + + S
Sbjct: 1153 IQFMWDTRDKQLIWHKNKPTEPDNKLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKS 1211
Query: 1179 RIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAF 1238
+ STTE E A++E+ L L L+ EL L +DS+S I + K++
Sbjct: 1212 TKASLTCTSTTEAEIHAISESVPLLNNLSYLIQELNKKPIIKGLLTDSRSTISIIKSTNE 1271
Query: 1239 HS-RTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1272 EKFRNRFFGTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKL 1321
Score = 100 bits (249), Expect = 3e-20
Identities = 125/569 (21%), Positives = 222/569 (38%), Gaps = 69/569 (12%)
Query: 257 EEKDEANVASTSGGGDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLG 316
+E E+ V T+ D L L +LDSGAS + V
Sbjct: 10 QELTESTVNHTNHSDDELPGHL-------LLDSGASRTLIRSAHHIHSASSNPDINVVDA 62
Query: 317 NEQSCKVVGKGEVKIKLNGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKIS 376
+++ + G+++ + V H PN+ +L+S+ +LA T F + + S
Sbjct: 63 QKRNIPINAIGDLQFHFQDNTKTSIKVLHTPNIAYDLLSLNELAAVDITACFTKNVLERS 122
Query: 377 KGAMTIARGRKSGTLYKTAGACHL---IAVATNENPN-----------LWHKRLGHMSEK 422
G + +A K G Y + L I+V T N + H+ L H + +
Sbjct: 123 DGTV-LAPIVKYGDFYWVSKKYLLPSNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQ 181
Query: 423 GMKVMHSKGKLPSLRSIEIDI-------CEDCILGKQ--------KRVSFQTSGRTPKKE 467
++ + ++D C DC++GK R+ +Q S E
Sbjct: 182 TIRYSLKNNTITYFNESDVDRSSAIDYQCPDCLIGKSTKHRHIKGSRLKYQNS-----YE 236
Query: 468 KLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSE--VFEAFKRWKAMVE 525
+ +H+D++GP YF++F D+ ++ WVY L + E + + F A ++
Sbjct: 237 PFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSILDVFTTILAFIK 296
Query: 526 NETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERAR 585
N+ + ++ D G EY + KF ++GI T + +GVAER+NRTL + R
Sbjct: 297 NQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVAERLNRTLLDDCR 356
Query: 586 SLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYV 645
+ SGLP W A+ S ++ + P K + + +S L FG V
Sbjct: 357 TQLQCSGLPNHLWFSAIEFST-IVRNSLASPKSKKSARQHAGLAGLDISTLLPFG--QPV 413
Query: 646 HISDQGRN-KLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKD-VIFNERVMYKDKHN 703
++D N K+ P+ + + +GY ++ KK V + + VI + D+ N
Sbjct: 414 IVNDHNPNSKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNYVILQGKESRLDQFN 473
Query: 704 --------------------TTTNDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQ 743
+N+ S+ + +E D S + P+ S
Sbjct: 474 YDALTFDEDLNRLTASYQSFIASNEIQQSDDLNIESDHDFQSDIELHPEQPRNVLSKAVS 533
Query: 744 PSDTLVHPTPVPVLRRSSRPHAPNRRYID 772
P+D+ T +R S+ + R +D
Sbjct: 534 PTDSTPPSTHTEDSKRVSKTNIRAPREVD 562
>YJZ7_YEAST (P47098) Transposon Ty1 protein B
Length = 1755
Score = 139 bits (350), Expect = 5e-32
Identities = 136/530 (25%), Positives = 246/530 (45%), Gaps = 42/530 (7%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWEL-----AKLPIGKKALHNKWVYRVK 835
YDEA + K+ A +E+ L+ +TW+ K K+ +++ +++ K
Sbjct: 1234 YDEAITYNKDIKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKK 1293
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKT 895
D +KAR V +G Q T + + V + + LS+ N Y+ QLD+ +
Sbjct: 1294 RDGT----HKARFVARGDIQHPDTYDTGMQSNTVHHYALMTSLSLALDNNYYITQLDISS 1349
Query: 896 AFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKE-GFQ 954
A+L+ D+ EE+Y+ P G + + LKKS YGLKQ+ WY +S++ K+ G +
Sbjct: 1350 AYLYADIKEELYIRPPPHL---GMNDKLIRLKKSHYGLKQSGANWYETIKSYLIKQCGME 1406
Query: 955 KCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DLGPAKK 1012
+ C F K+S + + L+VDDM++ +++ K + L K++D K +LG +
Sbjct: 1407 EVRGWSCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDN 1462
Query: 1013 -----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQ 1064
ILG++I + K + ++ S E I ++ N KL S P + Q++
Sbjct: 1463 EIQYDILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKL-SAPGQPGLYIDQDE 1521
Query: 1065 SPQTEEE-KELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWI 1123
E+E KE + ++ IG Y R D+ + + +++ + P + + +
Sbjct: 1522 LEIDEDEYKEKVHEM--QKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYEL 1579
Query: 1124 LRYLRGTTEKCLYFGKG-----EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMS 1178
++++ T +K L + K + K+ DA + G + +S G IF + + + S
Sbjct: 1580 IQFMWDTRDKQLIWHKNKPTEPDNKLVAISDASY-GNQPYYKSQIGNIFLLNGKVIGGKS 1638
Query: 1179 RIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAF 1238
+ STTE E A++E+ L L L+ EL L +DS+S I + K++
Sbjct: 1639 TKASLTCTSTTEAEIHAISESVPLLNNLSYLIQELNKKPIIKGLLTDSRSTISIIKSTNE 1698
Query: 1239 HS-RTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1699 EKFRNRFFGTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKL 1748
Score = 105 bits (263), Expect = 6e-22
Identities = 124/548 (22%), Positives = 225/548 (40%), Gaps = 62/548 (11%)
Query: 185 RRELGESSSSSVLHTESR-----GRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNC 239
RR L + + S +T + RN + N SK++ +++ N +S+NS S + +
Sbjct: 361 RRNLSDEKNDSRSYTNTTKPKVIARNPQKTNN---SKSKTARAHNVSTSNNSPSTDNDSI 417
Query: 240 GKTGHFKNQCRLPTKNQ----EEKDEANVASTSGGGDALICSLESKEESWVLDSGASFHA 295
K+ +L K+ +E E+ V T+ D L L +LDSGAS
Sbjct: 418 SKST--TEPIQLNNKHDLTLGQELTESTVNHTNHSDDELPGHL-------LLDSGASRTL 468
Query: 296 SSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKLNGSVWELKNVRHIPNLTKNLIS 355
+ V +++ + G+++ + V H PN+ +L+S
Sbjct: 469 IRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHTPNIAYDLLS 528
Query: 356 VGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTAGACHL---IAVATNENPN-- 410
+ +LA T F + + S G + +A + G Y + L I+V T N +
Sbjct: 529 LNELAAVDITACFTKNVLERSDGTV-LAPIVQYGDFYWVSKRYLLPSNISVPTINNVHTS 587
Query: 411 ---------LWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDI-------CEDCILGKQ-- 452
H+ L H + + ++ + ++D C DC++GK
Sbjct: 588 ESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTK 647
Query: 453 ------KRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYF 506
R+ +Q S E + +H+D++GP YF++F D+ ++ WVY
Sbjct: 648 HRHIKGSRLKYQNS-----YEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYP 702
Query: 507 LKHKSE--VFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVP 564
L + E + + F A ++N+ + ++ D G EY + KF ++GI T
Sbjct: 703 LHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTT 762
Query: 565 GTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEE 624
+ +GVAER+NRTL + R+ SGLP W A+ S ++ + P K +
Sbjct: 763 ADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFST-IVRNSLASPKSKKSARQ 821
Query: 625 VWSGKEVKLSHLRVFGCVAYVHISDQGRN-KLDPKSKKCIFIGYGEDEFGYRLWDDENKK 683
+ +S L FG V ++D N K+ P+ + + +GY ++ KK
Sbjct: 822 HAGLAGLDISTLLPFG--QPVIVNDHNPNSKIHPRGIPGYALHPSRNSYGYIIYLPSLKK 879
Query: 684 MVRSKDVI 691
V + + +
Sbjct: 880 TVDTTNYV 887
>YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)
Length = 1770
Score = 139 bits (350), Expect = 5e-32
Identities = 133/529 (25%), Positives = 242/529 (45%), Gaps = 40/529 (7%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWELAKLPIG-----KKALHNKWVYRVK 835
YDEA + ++ A +E+ L+ TW+ K KK +++ +++ K
Sbjct: 1249 YDEAITYNKDNKEKDRYVEAYHKEISQLLKMNTWDTNKYYDRNDIDPKKVINSMFIFNKK 1308
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKT 895
D +KAR V +G Q +++ + V + + LSI + Y+ QLD+ +
Sbjct: 1309 RDGT----HKARFVARGDIQHPDTYDSDMQSNTVHHYALMTSLSIALDNDYYITQLDISS 1364
Query: 896 AFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFM-HKEGFQ 954
A+L+ D+ EE+Y+ P G + + L+KSLYGLKQ+ WY +S++ + Q
Sbjct: 1365 AYLYADIKEELYIRPPPHL---GLNDKLLRLRKSLYGLKQSGANWYETIKSYLINCCDMQ 1421
Query: 955 KCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DLGPAKK 1012
+ C F K+S + + L+VDDM++ +++ K + L K++D K +LG +
Sbjct: 1422 EVRGWSCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDN 1477
Query: 1013 -----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQ 1064
ILG++I + K + ++ S E + ++ N KL + H+ E
Sbjct: 1478 EIQYDILGLEIKYQRSKYMKLGMEKSLTEKLPKLNVPLNPKGKKLRAPGQPGHYIDQDEL 1537
Query: 1065 SPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWIL 1124
+E KE + ++ IG Y R D+ + + +++ + P + + ++
Sbjct: 1538 EIDEDEYKEKVHEM--QKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELI 1595
Query: 1125 RYLRGTTEKCLYFGKG-----EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSR 1179
+++ T +K L + K + K+ DA + G + +S G IF + + + S
Sbjct: 1596 QFMWDTRDKQLIWHKNKPTKPDNKLVAISDASY-GNQPYYKSQIGNIFLLNGKVIGGKST 1654
Query: 1180 IQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFH 1239
+ STTE E AV+EA L L L+ EL L +DS+S I + K++
Sbjct: 1655 KASLTCTSTTEAEIHAVSEAIPLLNNLSHLVQELNKKPIIKGLLTDSRSTISIIKSTNEE 1714
Query: 1240 S-RTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1715 KFRNRFFGTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKL 1763
Score = 123 bits (309), Expect = 3e-27
Identities = 145/601 (24%), Positives = 250/601 (41%), Gaps = 62/601 (10%)
Query: 192 SSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSS------HNSKSIECWNCGKTGHF 245
+S + +T+ RN R N SK R +K+ N +S +N E +
Sbjct: 369 TSPNTTNTKVTTRNYQRTNS---SKPRAAKAHNIATSSKFSRVNNDHINESTVSSQYLSD 425
Query: 246 KNQCRLPTKNQEEKDEANVASTSGGGDALICSLESKEESWVLDSGASFHASSQKEFFKNY 305
N+ L + +E K + S D L+ +DSGAS + +
Sbjct: 426 DNELSLGQQQKESKPTHTIDSNDELPDHLL-----------IDSGASQTLVRSAHYLHHA 474
Query: 306 VPGNLGKVYLGNEQSCKVVGKGEVKIKL-NGSVWELKNVRHIPNLTKNLISVGQLADEGY 364
P + + +Q + G + NG+ +K + H PN+ +L+S+ +LA++
Sbjct: 475 TPNSEINIVDAQKQDIPINAIGNLHFNFQNGTKTSIKAL-HTPNIAYDLLSLSELANQNI 533
Query: 365 TTVFHGDDWKISKGAMTIARGRKSGTLY----KTAGACHLIAVATNENPN---------- 410
T F + + S G + +A K G Y K H+ + N N N
Sbjct: 534 TACFTRNTLERSDGTV-LAPIVKHGDFYWLSKKYLIPSHISKLTIN-NVNKSKSVNKYPY 591
Query: 411 -LWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDI-------CEDCILGKQKRVSFQTSGR 462
L H+ LGH + + ++ K + L+ +I+ C DC++GK + R
Sbjct: 592 PLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSR 651
Query: 463 TPKKEKLE---LVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSE--VFEAF 517
+E E +H+D++GP YF++F D+ +R WVY L + E + F
Sbjct: 652 LKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVF 711
Query: 518 KRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMN 577
A ++N+ + ++ ++ D G EY + KF GI T + +GVAER+N
Sbjct: 712 TSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLN 771
Query: 578 RTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLR 637
RTL R+L SGLP W AV S + N S P K + + ++ +
Sbjct: 772 RTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVS-PKNDKSARQHAGLAGLDITTIL 830
Query: 638 VFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKD-VIFNERV 696
FG V+ + +K+ P+ + + +GY ++ KK V + + VI ++
Sbjct: 831 PFGQPVIVN-NHNPDSKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNYVILQDKQ 889
Query: 697 MYKDKHN--TTTNDSGLS-----EPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLV 749
D+ N T T D L+ ++E ++ S D++ +S +S I SD LV
Sbjct: 890 SKLDQFNYDTLTFDDDLNRLTAHNQSFIEQNETEQS-YDQNTESDHDYQSEIEINSDPLV 948
Query: 750 H 750
+
Sbjct: 949 N 949
>YME4_YEAST (Q04711) Transposon Ty1 protein B
Length = 1328
Score = 139 bits (349), Expect = 7e-32
Identities = 135/530 (25%), Positives = 245/530 (45%), Gaps = 42/530 (7%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWELAKLPIGK-----KALHNKWVYRVK 835
YDEA + K+ A +E+ L+ TW+ K K + +++ +++ K
Sbjct: 807 YDEAITYNKDIKEKEKYIEAYHKEVNQLLKMNTWDTDKYYDRKEIDPKRVINSMFIFNRK 866
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKT 895
D +KAR V +G Q + + + V + + LS+ N Y+ QLD+ +
Sbjct: 867 RDGT----HKARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISS 922
Query: 896 AFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKE-GFQ 954
A+L+ D+ EE+Y+ P G + + LKKSLYGLKQ+ WY +S++ K+ G +
Sbjct: 923 AYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGME 979
Query: 955 KCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DLGPAKK 1012
+ C F K+S + + L+VDDM++ +++ K + L K++D K +LG +
Sbjct: 980 EVRGWSCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDN 1035
Query: 1013 -----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQ 1064
ILG++I + K + ++ S E I ++ N KL S P + Q++
Sbjct: 1036 EIQYDILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKL-SAPGQPGLYIDQDE 1094
Query: 1065 SPQTEEE-KELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWI 1123
E+E KE + ++ IG Y R D+ + + +++ + P + + +
Sbjct: 1095 LEIDEDEYKEKVHEM--QKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYEL 1152
Query: 1124 LRYLRGTTEKCLYFGKG-----EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMS 1178
++++ T +K L + K + K+ DA + G + +S G I+ + + + S
Sbjct: 1153 IQFMWDTRDKQLIWHKNKPTEPDNKLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKS 1211
Query: 1179 RIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIH-LAKNSA 1237
+ STTE E A++E+ L L L+ EL L +DS+S I + N+
Sbjct: 1212 TKASLTCTSTTEAEIHAISESVPLLNNLSHLVQELNKKPITKGLLTDSKSTISIIISNNE 1271
Query: 1238 FHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1272 EKFRNRFFGTKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKL 1321
Score = 99.8 bits (247), Expect = 4e-20
Identities = 110/483 (22%), Positives = 194/483 (39%), Gaps = 62/483 (12%)
Query: 343 VRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTAGACHL-- 400
V H PN+ +L+S+ +LA T F + + S G + +A K G Y + L
Sbjct: 89 VLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTV-LAPIVKYGDFYWVSKKYLLPS 147
Query: 401 -IAVATNENPN-----------LWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDI----- 443
I+V T N + H+ L H + + ++ + ++D
Sbjct: 148 NISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAID 207
Query: 444 --CEDCILGKQ--------KRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVT 493
C DC++GK R+ +Q S E + +H+D++GP YF++
Sbjct: 208 YQCPDCLIGKSTKHRHIKGSRLKYQNS-----YEPFQYLHTDIFGPVHNLPKSAPSYFIS 262
Query: 494 FIDDHSRKVWVYFLKHKSE--VFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKF 551
F D+ ++ WVY L + E + + F A ++N+ + ++ D G EY + KF
Sbjct: 263 FTDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKF 322
Query: 552 CYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINR 611
++GI T + +GVAER+NRTL + R+ SGLP W A+ S ++
Sbjct: 323 LEKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFST-IVRN 381
Query: 612 GPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRN-KLDPKSKKCIFIGYGED 670
+ P K + + +S L FG V ++D N K+ P+ + +
Sbjct: 382 SLASPKSKKSARQHAGLAGLDISTLLPFG--QPVIVNDHNPNSKIHPRGIPGYALHPSRN 439
Query: 671 EFGYRLWDDENKKMVRSKD-VIFNERVMYKDKHN--------------------TTTNDS 709
+GY ++ KK V + + VI + D+ N +N+
Sbjct: 440 SYGYIIYLPSLKKTVDTTNYVILQGKESRLDQFNYDALTFDEDLNRLTASYQSFIASNEI 499
Query: 710 GLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPTPVPVLRRSSRPHAPNRR 769
S+ + +E D S + P+ S P+D+ T +R S+ + R
Sbjct: 500 QQSDDLNIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRAPR 559
Query: 770 YID 772
+D
Sbjct: 560 EVD 562
>YMT5_YEAST (Q04214) Transposon Ty1 protein B
Length = 1328
Score = 138 bits (347), Expect = 1e-31
Identities = 135/529 (25%), Positives = 243/529 (45%), Gaps = 40/529 (7%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWELAKLPIGK-----KALHNKWVYRVK 835
YDEA + K+ A +E+ L+ +TW+ K K + +++ +++ K
Sbjct: 807 YDEAITYNKDIKEKEKYIEAYHKEVNQLLKMKTWDTDKYYDRKEIDPKRVINSMFIFNRK 866
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKT 895
D +KAR V +G Q + + + V + + LS+ N Y+ QLD+ +
Sbjct: 867 RDGT----HKARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISS 922
Query: 896 AFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKE-GFQ 954
A+L+ D+ EE+Y+ P G + + LKKSLYGLKQ+ WY +S++ K+ G +
Sbjct: 923 AYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGME 979
Query: 955 KCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DLGPAKK 1012
+ C F ++S + + L+VDDM++ N++ K + +L ++D K +LG + +
Sbjct: 980 EVRGWSCVF----ENSQVTICLFVDDMVLFSKNLNSNKRIIDKLKMQYDTKIINLGESDE 1035
Query: 1013 -----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQ 1064
ILG++I + K + ++ S E I ++ N KL S P + Q Q
Sbjct: 1036 EIQYDILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKL-SAPGQPGLYIDQ-Q 1093
Query: 1065 SPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWIL 1124
+ EE+ M IG Y R D+ + + +++ + P K + ++
Sbjct: 1094 ELELEEDDYKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELI 1153
Query: 1125 RYLRGTTEKCLYFGKGE-----IKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSR 1179
+++ T +K L + K + K+ DA + G + +S G I+ + + + S
Sbjct: 1154 QFIWNTRDKQLIWHKSKPVKPTNKLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKST 1212
Query: 1180 IQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIH-LAKNSAF 1238
+ STTE E A++E+ L L L+ EL L +DS+S I + N+
Sbjct: 1213 KASLTCTSTTEAEIHAISESVPLLNNLSYLIQELDKKPITKGLLTDSKSTISIIISNNEE 1272
Query: 1239 HSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1273 KFRNRFFGTKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKL 1321
Score = 100 bits (248), Expect = 3e-20
Identities = 107/467 (22%), Positives = 190/467 (39%), Gaps = 48/467 (10%)
Query: 257 EEKDEANVASTSGGGDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLG 316
+E E+ V T+ D L L +LDSGAS + V
Sbjct: 10 QELTESTVNHTNHSDDKLPGHL-------LLDSGASRTLIRSAHHIHSASSNPDINVVDA 62
Query: 317 NEQSCKVVGKGEVKIKLNGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKIS 376
+++ + G+++ + V H PN+ +L+S+ +LA T F + + S
Sbjct: 63 QKRNIPINAIGDLQFHFQDNTKTSIKVLHTPNIAYDLLSLNELAAVDITACFTKNVLERS 122
Query: 377 KGAMTIARGRKSGTLYKTAGACHL---IAVATNENPN-----------LWHKRLGHMSEK 422
G + +A K G Y + L I+V T N + H+ L H + +
Sbjct: 123 DGTV-LAPIVKYGDFYWVSKKYLLPSNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQ 181
Query: 423 GMKVMHSKGKLPSLRSIEIDI-------CEDCILGKQ--------KRVSFQTSGRTPKKE 467
++ + ++D C DC++GK R+ +Q S E
Sbjct: 182 TIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHRHIKGSRLKYQNS-----YE 236
Query: 468 KLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSE--VFEAFKRWKAMVE 525
+ +H+D++GP YF++F D+ ++ WVY L + E + + F A ++
Sbjct: 237 PFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSILDVFTTILAFIK 296
Query: 526 NETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERAR 585
N+ + ++ D G EY + KF ++GI T + +GVAER+NRTL + R
Sbjct: 297 NQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVAERLNRTLLDDCR 356
Query: 586 SLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYV 645
+ SGLP W A+ S ++ + P K + + +S L FG V
Sbjct: 357 TQLQCSGLPNHLWFSAIEFST-IVRNSLASPKSKKSARQHAGLAGLDISTLLPFG--QPV 413
Query: 646 HISDQGRN-KLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVI 691
++D N K+ P+ + + +GY ++ KK V + + +
Sbjct: 414 IVNDHNPNSKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNYV 460
>YJZ9_YEAST (P47100) Transposon Ty1 protein B
Length = 1755
Score = 138 bits (347), Expect = 1e-31
Identities = 135/529 (25%), Positives = 243/529 (45%), Gaps = 40/529 (7%)
Query: 785 YDEAC----QTTDASKWELAMKEEMKSLISNQTWEL-----AKLPIGKKALHNKWVYRVK 835
YDEA + K+ A +E+ L+ +TW+ K K+ +++ +++ K
Sbjct: 1234 YDEAITYNKDIKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKK 1293
Query: 836 EDHDGSKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKT 895
D +KAR V +G Q + + + V + + LS+ N Y+ QLD+ +
Sbjct: 1294 RDGT----HKARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISS 1349
Query: 896 AFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKE-GFQ 954
A+L+ D+ EE+Y+ P G + + LKKSLYGLKQ+ WY +S++ ++ G +
Sbjct: 1350 AYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIQQCGME 1406
Query: 955 KCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK--DLGPAKK 1012
+ C F K+S + + L+VDDM++ N++ K + +L ++D K +LG + +
Sbjct: 1407 EVRGWSCVF----KNSQVTICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINLGESDE 1462
Query: 1013 -----ILGMQITRDKQKGV---LQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQ 1064
ILG++I + K + ++ S E I ++ N KL S P + Q Q
Sbjct: 1463 EIQYDILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKL-SAPGQPGLYIDQ-Q 1520
Query: 1065 SPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWIL 1124
+ EE+ M IG Y R D+ + + +++ + P K + ++
Sbjct: 1521 ELELEEDDYKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELI 1580
Query: 1125 RYLRGTTEKCLYFGKGE-----IKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSR 1179
+++ T +K L + K + K+ DA + G + +S G I+ + + + S
Sbjct: 1581 QFIWNTRDKQLIWHKSKPVKPTNKLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKST 1639
Query: 1180 IQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIH-LAKNSAF 1238
+ STTE E A++E+ L L L+ EL L +DS+S I + N+
Sbjct: 1640 KASLTCTSTTEAEIHAISESVPLLNNLSYLIQELDKKPITKGLLTDSKSTISIIISNNEE 1699
Query: 1239 HSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKL 1287
R + G + +R + L + I+ KN AD++TK + I KL
Sbjct: 1700 KFRNRFFGTKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKL 1748
Score = 103 bits (257), Expect = 3e-21
Identities = 137/626 (21%), Positives = 248/626 (38%), Gaps = 78/626 (12%)
Query: 204 RNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQCRLPTKNQ----EEK 259
RN + N SK++ +++ N +S+NS S + + K+ +L K+ ++
Sbjct: 385 RNPQKTNN---SKSKTARAHNVSTSNNSPSTDNDSISKST--TEPIQLNNKHDLILGQKL 439
Query: 260 DEANVASTSGGGDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQ 319
E+ V T+ D L L +LDSGAS + V ++
Sbjct: 440 TESTVNHTNHSDDELPGHL-------LLDSGASRTLIRSAHHIHSASSNPDINVVDAQKR 492
Query: 320 SCKVVGKGEVKIKLNGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGA 379
+ + G+++ + V H PN+ +L+S+ +LA T F + + S G
Sbjct: 493 NIPINAIGDLQFHFQDNTKTSIKVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGT 552
Query: 380 MTIARGRKSGTLYKTAGACHL---IAVATNENPN-----------LWHKRLGHMSEKGMK 425
+ +A K G Y + L I+V T N + H+ L H + + ++
Sbjct: 553 V-LAPIVKYGDFYWVSKKYLLPSNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIR 611
Query: 426 VMHSKGKLPSLRSIEIDI-------CEDCILGKQ--------KRVSFQTSGRTPKKEKLE 470
+ ++D C DC++GK R+ +Q S E +
Sbjct: 612 YSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHRHIKGSRLKYQNS-----YEPFQ 666
Query: 471 LVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSE--VFEAFKRWKAMVENET 528
+H+D++GP YF++F D+ ++ WVY L + E + + F A ++N+
Sbjct: 667 YLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQF 726
Query: 529 DLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLR 588
+ ++ D G EY + KF ++GI T + +GVAER+NRTL + R+
Sbjct: 727 QASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQL 786
Query: 589 VQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHIS 648
SGLP W A+ S ++ + P K + + +S L FG V ++
Sbjct: 787 QCSGLPNHLWFSAIEFST-IVRNSLASPKSKKSARQHAGLAGLDISTLLPFG--QPVIVN 843
Query: 649 DQGRN-KLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKD-VIFNERVMYKDKHN--- 703
D N K+ P+ + + +GY ++ KK V + + VI + D+ N
Sbjct: 844 DHNPNSKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNYVILQGKESRLDQFNYDA 903
Query: 704 -----------------TTTNDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSD 746
+N+ S+ + +E D S + P+ S P+D
Sbjct: 904 LTFDEDLNRLTASYQSFIASNEIQQSDDLNIESDHDFQSDIELHPEQPRNVLSKAVSPTD 963
Query: 747 TLVHPTPVPVLRRSSRPHAPNRRYID 772
+ T +R S+ + R +D
Sbjct: 964 STPPSTHTEDSKRVSKTNIRAPREVD 989
>M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810
(ORF240b)
Length = 240
Score = 137 bits (345), Expect = 2e-31
Identities = 87/233 (37%), Positives = 126/233 (53%), Gaps = 11/233 (4%)
Query: 974 LLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AE 1033
LLLYVDD+L+ GS+ + L QLS F MKDLGP LG+QI L LS +
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQTK 60
Query: 1034 YINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVC 1093
Y ++L M D K +STPL S + + + S +G+L Y +
Sbjct: 61 YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSD-------FRSIVGALQY-LTL 112
Query: 1094 TRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKG-EIKVEGYVDADF 1152
TRPDI +AV +V + M P A ++ +K +LRY++GT LY K ++ V+ + D+D+
Sbjct: 113 TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDW 172
Query: 1153 AGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIW 1205
AG RRSTTG+ +G +SW ++ Q V+ S+TE EY A+ + EL W
Sbjct: 173 AGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
>M300_ARATH (P93293) Hypothetical mitochondrial protein AtMg00300
(ORF145a) (ORF1451)
Length = 145
Score = 90.5 bits (223), Expect = 3e-17
Identities = 46/114 (40%), Positives = 65/114 (56%), Gaps = 5/114 (4%)
Query: 374 KISKGAMTIARGRKSGTLYKTAGACHL----IAVATNENPNLWHKRLGHMSEKGMKVMHS 429
K+ KG TI +G + +LY G+ +A + LWH RL HMS++GM+++
Sbjct: 30 KVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVK 89
Query: 430 KGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVP 483
KG L S + + CEDCI GK RV+F T G+ K L+ VHSD+WG +VP
Sbjct: 90 KGFLDSSKVSSLKFCEDCIYGKTHRVNFST-GQHTTKNPLDYVHSDLWGAPSVP 142
>M710_ARATH (P92512) Hypothetical mitochondrial protein AtMg00710
(ORF120)
Length = 120
Score = 83.6 bits (205), Expect = 3e-15
Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 9/99 (9%)
Query: 576 MNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSH 635
MNRT+ E+ RS+ + GLPK F A+A NT+ ++IN+ PS + +P+EVW S+
Sbjct: 1 MNRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSY 60
Query: 636 LRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGY 674
LR FGCVAY+H D+G KL P++KK GE++ Y
Sbjct: 61 LRRFGCVAYIH-CDEG--KLKPRAKK------GEEKGSY 90
>M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820
(ORF170)
Length = 170
Score = 79.3 bits (194), Expect = 6e-14
Identities = 39/85 (45%), Positives = 57/85 (66%), Gaps = 1/85 (1%)
Query: 797 WELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVKEDHDGS-KRYKARLVVKGFRQ 855
W AM+EE+ +L N+TW L P+ + L KWV++ K DG+ R KARLV KGF Q
Sbjct: 40 WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQ 99
Query: 856 KEGIDYTEIFAPVVKLNTIRSVLSI 880
+EGI + E ++PVV+ TIR++L++
Sbjct: 100 EEGIYFVETYSPVVRTATIRTILNV 124
>M240_ARATH (P93290) Hypothetical mitochondrial protein AtMg00240
(ORF111a)
Length = 111
Score = 50.8 bits (120), Expect = 2e-05
Identities = 26/76 (34%), Positives = 46/76 (60%), Gaps = 1/76 (1%)
Query: 1091 MVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYF-GKGEIKVEGYVD 1149
+ TRPD+ AV +S+F S A +AV +L Y++GT + L++ +++++ + D
Sbjct: 3 LTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFAD 62
Query: 1150 ADFAGEVDHRRSTTGY 1165
+D+A D RRS TG+
Sbjct: 63 SDWASCPDTRRSVTGF 78
>POL_HV1RH (P05959) Pol polyprotein [Contains: Protease
(Retropepsin) (EC 3.4.23.16); Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 1002
Score = 46.2 bits (108), Expect = 6e-04
Identities = 22/60 (36%), Positives = 32/60 (52%)
Query: 532 IKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQS 591
+K + TDNG + T K C+ GI+ E +P PQ GV E MN+ L + +R Q+
Sbjct: 824 VKVIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNKQLKQIIGQVRDQA 883
>POL_HV1BR (P03367) Pol polyprotein [Contains: Protease
(Retropepsin) (EC 3.4.23.16); Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 1015
Score = 46.2 bits (108), Expect = 6e-04
Identities = 22/60 (36%), Positives = 32/60 (52%)
Query: 532 IKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQS 591
+K + TDNG + T K C+ GI+ E +P PQ GV E MN+ L + +R Q+
Sbjct: 837 VKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNKELKKIIGQVRDQA 896
>POL_HV1OY (P20892) Pol polyprotein [Contains: Protease
(Retropepsin) (EC 3.4.23.16); Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 1003
Score = 45.4 bits (106), Expect = 0.001
Identities = 22/60 (36%), Positives = 31/60 (51%)
Query: 532 IKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQS 591
+K + TDNG + T K C+ GI+ E +P PQ GV E MN L + +R Q+
Sbjct: 825 VKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNNELKKIIGQVRDQA 884
>POL_HV1N5 (P12497) Pol polyprotein [Contains: Protease
(Retropepsin) (EC 3.4.23.16); Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 1003
Score = 45.4 bits (106), Expect = 0.001
Identities = 22/60 (36%), Positives = 32/60 (52%)
Query: 532 IKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQS 591
+K + TDNG + T K C+ GI+ E +P PQ GV E MN+ L + +R Q+
Sbjct: 825 VKTVHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVIESMNKELKKIIGQVRDQA 884
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.317 0.133 0.392
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 155,427,810
Number of Sequences: 164201
Number of extensions: 6816323
Number of successful extensions: 19088
Number of sequences better than 10.0: 208
Number of HSP's better than 10.0 without gapping: 147
Number of HSP's successfully gapped in prelim test: 62
Number of HSP's that attempted gapping in prelim test: 18504
Number of HSP's gapped (non-prelim): 459
length of query: 1302
length of database: 59,974,054
effective HSP length: 122
effective length of query: 1180
effective length of database: 39,941,532
effective search space: 47131007760
effective search space used: 47131007760
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)
Medicago: description of AC144760.12