
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0399.4
(1400 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran... 710 0.0
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain... 360 2e-98
YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein 167 2e-40
M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810... 154 2e-36
YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein 148 1e-34
YMD9_YEAST (Q03434) Transposon Ty1 protein B 139 4e-32
YME4_YEAST (Q04711) Transposon Ty1 protein B 137 2e-31
YJZ7_YEAST (P47098) Transposon Ty1 protein B 137 3e-31
YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B) 135 6e-31
YJZ9_YEAST (P47100) Transposon Ty1 protein B 132 5e-30
YMU0_YEAST (Q04670) Transposon Ty1 protein B 131 1e-29
YMT5_YEAST (Q04214) Transposon Ty1 protein B 130 2e-29
M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820... 91 2e-17
M710_ARATH (P92512) Hypothetical mitochondrial protein AtMg00710... 62 8e-09
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 62 1e-08
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 60 4e-08
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 56 6e-07
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 55 1e-06
POL_JSRV (P31623) Pol polyprotein [Contains: Reverse transcripta... 53 5e-06
M240_ARATH (P93290) Hypothetical mitochondrial protein AtMg00240... 50 4e-05
>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1328
Score = 710 bits (1833), Expect = 0.0
Identities = 447/1341 (33%), Positives = 697/1341 (51%), Gaps = 118/1341 (8%)
Query: 85 SSELYRGMEDPEEEQ*RCSRAEKLKKVKLQT----MRRQFELLQMETNESIADFFNRIIS 140
S ++ + D + + +R E L K T +++Q L M + N
Sbjct: 68 SDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNG 127
Query: 141 LTNLMKACGEKMTDQAIVEKVLRTLTPKFDHVVVAIEESKKLENLKIEELQGSLEAHEQR 200
L + G K+ ++ +L +L +D++ I K ++++++ +L +E
Sbjct: 128 LITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKT--TIELKDVTSALLLNE-- 183
Query: 201 IVERSS*KSKSTDQALQAQTSKKGGFSGKGGYRGKGKSKDYKNSS*KQSQFQNQEDHDQP 260
+ K ++ QAL + G+ + Y+ SS
Sbjct: 184 ---KMRKKPENQGQALITE----------------GRGRSYQRSS--------------- 209
Query: 261 ESSSRRGGGSSNYKGGKRKFDRKKIR-CFNCNKIGHFSSECKAPSGGDTRGRTSDEANLA 319
G S +G + + ++R C+NCN+ GHF +C P G +G TS + N
Sbjct: 210 -----NNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKG--KGETSGQKNDD 262
Query: 320 KEGSITNEEPVTLMMVTEEGESNPATCNITDEEHVTLMMVTKEGGNYPGTWYLDSGCSNH 379
++ ++ + EE EE + L E W +D+ S+H
Sbjct: 263 NTAAMVQNNDNVVLFINEE------------EECMHLSGPESE-------WVVDTAASHH 303
Query: 380 MTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVISEVLYVPGMKTNL 439
T ++ V+ + + GIGD+ +K G +V+ +V +VP ++ NL
Sbjct: 304 ATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNL 363
Query: 440 ISMGQLLEKDFSMSMKKRHLEVFDPNERKIMKVPLTPNRTFQVKLTAIDSQCLTAELEDN 499
IS G L++D S I K + ++ + A+ E +
Sbjct: 364 IS-GIALDRDGYESYFANQKWRLTKGSLVIAK-GVARGTLYRTNAEICQGELNAAQDEIS 421
Query: 500 SWLWHQRFGHLNFKDLSSLKSKDMVHGLSQIKLPSKVCENCLVSKQPRAPFSSFTPTRST 559
LWH+R GH++ K L L K ++ + K C+ CL KQ R F + + R
Sbjct: 422 VDLWHKRMGHMSEKGLQILAKKSLISYAKGTTV--KPCDYCLFGKQHRVSFQT-SSERKL 478
Query: 560 AVLDVIYSDVCGPFETASIGGNKYFASFIDEYSRKMWVYLLKTKSEVFSVFKVFKTMAKK 619
+LD++YSDVCGP E S+GGNKYF +FID+ SRK+WVY+LKTK +VF VF+ F + ++
Sbjct: 479 NILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVER 538
Query: 620 QSGRSIKVLRTDEGGEYCSNEMSNFCEENGILHEVTAPYTPQHNGVAERRNRTVLNMVRS 679
++GR +K LR+D GGEY S E +C +GI HE T P TPQHNGVAER NRT++ VRS
Sbjct: 539 ETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRS 598
Query: 680 MLKGKSLPHRFCGEAVMTAVYVLNLCPTKSVDSQVPEAVWSGRKPSVKHLRIFGCLCHKH 739
ML+ LP F GEAV TA Y++N P+ + ++PE VW+ ++ S HL++FGC H
Sbjct: 599 MLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAH 658
Query: 740 IPDQRRRKLDDKSETMIFIGYSSTG-AYKLYNPRTSQVEFSRDVVFEEHS-------AWK 791
+P ++R KLDDKS IFIGY Y+L++P +V SRDVVF E + K
Sbjct: 659 VPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEK 718
Query: 792 GKETMVVN--------DSMQRVNLDLDHDDSEGIESAVVDVPGTSQNQNQIQVHNPRPIR 843
K ++ N ++ D +G + V G ++ +V +P
Sbjct: 719 VKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGE 778
Query: 844 TKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEAI----KNKTWRLAMKEEL 899
+ P R S+ + + S ++ + D P +E + KN+ + AM+EE+
Sbjct: 779 EQHQPLRRSERPRVESRRYPSTE---YVLISDDREPESLKEVLSHPEKNQLMK-AMQEEM 834
Query: 900 ASIERNKTWDLVDLPANKTPISVKWVFKVKLNPDGSISKHKARLVVRGFMQRGGLDYSEV 959
S+++N T+ LV+LP K P+ KWVFK+K + D + ++KARLVV+GF Q+ G+D+ E+
Sbjct: 835 ESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEI 894
Query: 960 FAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEEEVYITQPPGFEIKGSEHKVL 1019
F+PV ++ ++R I++LA+ + ++ QLDVK+AFL+G LEEE+Y+ QP GFE+ G +H V
Sbjct: 895 FSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVC 954
Query: 1020 KLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYVKSCDSGGVLLLCLYVDDLLI 1079
KL K+LYGLKQAPR W + D+F+ + K + VY K ++L LYVDD+LI
Sbjct: 955 KLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLI 1014
Query: 1080 TGSSYSKIQAVKRSLNNEFEMTDLGKLSYFLGIEFV--QTGEGILMHQRKYILEVLKRFN 1137
G I +K L+ F+M DLG LG++ V +T + + Q KYI VL+RFN
Sbjct: 1015 VGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFN 1074
Query: 1138 LLSCNPAETPVEGNLKLG--LC----EEEAEVDSTMFRQLVGCLRF--ICHSRLEISYGV 1189
+ + P TP+ G+LKL +C EE+ + + VG L + +C +R +I++ V
Sbjct: 1075 MKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVC-TRPDIAHAV 1133
Query: 1190 GLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVFFPKHLPSQKEDGNLHLVAYTDSDWCG 1249
G+VSRF+ P + H A K ILRYL+GT + F P K YTD+D G
Sbjct: 1134 GVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILK--------GYTDADMAG 1185
Query: 1250 DQVDRRSTMGYVFFFGKAPISWSSKKQAAVALSTCEAEYIAACSAACQGLWIQALLEELG 1309
D +R+S+ GY+F F ISW SK Q VALST EAEYIAA + +W++ L+ELG
Sbjct: 1186 DIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELG 1245
Query: 1310 LKTDEAVQLMVDNKSAIDLAKNPVSHGRSKHIETKYHFLRDQVSKEKIKLQHCGTDLQIA 1369
L E V + D++SAIDL+KN + H R+KHI+ +YH++R+ V E +K+ T+ A
Sbjct: 1246 LHQKEYV-VYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPA 1304
Query: 1370 DVFTKPLKADRFKTLKKMMNV 1390
D+ TK + ++F+ K+++ +
Sbjct: 1305 DMLTKVVPRNKFELCKELVGM 1325
>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
Copia VLP protein; Copia protease (EC 3.4.23.-)]
Length = 1409
Score = 360 bits (923), Expect = 2e-98
Identities = 212/595 (35%), Positives = 332/595 (55%), Gaps = 28/595 (4%)
Query: 806 NLDLDHDDSEGIESAVVDVPGTSQNQNQIQVHNPRPIRTKTLPARFSDYQLIAETEFNSD 865
N + + +E ++ +D P + I++ N R R KT P Q+ E NS
Sbjct: 826 NESRESETAEHLKEIGIDNPTKNDG---IEIINRRSERLKTKP------QISYNEEDNSL 876
Query: 866 GDMIHMA-LLADANPVKFEEAI---KNKTWRLAMKEELASIERNKTWDLVDLPANKTPIS 921
++ A + + P F+E +W A+ EL + + N TW + P NK +
Sbjct: 877 NKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVD 936
Query: 922 VKWVFKVKLNPDGSISKHKARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNW 981
+WVF VK N G+ ++KARLV RGF Q+ +DY E FAPVAR+ + R I++L N
Sbjct: 937 SRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNL 996
Query: 982 DLWQLDVKSAFLNGPLEEEVYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDT 1041
+ Q+DVK+AFLNG L+EE+Y+ P G I + V KL KA+YGLKQA R W + +
Sbjct: 997 KVHQMDVKTAFLNGTLKEEIYMRLPQG--ISCNSDNVCKLNKAIYGLKQAARCWFEVFEQ 1054
Query: 1042 FLSQTGFHKCSVEHGVYVKSCDSGGV---LLLCLYVDDLLITGSSYSKIQAVKRSLNNEF 1098
L + F SV+ +Y+ D G + + + LYVDD++I +++ KR L +F
Sbjct: 1055 ALKECEFVNSSVDRCIYI--LDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKF 1112
Query: 1099 EMTDLGKLSYFLGIEFVQTGEGILMHQRKYILEVLKRFNLLSCNPAETPVEGNLKLGLCE 1158
MTDL ++ +F+GI + I + Q Y+ ++L +FN+ +CN TP+ + L
Sbjct: 1113 RMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLN 1172
Query: 1159 EEAEVDSTMFRQLVGCLRFICH-SRLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGT 1217
+ + + T R L+GCL +I +R +++ V ++SR+ S KR+LRYLKGT
Sbjct: 1173 SDEDCN-TPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGT 1231
Query: 1218 PNHGVFFPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVF-FFGKAPISWSSKKQ 1276
+ + F K+L + + ++ Y DSDW G ++DR+ST GY+F F I W++K+Q
Sbjct: 1232 IDMKLIFKKNLAFENK-----IIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQ 1286
Query: 1277 AAVALSTCEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLAKNPVSHG 1336
+VA S+ EAEY+A A + LW++ LL + +K + +++ DN+ I +A NP H
Sbjct: 1287 NSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHK 1346
Query: 1337 RSKHIETKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTLKKMMNVL 1391
R+KHI+ KYHF R+QV I L++ T+ Q+AD+FTKPL A RF L+ + +L
Sbjct: 1347 RAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLL 1401
Score = 255 bits (651), Expect = 7e-67
Identities = 200/763 (26%), Positives = 354/763 (46%), Gaps = 91/763 (11%)
Query: 116 MRRQFELLQMETNESIADFFNRIISLTNLMKACGEKMTDQAIVEKVLRTLTPKFDHVVVA 175
+R++ L++ + S+ F+ L + + A G K+ + + +L TL +D ++ A
Sbjct: 99 LRKRLLSLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITA 158
Query: 176 IEESKKLENLKIEELQGSLEAHEQRIVERSS*KSKSTDQALQAQTSKKGGFSGKGGYRGK 235
IE + ENL + ++ L E +I + SK A+
Sbjct: 159 IETLSE-ENLTLAFVKNRLLDQEIKIKNDHNDTSKKVMNAIVHNN--------------- 202
Query: 236 GKSKDYKNSS*KQSQFQNQEDHDQPESSSRRGGGSSNYKGGKRKFDRKKIRCFNCNKIGH 295
N++ K + F+N+ +P+ + G+S YK ++C +C + GH
Sbjct: 203 -------NNTYKNNLFKNRVT--KPKKIFK---GNSKYK----------VKCHHCGREGH 240
Query: 296 FSSECKAPSGGDTRGRTSDEANLAKEGSITNEEPVTLMMVTEEGESNPATCNITDEEHVT 355
+C R + N E + + + +E + N
Sbjct: 241 IKKDCFHYK------RILNNKNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCG------ 288
Query: 356 LMMVTKEGGNYPGTWYLDSGCSNHMTGNKE-WLINLDENKKSRVRFADD-RFISAEGIGD 413
+ LDSG S+H+ ++ + +++ ++ A FI A G
Sbjct: 289 --------------FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRG- 333
Query: 414 VLVKREDGKDVVISEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERKIMKVP 473
+V+ + ++ + +VL+ NL+S+ +L E S+ K + + + + V
Sbjct: 334 -IVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTI----SKNGLMVV 388
Query: 474 LTPNRTFQVKLTAIDSQCLTAELEDNSWLWHQRFGHLNFKDLSSLKSKDMVHG---LSQI 530
V + + + A+ ++N LWH+RFGH++ L +K K+M L+ +
Sbjct: 389 KNSGMLNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNL 448
Query: 531 KLPSKVCENCLVSKQPRAPFSSFTP-TRSTAVLDVIYSDVCGPFETASIGGNKYFASFID 589
+L ++CE CL KQ R PF T L V++SDVCGP ++ YF F+D
Sbjct: 449 ELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVD 508
Query: 590 EYSRKMWVYLLKTKSEVFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFCEENG 649
+++ YL+K KS+VFS+F+ F ++ + L D G EY SNEM FC + G
Sbjct: 509 QFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKG 568
Query: 650 ILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLNLCPTKS 709
I + +T P+TPQ NGV+ER RT+ R+M+ G L F GEAV+TA Y++N P+++
Sbjct: 569 ISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRA 628
Query: 710 V--DSQVPEAVWSGRKPSVKHLRIFGCLCHKHIPDQRRRKLDDKSETMIFIGYSSTGAYK 767
+ S+ P +W +KP +KHLR+FG + HI + ++ K DDKS IF+GY G +K
Sbjct: 629 LVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGYEPNG-FK 686
Query: 768 LYNPRTSQVEFSRDVVFEEHSAWKGK----ETMVVNDSMQRVNLDLDHDDSEGIESAVVD 823
L++ + +RDVV +E + + ET+ + DS + N + +D + I++ +
Sbjct: 687 LWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQT---E 743
Query: 824 VPGTSQNQNQIQ-VHNPRPIRTKTLPARFSDYQLIAETEFNSD 865
P S+ + IQ + + + K P +D + I +TEF ++
Sbjct: 744 FPNESKECDNIQFLKDSKESENKNFP---NDSRKIIQTEFPNE 783
>YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein
Length = 308
Score = 167 bits (423), Expect = 2e-40
Identities = 108/309 (34%), Positives = 164/309 (52%), Gaps = 11/309 (3%)
Query: 986 LDVKSAFLNGPLEEEVYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQ 1045
+DV +AFLN ++E +Y+ QPPGF + + V +L +YGLKQAP WN+ I+ L +
Sbjct: 1 MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60
Query: 1046 TGFHKCSVEHGVYVKSCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFEMTDLGK 1105
GF + EHG+Y +S S G + + +YVDDLL+ S VK+ L + M DLGK
Sbjct: 61 IGFCRHEGEHGLYFRS-TSDGPIYIGVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 119
Query: 1106 LSYFLGIEFVQTGEG-ILMHQRKYILEVLKRFNLLSCNPAETPVEGNLKLGLCEEEAEVD 1164
+ FLG+ Q+ G I + + YI + + + +TP+ + L D
Sbjct: 120 VDKFLGLNIHQSTNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 179
Query: 1165 STMFRQLVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVF 1223
T ++ +VG L F ++ R +ISY V L+SRF+ PR HL +A+R+LRYL T + +
Sbjct: 180 ITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLK 239
Query: 1224 FPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKK-QAAVALS 1282
+ + + L Y D+ ST GYV AP++WSSKK + + +
Sbjct: 240 Y-------RSGSQVALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVP 292
Query: 1283 TCEAEYIAA 1291
+ EAEYI A
Sbjct: 293 STEAEYITA 301
>M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810
(ORF240b)
Length = 240
Score = 154 bits (388), Expect = 2e-36
Identities = 90/237 (37%), Positives = 131/237 (54%), Gaps = 10/237 (4%)
Query: 1068 LLLCLYVDDLLITGSSYSKIQAVKRSLNNEFEMTDLGKLSYFLGIEFVQTGEGILMHQRK 1127
+ L LYVDD+L+TGSS + + + L++ F M DLG + YFLGI+ G+ + Q K
Sbjct: 1 MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60
Query: 1128 YILEVLKRFNLLSCNPAETPVEGNLKLGLCEEEAEV-DSTMFRQLVGCLRFICHSRLEIS 1186
Y ++L +L C P TP+ LKL A+ D + FR +VG L+++ +R +IS
Sbjct: 61 YAEQILNNAGMLDCKPMSTPLP--LKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDIS 118
Query: 1187 YGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVFFPKHLPSQKEDGNLHLVAYTDSD 1246
Y V +V + M P + KR+LRY+KGT HG++ K + L++ A+ DSD
Sbjct: 119 YAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHK-------NSKLNVQAFCDSD 171
Query: 1247 WCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALSTCEAEYIAACSAACQGLWIQA 1303
W G RRST G+ F G ISWS+K+Q V+ S+ E EY A A + W A
Sbjct: 172 WAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTWSSA 228
>YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein
Length = 1803
Score = 148 bits (373), Expect = 1e-34
Identities = 132/516 (25%), Positives = 233/516 (44%), Gaps = 46/516 (8%)
Query: 891 WRLAMKEELASIERNKTWDLVDLPANKTPIS------VKWVFKVKLNPDGSISKHKARLV 944
++ A +EL +++ K +D VD+ +++ I +F K N +KAR+V
Sbjct: 1289 YKQAYHKELQNLKDMKVFD-VDVKYSRSEIPDNLIVPTNTIFTKKRN-----GIYKARIV 1342
Query: 945 VRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEEEVYIT 1004
RG Q YS + +++ + +A+ +N + LD+ AFL LEEE+YI
Sbjct: 1343 CRGDTQSPDT-YSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYAKLEEEIYIP 1401
Query: 1005 QPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYVKSCDS 1064
P V+KL KALYGLKQ+P+ WN + +L+ G S G+Y +
Sbjct: 1402 HPH------DRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGLYQTEDKN 1455
Query: 1065 GGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFEMTDLGKL------SYFLGIEFVQTG 1118
L++ +YVDD +I S+ ++ L + FE+ G L + LG++ V
Sbjct: 1456 ---LMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGMDLVYNK 1512
Query: 1119 E--GILMHQRKYILEVLKRFN--LLSCNPAETPVEGNLK-------LGLCEEEAEVDSTM 1167
I + + +I + K++N L + P K L + EEE
Sbjct: 1513 RLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEFRQGVLK 1572
Query: 1168 FRQLVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVFFPK 1226
+QL+G L ++ H R +I + V V+R ++ P + +I++YL + G+ + +
Sbjct: 1573 LQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIHYDR 1632
Query: 1227 HLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALSTCEA 1286
K+ ++A TD+ G + D +S +G + ++G + S K +S+ EA
Sbjct: 1633 DCNKDKK-----VIAITDAS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVSSTEA 1686
Query: 1287 EYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLAKNPVSHGRSKHIETKYH 1346
E A ++ L+ELG + + ++ D+K AI + K K
Sbjct: 1687 ELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTE 1746
Query: 1347 FLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFK 1382
+++++ ++ IKL IAD+ TKP+ A FK
Sbjct: 1747 IIKEKIKEKSIKLLKITGKGNIADLLTKPVSASDFK 1782
Score = 75.5 bits (184), Expect = 1e-12
Identities = 101/479 (21%), Positives = 203/479 (42%), Gaps = 49/479 (10%)
Query: 372 LDSGCSNHMTGNKEWLINLDENKKSRVRFA--DDRFISAEGIGDVLVKR-EDGKDVVISE 428
+D+G ++T +K L N +++ +S F + +S +G G + +K + D
Sbjct: 414 IDTGSGVNITNDKTLLHNYEDSNRSTRFFGIGKNSSVSVKGYGYIKIKNGHNNTDNKCLL 473
Query: 429 VLYVPGMKTNLISMGQLLEKDFSMSMKKRHLE---------------VFDPNERKIMKVP 473
YVP ++ +IS L +K M + +++ V ++++ P
Sbjct: 474 TYYVPEEESTIISCYDLAKKT-KMVLSRKYTRLGNKIIKIKTKIVNGVIHVKMNELIERP 532
Query: 474 LTPNRTFQVKLTA-----IDSQCLTAELEDNSWLWHQRFGHLNFKDL-SSLKSKDMVHGL 527
++ +K T+ ++ + +T LED H+R GH + + +S+K L
Sbjct: 533 SDDSKINAIKPTSSPGFKLNKRSIT--LEDA----HKRMGHTGIQQIENSIKHNHYEESL 586
Query: 528 SQIKLPSKV-CENCLVSKQPRAPFSSFTPTRSTAVLD-----VIYSDVCGPFETASIGGN 581
IK P++ C+ C +SK + + +T + + D D+ GP +++
Sbjct: 587 DLIKEPNEFWCQTCKISKATKR--NHYTGSMNNHSTDHEPGSSWCMDIFGPVSSSNADTK 644
Query: 582 KYFASFIDEYSRKMWVYLLKTKSEVFSVFKVFKTM--AKKQSGRSIKVLRTDEGGEYCSN 639
+Y +D +R K+ + +V K + + Q R ++ + +D G E+ ++
Sbjct: 645 RYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQFDRKVREINSDRGTEFTND 704
Query: 640 EMSNFCEENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAV 699
++ + GI H +T+ NG AER RT++ ++L+ +L +F AV +A
Sbjct: 705 QIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITDATTLLRQSNLRVKFWEYAVTSAT 764
Query: 700 YVLNLCPTKSVDSQVPEAVWSGRKPSVKHLRIFGCLCHKHIP-DQRRRKLDDKS-ETMIF 757
+ N KS +A+ R+P L F K I + +KL ++I
Sbjct: 765 NIRNYLEHKSTGKLPLKAI--SRQPVTVRLMSFLPFGEKGIIWNHNHKKLKPSGLPSIIL 822
Query: 758 IGYSSTGAYKLYNPRTSQVEFSRDVVFEEHSA-WKGKETMVVNDSMQRVNLDLDHDDSE 815
++ YK + P +++ S + ++ + + T +N S Q D+DD E
Sbjct: 823 CKDPNSYGYKFFIPSKNKIVTSDNYTIPNYTMDGRVRNTQNINKSHQ---FSSDNDDEE 878
>YMD9_YEAST (Q03434) Transposon Ty1 protein B
Length = 1328
Score = 139 bits (351), Expect = 4e-32
Identities = 143/583 (24%), Positives = 271/583 (45%), Gaps = 46/583 (7%)
Query: 830 NQNQIQVH----NPRPIRTKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEA 885
N+ +I+V N + +R+ P LIA + I L D + + +
Sbjct: 758 NETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD-EAITYNKD 816
Query: 886 IKNKTWRL-AMKEELASIERNKTWDLVDLPANKT-----PISVKWVFKVKLNPDGSISKH 939
IK K + A +E+ + + KTWD + K I+ ++F K DG+ H
Sbjct: 817 IKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKKR--DGT---H 871
Query: 940 KARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEE 999
KAR V RG +Q S + + + ++LA N+ + QLD+ SA+L ++E
Sbjct: 872 KARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKE 931
Query: 1000 EVYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYV 1059
E+YI PP G K+++L+K+LYGLKQ+ W + I ++L + +C +E
Sbjct: 932 ELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIK----QCGMEEVRGW 984
Query: 1060 KSCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLGK----LSY-FLGI 1112
+ +CL+VDD+++ + + + +L +++ + +LG+ + Y LG+
Sbjct: 985 SCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGL 1044
Query: 1113 EF-VQTGEGILMHQRKYILEVLKRFNL-LSCNPAETPVEGNLKLGLCEEEAEVDSTMFRQ 1170
E Q G+ + + + E + + N+ L+ + G L + ++E E+D +++
Sbjct: 1045 EIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKE 1104
Query: 1171 -------LVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGV 1222
L+G ++ + R ++ Y + +++ + P + L +++++ T + +
Sbjct: 1105 KVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQL 1164
Query: 1223 FFPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALS 1282
+ K+ P++ ++ LVA +D+ + G+Q +S +G ++ I S K + S
Sbjct: 1165 IWHKNKPTEPDN---KLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTS 1220
Query: 1283 TCEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLAKNPVSHG-RSKHI 1341
T EAE A + + L++EL K L+ D++S I + K+ R++
Sbjct: 1221 TTEAEIHAISESVPLLNNLSYLIQELN-KKPIIKGLLTDSRSTISIIKSTNEEKFRNRFF 1279
Query: 1342 ETKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1280 GTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKLL 1322
Score = 100 bits (250), Expect = 2e-20
Identities = 98/377 (25%), Positives = 156/377 (40%), Gaps = 26/377 (6%)
Query: 367 PGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVI 426
PG LDSG S + + + + N V A R I IGD+ +D I
Sbjct: 28 PGHLLLDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSI 87
Query: 427 SEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVPLTP 476
+VL+ P + +L+S+ +L D + K LE D + K L P
Sbjct: 88 -KVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLP 146
Query: 477 NRTFQVKLTAIDSQCLTAELEDNSWLW-HQRFGHLNFKDLS-SLKSKDMVH-GLSQIKLP 533
+ + + I++ + + + H+ H N + + SLK+ + + S +
Sbjct: 147 SN---ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDRS 203
Query: 534 SKV---CENCLVSKQPRAPF---SSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASF 587
S + C +CL+ K + S S +++D+ GP YF SF
Sbjct: 204 SAIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISF 263
Query: 588 IDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFC 645
DE ++ WVY L + E + VF K Q S+ V++ D G EY + + F
Sbjct: 264 TDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFL 323
Query: 646 EENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN-L 704
E+NGI T + +GVAER NRT+L+ R+ L+ LP+ A+ + V N L
Sbjct: 324 EKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSL 383
Query: 705 CPTKSVDSQVPEAVWSG 721
KS S A +G
Sbjct: 384 ASPKSKKSARQHAGLAG 400
>YME4_YEAST (Q04711) Transposon Ty1 protein B
Length = 1328
Score = 137 bits (346), Expect = 2e-31
Identities = 144/582 (24%), Positives = 268/582 (45%), Gaps = 44/582 (7%)
Query: 830 NQNQIQVH----NPRPIRTKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEA 885
N+ +I+V N + +R+ P LIA + I L D + + +
Sbjct: 758 NETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD-EAITYNKD 816
Query: 886 IKNKTWRL-AMKEELASIERNKTWDLVDLPANKTPISVKWV----FKVKLNPDGSISKHK 940
IK K + A +E+ + + TWD D ++ I K V F DG+ HK
Sbjct: 817 IKEKEKYIEAYHKEVNQLLKMNTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDGT---HK 872
Query: 941 ARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEEE 1000
AR V RG +Q S + + + ++LA N+ + QLD+ SA+L ++EE
Sbjct: 873 ARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEE 932
Query: 1001 VYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYVK 1060
+YI PP G K+++L+K+LYGLKQ+ W + I ++L + +C +E
Sbjct: 933 LYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIK----QCGMEEVRGWS 985
Query: 1061 SCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLGK----LSY-FLGIE 1113
+ +CL+VDD+++ + + + +L +++ + +LG+ + Y LG+E
Sbjct: 986 CVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLE 1045
Query: 1114 F-VQTGEGILMHQRKYILEVLKRFNL-LSCNPAETPVEGNLKLGLCEEEAEVDSTMFRQ- 1170
Q G+ + + + E + + N+ L+ + G L + ++E E+D +++
Sbjct: 1046 IKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEK 1105
Query: 1171 ------LVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVF 1223
L+G ++ + R ++ Y + +++ + P + L +++++ T + +
Sbjct: 1106 VHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLI 1165
Query: 1224 FPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALST 1283
+ K+ P++ ++ LVA +D+ + G+Q +S +G ++ I S K + ST
Sbjct: 1166 WHKNKPTEPDN---KLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTST 1221
Query: 1284 CEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLA-KNPVSHGRSKHIE 1342
EAE A + + L++EL K L+ D+KS I + N R++
Sbjct: 1222 TEAEIHAISESVPLLNNLSHLVQELN-KKPITKGLLTDSKSTISIIISNNEEKFRNRFFG 1280
Query: 1343 TKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1281 TKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKLL 1322
Score = 100 bits (250), Expect = 2e-20
Identities = 98/377 (25%), Positives = 156/377 (40%), Gaps = 26/377 (6%)
Query: 367 PGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVI 426
PG LDSG S + + + + N V A R I IGD+ +D I
Sbjct: 28 PGHLLLDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSI 87
Query: 427 SEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVPLTP 476
+VL+ P + +L+S+ +L D + K LE D + K L P
Sbjct: 88 -KVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLP 146
Query: 477 NRTFQVKLTAIDSQCLTAELEDNSWLW-HQRFGHLNFKDLS-SLKSKDMVH-GLSQIKLP 533
+ + + I++ + + + H+ H N + + SLK+ + + S +
Sbjct: 147 SN---ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWS 203
Query: 534 SKV---CENCLVSKQPRAPF---SSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASF 587
S + C +CL+ K + S S +++D+ GP YF SF
Sbjct: 204 SAIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISF 263
Query: 588 IDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFC 645
DE ++ WVY L + E + VF K Q S+ V++ D G EY + + F
Sbjct: 264 TDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFL 323
Query: 646 EENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN-L 704
E+NGI T + +GVAER NRT+L+ R+ L+ LP+ A+ + V N L
Sbjct: 324 EKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSL 383
Query: 705 CPTKSVDSQVPEAVWSG 721
KS S A +G
Sbjct: 384 ASPKSKKSARQHAGLAG 400
>YJZ7_YEAST (P47098) Transposon Ty1 protein B
Length = 1755
Score = 137 bits (344), Expect = 3e-31
Identities = 142/583 (24%), Positives = 270/583 (45%), Gaps = 46/583 (7%)
Query: 830 NQNQIQVH----NPRPIRTKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEA 885
N+ +I+V N + +R+ P LIA + I L D + + +
Sbjct: 1185 NETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD-EAITYNKD 1243
Query: 886 IKNKTWRL-AMKEELASIERNKTWDLVDLPANKT-----PISVKWVFKVKLNPDGSISKH 939
IK K + A +E+ + + KTWD + K I+ ++F K DG+ H
Sbjct: 1244 IKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKKR--DGT---H 1298
Query: 940 KARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEE 999
KAR V RG +Q + + + + ++LA N+ + QLD+ SA+L ++E
Sbjct: 1299 KARFVARGDIQHPDTYDTGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKE 1358
Query: 1000 EVYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYV 1059
E+YI PP G K+++L+K+ YGLKQ+ W + I ++L + +C +E
Sbjct: 1359 ELYIRPPPHL---GMNDKLIRLKKSHYGLKQSGANWYETIKSYLIK----QCGMEEVRGW 1411
Query: 1060 KSCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLGK----LSY-FLGI 1112
+ +CL+VDD+++ + + + +L +++ + +LG+ + Y LG+
Sbjct: 1412 SCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGL 1471
Query: 1113 EF-VQTGEGILMHQRKYILEVLKRFNL-LSCNPAETPVEGNLKLGLCEEEAEVDSTMFRQ 1170
E Q G+ + + + E + + N+ L+ + G L + ++E E+D +++
Sbjct: 1472 EIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKE 1531
Query: 1171 -------LVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGV 1222
L+G ++ + R ++ Y + +++ + P + L +++++ T + +
Sbjct: 1532 KVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQL 1591
Query: 1223 FFPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALS 1282
+ K+ P++ ++ LVA +D+ + G+Q +S +G +F I S K + S
Sbjct: 1592 IWHKNKPTEPDN---KLVAISDASY-GNQPYYKSQIGNIFLLNGKVIGGKSTKASLTCTS 1647
Query: 1283 TCEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLAKNPVSHG-RSKHI 1341
T EAE A + + L++EL K L+ D++S I + K+ R++
Sbjct: 1648 TTEAEIHAISESVPLLNNLSYLIQELN-KKPIIKGLLTDSRSTISIIKSTNEEKFRNRFF 1706
Query: 1342 ETKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1707 GTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKLL 1749
Score = 100 bits (249), Expect = 3e-20
Identities = 98/377 (25%), Positives = 156/377 (40%), Gaps = 26/377 (6%)
Query: 367 PGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVI 426
PG LDSG S + + + + N V A R I IGD+ +D I
Sbjct: 455 PGHLLLDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSI 514
Query: 427 SEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVPLTP 476
+VL+ P + +L+S+ +L D + K LE D + K L P
Sbjct: 515 -KVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTVLAPIVQYGDFYWVSKRYLLP 573
Query: 477 NRTFQVKLTAIDSQCLTAELEDNSWLW-HQRFGHLNFKDLS-SLKSKDMVH-GLSQIKLP 533
+ + + I++ + + + H+ H N + + SLK+ + + S +
Sbjct: 574 SN---ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWS 630
Query: 534 SKV---CENCLVSKQPRAPF---SSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASF 587
S + C +CL+ K + S S +++D+ GP YF SF
Sbjct: 631 SAIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISF 690
Query: 588 IDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFC 645
DE ++ WVY L + E + VF K Q S+ V++ D G EY + + F
Sbjct: 691 TDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFL 750
Query: 646 EENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN-L 704
E+NGI T + +GVAER NRT+L+ R+ L+ LP+ A+ + V N L
Sbjct: 751 EKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSL 810
Query: 705 CPTKSVDSQVPEAVWSG 721
KS S A +G
Sbjct: 811 ASPKSKKSARQHAGLAG 827
>YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)
Length = 1770
Score = 135 bits (341), Expect = 6e-31
Identities = 131/523 (25%), Positives = 247/523 (47%), Gaps = 49/523 (9%)
Query: 881 KFEEAIKNKTWRLAMKEELASIERNKTWDLVDLPANKTPISVKWVFKVKLNPDGSISKHK 940
+ + +K TW + NK +D D+ K I+ ++F K DG+ HK
Sbjct: 1272 EISQLLKMNTW-----------DTNKYYDRNDIDPKKV-INSMFIFNKKR--DGT---HK 1314
Query: 941 ARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEEE 1000
AR V RG +Q S++ + + +++A ++ + QLD+ SA+L ++EE
Sbjct: 1315 ARFVARGDIQHPDTYDSDMQSNTVHHYALMTSLSIALDNDYYITQLDISSAYLYADIKEE 1374
Query: 1001 VYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYVK 1060
+YI PP G K+L+LRK+LYGLKQ+ W + I ++L + C ++
Sbjct: 1375 LYIRPPPHL---GLNDKLLRLRKSLYGLKQSGANWYETIKSYL----INCCDMQEVRGWS 1427
Query: 1061 SCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLG----KLSY-FLGIE 1113
+ +CL+VDD+++ + + + +L +++ + +LG ++ Y LG+E
Sbjct: 1428 CVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLE 1487
Query: 1114 F-VQTGEGILMHQRKYILEVLKRFNLLSCNPAETPVEGNLKLG--LCEEEAEVDSTMFRQ 1170
Q + + + K + E L + N + NP + + G + ++E E+D +++
Sbjct: 1488 IKYQRSKYMKLGMEKSLTEKLPKLN-VPLNPKGKKLRAPGQPGHYIDQDELEIDEDEYKE 1546
Query: 1171 -------LVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGV 1222
L+G ++ + R ++ Y + +++ + P + L +++++ T + +
Sbjct: 1547 KVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQL 1606
Query: 1223 FFPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALS 1282
+ K+ P++ ++ LVA +D+ + G+Q +S +G +F I S K + S
Sbjct: 1607 IWHKNKPTKPDN---KLVAISDASY-GNQPYYKSQIGNIFLLNGKVIGGKSTKASLTCTS 1662
Query: 1283 TCEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLAKNPVSHG-RSKHI 1341
T EAE A A + L++EL K L+ D++S I + K+ R++
Sbjct: 1663 TTEAEIHAVSEAIPLLNNLSHLVQELN-KKPIIKGLLTDSRSTISIIKSTNEEKFRNRFF 1721
Query: 1342 ETKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1722 GTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPLPIKTFKLL 1764
Score = 100 bits (250), Expect = 2e-20
Identities = 94/420 (22%), Positives = 167/420 (39%), Gaps = 32/420 (7%)
Query: 313 SDEANLAKEGSITNEEPVTLMMVTEEGESNPATCNITDEEHVTLMMVTKEG--------- 363
S + AK +I + + ES ++ ++D+ ++L KE
Sbjct: 388 SSKPRAAKAHNIATSSKFSRVNNDHINESTVSSQYLSDDNELSLGQQQKESKPTHTIDSN 447
Query: 364 GNYPGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKD 423
P +DSG S + + +L + N + + A + I IG++ ++G
Sbjct: 448 DELPDHLLIDSGASQTLVRSAHYLHHATPNSEINIVDAQKQDIPINAIGNLHFNFQNGTK 507
Query: 424 VVISEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVP 473
I + L+ P + +L+S+ +L ++ + + LE D + K
Sbjct: 508 TSI-KALHTPNIAYDLLSLSELANQNITACFTRNTLERSDGTVLAPIVKHGDFYWLSKKY 566
Query: 474 LTPNRTFQVKLTAIDSQCLTAELEDNSWLWHQRFGHLNFKDLSSLKSKDMVHGLSQIKLP 533
L P+ ++ + ++ + L H+ GH NF+ + K+ V L + +
Sbjct: 567 LIPSHISKLTINNVNKSKSVNKYPYP--LIHRMLGHANFRSIQKSLKKNAVTYLKESDIE 624
Query: 534 -----SKVCENCLVSKQPR---APFSSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFA 585
+ C +CL+ K + S S +++D+ GP YF
Sbjct: 625 WSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFI 684
Query: 586 SFIDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSN 643
SF DE +R WVY L + E + +VF K Q + V++ D G EY + +
Sbjct: 685 SFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHK 744
Query: 644 FCEENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN 703
F GI T + +GVAER NRT+LN R++L LP+ AV + + N
Sbjct: 745 FFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRN 804
>YJZ9_YEAST (P47100) Transposon Ty1 protein B
Length = 1755
Score = 132 bits (333), Expect = 5e-30
Identities = 143/583 (24%), Positives = 266/583 (45%), Gaps = 46/583 (7%)
Query: 830 NQNQIQVH----NPRPIRTKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEA 885
N+ +I+V N + +R+ P LIA + I L D + + +
Sbjct: 1185 NETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD-EAITYNKD 1243
Query: 886 IKNKTWRL-AMKEELASIERNKTWDLVDLPANKT-----PISVKWVFKVKLNPDGSISKH 939
IK K + A +E+ + + KTWD + K I+ ++F K DG+ H
Sbjct: 1244 IKEKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIFNKKR--DGT---H 1298
Query: 940 KARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEE 999
KAR V RG +Q S + + + ++LA N+ + QLD+ SA+L ++E
Sbjct: 1299 KARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKE 1358
Query: 1000 EVYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYV 1059
E+YI PP G K+++L+K+LYGLKQ+ W + I ++L Q +C +E
Sbjct: 1359 ELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIQ----QCGMEEVRGW 1411
Query: 1060 KSCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLGK----LSY-FLGI 1112
+ +CL+VDD+++ + + + + L +++ + +LG+ + Y LG+
Sbjct: 1412 SCVFKNSQVTICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINLGESDEEIQYDILGL 1471
Query: 1113 EF-VQTGEGILMHQRKYILEVLKRFNL-LSCNPAETPVEGNLKLGLCEEEAEVDSTMFR- 1169
E Q G+ + + + E + + N+ L+ + G L + ++E E++ ++
Sbjct: 1472 EIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKM 1531
Query: 1170 ------QLVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGV 1222
+L+G ++ + R ++ Y + +++ + P + L +++++ T + +
Sbjct: 1532 KVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQL 1591
Query: 1223 FFPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALS 1282
+ K P + + LV +D+ + G+Q +S +G ++ I S K + S
Sbjct: 1592 IWHKSKPVKPTN---KLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTS 1647
Query: 1283 TCEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLA-KNPVSHGRSKHI 1341
T EAE A + + L++EL K L+ D+KS I + N R++
Sbjct: 1648 TTEAEIHAISESVPLLNNLSYLIQELD-KKPITKGLLTDSKSTISIIISNNEEKFRNRFF 1706
Query: 1342 ETKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1707 GTKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKLL 1749
Score = 100 bits (250), Expect = 2e-20
Identities = 98/377 (25%), Positives = 156/377 (40%), Gaps = 26/377 (6%)
Query: 367 PGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVI 426
PG LDSG S + + + + N V A R I IGD+ +D I
Sbjct: 455 PGHLLLDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSI 514
Query: 427 SEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVPLTP 476
+VL+ P + +L+S+ +L D + K LE D + K L P
Sbjct: 515 -KVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLP 573
Query: 477 NRTFQVKLTAIDSQCLTAELEDNSWLW-HQRFGHLNFKDLS-SLKSKDMVH-GLSQIKLP 533
+ + + I++ + + + H+ H N + + SLK+ + + S +
Sbjct: 574 SN---ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWS 630
Query: 534 SKV---CENCLVSKQPRAPF---SSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASF 587
S + C +CL+ K + S S +++D+ GP YF SF
Sbjct: 631 SAIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISF 690
Query: 588 IDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFC 645
DE ++ WVY L + E + VF K Q S+ V++ D G EY + + F
Sbjct: 691 TDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFL 750
Query: 646 EENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN-L 704
E+NGI T + +GVAER NRT+L+ R+ L+ LP+ A+ + V N L
Sbjct: 751 EKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSL 810
Query: 705 CPTKSVDSQVPEAVWSG 721
KS S A +G
Sbjct: 811 ASPKSKKSARQHAGLAG 827
>YMU0_YEAST (Q04670) Transposon Ty1 protein B
Length = 1328
Score = 131 bits (330), Expect = 1e-29
Identities = 145/587 (24%), Positives = 268/587 (44%), Gaps = 54/587 (9%)
Query: 830 NQNQIQVH----NPRPIRTKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEA 885
N+ +I+V N + +R+ P LIA + I L D + + +
Sbjct: 758 NETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD-EAITYNKD 816
Query: 886 IKNKTWRL-AMKEELASIERNKTWDLVDLPANKTPISVKWV----FKVKLNPDGSISKHK 940
IK K + A +E+ + + KTWD D ++ I K V F DG+ HK
Sbjct: 817 IKEKEKYIQAYHKEVNQLLKMKTWD-TDRYYDRKEIDPKRVINSMFIFNRKRDGT---HK 872
Query: 941 ARLVVRGFMQRGGLDYSEVFAPVARLETVRMI-----VALASWKNWDLWQLDVKSAFLNG 995
AR V RG +Q + + + P + TV ++LA N+ + QLD+ SA+L
Sbjct: 873 ARFVARGDIQ-----HPDTYDPGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYA 927
Query: 996 PLEEEVYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEH 1055
++EE+YI PP G K+++L+K+LYGLKQ+ W + I ++L + +C +E
Sbjct: 928 DIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIK----QCGMEE 980
Query: 1056 GVYVKSCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLGK----LSY- 1108
+ +CL+VDD+++ + + + +L +++ + +LG+ + Y
Sbjct: 981 VRGWSCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYD 1040
Query: 1109 FLGIEF-VQTGEGILMHQRKYILEVLKRFNL-LSCNPAETPVEGNLKLGLCEEEAEVDST 1166
LG+E Q G+ + + + E + + N+ L+ + G L + ++E E++
Sbjct: 1041 ILGLEIKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEED 1100
Query: 1167 MFR-------QLVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTP 1218
++ +L+G ++ + R ++ Y + +++ + P + L +++++ T
Sbjct: 1101 DYKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTR 1160
Query: 1219 NHGVFFPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAA 1278
+ + + K P + + LV +D+ + G+Q +S +G ++ I S K +
Sbjct: 1161 DKQLIWHKSKPVKPTN---KLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASL 1216
Query: 1279 VALSTCEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLA-KNPVSHGR 1337
ST EAE A + + L++EL K L+ D+KS I + N R
Sbjct: 1217 TCTSTTEAEIHAISESVPLLNNLSHLVQELN-KKPITKGLLTDSKSTISIIISNNEEKFR 1275
Query: 1338 SKHIETKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
++ TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1276 NRFFGTKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKLL 1322
Score = 100 bits (250), Expect = 2e-20
Identities = 98/377 (25%), Positives = 156/377 (40%), Gaps = 26/377 (6%)
Query: 367 PGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVI 426
PG LDSG S + + + + N V A R I IGD+ +D I
Sbjct: 28 PGHLLLDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSI 87
Query: 427 SEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVPLTP 476
+VL+ P + +L+S+ +L D + K LE D + K L P
Sbjct: 88 -KVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLP 146
Query: 477 NRTFQVKLTAIDSQCLTAELEDNSWLW-HQRFGHLNFKDLS-SLKSKDMVH-GLSQIKLP 533
+ + + I++ + + + H+ H N + + SLK+ + + S +
Sbjct: 147 SN---ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWS 203
Query: 534 SKV---CENCLVSKQPRAPF---SSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASF 587
S + C +CL+ K + S S +++D+ GP YF SF
Sbjct: 204 SAIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISF 263
Query: 588 IDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFC 645
DE ++ WVY L + E + VF K Q S+ V++ D G EY + + F
Sbjct: 264 TDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFL 323
Query: 646 EENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN-L 704
E+NGI T + +GVAER NRT+L+ R+ L+ LP+ A+ + V N L
Sbjct: 324 EKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSL 383
Query: 705 CPTKSVDSQVPEAVWSG 721
KS S A +G
Sbjct: 384 ASPKSKKSARQHAGLAG 400
>YMT5_YEAST (Q04214) Transposon Ty1 protein B
Length = 1328
Score = 130 bits (328), Expect = 2e-29
Identities = 143/582 (24%), Positives = 265/582 (44%), Gaps = 44/582 (7%)
Query: 830 NQNQIQVH----NPRPIRTKTLPARFSDYQLIAETEFNSDGDMIHMALLADANPVKFEEA 885
N+ +I+V N + +R+ P LIA + I L D + + +
Sbjct: 758 NETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKPIRTTLRYD-EAITYNKD 816
Query: 886 IKNKTWRL-AMKEELASIERNKTWDLVDLPANKTPISVKWV----FKVKLNPDGSISKHK 940
IK K + A +E+ + + KTWD D ++ I K V F DG+ HK
Sbjct: 817 IKEKEKYIEAYHKEVNQLLKMKTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDGT---HK 872
Query: 941 ARLVVRGFMQRGGLDYSEVFAPVARLETVRMIVALASWKNWDLWQLDVKSAFLNGPLEEE 1000
AR V RG +Q S + + + ++LA N+ + QLD+ SA+L ++EE
Sbjct: 873 ARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEE 932
Query: 1001 VYITQPPGFEIKGSEHKVLKLRKALYGLKQAPRAWNKRIDTFLSQTGFHKCSVEHGVYVK 1060
+YI PP G K+++L+K+LYGLKQ+ W + I ++L + +C +E
Sbjct: 933 LYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIK----QCGMEEVRGWS 985
Query: 1061 SCDSGGVLLLCLYVDDLLITGSSYSKIQAVKRSLNNEFE--MTDLGK----LSY-FLGIE 1113
+ +CL+VDD+++ + + + + L +++ + +LG+ + Y LG+E
Sbjct: 986 CVFENSQVTICLFVDDMVLFSKNLNSNKRIIDKLKMQYDTKIINLGESDEEIQYDILGLE 1045
Query: 1114 F-VQTGEGILMHQRKYILEVLKRFNL-LSCNPAETPVEGNLKLGLCEEEAEVDSTMFR-- 1169
Q G+ + + + E + + N+ L+ + G L + ++E E++ ++
Sbjct: 1046 IKYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMK 1105
Query: 1170 -----QLVGCLRFICHS-RLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVF 1223
+L+G ++ + R ++ Y + +++ + P + L +++++ T + +
Sbjct: 1106 VHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLI 1165
Query: 1224 FPKHLPSQKEDGNLHLVAYTDSDWCGDQVDRRSTMGYVFFFGKAPISWSSKKQAAVALST 1283
+ K P + + LV +D+ + G+Q +S +G ++ I S K + ST
Sbjct: 1166 WHKSKPVKPTN---KLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTST 1221
Query: 1284 CEAEYIAACSAACQGLWIQALLEELGLKTDEAVQLMVDNKSAIDLA-KNPVSHGRSKHIE 1342
EAE A + + L++EL K L+ D+KS I + N R++
Sbjct: 1222 TEAEIHAISESVPLLNNLSYLIQELD-KKPITKGLLTDSKSTISIIISNNEEKFRNRFFG 1280
Query: 1343 TKYHFLRDQVSKEKIKLQHCGTDLQIADVFTKPLKADRFKTL 1384
TK LRD+VS + + + T IADV TKPL FK L
Sbjct: 1281 TKAMRLRDEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKLL 1322
Score = 100 bits (250), Expect = 2e-20
Identities = 98/377 (25%), Positives = 156/377 (40%), Gaps = 26/377 (6%)
Query: 367 PGTWYLDSGCSNHMTGNKEWLINLDENKKSRVRFADDRFISAEGIGDVLVKREDGKDVVI 426
PG LDSG S + + + + N V A R I IGD+ +D I
Sbjct: 28 PGHLLLDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSI 87
Query: 427 SEVLYVPGMKTNLISMGQLLEKDFSMSMKKRHLEVFDPNERK----------IMKVPLTP 476
+VL+ P + +L+S+ +L D + K LE D + K L P
Sbjct: 88 -KVLHTPNIAYDLLSLNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLP 146
Query: 477 NRTFQVKLTAIDSQCLTAELEDNSWLW-HQRFGHLNFKDLS-SLKSKDMVH-GLSQIKLP 533
+ + + I++ + + + H+ H N + + SLK+ + + S +
Sbjct: 147 SN---ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWS 203
Query: 534 SKV---CENCLVSKQPRAPF---SSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASF 587
S + C +CL+ K + S S +++D+ GP YF SF
Sbjct: 204 SAIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISF 263
Query: 588 IDEYSRKMWVYLLKTKSE--VFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFC 645
DE ++ WVY L + E + VF K Q S+ V++ D G EY + + F
Sbjct: 264 TDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFL 323
Query: 646 EENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLN-L 704
E+NGI T + +GVAER NRT+L+ R+ L+ LP+ A+ + V N L
Sbjct: 324 EKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSL 383
Query: 705 CPTKSVDSQVPEAVWSG 721
KS S A +G
Sbjct: 384 ASPKSKKSARQHAGLAG 400
>M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820
(ORF170)
Length = 170
Score = 91.3 bits (225), Expect = 2e-17
Identities = 43/92 (46%), Positives = 62/92 (66%)
Query: 885 AIKNKTWRLAMKEELASIERNKTWDLVDLPANKTPISVKWVFKVKLNPDGSISKHKARLV 944
A+K+ W AM+EEL ++ RNKTW LV P N+ + KWVFK KL+ DG++ + KARLV
Sbjct: 34 ALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLV 93
Query: 945 VRGFMQRGGLDYSEVFAPVARLETVRMIVALA 976
+GF Q G+ + E ++PV R T+R I+ +A
Sbjct: 94 AKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125
>M710_ARATH (P92512) Hypothetical mitochondrial protein AtMg00710
(ORF120)
Length = 120
Score = 62.4 bits (150), Expect = 8e-09
Identities = 30/84 (35%), Positives = 48/84 (56%)
Query: 670 NRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLNLCPTKSVDSQVPEAVWSGRKPSVKHL 729
NRT++ VRSML LP F +A TAV+++N P+ +++ VP+ VW P+ +L
Sbjct: 2 NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61
Query: 730 RIFGCLCHKHIPDQRRRKLDDKSE 753
R FGC+ + H + + + K E
Sbjct: 62 RRFGCVAYIHCDEGKLKPRAKKGE 85
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 62.0 bits (149), Expect = 1e-08
Identities = 51/188 (27%), Positives = 85/188 (45%), Gaps = 8/188 (4%)
Query: 535 KVCENCLVSKQPRAPFSSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASFIDEYSRK 594
+ C CL + SS TP R T L+++ D+ S+ GN+Y + ID +++
Sbjct: 1508 RTCAKCLCANDHSKLTSSLTPYRMTFPLEIVACDLMDV--GLSVQGNRYILTIIDLFTKY 1565
Query: 595 MWVYLLKTKSEVFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFCEENGILHEV 654
+ K + +V K F GR L TD+G E+ + + F I H
Sbjct: 1566 GTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHIT 1624
Query: 655 TAPYTPQHNGVAERRNRTVLNMVRSMLKGKSLPHRFCGEAVMTAVYVLNLCPTKSVDSQV 714
T Y + NG ER N+T++++ M K ++P + + V+ AVY N C ++ +
Sbjct: 1625 TKGYNSRANGAVERFNKTIMHI---MKKKTAVPMEW-DDQVVYAVYAYNNCVHENT-GET 1679
Query: 715 PEAVWSGR 722
P + GR
Sbjct: 1680 PMFLMHGR 1687
Score = 32.7 bits (73), Expect = 7.1
Identities = 28/117 (23%), Positives = 45/117 (37%), Gaps = 13/117 (11%)
Query: 280 FDRKKIRCFNCNKIGHFSSECKAPSGGDTRGRTSDEANLAKEGSITNEEPVTLMMVTEEG 339
F K CF CN++GH + C + + EA +AK +I +++ +
Sbjct: 584 FKLKNRACFRCNEMGHIAWNCPKKN----ENTSEKEAPVAKVETIEGVRMKDCLLMVKSE 639
Query: 340 ESNPATCNITDEEHVTLMMVTKEGGNYPGTWYLDSGCSNHMTGNKEWLINLDENKKS 396
+S E VT + + G LDSG S + W ++ N KS
Sbjct: 640 KS---------ESEVTRSLEKGQIGKANVEILLDSGASISLMSKNTWEKIVEVNGKS 687
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 60.1 bits (144), Expect = 4e-08
Identities = 43/165 (26%), Positives = 73/165 (44%), Gaps = 14/165 (8%)
Query: 521 KDMVHGLSQIKLPSKVCENCLVSKQPRAPFSSFT-PTRSTAVLDVIYSDVCGPFETASIG 579
KD+V + Q C+ CLV+ P + D Y D GP ++
Sbjct: 849 KDVVKSIRQ-------CKQCLVTNATNLTSPPILRPVKPLKPFDKFYIDYIGPLPPSN-- 899
Query: 580 GNKYFASFIDEYSRKMWVYLLKTKSEVFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSN 639
G + +D + +W+Y K S +V K + S KVL +D+G + S+
Sbjct: 900 GYLHVLVVVDSMTGFVWLYPTKAPSTSATV----KALNMLTSIAIPKVLHSDQGAAFTSS 955
Query: 640 EMSNFCEENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGK 684
+++ +E GI E + PY PQ +G ER+N + ++ +L G+
Sbjct: 956 TFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGR 1000
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 56.2 bits (134), Expect = 6e-07
Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 7/149 (4%)
Query: 537 CENCLVSKQP-RAPFSSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASFIDEYSRKM 595
C+ CL++ +A P R D + D GP + G Y +D +
Sbjct: 650 CQQCLITNASNKASGPILRPDRPQKPFDKFFIDYIGPLPPSQ--GYLYVLVVVDGMTGFT 707
Query: 596 WVYLLKTKSEVFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFCEENGILHEVT 655
W+Y K S +V K++ S KV+ +D+G + S+ + + +E GI E +
Sbjct: 708 WLYPTKAPSTSATV----KSLNVLTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFS 763
Query: 656 APYTPQHNGVAERRNRTVLNMVRSMLKGK 684
PY PQ ER+N + ++ +L G+
Sbjct: 764 TPYHPQSGSKVERKNSDIKRLLTKLLVGR 792
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 55.5 bits (132), Expect = 1e-06
Identities = 38/149 (25%), Positives = 66/149 (43%), Gaps = 7/149 (4%)
Query: 537 CENCLVSKQPR-APFSSFTPTRSTAVLDVIYSDVCGPFETASIGGNKYFASFIDEYSRKM 595
C+ CLV+ A P R D + D GP ++ G + +D + +
Sbjct: 860 CKQCLVTNAATLAAPPILRPERPVKPFDKFFIDYIGPLPPSN--GYLHVLVVVDSMTGFV 917
Query: 596 WVYLLKTKSEVFSVFKVFKTMAKKQSGRSIKVLRTDEGGEYCSNEMSNFCEENGILHEVT 655
W+Y K S +V K + S KV+ +D+G + S +++ + GI E +
Sbjct: 918 WLYPTKAPSTSATV----KALNMLTSIAVPKVIHSDQGAAFTSATFADWAKNKGIQLEFS 973
Query: 656 APYTPQHNGVAERRNRTVLNMVRSMLKGK 684
PY PQ +G ER+N + ++ +L G+
Sbjct: 974 TPYHPQSSGKVERKNSDIKRLLTKLLVGR 1002
>POL_JSRV (P31623) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 870
Score = 53.1 bits (126), Expect = 5e-06
Identities = 30/104 (28%), Positives = 44/104 (41%)
Query: 626 KVLRTDEGGEYCSNEMSNFCEENGILHEVTAPYTPQHNGVAERRNRTVLNMVRSMLKGKS 685
+ L+TD G Y S FC I H+ PY PQ G+ ER ++ + + + KG
Sbjct: 707 QTLKTDNGPGYTSRSFQRFCLSFQIHHKTGIPYNPQGQGIVERAHQRIKHQLLKQKKGNE 766
Query: 686 LPHRFCGEAVMTAVYVLNLCPTKSVDSQVPEAVWSGRKPSVKHL 729
L A+ A+YVLN + + + W R K L
Sbjct: 767 LYSPSPHNALNHALYVLNFLTLDTEGNSAAQRFWGERSSCKKPL 810
>M240_ARATH (P93290) Hypothetical mitochondrial protein AtMg00240
(ORF111a)
Length = 111
Score = 50.1 bits (118), Expect = 4e-05
Identities = 25/84 (29%), Positives = 44/84 (51%), Gaps = 7/84 (8%)
Query: 1177 FICHSRLEISYGVGLVSRFMSCPRQSHLAAAKRILRYLKGTPNHGVFFPKHLPSQKEDGN 1236
++ +R ++++ V +S+F S R + + A ++L Y+KGT G+F+ +
Sbjct: 2 YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFY-------SATSD 54
Query: 1237 LHLVAYTDSDWCGDQVDRRSTMGY 1260
L L A+ DSDW RRS G+
Sbjct: 55 LQLKAFADSDWASCPDTRRSVTGF 78
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.135 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 164,537,269
Number of Sequences: 164201
Number of extensions: 7082483
Number of successful extensions: 17678
Number of sequences better than 10.0: 197
Number of HSP's better than 10.0 without gapping: 147
Number of HSP's successfully gapped in prelim test: 50
Number of HSP's that attempted gapping in prelim test: 17278
Number of HSP's gapped (non-prelim): 358
length of query: 1400
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1277
effective length of database: 39,777,331
effective search space: 50795651687
effective search space used: 50795651687
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0399.4