
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0226.2
(583 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF637357 similar to GP|13365578|db contains ESTs AU088719(S4716)... 263 1e-70
BF646565 weakly similar to GP|20259245|gb putative N-terminal ac... 263 2e-70
TC79901 weakly similar to GP|20259245|gb|AAM14358.1 putative N-t... 218 7e-57
TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal... 97 2e-20
BQ146979 weakly similar to GP|20259245|gb| putative N-terminal a... 50 3e-08
TC84723 similar to GP|16945387|emb|CAD11800. conserved hypotheti... 35 0.061
BE942366 similar to GP|7939524|dbj| contains similarity to unkno... 30 3.4
BE248078 similar to GP|14334774|gb| unknown protein {Arabidopsis... 30 3.4
AW694919 similar to GP|21554403|gb transcription factor TINY pu... 30 3.4
TC88705 similar to GP|10177834|dbj|BAB11263. gene_id:MCD7.9~unkn... 29 4.4
CA920908 homologue to PIR|T06377|T06 SAR DNA-binding protein-1 -... 29 5.7
BM779470 similar to GP|17225592|gb serine acetyltransferase {Ara... 28 7.5
TC85442 similar to GP|13543783|gb|AAH06040.1 Unknown (protein fo... 28 9.8
>BF637357 similar to GP|13365578|db contains ESTs AU088719(S4716)
C98685(E1389) AU066116(S4716)~hypothetical
protein~similar to, partial (21%)
Length = 582
Score = 263 bits (672), Expect = 1e-70
Identities = 134/173 (77%), Positives = 143/173 (82%)
Frame = +2
Query: 257 LEMLKFQDQLHSHSYFHKAAAGAIRCYIKLHDSPPKSTAEEDEEMSKLLPAQKKKMRQKQ 316
L L F+ SH YFHKAAAGAIRCYIKLHD PPKST EEDE MS LLP+QKKK+RQKQ
Sbjct: 62 LTCLNFKTNCTSHPYFHKAAAGAIRCYIKLHDFPPKSTTEEDEHMSNLLPSQKKKLRQKQ 241
Query: 317 RKAEARAKKGAEEKNEELSASGVSKSGKRHVKPVDPDPHGEKLLQVDDPLSEAIKYLKLL 376
RKAEARAKK AEEKNEEL++S VSKSGKR VKPVDPDPHGEKLLQV+DPLSEA+KYLKLL
Sbjct: 242 RKAEARAKKEAEEKNEELNSSVVSKSGKRPVKPVDPDPHGEKLLQVEDPLSEAVKYLKLL 421
Query: 377 QKNSPDSLETHLLSFELYTRKQKVLLTFQAVKQLLRLDAEHPDSHRCLVCQLH 429
QKNSPDSLETHLLSFELYTRK K+LL FQAV PDSHRCL+ H
Sbjct: 422 QKNSPDSLETHLLSFELYTRKXKILLAFQAVSSY*GWMLTTPDSHRCLIKFFH 580
Score = 63.9 bits (154), Expect = 2e-10
Identities = 28/34 (82%), Positives = 31/34 (90%)
Frame = +1
Query: 239 DQFDFHSYCLRKMTLRTYLEMLKFQDQLHSHSYF 272
DQFDFHSYCLRKMTLR+Y++MLKFQDQLH S F
Sbjct: 7 DQFDFHSYCLRKMTLRSYVDMLKFQDQLHLTSLF 108
>BF646565 weakly similar to GP|20259245|gb putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (19%)
Length = 614
Score = 263 bits (671), Expect = 2e-70
Identities = 138/206 (66%), Positives = 151/206 (72%), Gaps = 31/206 (15%)
Frame = +1
Query: 409 QLLRLDAEHPDSHRCLV-------------------------------CQLHEKTLFEAN 437
QLLRLDA+HPDSHRCL+ QLHEK+LFEAN
Sbjct: 1 QLLRLDADHPDSHRCLIKFFHQLGSMSTPVTESEKLIWSVLEAERSTISQLHEKSLFEAN 180
Query: 438 NSFLEKHKDSLMHRAAFAETLYILDPNRKSEAVKLIEESTNNIVPRNGALGPIREWKLKD 497
N+F + HKDSLMHRAAFAE LYILD NRKSEAVKLIE+S NN VPRNGA+GPI EWKL+D
Sbjct: 181 NAFHDNHKDSLMHRAAFAEILYILDSNRKSEAVKLIEDSVNNTVPRNGAIGPIGEWKLED 360
Query: 498 CIAVHKLLGTVLLDQDAALRWKVRCAEYFPYSRYFEGSRSSASSNTALKQLSKNSENETL 557
CIAVHKLLGTVL+DQDAALRWKVRCAEYFPYS YFEG SSAS N+A QL KNSEN+
Sbjct: 361 CIAVHKLLGTVLVDQDAALRWKVRCAEYFPYSTYFEGRHSSASPNSAFSQLRKNSENDGP 540
Query: 558 NHSVCTQNVGSITSNGKLEAFKQLAI 583
NHSV QNVGS TSNG+ AF+ L I
Sbjct: 541 NHSVDNQNVGSTTSNGR--AFENLTI 612
>TC79901 weakly similar to GP|20259245|gb|AAM14358.1 putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (11%)
Length = 965
Score = 218 bits (554), Expect = 7e-57
Identities = 109/155 (70%), Positives = 126/155 (80%)
Frame = +1
Query: 429 HEKTLFEANNSFLEKHKDSLMHRAAFAETLYILDPNRKSEAVKLIEESTNNIVPRNGALG 488
H K+L EAN+ FLEKH+ S+MHRAAF E +YILDPNR++EAVKLIE STNN V NGALG
Sbjct: 34 HGKSLLEANSLFLEKHEGSMMHRAAFGEMMYILDPNRRAEAVKLIEGSTNNPVSSNGALG 213
Query: 489 PIREWKLKDCIAVHKLLGTVLLDQDAALRWKVRCAEYFPYSRYFEGSRSSASSNTALKQL 548
PIREW LKDCIAVHKLLG+VL DQDAALRWKVRCAE+FPYS YFEGS+SSAS N+AL Q+
Sbjct: 214 PIREWTLKDCIAVHKLLGSVLDDQDAALRWKVRCAEFFPYSTYFEGSQSSASPNSALNQI 393
Query: 549 SKNSENETLNHSVCTQNVGSITSNGKLEAFKQLAI 583
K + N + +HS V S+TSNGKL +FK L I
Sbjct: 394 CKTTINGSSSHSPGDNIVESVTSNGKLASFKDLTI 498
>TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (38%)
Length = 1170
Score = 96.7 bits (239), Expect = 2e-20
Identities = 47/54 (87%), Positives = 49/54 (90%)
Frame = +1
Query: 15 QRIPLDFLQGDKFREAADNYIRPLLTKGVPSLFSDLSSLYNHPGKADILEQLIL 68
+RIPLDFLQGDKFREAA+NYIRPLLTKGVPSLFSDLSSLY H GKADILE L
Sbjct: 1009 KRIPLDFLQGDKFREAAENYIRPLLTKGVPSLFSDLSSLYTHSGKADILEHSFL 1170
>BQ146979 weakly similar to GP|20259245|gb| putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (3%)
Length = 644
Score = 49.7 bits (117), Expect(2) = 3e-08
Identities = 23/26 (88%), Positives = 24/26 (91%)
Frame = +3
Query: 491 REWKLKDCIAVHKLLGTVLLDQDAAL 516
REW LKDCIAVHKLLG+VL DQDAAL
Sbjct: 66 REWTLKDCIAVHKLLGSVLDDQDAAL 143
Score = 26.2 bits (56), Expect(2) = 3e-08
Identities = 13/17 (76%), Positives = 13/17 (76%)
Frame = +1
Query: 470 VKLIEESTNNIVPRNGA 486
VKLIE STNN V NGA
Sbjct: 1 VKLIEGSTNNPVSSNGA 51
>TC84723 similar to GP|16945387|emb|CAD11800. conserved hypothetical protein
{Neurospora crassa}, partial (5%)
Length = 999
Score = 35.4 bits (80), Expect = 0.061
Identities = 30/131 (22%), Positives = 57/131 (42%), Gaps = 3/131 (2%)
Frame = +3
Query: 206 ELASGESFFRQGDLGRALKKFLGVEKHYADINEDQFDFHSYCLRKMTLRTYLEMLKFQDQ 265
EL + + + +L RA + + A I+E++ L++ +E+ K+Q++
Sbjct: 561 ELRVQKEYREEAELQRAKRDLDEIRNREARISEEKH-----------LKSQVELQKYQEE 707
Query: 266 LH---SHSYFHKAAAGAIRCYIKLHDSPPKSTAEEDEEMSKLLPAQKKKMRQKQRKAEAR 322
+ K AA A+ + K +E +E K + ++ RQK+ A+
Sbjct: 708 MRLAEKAKQREKEAADAVERFQKKEQERLAKQEKEKQEREKEFQRRLQEERQKELDRIAK 887
Query: 323 AKKGAEEKNEE 333
AKK EE +E
Sbjct: 888 AKKEKEENEKE 920
>BE942366 similar to GP|7939524|dbj| contains similarity to unknown
protein~gb|AAD25764.1~gene_id:K14B15.5 {Arabidopsis
thaliana}, partial (12%)
Length = 391
Score = 29.6 bits (65), Expect = 3.4
Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 4/75 (5%)
Frame = +3
Query: 456 ETLYILDPNRKSEAVKLIEESTNNIV----PRNGALGPIREWKLKDCIAVHKLLGTVLLD 511
+TLY D + EA+ + + N I +N LG + L+DC+ G LLD
Sbjct: 39 QTLYFADKIKTEEAICELLKGLNYICRYEQQQNALLGCASSFNLEDCMKWKLQCGDSLLD 218
Query: 512 QDAALRWKVRCAEYF 526
+ +C E+F
Sbjct: 219 *THSRYCNPQCGEHF 263
>BE248078 similar to GP|14334774|gb| unknown protein {Arabidopsis thaliana},
partial (24%)
Length = 392
Score = 29.6 bits (65), Expect = 3.4
Identities = 22/72 (30%), Positives = 34/72 (46%), Gaps = 6/72 (8%)
Frame = +3
Query: 285 KLHDSPPKSTAEEDEEMSKLLPAQKKKMRQKQRKAEA------RAKKGAEEKNEELSASG 338
K+HD K E++ K+ QKK++R+ +R + +AK A K +ELS S
Sbjct: 120 KIHDLNSKL-----EDVQKINEEQKKQIRKTERALKVAEEEMLKAKLEATTKAKELSESV 284
Query: 339 VSKSGKRHVKPV 350
H KP+
Sbjct: 285 AESHWNEHGKPL 320
>AW694919 similar to GP|21554403|gb transcription factor TINY putative
{Arabidopsis thaliana}, partial (53%)
Length = 570
Score = 29.6 bits (65), Expect = 3.4
Identities = 23/96 (23%), Positives = 37/96 (37%)
Frame = -2
Query: 267 HSHSYFHKAAAGAIRCYIKLHDSPPKSTAEEDEEMSKLLPAQKKKMRQKQRKAEARAKKG 326
H H ++H+ CY P + + L P Q + Q + AE G
Sbjct: 347 HHHHHYHQN-----HCYY------PPAKQQP*PSYELL*PPQLEYP*QYTKTAEEAPADG 201
Query: 327 AEEKNEELSASGVSKSGKRHVKPVDPDPHGEKLLQV 362
A + +S + ++H KP+ P P EK L +
Sbjct: 200 ALHQENSAVLYNLSNNMQQHHKPLQPSPAPEKTLAI 93
>TC88705 similar to GP|10177834|dbj|BAB11263. gene_id:MCD7.9~unknown protein
{Arabidopsis thaliana}, partial (32%)
Length = 1685
Score = 29.3 bits (64), Expect = 4.4
Identities = 23/87 (26%), Positives = 39/87 (44%), Gaps = 19/87 (21%)
Frame = +1
Query: 292 KSTAEEDEEMSKLLPAQKKKMRQKQRKAEARAKKGAEEKNEELSASGVSKSGKRHV---- 347
K ++ EE +L QK+K +++++AE +A + + NEE +G+ ++H
Sbjct: 34 KEQIDKAEEKERL---QKEKEEKQKKEAEEKANEKQVKTNEE--DTGIENEAEKHSDIED 198
Query: 348 ---------------KPVDPDPHGEKL 359
PVD D GEKL
Sbjct: 199 NFAASIHEKIEVKEDSPVDQDEAGEKL 279
>CA920908 homologue to PIR|T06377|T06 SAR DNA-binding protein-1 - garden pea,
partial (37%)
Length = 748
Score = 28.9 bits (63), Expect = 5.7
Identities = 12/40 (30%), Positives = 25/40 (62%)
Frame = -2
Query: 292 KSTAEEDEEMSKLLPAQKKKMRQKQRKAEARAKKGAEEKN 331
KS + DE+ +L A KKK +++++K E + ++ ++ N
Sbjct: 198 KSDSAMDEDTEELSAADKKKEKKEKKKKEKKEEEDTQKSN 79
>BM779470 similar to GP|17225592|gb serine acetyltransferase {Arabidopsis
thaliana}, partial (45%)
Length = 810
Score = 28.5 bits (62), Expect = 7.5
Identities = 19/60 (31%), Positives = 32/60 (52%)
Frame = +1
Query: 352 PDPHGEKLLQVDDPLSEAIKYLKLLQKNSPDSLETHLLSFELYTRKQKVLLTFQAVKQLL 411
PDP E +L V DP+ EA+K Q+ ++ + +LS LY +L + ++Q+L
Sbjct: 286 PDPDPESILDVSDPIWEAVK-----QEAKHEAEKEPVLSSFLYAS----VLAHECLEQVL 438
>TC85442 similar to GP|13543783|gb|AAH06040.1 Unknown (protein for MGC:7642)
{Mus musculus}, partial (62%)
Length = 585
Score = 28.1 bits (61), Expect = 9.8
Identities = 11/25 (44%), Positives = 18/25 (72%)
Frame = +1
Query: 308 QKKKMRQKQRKAEARAKKGAEEKNE 332
+KKK R+K+++ + R KKG KN+
Sbjct: 100 RKKKKRKKKQRRKKRRKKGKRRKNQ 174
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.133 0.383
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,101,244
Number of Sequences: 36976
Number of extensions: 233493
Number of successful extensions: 1265
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 1247
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1263
length of query: 583
length of database: 9,014,727
effective HSP length: 101
effective length of query: 482
effective length of database: 5,280,151
effective search space: 2545032782
effective search space used: 2545032782
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0226.2