
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0253.3
(594 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF637357 similar to GP|13365578|db contains ESTs AU088719(S4716)... 260 2e-81
BF646565 weakly similar to GP|20259245|gb putative N-terminal ac... 280 1e-75
TC79901 weakly similar to GP|20259245|gb|AAM14358.1 putative N-t... 223 4e-58
TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal... 90 3e-18
BQ146979 weakly similar to GP|20259245|gb| putative N-terminal a... 41 4e-08
TC81541 similar to GP|22597168|gb|AAN03471.1 unknown protein {Gl... 32 0.53
TC84723 similar to GP|16945387|emb|CAD11800. conserved hypotheti... 32 0.69
TC83485 weakly similar to GP|10176757|dbj|BAB09988. ATP-dependen... 32 0.69
TC77763 similar to GP|15215674|gb|AAK91382.1 AT4g27500/F27G19_10... 31 1.5
TC91327 similar to GP|22597168|gb|AAN03471.1 unknown protein {Gl... 31 1.5
TC83583 similar to PIR|T06379|T06379 SAR DNA-binding protein 2 -... 31 1.5
TC93600 30 3.4
CA990769 weakly similar to GP|21594005|gb| unknown {Arabidopsis ... 30 3.4
CA920908 homologue to PIR|T06377|T06 SAR DNA-binding protein-1 -... 30 3.4
TC86288 similar to GP|2407790|gb|AAB70660.1| grr1 {Glycine max},... 30 3.4
TC87578 similar to PIR|T12113|T12113 transcription factor - fava... 30 3.4
TC89535 weakly similar to PIR|A96808|A96808 hypothetical protein... 29 4.5
BQ751814 similar to GP|18043248|gb| RIKEN cDNA 4833422F24 gene {... 29 4.5
TC91772 weakly similar to GP|17380998|gb|AAL36311.1 unknown prot... 29 4.5
TC80066 similar to PIR|T50518|T50518 ABC transporter-like protei... 29 5.8
>BF637357 similar to GP|13365578|db contains ESTs AU088719(S4716)
C98685(E1389) AU066116(S4716)~hypothetical
protein~similar to, partial (21%)
Length = 582
Score = 260 bits (664), Expect(2) = 2e-81
Identities = 133/170 (78%), Positives = 140/170 (82%)
Frame = +2
Query: 245 LKFQDRLHSHVYFHKAAVGAIRCYIKLHDSPPKSTAEEDDDMSKLLPSQKKKLRQKQRKA 304
L F+ SH YFHKAA GAIRCYIKLHD PPKST EED+ MS LLPSQKKKLRQKQRKA
Sbjct: 71 LNFKTNCTSHPYFHKAAAGAIRCYIKLHDFPPKSTTEEDEHMSNLLPSQKKKLRQKQRKA 250
Query: 305 EARAKKEAEEKNEESSASGISKSGKRQAKPVDPDPRGEKLLQVEDPLLEATKYLKLLQKN 364
EARAKKEAEEKNEE ++S +SKSGKR KPVDPDP GEKLLQVEDPL EA KYLKLLQKN
Sbjct: 251 EARAKKEAEEKNEELNSSVVSKSGKRPVKPVDPDPHGEKLLQVEDPLSEAVKYLKLLQKN 430
Query: 365 SPDSLETHLLSFELHMRRQKILLAFQAVKQLLRLDAEHPDSHRCLIKFFH 414
SPDSLETHLLSFEL+ R+ KILLAFQAV PDSHRCLIKFFH
Sbjct: 431 SPDSLETHLLSFELYTRKXKILLAFQAVSSY*GWMLTTPDSHRCLIKFFH 580
Score = 61.6 bits (148), Expect(2) = 2e-81
Identities = 26/29 (89%), Positives = 29/29 (99%)
Frame = +1
Query: 224 DQFDFHSYCLRKMTLRTYVEMLKFQDRLH 252
DQFDFHSYCLRKMTLR+YV+MLKFQD+LH
Sbjct: 7 DQFDFHSYCLRKMTLRSYVDMLKFQDQLH 93
>BF646565 weakly similar to GP|20259245|gb putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (19%)
Length = 614
Score = 280 bits (716), Expect = 1e-75
Identities = 140/206 (67%), Positives = 168/206 (80%), Gaps = 5/206 (2%)
Frame = +1
Query: 394 QLLRLDAEHPDSHRCLIKFFHKVGSLNTPVNDSEKLVWSVVEAERQTISQLQGKSLFETN 453
QLLRLDA+HPDSHRCLIKFFH++GS++TPV +SEKL+WSV+EAER TISQL KSLFE N
Sbjct: 1 QLLRLDADHPDSHRCLIKFFHQLGSMSTPVTESEKLIWSVLEAERSTISQLHEKSLFEAN 180
Query: 454 KSFLEKYEDSLMHRAAFGEMMYVLDPSRRSEAVKIIEGSTNSPVSRNGT-----NWKLED 508
+F + ++DSLMHRAAF E++Y+LD +R+SEAVK+IE S N+ V RNG WKLED
Sbjct: 181 NAFHDNHKDSLMHRAAFAEILYILDSNRKSEAVKLIEDSVNNTVPRNGAIGPIGEWKLED 360
Query: 509 CIEVHKLLGTVLVDKDAALRWKVRCAEIFPYSVYFEGRHSSASPNSALNQICKKSFENGS 568
CI VHKLLGTVLVD+DAALRWKVRCAE FPYS YFEGRHSSASPNSA +Q+ +K+ EN
Sbjct: 361 CIAVHKLLGTVLVDQDAALRWKVRCAEYFPYSTYFEGRHSSASPNSAFSQL-RKNSENDG 537
Query: 569 SSHSVGNCKVESITSNGKLDTFKDLT 594
+HSV N V S TSNG+ F++LT
Sbjct: 538 PNHSVDNQNVGSTTSNGR--AFENLT 609
>TC79901 weakly similar to GP|20259245|gb|AAM14358.1 putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (11%)
Length = 965
Score = 223 bits (567), Expect(2) = 4e-58
Identities = 113/154 (73%), Positives = 127/154 (82%), Gaps = 5/154 (3%)
Frame = +1
Query: 446 GKSLFETNKSFLEKYEDSLMHRAAFGEMMYVLDPSRRSEAVKIIEGSTNSPVSRNGT--- 502
GKSL E N FLEK+E S+MHRAAFGEMMY+LDP+RR+EAVK+IEGSTN+PVS NG
Sbjct: 37 GKSLLEANSLFLEKHEGSMMHRAAFGEMMYILDPNRRAEAVKLIEGSTNNPVSSNGALGP 216
Query: 503 --NWKLEDCIEVHKLLGTVLVDKDAALRWKVRCAEIFPYSVYFEGRHSSASPNSALNQIC 560
W L+DCI VHKLLG+VL D+DAALRWKVRCAE FPYS YFEG SSASPNSALNQIC
Sbjct: 217 IREWTLKDCIAVHKLLGSVLDDQDAALRWKVRCAEFFPYSTYFEGSQSSASPNSALNQIC 396
Query: 561 KKSFENGSSSHSVGNCKVESITSNGKLDTFKDLT 594
K + NGSSSHS G+ VES+TSNGKL +FKDLT
Sbjct: 397 KTTI-NGSSSHSPGDNIVESVTSNGKLASFKDLT 495
Score = 20.8 bits (42), Expect(2) = 4e-58
Identities = 9/10 (90%), Positives = 9/10 (90%)
Frame = +3
Query: 435 EAERQTISQL 444
E ERQTISQL
Sbjct: 3 EVERQTISQL 32
>TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (38%)
Length = 1170
Score = 89.7 bits (221), Expect = 3e-18
Identities = 43/53 (81%), Positives = 48/53 (90%)
Frame = +1
Query: 1 RIPLDFLQGDRFREAADSYMRPLLTKGVPSLFSDLSSLYDHPGKANILEQLIL 53
RIPLDFLQGD+FREAA++Y+RPLLTKGVPSLFSDLSSLY H GKA+ILE L
Sbjct: 1012 RIPLDFLQGDKFREAAENYIRPLLTKGVPSLFSDLSSLYTHSGKADILEHSFL 1170
>BQ146979 weakly similar to GP|20259245|gb| putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (3%)
Length = 644
Score = 41.2 bits (95), Expect(2) = 4e-08
Identities = 18/24 (75%), Positives = 21/24 (87%)
Frame = +3
Query: 504 WKLEDCIEVHKLLGTVLVDKDAAL 527
W L+DCI VHKLLG+VL D+DAAL
Sbjct: 72 WTLKDCIAVHKLLGSVLDDQDAAL 143
Score = 34.3 bits (77), Expect(2) = 4e-08
Identities = 14/19 (73%), Positives = 16/19 (83%)
Frame = +1
Query: 486 VKIIEGSTNSPVSRNGTNW 504
VK+IEGSTN+PVS NG W
Sbjct: 1 VKLIEGSTNNPVSSNGAPW 57
>TC81541 similar to GP|22597168|gb|AAN03471.1 unknown protein {Glycine max},
partial (36%)
Length = 927
Score = 32.3 bits (72), Expect = 0.53
Identities = 22/63 (34%), Positives = 26/63 (40%)
Frame = +3
Query: 277 KSTAEEDDDMSKLLPSQKKKLRQKQRKAEARAKKEAEEKNEESSASGISKSGKRQAKPVD 336
KS D K +KKK +K + AE KKE E+K E K K KP
Sbjct: 525 KSIEIVDPPKPKAAEPEKKKEAEKPKSAEPEKKKEPEKKKEGDK----PKPAKEAEKPKA 692
Query: 337 PDP 339
DP
Sbjct: 693 ADP 701
>TC84723 similar to GP|16945387|emb|CAD11800. conserved hypothetical protein
{Neurospora crassa}, partial (5%)
Length = 999
Score = 32.0 bits (71), Expect = 0.69
Identities = 29/131 (22%), Positives = 53/131 (40%), Gaps = 3/131 (2%)
Frame = +3
Query: 191 ELASAESYFRQGDLGLALKKFLAVEKHYTDITEDQFDFHSYCLRKMTLRTYVEMLKFQDR 250
EL + Y + +L A + + I+E++ L++ VE+ K+Q+
Sbjct: 561 ELRVQKEYREEAELQRAKRDLDEIRNREARISEEKH-----------LKSQVELQKYQEE 707
Query: 251 LH---SHVYFHKAAVGAIRCYIKLHDSPPKSTAEEDDDMSKLLPSQKKKLRQKQRKAEAR 307
+ K A A+ + K +E + K + ++ RQK+ A+
Sbjct: 708 MRLAEKAKQREKEAADAVERFQKKEQERLAKQEKEKQEREKEFQRRLQEERQKELDRIAK 887
Query: 308 AKKEAEEKNEE 318
AKKE EE +E
Sbjct: 888 AKKEKEENEKE 920
>TC83485 weakly similar to GP|10176757|dbj|BAB09988. ATP-dependent RNA
helicase-like protein {Arabidopsis thaliana}, partial
(10%)
Length = 375
Score = 32.0 bits (71), Expect = 0.69
Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Frame = +1
Query: 294 KKKLRQKQRKAEARAKKEAEEKN-EESSASGISKSGKRQAKPVDPDPRGEKLLQVEDPLL 352
+ K R+KQRK + KKEA+EK + + K R ++ ++ E+ L
Sbjct: 130 RDKSREKQRKKNLQVKKEAKEKEPKPKKPKKTPEVPTVMRKQTAKQRRAKQTVEDEEELT 309
Query: 353 EATKYLKLLQKNSPDSLE 370
+ + LK L+K + D E
Sbjct: 310 QEYRLLKKLKKGTIDEDE 363
>TC77763 similar to GP|15215674|gb|AAK91382.1 AT4g27500/F27G19_100
{Arabidopsis thaliana}, partial (38%)
Length = 2297
Score = 30.8 bits (68), Expect = 1.5
Identities = 19/49 (38%), Positives = 28/49 (56%), Gaps = 1/49 (2%)
Frame = +1
Query: 281 EEDDDMSKLLPSQKKKLRQKQR-KAEARAKKEAEEKNEESSASGISKSG 328
EE+ +KL +KKK+ +K KA +A+KEAE+K + KSG
Sbjct: 1612 EEEIAKAKLAAERKKKMAEKAAAKAALKAQKEAEKKLKVLEKKAKKKSG 1758
>TC91327 similar to GP|22597168|gb|AAN03471.1 unknown protein {Glycine max},
partial (70%)
Length = 738
Score = 30.8 bits (68), Expect = 1.5
Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 5/93 (5%)
Frame = +1
Query: 263 GAIRCYIKLHDSPPKSTAEEDDDMSKLLPSQKKKLRQKQRKAEARAKKEAEEKNEESSAS 322
GAI+ + PP E + + P + +K ++ ++ KE E+ E
Sbjct: 250 GAIKSIEIVEPPPPPKPKEPEKPKEPVKPKEPEKPKEPEKPKNPEKPKEPEKPKEPEKPK 429
Query: 323 GISK-----SGKRQAKPVDPDPRGEKLLQVEDP 350
K K + P P+P+ E Q E P
Sbjct: 430 EPEKPKEPEKPKEKPAPPPPEPKPEPPKQPEKP 528
>TC83583 similar to PIR|T06379|T06379 SAR DNA-binding protein 2 - garden
pea, partial (33%)
Length = 852
Score = 30.8 bits (68), Expect = 1.5
Identities = 15/58 (25%), Positives = 32/58 (54%)
Frame = +2
Query: 276 PKSTAEEDDDMSKLLPSQKKKLRQKQRKAEARAKKEAEEKNEESSASGISKSGKRQAK 333
P S + D+D + + KKK ++++++ + + KKE E ++ E + K K++ K
Sbjct: 347 PMSNSAMDEDTPEPPVTDKKKEKKEKKEKKKKEKKEEEVEDVEEPEEEVVKKEKKKKK 520
>TC93600
Length = 703
Score = 29.6 bits (65), Expect = 3.4
Identities = 16/31 (51%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Frame = +3
Query: 294 KKKLRQKQRKAEARAKKEAEEKN-EESSASG 323
+KK RQ +R+A+A AKKE K EES +G
Sbjct: 21 RKKKRQSEREADAAAKKEHIHKRVEESDTNG 113
>CA990769 weakly similar to GP|21594005|gb| unknown {Arabidopsis thaliana},
partial (20%)
Length = 488
Score = 29.6 bits (65), Expect = 3.4
Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 5/67 (7%)
Frame = +1
Query: 276 PKSTAEEDDDMSKLLPSQKKKLRQKQRKAEARAKKEAE-----EKNEESSASGISKSGKR 330
PKS D +K+ K++ + + K E + +++AE EK EES + K
Sbjct: 136 PKSELLLDTSYTKMGEEAKQEQPKVEEKQEEKKEEKAEGGKEEEKKEESKEEKPEEEKKE 315
Query: 331 QAKPVDP 337
+ KP +P
Sbjct: 316 EPKPPEP 336
>CA920908 homologue to PIR|T06377|T06 SAR DNA-binding protein-1 - garden pea,
partial (37%)
Length = 748
Score = 29.6 bits (65), Expect = 3.4
Identities = 16/62 (25%), Positives = 34/62 (54%), Gaps = 7/62 (11%)
Frame = -2
Query: 277 KSTAEEDDDMSKLLPSQKKKLRQKQRKAEARAKKEAEEKN-------EESSASGISKSGK 329
KS + D+D +L + KKK +++++K E + +++ ++ N +E SA+ K K
Sbjct: 198 KSDSAMDEDTEELSAADKKKEKKEKKKKEKKEEEDTQKSNTAMDEDTQEPSAADKKKEKK 19
Query: 330 RQ 331
+
Sbjct: 18 EK 13
>TC86288 similar to GP|2407790|gb|AAB70660.1| grr1 {Glycine max}, partial
(84%)
Length = 2900
Score = 29.6 bits (65), Expect = 3.4
Identities = 15/43 (34%), Positives = 24/43 (54%)
Frame = -1
Query: 277 KSTAEEDDDMSKLLPSQKKKLRQKQRKAEARAKKEAEEKNEES 319
K +E+ + L +KKK R+++R RAKK+ E+ ES
Sbjct: 140 KRNLKEEGKRRRTLEREKKKERERERTL*DRAKKKKGERKNES 12
>TC87578 similar to PIR|T12113|T12113 transcription factor - fava bean,
partial (49%)
Length = 1021
Score = 29.6 bits (65), Expect = 3.4
Identities = 22/74 (29%), Positives = 39/74 (51%), Gaps = 3/74 (4%)
Frame = +2
Query: 280 AEEDDDMSKLLPS--QKKKLRQKQRKAEARAKKEAEEK-NEESSASGISKSGKRQAKPVD 336
AE+DD+ S S Q + E AKKE ++ + ++SAS + + K+++K D
Sbjct: 560 AEKDDEGSPTDDSGADDSDASQSGDEKEIPAKKEPKKDLSSKASASTSTSTSKKKSKDAD 739
Query: 337 PDPRGEKLLQVEDP 350
D + +K + +DP
Sbjct: 740 EDGKKKKQKKKKDP 781
>TC89535 weakly similar to PIR|A96808|A96808 hypothetical protein T32E8.13
[imported] - Arabidopsis thaliana, partial (9%)
Length = 1729
Score = 29.3 bits (64), Expect = 4.5
Identities = 14/43 (32%), Positives = 25/43 (57%)
Frame = -1
Query: 319 SSASGISKSGKRQAKPVDPDPRGEKLLQVEDPLLEATKYLKLL 361
++ + S SG R+ + D D R ++++E+P LE + KLL
Sbjct: 325 AAVASFSGSGHRRRQ*EDADERKHSIIEIENPNLERERESKLL 197
>BQ751814 similar to GP|18043248|gb| RIKEN cDNA 4833422F24 gene {Mus
musculus}, partial (4%)
Length = 531
Score = 29.3 bits (64), Expect = 4.5
Identities = 25/66 (37%), Positives = 33/66 (49%), Gaps = 5/66 (7%)
Frame = -1
Query: 285 DMSKLLPSQKKKLRQKQRKAEARAKKEAEE-----KNEESSASGISKSGKRQAKPVDPDP 339
++ K L S KK+L QK E +KEA+ KNE+ + G K G + KPV
Sbjct: 216 ELKKTLKSHKKQL-QKAATEETAEQKEAKAQREADKNEKGAKKG-KKGGVKSEKPV--GN 49
Query: 340 RGEKLL 345
G KLL
Sbjct: 48 LGSKLL 31
>TC91772 weakly similar to GP|17380998|gb|AAL36311.1 unknown protein
{Arabidopsis thaliana}, partial (17%)
Length = 1237
Score = 29.3 bits (64), Expect = 4.5
Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 6/72 (8%)
Frame = +2
Query: 269 IKLHDSPPKSTAEEDDDMSKLLPSQ------KKKLRQKQRKAEARAKKEAEEKNEESSAS 322
IK S PK+ E D PS+ + K R + +K ++ AK+ A E + +
Sbjct: 353 IKAKISSPKAGGLESDAAGSPSPSESNHDENRSKKRVRTKKNDSSAKEVAAEDISKKVSE 532
Query: 323 GISKSGKRQAKP 334
G S S + A+P
Sbjct: 533 GTSDSKVKPARP 568
>TC80066 similar to PIR|T50518|T50518 ABC transporter-like protein -
Arabidopsis thaliana, partial (14%)
Length = 992
Score = 28.9 bits (63), Expect = 5.8
Identities = 17/48 (35%), Positives = 22/48 (45%)
Frame = +3
Query: 457 LEKYEDSLMHRAAFGEMMYVLDPSRRSEAVKIIEGSTNSPVSRNGTNW 504
LE+Y DS E+ LD + V+ E +SPV NG NW
Sbjct: 21 LEQYSDS--------EVWEALDKCQLGHLVRAKEEKLDSPVVENGDNW 140
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.317 0.132 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,460,402
Number of Sequences: 36976
Number of extensions: 237840
Number of successful extensions: 1328
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 1306
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1320
length of query: 594
length of database: 9,014,727
effective HSP length: 102
effective length of query: 492
effective length of database: 5,243,175
effective search space: 2579642100
effective search space used: 2579642100
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0253.3