
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0223.4
(336 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB895143 65 4e-11
AL369112 weakly similar to GP|13540393|gb| histone H1 {Pisum sat... 32 0.26
TC82640 similar to GP|10177002|dbj|BAB10252. gene_id:K9E15.9~unk... 32 0.45
TC88261 similar to PIR|S22697|S22697 extensin - Volvox carteri (... 31 0.77
TC88259 similar to GP|6729037|gb|AAF27033.1| unknown protein {Ar... 30 1.0
TC79086 similar to GP|23499033|emb|CAD51113. ubiquitin-protein l... 30 1.0
AW584244 similar to GP|8886934|gb F2D10.25 {Arabidopsis thaliana... 30 1.3
TC93452 similar to PIR|D84902|D84902 probable WRKY-type DNA bind... 27 2.2
TC78476 homologue to SP|O22437|CHLD_PEA Magnesium-chelatase subu... 29 2.9
TC92524 similar to GP|19698943|gb|AAL91207.1 unknown protein {Ar... 29 2.9
TC79681 similar to GP|20453116|gb|AAM19800.1 AT5g22640/MDJ22_6 {... 29 2.9
TC89852 similar to GP|15620063|gb|AAL03490.1 unknown {Rickettsia... 28 3.8
CA920710 similar to PIR|S64314|S643 probable membrane protein YG... 28 5.0
TC92719 similar to PIR|B81013|B81013 PTS system IIAB component ... 28 5.0
BQ141666 28 6.5
TC84505 similar to PIR|T48442|T48442 hypothetical protein T32M21... 28 6.5
TC80581 similar to GP|19683021|gb|AAL92644.1 hypothetical protei... 27 8.5
BQ135850 27 8.5
TC86789 similar to PIR|A84594|A84594 hypothetical protein At2g20... 27 8.5
>CB895143
Length = 770
Score = 65.1 bits (157), Expect = 4e-11
Identities = 39/177 (22%), Positives = 86/177 (48%)
Frame = +2
Query: 56 TRWRVKDDLLLVQSWLNISKDPTVGTDQTAAKFWDRIRDQFDEYRDFDTPPRTGKMLKCR 115
+ W + +L+L+ W+ + VG +Q + +I D F+E+ F+ PP +
Sbjct: 203 SNWNIDQNLILISGWIKYGTNSVVGRNQKCDSY*GKIVDYFNEHCSFN-PPCDAAPCRNH 379
Query: 116 FGKLSKDIQFFTGCYNKVTTPWKSGHSEKDIMAEAHALFQVDHKKDFTHENVWRMVKDEP 175
+ ++K + + Y+ +SG S+ D++A+AH L+ + F + W +V+D+P
Sbjct: 380 YNYMNKILNKWIDAYDGAKRMQQSG*SKIDVLAKAHELYSSGKSEHFILISGWFVVRDQP 559
Query: 176 KWKGQSMKTNSRGQKKSGAGADGTSTDPSASIDCDEYEATPPTTRPKGKKAEKRKAK 232
++ +S+ +G+G+ G+ + D + + ++ P G+ A K++ K
Sbjct: 560 RY-------DSQVGGNTGSGSSGSKR--AHKSDASDSNSVGSSSCPMGRDAAKKR*K 703
>AL369112 weakly similar to GP|13540393|gb| histone H1 {Pisum sativum},
partial (20%)
Length = 371
Score = 32.3 bits (72), Expect = 0.26
Identities = 23/77 (29%), Positives = 30/77 (38%)
Frame = +2
Query: 178 KGQSMKTNSRGQKKSGAGADGTSTDPSASIDCDEYEATPPTTRPKGKKAEKRKAKTTDTA 237
K + K + KK+ AGA D A+ E A P + K AEK AK D
Sbjct: 83 KAAAKKKAAAAAKKADAGAKDAKADTKAAEKPAEKAAEKPAEKAAEKPAEKAAAKKEDKK 262
Query: 238 SSTLSFVPHPDVLAMGK 254
+ + P D GK
Sbjct: 263 DAKPAAKPKADDKKKGK 313
>TC82640 similar to GP|10177002|dbj|BAB10252. gene_id:K9E15.9~unknown
protein {Arabidopsis thaliana}, partial (22%)
Length = 894
Score = 31.6 bits (70), Expect = 0.45
Identities = 22/76 (28%), Positives = 34/76 (43%)
Frame = +1
Query: 136 PWKSGHSEKDIMAEAHALFQVDHKKDFTHENVWRMVKDEPKWKGQSMKTNSRGQKKSGAG 195
PWKSG+SE +V K H+++W +DE K + + +K GQK + A
Sbjct: 244 PWKSGNSEN----------EVSLKDLLMHKDMW---EDEDKTRIELLKLLKTGQKSAPAN 384
Query: 196 ADGTSTDPSASIDCDE 211
+ S I+ E
Sbjct: 385 SAAKPEPISKDIEVSE 432
>TC88261 similar to PIR|S22697|S22697 extensin - Volvox carteri (fragment),
partial (7%)
Length = 1516
Score = 30.8 bits (68), Expect = 0.77
Identities = 28/76 (36%), Positives = 37/76 (47%), Gaps = 7/76 (9%)
Frame = +2
Query: 10 SQAPPTGSTGSKVSDTQCEATPDDTQHEGLDDID--LEDEDQSSGKKRTR-----WRVKD 62
S PT G +DT TP+D +H L+D+D LE ED S K + V+
Sbjct: 65 SPVDPTPKFGGSETDT---VTPNDKRHCILEDVDGELEMEDVSGHPKDEKPVXLDSSVET 235
Query: 63 DLLLVQSWLNISKDPT 78
D+LL S N + DPT
Sbjct: 236 DMLLQSS--NRNLDPT 277
>TC88259 similar to GP|6729037|gb|AAF27033.1| unknown protein {Arabidopsis
thaliana}, partial (35%)
Length = 1757
Score = 30.4 bits (67), Expect = 1.0
Identities = 17/44 (38%), Positives = 24/44 (53%)
Frame = +3
Query: 255 AKMEMMANFREIRNRELDLQQADQQLKQSELQLRQEELKFKKAE 298
+K E A+ R R L+ + + Q ELQ R EEL+F+ AE
Sbjct: 786 SKAEAAASIRSDLQRRLEAVEKENSSLQLELQSRLEELEFRIAE 917
>TC79086 similar to GP|23499033|emb|CAD51113. ubiquitin-protein ligase 1
putative {Plasmodium falciparum 3D7}, partial (0%)
Length = 1418
Score = 30.4 bits (67), Expect = 1.0
Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 4/70 (5%)
Frame = +2
Query: 171 VKDEPKWKGQSMKTNSRGQKKSGAGADGTSTDPSASIDCDEYEAT----PPTTRPKGKKA 226
VKD+P + ++ + +++ + A +S + E EA PP T PK
Sbjct: 74 VKDDPPPQEDKIEQHDENDEETSSEAGSSSEGGEEEEESSEEEAPKTTPPPKTNPKNNSV 253
Query: 227 EKRKAKTTDT 236
E + TDT
Sbjct: 254 EDEDEEETDT 283
>AW584244 similar to GP|8886934|gb F2D10.25 {Arabidopsis thaliana}, partial
(12%)
Length = 591
Score = 30.0 bits (66), Expect = 1.3
Identities = 17/55 (30%), Positives = 33/55 (59%), Gaps = 5/55 (9%)
Frame = +1
Query: 261 ANFREIRNRELDLQQADQQLKQSE-----LQLRQEELKFKKAENFRAYMDILNKN 310
A FR+I+ R+++LQQA +++Q LQ+R + ++ + FRA+ + K+
Sbjct: 412 AKFRDIQERKVELQQAIVKMEQGGSADGILQVRADRIQSDLEQLFRAFDERCKKH 576
>TC93452 similar to PIR|D84902|D84902 probable WRKY-type DNA binding protein
[imported] - Arabidopsis thaliana, partial (10%)
Length = 488
Score = 26.6 bits (57), Expect(2) = 2.2
Identities = 17/59 (28%), Positives = 26/59 (43%), Gaps = 7/59 (11%)
Frame = +1
Query: 130 YNKVTTPWKSGHSEKDIMAEAHALFQVDHKK-------DFTHENVWRMVKDEPKWKGQS 181
Y K T G + M ++H+ F + K +F H +++ K PKWK QS
Sbjct: 241 YQKALTMLNVGDGK---MKDSHSSFTNESPKSEVIIDQEFNHNALFKKRKSMPKWKEQS 408
Score = 21.2 bits (43), Expect(2) = 2.2
Identities = 10/23 (43%), Positives = 13/23 (56%)
Frame = +3
Query: 116 FGKLSKDIQFFTGCYNKVTTPWK 138
F K +D Q T YN+ +PWK
Sbjct: 87 FTKNGRDYQEKT**YNQRISPWK 155
>TC78476 homologue to SP|O22437|CHLD_PEA Magnesium-chelatase subunit chlD
chloroplast precursor (Mg- protoporphyrin IX chelatase),
partial (68%)
Length = 1596
Score = 28.9 bits (63), Expect = 2.9
Identities = 15/47 (31%), Positives = 22/47 (45%)
Frame = +2
Query: 3 PPQYQFQSQAPPTGSTGSKVSDTQCEATPDDTQHEGLDDIDLEDEDQ 49
PP+ Q Q PP + ++ Q E + + E DD D E E+Q
Sbjct: 476 PPEQQNQPPPPPPPPQNQESNEEQNEEEDKEEEEEEEDDKDEEKEEQ 616
>TC92524 similar to GP|19698943|gb|AAL91207.1 unknown protein {Arabidopsis
thaliana}, partial (38%)
Length = 721
Score = 28.9 bits (63), Expect = 2.9
Identities = 21/57 (36%), Positives = 27/57 (46%)
Frame = +3
Query: 270 ELDLQQADQQLKQSELQLRQEELKFKKAENFRAYMDILNKNTSGMNDEELRTHNALR 326
EL QQ Q+LKQ L+L+ ELK K+ E A K + + L T LR
Sbjct: 15 ELSEQQYAQKLKQKSLELQIAELKIKQHEERSAQEQSQMKLYAEQVSQLLATEKNLR 185
>TC79681 similar to GP|20453116|gb|AAM19800.1 AT5g22640/MDJ22_6 {Arabidopsis
thaliana}, partial (42%)
Length = 1682
Score = 28.9 bits (63), Expect = 2.9
Identities = 23/101 (22%), Positives = 40/101 (38%), Gaps = 1/101 (0%)
Frame = +1
Query: 7 QFQSQAPPTGSTGSKVSDTQCEATPDDTQHEGLDDIDLEDEDQSSGKKRTRWRVKDDLLL 66
+FQ PP S SD+ +++ D +D+D+E S + T V D
Sbjct: 40 RFQKPPPPPTSDSKNPSDSDSDSSSDAESETPQEDLDIESYLNSFIAQNTEAPVPDSEFD 219
Query: 67 VQSWLNISKDPTVGTDQTAAKFWD-RIRDQFDEYRDFDTPP 106
+ N + K+++ +IR + + FD PP
Sbjct: 220 PEQ--NFKGVIEAYHSEKVEKYYEKKIRRKVQHHNPFDFPP 336
>TC89852 similar to GP|15620063|gb|AAL03490.1 unknown {Rickettsia conorii},
partial (22%)
Length = 1001
Score = 28.5 bits (62), Expect = 3.8
Identities = 19/63 (30%), Positives = 35/63 (55%)
Frame = +3
Query: 18 TGSKVSDTQCEATPDDTQHEGLDDIDLEDEDQSSGKKRTRWRVKDDLLLVQSWLNISKDP 77
TGSK ++T +D + +G+D + DE++S GKK+ + V + ++ + N + P
Sbjct: 306 TGSK----DAKSTINDQKAQGVDSKN--DEEKSKGKKKKKSNVVSESVVDNAEDNQLESP 467
Query: 78 TVG 80
T G
Sbjct: 468 TCG 476
>CA920710 similar to PIR|S64314|S643 probable membrane protein YGR023w -
yeast (Saccharomyces cerevisiae), partial (7%)
Length = 774
Score = 28.1 bits (61), Expect = 5.0
Identities = 15/47 (31%), Positives = 25/47 (52%), Gaps = 3/47 (6%)
Frame = -2
Query: 118 KLSKDIQFFTGC---YNKVTTPWKSGHSEKDIMAEAHALFQVDHKKD 161
++S +I +F+ + V TPWKSG+ E + AE + D +D
Sbjct: 467 RISNEIHWFSAMDYVESLVPTPWKSGYIEAEAEAEEEEEEEKDEDED 327
>TC92719 similar to PIR|B81013|B81013 PTS system IIAB component NMB2046
[imported] - Neisseria meningitidis (strain MC58
serogroup B), partial (9%)
Length = 629
Score = 28.1 bits (61), Expect = 5.0
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 5/66 (7%)
Frame = +2
Query: 188 GQKKSGAGADGTSTDP--SASIDCDEYEATPPTTRPKGKKAEKRKA---KTTDTASSTLS 242
G KK D +P S + D+ ATP +T+P + +K KA + S + S
Sbjct: 104 GSKKRNNAIDSQEKEPLNKKSKNNDDSTATPSSTKPTMENHKKSKAFDKQRRSAKSKSKS 283
Query: 243 FVPHPD 248
+P PD
Sbjct: 284 ELPAPD 301
>BQ141666
Length = 1072
Score = 27.7 bits (60), Expect = 6.5
Identities = 16/68 (23%), Positives = 27/68 (39%)
Frame = +2
Query: 180 QSMKTNSRGQKKSGAGADGTSTDPSASIDCDEYEATPPTTRPKGKKAEKRKAKTTDTASS 239
+ ++T + + SG + + P + P TRP KA RK K +
Sbjct: 203 RDIRTKKKAKATSGGSSHKSPVPPPHQL---------PHTRPDKAKARDRKQKNVGEEQT 355
Query: 240 TLSFVPHP 247
T + +P P
Sbjct: 356 TNALIPFP 379
>TC84505 similar to PIR|T48442|T48442 hypothetical protein T32M21.60 -
Arabidopsis thaliana, partial (16%)
Length = 1130
Score = 27.7 bits (60), Expect = 6.5
Identities = 31/176 (17%), Positives = 64/176 (35%), Gaps = 9/176 (5%)
Frame = +2
Query: 157 DHKKDFTHENVWRMVKDEPKW-KGQSMKTNSRGQKKSGAGADGTSTDPSASIDCDEYEAT 215
+H+ ++ E V+ +W + S + SRG ++ + D + I D E
Sbjct: 185 NHRSEWLGETERERVRIVREWVQMTSQQRGSRGSRRDAQVSQSAPADRTRDIAADHDERQ 364
Query: 216 PPTTRPKGKKAEKRKAKTT-------DTASSTLSFVPHPDVLAMG-KAKMEMMANFREIR 267
P R + R+A + + H V + +++ + R +R
Sbjct: 365 PEHVRRDMLRLRGRQALVDLLVRVERERQRELEGLLEHRAVSDFAHRNRIQSLLRGRFLR 544
Query: 268 NRELDLQQADQQLKQSELQLRQEELKFKKAENFRAYMDILNKNTSGMNDEELRTHN 323
N ++ ++ +QLRQ E FR+ ++ + + S N + N
Sbjct: 545 NETVEDERPPSTAASELVQLRQRHTVSGIREGFRSRLENIVRGQSSTNPDATSNSN 712
>TC80581 similar to GP|19683021|gb|AAL92644.1 hypothetical protein
{Dictyostelium discoideum}, partial (5%)
Length = 850
Score = 27.3 bits (59), Expect = 8.5
Identities = 16/43 (37%), Positives = 26/43 (60%)
Frame = +1
Query: 273 LQQADQQLKQSELQLRQEELKFKKAENFRAYMDILNKNTSGMN 315
LQQ QQL Q +LQL+Q+ +++ + A L +N+S +N
Sbjct: 487 LQQQQQQLLQQQLQLQQQ---YQRLQQQLASSAQLQQNSSTLN 606
>BQ135850
Length = 804
Score = 27.3 bits (59), Expect = 8.5
Identities = 13/30 (43%), Positives = 16/30 (53%)
Frame = -1
Query: 92 IRDQFDEYRDFDTPPRTGKMLKCRFGKLSK 121
IR FD YR+ T T +LKC LS+
Sbjct: 318 IRSLFDRYREISTFILTNNILKCTVKNLSR 229
>TC86789 similar to PIR|A84594|A84594 hypothetical protein At2g20840
[imported] - Arabidopsis thaliana, partial (93%)
Length = 1421
Score = 27.3 bits (59), Expect = 8.5
Identities = 21/75 (28%), Positives = 42/75 (56%)
Frame = +3
Query: 236 TASSTLSFVPHPDVLAMGKAKMEMMANFREIRNRELDLQQADQQLKQSELQLRQEELKFK 295
+A+S LS +PH + + ++ ++++ +E +LQ ++SEL+ R++ELK +
Sbjct: 312 SAASRLSPLPHEPYDRNATVDIPLDSS-KDVKAKEKELQA-----RESELKKREQELKRR 473
Query: 296 KAENFRAYMDILNKN 310
+ RA + I KN
Sbjct: 474 EDAIARAGIVIEEKN 518
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.313 0.129 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,573,293
Number of Sequences: 36976
Number of extensions: 119538
Number of successful extensions: 609
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 582
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 607
length of query: 336
length of database: 9,014,727
effective HSP length: 97
effective length of query: 239
effective length of database: 5,428,055
effective search space: 1297305145
effective search space used: 1297305145
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 59 (27.3 bits)
Lotus: description of TM0223.4