
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0081b.1
(360 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC79471 similar to GP|21536629|gb|AAM60961.1 seed maturation-lik... 483 e-137
TC77629 similar to GP|6648962|gb|AAF21309.1| seed maturation pro... 182 2e-46
TC89581 similar to PIR|B96661|B96661 unknown protein 83181-8510... 55 3e-08
TC76514 homologue to PIR|T09648|T09648 nucleolin homolog nuM1 - ... 37 0.012
BG582075 similar to PIR|T05242|T052 hypothetical protein F18A5.1... 33 0.17
BQ144197 homologue to GP|1841930|emb| ROX protein {Mus musculus}... 28 0.30
TC78402 weakly similar to PIR|B84853|B84853 hypothetical protein... 31 0.85
BQ144498 similar to GP|6330472|dbj KIAA1205 protein {Homo sapien... 30 1.9
TC81739 similar to SP|Q42569|C901_ARATH Cytochrome P450 90A1 (EC... 30 1.9
AL369307 weakly similar to GP|21105736|gb| nam-like protein 4 {P... 30 1.9
TC81517 similar to GP|16323198|gb|AAL15333.1 At1g15980/T24D18_8 ... 29 2.5
AJ498464 similar to GP|19386754|dbj hypothetical protein {Oryza ... 29 2.5
TC79880 29 2.5
BF647972 similar to GP|22532805|gb| C. elegans PQN-75 protein (c... 29 3.2
CB892868 similar to PIR|D86467|D864 protein F23M19.3 [imported] ... 29 3.2
TC91301 similar to GP|19310721|gb|AAL85091.1 unknown protein {Ar... 29 3.2
TC81020 weakly similar to GP|15912327|gb|AAL08297.1 At1g25320/F4... 29 3.2
BM813189 weakly similar to GP|17104669|gb| unknown protein {Arab... 28 4.2
TC91439 similar to PIR|D96683|D96683 hypothetical protein F12P19... 28 5.5
BQ151301 similar to PIR|S34666|S346 glycine-rich protein - commo... 28 5.5
>TC79471 similar to GP|21536629|gb|AAM60961.1 seed maturation-like protein
{Arabidopsis thaliana}, partial (50%)
Length = 1464
Score = 483 bits (1242), Expect = e-137
Identities = 265/366 (72%), Positives = 297/366 (80%), Gaps = 7/366 (1%)
Frame = +3
Query: 2 AALSWRPFILSRLTDLSPNP-LHPPKPPPLFLRR---RRCFLTSCYADGFSSSSSSSSSD 57
+ +S +PFILSR+T+ +P LH P P FLRR RR FL F++S D
Sbjct: 93 STISSQPFILSRITNNPSHPSLHHPLPSLFFLRRPPHRRPFL-------FATSCHHVDGD 251
Query: 58 DVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHF 117
DVVSTRKSTF+RGFTVI+ MLRRI+PLDNSVISKGVS+A++DSMKQTISTMLGLLPSDHF
Sbjct: 252 DVVSTRKSTFNRGFTVISKMLRRIKPLDNSVISKGVSEASKDSMKQTISTMLGLLPSDHF 431
Query: 118 SVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEILEV 177
VTV+ PLHRLLVSSIITGYTLWNAEYRMSLTRNL+++ + +DCE E LE+
Sbjct: 432 DVTVSFEIQPLHRLLVSSIITGYTLWNAEYRMSLTRNLEMSH--ADQGADCETPLESLEL 605
Query: 178 KGGGEDGGEIE-VASDLGLKDLENCSSSPR--VFGDLPPQALNYIQQLQSELTNVKEELN 234
KGG E+ GE E V SDLGL + E CSSS VFGDLPPQAL YIQQLQSELTN+KEELN
Sbjct: 606 KGGEEEHGETEKVVSDLGLANSEICSSSTGAGVFGDLPPQALKYIQQLQSELTNMKEELN 785
Query: 235 ARKQEMMQLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVD 294
A+KQEMMQLE+DRGIRNNLLEYLRS DP+MVTE+SRPSS EVEDIIHQLVQNIL RF VD
Sbjct: 786 AQKQEMMQLEHDRGIRNNLLEYLRSFDPDMVTEMSRPSSEEVEDIIHQLVQNILRRFLVD 965
Query: 295 DASGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQLS 354
+AS +FMEQSVEGNI D+GDEFSD + TSRDYLAKLLFWCMLLGHHLRGLENRL LS
Sbjct: 966 EASSNFMEQSVEGNI----DDGDEFSDKIATSRDYLAKLLFWCMLLGHHLRGLENRLHLS 1133
Query: 355 CVVGLL 360
CVVGLL
Sbjct: 1134CVVGLL 1151
>TC77629 similar to GP|6648962|gb|AAF21309.1| seed maturation protein PM23
{Glycine max}, partial (76%)
Length = 1741
Score = 182 bits (462), Expect = 2e-46
Identities = 135/384 (35%), Positives = 201/384 (52%), Gaps = 32/384 (8%)
Frame = +3
Query: 8 PFILSRLTDLSPN-PLHPPKPPPLF-LRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKS 65
P ILS + S + PLH +P P F R T + +SSS S+D S + +
Sbjct: 102 PSILSFSSSSSSSLPLHRFRPSPSFSFSPLRSNPTKSPSPFLVLASSSHDSNDFTSKKSA 281
Query: 66 TFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSK 125
++ +++ IEPLD S I K V D+MK+TIS MLGLLPSD F+V V
Sbjct: 282 --------LSELIQEIEPLDVSNIQKDVPPTTADAMKRTISGMLGLLPSDQFNVVVEALW 437
Query: 126 HPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEK--------------R 171
PL +LL+SS++TGYTL NAEYR+ L + LDI D D EK R
Sbjct: 438 EPLSKLLISSMMTGYTLRNAEYRLCLEKTLDIC------DRDLEKPKAESTKFDLQDFLR 599
Query: 172 SEILEVKGGGEDGGEIEVA---SDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTN 228
+ + G + +V D+ ++DL G + +A YI LQS L++
Sbjct: 600 DSVNVIDFGRNNNLSSKVEKPHEDVNIQDL----------GQISAEAQEYISSLQSRLSS 749
Query: 229 VKEELNARKQEMMQLEYDRGI---RNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQ 285
+K+EL K++ L+ + + +N+LL+YLRSL PE V +LS +S E++DII +V
Sbjct: 750 IKKELREVKRKSAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKDIILSVVH 929
Query: 286 NILSRFF--VDDASGSFMEQSVEGNIDNHPDNGDE--------FSDTVGTSRDYLAKLLF 335
+L+ + + E + G+ + ++ E F + +RDYLA+LLF
Sbjct: 930 GLLATLSPKMHSKPSTTSENATVGSANAGNEDCAEVVENSSLKFQPFISLTRDYLARLLF 1109
Query: 336 WCMLLGHHLRGLENRLQLSCVVGL 359
WCMLLGH+LRGLE R++L+ ++ L
Sbjct: 1110WCMLLGHYLRGLEYRMELAELLSL 1181
Score = 28.1 bits (61), Expect = 5.5
Identities = 15/28 (53%), Positives = 18/28 (63%)
Frame = +1
Query: 5 SWRPFILSRLTDLSPNPLHPPKPPPLFL 32
+W+PF LS SP+P P PPPLFL
Sbjct: 88 TWQPFPLS-----SPSP---PPPPPLFL 147
>TC89581 similar to PIR|B96661|B96661 unknown protein 83181-85105
[imported] - Arabidopsis thaliana, partial (35%)
Length = 716
Score = 55.5 bits (132), Expect = 3e-08
Identities = 25/84 (29%), Positives = 47/84 (55%)
Frame = +3
Query: 73 VIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLL 132
++ ++ ++P + K D+M+QT++ M+G LP FS+T+ L +L+
Sbjct: 339 ILLEYVKTVQPEFMELFVKRAPPPVVDAMRQTVTNMIGTLPPQFFSITIATVAENLAQLM 518
Query: 133 VSSIITGYTLWNAEYRMSLTRNLD 156
S ++TGY NA+YR+ L +L+
Sbjct: 519 YSVMMTGYMFRNAQYRLELQESLE 590
>TC76514 homologue to PIR|T09648|T09648 nucleolin homolog nuM1 - alfalfa,
partial (61%)
Length = 1288
Score = 37.0 bits (84), Expect = 0.012
Identities = 37/145 (25%), Positives = 61/145 (41%), Gaps = 10/145 (6%)
Frame = -1
Query: 41 SCYADGFSSSSSSSSSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDS 100
+ +A GFSSSSSSS V S+ ++ F G + LD + ++ G ++ DS
Sbjct: 697 TAFAAGFSSSSSSSEDSSVSSSSEAAFFAG---------ALPFLDGTALAAGFLSSSEDS 545
Query: 101 MKQTISTM----LGLLP------SDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMS 150
+ + S + G LP D T + S+ L S +++G+ + +
Sbjct: 544 SESSSSEVSTFFAGALPFLDGTALDTGFFTSSSSEEELSESDSSEVVSGWAFFTFPFLAG 365
Query: 151 LTRNLDIASPCGARDSDCEKRSEIL 175
+ A G SD E SE+L
Sbjct: 364 VFFEGAGAFTTGCSSSDSEDSSELL 290
>BG582075 similar to PIR|T05242|T052 hypothetical protein F18A5.120 -
Arabidopsis thaliana, partial (21%)
Length = 784
Score = 33.1 bits (74), Expect = 0.17
Identities = 28/102 (27%), Positives = 51/102 (49%), Gaps = 7/102 (6%)
Frame = +1
Query: 21 PLHPPKPPPLFLRRR----RCFLTSCYADGFSSSSSSSSSDDVVSTRKSTFDRGFTVIAN 76
P PP PPP+ + +T+ SSS+S+SS+DD+ +R + ++A
Sbjct: 145 PSEPPSPPPVTVVEDPPSIHNAITTSPPSSSSSSASTSSTDDMSISRLAQSQS--QLLAE 318
Query: 77 MLRRIEPLD--NSVISKGVSDAARDSMKQTI-STMLGLLPSD 115
+ R++ + + S+G+ D+ ++ TI +LG LP D
Sbjct: 319 LSRKVIDMRELRKIASQGIPDS--PGLRSTIWKLLLGYLPPD 438
>BQ144197 homologue to GP|1841930|emb| ROX protein {Mus musculus}, partial
(4%)
Length = 1170
Score = 28.1 bits (61), Expect(2) = 0.30
Identities = 17/38 (44%), Positives = 20/38 (51%), Gaps = 4/38 (10%)
Frame = -2
Query: 21 PLHPPKPPPLFLRRRR----CFLTSCYADGFSSSSSSS 54
PL PP+PPP RRRR C +C G +S SS
Sbjct: 209 PLSPPRPPP--CRRRRGMSACSREACPLGGVPASVPSS 102
Score = 22.7 bits (47), Expect(2) = 0.30
Identities = 10/20 (50%), Positives = 12/20 (60%)
Frame = -1
Query: 9 FILSRLTDLSPNPLHPPKPP 28
FI S + SP+ L PP PP
Sbjct: 243 FIFSPFSLPSPSTLSPPPPP 184
>TC78402 weakly similar to PIR|B84853|B84853 hypothetical protein At2g42370
[imported] - Arabidopsis thaliana, partial (14%)
Length = 1982
Score = 30.8 bits (68), Expect = 0.85
Identities = 14/25 (56%), Positives = 16/25 (64%), Gaps = 2/25 (8%)
Frame = -1
Query: 18 SPNPL--HPPKPPPLFLRRRRCFLT 40
SP+PL HPP PPP+F R C T
Sbjct: 461 SPHPLLLHPPPPPPVFPRTTPCAAT 387
>BQ144498 similar to GP|6330472|dbj KIAA1205 protein {Homo sapiens}, partial
(3%)
Length = 1284
Score = 29.6 bits (65), Expect = 1.9
Identities = 16/34 (47%), Positives = 18/34 (52%)
Frame = -2
Query: 10 ILSRLTDLSPNPLHPPKPPPLFLRRRRCFLTSCY 43
I R P+P PP PPP L R R FL SC+
Sbjct: 281 IAPRTPTAPPSPT-PPIPPPFSLLRCRFFLFSCF 183
>TC81739 similar to SP|Q42569|C901_ARATH Cytochrome P450 90A1 (EC 1.14.-.-).
[Mouse-ear cress] {Arabidopsis thaliana}, partial (35%)
Length = 656
Score = 29.6 bits (65), Expect = 1.9
Identities = 20/43 (46%), Positives = 27/43 (62%)
Frame = +2
Query: 31 FLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKSTFDRGFTV 73
F R +L S + SSSSSSSSS++ V TR S++ RG +V
Sbjct: 89 FFRSIFQWLISSSSSSSSSSSSSSSSENSV-TRDSSYHRGVSV 214
>AL369307 weakly similar to GP|21105736|gb| nam-like protein 4 {Petunia x
hybrida}, partial (7%)
Length = 504
Score = 29.6 bits (65), Expect = 1.9
Identities = 21/81 (25%), Positives = 34/81 (41%)
Frame = +3
Query: 191 SDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTNVKEELNARKQEMMQLEYDRGIR 250
+D L +LE+C +PP YI+ LQSE + E + E+ + I
Sbjct: 102 ADFDLANLESC---------VPPSVAKYIKYLQSETQKLGVEKETMRFELTSAQTMINIL 254
Query: 251 NNLLEYLRSLDPEMVTELSRP 271
+ +E L + EM + P
Sbjct: 255 QSRVENLSKENEEMKMMMRNP 317
>TC81517 similar to GP|16323198|gb|AAL15333.1 At1g15980/T24D18_8
{Arabidopsis thaliana}, partial (26%)
Length = 688
Score = 29.3 bits (64), Expect = 2.5
Identities = 16/50 (32%), Positives = 25/50 (50%)
Frame = -1
Query: 233 LNARKQEMMQLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQ 282
L Q QLE R I + L+Y +LDP +V L + + D++H+
Sbjct: 406 LKELSQPYNQLETSR-IHKDFLDYRAALDPRLVYRLKTTNHIRYLDLLHR 260
>AJ498464 similar to GP|19386754|dbj hypothetical protein {Oryza sativa
(japonica cultivar-group)}, partial (25%)
Length = 659
Score = 29.3 bits (64), Expect = 2.5
Identities = 21/54 (38%), Positives = 28/54 (50%)
Frame = +3
Query: 12 SRLTDLSPNPLHPPKPPPLFLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKS 65
S L LSP+P PP P P + L + FSSSSSSSS +S++ +
Sbjct: 54 SPLPTLSPSPSLPPLPTPQQQPQPSSLLAIQF--HFSSSSSSSSFSKYLSSQST 209
>TC79880
Length = 1180
Score = 29.3 bits (64), Expect = 2.5
Identities = 16/35 (45%), Positives = 20/35 (56%)
Frame = +1
Query: 306 EGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLL 340
+ NID +GDE+SDT Y KLL WC L+
Sbjct: 274 KSNID*SLLHGDEYSDTYVYYHGY-KKLLLWCSLM 375
>BF647972 similar to GP|22532805|gb| C. elegans PQN-75 protein (corresponding
sequence W03D2.1b) {Caenorhabditis elegans}, partial
(3%)
Length = 397
Score = 28.9 bits (63), Expect = 3.2
Identities = 11/20 (55%), Positives = 15/20 (75%)
Frame = +2
Query: 10 ILSRLTDLSPNPLHPPKPPP 29
I S +TDL+P+ PP+PPP
Sbjct: 242 ITSTITDLTPSERPPPEPPP 301
>CB892868 similar to PIR|D86467|D864 protein F23M19.3 [imported] -
Arabidopsis thaliana, partial (13%)
Length = 770
Score = 28.9 bits (63), Expect = 3.2
Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 15/83 (18%)
Frame = -3
Query: 11 LSRLTDLSPNPLHPPKP-----------PPLFLRRRRCFLTSCYADGFSSSSSSSSSDDV 59
L+ +LSP L P PP+ + F + + F+SSSSSSS++
Sbjct: 342 LAECLELSPGTLSTATPSTVAFHDLEQMPPILIM----FNYTKLCNNFTSSSSSSSNEKE 175
Query: 60 VSTRKSTF----DRGFTVIANML 78
V+ +K+ F DR V A +L
Sbjct: 174 VTAKKNKFFFSIDRNVVVNAQIL 106
>TC91301 similar to GP|19310721|gb|AAL85091.1 unknown protein {Arabidopsis
thaliana}, partial (66%)
Length = 693
Score = 28.9 bits (63), Expect = 3.2
Identities = 14/26 (53%), Positives = 17/26 (64%), Gaps = 1/26 (3%)
Frame = +3
Query: 8 PFILSRLTDL-SPNPLHPPKPPPLFL 32
PF L+ L+ SP+P PP PPPL L
Sbjct: 309 PFFLTLLSPSPSPSPTPPPSPPPLSL 386
>TC81020 weakly similar to GP|15912327|gb|AAL08297.1 At1g25320/F4F7_17
{Arabidopsis thaliana}, partial (36%)
Length = 1024
Score = 28.9 bits (63), Expect = 3.2
Identities = 39/148 (26%), Positives = 60/148 (40%), Gaps = 16/148 (10%)
Frame = +1
Query: 5 SWRPFILSRLTDLSPNPLHPPKPPPLFLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRK 64
S +PF+L LS PLH P LF FL ++ SSS S S + R
Sbjct: 13 STQPFLLFSTISLSSTPLHCFVSPLLFFFFSFFFL*QNHSSLHFSSSQRSHS---LRRRI 183
Query: 65 STFDRGFTVIANML--RRIEPLDNSVISKG-----VSDAARD---------SMKQTISTM 108
F F + +L + PL NS+ +G + + +D S Q +
Sbjct: 184 KMFPFSFHFLFFLLFCNTLSPLANSLNPEGYVLLTLKQSLKDPQGSMNNWNSSDQNPCSW 363
Query: 109 LGLLPSDHFSVTVTVSKHPLHRLLVSSI 136
G+ D V++++ K L+ L SS+
Sbjct: 364 NGITCKDKTVVSISIPKRKLNGSLPSSL 447
>BM813189 weakly similar to GP|17104669|gb| unknown protein {Arabidopsis
thaliana}, partial (19%)
Length = 454
Score = 28.5 bits (62), Expect = 4.2
Identities = 20/45 (44%), Positives = 24/45 (52%)
Frame = +1
Query: 12 SRLTDLSPNPLHPPKPPPLFLRRRRCFLTSCYADGFSSSSSSSSS 56
S L+ LSP L PKPP R + ++ SSSSSSSSS
Sbjct: 109 SSLSPLSPYHLPSPKPP----RASESISRASFSTTPSSSSSSSSS 231
>TC91439 similar to PIR|D96683|D96683 hypothetical protein F12P19.8
[imported] - Arabidopsis thaliana, partial (28%)
Length = 1154
Score = 28.1 bits (61), Expect = 5.5
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Frame = +3
Query: 281 HQLVQNILSRFFVDDASGSFMEQSVEGNIDNHPDN----GDEFSDTVGTSRDYLAKLLFW 336
+ ++Q ILS + AS M QS +H DN D+F+ VGT+ +++ + W
Sbjct: 921 NDIMQGILS---LTHASQGLMNQSSYSQALDHNDNYAPYEDDFTFMVGTNYNHINDMSSW 1091
Query: 337 CML 339
ML
Sbjct: 1092DML 1100
>BQ151301 similar to PIR|S34666|S346 glycine-rich protein - common tobacco,
partial (22%)
Length = 530
Score = 28.1 bits (61), Expect = 5.5
Identities = 10/13 (76%), Positives = 10/13 (76%)
Frame = +3
Query: 19 PNPLHPPKPPPLF 31
P P HPP PPPLF
Sbjct: 213 PPPHHPPTPPPLF 251
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.135 0.390
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,533,099
Number of Sequences: 36976
Number of extensions: 181335
Number of successful extensions: 2990
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 2022
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2638
length of query: 360
length of database: 9,014,727
effective HSP length: 97
effective length of query: 263
effective length of database: 5,428,055
effective search space: 1427578465
effective search space used: 1427578465
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)
Lotus: description of TM0081b.1