
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0103.5
(275 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AL366725 55 4e-08
TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 50 9e-07
BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.... 47 8e-06
TC78961 similar to GP|18252179|gb|AAL61922.1 unknown protein {Ar... 39 0.003
TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-... 36 0.018
TC82733 similar to GP|10177404|dbj|BAB10535. gene_id:K24M7.12~pi... 33 0.091
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 33 0.12
TC84935 similar to PIR|G96631|G96631 probable RNA-binding protei... 32 0.20
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.... 32 0.20
TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 31 0.45
TC90680 similar to GP|12836314|dbj|BAB23601. data source:SPTR s... 31 0.59
TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helica... 29 1.7
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 29 2.2
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 28 2.9
BQ146262 28 2.9
TC91075 similar to PIR|T48025|T48025 hypothetical protein T12C14... 28 3.8
BE997611 28 5.0
TC76963 GP|3716262|emb|CAA03625.1 unnamed protein product {unide... 28 5.0
TC93533 similar to GP|17065464|gb|AAL32886.1 Unknown protein {Ar... 28 5.0
AW560988 similar to PIR|JE0113|JE01 zinc-finger protein S3574 [i... 27 6.5
>AL366725
Length = 485
Score = 54.7 bits (130), Expect = 4e-08
Identities = 18/47 (38%), Positives = 29/47 (61%)
Frame = +2
Query: 199 GHFANKCSVTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
GH +N C +C +C+++GH+ +CK + +CFNCN+ GH C
Sbjct: 329 GHKSNVCPKEIKKCVRCDKKGHIVADCKRNDIVCFNCNEEGHIGSQC 469
Score = 33.5 bits (75), Expect = 0.091
Identities = 13/30 (43%), Positives = 14/30 (46%)
Frame = +2
Query: 197 KFGHFANKCSVTGWRCFKCNREGHLAVNCK 226
K GH C CF CN EGH+ CK
Sbjct: 383 KKGHIVADCKRNDIVCFNCNEEGHIGSQCK 472
Score = 30.8 bits (68), Expect = 0.59
Identities = 11/34 (32%), Positives = 16/34 (46%)
Frame = +2
Query: 212 CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
CF +GH + C + C C++ GH DC
Sbjct: 308 CFNYGEKGHKSNVCPKEIKKCVRCDKKGHIVADC 409
>TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (96%)
Length = 1286
Score = 50.1 bits (118), Expect = 9e-07
Identities = 22/63 (34%), Positives = 34/63 (53%)
Frame = +1
Query: 183 QRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFS 242
+RG ++ ++ GH+ +C C C+ GH+A C TK SLC+NC +PGH +
Sbjct: 397 RRGFSQDNLCKNCKRPGHYVRECPNVAV-CHNCSLPGHIASECSTK-SLCWNCKEPGHMA 570
Query: 243 QDC 245
C
Sbjct: 571 SSC 579
Score = 45.8 bits (107), Expect = 2e-05
Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 6/55 (10%)
Frame = +1
Query: 197 KFGHFANKCSVTGWR------CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDC 245
K GH A +C+V C C ++GH+AV C T E C NC + GH ++DC
Sbjct: 610 KAGHRARECTVPQKPPGDLRLCNNCYKQGHIAVEC-TNEKACNNCRKTGHLARDC 771
Score = 34.7 bits (78), Expect = 0.041
Identities = 22/73 (30%), Positives = 28/73 (38%), Gaps = 22/73 (30%)
Frame = +1
Query: 196 RKFGHFANKCSVTGWRCFKCNREGHLAVNCKT----------------------KESLCF 233
RK GH A C C CN GH+A C ++ +C
Sbjct: 742 RKTGHLARDCP-NDPICNLCNISGHVARQCPKSNVIGDRGGGGSLRGGYRDGGFRDVVCR 918
Query: 234 NCNQPGHFSQDCM 246
+C Q GH S+DCM
Sbjct: 919 SCQQFGHMSRDCM 957
>BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.5 [imported]
- Arabidopsis thaliana, partial (5%)
Length = 627
Score = 47.0 bits (110), Expect = 8e-06
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 18/80 (22%)
Frame = -1
Query: 199 GHFANKCSVTGW---RCFKCNREGHLAVNCKTKESL-------------CFNCNQPGHFS 242
G + N S +G +C+KC + GH A NC + + C+ CNQPGH++
Sbjct: 549 GAYVNTVSGSGGASGKCYKCQQPGHWASNCPSMSAANRVSGGSGGASGNCYKCNQPGHWA 370
Query: 243 QDC--MALRGESSGNVGKGK 260
+C M+ +S GN G+
Sbjct: 369 NNCPNMSAAPQSHGNSNTGQ 310
>TC78961 similar to GP|18252179|gb|AAL61922.1 unknown protein {Arabidopsis
thaliana}, partial (71%)
Length = 974
Score = 38.5 bits (88), Expect = 0.003
Identities = 14/37 (37%), Positives = 22/37 (58%), Gaps = 2/37 (5%)
Frame = +2
Query: 211 RCFKCNREGHLAVNCKTKE--SLCFNCNQPGHFSQDC 245
RCF C +GH A +CK + + C+ C + GH ++C
Sbjct: 356 RCFNCGIDGHWARDCKAGDWKNKCYRCGERGHIEKNC 466
Score = 35.8 bits (81), Expect = 0.018
Identities = 18/79 (22%), Positives = 31/79 (38%), Gaps = 2/79 (2%)
Frame = +2
Query: 199 GHFANKCSVTGWR--CFKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSGNV 256
GH+A C W+ C++C GH+ NCK N P S+ ++ +
Sbjct: 380 GHWARDCKAGDWKNKCYRCGERGHIEKNCK---------NSPKKLSRHARSVSRSPGRSR 532
Query: 257 GKGKQLAPEERIHAKEGKS 275
+ +P + G+S
Sbjct: 533 SPARSRSPARSRSPRRGRS 589
Score = 28.1 bits (61), Expect = 3.8
Identities = 9/16 (56%), Positives = 12/16 (74%)
Frame = +2
Query: 232 CFNCNQPGHFSQDCMA 247
CFNC GH+++DC A
Sbjct: 359 CFNCGIDGHWARDCKA 406
>TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-1 {Triticum
aestivum}, partial (39%)
Length = 630
Score = 35.8 bits (81), Expect = 0.018
Identities = 26/95 (27%), Positives = 36/95 (37%), Gaps = 17/95 (17%)
Frame = +3
Query: 182 QQRGMNRTGAGGPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKT-------------- 227
Q R N G GG G + G C+ C GH+A +C
Sbjct: 270 QVRQDNHGGGGGGR---GFRGGERRNGGGGCYTCGDTGHIARDCDRSDRNDRNDRSGGGG 440
Query: 228 ---KESLCFNCNQPGHFSQDCMALRGESSGNVGKG 259
++ C+ C HF++DCM RG + N G G
Sbjct: 441 GGDRDRACYTCGSFEHFARDCM--RGGGNNNNGGG 539
>TC82733 similar to GP|10177404|dbj|BAB10535.
gene_id:K24M7.12~pir||S42136~similar to unknown protein
{Arabidopsis thaliana}, partial (57%)
Length = 710
Score = 33.5 bits (75), Expect = 0.091
Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 12/62 (19%)
Frame = +3
Query: 196 RKFGHFANKCSVTGWRC-----FKCNREGHLAVNCK-------TKESLCFNCNQPGHFSQ 243
R+ GH A C G + + C GH NC T + CF C + GH S+
Sbjct: 369 RRRGHRAQNCPDGGSKEDFKY*YNCGDNGHSLANCPHPLQEGGTMFAQCFVCKEQGHLSK 548
Query: 244 DC 245
+C
Sbjct: 549 NC 554
Score = 31.2 bits (69), Expect = 0.45
Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 5/39 (12%)
Frame = +3
Query: 212 CFKCNREGHLAVNCK---TKESLC--FNCNQPGHFSQDC 245
C +C R GH A NC +KE +NC GH +C
Sbjct: 357 CLRCRRRGHRAQNCPDGGSKEDFKY*YNCGDNGHSLANC 473
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 33.1 bits (74), Expect = 0.12
Identities = 11/33 (33%), Positives = 19/33 (57%)
Frame = +3
Query: 232 CFNCNQPGHFSQDCMALRGESSGNVGKGKQLAP 264
C+ C +PGHF+++C R G G+ + +P
Sbjct: 336 CYECGEPGHFAREC---RNRGGGGAGRRRSRSP 425
>TC84935 similar to PIR|G96631|G96631 probable RNA-binding protein F8A5.17
[imported] - Arabidopsis thaliana, partial (41%)
Length = 552
Score = 32.3 bits (72), Expect = 0.20
Identities = 14/47 (29%), Positives = 22/47 (46%)
Frame = +2
Query: 213 FKCNREGHLAVNCKTKESLCFNCNQPGHFSQDCMALRGESSGNVGKG 259
F G + + CF C +PGH+++DC G+ G G+G
Sbjct: 416 FSSGGRGSYGAGDRVGQDDCFKCGRPGHWARDCPLAGGD--GGRGRG 550
Score = 26.9 bits (58), Expect = 8.5
Identities = 22/75 (29%), Positives = 30/75 (39%)
Frame = +2
Query: 151 GRSMGYRMVDLNRMAEATREHHQRLMATAMYQQRGMNRTGAGGPMRKFGHFANKCSVTGW 210
GR +G R++ +N+ Q M QR +GG G + V
Sbjct: 326 GREIGDRIISVNKA--------QPRMGGDDADQRYRGGFSSGGR----GSYGAGDRVGQD 469
Query: 211 RCFKCNREGHLAVNC 225
CFKC R GH A +C
Sbjct: 470 DCFKCGRPGHWARDC 514
>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
Arabidopsis thaliana, partial (17%)
Length = 378
Score = 32.3 bits (72), Expect = 0.20
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 20/57 (35%)
Frame = +1
Query: 209 GWRCFKCNREGHLAVNCKTKES--------------------LCFNCNQPGHFSQDC 245
G C+ C GH+A C + C++C + GHF++DC
Sbjct: 7 GGGCYNCGESGHMARECTSGGGGGGGRYGGGGGGGGGGGGGGSCYSCGESGHFARDC 177
Score = 30.8 bits (68), Expect = 0.59
Identities = 10/28 (35%), Positives = 16/28 (56%)
Frame = +1
Query: 232 CFNCNQPGHFSQDCMALRGESSGNVGKG 259
C+NC + GH +++C + G G G G
Sbjct: 16 CYNCGESGHMARECTSGGGGGGGRYGGG 99
>TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (16%)
Length = 421
Score = 31.2 bits (69), Expect = 0.45
Identities = 8/19 (42%), Positives = 17/19 (89%)
Frame = +3
Query: 227 TKESLCFNCNQPGHFSQDC 245
++++LC NC +PGH++++C
Sbjct: 333 SRDNLCKNCKRPGHYAREC 389
>TC90680 similar to GP|12836314|dbj|BAB23601. data source:SPTR source
key:Q9H0W3 evidence:ISS~homolog to HYPOTHETICAL 64.6
KDA PROTEIN~putative, partial (5%)
Length = 1280
Score = 30.8 bits (68), Expect = 0.59
Identities = 11/25 (44%), Positives = 17/25 (68%)
Frame = +2
Query: 204 KCSVTGWRCFKCNREGHLAVNCKTK 228
+ S++ + C+KC R GHLA +C K
Sbjct: 953 RSSLSTYECWKCQRPGHLAEDCLVK 1027
Score = 30.0 bits (66), Expect = 1.0
Identities = 11/25 (44%), Positives = 17/25 (68%), Gaps = 4/25 (16%)
Frame = +2
Query: 226 KTKESL----CFNCNQPGHFSQDCM 246
KT+ SL C+ C +PGH ++DC+
Sbjct: 947 KTRSSLSTYECWKCQRPGHLAEDCL 1021
>TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helicase {Oryza
sativa}, partial (3%)
Length = 737
Score = 29.3 bits (64), Expect = 1.7
Identities = 9/14 (64%), Positives = 11/14 (78%)
Frame = +3
Query: 232 CFNCNQPGHFSQDC 245
CF+C QPGH + DC
Sbjct: 147 CFSCGQPGHRASDC 188
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 28.9 bits (63), Expect = 2.2
Identities = 7/14 (50%), Positives = 12/14 (85%)
Frame = +1
Query: 232 CFNCNQPGHFSQDC 245
C+ C +PGHF+++C
Sbjct: 319 CYECGEPGHFAREC 360
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 28.5 bits (62), Expect = 2.9
Identities = 9/21 (42%), Positives = 14/21 (65%)
Frame = +2
Query: 232 CFNCNQPGHFSQDCMALRGES 252
C+ C +PGHF++ C + G S
Sbjct: 344 CYXCGEPGHFARXCNSSPGGS 406
>BQ146262
Length = 687
Score = 28.5 bits (62), Expect = 2.9
Identities = 17/51 (33%), Positives = 22/51 (42%), Gaps = 3/51 (5%)
Frame = -3
Query: 217 REGHL---AVNCKTKESLCFNCNQPGHFSQDCMALRGESSGNVGKGKQLAP 264
REG + AV + K S C+NC GH Q C + K +Q P
Sbjct: 511 REGFVFPVAVEYERKPSFCYNCKLLGHSIQLCNRINNIQHHAAPKNQQKKP 359
>TC91075 similar to PIR|T48025|T48025 hypothetical protein T12C14.30 -
Arabidopsis thaliana, partial (29%)
Length = 686
Score = 28.1 bits (61), Expect = 3.8
Identities = 10/28 (35%), Positives = 17/28 (60%)
Frame = +2
Query: 226 KTKESLCFNCNQPGHFSQDCMALRGESS 253
K +E +C C + GHF+Q C + G ++
Sbjct: 371 KEEEMICKLCGESGHFTQGCPSTLGANN 454
>BE997611
Length = 547
Score = 27.7 bits (60), Expect = 5.0
Identities = 17/49 (34%), Positives = 24/49 (48%)
Frame = +2
Query: 106 SVLGCVTLQRVKIDCLQSDFRSNCLRYYSIGLSGLSSATVWLVLIGRSM 154
SV+G TL KI C +F Y+ + GLSSAT ++ S+
Sbjct: 368 SVIGMFTLYPFKIWCRALNFGGTSFSSYASSV*GLSSATSTALICSSSI 514
>TC76963 GP|3716262|emb|CAA03625.1 unnamed protein product {unidentified},
complete
Length = 1468
Score = 27.7 bits (60), Expect = 5.0
Identities = 14/32 (43%), Positives = 18/32 (55%)
Frame = +3
Query: 96 VFWESLGNRSSVLGCVTLQRVKIDCLQSDFRS 127
+ WES G RSS +GC +R C +S F S
Sbjct: 642 LLWESSG-RSSSMGCSKRERCGFGCSESSFGS 734
>TC93533 similar to GP|17065464|gb|AAL32886.1 Unknown protein {Arabidopsis
thaliana}, partial (38%)
Length = 660
Score = 27.7 bits (60), Expect = 5.0
Identities = 10/27 (37%), Positives = 14/27 (51%)
Frame = +3
Query: 219 GHLAVNCKTKESLCFNCNQPGHFSQDC 245
G+ V+C S C+NC + H DC
Sbjct: 249 GNYDVSCLCSYSFCWNCTEEAHRPVDC 329
>AW560988 similar to PIR|JE0113|JE01 zinc-finger protein S3574 [imported] -
rice, partial (20%)
Length = 756
Score = 27.3 bits (59), Expect = 6.5
Identities = 17/50 (34%), Positives = 23/50 (46%)
Frame = -2
Query: 193 GPMRKFGHFANKCSVTGWRCFKCNREGHLAVNCKTKESLCFNCNQPGHFS 242
GP R FGH + F C R H+++ C+T S F C H+S
Sbjct: 446 GPSRAFGH----------QPFPCER--HISLFCQTCVSFSFPCIA*QHYS 333
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.324 0.136 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,751,525
Number of Sequences: 36976
Number of extensions: 144637
Number of successful extensions: 1005
Number of sequences better than 10.0: 42
Number of HSP's better than 10.0 without gapping: 958
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 996
length of query: 275
length of database: 9,014,727
effective HSP length: 95
effective length of query: 180
effective length of database: 5,502,007
effective search space: 990361260
effective search space used: 990361260
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 57 (26.6 bits)
Lotus: description of TM0103.5