KMC004821A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004821A_C01 KMC004821A_c01
AAAGACAAGCAAAGCATTAAACTGTACATTATATAGTACACAAAACGTGTACTAATAATA
ATATACAATATAAACATAATCCAACTTTCTACTGCTAAAACCAAAACTCACTATATACAC
AAGGAGAATTCTCTCAACAACCAGTGCCTTGACACAGCTTAATTATCCTTAAGCTTATAT
AACATGTCAATGTAATCTTCTGTCTCCCACAGGAAAGATCCAAAAGCAACAGCTTCCAAG
ACTAGCCTCTTGAGGCTAGAAAATGAAGTCAAAATCTCATCATCCTTCTCAACTGAACCA
GAACCCTCATGGTTAAAAAGGGCAAAACTATAACTCTCAATCAAATTCACAGCCTCCTTA
GATCTCAACCTGGCACATCTCTGCAATGACCCAGGATGAAAGCTCATAACATAACATTTT
AAGTCTTCAAGTTCCTCTTCTTGCCTGATCAACCCTTTCCCTACGGAAGATGGCATATTG
CTCAAGCGGCCAAAGATGGTATCTTTCAATCCATTACACATATCATTATAAGAGAGGGAA
GTCTTATGACCATGGTTATCCAGAGATAGATTTCGTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004821A_C01 KMC004821A_c01
         (577 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_566302.1| expressed protein; protein id: At3g07310.1, sup...   103  1e-21
ref|NP_199670.1| putative protein; protein id: At5g48590.1, supp...    95  7e-19
ref|NP_566588.1| expressed protein; protein id: At3g17800.1, sup...    83  2e-15
gb|AAL32564.1| Unknown protein [Arabidopsis thaliana] gi|2025985...    83  2e-15
gb|AAK30572.1|AF346660_1 unknown [Brassica napus]                      83  2e-15

>ref|NP_566302.1| expressed protein; protein id: At3g07310.1, supported by cDNA:
           37200., supported by cDNA: gi_14596186 [Arabidopsis
           thaliana] gi|6041833|gb|AAF02142.1|AC009853_2 unknown
           protein [Arabidopsis thaliana]
           gi|6642634|gb|AAF20215.1|AC012395_2 unknown protein
           [Arabidopsis thaliana] gi|14596187|gb|AAK68821.1|
           Unknown protein [Arabidopsis thaliana]
           gi|21593217|gb|AAM65166.1| unknown [Arabidopsis
           thaliana] gi|22136074|gb|AAM91115.1| unknown protein
           [Arabidopsis thaliana]
          Length = 368

 Score =  103 bits (258), Expect = 1e-21
 Identities = 63/131 (48%), Positives = 82/131 (62%), Gaps = 2/131 (1%)
 Frame = -3

Query: 548 HKTSLSYNDMCNG--LKDTIFGRLSNMPSSVGKGLIRQEEELEDLKCYVMSFHPGSLQRC 375
           H+   S +D+     LK  IFG       + G   I  +++L     Y+  F P +LQRC
Sbjct: 238 HQLECSLSDIHGSGYLKSPIFG----CSFTTGTAQISNKQQLRH---YISDFDPETLQRC 290

Query: 374 ARLRSKEAVNLIESYSFALFNHEGSGSVEKDDEILTSFSSLKRLVLEAVAFGSFLWETED 195
           A+ R++EA NLIE  S ALF     G+ E D+ I+TSFSSLKRLVLEAVAFG+FLW+TE 
Sbjct: 291 AKPRTEEARNLIEKQSLALF-----GTEESDETIVTSFSSLKRLVLEAVAFGTFLWDTEL 345

Query: 194 YIDMLYKLKDN 162
           Y+D  YKLK+N
Sbjct: 346 YVDGAYKLKEN 356

>ref|NP_199670.1| putative protein; protein id: At5g48590.1, supported by cDNA: 7891.
           [Arabidopsis thaliana] gi|10177349|dbj|BAB10692.1|
           gb|AAF02142.1~gene_id:K15N18.6~strong similarity to
           unknown protein [Arabidopsis thaliana]
           gi|28392972|gb|AAO41921.1| unknown protein [Arabidopsis
           thaliana] gi|28973189|gb|AAO63919.1| unknown protein
           [Arabidopsis thaliana]
          Length = 344

 Score = 94.7 bits (234), Expect = 7e-19
 Identities = 49/89 (55%), Positives = 64/89 (71%)
 Frame = -3

Query: 428 EDLKCYVMSFHPGSLQRCARLRSKEAVNLIESYSFALFNHEGSGSVEKDDEILTSFSSLK 249
           + L+ Y+  F P  L+RCA+ RS EA +LIE  S ALF  E S      + I+TSFSSLK
Sbjct: 249 KQLRHYISEFDPKILRRCAKPRSHEAKSLIEKQSLALFGPEESSK----ESIVTSFSSLK 304

Query: 248 RLVLEAVAFGSFLWETEDYIDMLYKLKDN 162
           RL+LEAVAFG+FLW+TE+Y+D  +KLK+N
Sbjct: 305 RLLLEAVAFGTFLWDTEEYVDGAFKLKEN 333

>ref|NP_566588.1| expressed protein; protein id: At3g17800.1, supported by cDNA:
           15577., supported by cDNA: gi_17064819, supported by
           cDNA: gi_20259855, supported by cDNA: gi_20466176
           [Arabidopsis thaliana] gi|9294484|dbj|BAB02703.1|
           gb|AAF02142.1~gene_id:MEB5.2~similar to unknown protein
           [Arabidopsis thaliana] gi|20466177|gb|AAM20406.1|
           unknown protein [Arabidopsis thaliana]
          Length = 421

 Score = 83.2 bits (204), Expect = 2e-15
 Identities = 44/89 (49%), Positives = 58/89 (64%), Gaps = 7/89 (7%)
 Frame = -3

Query: 422 LKCYVMSFHPGSLQRCARLRSKEAVNLIESYSFALFNH-------EGSGSVEKDDEILTS 264
           L+ YVMSF   +LQR A +RS+EAV +IE ++ ALF         EG+    KD++I  S
Sbjct: 328 LRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEIVITPEGTVDSSKDEQIKIS 387

Query: 263 FSSLKRLVLEAVAFGSFLWETEDYIDMLY 177
           F  +KRLVLEAV FGSFLW+ E ++D  Y
Sbjct: 388 FGGMKRLVLEAVTFGSFLWDVESHVDARY 416

>gb|AAL32564.1| Unknown protein [Arabidopsis thaliana] gi|20259856|gb|AAM13275.1|
           unknown protein [Arabidopsis thaliana]
          Length = 421

 Score = 83.2 bits (204), Expect = 2e-15
 Identities = 44/89 (49%), Positives = 58/89 (64%), Gaps = 7/89 (7%)
 Frame = -3

Query: 422 LKCYVMSFHPGSLQRCARLRSKEAVNLIESYSFALFNH-------EGSGSVEKDDEILTS 264
           L+ YVMSF   +LQR A +RS+EAV +IE ++ ALF         EG+    KD++I  S
Sbjct: 328 LRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEIVITPEGTVDSSKDEQIKIS 387

Query: 263 FSSLKRLVLEAVAFGSFLWETEDYIDMLY 177
           F  +KRLVLEAV FGSFLW+ E ++D  Y
Sbjct: 388 FGGMKRLVLEAVTFGSFLWDVESHVDARY 416

>gb|AAK30572.1|AF346660_1 unknown [Brassica napus]
          Length = 256

 Score = 83.2 bits (204), Expect = 2e-15
 Identities = 44/89 (49%), Positives = 58/89 (64%), Gaps = 7/89 (7%)
 Frame = -3

Query: 422 LKCYVMSFHPGSLQRCARLRSKEAVNLIESYSFALFNH-------EGSGSVEKDDEILTS 264
           L+ YVMSF   +LQR A +RS+EAV +IE ++ ALF         EG+    KD++I  S
Sbjct: 163 LRTYVMSFDSETLQRYATIRSREAVGIIEKHTEALFGKPEIVITPEGTVDSSKDEQIKIS 222

Query: 263 FSSLKRLVLEAVAFGSFLWETEDYIDMLY 177
           F  +KRLVLEAV FGSFLW+ E ++D  Y
Sbjct: 223 FGGMKRLVLEAVTFGSFLWDVESHVDARY 251

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 462,913,453
Number of Sequences: 1393205
Number of extensions: 9327410
Number of successful extensions: 22489
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 21590
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 22466
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21530810025
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB098f03_f BP041148 1 579
2 MWL076d03_f AV769946 1 430
3 SPDL004g03_f BP052248 5 442
4 MRL005e04_f BP083963 14 372




Lotus japonicus
Kazusa DNA Research Institute