KMC011468A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC011468A_C01 KMC011468A_c01
agagaatttcaagcgagcatttatCAACTCAATTTTCATCATAATTACATTACATAAGAC
ATTTTAAGTGATAACAAAAGCTGAAAAAATGGCGTAGATAAATGCATGCTCATATGTGCC
ACAAAAAACTCCAACGAAAAATAAATCAGAAGTTTCAAAAAAATTAGAGAAACGTAAAAA
AAAATATTAAAAAAAAAACACTTTCCATTTTTTTACAACCGGTAAACAATAAGAAAACAG
GGCAATACCGCACGCGGGTCAAAAACAAACAACTCGCCGTGGTCACCACTCACTGAGTCA
AACCCGACTCGCTTATCCAACAGCGAGTCCAGAAACCCGATCGGCTTCGAGACCCGACCC
GCTATGACCCGACACACCAGCATCGCCCTCCTCCCCCTTCCACCGCCAGTACTCTCATGC
GCGTCGCCGCATACAGCAAACGTGCAGATCGCCGACCCGTTACGCTCGGGAAACGACCAC
GAACATCCACCATCAAGGCCGCCGTAGGCACCCAAAAAGTGAAACCGCATCACTTCAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC011468A_C01 KMC011468A_c01
         (538 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_192982.1| putative protein; protein id: At4g12450.1 [Arab...   116  1e-25
ref|NP_176441.1| hypothetical protein; protein id: At1g62520.1 [...   110  1e-23
ref|NP_193987.1| putative protein; protein id: At4g22560.1 [Arab...   103  2e-21
emb|CAB53477.1| CAA30374.1 protein [Oryza sativa]                      64  2e-09
ref|NP_567769.1| expressed protein; protein id: At4g27240.1, sup...    63  3e-09

>ref|NP_192982.1| putative protein; protein id: At4g12450.1 [Arabidopsis thaliana]
           gi|7487232|pir||T07637 hypothetical protein T1P17.40 -
           Arabidopsis thaliana gi|4725944|emb|CAB41715.1| putative
           protein [Arabidopsis thaliana]
           gi|7267947|emb|CAB78288.1| putative protein [Arabidopsis
           thaliana]
          Length = 277

 Score =  116 bits (291), Expect = 1e-25
 Identities = 64/108 (59%), Positives = 76/108 (70%), Gaps = 1/108 (0%)
 Frame = -3

Query: 536 EVMRFHFLGAY-GGLDGGCSWSFPERNGSAICTFAVCGDAHESTGGGRGRRAMLVCRVIA 360
           E+MRF  LG   GG++GG +W FP   G+A+CTF+  G+AH STGGG GRRAML+CRVIA
Sbjct: 180 EMMRFFPLGPIPGGINGG-AWGFPGGKGAAVCTFSGSGEAHASTGGGGGRRAMLICRVIA 238

Query: 359 GRVSKPIGFLDSLLDKRVGFDSVSGDHGELFVFDPRAVLPCFLIVYRL 216
           GRV+K   F         G DSV+G  GEL VFD RAVLPCFLI +RL
Sbjct: 239 GRVAKKGEF---------GSDSVAGRAGELIVFDARAVLPCFLIFFRL 277

>ref|NP_176441.1| hypothetical protein; protein id: At1g62520.1 [Arabidopsis
           thaliana] gi|5454194|gb|AAD43609.1|AC005698_8 T3P18.8
           [Arabidopsis thaliana] gi|28393500|gb|AAO42171.1|
           unknown protein [Arabidopsis thaliana]
           gi|28973499|gb|AAO64074.1| unknown protein [Arabidopsis
           thaliana]
          Length = 280

 Score =  110 bits (275), Expect = 1e-23
 Identities = 65/110 (59%), Positives = 76/110 (69%), Gaps = 3/110 (2%)
 Frame = -3

Query: 536 EVMRFHFLG-AYGGLDGGCSWSF--PERNGSAICTFAVCGDAHESTGGGRGRRAMLVCRV 366
           E MRF+ LG +YGG  GG +W     +  G++I TFA    A+E  GGG+GR+AMLVCRV
Sbjct: 174 ETMRFYCLGPSYGG--GGSAWGILGGKGGGASIYTFAGSSTANEKAGGGKGRKAMLVCRV 231

Query: 365 IAGRVSKPIGFLDSLLDKRVGFDSVSGDHGELFVFDPRAVLPCFLIVYRL 216
           IAGRV+K    L    D R  FDSVSGD GEL VFD RAVLPCFLI+YRL
Sbjct: 232 IAGRVTKQ-NELKYDSDLRSRFDSVSGDDGELLVFDTRAVLPCFLIIYRL 280

>ref|NP_193987.1| putative protein; protein id: At4g22560.1 [Arabidopsis thaliana]
           gi|7486606|pir||T05450 hypothetical protein F7K2.140 -
           Arabidopsis thaliana gi|3892711|emb|CAA22161.1| putative
           protein [Arabidopsis thaliana]
           gi|7269102|emb|CAB79211.1| putative protein [Arabidopsis
           thaliana]
          Length = 264

 Score =  103 bits (256), Expect = 2e-21
 Identities = 57/107 (53%), Positives = 71/107 (66%)
 Frame = -3

Query: 536 EVMRFHFLGAYGGLDGGCSWSFPERNGSAICTFAVCGDAHESTGGGRGRRAMLVCRVIAG 357
           E+MRF+      G +GG    F    G A+CTF+  G+A+ S+GGG GR+AM++CRVIAG
Sbjct: 170 EMMRFY--PVLDGFNGGAC-VFAGGKGQAVCTFSGSGEAYVSSGGGGGRKAMMICRVIAG 226

Query: 356 RVSKPIGFLDSLLDKRVGFDSVSGDHGELFVFDPRAVLPCFLIVYRL 216
           RV   IGF         G DSV+G  GELFVFD RAVLPCFLI++RL
Sbjct: 227 RVDDVIGF---------GSDSVAGRDGELFVFDTRAVLPCFLIIFRL 264

>emb|CAB53477.1| CAA30374.1 protein [Oryza sativa]
          Length = 603

 Score = 63.5 bits (153), Expect = 2e-09
 Identities = 41/97 (42%), Positives = 55/97 (56%), Gaps = 13/97 (13%)
 Frame = -3

Query: 470 PERNGSAICTFAVCGDAHE---STGGGRGRRAMLVCRVIAGRVSK---PIGFLDSLLDKR 309
           P   G+ I T A  G AH+   S+G    RRAMLVCRVIAGRV +        +   ++ 
Sbjct: 506 PPAPGAGIRTMATSGRAHDAVVSSGSEGDRRAMLVCRVIAGRVRREEAAAAAAEEEEEEE 565

Query: 308 VGFDSVSG-------DHGELFVFDPRAVLPCFLIVYR 219
             +DSV+G       +  EL VF+P A+LPCF++VYR
Sbjct: 566 EEYDSVAGTTPGLYSNLDELDVFNPTAILPCFVVVYR 602

>ref|NP_567769.1| expressed protein; protein id: At4g27240.1, supported by cDNA:
           2944. [Arabidopsis thaliana] gi|7486806|pir||T05748
           hypothetical protein M4I22.50 - Arabidopsis thaliana
           gi|3269285|emb|CAA19718.1| hypothetical protein
           [Arabidopsis thaliana] gi|7269577|emb|CAB79579.1|
           hypothetical protein [Arabidopsis thaliana]
          Length = 431

 Score = 62.8 bits (151), Expect = 3e-09
 Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 10/89 (11%)
 Frame = -3

Query: 461 NGSAICTFAVCGDAHEST----GGGRGRRAMLVCRVIAGRVSKPIGFLDSLLDKRVGFDS 294
           NG  + T +    A ES     GGG  R+A++VCRVIAGRV +P+  ++ +     GFDS
Sbjct: 340 NGIGVFTASTSERAFESIVIGDGGGGDRKALIVCRVIAGRVHRPVENVEEMGGLLSGFDS 399

Query: 293 VSGDHG------ELFVFDPRAVLPCFLIV 225
           ++G  G      EL++ + RA+LPCF+++
Sbjct: 400 LAGKVGLYTNVEELYLLNSRALLPCFVLI 428

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 478,985,912
Number of Sequences: 1393205
Number of extensions: 11481150
Number of successful extensions: 58336
Number of sequences better than 10.0: 116
Number of HSP's better than 10.0 without gapping: 51164
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 57695
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18173652336
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD054g07_f BP048335 1 363
2 MPD021h05_f AV771476 25 538




Lotus japonicus
Kazusa DNA Research Institute