KMC002995A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002995A_C01 KMC002995A_c01
AGAAAAAAATAATGTAATTGCAGCTTTGCATTTTAAGAGGCAAATGCAAGATGGATCCAA
GGTACAAACTAATTAAAAAAAATAACCACAAAATTAACATTCGATATTGATAAATATGAT
GGTCTAAATATAATATATGATGAATCTTCCGAATGGGGTGATGATGGAACTTTCCCATGA
CCAGATCCAAGATCACTCACCTCCGAGCAAATCTGTGCCCTGCTCCCAAAACCAATGCTC
GCCCTACTGCACCACCATGAACACCCAAAACTAGGGCAAAAGTACCACCTACAGCTACCC
ACTTCCAGTCGATGCCATTACCCTTATGTGCGGTATTACTGACGGTCCGCACATTCTCTT
CCTCACAAGATGCAGGCTCTGAGGAATGCATGATTCTATCAGAAGGCATCTTGGTTGTTT
CACTTGCCATCAGTGCACATCTTGACAAAGAAGCGTCTGCTTTCCTAGCATTCTGGTATG
CTCTCATACCAGAGTGCAGTTTCTTGACAGCTCCCCACATCCCATGGCGGACTCCTAGCT
TTGCAACATCTTTA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002995A_C01 KMC002995A_c01
         (554 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAL07016.1| unknown protein [Arabidopsis thaliana]                 129  3e-29
ref|NP_566722.1| expressed protein; protein id: At3g23080.1, sup...   129  3e-29
pir||C71407 hypothetical protein - Arabidopsis thaliana gi|22448...   127  1e-28
ref|NP_567433.1| expressed protein; protein id: At4g14500.1, sup...   127  1e-28
dbj|BAC24890.1| OJ1046_F10.28 [Oryza sativa (japonica cultivar-g...   101  5e-21

>gb|AAL07016.1| unknown protein [Arabidopsis thaliana]
          Length = 419

 Score =  129 bits (323), Expect = 3e-29
 Identities = 72/121 (59%), Positives = 85/121 (69%), Gaps = 3/121 (2%)
 Frame = -2

Query: 553 KDVAKLGVRHGMWGAVKKLHSGMRAYQNARKADASLSRCALMASETTKMPSDRIMHSSEP 374
           KDVAKLGVRHGMWGAVKKL+SG+RAYQ+ARK   SLSR A MAS TTK+  D +  S   
Sbjct: 304 KDVAKLGVRHGMWGAVKKLNSGLRAYQSARKPGTSLSRSAQMASITTKLNMDLVETSGAE 363

Query: 373 ASCEEENVRTVSNTAHKGN---GIDWKWVAVGGTFALVLGVHGGAVGRALVLGAGHRFAR 203
              +EE  R V N A K N   G+DWKW+ VGG  AL  G+H  A+G+AL++GAG R AR
Sbjct: 364 ---DEERGRAVEN-ARKQNDQFGVDWKWIVVGGV-ALACGLHSSAIGKALMVGAGQRLAR 418

Query: 202 R 200
           R
Sbjct: 419 R 419

>ref|NP_566722.1| expressed protein; protein id: At3g23080.1, supported by cDNA:
           gi_15810256 [Arabidopsis thaliana]
           gi|9294197|dbj|BAB02099.1| membrane related protein-like
           [Arabidopsis thaliana] gi|23297764|gb|AAN13021.1|
           unknown protein [Arabidopsis thaliana]
          Length = 419

 Score =  129 bits (323), Expect = 3e-29
 Identities = 72/121 (59%), Positives = 85/121 (69%), Gaps = 3/121 (2%)
 Frame = -2

Query: 553 KDVAKLGVRHGMWGAVKKLHSGMRAYQNARKADASLSRCALMASETTKMPSDRIMHSSEP 374
           KDVAKLGVRHGMWGAVKKL+SG+RAYQ+ARK   SLSR A MAS TTK+  D +  S   
Sbjct: 304 KDVAKLGVRHGMWGAVKKLNSGLRAYQSARKPGTSLSRSAQMASITTKLNMDLVETSGAE 363

Query: 373 ASCEEENVRTVSNTAHKGN---GIDWKWVAVGGTFALVLGVHGGAVGRALVLGAGHRFAR 203
              +EE  R V N A K N   G+DWKW+ VGG  AL  G+H  A+G+AL++GAG R AR
Sbjct: 364 ---DEERGRAVEN-ARKQNDQFGVDWKWIVVGGV-ALACGLHSSAIGKALMVGAGQRLAR 418

Query: 202 R 200
           R
Sbjct: 419 R 419

>pir||C71407 hypothetical protein - Arabidopsis thaliana
           gi|2244806|emb|CAB10229.1| hypothetical protein
           [Arabidopsis thaliana] gi|7268156|emb|CAB78492.1|
           hypothetical protein [Arabidopsis thaliana]
          Length = 420

 Score =  127 bits (318), Expect = 1e-28
 Identities = 68/119 (57%), Positives = 86/119 (72%), Gaps = 1/119 (0%)
 Frame = -2

Query: 553 KDVAKLGVRHGMWGAVKKLHSGMRAYQNARKADASLSRCALMASETTKMPSDRIMHSSEP 374
           KDVAKLGVRHGMWGAVKKL+SG+RAYQ+ARK+D+SLSR A MA  TTK+  D    S+E 
Sbjct: 307 KDVAKLGVRHGMWGAVKKLNSGLRAYQSARKSDSSLSRIAQMARITTKLNMD----SAES 362

Query: 373 ASCEEENVRTVSNTAHKGN-GIDWKWVAVGGTFALVLGVHGGAVGRALVLGAGHRFARR 200
           +S +E+  R +     + +  +DWKWV VGG  AL  G+H G +G+AL+ GAG R ARR
Sbjct: 363 SSRDEDRSRAMEYARQRDHLRMDWKWVVVGGV-ALACGLHSGIIGKALLAGAGQRLARR 420

>ref|NP_567433.1| expressed protein; protein id: At4g14500.1, supported by cDNA:
           gi_16226250 [Arabidopsis thaliana]
           gi|16226251|gb|AAL16115.1|AF428283_1 AT4g14500/dl3290w
           [Arabidopsis thaliana] gi|22531042|gb|AAM97025.1|
           expressed protein [Arabidopsis thaliana]
          Length = 433

 Score =  127 bits (318), Expect = 1e-28
 Identities = 68/119 (57%), Positives = 86/119 (72%), Gaps = 1/119 (0%)
 Frame = -2

Query: 553 KDVAKLGVRHGMWGAVKKLHSGMRAYQNARKADASLSRCALMASETTKMPSDRIMHSSEP 374
           KDVAKLGVRHGMWGAVKKL+SG+RAYQ+ARK+D+SLSR A MA  TTK+  D    S+E 
Sbjct: 320 KDVAKLGVRHGMWGAVKKLNSGLRAYQSARKSDSSLSRIAQMARITTKLNMD----SAES 375

Query: 373 ASCEEENVRTVSNTAHKGN-GIDWKWVAVGGTFALVLGVHGGAVGRALVLGAGHRFARR 200
           +S +E+  R +     + +  +DWKWV VGG  AL  G+H G +G+AL+ GAG R ARR
Sbjct: 376 SSRDEDRSRAMEYARQRDHLRMDWKWVVVGGV-ALACGLHSGIIGKALLAGAGQRLARR 433

>dbj|BAC24890.1| OJ1046_F10.28 [Oryza sativa (japonica cultivar-group)]
          Length = 442

 Score =  101 bits (252), Expect = 5e-21
 Identities = 58/121 (47%), Positives = 70/121 (56%), Gaps = 7/121 (5%)
 Frame = -2

Query: 553 KDVAKLGVRHGMWGAVKKLHSGMRAYQNARKADASLSRCALMASETTK-------MPSDR 395
           KDVAK+GVRHGMWGAVKK  SG RAYQ  R  + +LSR A+MA  TTK        P D+
Sbjct: 323 KDVAKVGVRHGMWGAVKKFQSGFRAYQQMRDTENTLSRSAIMARVTTKTSIASSSCPLDQ 382

Query: 394 IMHSSEPASCEEENVRTVSNTAHKGNGIDWKWVAVGGTFALVLGVHGGAVGRALVLGAGH 215
              ++     E EN R V        G DWKWV  GG  A V  ++ G VG+AL++GA  
Sbjct: 383 EPSNAAKTIDESENSRAVQ------PGFDWKWVVFGGAVAAVCVLNTGLVGKALLIGAAS 436

Query: 214 R 212
           R
Sbjct: 437 R 437

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 515,209,904
Number of Sequences: 1393205
Number of extensions: 11980088
Number of successful extensions: 43366
Number of sequences better than 10.0: 42
Number of HSP's better than 10.0 without gapping: 39515
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 43179
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19521267756
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf099h05 BP074736 1 342
2 MF017h01_f BP029167 1 553
3 MWL047c01_f AV769369 2 556
4 MFBL003h01_f BP041440 11 500
5 GNf011b03 BP068139 36 503
6 GNf032e03 BP069700 71 542
7 MWM164c02_f AV767270 85 374




Lotus japonicus
Kazusa DNA Research Institute