KMC002237A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002237A_C01 KMC002237A_c01
CAATAAGCTCCCACAAGAGGAGAGGGTAGAGATTCAATTAAGCACACCATTAATTCAGTA
CAAACAATCATCATAACTCTTCTTCCTCAAGTAGTTCATATCATCAGAACCCATTACTAA
GCACAGAGCTAACTCAGGGAACTTAACTTCCACTATCTACTTCCCCCAACTCCTAAACAA
GTCATCACAACAATTTCAACAACATCCATAACAAAACATCAAAGATAGAAATTAAGAAAC
ATAACCATATACACATTATATCATCATTCATATATATACCTGCAGTTAATTATAATTATT
TTACTTAATCTTCAGGTAATTAACGCTTCACTAGTGCGAGGCACCTGCGACAAAAGGACT
ACTAATGGCTATCTGTGGCTGTCGGCGGTGGATACTCTGGCGCTCGCCAAAAAATTAACC
TTCACTAAGTAGAAGCAGTTGCAGCGGTGGTTGTTATCATCTTGGGCTTAGGCCTCTTCC
TCTTCTTCTCATAGCTAACACCTCTCGCCTTCGTCTGGAAGTCACGAACATCCCGGAGGT
AAATCCTCACCGCGCGAGCACCGAACGGGTTCGTCTCCGGCCTACCTCCGTTCTCCTCAT
AGGCGGCGCGTAGACGTCCGATGAGCGCGTCGAGACTTCCCCATGCCTGCCGGAGCGGGC
AGGGGCAGGGAGCAGGCGGGTTCGGAAGCCCGAAGAACGGGCAAGGATGGTTGTGAACCT
TCGTCTTCCCGAACTGGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002237A_C01 KMC002237A_c01
         (738 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_187101.1| unknown protein; protein id: At3g04510.1 [Arabi...   172  6e-42
ref|NP_565716.1| expressed protein; protein id: At2g31160.1, sup...   170  2e-41
ref|NP_198201.1| putative protein; protein id: At5g28490.1 [Arab...   169  4e-41
gb|AAG13584.1|AC037425_15 hypothetical protein [Oryza sativa]         164  9e-40
ref|NP_563780.1| expressed protein; protein id: At1g07090.1, sup...   154  2e-36

>ref|NP_187101.1| unknown protein; protein id: At3g04510.1 [Arabidopsis thaliana]
           gi|7547110|gb|AAF63782.1| unknown protein [Arabidopsis
           thaliana]
          Length = 201

 Score =  172 bits (435), Expect = 6e-42
 Identities = 79/95 (83%), Positives = 86/95 (90%)
 Frame = -3

Query: 736 QFGKTKVHNHPCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGRPETNPFGA 557
           QFGKTKVH+  C FFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGG PET+PFG+
Sbjct: 75  QFGKTKVHHQNCAFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGAPETSPFGS 134

Query: 556 RAVRIYLRDVRDFQTKARGVSYEKKRKRPKPKMIT 452
           R+VRI+LR+VRDFQ K+RGVSYEKKRKR   K IT
Sbjct: 135 RSVRIFLREVRDFQAKSRGVSYEKKRKRVNNKQIT 169

>ref|NP_565716.1| expressed protein; protein id: At2g31160.1, supported by cDNA:
           gi_15912210, supported by cDNA: gi_19547988 [Arabidopsis
           thaliana] gi|25408134|pir||C84717 hypothetical protein
           At2g31160 [imported] - Arabidopsis thaliana
           gi|3746060|gb|AAC63835.1| expressed protein [Arabidopsis
           thaliana] gi|15912211|gb|AAL08239.1| At2g31160/T16B12.3
           [Arabidopsis thaliana] gi|19547989|gb|AAL87358.1|
           At2g31160/T16B12.3 [Arabidopsis thaliana]
          Length = 219

 Score =  170 bits (431), Expect = 2e-41
 Identities = 79/104 (75%), Positives = 92/104 (87%), Gaps = 1/104 (0%)
 Frame = -3

Query: 736 QFGKTKVHNHPCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGRPETNPFGA 557
           QFGKTKVH + C F+G PNPPAPCPCPLRQAWGSLDALIGRLRAA+EENGG+PETNPFGA
Sbjct: 96  QFGKTKVHTNICHFYGHPNPPAPCPCPLRQAWGSLDALIGRLRAAFEENGGKPETNPFGA 155

Query: 556 RAVRIYLRDVRDFQTKARGVSYE-KKRKRPKPKMITTTAATAST 428
           RAVR+YLR+VRD Q+KARGVSYE KKRKRP P   T++++  ++
Sbjct: 156 RAVRLYLREVRDMQSKARGVSYEKKKRKRPLPSSSTSSSSAVAS 199

>ref|NP_198201.1| putative protein; protein id: At5g28490.1 [Arabidopsis thaliana]
          Length = 190

 Score =  169 bits (428), Expect = 4e-41
 Identities = 77/88 (87%), Positives = 82/88 (92%)
 Frame = -3

Query: 736 QFGKTKVHNHPCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGRPETNPFGA 557
           QFGKTKVH+  C FFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGG PE NPFG+
Sbjct: 67  QFGKTKVHHQNCAFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGPPEANPFGS 126

Query: 556 RAVRIYLRDVRDFQTKARGVSYEKKRKR 473
           RAVR++LR+VRDFQ KARGVSYEKKRKR
Sbjct: 127 RAVRLFLREVRDFQAKARGVSYEKKRKR 154

>gb|AAG13584.1|AC037425_15 hypothetical protein [Oryza sativa]
          Length = 204

 Score =  164 bits (416), Expect = 9e-40
 Identities = 75/99 (75%), Positives = 84/99 (84%)
 Frame = -3

Query: 736 QFGKTKVHNHPCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGRPETNPFGA 557
           QFGKTKVH   CPFFG P PPAPCPCPLRQAWGSLDAL+GRLRAAYEENGGRPE NPFGA
Sbjct: 82  QFGKTKVHAPACPFFGHPAPPAPCPCPLRQAWGSLDALVGRLRAAYEENGGRPENNPFGA 141

Query: 556 RAVRIYLRDVRDFQTKARGVSYEKKRKRPKPKMITTTAA 440
           RAVR+YLR+VR+ Q +ARGVSYEKK+++  P   +  AA
Sbjct: 142 RAVRLYLREVREHQARARGVSYEKKKRKKPPHPSSAAAA 180

>ref|NP_563780.1| expressed protein; protein id: At1g07090.1, supported by cDNA:
           28780. [Arabidopsis thaliana] gi|25341953|pir||G86205
           hypothetical protein [imported] - Arabidopsis thaliana
           gi|8954037|gb|AAF82211.1|AC067971_19 Strong similarity
           to an unknown protein At2g31160 gi|3746060 from
           Arabidopsis thaliana BAC F7F1 gb|AC005311.  EST
           gb|AI998165 comes from this gene
           gi|21555695|gb|AAM63916.1| unknown [Arabidopsis
           thaliana] gi|28392914|gb|AAO41893.1| unknown protein
           [Arabidopsis thaliana]
          Length = 196

 Score =  154 bits (388), Expect = 2e-36
 Identities = 70/103 (67%), Positives = 83/103 (79%)
 Frame = -3

Query: 736 QFGKTKVHNHPCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGRPETNPFGA 557
           QFGKTKVH   CP+FG   PP+PC CPL+QAWGSLDALIGRLRAAYEENGGRP++NPF A
Sbjct: 73  QFGKTKVHVAACPYFGHQQPPSPCSCPLKQAWGSLDALIGRLRAAYEENGGRPDSNPFAA 132

Query: 556 RAVRIYLRDVRDFQTKARGVSYEKKRKRPKPKMITTTAATAST 428
           RAVRIYLR+VR+ Q KARG+ YEKK+++  P + T     AS+
Sbjct: 133 RAVRIYLREVRESQAKARGIPYEKKKRKRPPTVTTVRVDVASS 175

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 646,317,714
Number of Sequences: 1393205
Number of extensions: 15080397
Number of successful extensions: 58046
Number of sequences better than 10.0: 83
Number of HSP's better than 10.0 without gapping: 47835
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 56842
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 35188080875
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf012g04 BP068262 1 503
2 GNf004b06 BP067646 1 427
3 GENf047g06 BP060351 1 452
4 GNf011a09 BP068135 26 297
5 GNf053g12 BP071344 261 741




Lotus japonicus
Kazusa DNA Research Institute