KMC017276A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC017276A_C01 KMC017276A_c01
tgtccagacacaTTAAGGTACAGCAATACCAGTTTAGCAATAATTAATTGTTCTAAGCAT
AATGGGTATTCTATGGACATTAATAGACTAACCCAACTTTCAAGTCATACCTGAGTGACA
CATTCAGGTTCAATAATTATACATAACATATGGCCCCAGAACATAAACATAGTAATATTA
CTGTAGTTTATTTGCTTCTACATGAGTGCACATGCAGTTCCAGACGCCAATACTAGTTGC
CCTTAAGAGGCAGCAATACTGAGCTGAAATCCCACCCTTGTTCGTTAACTGGGGTCATCT
CAGCACATAAGTnGTTCTCTTnGTTATCAGCTCCTCTACCTGTGGTGGTGGTATAAGGAT
TTTTGCTACCCAACCTCATTCTTCGCTTCTCAAGAAACCTTTGAAGTGACTGTCTGCGAG
CTATTGGAAATTCTTGCAGCCTGCAAATGGAACTCTTCTCTGCAGGAAAGCTGACTGATG
GTGGAGAAGCCAAACTGTTAGAGGTTCCTTGTGGAGAAGAAGGGCTTGTGGGAATGAGTG
ACATGAAGGGGATCCCACTCTTCATTTCAGCAGACTTGGCAGCAGCAGCAATCAACATTA
TTTCATGCACCTTTTCAGCAGGGATTCCATCATAGACATGCATACTCCCATTATAGTAGA
TGGCGAACTGAGTCCTACCAGGGGTTACTACATTCAGCCCAGATGCTGGCATTGGCCTGT
TGTTGTTAGAGAAATGCTTCACCGACCCTTCATCACTGAGGTTGTTCTGAACCtccatgc
tgtcattccttccattactatcagcaagagattgaggagaagctgcaaaatctctgcctc
gtggttcacacag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC017276A_C01 KMC017276A_c01
         (853 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_197590.1| putative protein; protein id: At5g20900.1, supp...    92  1e-17
ref|NP_189930.1| putative protein; protein id: At3g43440.1 [Arab...    75  2e-12
dbj|BAC16504.1| contains ESTs C26074(C11585),AU092275(C11585)~un...    68  2e-10
gb|AAM64554.1| unknown [Arabidopsis thaliana]                          57  5e-07
ref|NP_565096.1| expressed protein; protein id: At1g74950.1, sup...    57  5e-07

>ref|NP_197590.1| putative protein; protein id: At5g20900.1, supported by cDNA:
           gi_13430543, supported by cDNA: gi_15293158 [Arabidopsis
           thaliana] gi|13430544|gb|AAK25894.1|AF360184_1 unknown
           protein [Arabidopsis thaliana]
           gi|15293159|gb|AAK93690.1| unknown protein [Arabidopsis
           thaliana]
          Length = 187

 Score = 92.0 bits (227), Expect = 1e-17
 Identities = 56/142 (39%), Positives = 80/142 (55%), Gaps = 1/142 (0%)
 Frame = -1

Query: 748 GSVKHFSNNNRPMPASGLNVVTPGRTQFAIYYNGSMHVYDGIPAEKVHEIMLIAAAAKSA 569
           GSV+   N  R           P   Q  I++ GS+ V+DG+P+EKV EI+ IAA A   
Sbjct: 32  GSVEKSINEVRSTEIQTAEPTVPPN-QLTIFFGGSVTVFDGLPSEKVQEILRIAAKAMET 90

Query: 568 EMKSGIPFMSLIPTSPSSPQGTSNSLASPPSVSFPAEKSSICR-LQEFPIARRQSLQRFL 392
           +  + I  +S    + +    +++++ASP +  FP +  S CR   + PIARR SLQRFL
Sbjct: 91  KNSTSISPVSSPALNRAPSFSSTSNVASPAAQPFPIQPISFCRSTADLPIARRHSLQRFL 150

Query: 391 EKRRMRLGSKNPYTTTTGRGAD 326
           EKRR RL +KNPY T+  +  D
Sbjct: 151 EKRRDRLVNKNPYPTSDFKKTD 172

>ref|NP_189930.1| putative protein; protein id: At3g43440.1 [Arabidopsis thaliana]
           gi|11283485|pir||T47386 hypothetical protein T18D12.10 -
           Arabidopsis thaliana gi|7288022|emb|CAB81784.1| putative
           protein [Arabidopsis thaliana]
          Length = 238

 Score = 74.7 bits (182), Expect = 2e-12
 Identities = 49/117 (41%), Positives = 63/117 (52%)
 Frame = -1

Query: 673 TQFAIYYNGSMHVYDGIPAEKVHEIMLIAAAAKSAEMKSGIPFMSLIPTSPSSPQGTSNS 494
           +Q  I + GS  V+DGIPAEKV EI+ IAAAAK+ E       ++L   +P+  +  S S
Sbjct: 130 SQLTIIFGGSFSVFDGIPAEKVQEILHIAAAAKATET------INLTSINPALKRAISFS 183

Query: 493 LASPPSVSFPAEKSSICRLQEFPIARRQSLQRFLEKRRMRLGSKNPYTTTTGRGADN 323
            AS  +    A         + PIARR+SLQRF EKRR R     PY+ TT     N
Sbjct: 184 NASTVACVSTA---------DVPIARRRSLQRFFEKRRHRFVHTKPYSATTSEADKN 231

 Score = 57.0 bits (136), Expect = 3e-07
 Identities = 38/100 (38%), Positives = 52/100 (52%)
 Frame = -1

Query: 673 TQFAIYYNGSMHVYDGIPAEKVHEIMLIAAAAKSAEMKSGIPFMSLIPTSPSSPQGTSNS 494
           TQ  I + GS  V++G+PA+KV EI+ IA A K  +  +GI        +P+  +  S S
Sbjct: 44  TQLTIIFGGSCRVFNGVPAQKVQEIIRIAFAGKQTKNVTGI--------NPALNRALSFS 95

Query: 493 LASPPSVSFPAEKSSICRLQEFPIARRQSLQRFLEKRRMR 374
             +                 + PIARR+SLQRFLEKRR R
Sbjct: 96  TVA-----------------DLPIARRRSLQRFLEKRRDR 118

>dbj|BAC16504.1| contains ESTs C26074(C11585),AU092275(C11585)~unknown protein
           [Oryza sativa (japonica cultivar-group)]
          Length = 244

 Score = 67.8 bits (164), Expect = 2e-10
 Identities = 35/111 (31%), Positives = 58/111 (51%), Gaps = 1/111 (0%)
 Frame = -1

Query: 682 PGRTQFAIYYNGSMHVYDGIPAEKVHEIMLIAAAAKS-AEMKSGIPFMSLIPTSPSSPQG 506
           P + Q  I+Y G + V++  PA+K   +M +A+     A   +  P  + +  +  +P  
Sbjct: 99  PEKRQLTIFYGGKVLVFNDFPADKAKGLMQLASKGSPVAPQNAAAPAPAAVTDNTKAPMA 158

Query: 505 TSNSLASPPSVSFPAEKSSICRLQEFPIARRQSLQRFLEKRRMRLGSKNPY 353
               ++S P+    A+K +     + PIAR+ SL RFLEKR+ RL +K PY
Sbjct: 159 VPAPVSSLPTAQADAQKPARANASDMPIARKASLHRFLEKRKDRLNAKTPY 209

>gb|AAM64554.1| unknown [Arabidopsis thaliana]
          Length = 249

 Score = 56.6 bits (135), Expect = 5e-07
 Identities = 55/180 (30%), Positives = 78/180 (42%), Gaps = 14/180 (7%)
 Frame = -1

Query: 850 CEPRGRDFAAS-----PQSLADSNGRNDSMEVQNNLSDEGSVKHFSNNNRPMPASGLNVV 686
           CE  G D +A      P+++        S        D   +K  + + +P   S     
Sbjct: 63  CEASGMDSSAGQEDIKPKTMFPRQSSFSSSSSSGTKEDVQMIKETTKSVKPESQSA---- 118

Query: 685 TPGRTQFAIYYNGSMHVYDGIPAEKVHEIMLIA--AAAKS-----AEMKSGIPFMSL--I 533
                   I+Y G + V+D   AEK  E++ +A   +AKS     AE+ +     S   I
Sbjct: 119 -----PLTIFYGGRVMVFDDFSAEKAKEVIDLANKGSAKSFTCFTAEVNNNHSAYSQKEI 173

Query: 532 PTSPSSPQGTSNSLASPPSVSFPAEKSSICRLQEFPIARRQSLQRFLEKRRMRLGSKNPY 353
            +SP+     + + A  P    PA  S  C   E PIARR SL RFLEKR+ R+ SK PY
Sbjct: 174 ASSPNPVCSPAKTAAQEPIQPKPA--SLAC---ELPIARRASLHRFLEKRKDRITSKAPY 228

>ref|NP_565096.1| expressed protein; protein id: At1g74950.1, supported by cDNA:
           3024., supported by cDNA: gi_12744980, supported by
           cDNA: gi_14423443 [Arabidopsis thaliana]
           gi|25406399|pir||C96779 unknown protein F9E10.20
           [imported] - Arabidopsis thaliana
           gi|5882728|gb|AAD55281.1|AC008263_12 ESTs gb|T75898,
           gb|R65457, gb|AA597517 and gb|AA597420 come from this
           gene. [Arabidopsis thaliana]
           gi|12323902|gb|AAG51928.1|AC013258_22 unknown protein;
           53109-54448 [Arabidopsis thaliana]
           gi|12744981|gb|AAK06870.1|AF344319_1 unknown protein
           [Arabidopsis thaliana]
           gi|14423444|gb|AAK62404.1|AF386959_1 Unknown protein
           [Arabidopsis thaliana]
          Length = 249

 Score = 56.6 bits (135), Expect = 5e-07
 Identities = 55/180 (30%), Positives = 78/180 (42%), Gaps = 14/180 (7%)
 Frame = -1

Query: 850 CEPRGRDFAAS-----PQSLADSNGRNDSMEVQNNLSDEGSVKHFSNNNRPMPASGLNVV 686
           CE  G D +A      P+++        S        D   +K  + + +P   S     
Sbjct: 63  CEASGMDSSAGQEDIKPKTMFPRQSSFSSSSSSGTKEDVQMIKETTKSVKPESQSA---- 118

Query: 685 TPGRTQFAIYYNGSMHVYDGIPAEKVHEIMLIA--AAAKS-----AEMKSGIPFMSL--I 533
                   I+Y G + V+D   AEK  E++ +A   +AKS     AE+ +     S   I
Sbjct: 119 -----PLTIFYGGRVMVFDDFSAEKAKEVIDLANKGSAKSFTCFTAEVNNNHSAYSQKEI 173

Query: 532 PTSPSSPQGTSNSLASPPSVSFPAEKSSICRLQEFPIARRQSLQRFLEKRRMRLGSKNPY 353
            +SP+     + + A  P    PA  S  C   E PIARR SL RFLEKR+ R+ SK PY
Sbjct: 174 ASSPNPVCSPAKTAAQEPIQPNPA--SLAC---ELPIARRASLHRFLEKRKDRITSKAPY 228

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 733,621,091
Number of Sequences: 1393205
Number of extensions: 16210234
Number of successful extensions: 48380
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 43799
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 48079
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 44873636157
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD002d11_f BP044162 1 554
2 SPD087d09_f BP050946 423 921




Lotus japonicus
Kazusa DNA Research Institute