KMC002113A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002113A_C01 KMC002113A_c01
accaatcacttttcattagcaattttaatcatgggattcaaaaatcaagggccGTGATGC
TGCAAGAGAGGGGTTGCTTATAAAAGTGCAGGAGGGGATCCATTTTCAAGAAGCTCCAAC
CTACACCACCTTCCATTTCTGGTTTTCATTTGTTACATGCTCACATCTCTTCCATACATA
CACAACCTAATAAACAAAGCCTAAAGCACACTATGCATAGCATTCTCCCACTTATTGCAA
AGAGAAGCCCTATTTCAGACCTGAACACCAATCCTTTCTGCTACCTCCTTAAAGGAGTGA
ACATCGCTTCCCCAAAGCCAAGCATCACAGCCAGCATCTCTGGCACCCCATATATCATTT
CTACGATCATCGCCAACATGAACAGCATCCTCAGGTTTTACGCCCAATAGTTCACATGCT
TTAAGAAATATAGTTGGATTTGGCTTCTCGGCTGCAACCTCAGCTGAAACAGCAACTGCA
TCAAACCAGTTATCACATTTCAATGCCCTCAGAAGAGGTCTTAACCGAGTATCAAAATTT
GACACAACAGCCAATTTTACACCTGATTTTCTAAGAGCTTTGAAAACTTCTTCTGCATCA
GGATCACAGAGGTGCCATGCCTTGTCTGTCATATAGTAGTTATAAAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002113A_C01 KMC002113A_c01
         (647 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_181658.1| hypothetical protein; protein id: At2g41250.1, ...   230  1e-59
dbj|BAC15484.1| contains ESTs AU091627(C0596),D15410(C0596A)~sim...   227  1e-58
pir||C86277 F14L17.7 protein - Arabidopsis thaliana gi|7262672|g...   120  2e-26
gb|AAO42097.1| unknown protein [Arabidopsis thaliana]                 120  2e-26
ref|NP_172883.1| unknown protein; protein id: At1g14310.1 [Arabi...   112  6e-24

>ref|NP_181658.1| hypothetical protein; protein id: At2g41250.1, supported by cDNA:
           gi_17979160 [Arabidopsis thaliana]
           gi|25408736|pir||F84839 hypothetical protein At2g41250
           [imported] - Arabidopsis thaliana
           gi|3894197|gb|AAC78546.1| hypothetical protein
           [Arabidopsis thaliana] gi|17979161|gb|AAL49776.1|
           unknown protein [Arabidopsis thaliana]
           gi|22136692|gb|AAM91665.1| unknown protein [Arabidopsis
           thaliana]
          Length = 290

 Score =  230 bits (587), Expect = 1e-59
 Identities = 104/130 (80%), Positives = 123/130 (94%)
 Frame = -1

Query: 647 LYNYYMTDKAWHLCDPDAEEVFKALRKSGVKLAVVSNFDTRLRPLLRALKCDNWFDAVAV 468
           LY+Y+ T++AW LCDPDA +VFKA++++GVK+A+VSNFDTRLRPLLRAL+C++WFDAVAV
Sbjct: 161 LYSYFTTEQAWKLCDPDAGKVFKAIKEAGVKVAIVSNFDTRLRPLLRALRCEDWFDAVAV 220

Query: 467 SAEVAAEKPNPTIFLKACELLGVKPEDAVHVGDDRRNDIWGARDAGCDAWLWGSDVHSFK 288
           SAEV AEKPNPTIFLKACELL V PEDAVHVGDDRRND+WGARDAGCDAWLWGS+V SFK
Sbjct: 221 SAEVEAEKPNPTIFLKACELLEVNPEDAVHVGDDRRNDVWGARDAGCDAWLWGSEVTSFK 280

Query: 287 EVAERIGVQV 258
           +VA+RIGV+V
Sbjct: 281 QVAQRIGVKV 290

>dbj|BAC15484.1| contains ESTs AU091627(C0596),D15410(C0596A)~similar to Arabidopsis
           thaliana chromosome 2, At2g41250~unknown protein [Oryza
           sativa (japonica cultivar-group)]
           gi|22831190|dbj|BAC16048.1| P0496C02.7 [Oryza sativa
           (japonica cultivar-group)]
          Length = 304

 Score =  227 bits (578), Expect = 1e-58
 Identities = 104/121 (85%), Positives = 113/121 (92%)
 Frame = -1

Query: 647 LYNYYMTDKAWHLCDPDAEEVFKALRKSGVKLAVVSNFDTRLRPLLRALKCDNWFDAVAV 468
           LY YY T KAW LCDPDA+ VF+ALRK+GVK AVVSNFDTRLRPLL+AL CD+WFDAVAV
Sbjct: 153 LYQYYTTAKAWQLCDPDAKYVFEALRKAGVKTAVVSNFDTRLRPLLQALNCDHWFDAVAV 212

Query: 467 SAEVAAEKPNPTIFLKACELLGVKPEDAVHVGDDRRNDIWGARDAGCDAWLWGSDVHSFK 288
           SAEVAAEKPNPTIFLKACE LGVKPE+AVH+GDDRRND+WGARDAGCDAWLWGSDV+SFK
Sbjct: 213 SAEVAAEKPNPTIFLKACEFLGVKPEEAVHIGDDRRNDLWGARDAGCDAWLWGSDVYSFK 272

Query: 287 E 285
           E
Sbjct: 273 E 273

>pir||C86277 F14L17.7 protein - Arabidopsis thaliana
           gi|7262672|gb|AAF43930.1|AC012188_7 Contains similarity
           to a hypothetical protein from Arabidopsis thaliana
           gb|AC005662.2 and contains a haloacid dehalogenase-like
           hydrolase PF|00702 domain.  EST gb|F15167 comes from
           this gene
          Length = 254

 Score =  120 bits (300), Expect = 2e-26
 Identities = 64/126 (50%), Positives = 78/126 (61%)
 Frame = -1

Query: 647 LYNYYMTDKAWHLCDPDAEEVFKALRKSGVKLAVVSNFDTRLRPLLRALKCDNWFDAVAV 468
           +Y YY   +AWHL +  A E    L+ +GVK+AVVSNFDTRLR LL+ L   + FDAV V
Sbjct: 125 VYQYYANGEAWHLPE-GAYETMSLLKDAGVKMAVVSNFDTRLRKLLKDLNVIDMFDAVIV 183

Query: 467 SAEVAAEKPNPTIFLKACELLGVKPEDAVHVGDDRRNDIWGARDAGCDAWLWGSDVHSFK 288
           SAEV  EKP+  IF  A E + V    AVHVGDD   D  GA   G   WLWG DV +F 
Sbjct: 184 SAEVGYEKPDERIFKSALEQISVDVNRAVHVGDDEGADKGGANAIGIACWLWGEDVQTFS 243

Query: 287 EVAERI 270
           ++ +RI
Sbjct: 244 DIQKRI 249

>gb|AAO42097.1| unknown protein [Arabidopsis thaliana]
          Length = 250

 Score =  120 bits (300), Expect = 2e-26
 Identities = 64/126 (50%), Positives = 78/126 (61%)
 Frame = -1

Query: 647 LYNYYMTDKAWHLCDPDAEEVFKALRKSGVKLAVVSNFDTRLRPLLRALKCDNWFDAVAV 468
           +Y YY   +AWHL +  A E    L+ +GVK+AVVSNFDTRLR LL+ L   + FDAV V
Sbjct: 121 VYQYYANGEAWHLPE-GAYETMSLLKDAGVKMAVVSNFDTRLRKLLKDLNVIDMFDAVIV 179

Query: 467 SAEVAAEKPNPTIFLKACELLGVKPEDAVHVGDDRRNDIWGARDAGCDAWLWGSDVHSFK 288
           SAEV  EKP+  IF  A E + V    AVHVGDD   D  GA   G   WLWG DV +F 
Sbjct: 180 SAEVGYEKPDERIFKSALEQISVDVNRAVHVGDDEGADKGGANAIGIACWLWGEDVQTFS 239

Query: 287 EVAERI 270
           ++ +RI
Sbjct: 240 DIQKRI 245

>ref|NP_172883.1| unknown protein; protein id: At1g14310.1 [Arabidopsis thaliana]
          Length = 265

 Score =  112 bits (279), Expect = 6e-24
 Identities = 64/137 (46%), Positives = 78/137 (56%), Gaps = 11/137 (8%)
 Frame = -1

Query: 647 LYNYYMTDKAWHLCDPDAEEVFKALRKSGVKLAVVSNFDTRLRPLLRALKCDNWFDAVAV 468
           +Y YY   +AWHL +  A E    L+ +GVK+AVVSNFDTRLR LL+ L   + FDAV V
Sbjct: 125 VYQYYANGEAWHLPE-GAYETMSLLKDAGVKMAVVSNFDTRLRKLLKDLNVIDMFDAVIV 183

Query: 467 SAEVAAEKPNPTIFLKA-----------CELLGVKPEDAVHVGDDRRNDIWGARDAGCDA 321
           SAEV  EKP+  IF  A            E + V    AVHVGDD   D  GA   G   
Sbjct: 184 SAEVGYEKPDERIFKSALGIGILHSKLNAEQISVDVNRAVHVGDDEGADKGGANAIGIAC 243

Query: 320 WLWGSDVHSFKEVAERI 270
           WLWG DV +F ++ +RI
Sbjct: 244 WLWGEDVQTFSDIQKRI 260

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 565,034,818
Number of Sequences: 1393205
Number of extensions: 12553278
Number of successful extensions: 44014
Number of sequences better than 10.0: 619
Number of HSP's better than 10.0 without gapping: 41853
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 43678
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27576232529
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf037e02 BP059941 1 191
2 GNf066b02 BP072241 51 194
3 GNf015c06 BP068441 58 572
4 MF031b10_f BP029898 59 545
5 MR089e09_f BP082850 78 596
6 MFB014g10_f BP034978 83 623
7 MFB091e03_f BP040652 84 626
8 SPD038a07_f BP046987 86 658
9 SPD090e09_f BP051207 89 598
10 MF083a11_f BP032660 92 662
11 MR089h08_f BP082879 127 523




Lotus japonicus
Kazusa DNA Research Institute