KMC009355A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC009355A_C01 KMC009355A_c01
attcattcatttcctctcttgttccttccttccttccatttccgggttcttCTTGTTGGT
AATCCACTGAAGAGAAGAAGAAGAAACGCGAGCAGTTATGGCGTCTCAGGAGCACCTCGA
GAAGATGCAACTCCGCCAGAATTACCGGAATCTCTGGCACACCGATCTCATGAGAACCAT
TCAGGTCGACACTCCATATTGCTGCTTCGCGTTGTGGTGTGCTCCATGTGCATCATACTT
GCTTCGCAAGCGAGCTCTTTACAATGATATGTCAAGATACACATGTTGTGCTGGTTACAT
GCCCTGCAGCGGCAGGTGTGGAGAAAGTAAATGTCCTGAATTTTGCCTTTGCACTGAGGT
TTTCCTCTGCTTTGGAAATTCAGTGGCCTCAACTCGCTTTATGTTGCAAGATGAATTCAA
CATCCAAACTACGCAATGTGATAATTGCATTATTGGTTTCATGTTTTGCCTTCAACAAAT
TGCTTGCATATTCTCAATTGTTGCCTTAATTGTGGGAAGTGATGAAATTCAACAGGCTTC
TCAGTTGCTATCTTGTTTGGCTGACTTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC009355A_C01 KMC009355A_c01
         (568 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_176568.1| unknown protein; protein id: At1g63830.1, suppo...   282  2e-75
ref|NP_198955.1| putative protein; protein id: At5g41390.1 [Arab...   279  2e-74
ref|NP_194078.2| expressed protein; protein id: At4g23470.1, sup...   271  3e-72
gb|AAM63607.1| unknown [Arabidopsis thaliana]                         266  1e-70
gb|AAL58123.1|AC092697_11 hypothetical protein [Oryza sativa (ja...   266  1e-70

>ref|NP_176568.1| unknown protein; protein id: At1g63830.1, supported by cDNA:
           gi_19424092 [Arabidopsis thaliana]
           gi|25404446|pir||D96663 unknown protein, 55304-53614
           [imported] - Arabidopsis thaliana
           gi|12325014|gb|AAG52456.1|AC010852_13 unknown protein;
           55304-53614 [Arabidopsis thaliana]
           gi|19424093|gb|AAL87329.1| unknown protein [Arabidopsis
           thaliana] gi|21436183|gb|AAM51379.1| unknown protein
           [Arabidopsis thaliana]
          Length = 232

 Score =  282 bits (722), Expect = 2e-75
 Identities = 125/155 (80%), Positives = 140/155 (89%)
 Frame = +2

Query: 101 ASQEHLEKMQLRQNYRNLWHTDLMRTIQVDTPYCCFALWCAPCASYLLRKRALYNDMSRY 280
           ASQ+ L+KM+LRQ+YRNLWH+DLM T+  DTPYCC +  C PC SY+LR+RALYNDMSRY
Sbjct: 3   ASQDKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRY 62

Query: 281 TCCAGYMPCSGRCGESKCPEFCLCTEVFLCFGNSVASTRFMLQDEFNIQTTQCDNCIIGF 460
           TCCAGYMPCSGRCGESKCP+ CL TEVFLCFGNSVASTRF+LQDEFNIQTTQCDNCIIGF
Sbjct: 63  TCCAGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGF 122

Query: 461 MFCLQQIACIFSIVALIVGSDEIQQASQLLSCLAD 565
           MFCL Q+ACIFSIVA IVGSDE+ +ASQ+LSC AD
Sbjct: 123 MFCLSQVACIFSIVACIVGSDELSEASQILSCCAD 157

>ref|NP_198955.1| putative protein; protein id: At5g41390.1 [Arabidopsis thaliana]
           gi|9758048|dbj|BAB08511.1|
           gb|AAF07369.1~gene_id:MYC6.10~strong similarity to
           unknown protein [Arabidopsis thaliana]
          Length = 297

 Score =  279 bits (714), Expect = 2e-74
 Identities = 126/154 (81%), Positives = 138/154 (88%)
 Frame = +2

Query: 104 SQEHLEKMQLRQNYRNLWHTDLMRTIQVDTPYCCFALWCAPCASYLLRKRALYNDMSRYT 283
           SQ+ L+KMQLRQ+YRNLWH+DLM T+  DTPYC F+  C PC SYLLRKRALYNDMSRYT
Sbjct: 4   SQDKLDKMQLRQSYRNLWHSDLMGTVSADTPYCFFSCLCGPCVSYLLRKRALYNDMSRYT 63

Query: 284 CCAGYMPCSGRCGESKCPEFCLCTEVFLCFGNSVASTRFMLQDEFNIQTTQCDNCIIGFM 463
           CC GYMPCSG+CGESKCP+FCL TEV LCFGNSVASTRFMLQDEFNI TT+CDNCIIGFM
Sbjct: 64  CCGGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNCIIGFM 123

Query: 464 FCLQQIACIFSIVALIVGSDEIQQASQLLSCLAD 565
           FCL QIACIFS+VA IVGSDE+ +ASQLLSCLAD
Sbjct: 124 FCLNQIACIFSLVACIVGSDELSEASQLLSCLAD 157

>ref|NP_194078.2| expressed protein; protein id: At4g23470.1, supported by cDNA:
           25694., supported by cDNA: gi_17065515, supported by
           cDNA: gi_20148522 [Arabidopsis thaliana]
           gi|17065516|gb|AAL32912.1| Unknown protein [Arabidopsis
           thaliana] gi|20148523|gb|AAM10152.1| unknown protein
           [Arabidopsis thaliana]
          Length = 255

 Score =  271 bits (694), Expect = 3e-72
 Identities = 118/153 (77%), Positives = 137/153 (89%)
 Frame = +2

Query: 107 QEHLEKMQLRQNYRNLWHTDLMRTIQVDTPYCCFALWCAPCASYLLRKRALYNDMSRYTC 286
           ++ +EKM+LR+N+RN+WHTDL  +IQ DTPYCCFALWCAPCASYLLRKRALY+DMSRY C
Sbjct: 3   KQDMEKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVC 62

Query: 287 CAGYMPCSGRCGESKCPEFCLCTEVFLCFGNSVASTRFMLQDEFNIQTTQCDNCIIGFMF 466
           CAGYMPCSGRCGE+KCP+ CL TEVF CF NSVASTRF+LQDEF IQTT+CDNCIIGFM 
Sbjct: 63  CAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMV 122

Query: 467 CLQQIACIFSIVALIVGSDEIQQASQLLSCLAD 565
           CL Q+ACIFSIVA IVG DE+ +ASQ+L+C +D
Sbjct: 123 CLSQVACIFSIVACIVGMDELSEASQILTCCSD 155

>gb|AAM63607.1| unknown [Arabidopsis thaliana]
          Length = 247

 Score =  266 bits (680), Expect = 1e-70
 Identities = 116/147 (78%), Positives = 132/147 (88%)
 Frame = +2

Query: 125 MQLRQNYRNLWHTDLMRTIQVDTPYCCFALWCAPCASYLLRKRALYNDMSRYTCCAGYMP 304
           M+LR+N+RN+WHTDL  +IQ DTPYCCFALWCAPCASYLLRKRALY+DMSRY CCAGYMP
Sbjct: 1   MELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGYMP 60

Query: 305 CSGRCGESKCPEFCLCTEVFLCFGNSVASTRFMLQDEFNIQTTQCDNCIIGFMFCLQQIA 484
           CSGRCGE+KCP+ CL TEVF CF NSVASTRF+LQDEF IQTT+CDNCIIGFM CL Q+A
Sbjct: 61  CSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQVA 120

Query: 485 CIFSIVALIVGSDEIQQASQLLSCLAD 565
           CIFSIVA IVG DE+ +ASQ+L+C +D
Sbjct: 121 CIFSIVACIVGMDELSEASQILTCCSD 147

>gb|AAL58123.1|AC092697_11 hypothetical protein [Oryza sativa (japonica cultivar-group)]
          Length = 216

 Score =  266 bits (680), Expect = 1e-70
 Identities = 118/155 (76%), Positives = 133/155 (85%)
 Frame = +2

Query: 101 ASQEHLEKMQLRQNYRNLWHTDLMRTIQVDTPYCCFALWCAPCASYLLRKRALYNDMSRY 280
           ASQ +L+K QLRQ+YRN+WHTDL   I  D   CC +LWC PC SY+LRKRALYNDMSRY
Sbjct: 3   ASQAYLDKAQLRQSYRNVWHTDLTNAITADFTCCCLSLWCGPCVSYMLRKRALYNDMSRY 62

Query: 281 TCCAGYMPCSGRCGESKCPEFCLCTEVFLCFGNSVASTRFMLQDEFNIQTTQCDNCIIGF 460
            CCAGYMPCSGRCGES CPE CL TEVF CFGNSVASTRF+LQDEFNIQTTQCDNCIIGF
Sbjct: 63  VCCAGYMPCSGRCGESNCPEVCLATEVFCCFGNSVASTRFLLQDEFNIQTTQCDNCIIGF 122

Query: 461 MFCLQQIACIFSIVALIVGSDEIQQASQLLSCLAD 565
           MFCLQQ ACI S+VA IVGS+E+ +ASQL+SC+++
Sbjct: 123 MFCLQQFACICSLVACIVGSEELSEASQLISCISN 157

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 507,074,655
Number of Sequences: 1393205
Number of extensions: 11096463
Number of successful extensions: 34515
Number of sequences better than 10.0: 33
Number of HSP's better than 10.0 without gapping: 32869
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 34460
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20669577624
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF028d01_f BP029733 1 326
2 GNf091a07 BP074054 30 568




Lotus japonicus
Kazusa DNA Research Institute