KMC004371A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004371A_C01 KMC004371A_c01
attcataatttgATTCAAACTATCTTTCATCACACTACTAGTGCTTAACTTTAGTGCAGA
AAATGTCCAAAGCACTATAAAGTCAACATCATGGGTTATAACAAAAAAATAATGAGATTC
ACAAGTTCATAGCATGAACTTTCTCTCTGTTGGCAAAAAGATACAGTCTCCAATAATGTC
ATACCGCTGTAGGATTAACCACTTCTGTGACAAACTAAATTTCTAATGTTGGAACTGTAC
AAATTATAAATAAATAAGACCAGGTTAAATGAAAAAGCATAGCTTTGGGGATTTTGAGAG
CAAGAAGGAAGAATTTACACAAAAAAGAAATTTATTTTTCATGATTTTCGTCAGTGGGAT
CTATCTTGATTATGAGAGGATGGCTGAGATCCGAAGATGTTTCAGCCACAATCGGTAACT
CCGGCGAGGAAGTTAACCCATCAATATCTTCATTTTCGGCCAAGGCTTTATCCAAAGCAG
ATTTGGCAACTTTAGTCACACATACCATAAGAACCACCGATACTACGAGGCCAAAAACGA
TAAACGCCCAGCGAGTCTTTGAGAACTCACTCCAACCATGTGTCAC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004371A_C01 KMC004371A_c01
         (586 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAC20797.1| contains EST AU173803(R3795)~integral membrane p...   100  2e-20
ref|NP_197408.1| putative protein; protein id: At5g19070.1 [Arab...    91  1e-17
ref|NP_171825.1| hypothetical protein; protein id: At1g03260.1 [...    80  2e-14
gb|AAC38377.1| EscC [Escherichia coli]                                 37  0.18
pir||I80311 sepC protein - Escherichia coli gi|886477|emb|CAA902...    37  0.18

>dbj|BAC20797.1| contains EST AU173803(R3795)~integral membrane protein-like [Oryza
           sativa (japonica cultivar-group)]
          Length = 269

 Score =  100 bits (248), Expect = 2e-20
 Identities = 49/84 (58%), Positives = 65/84 (77%)
 Frame = -1

Query: 586 VTHGWSEFSKTRWAFIVFGLVVSVVLMVCVTKVAKSALDKALAENEDIDGLTSSPELPIV 407
           VTHGWSE S TRW  I+ G ++SVVL+VCVT++AKS+L+KALAEN D       P+LP+V
Sbjct: 191 VTHGWSEISTTRWILIISGFILSVVLIVCVTRIAKSSLEKALAENGD----AGIPQLPVV 246

Query: 406 AETSSDLSHPLIIKIDPTDENHEK 335
           A + SDL  PL+I+ID ++E+HEK
Sbjct: 247 A-SPSDLQQPLVIRIDTSNEDHEK 269

>ref|NP_197408.1| putative protein; protein id: At5g19070.1 [Arabidopsis thaliana]
          Length = 262

 Score = 90.9 bits (224), Expect = 1e-17
 Identities = 49/88 (55%), Positives = 64/88 (72%), Gaps = 6/88 (6%)
 Frame = -1

Query: 586 VTHGWSEFSKTRWAFIVFGLVVSVVLMVCVTKVAKSALDKALAEN--EDIDGLTSSPELP 413
           VTH WSEFS  RWAF++  LV+SV+LMVCVTKVAK AL KALAE+  +  + + + PEL 
Sbjct: 173 VTHKWSEFSPGRWAFLISSLVISVILMVCVTKVAKDALRKALAEHGGDMNEAVAALPELT 232

Query: 412 IVAETSSDLSHPLIIKID---PTDE-NH 341
           +  + S+DL+ PL+IKID   P D+ NH
Sbjct: 233 VTDDASTDLNEPLLIKIDAQQPQDQVNH 260

>ref|NP_171825.1| hypothetical protein; protein id: At1g03260.1 [Arabidopsis
           thaliana] gi|25406732|pir||A86164 protein F15K9.14
           [imported] - Arabidopsis thaliana
           gi|3850582|gb|AAC72122.1| F15K9.14 [Arabidopsis
           thaliana]
          Length = 269

 Score = 80.1 bits (196), Expect = 2e-14
 Identities = 40/82 (48%), Positives = 60/82 (72%), Gaps = 3/82 (3%)
 Frame = -1

Query: 586 VTHGWSEFSKTRWAFIVFGLVVSVVLMVCVTKVAKSALDKALAEN-EDIDGLTS--SPEL 416
           +THGW E S  RW  ++ G+ ++V+L++C+T+VAKS+LDKALAEN  ++DG  +  +  L
Sbjct: 188 ITHGWHEVSVFRWVIMMVGVALAVILIICITRVAKSSLDKALAENGTELDGKKNDDASVL 247

Query: 415 PIVAETSSDLSHPLIIKIDPTD 350
           PI AE   DL  PL+I+IDP++
Sbjct: 248 PI-AEPPPDLQEPLVIRIDPSN 268

>gb|AAC38377.1| EscC [Escherichia coli]
          Length = 512

 Score = 37.0 bits (84), Expect = 0.18
 Identities = 27/95 (28%), Positives = 43/95 (44%), Gaps = 5/95 (5%)
 Frame = -1

Query: 523 VSVVLMVCVTKVAKSALDKALAENEDIDGLTSSPELPIVAETSSDLSHPLIIKIDPTDE- 347
           +   L  C  + A S+L+K L +NE      SSP   I+ + +++ S P+ I     D+ 
Sbjct: 8   IFTALFCCSAQAAPSSLEKRLGKNEYFIITKSSPVRAILNDFAANYSIPVFISSSVNDDF 67

Query: 346 ----NHEK*ISFLCKFFLLALKIPKAMLFHLTWSY 254
                +EK +  L K          + L+HLTW Y
Sbjct: 68  SGEIKNEKPVKVLEKL---------SKLYHLTWYY 93

>pir||I80311 sepC protein - Escherichia coli gi|886477|emb|CAA90274.1| SepC
           [Escherichia coli]
          Length = 512

 Score = 37.0 bits (84), Expect = 0.18
 Identities = 27/95 (28%), Positives = 43/95 (44%), Gaps = 5/95 (5%)
 Frame = -1

Query: 523 VSVVLMVCVTKVAKSALDKALAENEDIDGLTSSPELPIVAETSSDLSHPLIIKIDPTDE- 347
           +   L  C  + A S+L+K L +NE      SSP   I+ + +++ S P+ I     D+ 
Sbjct: 8   IFTALFCCSAQAAPSSLEKRLGKNEYFIITKSSPVRAILNDFAANYSIPVFISSSVNDDF 67

Query: 346 ----NHEK*ISFLCKFFLLALKIPKAMLFHLTWSY 254
                +EK +  L K          + L+HLTW Y
Sbjct: 68  SGEIKNEKPVKVLEKL---------SKLYHLTWYY 93

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 468,168,708
Number of Sequences: 1393205
Number of extensions: 9470173
Number of successful extensions: 21030
Number of sequences better than 10.0: 23
Number of HSP's better than 10.0 without gapping: 20541
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 21019
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21997688174
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR031f07_f BP078403 1 370
2 SPD047a02_f BP047707 13 571
3 MR075h01_f BP081802 19 480
4 SPD028f02_f BP046231 58 589
5 SPDL040b03_f BP054498 59 177
6 MF045f12_f BP030683 98 553
7 MF029e12_f BP029801 114 286




Lotus japonicus
Kazusa DNA Research Institute