KMC004067A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004067A_C01 KMC004067A_c01
agcatgcgtgaagacacatgatgatttATATAACTTCTTCTAGGCCACATAAAAGTAAGA
TATAAGGTTTCGGATGTGCACATGATCAGGTTTTCAAAAAAAAATCTCATTCCTTTCTGT
TTTTCTTTTCTTAGTTGATCTCTACTCATCTATAGAAAGCATACATACAGCACGTTAGCA
CTAGTTCTCACAATTCAAAACATCAGATTTTACATAAAAAAACAGAATGTGTTGTATTGT
GCCTACCAATGAAGATTGAAACTAGGCTGACTGCATAAGCGGGTGTGCAAAGTAAGCTCC
TAATGTTAAGAAAGTGATTGTGAGGTATGGTAATCGAATAAACTCCTTGTAGAAATCTTT
AGGTAACCTTTGTCTGCCATCGAGGATTGCTGCAAAAGGGACAATACTTGTTCGTTCCTT
AACTAAATCAAAATCTTCCCCATATCTTTTTGCTAGACGTCTATCTCCATTCCAAGCACC
AAATAGGTGGTGTCCAATTAAGCCAACTGAAGCTGCAACAGCCACAGAGTTTCCAATCCA
AATTGTATGGGCAAGACACCAGATAACCTGACCAACCAACTGTGGATGCCTGGTTATTCT
CATGATGCCAGTTTCCCAGAGATGTAATTTAGGCTTGTCAACTGCTGCAACCTCTAAAAG
ATTGAAGGTTGAAGGATATAAGAA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004067A_C01 KMC004067A_c01
         (684 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_563879.1| expressed protein; protein id: At1g10830.1, sup...   258  7e-68
gb|AAM63372.1| unknown [Arabidopsis thaliana]                         256  2e-67
ref|NP_487994.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...   191  1e-47
gb|ZP_00107264.1| hypothetical protein [Nostoc punctiforme]           184  8e-46
ref|ZP_00073663.1| hypothetical protein [Trichodesmium erythraeu...   182  3e-45

>ref|NP_563879.1| expressed protein; protein id: At1g10830.1, supported by cDNA:
           21943., supported by cDNA: gi_13194773 [Arabidopsis
           thaliana] gi|25402622|pir||A86242 hypothetical protein
           [imported] - Arabidopsis thaliana
           gi|4874265|gb|AAD31330.1|AC007354_3 EST gb|F13926 comes
           from this gene. [Arabidopsis thaliana]
           gi|13194774|gb|AAK15549.1|AF348578_1 unknown protein
           [Arabidopsis thaliana]
          Length = 367

 Score =  258 bits (658), Expect = 7e-68
 Identities = 115/143 (80%), Positives = 134/143 (93%)
 Frame = -1

Query: 684 FLYPSTFNLLEVAAVDKPKLHLWETGIMRITRHPQLVGQVIWCLAHTIWIGNSVAVAASV 505
           FLYPSTFNLLEVAAVDKPK+HLWETGIMRITRHPQ+VGQ++WCLAHT+WIGN+VA +AS+
Sbjct: 222 FLYPSTFNLLEVAAVDKPKMHLWETGIMRITRHPQMVGQIVWCLAHTLWIGNTVAASASL 281

Query: 504 GLIGHHLFGAWNGDRRLAKRYGEDFDLVKERTSIVPFAAILDGRQRLPKDFYKEFIRLPY 325
           GLI HHLFGAWNGDRRLAKRYGEDF+ +K+RTS++PFAAI +GRQ LP+D+YKEF+RLPY
Sbjct: 282 GLIAHHLFGAWNGDRRLAKRYGEDFESIKKRTSVIPFAAIFEGRQVLPEDYYKEFVRLPY 341

Query: 324 LTITFLTLGAYFAHPLMQSA*FQ 256
           L IT LT+GAYFAHPLMQ A F+
Sbjct: 342 LAITALTVGAYFAHPLMQGASFR 364

>gb|AAM63372.1| unknown [Arabidopsis thaliana]
          Length = 367

 Score =  256 bits (654), Expect = 2e-67
 Identities = 114/143 (79%), Positives = 134/143 (92%)
 Frame = -1

Query: 684 FLYPSTFNLLEVAAVDKPKLHLWETGIMRITRHPQLVGQVIWCLAHTIWIGNSVAVAASV 505
           FLYPSTFNLLEVAAVDKPK+HLWETGIMRITRHPQ+VGQ++WCLAHT+WIGN+VA +AS+
Sbjct: 222 FLYPSTFNLLEVAAVDKPKMHLWETGIMRITRHPQMVGQIVWCLAHTLWIGNTVAASASL 281

Query: 504 GLIGHHLFGAWNGDRRLAKRYGEDFDLVKERTSIVPFAAILDGRQRLPKDFYKEFIRLPY 325
           GLI HHLFGAWNGDRRLAKRYG+DF+ +K+RTS++PFAAI +GRQ LP+D+YKEF+RLPY
Sbjct: 282 GLIAHHLFGAWNGDRRLAKRYGKDFESIKKRTSVIPFAAIFEGRQVLPEDYYKEFVRLPY 341

Query: 324 LTITFLTLGAYFAHPLMQSA*FQ 256
           L IT LT+GAYFAHPLMQ A F+
Sbjct: 342 LAITALTVGAYFAHPLMQGASFR 364

>ref|NP_487994.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25359499|pir||AC2300
           hypothetical protein alr3954 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17133088|dbj|BAB75653.1|
           ORF_ID:alr3954~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 238

 Score =  191 bits (484), Expect = 1e-47
 Identities = 85/137 (62%), Positives = 112/137 (81%)
 Frame = -1

Query: 684 FLYPSTFNLLEVAAVDKPKLHLWETGIMRITRHPQLVGQVIWCLAHTIWIGNSVAVAASV 505
           FLYP+TFNLLE+AA+ KP++HL+ETGI+RITRHPQ+VGQVIWC+AHT+W+G S  +  S 
Sbjct: 96  FLYPATFNLLEIAAIQKPQVHLYETGIIRITRHPQMVGQVIWCIAHTLWLGTSFTLVTSF 155

Query: 504 GLIGHHLFGAWNGDRRLAKRYGEDFDLVKERTSIVPFAAILDGRQRLPKDFYKEFIRLPY 325
           GLI HHLFG W+GDRR++KRYGE F++VK+RTSI+PFAAI+DGRQ +    ++EFIR  Y
Sbjct: 156 GLILHHLFGVWHGDRRMSKRYGEAFEIVKQRTSIIPFAAIIDGRQSIK---WEEFIRPAY 212

Query: 324 LTITFLTLGAYFAHPLM 274
           L +       ++AHPL+
Sbjct: 213 LGVAIFVALLWWAHPLL 229

>gb|ZP_00107264.1| hypothetical protein [Nostoc punctiforme]
          Length = 240

 Score =  184 bits (468), Expect = 8e-46
 Identities = 79/140 (56%), Positives = 111/140 (78%)
 Frame = -1

Query: 684 FLYPSTFNLLEVAAVDKPKLHLWETGIMRITRHPQLVGQVIWCLAHTIWIGNSVAVAASV 505
           FLYP+TFNLLE+AA+ KP++HL+ETGI+RITRHPQ+VGQ+IWC+AHT+W+G +  +  S+
Sbjct: 96  FLYPATFNLLEIAAIQKPQVHLYETGIIRITRHPQMVGQIIWCVAHTLWLGTTFTLVTSI 155

Query: 504 GLIGHHLFGAWNGDRRLAKRYGEDFDLVKERTSIVPFAAILDGRQRLPKDFYKEFIRLPY 325
           GL+ HHLFG W+GDRRL+ RYGE F++ K+RTSI+PF AI+DGRQ +    ++EF+R  Y
Sbjct: 156 GLVLHHLFGVWHGDRRLSDRYGEAFEIAKQRTSIIPFKAIIDGRQSI---LWQEFLRPSY 212

Query: 324 LTITFLTLGAYFAHPLMQSA 265
           L +       +++HPL+  A
Sbjct: 213 LGVAIFIALLWWSHPLLMEA 232

>ref|ZP_00073663.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 237

 Score =  182 bits (463), Expect = 3e-45
 Identities = 86/137 (62%), Positives = 105/137 (75%)
 Frame = -1

Query: 684 FLYPSTFNLLEVAAVDKPKLHLWETGIMRITRHPQLVGQVIWCLAHTIWIGNSVAVAASV 505
           FLYPSTFNLLE+AA+ KP++HL ETGI+RITRHPQ+VGQVIWC+AHT+W+G S     SV
Sbjct: 95  FLYPSTFNLLEIAAIQKPQVHLHETGIIRITRHPQMVGQVIWCIAHTLWLGTSFTFVTSV 154

Query: 504 GLIGHHLFGAWNGDRRLAKRYGEDFDLVKERTSIVPFAAILDGRQRLPKDFYKEFIRLPY 325
           GLI HHLFG W+GDRRL KR+GE +D +K RTS++PF AI+ GRQ L     +EFIR  Y
Sbjct: 155 GLILHHLFGVWHGDRRLQKRFGESYDQLKSRTSVIPFLAIIQGRQTL---HLQEFIRWAY 211

Query: 324 LTITFLTLGAYFAHPLM 274
           L I    L  + AHPL+
Sbjct: 212 LGIGLFVLLFWQAHPLL 228

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 562,442,439
Number of Sequences: 1393205
Number of extensions: 11895852
Number of successful extensions: 25965
Number of sequences better than 10.0: 25
Number of HSP's better than 10.0 without gapping: 25290
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 25956
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 30552968016
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF070g10_f BP032029 1 252
2 MF034e01_f BP030093 28 538
3 MPD058c10_f AV773879 94 547
4 MR002e02_f BP076079 95 218
5 MR072c10_f BP081531 95 184
6 MFB087d05_f BP040357 98 693




Lotus japonicus
Kazusa DNA Research Institute