KMC004436A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004436A_C01 KMC004436A_c01
accaaaaaaaaacTGACAAAAGTAATCAATTTTCTAAAATAAACCAATTACATTTCGCAC
ATAGTCTGACAAAAATGTGCTGATTTATACAATTTACTATTCTTAACAATAAATTAGTCT
GGTATTAAGTACTGCCATATGAAATGGATGAAAGATACATACATGGGACTAATATGAACA
ACGTATCCTGAAGAACTCAGCTGAAGGTTCTAGGATTTATTACACTGAAATCAGGAATTA
ACAACAATCAGAGGCTACCTTGCAAATCTTTGAAACTTATTCTGGGGAAGTAAGACGAAG
ATGCTTTACATGTTTTGCAAGGGTAGAATCCAACTTGTCTCCTATGAATGTAATCCATGC
CTGATCAGAAGTAGTAAAACCTCGTATTGTGTCACCACCAATAATTGCAATTCCTTTATC
TCCAAGTGGTTGTAAAATTACTGCCTGAGTATTTGAAGGTAGAAATGACAGCTCAGATTT
CCCAGGATAAAGAGATAGATTAGCCAAGTAACTCTGGGCTCCAGATTTCATTACACCCTG
GTATACTGATCCTTGGATCAATTTATTAATGTCCACACTTA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004436A_C01 KMC004436A_c01
         (581 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_176193.1| hypothetical protein; protein id: At1g59840.1 [...   136  2e-31
ref|ZP_00071380.1| hypothetical protein [Trichodesmium erythraeu...    78  7e-14
ref|NP_441729.1| unknown protein [Synechocystis sp. PCC 6803] gi...    78  9e-14
gb|ZP_00109579.1| hypothetical protein [Nostoc punctiforme]            74  2e-12
ref|NP_487861.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    73  3e-12

>ref|NP_176193.1| hypothetical protein; protein id: At1g59840.1 [Arabidopsis
           thaliana] gi|25404238|pir||E96622 hypothetical protein
           F23H11.16 [imported] - Arabidopsis thaliana
           gi|5080827|gb|AAD39336.1|AC007258_25 Hypothetical
           protein [Arabidopsis thaliana]
          Length = 305

 Score =  136 bits (343), Expect = 2e-31
 Identities = 69/97 (71%), Positives = 83/97 (85%), Gaps = 8/97 (8%)
 Frame = -3

Query: 576 VDINKLIQGSVYQGVMKSGA--------QSYLANLSLYPGKSELSFLPSNTQAVILQPLG 421
           V  +KL+QGSVY+GVMKS A        +SYLANLSLYPG+SEL FLP+NTQAVILQPLG
Sbjct: 197 VKTDKLMQGSVYRGVMKSKARKNNDCLSESYLANLSLYPGRSELPFLPANTQAVILQPLG 256

Query: 420 DKGIAIIGGDTIRGFTTSDQAWITFIGDKLDSTLAKH 310
           DKGIA+IGG+TIRGFT+SDQAWI+ IG+KLD+TL ++
Sbjct: 257 DKGIAVIGGNTIRGFTSSDQAWISSIGEKLDATLGRY 293

>ref|ZP_00071380.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 214

 Score = 78.2 bits (191), Expect = 7e-14
 Identities = 35/80 (43%), Positives = 52/80 (64%)
 Frame = -3

Query: 552 GSVYQGVMKSGAQSYLANLSLYPGKSELSFLPSNTQAVILQPLGDKGIAIIGGDTIRGFT 373
           G++ + V+      YL +L +YPG+ E ++LP NTQ VI QPLG+ G+ I+G +  R +T
Sbjct: 133 GTILKQVLGKQKPIYLVDLKVYPGRIEFNYLPENTQGVICQPLGNNGVMILGANAPRSYT 192

Query: 372 TSDQAWITFIGDKLDSTLAK 313
             D+ WI+ I DKL  TL+K
Sbjct: 193 KQDENWISGIADKLTDTLSK 212

>ref|NP_441729.1| unknown protein [Synechocystis sp. PCC 6803] gi|7469449|pir||S76150
           hypothetical protein - Synechocystis sp. (strain PCC
           6803) gi|1653496|dbj|BAA18409.1| ORF_ID:slr0948~unknown
           protein [Synechocystis sp. PCC 6803]
          Length = 215

 Score = 77.8 bits (190), Expect = 9e-14
 Identities = 34/79 (43%), Positives = 52/79 (65%)
 Frame = -3

Query: 552 GSVYQGVMKSGAQSYLANLSLYPGKSELSFLPSNTQAVILQPLGDKGIAIIGGDTIRGFT 373
           G + + VM++    YL NL LYPG+ E  +LP+NTQ +I QPL D+G+ I+G +  R +T
Sbjct: 132 GPIVERVMQTQKPVYLVNLPLYPGRVEFDYLPANTQGLICQPLDDRGVLILGANIPRSYT 191

Query: 372 TSDQAWITFIGDKLDSTLA 316
             D+ W+T I DK+  +L+
Sbjct: 192 KQDENWVTGIADKIAHSLS 210

>gb|ZP_00109579.1| hypothetical protein [Nostoc punctiforme]
          Length = 212

 Score = 73.6 bits (179), Expect = 2e-12
 Identities = 33/84 (39%), Positives = 52/84 (61%)
 Frame = -3

Query: 567 NKLIQGSVYQGVMKSGAQSYLANLSLYPGKSELSFLPSNTQAVILQPLGDKGIAIIGGDT 388
           ++++ G + + V++     YL  L +YPG+ E  +LP NTQ VI QP+G++G  I+G + 
Sbjct: 128 SEVVPGVILKQVLEKQKPIYLVALKIYPGRLEFDYLPENTQGVICQPIGNQGALILGANA 187

Query: 387 IRGFTTSDQAWITFIGDKLDSTLA 316
            R +T  D+ WI  I DKL  TL+
Sbjct: 188 PRSYTKQDENWIAGIADKLAVTLS 211

>ref|NP_487861.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25359381|pir||AF2283
           hypothetical protein all3821 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17132955|dbj|BAB75520.1|
           ORF_ID:all3821~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 213

 Score = 72.8 bits (177), Expect = 3e-12
 Identities = 33/78 (42%), Positives = 49/78 (62%)
 Frame = -3

Query: 552 GSVYQGVMKSGAQSYLANLSLYPGKSELSFLPSNTQAVILQPLGDKGIAIIGGDTIRGFT 373
           G++ + V++     YL  L +YPG+ E  +LP NTQ VI QP+G++G+ I+G +  R +T
Sbjct: 133 GAIVKRVLEKQQPVYLVALYVYPGRIEFDYLPENTQGVICQPIGNEGVLILGANAPRSYT 192

Query: 372 TSDQAWITFIGDKLDSTL 319
             D+ WI  I DKL  TL
Sbjct: 193 KQDENWIAGIADKLAVTL 210

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 472,727,540
Number of Sequences: 1393205
Number of extensions: 9626083
Number of successful extensions: 20350
Number of sequences better than 10.0: 81
Number of HSP's better than 10.0 without gapping: 19788
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 20348
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21712003912
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR038h02_f BP078975 1 380
2 MPD099e02_f AV776462 11 512
3 SPD005b02_f BP044367 34 585
4 MWM198g09_f AV767775 35 494
5 SPD034e10_f BP046713 51 117
6 MF035c01_f BP030135 69 585




Lotus japonicus
Kazusa DNA Research Institute