KMC007550A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC007550A_C01 KMC007550A_c01
aaagcaagacttatcatcctgatttAGGTAGATTTTCAACCCTTCATAGAATATTCAGCC
TGAAATTATACATAAATTTGGTGTAAAAGAAGCACATGTAAATGTTTCTTTTTCACCTGA
ATCAATTCCAACATCCTGAAGCTATTCACAGAAGCTTCTCTTAAAAATTGATTTTGACTT
TAGAAATGATTGCAGAAGTATTTCCAACATCCACTAAATGATACAAGTACAGCGACACAA
TTTAACCTTGGTGCTATATACACTACAAAGGACTTCAATCATCACCTTGGGACTCTGTTT
CAGTAGCTAATATCATATCTCATTATTTTTTATCCAAGCAATATAAATCTTTCCTTGCAC
TTCAATCTTCAAAACGTCTGAACCAGTCGCTAATGGTTCTGGCAAGGATTCTAGTATCAC
TCAAAGAAAGAGATTTAAGCACTTGAGCAACAGCATCTGCTGGAGTATACAATTTACCCA
CTTGCCATCTTGGTTCCTGAATGCAAGCTGTAATGTGGTTACCATTAAGTAGTACTTTTT
CCAATGTCCCACCAAAAGACTCCACACGAGGTTTAAGTGTTTCTTCAAGAATGTCTGTCT
CATCAATTGCGTCAGAATTGAATTTCACCAATAATGTATTTTCAACATTGTATGAACATT
TGAAGCAATCACGGTTCTCAGATGGTGTTGGCTTGAACTCGGATACTCCTTGTGTAATCT
CATTCATTACTGATGGTAACTGATCAACAAATTTAGTTA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC007550A_C01 KMC007550A_c01
         (759 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_189940.1| putative protein; protein id: At3g43540.1, supp...   213  2e-54
ref|ZP_00071353.1| hypothetical protein [Trichodesmium erythraeu...    74  2e-12
ref|NP_683265.1| ORF_ID:tll2476~hypothetical protein [Thermosyne...    69  6e-11
ref|NP_485128.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    68  2e-10
gb|ZP_00111626.1| hypothetical protein [Nostoc punctiforme]            64  2e-09

>ref|NP_189940.1| putative protein; protein id: At3g43540.1, supported by cDNA:
           gi_14532795, supported by cDNA: gi_19310740 [Arabidopsis
           thaliana] gi|11358080|pir||T47396 hypothetical protein
           T18D12.110 - Arabidopsis thaliana
           gi|7288032|emb|CAB81794.1| putative protein [Arabidopsis
           thaliana] gi|14532796|gb|AAK64179.1| unknown protein
           [Arabidopsis thaliana] gi|19310741|gb|AAL85101.1|
           unknown protein [Arabidopsis thaliana]
          Length = 373

 Score =  213 bits (543), Expect = 2e-54
 Identities = 97/131 (74%), Positives = 118/131 (90%)
 Frame = -3

Query: 757 TKFVDQLPSVMNEITQGVSEFKPTPSENRDCFKCSYNVENTLLVKFNSDAIDETDILEET 578
           TK VDQLPSV  E+ QGVSEF+P+P ENR+CFKCSY+V +TLLV+FNSDAIDETD+LEET
Sbjct: 243 TKLVDQLPSVFGEVGQGVSEFRPSPLENRNCFKCSYSVPHTLLVQFNSDAIDETDLLEET 302

Query: 577 LKPRVESFGGTLEKVLLNGNHITACIQEPRWQVGKLYTPADAVAQVLKSLSLSDTRILAR 398
           L+PR+ES GGTLEKV LNGNH+T CIQ+P+WQ+G +YTPADAVAQ LK++ LS+TR+L+R
Sbjct: 303 LRPRIESIGGTLEKVRLNGNHLTPCIQDPKWQIGTVYTPADAVAQALKTIPLSETRVLSR 362

Query: 397 TISDWFRRFED 365
           TI DWFRRFE+
Sbjct: 363 TIVDWFRRFEN 373

>ref|ZP_00071353.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 256

 Score = 74.3 bits (181), Expect = 2e-12
 Identities = 46/122 (37%), Positives = 65/122 (52%)
 Frame = -3

Query: 748 VDQLPSVMNEITQGVSEFKPTPSENRDCFKCSYNVENTLLVKFNSDAIDETDILEETLKP 569
           V+Q+ SV+        EF P+P       +  Y +   LL+KFN+D ID+T  L + L+ 
Sbjct: 136 VEQISSVVKV------EFTPSPKITNSIIQKRYQIRRNLLIKFNNDNIDQTLRLSDILRL 189

Query: 568 RVESFGGTLEKVLLNGNHITACIQEPRWQVGKLYTPADAVAQVLKSLSLSDTRILARTIS 389
           R  S   T+++  LNGNH+T   Q+  W VG++YTP DA+ Q LK     D   L R I 
Sbjct: 190 RFPSMV-TVQR--LNGNHLTPLGQDLSWSVGQVYTPVDAIGQWLKQEIYKDFNKLQREIL 246

Query: 388 DW 383
            W
Sbjct: 247 LW 248

>ref|NP_683265.1| ORF_ID:tll2476~hypothetical protein [Thermosynechococcus elongatus
           BP-1] gi|22296203|dbj|BAC10027.1|
           ORF_ID:tll2476~hypothetical protein [Thermosynechococcus
           elongatus BP-1]
          Length = 258

 Score = 69.3 bits (168), Expect = 6e-11
 Identities = 36/115 (31%), Positives = 59/115 (51%)
 Frame = -3

Query: 727 MNEITQGVSEFKPTPSENRDCFKCSYNVENTLLVKFNSDAIDETDILEETLKPRVESFGG 548
           M+ +     EF P+P+E     +  Y V   LL++F  D ID+T  L   L+ +   FG 
Sbjct: 135 MDNLGPASVEFTPSPTETEHFIQKRYPVRRNLLIRFQDDDIDQTARLRSLLRAK---FGD 191

Query: 547 TLEKVLLNGNHITACIQEPRWQVGKLYTPADAVAQVLKSLSLSDTRILARTISDW 383
            +  + L GNH+T   Q+ +WQVG  ++P DA+ Q +K     +  +L   + +W
Sbjct: 192 MVTALKLPGNHLTPLSQDLKWQVGAEFSPLDALGQWIKQSLFPEMPVLEACLLEW 246

>ref|NP_485128.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25341276|pir||AB1942
           hypothetical protein alr1085 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17130431|dbj|BAB73042.1|
           ORF_ID:alr1085~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 255

 Score = 67.8 bits (164), Expect = 2e-10
 Identities = 42/121 (34%), Positives = 58/121 (47%)
 Frame = -3

Query: 745 DQLPSVMNEITQGVSEFKPTPSENRDCFKCSYNVENTLLVKFNSDAIDETDILEETLKPR 566
           D +P V    T    EF P+P E     + SYNV   LL+KFN+D +D++  L + L+ R
Sbjct: 131 DAIPLVEQFNTTLAIEFTPSPLETNKLVQESYNVRRNLLIKFNNDNLDQSAALTKILQVR 190

Query: 565 VESFGGTLEKVLLNGNHITACIQEPRWQVGKLYTPADAVAQVLKSLSLSDTRILARTISD 386
              F   +    L G H T   Q+ +WQ G  +TP DA+ Q  K     D   L R +  
Sbjct: 191 ---FPEMVTAQTLPGTHTTPLGQDVKWQTGSSFTPFDALGQWFKQEVYRDLNQLNRAMLL 247

Query: 385 W 383
           W
Sbjct: 248 W 248

>gb|ZP_00111626.1| hypothetical protein [Nostoc punctiforme]
          Length = 255

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 36/106 (33%), Positives = 53/106 (49%)
 Frame = -3

Query: 700 EFKPTPSENRDCFKCSYNVENTLLVKFNSDAIDETDILEETLKPRVESFGGTLEKVLLNG 521
           EF P+P E     +  YN+   LL+KF++D ID++  L + L+   E F   +    L G
Sbjct: 146 EFTPSPLETNKLVQERYNIRRNLLIKFSNDTIDQSAALTKILQ---ERFDDMVTAQTLPG 202

Query: 520 NHITACIQEPRWQVGKLYTPADAVAQVLKSLSLSDTRILARTISDW 383
            H T   Q+ +WQ G  +TP DA+ Q  K  +  D   L  +I  W
Sbjct: 203 THTTPLGQDIKWQTGTSFTPFDALGQWFKQEAYRDLNQLKSSILLW 248

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 581,489,716
Number of Sequences: 1393205
Number of extensions: 11805309
Number of successful extensions: 28689
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 27890
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28683
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 37158613404
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf087c11 BP062031 1 355
2 MF078a02_f BP032403 26 492
3 SPD068c06_f BP049417 201 760




Lotus japonicus
Kazusa DNA Research Institute