KMC003172A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003172A_C01 KMC003172A_c01
gAAAAAGAAACAAATACCCAGATGGAGATGCATTATACAGTTCTATTATACAATCAATAC
AAAGTGGAGTTTTGAAATTTCTCCAGTTCACACTTGCTGTTGACAAGCTATGAAAGAAAC
AACTGATAACGTTAGAATCCAAACATAAACTGTAGAGTCTACGCAGTGCAGTAAACTCAT
GCAATTGTGATCATCTGCCTAAGGATCTTGAATTGTTGAGGCGCTACCATTCTTATTCAC
AATATTGTATAGGTTATTTTCAACTTGGAAATACATAATGTCAATCGCCTCAGAACAGGG
CCTTACAATGCGGTATCAAGGCTGAACACAGTTCATAAAGAAAACAACCACTATGAATGA
GGGTACAATTTTGTGTGTGATTCCATCCTACATGTGTTGAGGCACGCCCTTACGCAAACT
TAGGAAATGGAAGAGCTTCTGAGGGAGGTAATGTGACAGATCTCAATAATTCCAGTGAGA
AAACAAGTCCAAGGAAATGTGAAAGAATAGTATTCGCTGAAGCCTGCACCAAAAATACAT
CCAATGCGAGAACAGGGCTAGAACCAGGAGCAATTCCCTGATAATATGGATTGGCAGCGG
AAGTGAGAGCCTTTGCAACTAACAGTCCAACCGTGGCTTGCATCCCAAGGATAGCAGCAC
CCATCCCCAGAAGGTTCACTACAATACCATTTTGCAAGCTTTTCACCACATCAGCGCGAG
GGGGTGCCTTGGTAGGTTGATTTGCAGTTTTTCTAAGCTTCTCAGAGAGACGAATATAGC
CAAAGGACCAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003172A_C01 KMC003172A_c01
         (791 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565372.1| expressed protein; protein id: At2g15290.1, sup...   193  3e-48
ref|NP_440150.1| unknown protein [Synechocystis sp. PCC 6803] gi...    74  2e-12
ref|NP_488153.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    67  4e-10
gb|ZP_00105923.1| hypothetical protein [Nostoc punctiforme]            64  3e-09
gb|ZP_00111932.1| hypothetical protein [Nostoc punctiforme]            62  1e-08

>ref|NP_565372.1| expressed protein; protein id: At2g15290.1, supported by cDNA:
           gi_15982806, supported by cDNA: gi_17528961 [Arabidopsis
           thaliana] gi|25411626|pir||B84527 hypothetical protein
           At2g15290 [imported] - Arabidopsis thaliana
           gi|4662632|gb|AAD26904.1| expressed protein [Arabidopsis
           thaliana] gi|15982807|gb|AAL09751.1| At2g15290/F27O10.6
           [Arabidopsis thaliana] gi|17528962|gb|AAL38691.1|
           unknown protein [Arabidopsis thaliana]
           gi|21689685|gb|AAM67464.1| unknown protein [Arabidopsis
           thaliana]
          Length = 296

 Score =  193 bits (490), Expect = 3e-48
 Identities = 96/126 (76%), Positives = 114/126 (90%)
 Frame = -2

Query: 790 WSFGYIRLSEKLRKTANQPTKAPPRADVVKSLQNGIVVNLLGMGAAILGMQATVGLLVAK 611
           WSFGYIRLSE+LR+T+  P KAPPRADVVK L++GI+VN+LGMG+A+LGMQATVG LVAK
Sbjct: 171 WSFGYIRLSERLRRTSIDPAKAPPRADVVKGLRSGIMVNILGMGSALLGMQATVGFLVAK 230

Query: 610 ALTSAANPYYQGIAPGSSPVLALDVFLVQASANTILSHFLGLVFSLELLRSVTLPPSEAL 431
           ALT++ANP+YQG++ G SPVLALDVFLVQASANT+LSHFLGLV SLELLRSVT+P SE++
Sbjct: 231 ALTTSANPFYQGVSQGYSPVLALDVFLVQASANTLLSHFLGLVCSLELLRSVTVPNSESV 290

Query: 430 PFPKFA 413
             PK A
Sbjct: 291 VVPKVA 296

>ref|NP_440150.1| unknown protein [Synechocystis sp. PCC 6803] gi|7459399|pir||S74679
           hypothetical protein sll1656 - Synechocystis sp. (strain
           PCC 6803) gi|1651904|dbj|BAA16830.1|
           ORF_ID:sll1656~unknown protein [Synechocystis sp. PCC
           6803]
          Length = 191

 Score = 73.9 bits (180), Expect = 2e-12
 Identities = 42/109 (38%), Positives = 67/109 (60%)
 Frame = -2

Query: 790 WSFGYIRLSEKLRKTANQPTKAPPRADVVKSLQNGIVVNLLGMGAAILGMQATVGLLVAK 611
           +S  Y RL   L+ +  + TK P RAD++K ++ G+++NL+GMG AI+G  A  G+++ K
Sbjct: 81  FSTRYRRLGRNLKDS--EATKHPRRADILKLIRIGLIINLVGMGTAIIGGAAMSGIVLGK 138

Query: 610 ALTSAANPYYQGIAPGSSPVLALDVFLVQASANTILSHFLGLVFSLELL 464
           AL+     +     P    V + D+  +QA+ NTI++HF GL+ SL LL
Sbjct: 139 ALSVPPGTFASAADPNRF-VQSTDLLAIQANINTIIAHFTGLLSSLWLL 186

>ref|NP_488153.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25335473|pir||AB2320
           hypothetical protein all4113 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17133248|dbj|BAB75812.1|
           ORF_ID:all4113~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 194

 Score = 66.6 bits (161), Expect = 4e-10
 Identities = 38/112 (33%), Positives = 64/112 (56%)
 Frame = -2

Query: 790 WSFGYIRLSEKLRKTANQPTKAPPRADVVKSLQNGIVVNLLGMGAAILGMQATVGLLVAK 611
           W F Y R+ + L      P   P +AD  ++++ GI+V+L+G+   ++G  AT+G+L+AK
Sbjct: 85  WDFRYTRIGKHLANP--NPALHPSKADTTQAIRLGIIVSLVGILLTLIGAGATLGVLIAK 142

Query: 610 ALTSAANPYYQGIAPGSSPVLALDVFLVQASANTILSHFLGLVFSLELLRSV 455
              S + P    I   +  + ALDVF++  + N I +HF+G + S+ LL  V
Sbjct: 143 ---SISQPPGVAITDPNKIIRALDVFVMVGNINGIAAHFVGAIASIWLLERV 191

>gb|ZP_00105923.1| hypothetical protein [Nostoc punctiforme]
          Length = 204

 Score = 63.9 bits (154), Expect = 3e-09
 Identities = 40/114 (35%), Positives = 68/114 (59%)
 Frame = -2

Query: 787 SFGYIRLSEKLRKTANQPTKAPPRADVVKSLQNGIVVNLLGMGAAILGMQATVGLLVAKA 608
           ++ Y R+ ++L   ++ P+  P +++ V+ L+ G+ VNL G    +LG QA VG LVA++
Sbjct: 95  AYRYTRIGKQLE--SSNPSNRPRKSETVQVLRLGLWVNLGGTLVTLLGAQAIVGTLVARS 152

Query: 607 LTSAANPYYQGIAPGSSPVLALDVFLVQASANTILSHFLGLVFSLELLRSVTLP 446
           ++  A    Q   P +  +  LD+ +VQA+ NT+ +HF GL+ SL LL  +  P
Sbjct: 153 ISPQAIT-TQFFDP-TRIISGLDMLVVQANTNTVSAHFAGLIASLWLLNRINRP 204

>gb|ZP_00111932.1| hypothetical protein [Nostoc punctiforme]
          Length = 189

 Score = 62.0 bits (149), Expect = 1e-08
 Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 5/117 (4%)
 Frame = -2

Query: 790 WSFGYIRLSEKLRKTANQPTKAPPRADVVKSLQNGIVVNLLGMGAAILGMQATVGLLVAK 611
           W + Y R+ + L  +   P   P +AD  + L+ GIV+ L+GM   +LG  +TV +L+AK
Sbjct: 80  WDYRYTRIGKALENS--NPALHPSKADTTRILRLGIVIGLVGMLLTLLGAGSTVLVLIAK 137

Query: 610 ALT-----SAANPYYQGIAPGSSPVLALDVFLVQASANTILSHFLGLVFSLELLRSV 455
           +++     +  NPY        + + A+DVF+  A    I +H++G V SL LL  V
Sbjct: 138 SISQPPGVAITNPY--------NIIRAMDVFVAVADITGIAAHYVGTVASLWLLERV 186

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 671,870,994
Number of Sequences: 1393205
Number of extensions: 14600606
Number of successful extensions: 40241
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 38780
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 40219
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 39775824764
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB052f01_f BP037780 1 491
2 SPDL030e11_f BP053881 2 516
3 MF012d08_f BP028866 2 468
4 MR010f06_f BP076721 2 388
5 MFB021h08_f BP035536 18 556
6 MFB083f10_f BP040088 25 479
7 MFB080b02_f BP039825 47 500
8 MF095h06_f BP033279 218 680
9 GNf024c05 BP069092 231 435
10 MWM145c08_f AV766985 235 798




Lotus japonicus
Kazusa DNA Research Institute