KMC001074A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001074A_C01 KMC001074A_c01
gTATTCATCTCATATTTTCTCAAGTCGATCAGATATAAGAGTTTTTGCAGAGCCAACTAC
ATATTCATGCTTTAAAAAATTTCTAACCGATGGGAAAATGAAAGGGGACAGAGCCACATT
CTCAGTGCCCCTAAAATCAATTAAATACAATCAATGAAAAAGATCATAGAATGTGCAAAA
TACAACTCAATTGAACTGCATTAGCTTAGGAGCAAATCTGAGCAATCTTTCCAAAGCATG
AGTAGTGAACTTCCTGGCAAAAAGGTGACAAATATTGGTGACCATTCCATGGTACTCGCA
AGTGGTTCCATTTCCATAGCTCAACCTCTCCAAAAACTCAACAGTAACATCAGTCCTAAT
ATACTTGGAAGGATGAGGTCCCCCCTTTGTCCAATCAACCCAAGTCAGAGTCCTATTAGA
ATTCCTCTTCCAAAACTTAATGTTGACCAATGTAGGCAAGTAATGCTCATCACTGTAACA
TTGAGGATTGCAGTACTTCTCGAACACGGGGAAATAACGGCGGTCGGAGACTACTTGGAG
CGCGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001074A_C01 KMC001074A_c01
         (545 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_177006.1| hypothetical protein; protein id: At1g68390.1 [...   149  3e-35
ref|NP_177522.1| hypothetical protein; protein id: At1g73810.1, ...   128  5e-29
gb|AAL38356.1| unknown protein [Arabidopsis thaliana] gi|2805899...   128  5e-29
ref|NP_196734.1| putative protein; protein id: At5g11730.1, supp...   125  3e-28
ref|NP_197121.1| putative protein; protein id: At5g16170.1 [Arab...   125  4e-28

>ref|NP_177006.1| hypothetical protein; protein id: At1g68390.1 [Arabidopsis
           thaliana] gi|25349271|pir||G96707 hypothetical protein
           T2E12.6 [imported] - Arabidopsis thaliana
           gi|6714352|gb|AAF26043.1|AC015986_6 hypothetical
           protein; 24280-21634 [Arabidopsis thaliana]
          Length = 408

 Score =  149 bits (375), Expect = 3e-35
 Identities = 69/119 (57%), Positives = 92/119 (76%), Gaps = 2/119 (1%)
 Frame = -3

Query: 543 ALQVVSDRRYFPVFEKYCNPQCYSDEHYLPTLVNIK--FWKRNSNRTLTWVDWTKGGPHP 370
           AL+++SDR Y+P+F  YC+  CY+DEHY+PTL+NIK    +RNSNRTLTWVDW+KGGPHP
Sbjct: 292 ALEIISDRIYWPLFYSYCHHGCYADEHYIPTLLNIKSSLKRRNSNRTLTWVDWSKGGPHP 351

Query: 369 SKYIRTDVTVEFLERLSYGNGTTCEYHGMVTNICHLFARKFTTHALERLLRFAPKLMQF 193
           +++IR +VT EF+E L   +G  C Y+G  TNIC+LFARKF   AL+RLLR +  ++ F
Sbjct: 352 NRFIRHEVTAEFMENLR--SGGECLYNGEETNICYLFARKFLPTALDRLLRLSRTVLHF 408

>ref|NP_177522.1| hypothetical protein; protein id: At1g73810.1, supported by cDNA:
           gi_17473868 [Arabidopsis thaliana]
           gi|25349273|pir||E96765 hypothetical protein F25P22.23
           [imported] - Arabidopsis thaliana
           gi|12324202|gb|AAG52068.1|AC012679_6 hypothetical
           protein; 83152-80450 [Arabidopsis thaliana]
          Length = 418

 Score =  128 bits (321), Expect = 5e-29
 Identities = 63/117 (53%), Positives = 77/117 (64%)
 Frame = -3

Query: 543 ALQVVSDRRYFPVFEKYCNPQCYSDEHYLPTLVNIKFWKRNSNRTLTWVDWTKGGPHPSK 364
           AL VVSD  YFPVFEKYC   CY+DEHYL T V+  F  +N+NR+LTW DW++ GPHP K
Sbjct: 303 ALAVVSDTTYFPVFEKYCLWNCYADEHYLSTFVHAMFPGKNANRSLTWTDWSRRGPHPRK 362

Query: 363 YIRTDVTVEFLERLSYGNGTTCEYHGMVTNICHLFARKFTTHALERLLRFAPKLMQF 193
           Y R  VT EFL R+       C Y+G  +  C+LFARKF    L++LL FA  +M F
Sbjct: 363 YTRRSVTGEFLRRVR-NREQGCVYNGKKSEKCYLFARKFDGSTLDKLLYFAHSVMGF 418

>gb|AAL38356.1| unknown protein [Arabidopsis thaliana] gi|28058990|gb|AAO29975.1|
           unknown protein [Arabidopsis thaliana]
          Length = 418

 Score =  128 bits (321), Expect = 5e-29
 Identities = 63/117 (53%), Positives = 77/117 (64%)
 Frame = -3

Query: 543 ALQVVSDRRYFPVFEKYCNPQCYSDEHYLPTLVNIKFWKRNSNRTLTWVDWTKGGPHPSK 364
           AL VVSD  YFPVFEKYC   CY+DEHYL T V+  F  +N+NR+LTW DW++ GPHP K
Sbjct: 303 ALAVVSDTTYFPVFEKYCLWNCYADEHYLSTFVHAMFPGKNANRSLTWTDWSRRGPHPRK 362

Query: 363 YIRTDVTVEFLERLSYGNGTTCEYHGMVTNICHLFARKFTTHALERLLRFAPKLMQF 193
           Y R  VT EFL R+       C Y+G  +  C+LFARKF    L++LL FA  +M F
Sbjct: 363 YTRRSVTGEFLRRVR-NREQGCVYNGKKSEKCYLFARKFDGSTLDKLLYFAHSVMGF 418

>ref|NP_196734.1| putative protein; protein id: At5g11730.1, supported by cDNA:
           gi_20260569 [Arabidopsis thaliana]
           gi|11281886|pir||T48532 hypothetical protein T22P22.120
           - Arabidopsis thaliana gi|7573387|emb|CAB87691.1|
           putative protein [Arabidopsis thaliana]
           gi|20260570|gb|AAM13183.1| putative protein [Arabidopsis
           thaliana]
          Length = 386

 Score =  125 bits (314), Expect = 3e-28
 Identities = 55/117 (47%), Positives = 80/117 (68%)
 Frame = -3

Query: 543 ALQVVSDRRYFPVFEKYCNPQCYSDEHYLPTLVNIKFWKRNSNRTLTWVDWTKGGPHPSK 364
           A  +V D  Y+P F+++C P CY DEHY PT++ I+     +NR+LTWVDW++GGPHP+ 
Sbjct: 272 AATIVKDTLYYPKFKEFCRPACYVDEHYFPTMLTIEKPTVLANRSLTWVDWSRGGPHPAT 331

Query: 363 YIRTDVTVEFLERLSYGNGTTCEYHGMVTNICHLFARKFTTHALERLLRFAPKLMQF 193
           + R+D+T  F  ++   +G  C Y+G  T++C+LFARKF   ALE LL  APK++ F
Sbjct: 332 FGRSDITENFFGKIF--DGRNCSYNGRNTSMCYLFARKFAPSALEPLLHIAPKILGF 386

>ref|NP_197121.1| putative protein; protein id: At5g16170.1 [Arabidopsis thaliana]
           gi|11358179|pir||T51487 hypothetical protein T21H19_90 -
           Arabidopsis thaliana gi|9755827|emb|CAC01858.1| putative
           protein [Arabidopsis thaliana]
          Length = 329

 Score =  125 bits (313), Expect = 4e-28
 Identities = 57/122 (46%), Positives = 81/122 (65%), Gaps = 5/122 (4%)
 Frame = -3

Query: 543 ALQVVSDRRYFPVFEKYCNPQCYSDEHYLPTLVNIKFWKRNSNRTLTWVDWTKGGPHPSK 364
           AL ++ D  Y+ +F+++C P CY DEHY+PTLV++   + ++NRTLTWVDW+K GPHP +
Sbjct: 208 ALHIIEDTVYYKIFDQHCKPPCYMDEHYIPTLVHMLHGEMSANRTLTWVDWSKAGPHPGR 267

Query: 363 YIRTDVTVEFLERLSYGNGTTCEYHG-----MVTNICHLFARKFTTHALERLLRFAPKLM 199
           +I  D+T EFL R+ +     C Y G     + T+ C LFARKFT   LE LLR +P ++
Sbjct: 268 FIWPDITDEFLNRIRFKE--ECVYFGRGGENVTTSKCFLFARKFTAETLEPLLRISPIVL 325

Query: 198 QF 193
            F
Sbjct: 326 GF 327

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 455,391,296
Number of Sequences: 1393205
Number of extensions: 9533583
Number of successful extensions: 24185
Number of sequences better than 10.0: 60
Number of HSP's better than 10.0 without gapping: 23685
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 24138
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 18660035355
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf089h12 BP062126 1 321
2 GENLf056c08 BP065330 2 367
3 GNf054b12 BP071377 69 556




Lotus japonicus
Kazusa DNA Research Institute