KMC014766A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC014766A_C01 KMC014766A_c01
aagaaaaatgttaaattgaattctagtatgtatgccataagtccaTGGTAAGTCAAATTT
CTTTACACCTTAGCAGAAAAAATAACAATGATGATAACAATGCTGACCAACACTCGAATA
AGCAAACTTTTATGCCTCGACACAAGAAAATAAAAATCATATGGAAATGAAAAGATTATA
TAGATCTAATATAAGTGAGAAAAAGGAAGGAATGGAAGCAACTACTTTCCAAATAGTTTC
CTGCGCCTAACCGATGGCTTGAGCCGCATACTAGCGGCTTCTCCAATCACAATCTCACCT
ACTAAGTCCTTAAATATGAGTCTCTCAACATCCAAAACAAACCCAGGTATTTCACCATTG
AAGTTTTTCCAACTTTCTGAACCTTCATGCATCACATCTTCCCACAGAATACTTTTCAGA
CCATCATCCTCTTCACCTTCCAATTCTAAGCACCACCCTGGCTTCTTGGCTTGAACTTTT
TCTATCTCAAAGCATAGTTCTTTAAGAAGCTTTTGAGCACTGAGCGATTTCTTCGTGAGT
CTGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC014766A_C01 KMC014766A_c01
         (544 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_177556.1| unknown protein; protein id: At1g74160.1 [Arabi...    91  7e-18
ref|NP_173297.1| unknown protein; protein id: At1g18620.1, suppo...    85  6e-16
ref|NP_197062.1| putative protein; protein id: At5g15580.1 [Arab...    78  8e-14
gb|AAM64269.1| unknown [Arabidopsis thaliana]                          63  3e-09
ref|NP_566165.2| expressed protein; protein id: At3g02170.1, sup...    63  3e-09

>ref|NP_177556.1| unknown protein; protein id: At1g74160.1 [Arabidopsis thaliana]
            gi|25406317|pir||G96769 unknown protein F9E11.1
            [imported] - Arabidopsis thaliana
            gi|12323815|gb|AAG51874.1|AC079678_4 unknown protein;
            20090-16103 [Arabidopsis thaliana]
          Length = 1028

 Score = 91.3 bits (225), Expect = 7e-18
 Identities = 60/106 (56%), Positives = 74/106 (69%), Gaps = 2/106 (1%)
 Frame = -3

Query: 542  RLTKKSLSAQKLLKELCFEIE--KVQAKKPGWCLELEGEEDDGLKSILWEDVMHEGSESW 369
            ++TKK++SAQ+LLKELC  IE  + QA K      LE EEDD LKSIL EDV    S +W
Sbjct: 923  KVTKKAVSAQQLLKELCSAIETQQKQATKRSENFLLE-EEDDFLKSILAEDVTIR-SGNW 980

Query: 368  KNFNGEIPGFVLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRKLF 231
             +F+GE+ G VLDVERL+FKDLV EIV  E + ++ K S RRR LF
Sbjct: 981  ADFSGEMSGLVLDVERLVFKDLVNEIVHAETSRLQAK-SGRRRTLF 1025

>ref|NP_173297.1| unknown protein; protein id: At1g18620.1, supported by cDNA:
            gi_20856608 [Arabidopsis thaliana]
            gi|25518757|pir||H86319 hypothetical protein F26I16.4 -
            Arabidopsis thaliana gi|9795593|gb|AAF98411.1|AC026238_3
            Unknown protein [Arabidopsis thaliana]
            gi|20856609|gb|AAM26675.1| At1g18620/F25I16_13
            [Arabidopsis thaliana]
          Length = 978

 Score = 84.7 bits (208), Expect = 6e-16
 Identities = 59/105 (56%), Positives = 70/105 (66%), Gaps = 6/105 (5%)
 Frame = -3

Query: 533  KKSLSAQKLLKELCFEIE--KVQAKKPGWCL----ELEGEEDDGLKSILWEDVMHEGSES 372
            KK LSAQ LLKELC EIE  + QAKK    L    E E EE+D LK IL ED+  + SE 
Sbjct: 871  KKVLSAQNLLKELCSEIEILQKQAKKRSENLLLLEEEEEEEEDFLKCILDEDMAIQ-SEK 929

Query: 371  WKNFNGEIPGFVLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRK 237
            W +F+  IPG VLD+ERL+FKDLV EIV GE    RL+ + RR+K
Sbjct: 930  WTDFDDAIPGLVLDMERLLFKDLVKEIVHGEID--RLQGNSRRQK 972

>ref|NP_197062.1| putative protein; protein id: At5g15580.1 [Arabidopsis thaliana]
            gi|11358126|pir||T51536 hypothetical protein T20K14_190 -
            Arabidopsis thaliana gi|9755813|emb|CAC01757.1| putative
            protein [Arabidopsis thaliana]
          Length = 927

 Score = 77.8 bits (190), Expect = 8e-14
 Identities = 44/104 (42%), Positives = 63/104 (60%)
 Frame = -3

Query: 542  RLTKKSLSAQKLLKELCFEIEKVQAKKPGWCLELEGEEDDGLKSILWEDVMHEGSESWKN 363
            R  +KS   ++LL+ LC EI+++Q      C+  E +ED     ++WED+   G  +WK 
Sbjct: 830  RTHEKSSRGEELLQTLCSEIDRLQDNSK--CILDEDDED-----LIWEDLQSHGM-NWKE 881

Query: 362  FNGEIPGFVLDVERLIFKDLVGEIVIGEAASMRLKPSVRRRKLF 231
              GE PG VLD+ERLIFKDL+GE+V  E A+     S + R+LF
Sbjct: 882  IEGETPGLVLDIERLIFKDLIGEVVTSEFAAFPRMLSGQPRQLF 925

>gb|AAM64269.1| unknown [Arabidopsis thaliana]
          Length = 442

 Score = 62.8 bits (151), Expect = 3e-09
 Identities = 35/81 (43%), Positives = 51/81 (62%)
 Frame = -3

Query: 533 KKSLSAQKLLKELCFEIEKVQAKKPGWCLELEGEEDDGLKSILWEDVMHEGSESWKNFNG 354
           +K    ++LL+ LC EI+++Q       LE + EED     I+WED+  + S + K F G
Sbjct: 366 EKISKEEQLLQTLCSEIDRLQQNNSNCILE-DDEED-----IIWEDLQSQ-SMNLKEFEG 418

Query: 353 EIPGFVLDVERLIFKDLVGEI 291
           E PG VLD+ER+IF+DLV E+
Sbjct: 419 ETPGIVLDIERMIFRDLVNEV 439

>ref|NP_566165.2| expressed protein; protein id: At3g02170.1, supported by cDNA:
            gi_15810132 [Arabidopsis thaliana]
            gi|6041800|gb|AAF02120.1|AC009755_13 unknown protein
            [Arabidopsis thaliana]
            gi|6513917|gb|AAF14821.1|AC011664_3 unknown protein
            [Arabidopsis thaliana] gi|23297751|gb|AAN13017.1| unknown
            protein [Arabidopsis thaliana]
          Length = 905

 Score = 62.8 bits (151), Expect = 3e-09
 Identities = 35/81 (43%), Positives = 51/81 (62%)
 Frame = -3

Query: 533  KKSLSAQKLLKELCFEIEKVQAKKPGWCLELEGEEDDGLKSILWEDVMHEGSESWKNFNG 354
            +K    ++LL+ LC EI+++Q       LE + EED     I+WED+  + S + K F G
Sbjct: 829  EKISKEEQLLQTLCSEIDRLQQNNSNCILE-DDEED-----IIWEDLQSQ-SMNLKEFEG 881

Query: 353  EIPGFVLDVERLIFKDLVGEI 291
            E PG VLD+ER+IF+DLV E+
Sbjct: 882  ETPGIVLDIERMIFRDLVNEV 902

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 444,971,003
Number of Sequences: 1393205
Number of extensions: 9668699
Number of successful extensions: 46236
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 32650
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 44063
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18750593680
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWL073a12_f AV769888 1 544
2 MF057f03_f BP031306 46 510




Lotus japonicus
Kazusa DNA Research Institute