KMC004901A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004901A_C01 KMC004901A_c01
aatagaaaTGATTGAGTAAACACTGAACATCTCGTTATATCCAAAGGAAAAAAATGAACA
GCTCATGCTAATTAATCAGACATAACCAAGCCTAATGGTATTGGTTAAAACTAAGTAGAC
TTCATATATTATCTTACTATATACAAAAGGCTTGGAAACTGAAAAAAAGCAGAATATAAC
TAGATTGCAAAGATCTAGACACATACATACATATATAGATTGCAAAGATCTAACAGCCCC
CATGGTTGAGGAGGCAATCTAGTACCATTCTTCTCCTACATTGCCAGTTGTATGGTTCTA
TGGCGATAATATAAGTTGAAAAAGGGCCAAAAACAGGAGGTTGTGGAAACAACTGACCAA
GAAAGATGGCAAAATGAATTAATTAATTGTCAAAGTGAAAGACATTGTACCCTGTCTGGC
AGATTTTTAGGCATTGGAACTGCAGTGACATATTTCTGGGGGTATGCTTTCTGAGCAGTT
GTATCAGAGGATGATGAGGTCTCTGCCCCATCCTTTTGCCCTCGTCCATGTCCATGGAAC
TTGGCAAACCGATTGATGACAGAAAATCTTTCTAAATCTTGTAACTCAACTCTCAAGTCC
AAAATTGAGGCTCTGCTATCCAATCTCAATATATCATTTTCCAGTTTCCTCGCCCTACCA
ACAAAGTCTTCCACTTTCGAAATGCACTGGTCGATTTTCTCAGATGGCTTAAGCGTATCC
GGCAGGTGGTTTTGGCTGCCAGGGGGCACTAGTGTGTCACTTCCGCACAATGAGACTGAA
CTGCAAGCATCTCCTAACACTAATCTAGCAACAGAGTAAACCACACTATCATGATGCAAC
TTGACATCCGCAGAGAGAACTGCAGCTGGTGGAGGGTTAAGTAGTTGTTGCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004901A_C01 KMC004901A_c01
         (892 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_196415.1| unknown protein; protein id: At5g07980.1 [Arabi...   146  4e-34
ref|NP_196411.1| unknown protein; protein id: At5g07940.1 [Arabi...   143  3e-33
pir||T45619 hypothetical protein F13G24.140 - Arabidopsis thalia...   127  2e-28
ref|NP_196414.1| unknown protein; protein id: At5g07970.1 [Arabi...   125  9e-28
ref|NP_683606.1| hypothetical protein; protein id: At3g29385.1 [...    84  2e-15

>ref|NP_196415.1| unknown protein; protein id: At5g07980.1 [Arabidopsis thaliana]
            gi|11357396|pir||T45623 hypothetical protein F13G24.180 -
            Arabidopsis thaliana gi|6562312|emb|CAB62610.1| putative
            protein [Arabidopsis thaliana]
            gi|10176732|dbj|BAB09962.1| gene_id:MXM12.22~unknown
            protein [Arabidopsis thaliana]
          Length = 1501

 Score =  146 bits (369), Expect = 4e-34
 Identities = 82/167 (49%), Positives = 103/167 (61%), Gaps = 1/167 (0%)
 Frame = -3

Query: 890  QQLLNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVPPGSQNHLPDTL 711
            QQL +PPPA V+S     +++ V Y+ AR  LGDACSS S   S+   PP   N L +  
Sbjct: 1345 QQLCSPPPARVISLVASSNYEFVAYTAARGALGDACSSSSTDRSEGFWPPNISNPLSERT 1404

Query: 710  KPSEKIDQCISKV-EDFVGRARKLENDILRLDSRASILDLRVELQDLERFSVINRFAKFH 534
            K  +  DQ ISK  EDF+ R RKLE D  RL++  +I DLRVE+QDLE+F+VINRFAKFH
Sbjct: 1405 KTEKISDQYISKAAEDFISRTRKLETDFARLENGTTIPDLRVEVQDLEKFAVINRFAKFH 1464

Query: 533  GHGRGQKDGAETSSSSDTTAQKAYPQKYVTAVPMPKNLPDRVQCLSL 393
                        S      + +  PQ+YVT  PMP+N+PDRVQCLSL
Sbjct: 1465 ----------PPSMDRTLNSVRINPQRYVTVAPMPQNIPDRVQCLSL 1501

>ref|NP_196411.1| unknown protein; protein id: At5g07940.1 [Arabidopsis thaliana]
            gi|10176728|dbj|BAB09958.1| gene_id:MXM12.18~unknown
            protein [Arabidopsis thaliana]
          Length = 1526

 Score =  143 bits (361), Expect = 3e-33
 Identities = 80/167 (47%), Positives = 103/167 (60%), Gaps = 1/167 (0%)
 Frame = -3

Query: 890  QQLLNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVPPGSQNHLPDTL 711
            QQL +PPPA V+S     ++D V Y+  R  LGDACSS S   S+   PP + N L +  
Sbjct: 1368 QQLFSPPPARVISLVASSNYDVVAYTAGRAALGDACSSSSTDRSEGFSPPNNSNPLSERT 1427

Query: 710  KPSEKIDQCISKV-EDFVGRARKLENDILRLDSRASILDLRVELQDLERFSVINRFAKFH 534
            +  +  DQ ISK  EDF+ R RKLE D   L++  +I DLRVE+QDLE+F+VINRFAKFH
Sbjct: 1428 ENEKISDQYISKAAEDFISRTRKLETDFAGLENGTTIPDLRVEVQDLEKFAVINRFAKFH 1487

Query: 533  GHGRGQKDGAETSSSSDTTAQKAYPQKYVTAVPMPKNLPDRVQCLSL 393
                       +S +    + K   Q+YVT  PMP+N+PDRVQCLSL
Sbjct: 1488 --------PPSSSMNRTVNSLKLNLQRYVTIAPMPQNIPDRVQCLSL 1526

>pir||T45619 hypothetical protein F13G24.140 - Arabidopsis thaliana
            gi|6562308|emb|CAB62606.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 1540

 Score =  127 bits (319), Expect = 2e-28
 Identities = 76/163 (46%), Positives = 98/163 (59%), Gaps = 2/163 (1%)
 Frame = -3

Query: 890  QQLLNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVPPGSQNHLPDTL 711
            QQL +PPPA V+S     ++D V Y+  R  LGDACSS S   S+   PP + N   +  
Sbjct: 1368 QQLFSPPPARVISLVASSNYDVVAYTAGRAALGDACSSSSTDRSEGFSPPNNSNPTEN-- 1425

Query: 710  KPSEKI-DQCISKV-EDFVGRARKLENDILRLDSRASILDLRVELQDLERFSVINRFAKF 537
               EKI DQ ISK  EDF+ R RKLE D   L++  +I DLRVE+QDLE+F+VINRFAKF
Sbjct: 1426 ---EKISDQYISKAAEDFISRTRKLETDFAGLENGTTIPDLRVEVQDLEKFAVINRFAKF 1482

Query: 536  HGHGRGQKDGAETSSSSDTTAQKAYPQKYVTAVPMPKNLPDRV 408
            H           +S +    + K   Q+YVT  PMP+N+PDR+
Sbjct: 1483 H--------PPSSSMNRTVNSLKLNLQRYVTIAPMPQNIPDRL 1517

>ref|NP_196414.1| unknown protein; protein id: At5g07970.1 [Arabidopsis thaliana]
            gi|11357395|pir||T45622 hypothetical protein F13G24.170 -
            Arabidopsis thaliana gi|6562311|emb|CAB62609.1| putative
            protein [Arabidopsis thaliana]
            gi|10176731|dbj|BAB09961.1| gene_id:MXM12.21~unknown
            protein [Arabidopsis thaliana]
          Length = 1097

 Score =  125 bits (314), Expect = 9e-28
 Identities = 75/168 (44%), Positives = 100/168 (58%), Gaps = 3/168 (1%)
 Frame = -3

Query: 890  QQLLNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVPPGSQNHLPDTL 711
            QQL  P P  V S  +   ++ V YS AR  LGDACSS S    +  +   + N L +  
Sbjct: 941  QQLFRPLPGRVKS--LVTSYEFVAYSAARAALGDACSSTSTDRIEGFLLQNNLNPLSERT 998

Query: 710  KPSEKIDQCISKV-EDFVGRARKLENDILRLDSRASILDLRVELQDLERFSVINRFAKFH 534
            +  +  DQ ISK  EDF+ R +KLE D   L+   +I DLRVE+QDLERF+VINRFA FH
Sbjct: 999  ETEKMSDQYISKAAEDFISRTKKLETDFAGLEKGTTITDLRVEVQDLERFAVINRFASFH 1058

Query: 533  GHGRGQKDGAETSSSSD--TTAQKAYPQKYVTAVPMPKNLPDRVQCLS 396
                      ++SSS D   ++ +  PQ+YVT  P+P+++PDRVQCLS
Sbjct: 1059 ----------QSSSSMDRSVSSLRLNPQRYVTVAPVPRHIPDRVQCLS 1096

>ref|NP_683606.1| hypothetical protein; protein id: At3g29385.1 [Arabidopsis
           thaliana]
          Length = 218

 Score = 84.3 bits (207), Expect = 2e-15
 Identities = 61/168 (36%), Positives = 84/168 (49%), Gaps = 2/168 (1%)
 Frame = -3

Query: 890 QQLLNPPPAAVLSAD-VKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVPPGSQNHLPDT 714
           QQLL P P  V   D   L+++ V+Y V+R+ L ++CS    C SD       Q     T
Sbjct: 72  QQLLQPAPTFVFLGDNAALNYEIVLYYVSRINLANSCSLK--CRSDLDKSINRQ-----T 124

Query: 713 LKPSEKIDQCISK-VEDFVGRARKLENDILRLDSRASILDLRVELQDLERFSVINRFAKF 537
            K +   DQ  S  V  F  + +KLE++   L+   SILD+  E+QDLERFS+IN   KF
Sbjct: 125 SKTASNQDQQHSLLVNAFNEKIQKLESNFQSLERTTSILDIIFEIQDLERFSMINHLGKF 184

Query: 536 HGHGRGQKDGAETSSSSDTTAQKAYPQKYVTAVPMPKNLPDRVQCLSL 393
           H   +              T ++  P KY  A+ MP NLP+ + CL L
Sbjct: 185 HNRAK--------------TFKRLIPHKYAVAIQMPMNLPEPLHCLPL 218

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 774,424,657
Number of Sequences: 1393205
Number of extensions: 16961175
Number of successful extensions: 44579
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 42631
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 44550
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 48496973238
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWL060b06_f AV769616 1 593
2 MPDL040a10_f AV778509 6 505
3 MWL078e08_f AV769992 14 261
4 MRL020c11_f BP084737 112 575
5 SPDL064d05_f BP055975 167 735
6 MPDL082a09_f AV780748 168 692
7 SPDL060h02_f BP055773 169 696
8 MPDL086c05_f AV780977 366 905




Lotus japonicus
Kazusa DNA Research Institute