KMC000763A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000763A_C01 KMC000763A_c01
atgaaaaactaaaatatttcatatcacgagtataacctgtggcgccatgaagcatgaatc
acccaactagtaacaaaataAAAATATGGAATGAGCATAGCATGCAGCGCAGGCGACCAC
GCAATTAACTAGACCAAAAATGTGGATAACATGTCAAATACTTCACCCGCTGAAGCACAT
TCTTAAGCCAAATGAAAATGTGCAAGACATTAATCTCTCTACCTCTCTACACTACACTTT
TTCTTCTTTTTCTTCTTTTTCTTTCCTTTCGGTTTCTTCACTTCTACAGAGTTAAGATGG
TCATTAGCAGAGTACGTGTTTGGACAAGATAAGATGACAGAACAGACCTCCATATCATCT
TTGGGAATTTTTTGGAGCATAAGGTGGGCAGATCTATGATCTCCTGATGCTAAAAGTGCC
TGATACATCCTCAAGTAACTTAATACATGGTAAGGAAATCCAGCAACCTCATATTCTCTC
CAAATCAAACGAGCATTATTCAAATCTCCAGCATGGGCACAAGCACTTAGAAGAAAATCA
AGACTCTGCCTTGAAGGAATAAGGCCAATCTCATCCTTGATGGCCCAAAGCAAGTCGAGG
CCAAACGGTAGATGTGTGGACTGGGACGACGCAATTACAGAAAATACAGAGTCAAATAGA
ACTTCCAATGGTATTTCATCACTTTCGAACTTATCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000763A_C01 KMC000763A_c01
         (696 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAD03456.1| contains similarity to Ipomoea nil leaf protein (...   128  9e-29
ref|NP_192388.1| hypothetical protein; protein id: At4g04790.1 [...   128  9e-29
ref|NP_193919.1| putative protein; protein id: At4g21880.1 [Arab...    84  1e-15
ref|NP_171976.1| hypothetical protein; protein id: At1g04840.1 [...    40  0.024
sp|Q9H7B2|U170_HUMAN Hypothetical protein FLJ21087 gi|10437105|d...    38  0.15

>gb|AAD03456.1| contains similarity to Ipomoea nil leaf protein (GB: D85101)
            [Arabidopsis thaliana]
          Length = 760

 Score =  128 bits (321), Expect = 9e-29
 Identities = 66/143 (46%), Positives = 91/143 (63%)
 Frame = -2

Query: 674  IPLEVLFDSVFSVIASSQSTHLPFGLDLLWAIKDEIGLIPSRQSLDFLLSACAHAGDLNN 495
            IP+E  FD VF  IA ++ + +  G+DLL  +KDE+G +PSR+ LDFLL AC +A DL +
Sbjct: 603  IPVEAHFDEVFWAIAETEPSKVHLGMDLLRFMKDELGFVPSRKCLDFLLHACVNAKDLEH 662

Query: 494  ARLIWREYEVAGFPYHVLSYLRMYQALLASGDHRSAHLMLQKIPKDDMEVCSVILSCPNT 315
              L+W+EY+ A FP +VLS+LRMYQ LLA+GD   A  ++ KIPKDD +V  +I    + 
Sbjct: 663  GLLVWKEYQSAAFPCNVLSFLRMYQVLLAAGDSEGAKALVSKIPKDDKDVQHIIEESQSA 722

Query: 314  YSANDHLNSVEVKKPKGKKKKKK 246
            +S          + P  KK KKK
Sbjct: 723  FS----------QAPNKKKPKKK 735

>ref|NP_192388.1| hypothetical protein; protein id: At4g04790.1 [Arabidopsis
           thaliana] gi|25407292|pir||C85060 hypothetical protein
           AT4g04790 [imported] - Arabidopsis thaliana
           gi|7267237|emb|CAB80844.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 731

 Score =  128 bits (321), Expect = 9e-29
 Identities = 66/143 (46%), Positives = 91/143 (63%)
 Frame = -2

Query: 674 IPLEVLFDSVFSVIASSQSTHLPFGLDLLWAIKDEIGLIPSRQSLDFLLSACAHAGDLNN 495
           IP+E  FD VF  IA ++ + +  G+DLL  +KDE+G +PSR+ LDFLL AC +A DL +
Sbjct: 574 IPVEAHFDEVFWAIAETEPSKVHLGMDLLRFMKDELGFVPSRKCLDFLLHACVNAKDLEH 633

Query: 494 ARLIWREYEVAGFPYHVLSYLRMYQALLASGDHRSAHLMLQKIPKDDMEVCSVILSCPNT 315
             L+W+EY+ A FP +VLS+LRMYQ LLA+GD   A  ++ KIPKDD +V  +I    + 
Sbjct: 634 GLLVWKEYQSAAFPCNVLSFLRMYQVLLAAGDSEGAKALVSKIPKDDKDVQHIIEESQSA 693

Query: 314 YSANDHLNSVEVKKPKGKKKKKK 246
           +S          + P  KK KKK
Sbjct: 694 FS----------QAPNKKKPKKK 706

>ref|NP_193919.1| putative protein; protein id: At4g21880.1 [Arabidopsis thaliana]
           gi|7487835|pir||T05470 hypothetical protein T8O5.90 -
           Arabidopsis thaliana gi|2894566|emb|CAA17155.1| putative
           protein [Arabidopsis thaliana]
           gi|7269033|emb|CAB79143.1| putative protein [Arabidopsis
           thaliana]
          Length = 859

 Score = 84.3 bits (207), Expect = 1e-15
 Identities = 39/84 (46%), Positives = 59/84 (69%)
 Frame = -2

Query: 680 DEIPLEVLFDSVFSVIASSQSTHLPFGLDLLWAIKDEIGLIPSRQSLDFLLSACAHAGDL 501
           D++ +E  F+ VF  IA ++S+ +  GLDL+  +K+E+ L PSR+ LDFLL AC +A D 
Sbjct: 683 DDVGVEYWFEEVFKSIAETESSDVKVGLDLVSFMKEELELCPSRKCLDFLLHACVNAKDK 742

Query: 500 NNARLIWREYEVAGFPYHVLSYLR 429
            +A L+W EY+ A  PY+V++YLR
Sbjct: 743 QSALLVWEEYQCAELPYNVINYLR 766

>ref|NP_171976.1| hypothetical protein; protein id: At1g04840.1 [Arabidopsis
           thaliana] gi|25346323|pir||F86181 protein F13M7.17
           [imported] - Arabidopsis thaliana
           gi|7211995|gb|AAF40466.1|AC004809_24 F13M7.17
           [Arabidopsis thaliana]
          Length = 665

 Score = 40.4 bits (93), Expect = 0.024
 Identities = 33/149 (22%), Positives = 71/149 (47%), Gaps = 2/149 (1%)
 Frame = -2

Query: 683 SDEIPLEVLFDSVFSVIASSQSTHLPFGLDLLWAIKDEIGLIPSRQSLDFLLSACAHAGD 504
           S E P EV+F +V +   +S    L  GL+   +++ +  + P+ +    ++     AG 
Sbjct: 388 SGEKPDEVVFLAVLTACLNSSEVDL--GLNFFDSMRLDYAIEPTLKHYVLVVDLLGRAGK 445

Query: 503 LNNARLIWREYEVAGFPYH--VLSYLRMYQALLASGDHRSAHLMLQKIPKDDMEVCSVIL 330
           LN A  +     V   P +  + ++  +Y+A  A   +R A  + Q + + D E+C   +
Sbjct: 446 LNEAHEL-----VENMPINPDLTTWAALYRACKAHKGYRRAESVSQNLLELDPELCGSYI 500

Query: 329 SCPNTYSANDHLNSVEVKKPKGKKKKKKK 243
               T+++  ++  VE ++   +K+ K++
Sbjct: 501 FLDKTHASKGNIQDVEKRRLSLQKRIKER 529

>sp|Q9H7B2|U170_HUMAN Hypothetical protein FLJ21087 gi|10437105|dbj|BAB14983.1| unnamed
           protein product [Homo sapiens]
          Length = 253

 Score = 37.7 bits (86), Expect = 0.15
 Identities = 31/112 (27%), Positives = 49/112 (43%)
 Frame = -2

Query: 575 DEIGLIPSRQSLDFLLSACAHAGDLNNARLIWREYEVAGFPYHVLSYLRMYQALLASGDH 396
           D+  +    + L  LL        ++N RL   EY +     +   Y R Y+ LL     
Sbjct: 148 DDFDVTEDYRRLKSLLIDFFRGPTVSNIRLAGLEYVLHFTALNGKIYFRSYKLLLKKSGC 207

Query: 395 RSAHLMLQKIPKDDMEVCSVILSCPNTYSANDHLNSVEVKKPKGKKKKKKKK 240
           R+  + L+++        S+ L    T+ A+D L  + +K PK  K KKKKK
Sbjct: 208 RTPRIELEEMGP------SLDLVLRRTHLASDDLYKLSMKMPKALKPKKKKK 253

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 595,188,368
Number of Sequences: 1393205
Number of extensions: 12672482
Number of successful extensions: 52027
Number of sequences better than 10.0: 78
Number of HSP's better than 10.0 without gapping: 40083
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 48162
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 31684559424
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf082c10 BP066804 1 447
2 MPDL032b01_f AV778068 99 584
3 GENLf035a12 BP064161 270 696




Lotus japonicus
Kazusa DNA Research Institute