KMC007982A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC007982A_C01 KMC007982A_c01
catttttccttttatacaaATAAAGAATAGTAAAATAAATTTTGTTAACAACAATACAAC
TGAAGATCTAACATTACATTACATATTATAGAGATATATGCCAATCATAGATTTTGACTT
CAAGATGTTTCTCATCTTGTAGCCTCTGTCTGAGGTTCCTTGGCTCGCTGTTTGTCTCTG
GCAAGTAAGAGGGCAAAGCCTCCGCCACCAGCACCAACCAGTTTGTAACCACAGCAGTAG
GGACTAGCAAAAGAAAATAGCCTGTCAACAGACTCATTACTGCAGAAAGGGTCAAGTTCC
TGATGCAATCTCCAAGCTTCCAACATTATCTCCCCAAATTCATCAATCTCACAGTTCATC
AAAGCTTCTCTCCCAATCTTTGCCAGCTCAACAAGGCGCTTGATACTTGATACCAAAAGA
TTATCCCGCCGAATATATCTAGTTACCACCTTTTGCAGCACCTTCTTTGCAAGCCGAACT
TGACCGGTAAATACCACAAGAAGCCGTTGCTGCAATTCAGAAATCAACTCTGGTGAAGCC
AACAGGGGGACCACTTGAAGCCGCAATGGAATTCCAGGAAAGCTTGAAGTACAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC007982A_C01 KMC007982A_c01
         (594 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAO17031.1| Hypothetical protein [Oryza sativa (japonica cult...   239  4e-64
ref|NP_563620.1| hypothetical protein; protein id: At1g01220.1 [...   238  4e-62
pir||D86142 hypothetical protein F6F3.3 - Arabidopsis thaliana g...   238  4e-62
dbj|BAC33572.1| unnamed protein product [Mus musculus]                 80  2e-14
ref|XP_226508.1| hypothetical protein XP_226508 [Rattus norvegicus]    80  3e-14

>gb|AAO17031.1| Hypothetical protein [Oryza sativa (japonica cultivar-group)]
          Length = 1072

 Score =  239 bits (611), Expect(2) = 4e-64
 Identities = 119/146 (81%), Positives = 129/146 (87%)
 Frame = -2

Query: 593  CTSSFPGIPLRLQVVPLLASPELISELQQRLLVVFTGQVRLAKKVLQKVVTRYIRRDNLL 414
            CT SFPG PLRL VVPLLASP+LI ELQQRLLVVFTGQVRLA +VLQKVVTRY+RRD+LL
Sbjct: 903  CTQSFPGQPLRLHVVPLLASPQLIQELQQRLLVVFTGQVRLAHRVLQKVVTRYLRRDSLL 962

Query: 413  VSSIKRLVELAKIGREALMNCEIDEFGEIMLEAWRLHQELDPFCSNESVDRLFSFASPYC 234
            +SSIKRL ELAKIGREALMN EIDE G IM EAWRLHQELDPFCSN+ VD LF+FA PYC
Sbjct: 963  ISSIKRLAELAKIGREALMNGEIDELGGIMSEAWRLHQELDPFCSNKLVDELFAFADPYC 1022

Query: 233  CGYKLVGAGGGGFALLLARDKQRAKE 156
            CGYKLVGAGGGGFAL+L ++   AKE
Sbjct: 1023 CGYKLVGAGGGGFALMLGKNLNSAKE 1048

 Score = 27.3 bits (59), Expect(2) = 4e-64
 Identities = 8/24 (33%), Positives = 18/24 (74%)
 Frame = -3

Query: 160  RNLRQRLQDEKHLEVKIYDWHISL 89
            + LRQ L++    +VK+Y+W++++
Sbjct: 1047 KELRQALENSATFDVKVYNWNVAM 1070

>ref|NP_563620.1| hypothetical protein; protein id: At1g01220.1 [Arabidopsis thaliana]
          Length = 1055

 Score =  238 bits (607), Expect = 4e-62
 Identities = 115/145 (79%), Positives = 133/145 (91%)
 Frame = -2

Query: 590  TSSFPGIPLRLQVVPLLASPELISELQQRLLVVFTGQVRLAKKVLQKVVTRYIRRDNLLV 411
            TSSFPGIP+RLQVVPLLASP+LISEL+QRLLVVFTGQVRLA +VL KVVTRY++RDNLL+
Sbjct: 889  TSSFPGIPMRLQVVPLLASPQLISELEQRLLVVFTGQVRLAHQVLHKVVTRYLQRDNLLI 948

Query: 410  SSIKRLVELAKIGREALMNCEIDEFGEIMLEAWRLHQELDPFCSNESVDRLFSFASPYCC 231
            SSIKRL ELAK GREALMNCE+DE G+IM EAWRLHQELDP+CSNE VD+LF F+ PY  
Sbjct: 949  SSIKRLTELAKSGREALMNCEVDEVGDIMSEAWRLHQELDPYCSNEFVDKLFEFSQPYSS 1008

Query: 230  GYKLVGAGGGGFALLLARDKQRAKE 156
            G+KLVGAGGGGF+L+LA+D ++AKE
Sbjct: 1009 GFKLVGAGGGGFSLILAKDAEKAKE 1033

>pir||D86142 hypothetical protein F6F3.3 - Arabidopsis thaliana
            gi|9665149|gb|AAF97333.1|AC023628_14 Hypothetical protein
            [Arabidopsis thaliana]
          Length = 1113

 Score =  238 bits (607), Expect = 4e-62
 Identities = 115/145 (79%), Positives = 133/145 (91%)
 Frame = -2

Query: 590  TSSFPGIPLRLQVVPLLASPELISELQQRLLVVFTGQVRLAKKVLQKVVTRYIRRDNLLV 411
            TSSFPGIP+RLQVVPLLASP+LISEL+QRLLVVFTGQVRLA +VL KVVTRY++RDNLL+
Sbjct: 947  TSSFPGIPMRLQVVPLLASPQLISELEQRLLVVFTGQVRLAHQVLHKVVTRYLQRDNLLI 1006

Query: 410  SSIKRLVELAKIGREALMNCEIDEFGEIMLEAWRLHQELDPFCSNESVDRLFSFASPYCC 231
            SSIKRL ELAK GREALMNCE+DE G+IM EAWRLHQELDP+CSNE VD+LF F+ PY  
Sbjct: 1007 SSIKRLTELAKSGREALMNCEVDEVGDIMSEAWRLHQELDPYCSNEFVDKLFEFSQPYSS 1066

Query: 230  GYKLVGAGGGGFALLLARDKQRAKE 156
            G+KLVGAGGGGF+L+LA+D ++AKE
Sbjct: 1067 GFKLVGAGGGGFSLILAKDAEKAKE 1091

>dbj|BAC33572.1| unnamed protein product [Mus musculus]
          Length = 729

 Score = 80.1 bits (196), Expect = 2e-14
 Identities = 40/136 (29%), Positives = 72/136 (52%)
 Frame = -2

Query: 572 IPLRLQVVPLLASPELISELQQRLLVVFTGQVRLAKKVLQKVVTRYIRRDNLLVSSIKRL 393
           +PL+++V  +      + ++   LL+V+TG+ RLA+ +LQ V+  +  R  ++V + +RL
Sbjct: 543 LPLKVEVEEITVPEGFVQKINDHLLLVYTGKTRLARNLLQDVLRNWYARLPVVVQNARRL 602

Query: 392 VELAKIGREALMNCEIDEFGEIMLEAWRLHQELDPFCSNESVDRLFSFASPYCCGYKLVG 213
           V   +   EA     +   G+ +   W   + + P C   +V R+    +PY  G  L G
Sbjct: 603 VRQTEKCAEAFRQGNLPLLGQYLTSYWEQKKLMAPGCEPLAVQRMMDVLAPYAYGQSLAG 662

Query: 212 AGGGGFALLLARDKQR 165
           AGGGGF  LL ++ ++
Sbjct: 663 AGGGGFLYLLTKEPRQ 678

>ref|XP_226508.1| hypothetical protein XP_226508 [Rattus norvegicus]
          Length = 1133

 Score = 79.7 bits (195), Expect = 3e-14
 Identities = 40/136 (29%), Positives = 71/136 (51%)
 Frame = -2

Query: 572  IPLRLQVVPLLASPELISELQQRLLVVFTGQVRLAKKVLQKVVTRYIRRDNLLVSSIKRL 393
            +PL+++V  +      +  +   LL+V+TG+ RLA+ +LQ V+  +  R  ++V + +RL
Sbjct: 947  LPLKVEVEEITVPENFVQRINDHLLLVYTGKTRLARNLLQDVLRNWYARLPVVVQNARRL 1006

Query: 392  VELAKIGREALMNCEIDEFGEIMLEAWRLHQELDPFCSNESVDRLFSFASPYCCGYKLVG 213
            V   +   EA     +   G+ +   W   + + P C   +V R+    +PY  G  L G
Sbjct: 1007 VRQTEKCAEAFRQGNLALLGQYLTSYWEQKKLMAPGCEPLAVHRMMDVLAPYAFGQSLAG 1066

Query: 212  AGGGGFALLLARDKQR 165
            AGGGGF  LL ++ ++
Sbjct: 1067 AGGGGFLYLLTKEPRQ 1082

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 504,976,802
Number of Sequences: 1393205
Number of extensions: 10940852
Number of successful extensions: 36835
Number of sequences better than 10.0: 64
Number of HSP's better than 10.0 without gapping: 34651
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 36760
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 22854740960
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL071d07_f BP056395 1 596
2 SPDL090g12_f BP057670 2 526
3 GNLf018b06 BP075823 21 451




Lotus japonicus
Kazusa DNA Research Institute