KMC004472A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004472A_C01 KMC004472A_c01
TCTCGAAATTTATCAGAATTTTTCTTCGTTTCTCTTCTGCGCACGGATCTGCATAGATCT
GGTGATGGCGTTTTCGATCAGATCGTTCCAGAATCCGAATCTTGCAACCTTCGATCTACA
ACGCAGAGGTGCGAGGCTGAAGTTCGTGCTGCCTTTGGAATCTTAATTGATAGATAGTAG
ATTGGTCTTGCATATGTGAAGGTTTTGGAGGTATTTTTTGCAGGTTTGGAGAAGGATCGA
TGAGGAAAACGTTGATCCAGTGAAAAATTATGCCGCTGCTGCTTGTAATCGATGCCTCTA
CCTCAGATCTGTAATCCGTGACCTCATATGTCGCTATTATTCGATATTGTAATTGTAGGT
TATCTTTGGGAGGGGAAAATTTGTGGATGTGGAAATTATCAGGATGAATTACCTCAATAC
TTATTGAATTGAGTGTTTGTGGTATGAATCCTGATGGATCTTGTCATCCGACCTTTGCCA
TGTCAGGTGCGCTTGCTTGGCAGGTCACAATATCTTAATAAAAAAGTTTTGTTGGTTTTG
GGGGAGATCGCACGGGATTGCCCGGGATCTCTCCCTGTGTGGCGTGTGCTCTGTTCCTTG
TTCCTTTATTATTATTTCTCTGTCGATCGACTCTCCTCAGGTATCATCAGATTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004472A_C01 KMC004472A_c01
         (654 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|ZP_00093643.1| hypothetical protein [Novosphingobium aromatic...    33  4.4
ref|NP_795939.1| RIKEN cDNA 4930563A03 gene [Mus musculus] gi|26...    32  7.5
gb|AAL68238.1| LD43328p [Drosophila melanogaster]                      32  9.8
ref|NP_610435.2| CG8213-PA [Drosophila melanogaster] gi|21645579...    32  9.8
pir||T03930 gene GUT15 protein - common tobacco gi|2275256|gb|AA...    32  9.8

>gb|ZP_00093643.1| hypothetical protein [Novosphingobium aromaticivorans]
          Length = 538

 Score = 32.7 bits (73), Expect = 4.4
 Identities = 12/32 (37%), Positives = 21/32 (65%)
 Frame = -3

Query: 301 VEASITSSSGIIFHWINVFLIDPSPNLQKIPP 206
           V+ +  ++S ++  W+NVFL++PS N    PP
Sbjct: 457 VDCTSAATSQVVKKWVNVFLVEPSLNRGSAPP 488

>ref|NP_795939.1| RIKEN cDNA 4930563A03 gene [Mus musculus] gi|26325728|dbj|BAC26618.1|
            unnamed protein product [Mus musculus]
          Length = 1267

 Score = 32.0 bits (71), Expect = 7.5
 Identities = 23/97 (23%), Positives = 44/97 (44%), Gaps = 9/97 (9%)
 Frame = -3

Query: 394  FHIHKF--------SPPKDNLQLQYRIIATYEVTDYRSEVEASITSSSGIIFHWINVFLI 239
            FHI+K+         P   N+++   ++ +Y     R+EV         +++HWIN+ L 
Sbjct: 1155 FHINKYLVEEICVLDPTASNVEVNVELVTSYIQAHSRTEVWNFRNIVIELLYHWINICLT 1214

Query: 238  DPSPNL-QKIPPKPSHMQDQSTIYQLRFQRQHELQPR 131
                N+ Q +   P+  Q++  +Y     R  ++ PR
Sbjct: 1215 LIELNMRQDVSIIPAIAQEECHLYLCHILR--KINPR 1249

>gb|AAL68238.1| LD43328p [Drosophila melanogaster]
          Length = 1674

 Score = 31.6 bits (70), Expect = 9.8
 Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 21/109 (19%)
 Frame = -3

Query: 439 QTLNSISIEVIHPDNFHIHKFS---------PPKDNLQLQ------------YRIIATYE 323
           QT   I+ + + P NFH+HK S         P  D+  +Q            + I+ T E
Sbjct: 285 QTTQKINKQPVQPPNFHVHKHSVTINSPSSPPQNDDFVMQVLSTLPPEHADDHHIVFTTE 344

Query: 322 VTDYRSEVEASITSSSGIIFHWINVFLIDPSPNLQKIPPKPSHMQDQST 176
           V    +      TSS    F  ++          QK  PKP+ M  Q T
Sbjct: 345 VPTKITSGLQDQTSSESNSFEEVS----STPAATQKPKPKPTQMPTQKT 389

>ref|NP_610435.2| CG8213-PA [Drosophila melanogaster] gi|21645579|gb|AAF59009.2|
           CG8213-PA [Drosophila melanogaster]
          Length = 1599

 Score = 31.6 bits (70), Expect = 9.8
 Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 21/109 (19%)
 Frame = -3

Query: 439 QTLNSISIEVIHPDNFHIHKFS---------PPKDNLQLQ------------YRIIATYE 323
           QT   I+ + + P NFH+HK S         P  D+  +Q            + I+ T E
Sbjct: 255 QTTQKINKQPVQPPNFHVHKHSVTINSPSSPPQNDDFVMQVLSTLPPEHADDHHIVFTTE 314

Query: 322 VTDYRSEVEASITSSSGIIFHWINVFLIDPSPNLQKIPPKPSHMQDQST 176
           V    +      TSS    F  ++          QK  PKP+ M  Q T
Sbjct: 315 VPTKITSGLQDQTSSESNSFEEVS----STPAATQKPKPKPTQMPTQKT 359

>pir||T03930 gene GUT15 protein - common tobacco gi|2275256|gb|AAD09831.1|
           unknown [Nicotiana tabacum]
          Length = 78

 Score = 31.6 bits (70), Expect = 9.8
 Identities = 28/59 (47%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
 Frame = +3

Query: 480 MSGALAWQVTIS**KSFVGFG-GDRTG-LPGISPCVACALFLVPLLLFLCRSTLLRYHQ 650
           M GALAWQVTI    S   FG GD TG LPG    V+  L      LF    T  +YHQ
Sbjct: 1   MMGALAWQVTIPNKVSIFCFGRGDGTGILPGAPLFVSSRLLFSS--LFPRYYTQDQYHQ 57

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 588,961,520
Number of Sequences: 1393205
Number of extensions: 13225111
Number of successful extensions: 36742
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 35487
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 36730
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 28144814643
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL097g12_f BP058123 1 422
2 MR043h03_f BP079362 70 468
3 MF017b01_f BP029128 78 574
4 MFB061b04_f BP038408 91 266
5 SPD027f07_f BP046155 93 654
6 MPD039e08_f AV772668 105 205




Lotus japonicus
Kazusa DNA Research Institute