KMC001597A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001597A_C01 KMC001597A_c01
gGGTTTAAATCTATCTTTTTGCTATTATAAAATGGATTCAATTTATGTACACCGAAAATG
TTTATATAATAGCGTGGTACATCTGCAAAACTTTATTTTACTACCATGAACTAACAGTTA
GTTATCATGGACTAAGTCTCTCCAACCAATTACTTACATGCTGGGAAGAAGAGTGTTGGA
CCTTACCACCTCCAATCAATCAGCAATGGCTAGGGTGTTTTCTGTTTTTGGTGGTAATAT
TTGTCAAGCAGCAACTCTGAGTACTCTCCAAAAGTAAGAGCGGCATTTGATGGAATAAGC
TCTTGATGAATGTGAACTCCCTCGGGGAATACTGAGCTTTCCATCCCAGGTCAGGTTGCA
TAGAGAATGCAAATGTCGAGCGCTCAATACCAGAAGACTCCTCCCCCTTAGGAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001597A_C01 KMC001597A_c01
         (414 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193076.1| putative protein; protein id: At4g13400.1 [Arab...    40  8e-08
gb|AAC32122.1| hypothetical protein [Picea mariana]                    37  2e-05
gb|AAM97033.1| putative protein [Arabidopsis thaliana] gi|231979...    45  2e-04
dbj|BAC40711.1| unnamed protein product [Mus musculus]                 33  0.84
ref|XP_236916.1| similar to unc-5 homolog C; homolog of C. elega...    32  2.4

>ref|NP_193076.1| putative protein; protein id: At4g13400.1 [Arabidopsis thaliana]
           gi|7487863|pir||T06297 hypothetical protein T9E8.140 -
           Arabidopsis thaliana gi|4584545|emb|CAB40775.1| putative
           protein [Arabidopsis thaliana]
           gi|7268043|emb|CAB78382.1| putative protein [Arabidopsis
           thaliana]
          Length = 306

 Score = 40.0 bits (92), Expect(2) = 8e-08
 Identities = 18/31 (58%), Positives = 23/31 (74%)
 Frame = -2

Query: 317 VHIHQELIPSNAALTFGEYSELLLDKYYHQK 225
           + IH+EL  S+  LTFGEY+E LL+KYY  K
Sbjct: 275 ITIHEELCLSDEVLTFGEYTEKLLNKYYDTK 305

 Score = 36.2 bits (82), Expect(2) = 8e-08
 Identities = 15/21 (71%), Positives = 18/21 (85%)
 Frame = -3

Query: 412 PKGEESSGIERSTFAFSMQPD 350
           P+GEE+ G+ERSTFA  MQPD
Sbjct: 246 PQGEEARGLERSTFALFMQPD 266

>gb|AAC32122.1| hypothetical protein [Picea mariana]
          Length = 189

 Score = 36.6 bits (83), Expect(2) = 2e-05
 Identities = 15/26 (57%), Positives = 21/26 (80%)
 Frame = -2

Query: 311 IHQELIPSNAALTFGEYSELLLDKYY 234
           I  +L  SN ++TFG+YSEL+L+KYY
Sbjct: 160 IELQLTKSNGSMTFGDYSELVLNKYY 185

 Score = 31.2 bits (69), Expect(2) = 2e-05
 Identities = 11/21 (52%), Positives = 18/21 (85%)
 Frame = -3

Query: 412 PKGEESSGIERSTFAFSMQPD 350
           P+G+++ G+ER+TFA  MQP+
Sbjct: 122 PRGDKAHGVERNTFALFMQPN 142

>gb|AAM97033.1| putative protein [Arabidopsis thaliana] gi|23197950|gb|AAN15502.1|
           putative protein [Arabidopsis thaliana]
          Length = 203

 Score = 45.1 bits (105), Expect = 2e-04
 Identities = 21/35 (60%), Positives = 26/35 (74%)
 Frame = -2

Query: 329 FPEGVHIHQELIPSNAALTFGEYSELLLDKYYHQK 225
           FP+ V IH+EL  S+  LTFGEY+E LL+KYY  K
Sbjct: 168 FPKEVTIHEELCLSDEVLTFGEYTEKLLNKYYDTK 202

 Score = 39.7 bits (91), Expect = 0.009
 Identities = 20/39 (51%), Positives = 26/39 (66%)
 Frame = -3

Query: 412 PKGEESSGIERSTFAFSMQPDLGWKAQYSPREFTFIKSL 296
           P+GEE+ G+ERSTFA  MQPD   K  + P+E T  + L
Sbjct: 141 PQGEEARGLERSTFALFMQPDWDQKLTF-PKEVTIHEEL 178

>dbj|BAC40711.1| unnamed protein product [Mus musculus]
          Length = 164

 Score = 33.1 bits (74), Expect = 0.84
 Identities = 20/65 (30%), Positives = 32/65 (48%)
 Frame = +3

Query: 132 LSLSNQLLTCWEEECWTLPPPINQQWLGCFLFLVVIFVKQQL*VLSKSKSGI*WNKLLMN 311
           LS S +     EEE W +PPP++Q  L C      +F + Q+ +   S +G  W +L  +
Sbjct: 24  LSASGERFQASEEETWAVPPPVSQPPL-CNRLPPELFEQLQMLLEPNSVTGNDWRRLASH 82

Query: 312 VNSLG 326
           +   G
Sbjct: 83  LGLCG 87

>ref|XP_236916.1| similar to unc-5 homolog C; homolog of C. elegans transmembrane
           receptor Unc5; unc-5 homolog C (C. elegans) [Homo
           sapiens] [Rattus norvegicus]
          Length = 434

 Score = 31.6 bits (70), Expect = 2.4
 Identities = 18/54 (33%), Positives = 28/54 (51%)
 Frame = +3

Query: 165 EEECWTLPPPINQQWLGCFLFLVVIFVKQQL*VLSKSKSGI*WNKLLMNVNSLG 326
           EEE W +PPP++Q  L C      +F + Q+ +   S +G  W KL  ++   G
Sbjct: 305 EEETWAVPPPVSQPPL-CNRLPPELFEQLQMLLEPSSVTGNDWRKLASHLGLCG 357

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 363,954,430
Number of Sequences: 1393205
Number of extensions: 7468233
Number of successful extensions: 17082
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 16828
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 17081
length of database: 448,689,247
effective HSP length: 113
effective length of database: 291,257,082
effective search space used: 6990169968
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf062h02 BP061012 1 359
2 GENf076h02 BP061638 2 379
3 GNf043g01 BP070553 2 317
4 GENf005h01 BP058555 5 358
5 GNf001a01 BP067425 42 432




Lotus japonicus
Kazusa DNA Research Institute