KMC000720A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000720A_C01 KMC000720A_c01
ggGACTCCAAAATTATATACATTCTCACAAAAGGTCACAAAACATTCATGGATCCACTAT
GTTGGCAAGTTTTCAACTAAATTACGCCTCATATCAAGTAAAGAAAAACTAACGAGATGA
TGAAAGTTTGGGACGCTTTACTATACACATAAGTTAATCAAAAGCACCAAAAAGGGGGGT
CGGTTGGTAAGACTAAACGATCAACAGAGGGAATGTCATGCTGCTACCTTTGAGGAGCAC
CAGAAAGGATTGGATCATTGATTGCTGATCTATGTTTTCCTAATGCAGGAATGCTAAGTG
AACGTTTTCTTAATTTGGCTCTGCCAGCTGGTGTTAGTTTCCTCTCACTCTTTTCCATGA
GCAAAGAAGAATAAGAGCAAGGAGGCTCTGGAGACGCAACCCCCATGGTCATCGGCTCGC
TTCTAGTTCTGAGGAACAGATCCAGGTTTTCACGATACTTGTTCCCATGAATTTCACCGG
CAACAGCAGCGAGCCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000720A_C01 KMC000720A_c01
         (496 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_200884.1| putative protein; protein id: At5g60760.1 [Arab...    59  4e-08
dbj|BAB10097.1| emb|CAB72147.1~gene_id:MAE1.1~strong similarity ...    59  4e-08
pir||T47449 hypothetical protein T14D3.30 - Arabidopsis thaliana...    56  2e-07
ref|NP_566873.1| putative protein; protein id: At3g45090.1, supp...    56  2e-07
gb|EAA31881.1| hypothetical protein ( (AL513466) conserved hypot...    34  0.78

>ref|NP_200884.1| putative protein; protein id: At5g60760.1 [Arabidopsis thaliana]
          Length = 749

 Score = 58.5 bits (140), Expect = 4e-08
 Identities = 35/77 (45%), Positives = 47/77 (60%)
 Frame = -1

Query: 463 NKYRENLDLFLRTRSEPMTMGVASPEPPCSYSSLLMEKSERKLTPAGRAKLRKRSLSIPA 284
           +KY +NLDLFLRT ++ +       EP    +SLL  ++       G+ K+RKRSLSI A
Sbjct: 679 DKYIQNLDLFLRTANQQLV------EPLQLCASLLTCENGNTRLWLGKEKMRKRSLSISA 732

Query: 283 LGKHRSAINDPILSGAP 233
           +GKH S + D IL GAP
Sbjct: 733 IGKHGSGLGDAILLGAP 749

>dbj|BAB10097.1| emb|CAB72147.1~gene_id:MAE1.1~strong similarity to unknown protein
           [Arabidopsis thaliana] gi|27754621|gb|AAO22756.1|
           unknown protein [Arabidopsis thaliana]
           gi|28393929|gb|AAO42372.1| unknown protein [Arabidopsis
           thaliana]
          Length = 738

 Score = 58.5 bits (140), Expect = 4e-08
 Identities = 35/77 (45%), Positives = 47/77 (60%)
 Frame = -1

Query: 463 NKYRENLDLFLRTRSEPMTMGVASPEPPCSYSSLLMEKSERKLTPAGRAKLRKRSLSIPA 284
           +KY +NLDLFLRT ++ +       EP    +SLL  ++       G+ K+RKRSLSI A
Sbjct: 668 DKYIQNLDLFLRTANQQLV------EPLQLCASLLTCENGNTRLWLGKEKMRKRSLSISA 721

Query: 283 LGKHRSAINDPILSGAP 233
           +GKH S + D IL GAP
Sbjct: 722 IGKHGSGLGDAILLGAP 738

>pir||T47449 hypothetical protein T14D3.30 - Arabidopsis thaliana
           gi|6911847|emb|CAB72147.1| putative protein [Arabidopsis
           thaliana]
          Length = 716

 Score = 55.8 bits (133), Expect = 2e-07
 Identities = 32/74 (43%), Positives = 47/74 (63%)
 Frame = -1

Query: 463 NKYRENLDLFLRTRSEPMTMGVASPEPPCSYSSLLMEKSERKLTPAGRAKLRKRSLSIPA 284
           +KY +NLDLFL+T ++P+T    S E    Y      ++   +  + +AK+RKRSLSIP 
Sbjct: 645 DKYSQNLDLFLKTTNQPLT---ESLELTSEY------RNRMGVAASDKAKMRKRSLSIPP 695

Query: 283 LGKHRSAINDPILS 242
           +GKH S I+D IL+
Sbjct: 696 VGKHGSIIDDQILA 709

>ref|NP_566873.1| putative protein; protein id: At3g45090.1, supported by cDNA:
           gi_15810518, supported by cDNA: gi_20465686 [Arabidopsis
           thaliana] gi|15810519|gb|AAL07147.1| unknown protein
           [Arabidopsis thaliana] gi|20465687|gb|AAM20312.1|
           unknown protein [Arabidopsis thaliana]
          Length = 717

 Score = 55.8 bits (133), Expect = 2e-07
 Identities = 32/74 (43%), Positives = 47/74 (63%)
 Frame = -1

Query: 463 NKYRENLDLFLRTRSEPMTMGVASPEPPCSYSSLLMEKSERKLTPAGRAKLRKRSLSIPA 284
           +KY +NLDLFL+T ++P+T    S E    Y      ++   +  + +AK+RKRSLSIP 
Sbjct: 646 DKYSQNLDLFLKTTNQPLT---ESLELTSEY------RNRMGVAASDKAKMRKRSLSIPP 696

Query: 283 LGKHRSAINDPILS 242
           +GKH S I+D IL+
Sbjct: 697 VGKHGSIIDDQILA 710

>gb|EAA31881.1| hypothetical protein ( (AL513466) conserved hypothetical protein
           [Neurospora crassa] )
          Length = 1080

 Score = 34.3 bits (77), Expect = 0.78
 Identities = 24/69 (34%), Positives = 32/69 (45%), Gaps = 7/69 (10%)
 Frame = -1

Query: 418 EPMTMGVASPEPPCSYSSLLMEKSERKLTPAGRAKLRKRS-------LSIPALGKHRSAI 260
           E +T+G  +P+P C  SS L+E SER L  A    L +R         S PA      A 
Sbjct: 209 EDITIGTRTPKPYCRQSSELLEPSERLLHRAATVALSQRQPSPTSAHNSNPASDSGTEAD 268

Query: 259 NDPILSGAP 233
           ++  L G P
Sbjct: 269 DEHFLKGLP 277

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 410,632,293
Number of Sequences: 1393205
Number of extensions: 7948180
Number of successful extensions: 16739
Number of sequences better than 10.0: 20
Number of HSP's better than 10.0 without gapping: 16377
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 16728
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 14493193850
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf032f02 BP064031 1 496
2 GNf092d06 BP074164 3 390




Lotus japonicus
Kazusa DNA Research Institute