KMC015624A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC015624A_C01 KMC015624A_c01
tgggtcgggccccctcgagaggtcttctgtccatgcttcctccaggggcgcctgctcagt
tcaggaaattatttccaccaACTAAATGGGCTGCAGAGTTTAATGCTGCATGACAGTTCC
TTTCTTCCACTGGTTAGTTGGGCCTTCTGAGGTTGTGGAAGTAGAGATAAATGGGGTGAA
GCAAAAGAGTGGTGTCCATATAAAGAAGTGCAGCGGCTGTGTTGGAATGTGTGTCAATAT
GTGCAAGACTCCTACCCAAGATTTTTTCACCAATGAATTTGGGCTTCCACTGACAATGAT
TCCTAATTTTGAAGACATGAGTTGTGAAATGGTGTATGGCCAAGCCCCACCTCCATTTGA
AGAGGACCCAGTCTCCAAACAAACTTGCTATGCTAAGATATGCTCTGTAGTACCACAACC
AAGCACCAGTGTGTGTCCTAAACTACAAGGATGAGTAGACACTTTTTAGGATTTACATTT
CCATGACATAATTTGAGCCGGTTTGACAACAAATCATACAACACTTATTGGAACTTGGTT
CACATTTTAGTGAAGAAACTCTAATAAGTGCTTTTTGATCCGTATTCAAACGTGGTCTTG
GTGTCTGGGTATAGCTTTCCCTCATGCCGTCTAGCTTCGGAGGAGGTTTTTTACTATTCC
GCATGGAATTATACTGCCTTTGAAGGCAaggatgttaaagatttaagtaaataatgtgca
acacatttattgattattatatcgaaaaattaataaatatgctattgt


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC015624A_C01 KMC015624A_c01
         (768 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564838.1| expressed protein; protein id: At1g64680.1, sup...   191  6e-60
ref|NP_563673.1| Expressed protein; protein id: At1g03055.1, sup...   114  2e-24
ref|NP_682732.1| ORF_ID:tll1942~hypothetical protein [Thermosyne...    83  5e-18
ref|NP_680560.1| unknown protein; protein id: At4g01995.1, suppo...    91  2e-17
gb|AAN65325.1| Hypothetical protein F10G7.9a [Caenorhabditis ele...    32  7.7

>ref|NP_564838.1| expressed protein; protein id: At1g64680.1, supported by cDNA:
           101924. [Arabidopsis thaliana] gi|25373196|pir||H96669
           protein F1N19.25 [imported] - Arabidopsis thaliana
           gi|6633822|gb|AAF19681.1|AC009519_15 F1N19.25
           [Arabidopsis thaliana]
          Length = 250

 Score =  191 bits (485), Expect(2) = 6e-60
 Identities = 88/118 (74%), Positives = 100/118 (84%), Gaps = 5/118 (4%)
 Frame = +2

Query: 110 MTVPFFHWLVGPSEVVEVEINGVKQKSGVHIKKC-----SGCVGMCVNMCKTPTQDFFTN 274
           +TVPFFHWLVGPS+V+EVE+NGVKQ+SGV IKKC     SGCVGMCVNMCK PTQDFFTN
Sbjct: 133 LTVPFFHWLVGPSQVIEVEVNGVKQRSGVRIKKCRYLENSGCVGMCVNMCKIPTQDFFTN 192

Query: 275 EFGLPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSVVPQPSTSVCPKLQ 448
           EFGLPLTM PN+EDMSCEM+YGQAPP FEED  +KQ C A ICS +  PS+ +CPKL+
Sbjct: 193 EFGLPLTMNPNYEDMSCEMIYGQAPPAFEEDVATKQPCLADICS-MSNPSSPICPKLE 249

 Score = 62.4 bits (150), Expect(2) = 6e-60
 Identities = 28/29 (96%), Positives = 28/29 (96%)
 Frame = +3

Query: 24  LLSMLPPGAPAQFRKLFPPTKWAAEFNAA 110
           LLSMLPPGAP QFRKLFPPTKWAAEFNAA
Sbjct: 104 LLSMLPPGAPEQFRKLFPPTKWAAEFNAA 132

>ref|NP_563673.1| Expressed protein; protein id: At1g03055.1, supported by cDNA:
           gi_14488101 [Arabidopsis thaliana]
          Length = 264

 Score =  114 bits (284), Expect = 2e-24
 Identities = 50/94 (53%), Positives = 67/94 (71%), Gaps = 5/94 (5%)
 Frame = +2

Query: 125 FHWLVGPSEVVEVEINGVKQKSGVHIKKC-----SGCVGMCVNMCKTPTQDFFTNEFGLP 289
           F WLVGPSEV E E+NG K+KS V+I+KC     S CVGMC ++CK P+Q F  N  G+P
Sbjct: 158 FAWLVGPSEVRETEVNGRKEKSVVYIEKCRFLEQSNCVGMCTHICKIPSQIFIKNSLGMP 217

Query: 290 LTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCY 391
           + M P+F D+SC+M++G+ PP  E+DP  KQ C+
Sbjct: 218 IYMEPDFNDLSCKMMFGREPPEIEDDPAMKQPCF 251

>ref|NP_682732.1| ORF_ID:tll1942~hypothetical protein [Thermosynechococcus elongatus
           BP-1] gi|22295668|dbj|BAC09494.1|
           ORF_ID:tll1942~hypothetical protein [Thermosynechococcus
           elongatus BP-1]
          Length = 218

 Score = 83.2 bits (204), Expect(2) = 5e-18
 Identities = 46/113 (40%), Positives = 61/113 (53%), Gaps = 10/113 (8%)
 Frame = +2

Query: 131 WLVGPSEVVEVEINGVKQ-----KSGVHIKKC-----SGCVGMCVNMCKTPTQDFFTNEF 280
           WLVG S+   VE+    Q      SGV I+KC     S C+ +C+N+CK PT+ FF    
Sbjct: 113 WLVGASDRYWVEVIPPNQLPQWQHSGVRIQKCRYLAESQCMALCMNLCKKPTEQFFRQRL 172

Query: 281 GLPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSVVPQPSTSVCP 439
           G+PLTM PNF+D SCEMV+G    P  + P+    C+         PS + CP
Sbjct: 173 GIPLTMTPNFKDYSCEMVFGTPAQPIPQPPL--LPCW-------QDPSQTPCP 216

 Score = 30.0 bits (66), Expect(2) = 5e-18
 Identities = 11/25 (44%), Positives = 16/25 (64%)
 Frame = +3

Query: 33  MLPPGAPAQFRKLFPPTKWAAEFNA 107
           ++PP      RKLF P++W  E+NA
Sbjct: 80  LIPPMMSTLIRKLFRPSRWVCEWNA 104

>ref|NP_680560.1| unknown protein; protein id: At4g01995.1, supported by cDNA:
           gi_17065173, supported by cDNA: gi_20259951 [Arabidopsis
           thaliana] gi|17065174|gb|AAL32741.1| Unknown protein
           [Arabidopsis thaliana] gi|20259952|gb|AAM13323.1|
           unknown protein [Arabidopsis thaliana]
          Length = 258

 Score = 90.5 bits (223), Expect = 2e-17
 Identities = 47/106 (44%), Positives = 67/106 (62%), Gaps = 6/106 (5%)
 Frame = +2

Query: 110 MTVPFFHWLVGPSEVVEVEI-NGVKQKSGVHIKKC-----SGCVGMCVNMCKTPTQDFFT 271
           +TV    WL+GPS+V  +++ NG    SGV ++KC     S CVG+C+N CK PTQ FF 
Sbjct: 142 VTVLTCQWLMGPSKVNIIDLPNGESWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFK 201

Query: 272 NEFGLPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSV 409
           +  G+PL M PNF+D SC+  +G APP  E+D    + C+ + CS+
Sbjct: 202 DYMGVPLVMEPNFKDYSCQFKFGVAPP--EDDGNVNEPCF-ETCSI 244

>gb|AAN65325.1| Hypothetical protein F10G7.9a [Caenorhabditis elegans]
          Length = 727

 Score = 32.3 bits (72), Expect = 7.7
 Identities = 17/49 (34%), Positives = 25/49 (50%)
 Frame = +2

Query: 284 LPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSVVPQPSTS 430
           LP +   N++++S E+V    P P   DP  K      + +VV  PSTS
Sbjct: 190 LPASFHDNYDEVSMEVVSPDEPQPSPNDPFIKPPIQIPLEAVVSLPSTS 238

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 721,023,822
Number of Sequences: 1393205
Number of extensions: 16827736
Number of successful extensions: 41919
Number of sequences better than 10.0: 12
Number of HSP's better than 10.0 without gapping: 40310
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 41902
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37534933228
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB005f11_f BP034281 1 464
2 MWM126b07_f AV766743 224 768




Lotus japonicus
Kazusa DNA Research Institute