KMC019605A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019605A_C01 KMC019605A_c01
attgccccaccagatgcaGGGTGCTGTTGCACTCAGAAGATAGATCTGGTGGTGGATGAA
GCCTCACATGATCCCTATTATTGTTAGAGAAGTCCAGCATGTCGCTACTAAAACTTGTCA
CACAGGATTTTGGAGACCAACAAGACTTTGCTGAGTGAAGTTCTTCGTTTCCATGCCCAT
ACACATAGCTGTTTCCCGAGTTTTCTTGTTTAACATCAATAAAGGAAGCATTTGGAGCCT
GAGTTAGCATTTGATTCTGAAACTGGGCCATAGCACCCTTATCTTCTTCAGCCACAACTC
CACTCATGAGTAGCTGGCTCCATGACTCAGGAACCTCTTGATTTTCTGGCCAAGAAGGAA
AGGGAAGAGAAGAAGAAGAAGAAGGGGGATATGGGGTGAGAAAGTTGGAAGGAGTGGGAA
GGAAAGGAGGAGCTTGTGGTTGTGGAGATGGTGGCCTCATGCTATTGATATTGATATTCC
ACCAGTTAGGGTTCCCAGCCATCATTTGTTGCACTGGGGAGCTTTGAAGAACACCTCTAT
TCATGCCTTGTTGTTCTTCTTCTTCGCTTTGAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019605A_C01 KMC019605A_c01
         (573 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_194639.1| putative protein; protein id: At4g29100.1, supp...   103  2e-21
emb|CAC14433.1| putative protein [Brassica napus]                      98  6e-20
ref|NP_179599.1| unknown protein; protein id: At2g20090.1 [Arabi...    76  3e-13
gb|AAL92370.2| similar to Dictyostelium discoideum (Slime mold)....    40  0.021
ref|XP_289489.1| hypothetical protein XP_289489 [Mus musculus]         40  0.027

>ref|NP_194639.1| putative protein; protein id: At4g29100.1, supported by cDNA:
           gi_19698938 [Arabidopsis thaliana]
           gi|7485742|pir||T08965 hypothetical protein F19B15.130 -
           Arabidopsis thaliana gi|4972056|emb|CAB43924.1| putative
           protein [Arabidopsis thaliana]
           gi|7269808|emb|CAB79668.1| putative protein [Arabidopsis
           thaliana] gi|19698939|gb|AAL91205.1| putative protein
           [Arabidopsis thaliana]
           gi|22711852|gb|AAM10966.2|AF488634_1 putative bHLH
           transcription factor [Arabidopsis thaliana]
           gi|23197826|gb|AAN15440.1| putative protein [Arabidopsis
           thaliana]
          Length = 407

 Score =  103 bits (257), Expect = 2e-21
 Identities = 89/242 (36%), Positives = 118/242 (47%), Gaps = 68/242 (28%)
 Frame = -3

Query: 544 MNRGVLQSSPVQQMM-AGNPNWWNININSMRPPSP-----QPQAPPFLPTPSNFL----- 398
           MNRGVL+SSPVQQ+M AGNPNWWN++   MRPP P     Q   PP +   +N+L     
Sbjct: 1   MNRGVLESSPVQQLMAAGNPNWWNVS-GGMRPPPPLMGHQQAPLPPHMTPNNNYLRPRMM 59

Query: 397 -TPYP-------PSSSSSLPFPSWPENQEV----------PESW--SQLLMSGVVAEED- 281
            TP+P        SSSSS   PS P N  +          PESW  SQLL+ G++  E+ 
Sbjct: 60  PTPFPHFLPSPATSSSSSSSSPSLPNNPNLSSWLESNDLPPESWSLSQLLLGGLMMGEEE 119

Query: 280 ---------------------KGAMAQFQNQMLT-QAPNASFIDVKQE---NSGNSYVYG 176
                                K  +  ++ Q+L+ Q  +   +D+KQE   N+ N YV  
Sbjct: 120 RLEMMNHHNHHDEQQHHGFQGKIRLENWEEQVLSHQQASMVAVDIKQEGNINNNNGYVIS 179

Query: 175 HGNEELHSAKSCWSPKSCVTSFSSD--------MLDF-SNNNRDHVR--LHPPPDLSSEC 29
             N   +  KSC +  +  +  S+D        MLDF SN+N  H+    H PPD SSEC
Sbjct: 180 SPNSPPN--KSCVTTTTTTSLNSNDDNINNNNNMLDFSSNHNGLHLSEGRHTPPDRSSEC 237

Query: 28  NS 23
           NS
Sbjct: 238 NS 239

>emb|CAC14433.1| putative protein [Brassica napus]
          Length = 389

 Score = 98.2 bits (243), Expect = 6e-20
 Identities = 83/226 (36%), Positives = 113/226 (49%), Gaps = 52/226 (23%)
 Frame = -3

Query: 544 MNRGVLQSSPVQQMM-AGNPNWWNININSMRPPSP-----QPQAPPFLPTPSNFLTP--Y 389
           MNRG L+SSPVQQ+M AGNPNWWN++  S RPP P     Q   PP +   +N+L P   
Sbjct: 1   MNRGALESSPVQQLMVAGNPNWWNVS-GSTRPPPPLMGHQQGPLPPQMTPNNNYLRPRMM 59

Query: 388 PPSSSSSL----PFPSWPENQEV-PESW--SQLLMSGVVAEED----------------- 281
             SSS SL       SW E+ ++ PESW  SQLL+ G++  E+                 
Sbjct: 60  MTSSSPSLLDNPSLSSWLESNDLPPESWSLSQLLLGGLMMGEEERLEIMNHHSHHDEQHH 119

Query: 280 -----KGAMAQFQNQMLT-QAPNASFIDVKQE---NSGNSYVYGHGNEELHSAKSCWSPK 128
                K  +  ++ Q+L  Q  +   +D+KQE   N+ N Y+    N   +  KSC +  
Sbjct: 120 HSFQGKMRLENWEEQVLRHQQASMGVVDIKQESNINNNNGYLISSPNSPPN--KSCVTTT 177

Query: 127 SCVTSFSSD-------MLDFSNN----NRDHVRLHPPPDLSSECNS 23
           +  +  S+D       ML FS+N    N   +R H PPD SSECNS
Sbjct: 178 TTTSLNSNDNTNNNNNMLGFSSNHNGLNLSEIR-HTPPDRSSECNS 222

>ref|NP_179599.1| unknown protein; protein id: At2g20090.1 [Arabidopsis thaliana]
           gi|25411964|pir||H84584 hypothetical protein At2g20090
           [imported] - Arabidopsis thaliana
           gi|4580464|gb|AAD24388.1| unknown protein [Arabidopsis
           thaliana]
          Length = 219

 Score = 75.9 bits (185), Expect = 3e-13
 Identities = 70/197 (35%), Positives = 96/197 (48%), Gaps = 34/197 (17%)
 Frame = -3

Query: 544 MNRGVLQSSPVQQMMA-GNPNWWNININSMRPPSP-----QPQAPPFLPT--PSNFLTPY 389
           MNRGVL+SSPVQ + A GNPNWWN     +RPP+P      P    F+P+  P+ F +P 
Sbjct: 1   MNRGVLESSPVQHLTAAGNPNWWNNVSRGLRPPTPLMSHEPPSTTAFIPSLLPNFFSSPT 60

Query: 388 PPSSSS-SLP-------FPSWPENQEVP--ESW--SQLLMSGVV--AEEDKGAMAQFQNQ 251
             SSSS S P       F SW E  ++P  + W  SQLL+ G++   EE    M    +Q
Sbjct: 61  SSSSSSPSFPPPNSNPNFSSWLEMSDLPLDQPWSLSQLLLGGLMMGEEEKMEMMNHHHHQ 120

Query: 250 MLTQAPNASFI------------DVKQENSGNSYVYGHGNEELHSAKSCWSPKSCVTSFS 107
              Q+  A  I             +KQE+S N+  YG     + S+ +    KSC T  +
Sbjct: 121 NQHQSYQAKRIQNWEEQVLRHQASMKQESSNNN-SYG-----IMSSPNSPPNKSCATIIN 174

Query: 106 SDMLDFSNNNRDHVRLH 56
           ++     NNN  H  L+
Sbjct: 175 TNE---DNNNNIHSGLN 188

>gb|AAL92370.2| similar to Dictyostelium discoideum (Slime mold). TRFA
          Length = 673

 Score = 40.0 bits (92), Expect = 0.021
 Identities = 39/160 (24%), Positives = 55/160 (34%), Gaps = 25/160 (15%)
 Frame = -3

Query: 571 QSEEEEQQGMNRGVLQSSPVQQMMAGNPNW----------------WNININSMRPPSPQ 440
           Q +++++Q   +   +  P QQ    NPN+                   N N +    PQ
Sbjct: 202 QQQDQQKQHDQQQHQEQQPQQQQFNNNPNFNGNTTNNSNQFMNGQNIQFNNNDIHQSQPQ 261

Query: 439 PQAPPF-LPTPSNFLTPYPPSSSSSLPFPSWPENQEVPESWSQLLMS----GVVAEEDKG 275
           PQ+ P   P P     P P       P P  P+ Q  P+   QL  S    G     +  
Sbjct: 262 PQSQPQPQPQPQPQPQPQPQPQPQPQPQPQQPQPQPQPQQQQQLQFSSNNNGTFNNTNNY 321

Query: 274 AMAQF----QNQMLTQAPNASFIDVKQENSGNSYVYGHGN 167
               F     N       N+ FI+    N+ NSY    GN
Sbjct: 322 NNGSFNNNNNNNNNNNNNNSGFINSSNGNNFNSYNNNSGN 361

>ref|XP_289489.1| hypothetical protein XP_289489 [Mus musculus]
          Length = 330

 Score = 39.7 bits (91), Expect = 0.027
 Identities = 23/59 (38%), Positives = 29/59 (48%), Gaps = 1/59 (1%)
 Frame = -3

Query: 490 PNWWNININSMRPPSPQPQAP-PFLPTPSNFLTPYPPSSSSSLPFPSWPENQEVPESWS 317
           P  W++     RPP P PQ P P LP  S     + P+  + LP  SW   Q  PE+WS
Sbjct: 272 PQPWDV-----RPPQPLPQPPSPLLPRTSAL--DWSPNPPAPLPSLSWVVTQSSPEAWS 323

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 555,271,291
Number of Sequences: 1393205
Number of extensions: 13627375
Number of successful extensions: 86195
Number of sequences better than 10.0: 563
Number of HSP's better than 10.0 without gapping: 60073
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 79970
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21243732558
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB034g01_f BP036517 1 532
2 MFB077g01_f BP039642 19 573




Lotus japonicus
Kazusa DNA Research Institute