KMC020483A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC020483A_C01 KMC020483A_c01
AGTTTTCATGCTTTTCCTAAGCCAGGAGAGTTTTCAGGATCATTTCCCTGTTCTCAACAG
ATGGAAGGCCAACCATGATCCTGCGCTCGAACCGTCTAATAATTGCTTCATCGAGGTCAA
ATGGCCTATTGGTTGCAGCAAGAACGAGAATTTGCTCACCAGGTGCAGTTAATAGCCCAT
CCCAGTGTGTCATGAATTCATTTTTAATCTTCCTCATGGCCTCATGCTCTCCAACTCTAG
TCCTCTGACCAAGCATGCTGTCAACCTCATCGACAAAGATTATAGTAGGAGCAACCTTTG
CTGCTAGCGAGAACAGAGCGCGGACATTCTTTTCGTCTTCTCCAAACCATTTTGAAGTGA
TGGTGGACATTGAAACATTGATGAAACTTGCTCCAGCTTCATTTGCAATTGCTTnGGCAA
GCATTGTTTTCCCGGTTCCCGGAGGCCCGAAAAGTAATATACCTCTACATGGCTTTAGAA
GACCACCTTTGAAGAGGTCTGGTCTTCTAAGGGGAAGCATTACCAACTCCTGAAGTGATT
CTTTAATCTCATCCATTGCACCAATGTCTCCAAATGTAACCCCTATCTCATTTGCAGGGA
TAACTTCAGGTCTTATGCGCTTCTCAAATTCGTTATCAGGAACTTCAGCTTTTGCAGGTG
TTGGATTTTCACCATCTTTTTTTGTTACAGGGATGGCTTTCTCTGTnTCATTTTTGTTTT
CAGGTGCTTGGTTGTCACACCGCACATCATTCTTTGCACCAGTAATGTCTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC020483A_C01 KMC020483A_c01
         (771 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAL32670.1| similar to homeobox protein [Arabidopsis thaliana]     390  e-107
ref|NP_564824.1| expressed protein; protein id: At1g64110.1, sup...   390  e-107
pir||G96665 protein F22C12.12 [imported] - Arabidopsis thaliana ...   390  e-107
gb|AAN46221.1| unknown protein [Arabidopsis lyrata] gi|28188593|...   387  e-106
gb|AAN46211.1| unknown protein [Arabidopsis thaliana]                 387  e-106

>gb|AAL32670.1| similar to homeobox protein [Arabidopsis thaliana]
          Length = 752

 Score =  390 bits (1002), Expect = e-107
 Identities = 205/257 (79%), Positives = 221/257 (85%), Gaps = 8/257 (3%)
 Frame = -3

Query: 757  AKNDVRCDNQAPENKNXTEKAIPVTKKD-------GENPTPAKAEV-PDNEFEKRIRPEV 602
            +  +V+ ++  PE K  TE    V+ K+        E  TP   EV PDNEFEKRIRPEV
Sbjct: 453  SSKEVKAESIKPETK--TESVTTVSSKEEPEKEAKAEKVTPKAPEVAPDNEFEKRIRPEV 510

Query: 601  IPANEIGVTFGDIGAMDEIKESLQELVMLPLRRPDLFKGGLLKPCRGILLFGPPGTGKTM 422
            IPA EI VTF DIGA+DEIKESLQELVMLPLRRPDLF GGLLKPCRGILLFGPPGTGKTM
Sbjct: 511  IPAEEINVTFKDIGALDEIKESLQELVMLPLRRPDLFTGGLLKPCRGILLFGPPGTGKTM 570

Query: 421  LAXAIANEAGASFINVSMSTITSKWFGEDEKNVRALFSLAAKVAPTIIFVDEVDSMLGQR 242
            LA AIA EAGASFINVSMSTITSKWFGEDEKNVRALF+LA+KV+PTIIFVDEVDSMLGQR
Sbjct: 571  LAKAIAKEAGASFINVSMSTITSKWFGEDEKNVRALFTLASKVSPTIIFVDEVDSMLGQR 630

Query: 241  TRVGEHEAMRKIKNEFMTHWDGLLTAPGEQILVLAATNRPFDLDEAIIRRFERRIMVGLP 62
            TRVGEHEAMRKIKNEFM+HWDGL+T PGE+ILVLAATNRPFDLDEAIIRRFERRIMVGLP
Sbjct: 631  TRVGEHEAMRKIKNEFMSHWDGLMTKPGERILVLAATNRPFDLDEAIIRRFERRIMVGLP 690

Query: 61   SVENREMILKTLLA*EK 11
            +VENRE IL+TLLA EK
Sbjct: 691  AVENREKILRTLLAKEK 707

>ref|NP_564824.1| expressed protein; protein id: At1g64110.1, supported by cDNA:
            gi_15810166 [Arabidopsis thaliana]
            gi|15810167|gb|AAL06985.1| At1g64110/F22C12_22
            [Arabidopsis thaliana]
          Length = 824

 Score =  390 bits (1002), Expect = e-107
 Identities = 205/257 (79%), Positives = 221/257 (85%), Gaps = 8/257 (3%)
 Frame = -3

Query: 757  AKNDVRCDNQAPENKNXTEKAIPVTKKD-------GENPTPAKAEV-PDNEFEKRIRPEV 602
            +  +V+ ++  PE K  TE    V+ K+        E  TP   EV PDNEFEKRIRPEV
Sbjct: 448  SSKEVKAESIKPETK--TESVTTVSSKEEPEKEAKAEKVTPKAPEVAPDNEFEKRIRPEV 505

Query: 601  IPANEIGVTFGDIGAMDEIKESLQELVMLPLRRPDLFKGGLLKPCRGILLFGPPGTGKTM 422
            IPA EI VTF DIGA+DEIKESLQELVMLPLRRPDLF GGLLKPCRGILLFGPPGTGKTM
Sbjct: 506  IPAEEINVTFKDIGALDEIKESLQELVMLPLRRPDLFTGGLLKPCRGILLFGPPGTGKTM 565

Query: 421  LAXAIANEAGASFINVSMSTITSKWFGEDEKNVRALFSLAAKVAPTIIFVDEVDSMLGQR 242
            LA AIA EAGASFINVSMSTITSKWFGEDEKNVRALF+LA+KV+PTIIFVDEVDSMLGQR
Sbjct: 566  LAKAIAKEAGASFINVSMSTITSKWFGEDEKNVRALFTLASKVSPTIIFVDEVDSMLGQR 625

Query: 241  TRVGEHEAMRKIKNEFMTHWDGLLTAPGEQILVLAATNRPFDLDEAIIRRFERRIMVGLP 62
            TRVGEHEAMRKIKNEFM+HWDGL+T PGE+ILVLAATNRPFDLDEAIIRRFERRIMVGLP
Sbjct: 626  TRVGEHEAMRKIKNEFMSHWDGLMTKPGERILVLAATNRPFDLDEAIIRRFERRIMVGLP 685

Query: 61   SVENREMILKTLLA*EK 11
            +VENRE IL+TLLA EK
Sbjct: 686  AVENREKILRTLLAKEK 702

>pir||G96665 protein F22C12.12 [imported] - Arabidopsis thaliana
            gi|6692099|gb|AAF24564.1|AC007764_6 F22C12.12
            [Arabidopsis thaliana]
          Length = 825

 Score =  390 bits (1002), Expect = e-107
 Identities = 205/257 (79%), Positives = 221/257 (85%), Gaps = 8/257 (3%)
 Frame = -3

Query: 757  AKNDVRCDNQAPENKNXTEKAIPVTKKD-------GENPTPAKAEV-PDNEFEKRIRPEV 602
            +  +V+ ++  PE K  TE    V+ K+        E  TP   EV PDNEFEKRIRPEV
Sbjct: 426  SSKEVKAESIKPETK--TESVTTVSSKEEPEKEAKAEKVTPKAPEVAPDNEFEKRIRPEV 483

Query: 601  IPANEIGVTFGDIGAMDEIKESLQELVMLPLRRPDLFKGGLLKPCRGILLFGPPGTGKTM 422
            IPA EI VTF DIGA+DEIKESLQELVMLPLRRPDLF GGLLKPCRGILLFGPPGTGKTM
Sbjct: 484  IPAEEINVTFKDIGALDEIKESLQELVMLPLRRPDLFTGGLLKPCRGILLFGPPGTGKTM 543

Query: 421  LAXAIANEAGASFINVSMSTITSKWFGEDEKNVRALFSLAAKVAPTIIFVDEVDSMLGQR 242
            LA AIA EAGASFINVSMSTITSKWFGEDEKNVRALF+LA+KV+PTIIFVDEVDSMLGQR
Sbjct: 544  LAKAIAKEAGASFINVSMSTITSKWFGEDEKNVRALFTLASKVSPTIIFVDEVDSMLGQR 603

Query: 241  TRVGEHEAMRKIKNEFMTHWDGLLTAPGEQILVLAATNRPFDLDEAIIRRFERRIMVGLP 62
            TRVGEHEAMRKIKNEFM+HWDGL+T PGE+ILVLAATNRPFDLDEAIIRRFERRIMVGLP
Sbjct: 604  TRVGEHEAMRKIKNEFMSHWDGLMTKPGERILVLAATNRPFDLDEAIIRRFERRIMVGLP 663

Query: 61   SVENREMILKTLLA*EK 11
            +VENRE IL+TLLA EK
Sbjct: 664  AVENREKILRTLLAKEK 680

>gb|AAN46221.1| unknown protein [Arabidopsis lyrata] gi|28188593|gb|AAN46222.1|
           unknown protein [Arabidopsis lyrata]
          Length = 316

 Score =  387 bits (993), Expect = e-106
 Identities = 194/218 (88%), Positives = 207/218 (93%)
 Frame = -3

Query: 655 AKAEVPDNEFEKRIRPEVIPANEIGVTFGDIGAMDEIKESLQELVMLPLRRPDLFKGGLL 476
           +K   PDNEFEKRIRPEVIPANEIGVTF DIG++DE KESLQELVMLPLRRPDLFKGGLL
Sbjct: 1   SKEVAPDNEFEKRIRPEVIPANEIGVTFADIGSLDETKESLQELVMLPLRRPDLFKGGLL 60

Query: 475 KPCRGILLFGPPGTGKTMLAXAIANEAGASFINVSMSTITSKWFGEDEKNVRALFSLAAK 296
           KPCRGILLFGPPGTGKTM+A AIANEAGASFINVSMSTITSKWFGEDEKNVRALF+LAAK
Sbjct: 61  KPCRGILLFGPPGTGKTMMAKAIANEAGASFINVSMSTITSKWFGEDEKNVRALFTLAAK 120

Query: 295 VAPTIIFVDEVDSMLGQRTRVGEHEAMRKIKNEFMTHWDGLLTAPGEQILVLAATNRPFD 116
           V+PTIIFVDEVDSMLGQRTRVGEHEAMRKIKNEFMTHWDGL++  G++ILVLAATNRPFD
Sbjct: 121 VSPTIIFVDEVDSMLGQRTRVGEHEAMRKIKNEFMTHWDGLMSNAGDRILVLAATNRPFD 180

Query: 115 LDEAIIRRFERRIMVGLPSVENREMILKTLLA*EKHEN 2
           LDEAIIRRFERRIMVGLPSVE+RE IL+TLL+ EK EN
Sbjct: 181 LDEAIIRRFERRIMVGLPSVESREKILRTLLSKEKTEN 218

>gb|AAN46211.1| unknown protein [Arabidopsis thaliana]
          Length = 316

 Score =  387 bits (993), Expect = e-106
 Identities = 194/218 (88%), Positives = 207/218 (93%)
 Frame = -3

Query: 655 AKAEVPDNEFEKRIRPEVIPANEIGVTFGDIGAMDEIKESLQELVMLPLRRPDLFKGGLL 476
           +K   PDNEFEKRIRPEVIPANEIGVTF DIG++DE KESLQELVMLPLRRPDLFKGGLL
Sbjct: 1   SKEVAPDNEFEKRIRPEVIPANEIGVTFADIGSLDETKESLQELVMLPLRRPDLFKGGLL 60

Query: 475 KPCRGILLFGPPGTGKTMLAXAIANEAGASFINVSMSTITSKWFGEDEKNVRALFSLAAK 296
           KPCRGILLFGPPGTGKTM+A AIANEAGASFINVSMSTITSKWFGEDEKNVRALF+LAAK
Sbjct: 61  KPCRGILLFGPPGTGKTMMAKAIANEAGASFINVSMSTITSKWFGEDEKNVRALFTLAAK 120

Query: 295 VAPTIIFVDEVDSMLGQRTRVGEHEAMRKIKNEFMTHWDGLLTAPGEQILVLAATNRPFD 116
           V+PTIIFVDEVDSMLGQRTRVGEHEAMRKIKNEFMTHWDGL++  G++ILVLAATNRPFD
Sbjct: 121 VSPTIIFVDEVDSMLGQRTRVGEHEAMRKIKNEFMTHWDGLMSNAGDRILVLAATNRPFD 180

Query: 115 LDEAIIRRFERRIMVGLPSVENREMILKTLLA*EKHEN 2
           LDEAIIRRFERRIMVGLPSVE+RE IL+TLL+ EK EN
Sbjct: 181 LDEAIIRRFERRIMVGLPSVESREKILRTLLSKEKTEN 218

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 671,977,079
Number of Sequences: 1393205
Number of extensions: 14843067
Number of successful extensions: 55156
Number of sequences better than 10.0: 2575
Number of HSP's better than 10.0 without gapping: 50137
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 53368
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37815044670
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL033f02_f BP042924 1 397
2 MFBL026e05_f BP042571 309 771




Lotus japonicus
Kazusa DNA Research Institute