KMC000939A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000939A_C01 KMC000939A_c01
ataaaaaaaacttttctaattaaatcccaaaattaaaacttaagatcaatttccaaataa
taaatagatcatattacaagTAAGATTATTTTTTCTGAAAAATAAGCCCAAGGCCCGTAT
GGAATAATAATCTCCACATTCTTAATTGACCAAAAAAAAAAAAAAAATCTCCACATTCTG
GCATGCCTAGTCGCTTGGTAGAATACAAGGGGAGCATTGCAGAAACAGGAAAAAATGATT
TAGATGATGAAACACTGCTTATGACAATTTCTATAAATCAGAAATATGCATTTTTCATTA
TGACAGGATTTTCTTTTGTGTTAATGTAATGAGGGGAGGGGTTGTGATGAGATAGGCACT
GAACTCCCTCATAAGAGTATTTACAGTGAATTTGCAGGAATGGCACATGGAATGAAGGAG
GAGAGTGCGACATGGAAAAGGAACCAGAAAAGGACCATACAAAACTTGCAACTGAACCAT
ATTATAACACATTTATATCTGATGTTGTCGAACAAATGCAATATGGGAGCTGGAAAGTCC
ACTTTTTGAACATCACATATCTTTCAGAATTGAGGAAAGATGGTCACCCTTCAAAATATC
GGGAACCGGGCACTCCACCTGATGCTCCTCAGGATTGTAGCCACTGGTGTTTGCCCGGCG
TTCCGGACACGTGGAATGAACTTCTATATGCCCAACTTCTATCTAAGAAATTCGGCACTG
ATGACAAATTTCCAGAAAGTGGAGAACAAAGCCAATCCAATTTTGATCAGGCCAGAAAGA
ATGTTAAGTTTTGATAGCTATACATAAATCTTCAATGCTCCTGATCATTTAACTTCATCC
CCTTCCTCTTCAGCGTTCATATCACTGGCTGTCATTTGCTTCATTTTCTCAAGCTGCCTC
AACAGATCTAACCTCTCTGGTGCATACGTGGACGGAATCAGAGCTGGTACATATCTCTGA
ATAGCAGCCAACATAGCTGCATGACCAACAGCAGTAACCGGAAGAAGCATTAAAACACCT
ATAGGAACAACTGAAGCCATGTCCGTCAAGGTTCTTTTTAGCGTTTTTnTCTCCTTCTCC
GTCAATTCATCCCCTATTAGGGCCCTTCTAACAAAACCTGTGGCAGCACCAACATCAATA
GCAAGAAGTTGAGTTCCTTGCCAGACATCCGTTCCCGTTTCTTTTAGTTTGTCTAGAGAT
TTCTGTAGCATACTTTCTTTCTTCTGAACCCTGACAATCTGAACACCTCTAGCATCATTG
TTGTAACGGCCACTATCATCGCTAAccatcaaatcctcattactttgtgattgattggcg
cttctttgaacccgtttctcaagttctagtagctcatttctcaga


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000939A_C01 KMC000939A_c01
         (1365 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAN72009.1| Unknown protein [Arabidopsis thaliana]                 219  1e-55
gb|AAG51444.1|AC008153_17 unknown protein; 82436-88041 [Arabidop...   219  1e-55
dbj|BAB09687.1| gb|AAF00669.1~gene_id:MBL20.10~strong similarity...   218  2e-55
ref|NP_196240.1| putative protein; protein id: At5g06220.1 [Arab...   215  2e-54
ref|NP_187764.1| hypothetical protein; protein id: At3g11570.1 [...   166  1e-39

>gb|AAN72009.1| Unknown protein [Arabidopsis thaliana]
          Length = 872

 Score =  219 bits (557), Expect = 1e-55
 Identities = 111/178 (62%), Positives = 144/178 (80%)
 Frame = -2

Query: 1364 LRNELLELEKRVQRSANQSQSNEDLMVSDDSGRYNNDARGVQIVRVQKKESMLQKSLDKL 1185
            LRNEL+ELEKRV+RS +QS   E+L +S+D+ + ++    VQ+V+  KKE+M++K+L KL
Sbjct: 694  LRNELIELEKRVKRSTDQSVDEEEL-ISEDTPQSSSRTESVQLVQTPKKENMMEKTLQKL 752

Query: 1184 KETGTDVWQGTQLLAIDVGAATGFVRRALIGDELTEKEXKTLKRTLTDMASVVPIGVLML 1005
            +E  TDVWQGTQLLAID  AA   +RR+LIGDELT KE K L+RT+TD+ASV+PIG+LML
Sbjct: 753  REATTDVWQGTQLLAIDSAAAVQLLRRSLIGDELTGKEKKALRRTMTDLASVIPIGILML 812

Query: 1004 LPVTAVGHAAMLAAIQRYVPALIPSTYAPERLDLLRQLEKMKQMTASDMNAEEEGDEV 831
            LPVTAVGHAAMLA IQRYVP LIPSTY  ERL+LLRQLEK+K++  ++  +EE  +E+
Sbjct: 813  LPVTAVGHAAMLAGIQRYVPGLIPSTYGSERLNLLRQLEKIKELQTNETESEEGVEEI 870

>gb|AAG51444.1|AC008153_17 unknown protein; 82436-88041 [Arabidopsis thaliana]
          Length = 797

 Score =  219 bits (557), Expect = 1e-55
 Identities = 111/178 (62%), Positives = 144/178 (80%)
 Frame = -2

Query: 1364 LRNELLELEKRVQRSANQSQSNEDLMVSDDSGRYNNDARGVQIVRVQKKESMLQKSLDKL 1185
            LRNEL+ELEKRV+RS +QS   E+L +S+D+ + ++    VQ+V+  KKE+M++K+L KL
Sbjct: 619  LRNELIELEKRVKRSTDQSVDEEEL-ISEDTPQSSSRTESVQLVQTPKKENMMEKTLQKL 677

Query: 1184 KETGTDVWQGTQLLAIDVGAATGFVRRALIGDELTEKEXKTLKRTLTDMASVVPIGVLML 1005
            +E  TDVWQGTQLLAID  AA   +RR+LIGDELT KE K L+RT+TD+ASV+PIG+LML
Sbjct: 678  REATTDVWQGTQLLAIDSAAAVQLLRRSLIGDELTGKEKKALRRTMTDLASVIPIGILML 737

Query: 1004 LPVTAVGHAAMLAAIQRYVPALIPSTYAPERLDLLRQLEKMKQMTASDMNAEEEGDEV 831
            LPVTAVGHAAMLA IQRYVP LIPSTY  ERL+LLRQLEK+K++  ++  +EE  +E+
Sbjct: 738  LPVTAVGHAAMLAGIQRYVPGLIPSTYGSERLNLLRQLEKIKELQTNETESEEGVEEI 795

>dbj|BAB09687.1| gb|AAF00669.1~gene_id:MBL20.10~strong similarity to unknown protein
            [Arabidopsis thaliana]
          Length = 806

 Score =  218 bits (554), Expect = 2e-55
 Identities = 113/177 (63%), Positives = 142/177 (79%)
 Frame = -2

Query: 1364 LRNELLELEKRVQRSANQSQSNEDLMVSDDSGRYNNDARGVQIVRVQKKESMLQKSLDKL 1185
            LRNEL+ELEKRVQ S ++S S +    S+D  + ++  +GVQ+V+  KKE++++K+LD+L
Sbjct: 628  LRNELIELEKRVQGSTDESVSKQG-RTSEDLPKSSSSTKGVQLVQSSKKENVIEKTLDQL 686

Query: 1184 KETGTDVWQGTQLLAIDVGAATGFVRRALIGDELTEKEXKTLKRTLTDMASVVPIGVLML 1005
            K+  TDVWQGTQLLA D  AA   +RR+++GDELTEKE K L+RT+TD+ASVVPIGVLML
Sbjct: 687  KDATTDVWQGTQLLAFDSAAAMELLRRSVVGDELTEKEKKALRRTMTDLASVVPIGVLML 746

Query: 1004 LPVTAVGHAAMLAAIQRYVPALIPSTYAPERLDLLRQLEKMKQMTASDMNAEEEGDE 834
            LPVTAVGHAAMLAAIQRYVP LIPSTY  ERL+LLRQLEK+KQM  ++   EE  DE
Sbjct: 747  LPVTAVGHAAMLAAIQRYVPGLIPSTYGAERLNLLRQLEKVKQMQTNETEPEEGIDE 803

>ref|NP_196240.1| putative protein; protein id: At5g06220.1 [Arabidopsis thaliana]
          Length = 813

 Score =  215 bits (547), Expect = 2e-54
 Identities = 115/178 (64%), Positives = 142/178 (79%), Gaps = 1/178 (0%)
 Frame = -2

Query: 1364 LRNELLELEKRVQRSANQS-QSNEDLMVSDDSGRYNNDARGVQIVRVQKKESMLQKSLDK 1188
            LRNEL+ELEKRVQ S ++S +++EDL  S  S       +GVQ+V+  KKE++++K+LD+
Sbjct: 639  LRNELIELEKRVQGSTDESGRTSEDLPKSSSS------TKGVQLVQSSKKENVIEKTLDQ 692

Query: 1187 LKETGTDVWQGTQLLAIDVGAATGFVRRALIGDELTEKEXKTLKRTLTDMASVVPIGVLM 1008
            LK+  TDVWQGTQLLA D  AA   +RR+++GDELTEKE K L+RT+TD+ASVVPIGVLM
Sbjct: 693  LKDATTDVWQGTQLLAFDSAAAMELLRRSVVGDELTEKEKKALRRTMTDLASVVPIGVLM 752

Query: 1007 LLPVTAVGHAAMLAAIQRYVPALIPSTYAPERLDLLRQLEKMKQMTASDMNAEEEGDE 834
            LLPVTAVGHAAMLAAIQRYVP LIPSTY  ERL+LLRQLEK+KQM  ++   EE  DE
Sbjct: 753  LLPVTAVGHAAMLAAIQRYVPGLIPSTYGAERLNLLRQLEKVKQMQTNETEPEEGIDE 810

>ref|NP_187764.1| hypothetical protein; protein id: At3g11570.1 [Arabidopsis
           thaliana] gi|12322909|gb|AAG51447.1|AC008153_20
           hypothetical protein; 89863-88075 [Arabidopsis thaliana]
          Length = 427

 Score =  166 bits (419), Expect = 1e-39
 Identities = 71/108 (65%), Positives = 85/108 (77%)
 Frame = +3

Query: 396 RNGTWNEGGECDMEKEPEKDHTKLATEPYYNTFISDVVEQMQYGSWKVHFLNITYLSELR 575
           RNGTWN GG CD + EPE D  K+  +P +N +IS  +++M+Y   KV FLNITYL+E R
Sbjct: 319 RNGTWNLGGLCDADTEPETDMKKMEPDPIHNNYISQAIQEMRYEHSKVKFLNITYLTEFR 378

Query: 576 KDGHPSKYREPGTPPDAPQDCSHWCLPGVPDTWNELLYAQLLSKKFGT 719
           KD HPS+YREPGTP DAPQDCSHWCLPGVPDTWNE+LYAQLL+  + T
Sbjct: 379 KDAHPSRYREPGTPEDAPQDCSHWCLPGVPDTWNEILYAQLLAMNYRT 426

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,156,288,137
Number of Sequences: 1393205
Number of extensions: 27167284
Number of successful extensions: 67732
Number of sequences better than 10.0: 125
Number of HSP's better than 10.0 without gapping: 63011
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 67573
length of database: 448,689,247
effective HSP length: 127
effective length of database: 271,752,212
effective search space used: 88862973324
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL088g04_f AV781119 1 519
2 MPDL032a04_f AV778065 245 785
3 MPDL022c01_f AV777601 246 685
4 MFL006b10_f BP033638 260 607
5 MFB017f09_f BP035201 271 800
6 MFBL036g07_f BP043087 276 566
7 SPDL053d07_f BP055344 282 850
8 GENLf046b05 BP064761 499 964
9 SPD050b02_f BP047961 851 1366




Lotus japonicus
Kazusa DNA Research Institute