KMC018210A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC018210A_C01 KMC018210A_c01
tgtcgtgctctgttccttccttccttccttccaattccatggcagcactctcatggcgac
ccttcatcctctcacgcctcACCGACCTCTCACCCAATCCTCTCCACCCACCCAAGCCAC
CACCTCTCTTCCTCCGCCGCCGCCGCTGCTTCCTCACCTCCTGCTACGCCGATGGGTTCT
CGTCGTCCTCTTCTTCCTCATCCTCTGACGATGTCGTCTCCACCCGGAAATCCACCTTCG
ACCGCGGCTTCACCGTCATCGCCAACATGCTCCGCCGCATCGAGCCCCTCGATAACTCCG
TCATCTCAAAGGGTGTCTCCGACGCCGCCAGAGACTCCATGAAGCAGACCATCTCCACCA
TGCTCGGTTTGCTCCCCTCCGATCACTTCTCTGTCACTGTCACTGTTTCCAAGCACCCCC
TCCATCGTCTTCTCGTCTCTTCCATCATCACCGGGTACACTCTGTGGAATGCGGAGTACA
GGATGTCCCTGACGAGGAATCTGGATATAGCATCGCCCTGTGGTGCAAGAGATTCGGATT
GTGAGAAACGTTCAGAGATCTTGGAGGTTAAGGGTGGAGGAGAGGATGGTGGGGAAATTG
AGGTTGCTTCTGATTTGGGGCTCAAGGATTTGGAAAATTGCAGTAGTAGCCCGAGAGTGT
TTGGAGATTTGCCTCCTCAGGCTCTCAATTACATTCAGCAGTTGCAGTCTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC018210A_C01 KMC018210A_c01
         (711 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM60961.1| seed maturation-like protein [Arabidopsis thaliana]    202  3e-51
ref|NP_197001.1| seed maturation -like protein; protein id: At5g...   202  3e-51
ref|NP_179097.1| unknown protein; protein id: At2g14910.1, suppo...   107  1e-22
gb|AAF21309.1| seed maturation protein PM23 [Glycine max]              94  2e-18
gb|ZP_00056820.1| hypothetical protein [Thermobifida fusca]            69  5e-11

>gb|AAM60961.1| seed maturation-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  202 bits (515), Expect = 3e-51
 Identities = 126/230 (54%), Positives = 154/230 (66%), Gaps = 6/230 (2%)
 Frame = +3

Query: 39  MAALSWRPF-ILSRLTDLSPNPLHPPKPPPLFLRRRRCF-----LTSCYADGFSSSSSSS 200
           MAA S R F +LSR+TDLS   L   +PPP     R  +     ++S       S    S
Sbjct: 1   MAAASARAFFMLSRVTDLSKKKLILHQPPPSSSPHRLPYAPNRAVSSSAVISCLSGGGVS 60

Query: 201 SSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPS 380
           S D  VSTR+S  DRGF VIAN++ RI+PLD SVISKG+SD+A+DSMKQTIS+MLGLLPS
Sbjct: 61  SDDSYVSTRRSKLDRGFAVIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPS 120

Query: 381 DHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEI 560
           D FSV+VT+S+ PL+RLL+SSIITGYTLWNAEYR+SL RN DI  P   R  + E +S  
Sbjct: 121 DQFSVSVTISEQPLYRLLISSIITGYTLWNAEYRVSLRRNFDI--PIDPRKEE-EDQSSK 177

Query: 561 LEVKGGGEDGGEIEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQS 710
             V+ G E G    ++ DLG    E    SP+VFGDL P+AL+YIQ LQS
Sbjct: 178 DNVRFGSEKG----MSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQS 223

>ref|NP_197001.1| seed maturation -like protein; protein id: At5g14970.1, supported
           by cDNA: 106301. [Arabidopsis thaliana]
           gi|11346180|pir||T51442 seed maturation-like protein -
           Arabidopsis thaliana gi|9755664|emb|CAC01816.1| seed
           maturation-like protein [Arabidopsis thaliana]
           gi|22655278|gb|AAM98229.1| seed maturation-like protein
           [Arabidopsis thaliana]
          Length = 355

 Score =  202 bits (515), Expect = 3e-51
 Identities = 126/230 (54%), Positives = 154/230 (66%), Gaps = 6/230 (2%)
 Frame = +3

Query: 39  MAALSWRPF-ILSRLTDLSPNPLHPPKPPPLFLRRRRCF-----LTSCYADGFSSSSSSS 200
           MAA S R F +LSR+TDLS   L   +PPP     R  +     ++S       S    S
Sbjct: 1   MAAASARAFFMLSRVTDLSKKKLILHQPPPSSSPHRLPYAPNRAVSSSAVISCLSGGGVS 60

Query: 201 SSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPS 380
           S D  VSTR+S  DRGF VIAN++ RI+PLD SVISKG+SD+A+DSMKQTIS+MLGLLPS
Sbjct: 61  SDDSYVSTRRSKLDRGFAVIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPS 120

Query: 381 DHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEI 560
           D FSV+VT+S+ PL+RLL+SSIITGYTLWNAEYR+SL RN DI  P   R  + E +S  
Sbjct: 121 DQFSVSVTISEQPLYRLLISSIITGYTLWNAEYRVSLRRNFDI--PIDPRKEE-EDQSSK 177

Query: 561 LEVKGGGEDGGEIEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQS 710
             V+ G E G    ++ DLG    E    SP+VFGDL P+AL+YIQ LQS
Sbjct: 178 DNVRFGSEKG----MSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQS 223

>ref|NP_179097.1| unknown protein; protein id: At2g14910.1, supported by cDNA:
           gi_17380883, supported by cDNA: gi_19698842, supported
           by cDNA: gi_20465934 [Arabidopsis thaliana]
           gi|25368604|pir||H84522 hypothetical protein At2g14910
           [imported] - Arabidopsis thaliana
           gi|3650033|gb|AAC61288.1| unknown protein [Arabidopsis
           thaliana] gi|17380884|gb|AAL36254.1| unknown protein
           [Arabidopsis thaliana] gi|19698843|gb|AAL91157.1|
           unknown protein [Arabidopsis thaliana]
           gi|20465935|gb|AAM20153.1| unknown protein [Arabidopsis
           thaliana]
          Length = 386

 Score =  107 bits (268), Expect = 1e-22
 Identities = 79/218 (36%), Positives = 113/218 (51%), Gaps = 12/218 (5%)
 Frame = +3

Query: 93  PNPLHPPKPP-------PLFLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKSTFDRGF 251
           P  LH P  P       P F RR R    +  +   S++ SS+  DD  S    T     
Sbjct: 14  PQLLHKPTKPLPFLFLLPRFNRRFRSLTITSSSTTSSNNFSSNCGDDGFSLDDFTLHSDS 73

Query: 252 T-----VIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKH 416
                 V++++++ IEPLD S+I K V     D+MK+TIS MLGLLPSD F V +     
Sbjct: 74  RSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSDRFQVHIESLWE 133

Query: 417 PLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEILEVKGGGEDGGE 596
           PL +LLVSS++TGYTL NAEYR+ L +NLD++   G  DS   + +E  +++G   D   
Sbjct: 134 PLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSG--GGLDSHASENTE-YDMEGTFPDEDH 190

Query: 597 IEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQS 710
           +    D   ++L   +      G +  +A  YI +LQS
Sbjct: 191 VSSKRDSRTQNLSE-TIDEEGLGRVSSEAQEYILRLQS 227

>gb|AAF21309.1| seed maturation protein PM23 [Glycine max]
          Length = 404

 Score = 94.0 bits (232), Expect = 2e-18
 Identities = 67/179 (37%), Positives = 96/179 (53%), Gaps = 7/179 (3%)
 Frame = +3

Query: 195 SSSSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLL 374
           ++SS D  S  K +      V+  +++ IEPLD S I K V     D+MK+TIS MLGLL
Sbjct: 75  AASSHDFASNSKKS------VLTELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLL 128

Query: 375 PSDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEK-R 551
           PSD F V +     PL +LL+SS++TGYTL N EYR+ L +NLD+       + D EK +
Sbjct: 129 PSDQFHVVIEALWEPLSKLLISSMMTGYTLRNVEYRLCLEKNLDMF------EGDIEKPK 182

Query: 552 SEILEVKGGG---EDGGEIEVASDLGLKD-LENCSSSPRV--FGDLPPQALNYIQQLQS 710
           +E ++V   G   +    IE   +  L   +E       +   G++  +A  YI  LQS
Sbjct: 183 AESMKVDLQGLMHDSVNAIEFGKNKNLSSKVEKLHEEVDIQELGEISAEAQQYIFNLQS 241

>gb|ZP_00056820.1| hypothetical protein [Thermobifida fusca]
          Length = 754

 Score = 69.3 bits (168), Expect = 5e-11
 Identities = 54/140 (38%), Positives = 72/140 (50%), Gaps = 9/140 (6%)
 Frame = +1

Query: 61  PSSSHASPTSHPILSTHPSHHLSSSA-----AAAASSPPATPMGSRRPLLPHP---LTMS 216
           PS+  AS    P+ S  PS + ++S      A AASS   T   +  P  P P    T S
Sbjct: 178 PSTQPASSPPEPLRSASPSKNSTNSPSPHPPAPAASSSCPTSTANAPPTCPTPPAASTAS 237

Query: 217 SPPGNPPSTAASPSSPTCSAASSPSITPSSQRVSP-TPPETP*SRPSPPCSVCSPPITSL 393
             P +PP T  +PSS  CSAAS    TPSS +VSP T   +  + P+PP S  SPP +S 
Sbjct: 238 PEPTSPPPTWPAPSSKACSAASLTPSTPSSTKVSPSTVSSSSAAAPAPPQSAPSPPKSSA 297

Query: 394 SLSLFPSTPSIVFSSLPSSP 453
           + S  PS P    ++ P++P
Sbjct: 298 APS--PSQPKA--NTSPTAP 313

 Score = 60.8 bits (146), Expect = 2e-08
 Identities = 48/141 (34%), Positives = 69/141 (48%), Gaps = 7/141 (4%)
 Frame = +1

Query: 52  HG-DPSSSHASPTS-HPILSTHPSHHLSSSAAAAASSPPATPMGSRRPLLPHPLTMSSPP 225
           HG  PS+   SP S  P  +  P+ + ++     ASS P T   +  P  P+P   ++ P
Sbjct: 5   HGAKPSAQSPSPASPSPNSAGSPNTNPTTPPEPKASSCPTTGSPNALPKPPNPSPTAATP 64

Query: 226 GNPPSTAA--SPSSPTCSAASSPSITPSSQRVSPTPPETP*SRP---SPPCSVCSPPITS 390
             P +T    +P+ PTCS   SPS   S+ R SP PP++  + P   SPP +  +P   +
Sbjct: 65  PAPATTTPPPTPTGPTCSP--SPSAANSAPRESPPPPKSSAASPPPNSPPSTSSAPAPAT 122

Query: 391 LSLSLFPSTPSIVFSSLPSSP 453
                 PSTP    SS PS+P
Sbjct: 123 TWPPHSPSTPDPATSSSPSAP 143

 Score = 55.8 bits (133), Expect = 6e-07
 Identities = 50/142 (35%), Positives = 65/142 (45%), Gaps = 1/142 (0%)
 Frame = +1

Query: 31  PIPWQHSHGDPSSSHASPTSHPI-LSTHPSHHLSSSAAAAASSPPATPMGSRRPLLPHPL 207
           P P   +   P +S  S +S P   +T P H  S+   A +SSP A P     P  P P 
Sbjct: 97  PPPKSSAASPPPNSPPSTSSAPAPATTWPPHSPSTPDPATSSSPSAPP----EPPSPSP- 151

Query: 208 TMSSPPGNPPSTAASPSSPTCSAASSPSITPSSQRVSPTPPETP*SRPSPPCSVCSPPIT 387
              +PP  PP  A SP SPT   ASSPS  PS+Q  S +PPE   S      S  SP   
Sbjct: 152 --KNPPPTPP--APSPDSPTPPDASSPSSAPSTQPAS-SPPEPLRSASPSKNSTNSPSPH 206

Query: 388 SLSLSLFPSTPSIVFSSLPSSP 453
             + +   S P+   ++ P+ P
Sbjct: 207 PPAPAASSSCPTSTANAPPTCP 228

 Score = 55.5 bits (132), Expect = 7e-07
 Identities = 54/186 (29%), Positives = 80/186 (42%), Gaps = 23/186 (12%)
 Frame = +1

Query: 1   CRALFLPSFLPIPWQHSHGDPSSSHASPTSHPILSTHPSHHLSSSAAAAASSPPATPMGS 180
           C     P+ LP P      +PS + A+P + P  +T P      + + + S+  + P  S
Sbjct: 42  CPTTGSPNALPKP-----PNPSPTAATPPA-PATTTPPPTPTGPTCSPSPSAANSAPRES 95

Query: 181 RRPLLPHPLTMSSPPGNPPSTAA-----------SPSSPTCSAASSPSI-----TPSSQR 312
             P  P     S PP +PPST++           SPS+P  + +SSPS      +PS + 
Sbjct: 96  PPP--PKSSAASPPPNSPPSTSSAPAPATTWPPHSPSTPDPATSSSPSAPPEPPSPSPKN 153

Query: 313 VSPTPPETP*SRPSPPCSVCSPPITSLSLSLFPSTPSIVFSSLPS-------SPGTLCGM 471
             PTPP      P+PP +  S P ++ S     S P  + S+ PS       SP      
Sbjct: 154 PPPTPPAPSPDSPTPPDA--SSPSSAPSTQPASSPPEPLRSASPSKNSTNSPSPHPPAPA 211

Query: 472 RSTGCP 489
            S+ CP
Sbjct: 212 ASSSCP 217

 Score = 43.1 bits (100), Expect = 0.004
 Identities = 35/101 (34%), Positives = 49/101 (47%), Gaps = 1/101 (0%)
 Frame = +1

Query: 61  PSSSHASPTSHPILSTHPSHHLSSSAAAAASSPPATPMGSRRPLLPHPLTMSSPPGNPPS 240
           P++S ASP       T P+   SS A +AAS  P+TP  ++      P T+SS     P+
Sbjct: 231 PAASTASPEPTSPPPTWPAP--SSKACSAASLTPSTPSSTK----VSPSTVSSSSAAAPA 284

Query: 241 TAAS-PSSPTCSAASSPSITPSSQRVSPTPPETP*SRPSPP 360
              S PS P  SAA SPS   ++   +  P + P   P+ P
Sbjct: 285 PPQSAPSPPKSSAAPSPSQPKANTSPTAPPAKQPGHSPANP 325

 Score = 40.4 bits (93), Expect = 0.024
 Identities = 40/134 (29%), Positives = 54/134 (39%), Gaps = 9/134 (6%)
 Frame = +1

Query: 61  PSSSHASPTSHPILSTHPSHHLSSSAAAAASSPPATPMGSRRPLLPHPLTMSSPPGNPPS 240
           PSS+  SP++           +SSS+AAA + P + P   +    P P   S P  N   
Sbjct: 265 PSSTKVSPST-----------VSSSSAAAPAPPQSAPSPPKSSAAPSP---SQPKANTSP 310

Query: 241 TAASPSSPTCSAASSP---SITPSSQRVSPTPPE-----TP*SRPSPPCSVCSPPIT-SL 393
           TA     P  S A+ P     TP      PTPP      TP  +P    +  + P T   
Sbjct: 311 TAPPAKQPGHSPANPPHPNGKTPEPSSAKPTPPRRYANATP--KPETDSNRKTKPTTKEA 368

Query: 394 SLSLFPSTPSIVFS 435
            +S +  TP   FS
Sbjct: 369 HMSNYQPTPEDRFS 382

 Score = 33.1 bits (74), Expect = 3.9
 Identities = 28/97 (28%), Positives = 41/97 (41%)
 Frame = +1

Query: 163 ATPMGSRRPLLPHPLTMSSPPGNPPSTAASPSSPTCSAASSPSITPSSQRVSPTPPETP* 342
           A P G++ P    P   S  P +  S   +P++P    ASS   T S   +   P  +P 
Sbjct: 2   APPHGAK-PSAQSPSPASPSPNSAGSPNTNPTTPPEPKASSCPTTGSPNALPKPPNPSPT 60

Query: 343 SRPSPPCSVCSPPITSLSLSLFPSTPSIVFSSLPSSP 453
           +   P  +  +PP T    +  PS PS   S+   SP
Sbjct: 61  AATPPAPATTTPPPTPTGPTCSPS-PSAANSAPRESP 96

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 762,837,119
Number of Sequences: 1393205
Number of extensions: 21681379
Number of successful extensions: 294074
Number of sequences better than 10.0: 7497
Number of HSP's better than 10.0 without gapping: 128444
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 225390
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 32654539052
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB012g04_f BP034815 1 486
2 SPD030c09_f BP046370 181 711




Lotus japonicus
Kazusa DNA Research Institute