KMC015242A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC015242A_C01 KMC015242A_c01
ctctcattgcagactccgacgcgcacccgcacccgcacccaaactctctttctcttgccg
cctccttcgatttccccaaaGCTCAAATCCCCAAAAACCCAATCATCACTATCTCCGTCT
ACAAATCCCCAACAACCCCCTCCTGCATCTTCACCTCCGCCAAACTCCTCGGCAAGGTCA
CGCTGCCTCTCGATCTCACCATGGCGGACTCGCGACCCTGCATGTTCCAAAACGGATGGC
TTCCTCTCGGCCGCAAAACACACAACACTCAGTTGTTGCACTTAACACTCCGGGCCGAAC
CGGACCCGAGGTTCGTTTTCCGCTTCGACGGTGAACCGGAGTGCAGCCCGCAGGTTTTTC
AAGTCAAAGGAGATGTTAAGCAGCCGGTTTTCACTTGCAAGTTCAGTTTCAGAGATAGAA
ACCCGGTTCAGTTCCCCTCCTCGACCACCGCGAACGAGCGGAAAGGATGGTCCATCACGG
TGCACGATTTATCGGGTTCTCCGGTTGCGGCGGCGTCTATGGCCACTCCGTTCGTTCCCT
CACCCGGTTCACAGCGGGTCAGCAAGTCCAACCCGGGAGCCTGGCTCATCATCCGACCCG
ACGGTGATTGGCACTTGAAGCCTTGGGGCCGCCTCGAGAGCGTGGGTGAAnccaacaact
ccaacgccgtcggg


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC015242A_C01 KMC015242A_c01
         (674 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_175426.1| hypothetical protein; protein id: At1g50040.1, ...   218  6e-56
ref|NP_172473.1| unknown protein; protein id: At1g10020.1 [Arabi...   213  3e-54
gb|AAL60032.1| unknown protein [Arabidopsis thaliana]                 203  2e-51
ref|NP_188602.1| unknown protein; protein id: At3g19680.1, suppo...   203  2e-51
ref|NP_194660.1| putative protein; protein id: At4g29310.1, supp...   195  4e-49

>ref|NP_175426.1| hypothetical protein; protein id: At1g50040.1, supported by cDNA:
           gi_18389271, supported by cDNA: gi_20258934 [Arabidopsis
           thaliana] gi|25354742|pir||H96536 hypothetical protein
           F2J10.8 [imported] - Arabidopsis thaliana
           gi|8569096|gb|AAF76441.1|AC015445_8 ESTs gb|AI994059,
           gb|T43740 come from this gene. [Arabidopsis thaliana]
           gi|18389272|gb|AAL67079.1| unknown protein [Arabidopsis
           thaliana] gi|20258935|gb|AAM14183.1| unknown protein
           [Arabidopsis thaliana]
          Length = 460

 Score =  218 bits (555), Expect = 6e-56
 Identities = 123/235 (52%), Positives = 157/235 (66%), Gaps = 24/235 (10%)
 Frame = +3

Query: 42  NSLSLAASFDFPKAQIP------KNPIITISVYKSPTTPSCIFTSA---KLLGKVTLPLD 194
           N  ++AA F   K+QI       K  ++++ VY S  + SC F +A   KL+G+  + LD
Sbjct: 78  NVSTVAACFSLSKSQIETSLKKAKWSVLSVEVY-SRRSASCGFVAASGEKLIGRFQVTLD 136

Query: 195 LTMADSRPCMFQNGWLPLGRKTHNTQL------LHLTLRAEPDPRFVFRFDGEPECSPQV 356
           L  A+S+ C+  NGW+ LG K+ N +       LH+++R EPD RFVF+FDGEPECSPQV
Sbjct: 137 LKAAESKTCLAHNGWVDLGTKSKNNKKSGSDPELHVSVRVEPDTRFVFQFDGEPECSPQV 196

Query: 357 FQVKGDVKQPVFTCKFSFR---DRNPVQFPSSTTAN------ERKGWSITVHDLSGSPVA 509
           FQV+G+ KQ VFTCKF FR   DRN     SS T+       ERKGWSIT+HDLSGSPVA
Sbjct: 197 FQVQGNAKQAVFTCKFGFRNSGDRNLSLSLSSVTSGKEQFSKERKGWSITIHDLSGSPVA 256

Query: 510 AASMATPFVPSPGSQRVSKSNPGAWLIIRPDGDWHLKPWGRLESVGEXNNSNAVG 674
            ASM TPFVPSPGS RVS+S+PGAWLI+RPDG +  KPW RL++  E   S+ +G
Sbjct: 257 MASMVTPFVPSPGSNRVSRSSPGAWLILRPDG-YTWKPWVRLQAWREPGVSDVLG 310

>ref|NP_172473.1| unknown protein; protein id: At1g10020.1 [Arabidopsis thaliana]
           gi|7487490|pir||T00621 hypothetical protein T27I1.4 -
           Arabidopsis thaliana gi|3540181|gb|AAC34331.1| Unknown
           protein [Arabidopsis thaliana]
          Length = 461

 Score =  213 bits (541), Expect = 3e-54
 Identities = 111/234 (47%), Positives = 149/234 (63%), Gaps = 33/234 (14%)
 Frame = +3

Query: 39  PNSLSLAASFDFPKAQIPK---------NPIITISVYKSPTTPSCIFTSAKLLGKVTLPL 191
           P   +LAA+F    + I +          P + I +Y      +C   S +LL KV++PL
Sbjct: 63  PEIQTLAATFHLSSSDIQRLASRSIFTSKPCLKILIYTGRAGAACGVHSGRLLAKVSVPL 122

Query: 192 DLTMADSRPCMFQNGWLPLGR---KTHNTQLLHLTLRAEPDPRFVFRFDGEPECSPQVFQ 362
           DL+   S+PC+F NGW+ +G+   K+ ++   HL ++AEPDPRFVF+FDGEPECSPQV Q
Sbjct: 123 DLSGTQSKPCVFHNGWISVGKGAGKSSSSAQFHLNVKAEPDPRFVFQFDGEPECSPQVVQ 182

Query: 363 VKGDVKQPVFTCKFSFRD-----RNPVQFPSSTTAN----------------ERKGWSIT 479
           ++G+++QPVFTCKFS R      +     P+ T+ +                ERKGWSIT
Sbjct: 183 IQGNIRQPVFTCKFSCRHTGDRTQRSRSLPTETSVSRSWLNSFGSERERPGKERKGWSIT 242

Query: 480 VHDLSGSPVAAASMATPFVPSPGSQRVSKSNPGAWLIIRPDGDWHLKPWGRLES 641
           VHDLSGSPVA AS+ TPFV SPG+ RVS+SNPG+WLI+RP GD   +PWGRLE+
Sbjct: 243 VHDLSGSPVAMASIVTPFVASPGTDRVSRSNPGSWLILRP-GDCTWRPWGRLEA 295

>gb|AAL60032.1| unknown protein [Arabidopsis thaliana]
          Length = 491

 Score =  203 bits (517), Expect = 2e-51
 Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 46/268 (17%)
 Frame = +3

Query: 9   ADSDAHPHPHPNSLSLAASFDFPKAQI------PKNPIITISVYKSPTTP--------SC 146
           ++S+       N  ++AA F   KAQI      PK  ++++  Y    +         SC
Sbjct: 67  SESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKFSVLSVEAYSRGNSDGDDGVSGASC 126

Query: 147 IFTSA--KLLGKVTLPLDLTMADSRPCMFQNGWLPLGRK-----THNTQLLHLTLRAEPD 305
              +A  KLLG+  + LDL  A+++  +  NGW+ L  K     T +   LH+++R EPD
Sbjct: 127 GLATAGEKLLGRFEVSLDLKSAETKSFLAHNGWVALPSKKTKSKTGSDPELHVSVRVEPD 186

Query: 306 PRFVFRFDGEPECSPQVFQVKGDVKQPVFTCKFSFR-----DRNPVQFPSSTT------- 449
           PRFVF+FDGEPECSPQVFQV+G+ KQ VFTCKF  R     DRN +   S  +       
Sbjct: 187 PRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSSSMMSEISSTRS 246

Query: 450 ------------ANERKGWSITVHDLSGSPVAAASMATPFVPSPGSQRVSKSNPGAWLII 593
                       + ERKGWSITVHDLSGSPVA ASM TPFVPSPGS RV++S+PGAWLI+
Sbjct: 247 CISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRVTRSSPGAWLIL 306

Query: 594 RPDG-DWHLKPWGRLESVGEXNNSNAVG 674
           RPDG  W  KPWGRLE+  E   S+ +G
Sbjct: 307 RPDGCTW--KPWGRLEAWREAGYSDTLG 332

>ref|NP_188602.1| unknown protein; protein id: At3g19680.1, supported by cDNA:
           gi_18176371 [Arabidopsis thaliana]
           gi|25354743|pir||T52398 hypothetical protein MMB12.17
           [imported] - Arabidopsis thaliana
           gi|9294435|dbj|BAB02555.1|
           gb|AAC34331.1~gene_id:MMB12.17~similar to unknown
           protein [Arabidopsis thaliana]
           gi|23297407|gb|AAN12962.1| unknown protein [Arabidopsis
           thaliana]
          Length = 491

 Score =  203 bits (517), Expect = 2e-51
 Identities = 122/268 (45%), Positives = 157/268 (58%), Gaps = 46/268 (17%)
 Frame = +3

Query: 9   ADSDAHPHPHPNSLSLAASFDFPKAQI------PKNPIITISVYKSPTTP--------SC 146
           ++S+       N  ++AA F   KAQI      PK  ++++  Y    +         SC
Sbjct: 67  SESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKFSVLSVEAYSRGNSDGDDGVSGASC 126

Query: 147 IFTSA--KLLGKVTLPLDLTMADSRPCMFQNGWLPLGRK-----THNTQLLHLTLRAEPD 305
              +A  KLLG+  + LDL  A+++  +  NGW+ L  K     T +   LH+++R EPD
Sbjct: 127 GLATAGEKLLGRFEVSLDLKSAETKSFLAHNGWVALPSKKTKSKTGSDPELHVSVRVEPD 186

Query: 306 PRFVFRFDGEPECSPQVFQVKGDVKQPVFTCKFSFR-----DRNPVQFPSSTT------- 449
           PRFVF+FDGEPECSPQVFQV+G+ KQ VFTCKF  R     DRN +   S  +       
Sbjct: 187 PRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSSSMMSEISSTRS 246

Query: 450 ------------ANERKGWSITVHDLSGSPVAAASMATPFVPSPGSQRVSKSNPGAWLII 593
                       + ERKGWSITVHDLSGSPVA ASM TPFVPSPGS RV++S+PGAWLI+
Sbjct: 247 CISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRVTRSSPGAWLIL 306

Query: 594 RPDG-DWHLKPWGRLESVGEXNNSNAVG 674
           RPDG  W  KPWGRLE+  E   S+ +G
Sbjct: 307 RPDGCTW--KPWGRLEAWREAGYSDTLG 332

>ref|NP_194660.1| putative protein; protein id: At4g29310.1, supported by cDNA:
           gi_20260629 [Arabidopsis thaliana]
           gi|7487111|pir||T13451 hypothetical protein T17A13.130 -
           Arabidopsis thaliana gi|7269829|emb|CAB79689.1| putative
           protein [Arabidopsis thaliana]
           gi|20260630|gb|AAM13213.1| unknown protein [Arabidopsis
           thaliana] gi|28059515|gb|AAO30065.1| unknown protein
           [Arabidopsis thaliana]
          Length = 424

 Score =  195 bits (496), Expect = 4e-49
 Identities = 113/217 (52%), Positives = 133/217 (61%), Gaps = 27/217 (12%)
 Frame = +3

Query: 105 ITISVYKSPTTPSCIFTSAKLLGKVTLPLDLTMADSRPCMFQNGWLPLGRKTHNTQL-LH 281
           + +SVY   T  +C   S KLLGKV + +DL  A SR   F NGW  LG         LH
Sbjct: 92  LRVSVYAGRTGHTCGVASGKLLGKVEVAVDLAAALSRTVAFHNGWKKLGGDGDKPSARLH 151

Query: 282 LTLRAEPDPRFVFRFDGEPECSPQVFQVKGDVKQPVFTCKFSFRDRN------PVQFPSS 443
           L + AEPDPRFVF+F GEPECSP V+Q++ ++KQPVF+CKFS  DRN      P  F  S
Sbjct: 152 LLVCAEPDPRFVFQFGGEPECSPVVYQIQDNLKQPVFSCKFS-SDRNGRSRSLPSGFTYS 210

Query: 444 TT----------------ANERKGWSITVHDLSGSPVAAASMATPFVPSPGSQRVSKSNP 575
           +                 A ERKGW IT+HDLSGSPVAAASM TPFV SPGS RVS+SNP
Sbjct: 211 SRGWITRTLSGDQWEKKQARERKGWMITIHDLSGSPVAAASMITPFVASPGSDRVSRSNP 270

Query: 576 GAWLIIRPDG----DWHLKPWGRLESVGEXNNSNAVG 674
           GAWLI+RP G     W  KPWGRLE+  E    + +G
Sbjct: 271 GAWLILRPHGTCVSSW--KPWGRLEAWRERGAIDGLG 305

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 692,303,925
Number of Sequences: 1393205
Number of extensions: 18283580
Number of successful extensions: 137899
Number of sequences better than 10.0: 896
Number of HSP's better than 10.0 without gapping: 84538
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 124197
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29704274460
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM063g02_f AV765712 1 344
2 SPDL009b08_f BP052513 181 674




Lotus japonicus
Kazusa DNA Research Institute