KMC009490A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC009490A_C01 KMC009490A_c01
gaactctgctcagaatgcttctctcctctatactcggaatcttcttcttcactcttcttc
caccaccgccgcaccaccacCACCACCACCGTCGCCACCTCACTCGCCGCCGTCACAAAT
CGTTTCCGGCCCAGGCTCTTCTCCTCCGATGAAACCCCCAAAAGCCCCGTTCCTCAAGAA
CCCGAAGAAATCAAGGACGTCGATAACAAAGAGTTCAAAAAGATGATTTGATCAATACCT
CAAGGGCGACGAGGAGGTTCTTCCGTTGATACAGGAGGCGATTCTGATACGGAGGTTATC
GGGGAAGCATGATGACACTGATGACGAGATGATGGATGAGCTCCGCATGGGGCCTCTGGA
TGATGTCAGTGACAGGGAGTTCGAGGAGGATTTCGAGGAGGCGCATCAGACCGATGAGGA
GATCGATGATCTGTATAATGCTAGGGATGTGGTGCAGAAGAGGATGGTTCATGATGAGTA
CTTCAACATGGATCCCAAGAAGTGGGATGAAATGGTTCAGGATGGGGTCAACCATGGTTT
TCTTAAGGATACCAAGTAGTGTGAGGAGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC009490A_C01 KMC009490A_c01
         (569 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_566603.1| expressed protein; protein id: At3g18240.1, sup...   138  4e-32
ref|NP_567628.1| putative protein; protein id: At4g21460.1, supp...   134  8e-31
pir||T05154 hypothetical protein F18E5.80 - Arabidopsis thaliana...   130  2e-29
pir||S23737 proline-rich protein precursor - kidney bean gi|2104...    63  2e-09
ref|NP_177518.1| hypothetical protein; protein id: At1g73770.1 [...    57  1e-07

>ref|NP_566603.1| expressed protein; protein id: At3g18240.1, supported by cDNA:
           gi_11908119, supported by cDNA: gi_13194809, supported
           by cDNA: gi_13265398 [Arabidopsis thaliana]
           gi|9279658|dbj|BAB01174.1|
           emb|CAA18710.1~gene_id:MIE15.3~strong similarity to
           unknown protein [Arabidopsis thaliana]
           gi|11692818|gb|AAG40012.1|AF324661_1 AT3g18240
           [Arabidopsis thaliana]
           gi|11908120|gb|AAG41489.1|AF326907_1 unknown protein
           [Arabidopsis thaliana]
           gi|13194810|gb|AAK15567.1|AF349520_1 unknown protein
           [Arabidopsis thaliana] gi|21536541|gb|AAM60873.1|
           unknown [Arabidopsis thaliana]
          Length = 419

 Score =  138 bits (348), Expect = 4e-32
 Identities = 77/195 (39%), Positives = 115/195 (58%), Gaps = 9/195 (4%)
 Frame = +2

Query: 11  QNASLLYTRNLLLHSSSTTAAPPPP---PPSPPHSPPSQIVSGPGSSPPMKPPKAPFLKN 181
           +NASL   R +L    +   +P  P   P + P +P  +  S          P++    +
Sbjct: 7   RNASLCARRIILSPRITHQISPNVPFLAPIAAPAAPKFRFFSSESGENSTTAPESSPTDS 66

Query: 182 PKKSRTSI----TKSSKR*FDQYL-KGDEEVLPLIQEAILIRRLSGKHDDTDDEMMDELR 346
           P+K    +     K  K   ++Y  +G+E+ LP + EA+L RRL  KH +TDDE+++++ 
Sbjct: 67  PEKKDLVVEDVSNKELKSRIEKYFNEGNEDALPGVIEALLQRRLVDKHAETDDELLEKIE 126

Query: 347 MGPL-DDVSDREFEEDFEEAHQTDEEIDDLYNARDVVQKRMVHDEYFNMDPKKWDEMVQD 523
             P  DDV D +FE DFEEAH TDEE++DLYN+ + V ++M  +E+FNMD KKWD M+++
Sbjct: 127 SLPFKDDVKDEDFESDFEEAHSTDEELEDLYNSPEYVAEKMRKNEFFNMDDKKWDHMIRE 186

Query: 524 GVNHGFLKDTK*CEE 568
           G+ HG L DTK CEE
Sbjct: 187 GIQHGCLTDTKECEE 201

>ref|NP_567628.1| putative protein; protein id: At4g21460.1, supported by cDNA:
           10221. [Arabidopsis thaliana]
          Length = 415

 Score =  134 bits (337), Expect = 8e-31
 Identities = 78/192 (40%), Positives = 111/192 (57%), Gaps = 6/192 (3%)
 Frame = +2

Query: 11  QNASLLYTRNLLLHSSSTTAAPPPPPPSPPHSPPSQIVSGP-GSSPPMKPPKAPFLKNPK 187
           +NASL   R ++L S  +   P   P + P  P  +  S   G +       +P   + K
Sbjct: 7   RNASLC-ARRIILSSRISPNVPFLTPIAAPAPPKFRFFSSESGENSTTATESSPTDSSDK 65

Query: 188 KSRTSITKSSK----R*FDQYLKGDEEVLPLIQEAILIRRLSGKHDDTDDEMMDELRMGP 355
           K       S+K    R    + +G+E+ LP + EA+L RRL  KH +TDDE+M+++   P
Sbjct: 66  KDLVVKDVSNKELKSRIDKSFNEGNEDALPGVIEALLQRRLVDKHAETDDELMEKIESLP 125

Query: 356 L-DDVSDREFEEDFEEAHQTDEEIDDLYNARDVVQKRMVHDEYFNMDPKKWDEMVQDGVN 532
             DDV D +FE DFEEAH TDEE++DLYN+ + V ++M   E+FNMD  KWD M+++G+ 
Sbjct: 126 FKDDVKDEDFESDFEEAHSTDEELEDLYNSPEYVAEKMRKKEFFNMDDNKWDHMIREGIQ 185

Query: 533 HGFLKDTK*CEE 568
           HG L DTK CEE
Sbjct: 186 HGCLTDTKQCEE 197

>pir||T05154 hypothetical protein F18E5.80 - Arabidopsis thaliana
           gi|3080390|emb|CAA18710.1| putative protein [Arabidopsis
           thaliana] gi|7268943|emb|CAB81253.1| putative protein
           [Arabidopsis thaliana]
          Length = 420

 Score =  130 bits (326), Expect = 2e-29
 Identities = 73/173 (42%), Positives = 102/173 (58%), Gaps = 6/173 (3%)
 Frame = +2

Query: 68  AAPPPPP-----PSPPHSPPSQIVSGPGSSPPMKPPKAPFLKNPKKSRTSITKSSKR*FD 232
           AAP PP           +  +   S P  S   K      + N K+ ++ I KS      
Sbjct: 38  AAPAPPKFRFFSSESGENSTTATESSPTDSSDKKDLVVKDVSN-KELKSRIDKS------ 90

Query: 233 QYLKGDEEVLPLIQEAILIRRLSGKHDDTDDEMMDELRMGPL-DDVSDREFEEDFEEAHQ 409
            + +G+E+ LP + EA+L RRL  KH +TDDE+M+++   P  DDV D +FE DFEEAH 
Sbjct: 91  -FNEGNEDALPGVIEALLQRRLVDKHAETDDELMEKIESLPFKDDVKDEDFESDFEEAHS 149

Query: 410 TDEEIDDLYNARDVVQKRMVHDEYFNMDPKKWDEMVQDGVNHGFLKDTK*CEE 568
           TDEE++DLYN+ + V ++M   E+FNMD  KWD M+++G+ HG L DTK CEE
Sbjct: 150 TDEELEDLYNSPEYVAEKMRKKEFFNMDDNKWDHMIREGIQHGCLTDTKQCEE 202

>pir||S23737 proline-rich protein precursor - kidney bean
           gi|21046|emb|CAA42942.1| proline-rich protein [Phaseolus
           vulgaris]
          Length = 297

 Score = 63.2 bits (152), Expect = 2e-09
 Identities = 33/91 (36%), Positives = 43/91 (46%)
 Frame = -1

Query: 515 PFHPTSWDPC*STHHEPSSSAPHP*HYTDHRSPHRSDAPPRNPPRTPCH*HHPEAPCGAH 336
           P HP ++ P  + HH      PHP H+  H  P      P +PP  P H H+P AP  AH
Sbjct: 28  PLHPPTYPPANAPHH------PHPHHH--HHHPPAPAPAPLHPPSPPSHPHYPPAP--AH 77

Query: 335 PSSRHQCHHASPITSVSESPPVSTEEPPRRP 243
           P + H  HH S       +PP+    PP +P
Sbjct: 78  PPTHHHHHHPSAPVHPPLNPPLVPVHPPLKP 108

 Score = 38.9 bits (89), Expect = 0.046
 Identities = 29/95 (30%), Positives = 36/95 (37%)
 Frame = -1

Query: 527 PHPEPFHPTSWDPC*STHHEPSSSAPHP*HYTDHRSPHRSDAPPRNPPRTPCH*HHPEAP 348
           P P P HP S  P    H+ P+ + P   H+  H  P     PP NPP  P H    + P
Sbjct: 55  PAPAPLHPPS--PPSHPHYPPAPAHPPTHHH--HHHPSAPVHPPLNPPLVPVH-PPLKPP 109

Query: 347 CGAHPSSRHQCHHASPITSVSESPPVSTEEPPRRP 243
              HP          P+      PPV    P + P
Sbjct: 110 VPIHPPLNPPVPVHPPV-----KPPVPVHPPVKPP 139

 Score = 28.1 bits (61), Expect(2) = 0.40
 Identities = 8/9 (88%), Positives = 8/9 (88%)
 Frame = +3

Query: 63  PPPHHHHHH 89
           PP HHHHHH
Sbjct: 78  PPTHHHHHH 86

 Score = 26.6 bits (57), Expect(2) = 0.40
 Identities = 12/28 (42%), Positives = 15/28 (52%)
 Frame = +2

Query: 77  PPPPPSPPHSPPSQIVSGPGSSPPMKPP 160
           PP P  PP +PP  +       PP+KPP
Sbjct: 108 PPVPIHPPLNPPVPV------HPPVKPP 129

>ref|NP_177518.1| hypothetical protein; protein id: At1g73770.1 [Arabidopsis
           thaliana] gi|25406293|pir||A96765 hypothetical protein
           F25P22.19 [imported] - Arabidopsis thaliana
           gi|12324213|gb|AAG52079.1|AC012679_17 hypothetical
           protein; 70159-70900 [Arabidopsis thaliana]
          Length = 191

 Score = 57.4 bits (137), Expect = 1e-07
 Identities = 30/69 (43%), Positives = 46/69 (66%), Gaps = 6/69 (8%)
 Frame = +2

Query: 245 GDEEVLPLIQEAILIRRLSGKHDDTDDEMMDELRMGPLDDVS-----DREFEED-FEEAH 406
           G+E+ +P + EA++IR+LSGKHDD+DDE+MD +R  P++D       D + E D   ++ 
Sbjct: 80  GNEDAIPDLFEALMIRKLSGKHDDSDDEVMDVVRKYPVNDAHKVDDIDSDIESDGHGDSS 139

Query: 407 QTDEEIDDL 433
            +D E DDL
Sbjct: 140 DSDIESDDL 148

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 567,978,702
Number of Sequences: 1393205
Number of extensions: 15345968
Number of successful extensions: 314016
Number of sequences better than 10.0: 3853
Number of HSP's better than 10.0 without gapping: 101309
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 223379
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20956655091
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF068a07_f BP031899 1 430
2 MR001a08_f BP075973 143 569




Lotus japonicus
Kazusa DNA Research Institute