KMC019095A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019095A_C01 KMC019095A_c01
gaaACAACTGAAGGTCTGATGCATCTCTATGGCTCAAAAATGGTAAAAGTTTATACACTC
AATTCTTGGTCATATAAAACTAAGATTAAACAACAGAGACTCTATTCAAATTCTCAGTGG
AAATACTATACCAATGTCATGAAAGTAAAACTAATGTTCCAGAAATAAAAAAATAAAAAA
AGAGGAAATTAACTAGCATAGGGTTGTCTAAGGCCAGATGCTTTCTTCATCATAGCAGCA
TGCATTTTTCCTGCAATATGATAATTATAAACTGTCTCACTATTACACACCACATTACAT
ACAGCACAGGTTCTAACTGCATTAGCTGCAGCCCCTCCTTCAAGAACCTTCTTTTTCTTA
ATCTCCAAATCCTCTGGAGTTTCAATTTTCCTCTTACTCTTAAGCTTACTGATAGATTTT
CCCTTGTCGTTCTGTTCTCGTGGTCCTATAACAGGATTAGACGACCCAGATGGATGAACT
TGGGCTGGTTTTAATGATTCTCTTAGTTTCTCCAAGTTCTTTTTGTGCCTCTTTCCTAAT
TTATGCTGGTCCAGTACATCCTTGCTGGTGCACTCTGTCTTGCAAACTTCACAATATGCA
GGTTGCACAGTCTTCATTTTAGGTTTCTTTGGGTATTTCTTCCATTTTCCAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019095A_C01 KMC019095A_c01
         (652 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAK92570.1|AC074354_4 Hypothetical protein [Oryza sativa]          123  2e-27
ref|NP_200927.1| putative protein; protein id: At5g61190.1 [Arab...    78  1e-13
ref|NP_189579.1| hypothetical protein; protein id: At3g29330.1 [...    75  1e-12
dbj|BAB10381.1| emb|CAB71880.1~gene_id:MAF19.19~similar to unkno...    67  2e-10
ref|NP_179981.1| hypothetical protein; protein id: At2g24030.1 [...    47  3e-04

>gb|AAK92570.1|AC074354_4 Hypothetical protein [Oryza sativa]
          Length = 421

 Score =  123 bits (309), Expect = 2e-27
 Identities = 64/145 (44%), Positives = 89/145 (61%), Gaps = 10/145 (6%)
 Frame = -3

Query: 626 KPKMKTVQPAYCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVI 447
           K K K VQP  CEVCK +C + +VL  HK GK+HKKNLE+L++S+ P  V P  + N V 
Sbjct: 269 KKKPKVVQPLTCEVCKIQCDTPEVLRIHKTGKKHKKNLERLQDSITPKPVKPPSTPNTVA 328

Query: 446 -------GPREQNDKGKSISKLKSKRK---IETPEDLEIKKKKVLEGGAAANAVRTCAVC 297
                   P   +     I   ++K+K     TPE+LE+K+++VL+ GAA   V+ C VC
Sbjct: 329 LAANMAPDPVTTSVTTSVIPAAQTKKKKSAAATPEELEVKRRRVLDAGAAQGEVKICTVC 388

Query: 296 NVVCNSETVYNYHIAGKMHAAMMKK 222
           NVV NS+ VY +HI G+ H AM++K
Sbjct: 389 NVVVNSQKVYEFHIIGQKHKAMVQK 413

>ref|NP_200927.1| putative protein; protein id: At5g61190.1 [Arabidopsis thaliana]
          Length = 976

 Score = 77.8 bits (190), Expect = 1e-13
 Identities = 43/130 (33%), Positives = 63/130 (48%)
 Frame = -3

Query: 608 VQPAYCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVIGPREQN 429
           V+P  C+VC+   T+ D    H  GK+H+ NLE          +    S N ++GP E +
Sbjct: 294 VEPLLCKVCQISFTNNDTYKNHTYGKKHRNNLE----------LQSGKSKNILVGPAEPS 343

Query: 428 DKGKSISKLKSKRKIETPEDLEIKKKKVLEGGAAANAVRTCAVCNVVCNSETVYNYHIAG 249
                          E  E   + KK ++E  A ANA   C +CNVVC S+ V+N H+ G
Sbjct: 344 K--------------EVLEKHNMNKKVMIESRAQANAEFVCLMCNVVCQSQIVFNSHLRG 389

Query: 248 KMHAAMMKKA 219
           K HA M+ ++
Sbjct: 390 KKHANMLSQS 399

 Score = 42.4 bits (98), Expect = 0.005
 Identities = 22/67 (32%), Positives = 37/67 (54%)
 Frame = -3

Query: 608 VQPAYCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVIGPREQN 429
           +QP +C+VC+  C SK     H  GK+H++NLE   +S K    + + S+ P    ++  
Sbjct: 679 LQPVWCQVCQISCNSKVAFASHTYGKKHRQNLES--QSAK----NETMSTGPGKLSKDYG 732

Query: 428 DKGKSIS 408
           +K K +S
Sbjct: 733 EKTKKVS 739

 Score = 37.0 bits (84), Expect = 0.23
 Identities = 23/96 (23%), Positives = 43/96 (43%), Gaps = 2/96 (2%)
 Frame = -3

Query: 521 KNLEKLRESLKPAQVHPSGSSNPVIGPREQNDKGKSISKLKSKRKIETPEDLEIKKKKVL 342
           K + KL  S +     P+GS++     ++   + K + +            L+  +K +L
Sbjct: 182 KGVSKLPVSEQAQPSQPTGSTSNAGDTKDHKTREKHVPR----------GSLQENRKNML 231

Query: 341 E--GGAAANAVRTCAVCNVVCNSETVYNYHIAGKMH 240
           +   GA   +  TC +CNVVC+S   +  H++   H
Sbjct: 232 QHSSGATGESATTCRICNVVCDSFEKFTAHLSDIRH 267

 Score = 34.7 bits (78), Expect = 1.1
 Identities = 27/114 (23%), Positives = 48/114 (41%), Gaps = 14/114 (12%)
 Frame = -3

Query: 593 CEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVIGPREQNDKGKS 414
           C +C   C S+ V + H  GK+H   L +   +L  A +  +      +G +EQ  +  +
Sbjct: 370 CLMCNVVCQSQIVFNSHLRGKKHANMLSQSEATLDQALIVSTKLQEKGVGEKEQPSETVA 429

Query: 413 ISKLKSKRKIETPEDLE-IKKKKVLEGG-------------AAANAVRTCAVCN 294
             +L+S++  E    +  +  KK+ E G               A+A   C +CN
Sbjct: 430 ELQLQSQKAQEKQVPMVLVDSKKLPEKGDEVKGQPKEMTALRNASAKYICRMCN 483

>ref|NP_189579.1| hypothetical protein; protein id: At3g29330.1 [Arabidopsis
           thaliana]
          Length = 232

 Score = 74.7 bits (182), Expect = 1e-12
 Identities = 36/79 (45%), Positives = 51/79 (63%)
 Frame = -3

Query: 455 PVIGPREQNDKGKSISKLKSKRKIETPEDLEIKKKKVLEGGAAANAVRTCAVCNVVCNSE 276
           P+IGP+E        SK + +    T EDLE K+++V+E G +  ++R C +CNVVCNS+
Sbjct: 151 PLIGPQEN----PCTSKARKRGADSTTEDLESKRRRVVECGVSNESIRLCRICNVVCNSD 206

Query: 275 TVYNYHIAGKMHAAMMKKA 219
            VYN H+AG+ HAA   KA
Sbjct: 207 IVYNDHLAGQKHAAKAAKA 225

>dbj|BAB10381.1| emb|CAB71880.1~gene_id:MAF19.19~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 996

 Score = 67.0 bits (162), Expect = 2e-10
 Identities = 38/118 (32%), Positives = 55/118 (46%)
 Frame = -3

Query: 608 VQPAYCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVIGPREQN 429
           V+P  C+VC+   T+ D    H  GK+H+ NLE          +    S N ++GP E +
Sbjct: 294 VEPLLCKVCQISFTNNDTYKNHTYGKKHRNNLE----------LQSGKSKNILVGPAEPS 343

Query: 428 DKGKSISKLKSKRKIETPEDLEIKKKKVLEGGAAANAVRTCAVCNVVCNSETVYNYHI 255
                          E  E   + KK ++E  A ANA   C +CNVVC S+ V+N H+
Sbjct: 344 K--------------EVLEKHNMNKKVMIESRAQANAEFVCLMCNVVCQSQIVFNSHL 387

 Score = 42.4 bits (98), Expect = 0.005
 Identities = 22/67 (32%), Positives = 37/67 (54%)
 Frame = -3

Query: 608 VQPAYCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVIGPREQN 429
           +QP +C+VC+  C SK     H  GK+H++NLE   +S K    + + S+ P    ++  
Sbjct: 699 LQPVWCQVCQISCNSKVAFASHTYGKKHRQNLES--QSAK----NETMSTGPGKLSKDYG 752

Query: 428 DKGKSIS 408
           +K K +S
Sbjct: 753 EKTKKVS 759

 Score = 39.7 bits (91), Expect = 0.036
 Identities = 31/130 (23%), Positives = 62/130 (46%), Gaps = 2/130 (1%)
 Frame = -3

Query: 593 CEVCKTECTSKDVLDQHKLGKRHKKNLE-KLRESLKPAQVHPSGSSNPV-IGPREQNDKG 420
           C +C   C S+ V + H         +  KL+E     +  PS +   + +  ++  +K 
Sbjct: 370 CLMCNVVCQSQIVFNSHLRALDQALIVSTKLQEKGVGEKEQPSETVAELQLQSQKAQEKQ 429

Query: 419 KSISKLKSKRKIETPEDLEIKKKKVLEGGAAANAVRTCAVCNVVCNSETVYNYHIAGKMH 240
             +  + SK+  E  ++++ + K+ +     A+A   C +CNV C+S  V+  H+ G+ H
Sbjct: 430 VPMVLVDSKKLPEKGDEVKGQPKE-MTALRNASAKYICRMCNVGCHSPIVFETHLRGQKH 488

Query: 239 AAMMKKASGL 210
           AA + ++  L
Sbjct: 489 AANLNQSKAL 498

 Score = 37.7 bits (86), Expect = 0.14
 Identities = 26/95 (27%), Positives = 40/95 (41%), Gaps = 19/95 (20%)
 Frame = -3

Query: 443 PREQNDKGKSISKLKSKRKIETPEDLEIKKKKV------LEGGAAANAVRT--------- 309
           PRE ++    I K  +  K ET    E+KKKK+      +    + ++V T         
Sbjct: 640 PREASECFDGIVKPVNLSKGETKHSWEVKKKKIEVTAAFVASNGSQSSVSTNPLKEPEGL 699

Query: 308 ----CAVCNVVCNSETVYNYHIAGKMHAAMMKKAS 216
               C VC + CNS+  +  H  GK H   ++  S
Sbjct: 700 QPVWCQVCQISCNSKVAFASHTYGKKHRQNLESQS 734

 Score = 37.0 bits (84), Expect = 0.23
 Identities = 23/96 (23%), Positives = 43/96 (43%), Gaps = 2/96 (2%)
 Frame = -3

Query: 521 KNLEKLRESLKPAQVHPSGSSNPVIGPREQNDKGKSISKLKSKRKIETPEDLEIKKKKVL 342
           K + KL  S +     P+GS++     ++   + K + +            L+  +K +L
Sbjct: 182 KGVSKLPVSEQAQPSQPTGSTSNAGDTKDHKTREKHVPR----------GSLQENRKNML 231

Query: 341 E--GGAAANAVRTCAVCNVVCNSETVYNYHIAGKMH 240
           +   GA   +  TC +CNVVC+S   +  H++   H
Sbjct: 232 QHSSGATGESATTCRICNVVCDSFEKFTAHLSDIRH 267

>ref|NP_179981.1| hypothetical protein; protein id: At2g24030.1 [Arabidopsis
           thaliana] gi|25412186|pir||G84631 hypothetical protein
           At2g24030 [imported] - Arabidopsis thaliana
           gi|3738330|gb|AAC63671.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 440

 Score = 46.6 bits (109), Expect = 3e-04
 Identities = 33/131 (25%), Positives = 55/131 (41%), Gaps = 6/131 (4%)
 Frame = -3

Query: 596 YCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESLKPAQVHPSGSSNPVIGPR------E 435
           YC++         V+  H+LGK+HK  + +  E+ + A    S +S     P        
Sbjct: 289 YCQI---------VMRDHELGKKHKAAVTQQNETPEAASTSLSPASVTAPQPEAIRVCEN 339

Query: 434 QNDKGKSISKLKSKRKIETPEDLEIKKKKVLEGGAAANAVRTCAVCNVVCNSETVYNYHI 255
            N +G+ + ++ +K         E KKK+ +           C  CN+  NSE     H 
Sbjct: 340 ANPQGQKVDEIAAKETTGKKTKGEKKKKETV----------WCKTCNIQTNSEQTMRNHT 389

Query: 254 AGKMHAAMMKK 222
            GK H A+++K
Sbjct: 390 LGKKHMALLEK 400

 Score = 34.7 bits (78), Expect = 1.1
 Identities = 18/53 (33%), Positives = 26/53 (48%)
 Frame = -3

Query: 650 GKWKKYPKKPKMKTVQPAYCEVCKTECTSKDVLDQHKLGKRHKKNLEKLRESL 492
           GK  K  KK K    +  +C+ C  +  S+  +  H LGK+H   LEK +  L
Sbjct: 357 GKKTKGEKKKK----ETVWCKTCNIQTNSEQTMRNHTLGKKHMALLEKQQNKL 405

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 540,574,469
Number of Sequences: 1393205
Number of extensions: 11288095
Number of successful extensions: 37292
Number of sequences better than 10.0: 121
Number of HSP's better than 10.0 without gapping: 35122
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 37167
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27860523586
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB060f01_f BP038366 1 504
2 MFB012b10_f BP034776 4 541
3 MFB007f10_f BP034432 94 652




Lotus japonicus
Kazusa DNA Research Institute