KMC011158A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC011158A_C01 KMC011158A_c01
cagaatatgctttaggtttattagtaaccaatttgaagtttggtctaccatccctaacat
aaaggatcagcctgttgtgtAGCTGTCTTGTTTCAAACCCCTTCAAAACTATGGATCACG
AATCAACTATCACACATCCAATTGACGCTTTCGGTGACCAATTACGCGTTTGGATTAACT
ACTTTTGTATCGGCCTGAAATTCAAAAGTTACTCACATTAGCTTCTCCTGAGAATTTATC
CTAGCTTCAGACTTAAAATAAAACCTTCCTATTCAATTCAGGTCATCAAGCTAAAAAAAC
GTATGCACATGGACTACAAGGGCAAGTTATAATAAGAGCACCTAAGACACTGCCTCCACT
AACACAAGCCATCACTGAACATCCCCATATACAACAAACTTGGACAGTGAAATCAGTAAT
TAAGCAATGAATATTTCAGTCTCAAAACAATCTTCCCTTTTTCAACACCAAGCTTTCCAC
CAGTTACTAGCCAATGGCCAGGAGGATCTTGCGGTCCTTTGCTCATTTCAGACAAGTCAA
CATACTTAACCAATTCGTTCCCTGGGGTGGTGTTTTCTCTAAAACTAATGCTCGATTCAT
CTGGGTTGGAAGTGTTCCCCACACTCACCGTTGGGAGTGGCTGGTTGGGGATGTGATCCC
ACAGTGATCTTCGGATGGTGCAGCCAGGTAACCTTGAGTACAAGAGCTTCATGTACAGTA
CATTCCGTGATCCGAAATCCCAAACTCCAAGCTGCGCTCCCGTGACTACGCAGACAC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC011158A_C01 KMC011158A_c01
         (777 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564333.1| expressed protein; protein id: At1g29690.1, sup...   152  5e-36
dbj|BAB56041.1| P0481E12.26 [Oryza sativa (japonica cultivar-gro...   150  2e-35
pir||T09892 hypothetical protein T22A6.120 - Arabidopsis thalian...    88  2e-16
pir||H86281 protein F10B6.18 [imported] - Arabidopsis thaliana g...    87  2e-16
ref|NP_172931.1| hypothetical protein; protein id: At1g14780.1, ...    87  2e-16

>ref|NP_564333.1| expressed protein; protein id: At1g29690.1, supported by cDNA:
           gi_15809819, supported by cDNA: gi_18650617 [Arabidopsis
           thaliana] gi|25333974|pir||C86420 unknown protein,
           124288-121737 [imported] - Arabidopsis thaliana
           gi|12323548|gb|AAG51760.1|AC068667_39 unknown protein;
           124288-121737 [Arabidopsis thaliana]
           gi|18650618|gb|AAL75908.1| At1g29690/F15D2_24
           [Arabidopsis thaliana]
          Length = 561

 Score =  152 bits (384), Expect = 5e-36
 Identities = 77/120 (64%), Positives = 90/120 (74%)
 Frame = -3

Query: 775 VCVVTGAQLGVWDFGSRNVLYMKLLYSRLPGCTIRRSLWDHIPNQPLPTVSVGNTSNPDE 596
           V +VTGAQLGVWDFGS+NVL++KLL+S++PGCTIRRS+WDH P      +  G  S    
Sbjct: 446 VHIVTGAQLGVWDFGSKNVLHLKLLFSKVPGCTIRRSVWDHTPVASSGRLEPGGPS---- 501

Query: 595 SSISFRENTTPGNELVKYVDLSEMSKGPQDPPGHWLVTGGKLGVEKGKIVLRLKYSLLNY 416
           +S S  E +    +L K VD SEM KGPQD PGHWLVTG KLGVEKGKIVLR+KYSLLNY
Sbjct: 502 TSSSTEEVSGQSGKLAKIVDSSEMLKGPQDLPGHWLVTGAKLGVEKGKIVLRVKYSLLNY 561

>dbj|BAB56041.1| P0481E12.26 [Oryza sativa (japonica cultivar-group)]
          Length = 553

 Score =  150 bits (379), Expect = 2e-35
 Identities = 75/120 (62%), Positives = 91/120 (75%)
 Frame = -3

Query: 775 VCVVTGAQLGVWDFGSRNVLYMKLLYSRLPGCTIRRSLWDHIPNQPLPTVSVGNTSNPDE 596
           V +VTGAQLGVWDFG+++VL++KLL+SR+PGCTIRRS+WDH P+  L           DE
Sbjct: 445 VYIVTGAQLGVWDFGAKSVLHLKLLFSRVPGCTIRRSVWDHSPSSSL-------VHRTDE 497

Query: 595 SSISFRENTTPGNELVKYVDLSEMSKGPQDPPGHWLVTGGKLGVEKGKIVLRLKYSLLNY 416
           +S S  +N     +LVK VD++E  KGPQD PGHWLVTG KLGVEKGKIV+R KYSLLNY
Sbjct: 498 ASSSSSDNA----KLVKIVDMTETLKGPQDAPGHWLVTGAKLGVEKGKIVVRAKYSLLNY 553

>pir||T09892 hypothetical protein T22A6.120 - Arabidopsis thaliana
           gi|5051771|emb|CAB45064.1| putative protein [Arabidopsis
           thaliana] gi|7269279|emb|CAB79339.1| putative protein
           [Arabidopsis thaliana]
          Length = 606

 Score = 87.8 bits (216), Expect = 2e-16
 Identities = 52/134 (38%), Positives = 78/134 (57%), Gaps = 19/134 (14%)
 Frame = -3

Query: 769 VVTGAQLGVWDFGSRNVLYMKLLYSRLPGCT-IRRSLWDHI----PNQPLPTVSVGNTSN 605
           VVTGAQL V   G +NVL+++L +SR+ G T ++ S WD      P   L +  + +   
Sbjct: 457 VVTGAQLHVESHGFKNVLFLRLCFSRVVGATLVKNSEWDEAVGFAPKSGLISTLISHHFT 516

Query: 604 ------PDESSISFRENTTPGN--------ELVKYVDLSEMSKGPQDPPGHWLVTGGKLG 467
                 P  + ++      PG         +L+K+VD SEM++GPQ+ PG+W+V+G +L 
Sbjct: 517 AAQKPPPRPADVNINSAIYPGGPPVPTQAPKLLKFVDTSEMTRGPQESPGYWVVSGARLL 576

Query: 466 VEKGKIVLRLKYSL 425
           VEKGKI L++KYSL
Sbjct: 577 VEKGKISLKVKYSL 590

>pir||H86281 protein F10B6.18 [imported] - Arabidopsis thaliana
           gi|8778214|gb|AAF79223.1|AC006917_8 F10B6.18
           [Arabidopsis thaliana]
          Length = 645

 Score = 87.4 bits (215), Expect = 2e-16
 Identities = 50/138 (36%), Positives = 81/138 (58%), Gaps = 21/138 (15%)
 Frame = -3

Query: 769 VVTGAQLGVWDFGSRNVLYMKLLYSRLPGCTIRRSLWDHIP-------------NQPLPT 629
           +VTGAQL V   GS++VL+++L Y+++    + ++ W H P             + PL +
Sbjct: 500 IVTGAQLEVKKHGSKSVLHLRLRYTKVSDHYVVQNSWVHGPIGTSQKSGIFSSMSMPLTS 559

Query: 628 VSVG-NTSNPDESSISFRENTTPG-------NELVKYVDLSEMSKGPQDPPGHWLVTGGK 473
            SV  N    D++ +       PG       N++VK+VDLS++ +GPQ  PGHWLVTG +
Sbjct: 560 GSVHHNMIQKDKNEVVLDSGVFPGGPPVPANNKIVKFVDLSQLCRGPQHSPGHWLVTGVR 619

Query: 472 LGVEKGKIVLRLKYSLLN 419
           L ++KGK+ L +K++LL+
Sbjct: 620 LYLDKGKLCLHVKFALLH 637

>ref|NP_172931.1| hypothetical protein; protein id: At1g14780.1, supported by cDNA:
           gi_20466443 [Arabidopsis thaliana]
           gi|20466444|gb|AAM20539.1| unknown protein [Arabidopsis
           thaliana] gi|27311987|gb|AAO00959.1| unknown protein
           [Arabidopsis thaliana]
          Length = 627

 Score = 87.4 bits (215), Expect = 2e-16
 Identities = 50/138 (36%), Positives = 81/138 (58%), Gaps = 21/138 (15%)
 Frame = -3

Query: 769 VVTGAQLGVWDFGSRNVLYMKLLYSRLPGCTIRRSLWDHIP-------------NQPLPT 629
           +VTGAQL V   GS++VL+++L Y+++    + ++ W H P             + PL +
Sbjct: 482 IVTGAQLEVKKHGSKSVLHLRLRYTKVSDHYVVQNSWVHGPIGTSQKSGIFSSMSMPLTS 541

Query: 628 VSVG-NTSNPDESSISFRENTTPG-------NELVKYVDLSEMSKGPQDPPGHWLVTGGK 473
            SV  N    D++ +       PG       N++VK+VDLS++ +GPQ  PGHWLVTG +
Sbjct: 542 GSVHHNMIQKDKNEVVLDSGVFPGGPPVPANNKIVKFVDLSQLCRGPQHSPGHWLVTGVR 601

Query: 472 LGVEKGKIVLRLKYSLLN 419
           L ++KGK+ L +K++LL+
Sbjct: 602 LYLDKGKLCLHVKFALLH 619

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 674,692,572
Number of Sequences: 1393205
Number of extensions: 14706547
Number of successful extensions: 38938
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 37280
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 38914
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 38375267554
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL057a02_f AV779366 1 551
2 MF098a04_f BP033380 243 759
3 MPD004g01_f AV770281 249 781




Lotus japonicus
Kazusa DNA Research Institute