KMC004724A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004724A_C01 KMC004724A_c01
AGCAAATAATAATGATGCTTGAAGCCATTTTATTTTAATATTTCAATTTTACATTTCTAA
CTATTAGTCTGTTACAAGAAGGTGATGGCTATTTAGGCATGCATTTTCACTTGGATGGTT
TGCTGTGGAGATCCTTTCCTTACACTTAGATTCAGTGTTTTGGAGATTTGCTTTTGAGGA
TGTGTAACAATAACATCATAATCTCCATGATGTAGTGAAATATCAACAATTCCTCTGCTA
TCTGCTTTGGCTTCATGAGGCCCAGTTCCCCACTCACGAATCAGCATGTCTACAACATCT
CCAATTGGAGTATTTTTGAAGTTTTCATCTGCTAAGGGTGCCTTCTTGAAACCCGCTTGC
ACTGGACCCATGAACATTATAATCCCTTGAACAGCAGGATGAGCATAACCCTCTCTTAGA
ATCCAGTCAACATAATCTTCCTGATGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004724A_C01 KMC004724A_c01
         (447 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_195107.1| glycosyl hydrolase family 10; protein id: At4g3...    94  5e-19
pir||B85398 hypothetical protein AT4g33820 [imported] - Arabidop...    91  2e-18
ref|NP_680761.1| glycosyl hydrolase family 10; protein id: At4g3...    91  2e-18
ref|NP_179076.1| glycosyl hydrolase family 10; protein id: At2g1...    87  5e-17
pir||T05212 hypothetical protein F17I5.30 - Arabidopsis thaliana...    87  6e-17

>ref|NP_195107.1| glycosyl hydrolase family 10; protein id: At4g33810.1 [Arabidopsis
           thaliana] gi|7487091|pir||T04998 hypothetical protein
           T16L1.300 - Arabidopsis thaliana
           gi|3549683|emb|CAA20594.1| beta-xylan endohydrolase-like
           protein [Arabidopsis thaliana]
           gi|7270330|emb|CAB80098.1| beta-xylan endohydrolase-like
           protein [Arabidopsis thaliana]
          Length = 536

 Score = 93.6 bits (231), Expect = 5e-19
 Identities = 55/123 (44%), Positives = 77/123 (61%), Gaps = 6/123 (4%)
 Frame = -2

Query: 446 HQEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG 267
           +QE Y++ ILRE Y+HPAV+GII+F GP  +GF K  LAD+ F NT  GDV+D L++EW 
Sbjct: 414 NQEVYIEEILREAYSHPAVKGIIIFAGPEVSGFDKLTLADKYFNNTATGDVIDKLLKEWQ 473

Query: 266 TG---PHEAKADSRG-IVDISLHHGDYDVIVTHP-QKQISKTLNLSVRKGSPQ-QTIQVK 105
                P     DS     ++SL HG Y+V V+HP  K +S + +L V K   Q Q ++V 
Sbjct: 474 QSSEIPKIFMTDSENDEEEVSLLHGHYNVNVSHPWMKNMSTSFSLEVTKEMGQRQVVRVV 533

Query: 104 MHA 96
           ++A
Sbjct: 534 INA 536

>pir||B85398 hypothetical protein AT4g33820 [imported] - Arabidopsis thaliana
           gi|7270331|emb|CAB80099.1| putative protein [Arabidopsis
           thaliana]
          Length = 546

 Score = 91.3 bits (225), Expect = 2e-18
 Identities = 54/124 (43%), Positives = 75/124 (59%), Gaps = 7/124 (5%)
 Frame = -2

Query: 446 HQEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG 267
           +Q  YV+ ILRE Y+HPAV+GII+F GP  +GF K  LAD++F NT  GDV+D L++EW 
Sbjct: 423 NQAQYVEDILREAYSHPAVKGIIIFGGPEVSGFDKLTLADKDFNNTQTGDVIDKLLKEWQ 482

Query: 266 TGPHEAKADSRGIVD-----ISLHHGDYDVIVTHPQ-KQISKTLNLSVRKGSPQ-QTIQV 108
               E + +     D     +SL HG Y+V V+HP    +S + +L V K   Q Q I+V
Sbjct: 483 QKSSEIQTNFTADSDNEEEEVSLLHGHYNVNVSHPWIANLSTSFSLEVTKEMDQDQVIRV 542

Query: 107 KMHA 96
            + A
Sbjct: 543 VISA 546

>ref|NP_680761.1| glycosyl hydrolase family 10; protein id: At4g33820.1 [Arabidopsis
           thaliana] gi|27754330|gb|AAO22618.1| putative glycosyl
           hydrolase family 10 protein [Arabidopsis thaliana]
          Length = 570

 Score = 91.3 bits (225), Expect = 2e-18
 Identities = 54/124 (43%), Positives = 75/124 (59%), Gaps = 7/124 (5%)
 Frame = -2

Query: 446 HQEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG 267
           +Q  YV+ ILRE Y+HPAV+GII+F GP  +GF K  LAD++F NT  GDV+D L++EW 
Sbjct: 447 NQAQYVEDILREAYSHPAVKGIIIFGGPEVSGFDKLTLADKDFNNTQTGDVIDKLLKEWQ 506

Query: 266 TGPHEAKADSRGIVD-----ISLHHGDYDVIVTHPQ-KQISKTLNLSVRKGSPQ-QTIQV 108
               E + +     D     +SL HG Y+V V+HP    +S + +L V K   Q Q I+V
Sbjct: 507 QKSSEIQTNFTADSDNEEEEVSLLHGHYNVNVSHPWIANLSTSFSLEVTKEMDQDQVIRV 566

Query: 107 KMHA 96
            + A
Sbjct: 567 VISA 570

>ref|NP_179076.1| glycosyl hydrolase family 10; protein id: At2g14690.1 [Arabidopsis
           thaliana] gi|25411580|pir||C84520 1,4-beta-xylan
           endohydrolase [imported] - Arabidopsis thaliana
           gi|3810591|gb|AAC69373.1| 1,4-beta-xylan endohydrolase
           [Arabidopsis thaliana]
          Length = 552

 Score = 87.0 bits (214), Expect = 5e-17
 Identities = 53/130 (40%), Positives = 76/130 (57%), Gaps = 14/130 (10%)
 Frame = -2

Query: 443 QEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWGT 264
           Q  Y++ ILRE Y+HPAV+ II++ GP  +GF K  LAD++FKNT  GD++D L++EW  
Sbjct: 423 QVKYMEDILREAYSHPAVKAIILYGGPEVSGFDKLTLADKDFKNTQAGDLIDKLLQEWKQ 482

Query: 263 GP-------HEAKADSRGIV-----DISLHHGDYDVIVTHP-QKQISKTLNLSVRKGSPQ 123
            P       HE   +  G +     +ISL HG Y V VT+P  K +S   ++ V K S  
Sbjct: 483 EPVEIPIQHHEHNDEEGGRIIGFSPEISLLHGHYRVTVTNPSMKNLSTRFSVEVTKESGH 542

Query: 122 -QTIQVKMHA 96
            Q +Q+ + A
Sbjct: 543 LQEVQLVIDA 552

>pir||T05212 hypothetical protein F17I5.30 - Arabidopsis thaliana
           gi|3297808|emb|CAA19866.1| putative protein [Arabidopsis
           thaliana] gi|7270333|emb|CAB80101.1| putative protein
           [Arabidopsis thaliana]
          Length = 669

 Score = 86.7 bits (213), Expect = 6e-17
 Identities = 41/101 (40%), Positives = 60/101 (58%), Gaps = 2/101 (1%)
 Frame = -2

Query: 437 DYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG--T 264
           +Y + +LREG+AHP V G++M+ G   +G  +  L D NFKN P GDVVD L+REWG   
Sbjct: 551 NYFEQVLREGHAHPKVNGMVMWTGYSPSGCYRMCLTDGNFKNLPTGDVVDKLLREWGGLR 610

Query: 263 GPHEAKADSRGIVDISLHHGDYDVIVTHPQKQISKTLNLSV 141
                  D+ G+ +  L HGDYD+ ++HP      + N ++
Sbjct: 611 SQTTGVTDANGLFEAPLFHGDYDLRISHPLTNSKASYNFTL 651

 Score = 48.1 bits (113), Expect = 2e-05
 Identities = 21/50 (42%), Positives = 33/50 (66%)
 Frame = -2

Query: 443 QEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDV 294
           Q  Y + +LR+G+AHP V+G++++ G   +G  +  L D NF+N P GDV
Sbjct: 60  QAKYFEQVLRDGHAHPQVKGMVVWGGYSPSGCYRMCLTDGNFRNLPTGDV 109

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 375,685,478
Number of Sequences: 1393205
Number of extensions: 7805549
Number of successful extensions: 16530
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 16189
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 16515
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 6622363848
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD067h08_f AV774487 1 449
2 MR087a01_f BP082657 1 358
3 MPDL020c04_f AV777504 1 408




Lotus japonicus
Kazusa DNA Research Institute