KMC007827A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC007827A_C01 KMC007827A_c01
ggtcagaatcgaattcaatgaaatggtctatggggtacaagtgaagtcttacacctttat
tgaatggacatgaaattaatAGTTGAATGAAATGGTCTATGTGAATCACTGTACTAAATC
CACAACATTTTCTGGTTGCTTGGAACCTTGTACTCACCATGGAAAGACACTTAATTTCAG
CTAGATGAATCAGAAGATAACTGTTCATGGAAGGCAATTGGATACTGTGGACCTGATCCA
GAATTCAAGGCTCAGTTAAAGAACCATTTTTTACAAGGAGGTCAAGAGAACAATAAGGAA
GAGGATGATGCAGAACAGGTCATAGGGCCTCTTTACAAAGGTCTAGTCCACCTGAACAAC
CCAAGAGATATCACTGCAGATGCTCTCAGACAATTCGGATTCCCGGGTATCTGTCATTCG
CCGGAAGCCGAAAGCCTCCCCTAATATATCTTCTCCCTCTTTACTCATCCAATGTGATGC
AGTGGGTGGTGGTTCCGGCTCAGGAGGTGGTCTGATAGCTGGAATTCTTGCAAAAGCTGG
TTACAAAGTGCTGGTAAAGGAGAAAGGAGGCTACACTGCTCGGAACAATCTTTCGCTTCT
TGAAGCACCGAGCATGGATCAAATGTACCTCTCTGGGGGTTAGGTTGCAACTGATGACAT
GAGGGTATTCATACTAGCAGGGTCCACTGTAGGTGGAGGCTCTGCAATCAACTGGTCTGC
AT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC007827A_C01 KMC007827A_c01
         (722 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193673.1| putative protein; protein id: At4g19380.1 [Arab...   128  9e-39
gb|AAK64154.1| unknown protein [Arabidopsis thaliana]                  96  2e-22
ref|NP_194586.1| putative protein; protein id: At4g28570.1, supp...    96  2e-22
gb|AAM63097.1| unknown [Arabidopsis thaliana]                          96  3e-22
gb|AAL31024.1|AC078948_8 putative alcohol oxidase [Oryza sativa]       98  5e-22

>ref|NP_193673.1| putative protein; protein id: At4g19380.1 [Arabidopsis thaliana]
           gi|7487715|pir||T05821 hypothetical protein T5K18.160 -
           Arabidopsis thaliana gi|3080368|emb|CAA18625.1| putative
           protein [Arabidopsis thaliana]
           gi|7268733|emb|CAB78940.1| putative protein [Arabidopsis
           thaliana]
          Length = 678

 Score =  128 bits (321), Expect(2) = 9e-39
 Identities = 66/93 (70%), Positives = 73/93 (77%)
 Frame = +2

Query: 443 NISSPSLLIQCDAVGGGSGSGGGLIAGILAKAGYKVLVKEKGGYTARNNLSLLEAPSMDQ 622
           +IS P + IQCDAV  GSGSGGG+ AG+LAKAGYKVLV E G Y AR+ LSLLE  +MD 
Sbjct: 166 SISDPVMKIQCDAVVVGSGSGGGVAAGVLAKAGYKVLVIESGNYYARSKLSLLEGQAMDD 225

Query: 623 MYLSGG*VATDDMRVFILAGSTVGGGSAINWSA 721
           MYLSGG +AT D  V ILAGSTVGGGS INWSA
Sbjct: 226 MYLSGGLLATSDTNVVILAGSTVGGGSTINWSA 258

 Score = 54.7 bits (130), Expect(2) = 9e-39
 Identities = 28/75 (37%), Positives = 43/75 (57%)
 Frame = +1

Query: 178 QLDESEDNCSWKAIGYCGPDPEFKAQLKNHFLQGGQENNKEEDDAEQVIGPLYKGLVHLN 357
           ++DE   N +WKAIGY GP P+      +H ++  +E  K++   E++ GPLY G+V L 
Sbjct: 85  RVDEKGRNLAWKAIGYNGPSPDHS----DHEVELNEEKKKKKP--EEIFGPLYNGIVDLK 138

Query: 358 NPRDITADALRQFGF 402
           +PR+     L   GF
Sbjct: 139 SPREAVEKKLAGRGF 153

>gb|AAK64154.1| unknown protein [Arabidopsis thaliana]
          Length = 748

 Score = 96.3 bits (238), Expect(2) = 2e-22
 Identities = 50/85 (58%), Positives = 61/85 (70%)
 Frame = +2

Query: 467 IQCDAVGGGSGSGGGLIAGILAKAGYKVLVKEKGGYTARNNLSLLEAPSMDQMYLSGG*V 646
           I+CDAV  GSGSGGG+ A  LAKAG KVLV EKG Y   ++ S LE PSM ++Y  GG +
Sbjct: 236 IRCDAVVVGSGSGGGVAAANLAKAGLKVLVLEKGNYFTAHDYSGLEVPSMLELYEKGGLL 295

Query: 647 ATDDMRVFILAGSTVGGGSAINWSA 721
            T D +  +LAGS VGGG+A+NWSA
Sbjct: 296 TTVDGKFMLLAGSAVGGGTAVNWSA 320

 Score = 31.6 bits (70), Expect(2) = 2e-22
 Identities = 17/66 (25%), Positives = 33/66 (49%)
 Frame = +1

Query: 178 QLDESEDNCSWKAIGYCGPDPEFKAQLKNHFLQGGQENNKEEDDAEQVIGPLYKGLVHLN 357
           Q DE+  N + +AIGYC              + G + ++ ++ +A++   PL KG++   
Sbjct: 162 QTDENLKNPALEAIGYC--------------IDGTERSSNKKSEADEKRRPLEKGIIETM 207

Query: 358 NPRDIT 375
           +  D+T
Sbjct: 208 HESDVT 213

>ref|NP_194586.1| putative protein; protein id: At4g28570.1, supported by cDNA:
           19314., supported by cDNA: gi_14532705 [Arabidopsis
           thaliana] gi|7487681|pir||T10651 hypothetical protein
           T5F17.20 - Arabidopsis thaliana
           gi|7269712|emb|CAB81445.1| putative protein [Arabidopsis
           thaliana] gi|22798798|emb|CAC87644.1| alcohol oxidase
           [Arabidopsis thaliana] gi|25054929|gb|AAN71941.1|
           unknown protein [Arabidopsis thaliana]
          Length = 748

 Score = 96.3 bits (238), Expect(2) = 2e-22
 Identities = 50/85 (58%), Positives = 61/85 (70%)
 Frame = +2

Query: 467 IQCDAVGGGSGSGGGLIAGILAKAGYKVLVKEKGGYTARNNLSLLEAPSMDQMYLSGG*V 646
           I+CDAV  GSGSGGG+ A  LAKAG KVLV EKG Y   ++ S LE PSM ++Y  GG +
Sbjct: 236 IRCDAVVVGSGSGGGVAAANLAKAGLKVLVLEKGNYFTAHDYSGLEVPSMLELYEKGGLL 295

Query: 647 ATDDMRVFILAGSTVGGGSAINWSA 721
            T D +  +LAGS VGGG+A+NWSA
Sbjct: 296 TTVDGKFMLLAGSAVGGGTAVNWSA 320

 Score = 31.6 bits (70), Expect(2) = 2e-22
 Identities = 17/66 (25%), Positives = 33/66 (49%)
 Frame = +1

Query: 178 QLDESEDNCSWKAIGYCGPDPEFKAQLKNHFLQGGQENNKEEDDAEQVIGPLYKGLVHLN 357
           Q DE+  N + +AIGYC              + G + ++ ++ +A++   PL KG++   
Sbjct: 162 QTDENLKNPALEAIGYC--------------IDGTERSSNKKSEADEKRRPLEKGIIETM 207

Query: 358 NPRDIT 375
           +  D+T
Sbjct: 208 HESDVT 213

>gb|AAM63097.1| unknown [Arabidopsis thaliana]
          Length = 748

 Score = 96.3 bits (238), Expect(2) = 3e-22
 Identities = 50/85 (58%), Positives = 61/85 (70%)
 Frame = +2

Query: 467 IQCDAVGGGSGSGGGLIAGILAKAGYKVLVKEKGGYTARNNLSLLEAPSMDQMYLSGG*V 646
           I+CDAV  GSGSGGG+ A  LAKAG KVLV EKG Y   ++ S LE PSM ++Y  GG +
Sbjct: 236 IRCDAVVVGSGSGGGVAAANLAKAGLKVLVLEKGNYFTAHDYSGLEVPSMLELYEKGGLL 295

Query: 647 ATDDMRVFILAGSTVGGGSAINWSA 721
            T D +  +LAGS VGGG+A+NWSA
Sbjct: 296 TTVDGKFMLLAGSAVGGGTAVNWSA 320

 Score = 31.2 bits (69), Expect(2) = 3e-22
 Identities = 17/66 (25%), Positives = 33/66 (49%)
 Frame = +1

Query: 178 QLDESEDNCSWKAIGYCGPDPEFKAQLKNHFLQGGQENNKEEDDAEQVIGPLYKGLVHLN 357
           Q DE+  N + +AIGYC              + G + ++ ++ +A++   PL KG++   
Sbjct: 162 QTDENLKNPALEAIGYC--------------IDGTERSSNKKSEADEKRRPLEKGIIETM 207

Query: 358 NPRDIT 375
           +  D+T
Sbjct: 208 HESDLT 213

>gb|AAL31024.1|AC078948_8 putative alcohol oxidase [Oryza sativa]
          Length = 756

 Score = 97.8 bits (242), Expect(2) = 5e-22
 Identities = 54/121 (44%), Positives = 76/121 (62%), Gaps = 3/121 (2%)
 Frame = +2

Query: 368 ISLQMLSDNSDSRVSVIRRKPKASPNISSPSL---LIQCDAVGGGSGSGGGLIAGILAKA 538
           +  + L DN+   +S+  +        SSPS     + CDAV  GSG GGG+ A +LA A
Sbjct: 196 VETKQLDDNA-LLMSLAEKGLALKTGASSPSAHHHTVLCDAVVVGSGCGGGVAAAVLASA 254

Query: 539 GYKVLVKEKGGYTARNNLSLLEAPSMDQMYLSGG*VATDDMRVFILAGSTVGGGSAINWS 718
           GYKV+V EKG Y A  + S LE P+M+++Y +GG  +T ++   + AG+TVGGGSA+NWS
Sbjct: 255 GYKVVVVEKGDYFAARDYSSLEGPTMERLYENGGVFSTANVTTTMFAGATVGGGSAVNWS 314

Query: 719 A 721
           A
Sbjct: 315 A 315

 Score = 28.9 bits (63), Expect(2) = 5e-22
 Identities = 13/25 (52%), Positives = 17/25 (68%)
 Frame = +1

Query: 181 LDESEDNCSWKAIGYCGPDPEFKAQ 255
           +DE+ +N SWKAIGY  P  E + Q
Sbjct: 145 VDENLENPSWKAIGYSVPAAEEEPQ 169

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 669,131,000
Number of Sequences: 1393205
Number of extensions: 15725074
Number of successful extensions: 91284
Number of sequences better than 10.0: 117
Number of HSP's better than 10.0 without gapping: 58518
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 85051
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 33780557640
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL084c06_f BP057248 1 529
2 GNLf010a03 BP075369 228 723




Lotus japonicus
Kazusa DNA Research Institute