KMC012695A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC012695A_C01 KMC012695A_c01
gTGTAGCTGTATGTTTAGTTATTAAACTCAATTACAATTACATAACTAGGAAGTACATCA
CTAAACACATTCAATTATTTCACAAGACAAATTAAATGGCTACACTGCAATGGGTGTTCA
GGGACGAGTTCTCATTTAATTTTCGATACAAAAGTATCTTTAAATTTTTCACCCTGAGTT
TTAAGGCTTAACCTAGGTCTCAAGCTGTACCTGCGCAGGCGAGGCTCAAACTTTGTACTC
CTCAAGATGCATCCACTTCGTTGAAGACCATTTATTTCCTCTAATTACTGGGCAACCACC
ATGTAAACTTGATGGATCTAGTGTGGCATCTGGCCTTATGCTCCAAAACAGCAGAGCATC
ACCCCTTTTTGGTTTCACTGATAGACCCTTTTTAGCACATACAGATAAATCATTATACCA
TGGCACAGAACTAAAGTTTGCCTTGGCAGCAGGAAATATGGTCTCACCACCTTCTTCAAC
ATCTGAAAGGTACATAAGAACTGTAGCAATACGTTGGCCTCCATTTTTAGTGTTGAACTC
ATCAAGGAAGTAATCATAGTGAGGTTCATATTTCTGCCCAACTTCATAGTGAAGAACCTG
AAGTCCTTCTCCATTCTCTAC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC012695A_C01 KMC012695A_c01
         (621 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564109.1| expressed protein; protein id: At1g20270.1, sup...   250  1e-65
gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Ara...   248  3e-65
ref|NP_179363.1| putative prolyl 4-hydroxylase, alpha subunit; p...   241  4e-63
gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Ara...   240  9e-63
ref|NP_201407.1| prolyl 4-hydroxylase, alpha subunit-like protei...   202  4e-51

>ref|NP_564109.1| expressed protein; protein id: At1g20270.1, supported by cDNA:
           13404. [Arabidopsis thaliana] gi|25513547|pir||D86336
           F14O10.12 protein - Arabidopsis thaliana
           gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity
           to a prolyl 4-hydroxylase alpha subunit protein from
           Gallus gallus gi|212530. [Arabidopsis thaliana]
          Length = 287

 Score =  250 bits (638), Expect = 1e-65
 Identities = 111/130 (85%), Positives = 123/130 (94%)
 Frame = -1

Query: 618 ENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAKA 439
           ++GEGLQVLHYE GQKYEPHYDYF+DEFNTKNGGQR+AT+LMYLSDVEEGGET+FPAA  
Sbjct: 158 DHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANM 217

Query: 438 NFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSST 259
           NFSSVPWYN+LS C KKGLSVKP+ GDALLFWS+RPDATLDP+SLHGGCPVIRGNKWSST
Sbjct: 218 NFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSST 277

Query: 258 KWMHLEEYKV 229
           KWMH+ EYK+
Sbjct: 278 KWMHVGEYKI 287

>gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score =  248 bits (634), Expect = 3e-65
 Identities = 110/130 (84%), Positives = 123/130 (94%)
 Frame = -1

Query: 618 ENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAKA 439
           ++GEGLQVLHYE GQKYEPHYDYF+DEFNTKNGGQR+AT+LMYLSDVEEGGET+FPAA  
Sbjct: 158 DHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANM 217

Query: 438 NFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSST 259
           NFSSVPWYN+LS C KKGLSVKP+ GDALLFWS+RPDATLDP+SLHGGCPVIRGNKWSST
Sbjct: 218 NFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSST 277

Query: 258 KWMHLEEYKV 229
           KW+H+ EYK+
Sbjct: 278 KWIHVGEYKI 287

>ref|NP_179363.1| putative prolyl 4-hydroxylase, alpha subunit; protein id:
           At2g17720.1, supported by cDNA: 36054. [Arabidopsis
           thaliana] gi|25411813|pir||F84555 similar to prolyl
           4-hydroxylase alpha subunit [imported] - Arabidopsis
           thaliana
          Length = 291

 Score =  241 bits (616), Expect = 4e-63
 Identities = 106/131 (80%), Positives = 123/131 (92%)
 Frame = -1

Query: 621 VENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAK 442
           VENGEGLQVLHY+VGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDV++GGET+FPAA+
Sbjct: 161 VENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAAR 220

Query: 441 ANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSS 262
            N S+VPW+N+LS C K+GLSV PK+ DALLFW++RPDA+LDPSSLHGGCPV++GNKWSS
Sbjct: 221 GNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSS 280

Query: 261 TKWMHLEEYKV 229
           TKW H+ E+KV
Sbjct: 281 TKWFHVHEFKV 291

>gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  240 bits (613), Expect = 9e-63
 Identities = 106/131 (80%), Positives = 122/131 (92%)
 Frame = -1

Query: 621 VENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAK 442
           VENGEGLQVLHY+VGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDV++GGET+FPAA+
Sbjct: 161 VENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAAR 220

Query: 441 ANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSS 262
            N S+VPW+N+LS C K+GLSV PK  DALLFW++RPDA+LDPSSLHGGCPV++GNKWSS
Sbjct: 221 GNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSS 280

Query: 261 TKWMHLEEYKV 229
           TKW H+ E+KV
Sbjct: 281 TKWFHVHEFKV 291

>ref|NP_201407.1| prolyl 4-hydroxylase, alpha subunit-like protein; protein id:
           At5g66060.1 [Arabidopsis thaliana]
           gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha
           subunit-like protein [Arabidopsis thaliana]
          Length = 267

 Score =  202 bits (513), Expect = 4e-51
 Identities = 91/108 (84%), Positives = 102/108 (94%)
 Frame = -1

Query: 621 VENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAK 442
           VE+GEGLQVLHYE+GQKYEPHYDYF+DE+NT+NGGQRIATVLMYLSDVEEGGET+FPAAK
Sbjct: 159 VEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAK 218

Query: 441 ANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHG 298
            N+S+VPW+N+LS C K GLSVKPK GDALLFWS+ PDATLDPSSLHG
Sbjct: 219 GNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 266

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 507,113,714
Number of Sequences: 1393205
Number of extensions: 10533090
Number of successful extensions: 24997
Number of sequences better than 10.0: 134
Number of HSP's better than 10.0 without gapping: 24177
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 24829
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25017613016
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD025h09_f BP046015 1 604
2 SPD036b08_f BP046842 13 541
3 MPD091a03_f AV775952 14 473
4 SPD092b04_f BP051322 16 512
5 SPD002a03_f BP044119 17 636
6 MF069f01_f BP031973 29 546
7 MWM115f05_f AV766573 55 395




Lotus japonicus
Kazusa DNA Research Institute