KMC004726A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004726A_C01 KMC004726A_c01
aaatatgaaatatatctatccaatagaaaattaaaataaggcaataaacttactagaaaa
agaaacatcgtgttgaAGGAGGCAAAGTGAAAGCAAAGTACCCTATAGCTATATAATTCC
AATAGTGAACACTAATCTAAGCTAGTGTTTCTATTAATCTAATAACTGAAGCTATATATA
TAACTAAACTTAGATAATTAAACAGACCAAAAAAAGAAAGGAAAAAAAAAGCAAATAAAT
AGAATAATAATATAATTTCACAATACAGAGTTAGCAAAAGAAAAACAGAAACTACCAATG
TGAATGTGATTTCTTCAGTACGGTGGTGGCGGTGGCCTCGCCGTTGGAGCCCAAATCACA
TCCGAAGCCAACTGACAGCTGTACATTGACATCCCACACGACGCCGCCTGATCATCACCG
CCACCGGACACAGGCGGCGACTGTACATCCCCTGCTGAGGCCGAGTTCCCAGCCTCTTCC
TCCGGCGGGGGAAGCCTGTGGTACGACGGACTGTTGAACGAAGCAGCCACCACGAACACC
GTCCCGGCAGCCATCAATCCGCCGGCGACAATCCCTCCGACGATCTGCCCTTGCGGTCCA
GCGAGCGAGATGGAGAATCCGCCGGAAAGCGCCGGCTGTTGCTGAGGAAGAAACGTAGCG
TTGATGGAGAGGATGTCGAAGCGGCCGTGAAAAGTAACCGTCGCTCCGGGAGTGGTTGAA
GGTTGCCGGAGAGTGACGTTCGCGACGGTTCCAGAACCGGTGAGGATGGACAGCCCCATG
TTCTTGCGGCGGCTGAAAAGCGCGATGGCTTGAACGACATCGTTTCCGCCGGCGACTTCG
AGAATGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004726A_C01 KMC004726A_c01
         (847 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_199781.1| putative protein; protein id: At5g49700.1 [Arab...   216  3e-55
ref|NP_172901.1| hypothetical protein; protein id: At1g14490.1 [...   186  3e-46
ref|NP_566232.1| expressed protein; protein id: At3g04570.1, sup...   141  1e-32
ref|NP_192942.1| putative DNA-binding protein; protein id: At4g1...   138  1e-31
ref|NP_177776.1| unknown protein; protein id: At1g76500.1 [Arabi...   135  8e-31

>ref|NP_199781.1| putative protein; protein id: At5g49700.1 [Arabidopsis thaliana]
           gi|8978267|dbj|BAA98158.1| contains similarity to
           AT-hook DNA-binding protein~gene_id:K2I5.6 [Arabidopsis
           thaliana]
          Length = 276

 Score =  216 bits (550), Expect = 3e-55
 Identities = 116/190 (61%), Positives = 144/190 (75%), Gaps = 16/190 (8%)
 Frame = -3

Query: 845 ILEVAGGNDVVQAIALFSRRKNMGLSILTGSGTVANVTLRQPSTTP-GATVTFHGRFDIL 669
           ILEV  GNDVV+AI  F RRK++G+ +L+GSG+VANVTLRQPS    G+T+TFHG+FD+L
Sbjct: 87  ILEVPSGNDVVEAINRFCRRKSIGVCVLSGSGSVANVTLRQPSPAALGSTITFHGKFDLL 146

Query: 668 SINATFLPQQ-----QPALSGGFSISLAGPQGQIVGGIVAGGLMAAGTVFVVAASFNSPS 504
           S++ATFLP        P +S  F++SLAGPQGQI+GG VAG L++AGTV+V+AASFN+PS
Sbjct: 147 SVSATFLPPPPRTSLSPPVSNFFTVSLAGPQGQIIGGFVAGPLISAGTVYVIAASFNNPS 206

Query: 503 YHRLPPPEEEAGNSASAG--DVQSPPVSGGGDDQ-------AASCGMSMYSCQL-ASDVI 354
           YHRL P EEE  +SA  G  + QSPPVSGGG++          SCG+SMYSC +  SDVI
Sbjct: 207 YHRL-PAEEEQKHSAGTGEREGQSPPVSGGGEESGQMAGSGGESCGVSMYSCHMGGSDVI 265

Query: 353 WAPTARPPPP 324
           WAPTAR PPP
Sbjct: 266 WAPTARAPPP 275

>ref|NP_172901.1| hypothetical protein; protein id: At1g14490.1 [Arabidopsis
           thaliana] gi|25511630|pir||G86279 F14L17.27 protein -
           Arabidopsis thaliana
           gi|7262692|gb|AAF43950.1|AC012188_27 Contains similarity
           to an AT-hook protein 2 from Arabidopsis thaliana
           gb|AJ224119.1
          Length = 206

 Score =  186 bits (473), Expect = 3e-46
 Identities = 103/181 (56%), Positives = 127/181 (69%), Gaps = 7/181 (3%)
 Frame = -3

Query: 845 ILEVAGGNDVVQAIALFSRRKNMGLSILTGSGTVANVTLRQPS-TTPGATVTFHGRFDIL 669
           ILEV  GNDVV+A+  F R K +G  +L+GSG+VA+VTLRQPS   PG+T+TFHG+FD+L
Sbjct: 34  ILEVPSGNDVVEALNRFCRGKAIGFCVLSGSGSVADVTLRQPSPAAPGSTITFHGKFDLL 93

Query: 668 SINATFLP-----QQQPALSGGFSISLAGPQGQIVGGIVAGGLMAAGTVFVVAASFNSPS 504
           S++ATFLP        P +S  F++SLAGPQG+++GG VAG L+AAGTV+ VA SF +PS
Sbjct: 94  SVSATFLPPLPPTSLSPPVSNFFTVSLAGPQGKVIGGFVAGPLVAAGTVYFVATSFKNPS 153

Query: 503 YHRLPPPEEEAGNSASAGDV-QSPPVSGGGDDQAASCGMSMYSCQLASDVIWAPTARPPP 327
           YHRLP  EEE  NSA   +  QSPPVSGGG       G SMY     SDVIW P A+ P 
Sbjct: 154 YHRLPATEEEQRNSAEGEEEGQSPPVSGGG-------GESMYVG--GSDVIWDPNAKAPS 204

Query: 326 P 324
           P
Sbjct: 205 P 205

>ref|NP_566232.1| expressed protein; protein id: At3g04570.1, supported by cDNA:
           15781. [Arabidopsis thaliana]
           gi|6175162|gb|AAF04888.1|AC011437_3 hypothetical protein
           [Arabidopsis thaliana] gi|21553701|gb|AAM62794.1|
           putative DNA-binding protein [Arabidopsis thaliana]
           gi|29028876|gb|AAO64817.1| At3g04570 [Arabidopsis
           thaliana]
          Length = 315

 Score =  141 bits (355), Expect = 1e-32
 Identities = 77/161 (47%), Positives = 104/161 (63%), Gaps = 10/161 (6%)
 Frame = -3

Query: 845 ILEVAGGNDVVQAIALFSRRKNMGLSILTGSGTVANVTLRQPSTT-----PG--ATVTFH 687
           ++E+A G DV++ +A F+RR+  G+ IL+G+GTVANVTLRQPST      PG  A +   
Sbjct: 111 VMEIASGTDVIETLATFARRRQRGICILSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQ 170

Query: 686 GRFDILSINATFLPQQQPALSGGFSISLAGPQGQIVGGIVAGGLMAAGTVFVVAASFNSP 507
           GRF+ILS+  +FLP   P  S G +I LAG QGQ+VGG V G LMAAG V ++AA+F++ 
Sbjct: 171 GRFEILSLTGSFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNA 230

Query: 506 SYHRLPPPEEEA---GNSASAGDVQSPPVSGGGDDQAASCG 393
           +Y RLP  EEEA   G    +G V    + GGG   ++  G
Sbjct: 231 TYERLPLEEEEAAERGGGGGSGGVVPGQLGGGGSPLSSGAG 271

>ref|NP_192942.1| putative DNA-binding protein; protein id: At4g12050.1 [Arabidopsis
           thaliana] gi|7485568|pir||T06612 hypothetical protein
           F16J13.120 - Arabidopsis thaliana
           gi|4586110|emb|CAB40946.1| putative DNA-binding protein
           [Arabidopsis thaliana] gi|7267906|emb|CAB78248.1|
           putative DNA-binding protein [Arabidopsis thaliana]
          Length = 339

 Score =  138 bits (347), Expect = 1e-31
 Identities = 74/190 (38%), Positives = 107/190 (55%), Gaps = 18/190 (9%)
 Frame = -3

Query: 845 ILEVAGGNDVVQAIALFSRRKNMGLSILTGSGTVANVTLRQPSTTPGATVTFHGRFDILS 666
           ++E+  G D+V  +A F+RR+  G+ +++G+G+V NVT+RQP + PG+ V+ HGRF+ILS
Sbjct: 149 VMEIGDGCDIVDCMATFARRRQRGVCVMSGTGSVTNVTIRQPGSPPGSVVSLHGRFEILS 208

Query: 665 INATFLPQQQPALSGGFSISLAGPQGQIVGGIVAGGLMAAGTVFVVAASFNSPSYHRLPP 486
           ++ +FLP   P  + G S+ LAG QGQ+VGG V G L+ +G V V+AASF++ +Y RLP 
Sbjct: 209 LSGSFLPPPAPPAATGLSVYLAGGQGQVVGGSVVGPLLCSGPVVVMAASFSNAAYERLPL 268

Query: 485 PEEE--------AGNSASAGDVQSPPVSGGGDDQAASCGMSMYSCQLASDVIWAP----- 345
            E+E         G     G + SPP+ G     AA          L   V   P     
Sbjct: 269 EEDEMQTPVQGGGGGGGGGGGMGSPPMMGQQQAMAAMAAAQGLPPNLLGSVQLPPPQQND 328

Query: 344 -----TARPP 330
                T RPP
Sbjct: 329 QQYWSTGRPP 338

>ref|NP_177776.1| unknown protein; protein id: At1g76500.1 [Arabidopsis thaliana]
           gi|25348243|pir||H96792 unknown protein F14G6.10
           [imported] - Arabidopsis thaliana
           gi|12323978|gb|AAG51949.1|AC015450_10 unknown protein;
           41834-42742 [Arabidopsis thaliana]
          Length = 302

 Score =  135 bits (340), Expect = 8e-31
 Identities = 83/199 (41%), Positives = 113/199 (56%), Gaps = 25/199 (12%)
 Frame = -3

Query: 845 ILEVAGGNDVVQAIALFSRRKNMGLSILTGSGTVANVTLRQPSTTP--------GATVTF 690
           +LEV+ G D+V+++  ++RR+  G+SIL+G+GTVANV+LRQP+TT         G  V  
Sbjct: 103 VLEVSSGADIVESVTTYARRRGRGVSILSGNGTVANVSLRQPATTAAHGANGGTGGVVAL 162

Query: 689 HGRFDILSINATFLPQQQPALSGGFSISLAGPQGQIVGGIVAGGLMAAGTVFVVAASFNS 510
           HGRF+ILS+  T LP   P  SGG SI L+G QGQ++GG V   L+A+G V ++AASF++
Sbjct: 163 HGRFEILSLTGTVLPPPAPPGSGGLSIFLSGVQGQVIGGNVVAPLVASGPVILMAASFSN 222

Query: 509 PSYHRLPPPEEEAGNSASAGDV--------------QSPPVSGGGDDQAASCGMSMYSCQ 372
            ++ RL P E+E G     G+V               S P SG G  Q     MS Y  Q
Sbjct: 223 ATFERL-PLEDEGGEGGEGGEVGEGGGGEGGPPPATSSSPPSGAGQGQLRG-NMSGYD-Q 279

Query: 371 LASD---VIWAPTARPPPP 324
            A D   + W   A   PP
Sbjct: 280 FAGDPHLLGWGAAAAAAPP 298

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 789,587,996
Number of Sequences: 1393205
Number of extensions: 20427856
Number of successful extensions: 101912
Number of sequences better than 10.0: 246
Number of HSP's better than 10.0 without gapping: 82862
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 100176
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 44316199683
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR087c10_f BP082677 1 378
2 MPD006h12_f AV770434 74 483
3 MFB037g06_f BP036733 93 553
4 MWM218f12_f AV768078 216 323
5 SPD053b12_f BP048215 281 848




Lotus japonicus
Kazusa DNA Research Institute