KMC000851A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000851A_C01 KMC000851A_c01
gtGATATTAATTCGAGATGCAATTTTCCATTTACGAGTAATAGATATTCAGTTGGGCTGG
TCACGTGAAAACTCGATGCACAGAAGCACAATGGCTAACATATAAACTCGTAGGTGACAC
TTGATCATTTTTTTACATACCGCTCGTGGTAATTCCTCTCATTTTGGAGAACAAAAAATT
GAAGCCAGATTAGACTTCACAATTCCAAGATCACCAGCATGGTGGCATGGATATCCATTT
TCCTGGTCCACAAATGCCTCATTCCTTTTCAGTAATCACCACAAAAACATCCTTCAGTGA
CCACTTCCGTCTGCCAGTTTTTGCTGGGGGGGTTAATGACTGCCCTCTCGGCATTCGCTA
AACGGTAGCCTATCAAAATCTCTCTTCTCTGCCGAGCACGCAACATTATTTCATAGAAAC
TCATCTCCTCACCTTCACGGAGGTATATATCTGCTTGCCTTATATGCATCTCATTTCCCT
CCTCCGCAAAGAGCTCCTCCAATACATCATTTATCTGCCGATCTTCTGCAACCATAGCCA
AAGCCATGCTGACAAGTTCATTGGATAAAACATAATCACTAATCTTTGACATAGATAAAA
GATTTTTGGTnctagggtccaaaatttcac


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000851A_C01 KMC000851A_c01
         (630 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB64102.1| P0039A07.8 [Oryza sativa (japonica cultivar-grou...   150  2e-38
ref|NP_199807.1| putative protein; protein id: At5g49960.1 [Arab...   145  4e-34
gb|AAL32724.1| Unknown protein [Arabidopsis thaliana] gi|2025981...    68  9e-11
ref|NP_568628.1| Expressed protein; protein id: At5g43745.1, sup...    68  9e-11
gb|AAL36360.1| unknown protein [Arabidopsis thaliana]                  67  2e-10

>dbj|BAB64102.1| P0039A07.8 [Oryza sativa (japonica cultivar-group)]
            gi|20160875|dbj|BAB89814.1| P0677H08.29 [Oryza sativa
            (japonica cultivar-group)]
          Length = 927

 Score =  150 bits (380), Expect(2) = 2e-38
 Identities = 76/104 (73%), Positives = 92/104 (88%)
 Frame = -3

Query: 628  EILDPXTKNLLSMSKISDYVLSNELVSMALAMVAEDRQINDVLEELFAEEGNEMHIRQAD 449
            EILD  T+NL+S+SKISDYVLSNELVSMALAMVAED+QIN VLEELFAEEGNEM IR A+
Sbjct: 806  EILDSRTRNLVSVSKISDYVLSNELVSMALAMVAEDKQINRVLEELFAEEGNEMCIRSAE 865

Query: 448  IYLREGEEMSFYEIMLRARQRREILIGYRLANAERAVINPPSKN 317
             YL E EE+SF++IM+RAR+R E++IGYRLAN ++A+INP  K+
Sbjct: 866  FYLYEQEELSFFDIMVRARERDEVVIGYRLANDDQAIINPEQKS 909

 Score = 30.0 bits (66), Expect(2) = 2e-38
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = -1

Query: 336 LTPPAKTGRRKWSLKDVFVVITE 268
           + P  K+  RKWSL DVFVVI++
Sbjct: 903 INPEQKSEIRKWSLDDVFVVISK 925

>ref|NP_199807.1| putative protein; protein id: At5g49960.1 [Arabidopsis thaliana]
            gi|8777427|dbj|BAA97017.1|
            emb|CAB86048.1~gene_id:K9P8.10~similar to unknown protein
            [Arabidopsis thaliana]
          Length = 824

 Score =  145 bits (366), Expect = 4e-34
 Identities = 75/108 (69%), Positives = 92/108 (84%)
 Frame = -3

Query: 628  EILDPXTKNLLSMSKISDYVLSNELVSMALAMVAEDRQINDVLEELFAEEGNEMHIRQAD 449
            EILD  TKNL+S+S+ISDYVLSNELVSMALAMVAED+QIN VL+ELFAE+GNE+ IR A+
Sbjct: 703  EILDSRTKNLVSVSRISDYVLSNELVSMALAMVAEDKQINRVLKELFAEKGNELCIRPAE 762

Query: 448  IYLREGEEMSFYEIMLRARQRREILIGYRLANAERAVINPPSKNWQTE 305
             Y+ + EE+ FY+IM RARQR+EI+IGYRLA  E+AVINP  K+  T+
Sbjct: 763  FYIYDQEEVCFYDIMRRARQRQEIIIGYRLAGMEQAVINPTDKSKLTK 810

>gb|AAL32724.1| Unknown protein [Arabidopsis thaliana] gi|20259818|gb|AAM13256.1|
           unknown protein [Arabidopsis thaliana]
          Length = 470

 Score = 68.2 bits (165), Expect = 9e-11
 Identities = 34/105 (32%), Positives = 64/105 (60%), Gaps = 1/105 (0%)
 Frame = -3

Query: 628 EILDPXT-KNLLSMSKISDYVLSNELVSMALAMVAEDRQINDVLEELFAEEGNEMHIRQA 452
           EI+D    K +  +     ++ + E++S+  A VAE+ ++N+V +++   +G+E++++  
Sbjct: 343 EIVDSKLGKQITGLKPSLTFIAAEEVMSLVTAQVAENSELNEVWKDILDADGDEIYVKDV 402

Query: 451 DIYLREGEEMSFYEIMLRARQRREILIGYRLANAERAVINPPSKN 317
           ++Y++EGE  SF E+  RA  RRE+ IGY      + +INP  KN
Sbjct: 403 ELYMKEGENPSFTELSERAWLRREVAIGY--IKGGKKMINPVPKN 445

>ref|NP_568628.1| Expressed protein; protein id: At5g43745.1, supported by cDNA:
           gi_15450497, supported by cDNA: gi_16974322 [Arabidopsis
           thaliana] gi|15450498|gb|AAK96542.1| AT5g02940/F9G14_250
           [Arabidopsis thaliana] gi|16974323|gb|AAL31146.1|
           AT5g02940/F9G14_250 [Arabidopsis thaliana]
          Length = 817

 Score = 68.2 bits (165), Expect = 9e-11
 Identities = 34/105 (32%), Positives = 64/105 (60%), Gaps = 1/105 (0%)
 Frame = -3

Query: 628 EILDPXT-KNLLSMSKISDYVLSNELVSMALAMVAEDRQINDVLEELFAEEGNEMHIRQA 452
           EI+D    K +  +     ++ + E++S+  A VAE+ ++N+V +++   +G+E++++  
Sbjct: 690 EIVDSKLGKQITGLKPSLTFIAAEEVMSLVTAQVAENSELNEVWKDILDADGDEIYVKDV 749

Query: 451 DIYLREGEEMSFYEIMLRARQRREILIGYRLANAERAVINPPSKN 317
           ++Y++EGE  SF E+  RA  RRE+ IGY      + +INP  KN
Sbjct: 750 ELYMKEGENPSFTELSERAWLRREVAIGY--IKGGKKMINPVPKN 792

>gb|AAL36360.1| unknown protein [Arabidopsis thaliana]
          Length = 813

 Score = 67.0 bits (162), Expect = 2e-10
 Identities = 35/104 (33%), Positives = 64/104 (60%), Gaps = 1/104 (0%)
 Frame = -3

Query: 628 EILDPXTKNLLSMSKIS-DYVLSNELVSMALAMVAEDRQINDVLEELFAEEGNEMHIRQA 452
           EI+D      ++  K S  ++ + E++S+  A VAE+ ++N+V +++   EG+E++++  
Sbjct: 686 EIVDTKLGKQITRLKPSLTFIAAEEVMSLVTAQVAENSELNEVWKDILDAEGDEIYVKDI 745

Query: 451 DIYLREGEEMSFYEIMLRARQRREILIGYRLANAERAVINPPSK 320
           ++Y++EGE  SF E+  RA  RRE+ IGY      + +INP  K
Sbjct: 746 ELYMKEGENPSFTELSERAWLRREVAIGY--IKGGKKIINPVPK 787

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 559,127,262
Number of Sequences: 1393205
Number of extensions: 12508293
Number of successful extensions: 30050
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 28877
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 29995
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25870486187
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL025h05_f BP053576 1 457
2 MRL036g04_f BP085503 6 385
3 GENLf054d02 BP065226 7 324
4 GENLf060g02 BP065575 8 532
5 GENLf080d10 BP066693 15 530
6 SPDL088b09_f BP057508 19 529
7 MPDL038c03_f AV778414 32 595
8 SPDL004a09_f BP052209 84 520
9 GENLf092b06 BP067357 85 620
10 SPDL024e03_f BP053490 90 599
11 SPDL081h05_f BP057086 90 609
12 SPDL016b11_f BP052977 100 609
13 SPDL098h09_f BP058192 100 501
14 GENLf040e06 BP064461 105 646
15 SPDL100h07_f BP058325 112 611




Lotus japonicus
Kazusa DNA Research Institute