KMC002852A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002852A_C01 KMC002852A_c01
AGAAAGAAAAACTAAGCAATTTTATAAAAACAACCTTAACATATTAATATTTTTAGCTTA
CAACAGCCACTGACTGAATGACTACAAGGAATCCAACACTGCAGCCAATAAAATCCAGGC
GCCTGTAGCAAAAAATTAACACTACATCATGAAGCTAAACCCATGAGTATGAATATCAAA
CCAGGCCTAGAAACAACCCCAAAACAACACTTGGTATATGGTGGTAACAGAAAGAGGCTC
CTGTATTTAAAAATACTCAAGTGGAGCTTTTTTTTGTTCTTTCTCTTTCACAAAAGTACA
CAAATCCTTGAACACTAAGCTAACTTCCCCTTCAATAGACCTTTGGGAATCTTACTCAAG
ACCTTACTCAAGACCTTGGCATCAAACACCACATACTGCTTCTTAATCTCATGGGCTGCC
TTCTCAGCCAGAGGGTCTATCTTATCCTCATACTTGTCATACAAGAAAGGAATTGTGAGC
AACAGAACAAAATTTATGTAGAACAGAGTCAAGAAATTGCACCAACTCCCCACAATGGAT
AGAATCCACAAGCCAACAATAGCAATGAGGAATTTCTTCACATCTTTTCCAGTACCAATA
TCACGCAATGCAGCAAACCCGTGATTGATCTCAATTCTCAATGCAGACGCAATTTCCAGA
AACGGCTCTTCAGGAAGATGAAACACAGGAATGCGAGGCGGAGC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002852A_C01 KMC002852A_c01
         (704 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_176592.1| hypothetical protein; protein id: At1g64090.1, ...   168  8e-41
ref|NP_194094.1| putative protein; protein id: At4g23630.1, supp...   166  3e-40
ref|NP_198975.1| putative protein; protein id: At5g41600.1, supp...   164  1e-39
ref|NP_566065.1| expressed protein; protein id: At2g46170.1, sup...   158  8e-38
ref|NP_192861.1| putative protein; protein id: At4g11220.1, supp...   157  1e-37

>ref|NP_176592.1| hypothetical protein; protein id: At1g64090.1, supported by cDNA:
           gi_17381217 [Arabidopsis thaliana]
           gi|6692111|gb|AAF24576.1|AC007764_18 F22C12.15
           [Arabidopsis thaliana] gi|17381218|gb|AAL36421.1|
           unknown protein [Arabidopsis thaliana]
           gi|21436435|gb|AAM51418.1| unknown protein [Arabidopsis
           thaliana]
          Length = 255

 Score =  168 bits (425), Expect = 8e-41
 Identities = 82/122 (67%), Positives = 95/122 (77%)
 Frame = -1

Query: 704 APPRIPVFHLPEEPFLEIASALRIEINHGFAALRDIGTGKDVKKFLIAIVGLWILSIVGS 525
           +P  IP  H+PE+  L++AS LRIEIN GF  LRDI +G+D+KKFL+ I GLW+LS VGS
Sbjct: 125 SPLHIPEVHIPEDVVLQLASGLRIEINRGFTVLRDIASGRDLKKFLLVIAGLWVLSKVGS 184

Query: 524 WCNFLTLFYINFVLLLTIPFLYDKYEDKIDPLAEKAAHEIKKQYVVFDAKVLSKVLSKIP 345
            CNFLTL YI  VLL TIP LY+KYEDK+D   EKA  EIKKQYV FD KVLSKV+SKIP
Sbjct: 185 SCNFLTLIYIATVLLFTIPVLYEKYEDKVDDFGEKAMREIKKQYVEFDVKVLSKVMSKIP 244

Query: 344 KG 339
           KG
Sbjct: 245 KG 246

>ref|NP_194094.1| putative protein; protein id: At4g23630.1, supported by cDNA:
           39185., supported by cDNA: gi_15215585 [Arabidopsis
           thaliana] gi|7486733|pir||T05595 hypothetical protein
           F9D16.100 - Arabidopsis thaliana
           gi|4454032|emb|CAA23029.1| putative protein [Arabidopsis
           thaliana] gi|7269211|emb|CAB79318.1| putative protein
           [Arabidopsis thaliana] gi|15215586|gb|AAK91338.1|
           AT4g23630/F9D16_100 [Arabidopsis thaliana]
           gi|21593466|gb|AAM65433.1| unknown [Arabidopsis
           thaliana] gi|22137240|gb|AAM91465.1| AT4g23630/F9D16_100
           [Arabidopsis thaliana]
          Length = 275

 Score =  166 bits (420), Expect = 3e-40
 Identities = 82/127 (64%), Positives = 98/127 (76%)
 Frame = -1

Query: 704 APPRIPVFHLPEEPFLEIASALRIEINHGFAALRDIGTGKDVKKFLIAIVGLWILSIVGS 525
           +PP+IP  H+PEEP L++AS LRIEIN GF++LR+I +G+D+KKFLIAI GLW+LSI+G 
Sbjct: 150 SPPKIPEVHIPEEPILQLASGLRIEINRGFSSLREIASGRDLKKFLIAIAGLWVLSILGG 209

Query: 524 WCNFLTLFYINFVLLLTIPFLYDKYEDKIDPLAEKAAHEIKKQYVVFDAKVLSKVLSKIP 345
             NFLTL YI  VLL T+P  YDKYEDK+DPL EKA  E+KKQY V D     KVLSKIP
Sbjct: 210 CFNFLTLAYIALVLLFTVPLAYDKYEDKVDPLGEKAMIELKKQYAVLD----EKVLSKIP 265

Query: 344 KGLLKGK 324
            G LK K
Sbjct: 266 LGPLKNK 272

>ref|NP_198975.1| putative protein; protein id: At5g41600.1, supported by cDNA:
           gi_14334529, supported by cDNA: gi_16323068 [Arabidopsis
           thaliana] gi|10178014|dbj|BAB11466.1| contains
           similarity to 24 kDa seed maturation
           protein~gene_id:MBK23.13 [Arabidopsis thaliana]
           gi|14334530|gb|AAK59673.1| unknown protein [Arabidopsis
           thaliana] gi|16323069|gb|AAL15269.1| AT5g41600/MBK23_13
           [Arabidopsis thaliana] gi|23297540|gb|AAN12890.1|
           unknown protein [Arabidopsis thaliana]
          Length = 257

 Score =  164 bits (414), Expect = 1e-39
 Identities = 79/125 (63%), Positives = 93/125 (74%)
 Frame = -1

Query: 698 PRIPVFHLPEEPFLEIASALRIEINHGFAALRDIGTGKDVKKFLIAIVGLWILSIVGSWC 519
           P IP  H+PE+P L++ S LRIEIN G   LR+I +GKDVKKF++ I GLW+LSI+GS  
Sbjct: 131 PHIPEVHIPEDPILQLVSGLRIEINRGLTLLRNIASGKDVKKFILVIAGLWVLSIIGSCY 190

Query: 518 NFLTLFYINFVLLLTIPFLYDKYEDKIDPLAEKAAHEIKKQYVVFDAKVLSKVLSKIPKG 339
           NFLTLFY   VLL TIP LY+KYEDK+D   EKA  EIKKQY V D KVL KV+SKIP+G
Sbjct: 191 NFLTLFYTATVLLFTIPVLYEKYEDKVDAYGEKAMREIKKQYAVLDEKVLRKVISKIPRG 250

Query: 338 LLKGK 324
            L  K
Sbjct: 251 ALNKK 255

>ref|NP_566065.1| expressed protein; protein id: At2g46170.1, supported by cDNA:
           gi_15450758, supported by cDNA: gi_18491130 [Arabidopsis
           thaliana] gi|25408969|pir||E84899 hypothetical protein
           At2g46170 [imported] - Arabidopsis thaliana
           gi|3702332|gb|AAC62889.1| expressed protein [Arabidopsis
           thaliana] gi|15450759|gb|AAK96651.1| At2g46170/T3F17.18
           [Arabidopsis thaliana] gi|18491131|gb|AAL69534.1|
           At2g46170/T3F17.18 [Arabidopsis thaliana]
          Length = 255

 Score =  158 bits (399), Expect = 8e-38
 Identities = 77/125 (61%), Positives = 96/125 (76%)
 Frame = -1

Query: 698 PRIPVFHLPEEPFLEIASALRIEINHGFAALRDIGTGKDVKKFLIAIVGLWILSIVGSWC 519
           P+IP  H+PEE FL +AS+LR E+N  F  LR I  G+D+KKFL+ +VGLWI+S+VG+W 
Sbjct: 131 PQIPEIHVPEEAFLVVASSLRNELNQAFVILRSIALGRDLKKFLMVVVGLWIISVVGNWF 190

Query: 518 NFLTLFYINFVLLLTIPFLYDKYEDKIDPLAEKAAHEIKKQYVVFDAKVLSKVLSKIPKG 339
           NFLTL YI FV+L T+P LY+K+EDK+DPLAEKA  E++KQYVVFD     KVLSKIP  
Sbjct: 191 NFLTLVYICFVILHTVPMLYEKHEDKVDPLAEKAMKELQKQYVVFD----EKVLSKIPIA 246

Query: 338 LLKGK 324
            LK K
Sbjct: 247 SLKAK 251

>ref|NP_192861.1| putative protein; protein id: At4g11220.1, supported by cDNA:
           23536., supported by cDNA: gi_14334419, supported by
           cDNA: gi_15081760, supported by cDNA: gi_16209686
           [Arabidopsis thaliana] gi|7486694|pir||T13013
           hypothetical protein F8L21.10 - Arabidopsis thaliana
           gi|5596468|emb|CAB51406.1| putative protein [Arabidopsis
           thaliana] gi|7267821|emb|CAB81223.1| putative protein
           [Arabidopsis thaliana] gi|14334420|gb|AAK59408.1|
           unknown protein [Arabidopsis thaliana]
           gi|15081761|gb|AAK82535.1| AT4g11220/F8L21_10
           [Arabidopsis thaliana] gi|16209687|gb|AAL14401.1|
           AT4g11220/F8L21_10 [Arabidopsis thaliana]
           gi|21592415|gb|AAM64366.1| unknown [Arabidopsis
           thaliana] gi|26983898|gb|AAN86201.1| unknown protein
           [Arabidopsis thaliana]
          Length = 271

 Score =  157 bits (397), Expect = 1e-37
 Identities = 77/127 (60%), Positives = 95/127 (74%)
 Frame = -1

Query: 704 APPRIPVFHLPEEPFLEIASALRIEINHGFAALRDIGTGKDVKKFLIAIVGLWILSIVGS 525
           +PP+IP  H+PEEP L++AS LRIEIN G ++LR+I +G+D+KKFL AI GLW+LSI+G 
Sbjct: 146 SPPKIPEVHIPEEPLLQLASGLRIEINRGISSLREIASGRDIKKFLSAIAGLWVLSILGG 205

Query: 524 WCNFLTLFYINFVLLLTIPFLYDKYEDKIDPLAEKAAHEIKKQYVVFDAKVLSKVLSKIP 345
             +FLTL YI  VLL T+P  YDKYEDK+D   EKA  E+KKQY V DA    KV SKIP
Sbjct: 206 CYSFLTLAYIALVLLFTVPLFYDKYEDKVDSYGEKAMAELKKQYAVLDA----KVFSKIP 261

Query: 344 KGLLKGK 324
           +G LK K
Sbjct: 262 RGPLKDK 268

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 593,839,344
Number of Sequences: 1393205
Number of extensions: 13241151
Number of successful extensions: 35455
Number of sequences better than 10.0: 92
Number of HSP's better than 10.0 without gapping: 34081
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35410
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 32091529758
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf010h04 BP068121 1 476
2 MR077e10_f BP081942 39 433
3 GNf047e01 BP070843 39 441
4 GNf001g04 BP067468 40 540
5 MPD068g11_f AV774534 74 564
6 MFB080d10_f BP039853 181 754




Lotus japonicus
Kazusa DNA Research Institute