KMC000525A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000525A_C01 KMC000525A_c01
atgtaaataaataaTAATTCGGCAGCAAAGTATAAACTACACTGAAATATGTTGTGCATT
TACATATATCATATCTCAATGATTAATTTACCAGTTTGGAAATTTTGGTGTTAACAATTA
AGTACATGCAAATTCCCTAGTTTACTTGTTCCATGGTGATGTTTTCTAGAATCACTTCTT
AGCTTCTGAGTATCTTGAAACTAAACATGCACTAACTAACGAAAGAGCTCTGGGAAAGAC
CACATGATGAACCTGACAAATATGTCACACTCCAAGAATTCAATATGAGCGGCGCGTCCA
ATGAGAGTGTTGAATCCCCTTCCATAGGAAGGAGGTTTCGAAGTTAATGTCACAGCGCAA
GACCACACGATGATCAGAGTGAGTCCGCAATTGGTCCAGGCAATTATTTAACATATCCAG
GAACACTTTCCCTTGTTTAGAAAAATCTGAAGAAGCTGCTGGACACAGTTCTATTCTGGC
TGAATGGTACGGAACATAACCATCCTGAGGAGAGGATAAAAGAATCACATTCTTGAAGTT
TGCCAGTGTCTTCATCTTCGAAAGATT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000525A_C01 KMC000525A_c01
         (567 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAL86333.1| unknown protein [Arabidopsis thaliana] gi|2168982...   108  1e-35
ref|NP_564732.1| expressed protein; protein id: At1g58350.1, sup...   107  2e-35
ref|NP_172469.1| hypothetical protein; protein id: At1g09980.1 [...   105  9e-35
pir||T00618 hypothetical protein T27I1.1 - Arabidopsis thaliana ...    63  5e-22
ref|NP_728588.1| CG32333-PA [Drosophila melanogaster] gi|2309274...    61  6e-12

>gb|AAL86333.1| unknown protein [Arabidopsis thaliana] gi|21689821|gb|AAM67554.1|
           unknown protein [Arabidopsis thaliana]
          Length = 802

 Score =  108 bits (271), Expect(2) = 1e-35
 Identities = 53/89 (59%), Positives = 65/89 (72%), Gaps = 5/89 (5%)
 Frame = -1

Query: 564 LSKMKTLANFKNVILLSSPQDGYVPYHSARIELCPAASSDFSKQGKVFLDMLNNCLDQLR 385
           L K KTL +FKN+ILLSSPQDGYVPYHSARIE C  AS D SK+G  FL+MLNNC+DQ+R
Sbjct: 683 LCKQKTLCSFKNIILLSSPQDGYVPYHSARIESCQPASFDNSKRGVAFLEMLNNCMDQIR 742

Query: 384 -----THSDHRVVLRCDINFETSFLWKGI 313
                T    RV +RCD+NF+T+   + +
Sbjct: 743 GPSPETPHHQRVFMRCDVNFDTTLYGRNL 771

 Score = 62.8 bits (151), Expect(2) = 1e-35
 Identities = 27/36 (75%), Positives = 30/36 (83%)
 Frame = -2

Query: 326 YGRGFNTLIGRAAHIEFLECDIFVRFIMWSFPELFR 219
           YGR  N+ IGRAAHIEFLE D+F RFIMWSF +LFR
Sbjct: 767 YGRNLNSFIGRAAHIEFLESDVFARFIMWSFQDLFR 802

>ref|NP_564732.1| expressed protein; protein id: At1g58350.1, supported by cDNA:
           gi_6520166 [Arabidopsis thaliana]
           gi|25333053|pir||T52441 hypothetical protein ZW18
           [imported] - Arabidopsis thaliana
           gi|6520167|dbj|BAA87940.1| ZW18 [Arabidopsis thaliana]
           gi|8979939|gb|AAF82253.1|AC008051_4 Identical to gene
           ZW18 from Arabidopsis thaliana gb|AB028199
          Length = 794

 Score =  107 bits (267), Expect(2) = 2e-35
 Identities = 53/81 (65%), Positives = 60/81 (73%), Gaps = 5/81 (6%)
 Frame = -1

Query: 564 LSKMKTLANFKNVILLSSPQDGYVPYHSARIELCPAASSDFSKQGKVFLDMLNNCLDQLR 385
           L K KTL NFKN+ILLSSPQDGYVPYHSARIE C  AS D SK+G  FL+MLNNCLDQ+R
Sbjct: 675 LCKQKTLENFKNIILLSSPQDGYVPYHSARIESCQPASFDSSKRGVAFLEMLNNCLDQIR 734

Query: 384 -----THSDHRVVLRCDINFE 337
                     RV +RCD+NF+
Sbjct: 735 GPVPEAPHQQRVFMRCDVNFD 755

 Score = 63.2 bits (152), Expect(2) = 2e-35
 Identities = 28/36 (77%), Positives = 30/36 (82%)
 Frame = -2

Query: 326 YGRGFNTLIGRAAHIEFLECDIFVRFIMWSFPELFR 219
           YGR  N+ IGRAAHIEFLE DIF RFIMWSF +LFR
Sbjct: 759 YGRNLNSFIGRAAHIEFLESDIFARFIMWSFQDLFR 794

>ref|NP_172469.1| hypothetical protein; protein id: At1g09980.1 [Arabidopsis
           thaliana] gi|2160190|gb|AAB60753.1| F21M12.37 gene
           product [Arabidopsis thaliana]
          Length = 553

 Score =  105 bits (263), Expect(2) = 9e-35
 Identities = 51/85 (60%), Positives = 63/85 (74%), Gaps = 5/85 (5%)
 Frame = -1

Query: 552 KTLANFKNVILLSSPQDGYVPYHSARIELCPAASSDFSKQGKVFLDMLNNCLDQLR---- 385
           KTL +FKN+ILLSSPQDGYVPYHSARIE C  AS D SK+G  FL+MLNNC+DQ+R    
Sbjct: 438 KTLCSFKNIILLSSPQDGYVPYHSARIESCQPASFDNSKRGVAFLEMLNNCMDQIRGPSP 497

Query: 384 -THSDHRVVLRCDINFETSFLWKGI 313
            T    RV +RCD+NF+T+   + +
Sbjct: 498 ETPHHQRVFMRCDVNFDTTLYGRNL 522

 Score = 62.8 bits (151), Expect(2) = 9e-35
 Identities = 27/36 (75%), Positives = 30/36 (83%)
 Frame = -2

Query: 326 YGRGFNTLIGRAAHIEFLECDIFVRFIMWSFPELFR 219
           YGR  N+ IGRAAHIEFLE D+F RFIMWSF +LFR
Sbjct: 518 YGRNLNSFIGRAAHIEFLESDVFARFIMWSFQDLFR 553

>pir||T00618 hypothetical protein T27I1.1 - Arabidopsis thaliana
           gi|3540180|gb|AAC34330.1| Unknown protein [Arabidopsis
           thaliana]
          Length = 837

 Score = 63.2 bits (152), Expect(2) = 5e-22
 Identities = 31/64 (48%), Positives = 42/64 (65%), Gaps = 5/64 (7%)
 Frame = -1

Query: 489 YHSARIELCPAASSDFSKQGKVFLDMLNNCLDQLR-----THSDHRVVLRCDINFETSFL 325
           + SARIE C  AS D SK+G  FL+MLNNC+DQ+R     T    RV +RCD+NF+T+  
Sbjct: 743 FASARIESCQPASFDNSKRGVAFLEMLNNCMDQIRGPSPETPHHQRVFMRCDVNFDTTLY 802

Query: 324 WKGI 313
            + +
Sbjct: 803 GRNL 806

 Score = 62.8 bits (151), Expect(2) = 5e-22
 Identities = 27/36 (75%), Positives = 30/36 (83%)
 Frame = -2

Query: 326 YGRGFNTLIGRAAHIEFLECDIFVRFIMWSFPELFR 219
           YGR  N+ IGRAAHIEFLE D+F RFIMWSF +LFR
Sbjct: 802 YGRNLNSFIGRAAHIEFLESDVFARFIMWSFQDLFR 837

>ref|NP_728588.1| CG32333-PA [Drosophila melanogaster] gi|23092748|gb|AAF47465.2|
            CG32333-PA [Drosophila melanogaster]
          Length = 1489

 Score = 60.8 bits (146), Expect(2) = 6e-12
 Identities = 28/56 (50%), Positives = 39/56 (69%)
 Frame = -1

Query: 564  LSKMKTLANFKNVILLSSPQDGYVPYHSARIELCPAASSDFSKQGKVFLDMLNNCL 397
            LS+  TL +FKN++L  S QD YVP HSAR+ELC AA  D S  G ++ +M++N +
Sbjct: 1378 LSQRSTLHHFKNILLCGSSQDRYVPAHSARLELCKAAMRDSSSLGTIYREMVHNII 1433

 Score = 30.8 bits (68), Expect(2) = 6e-12
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = -2

Query: 311  NTLIGRAAHIEFLECDIFVRFIM 243
            NTLIGRAAHI  L+ ++F+   M
Sbjct: 1458 NTLIGRAAHIAVLDSELFIEKFM 1480

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 462,377,451
Number of Sequences: 1393205
Number of extensions: 9255648
Number of successful extensions: 20184
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 19787
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 20178
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20669577624
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf021d08 BP063457 1 419
2 MPDL077f05_f AV780482 15 528
3 SPDL040f12_f BP054537 15 568
4 SPDL042a09_f BP054622 19 430
5 SPDL026e01_f BP053610 39 566




Lotus japonicus
Kazusa DNA Research Institute