KMC001255A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001255A_C01 KMC001255A_c01
ggtacgggcccccctcgagttttttttttttttttttTGATCAAATTCAAACAAATTACA
ATTTCGGAGAAGGAATACAATCATTTCAATACAATCAATTTCCAAGGGGAGAGGTAGCTA
AAATTCCAATCTCTGAACTAGAGGCTAAACTACATCTCAAATGCCTAAAACTATCAACAA
ATACCAAAGGGTTCCAAATATCCATAGAAACGTAAAGGAGCAAAAAACCTAATGAAATTC
AATCACCCCCTCCTAAGTAACCCCTCGTGGTTATAACTCAGAAATCTAAGTCCAGTAATT
AATACAAGATAAACTTCTGACAACATCCCTAAAGTTGATGAGGTCAAGAGACAAATCAAG
AAGCTGGTGAAGTGCTGTTGGTTATGTTGTAAAAAAGGTCCAAAGCTTCTTCCCTATAGA
AGTTTCACCGGGACGAGGGTAATCTTCATCATCCTCTTCATCAACACCACCATGAGATTC
ACTCCGGTACTCAGAACCATGCTCTACTATATCATCTGCTGTTCCATTCACCTCCTCACT
CAAGGTCCTTGCGGTATCACCATAACCATCCTGGGTACCCACCAGTGAATGAGACTCCTC
AAGATCTATGCTTCCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001255A_C01 KMC001255A_c01
         (616 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_563924.1| putative nuclear matrix constituent protein; al...    47  2e-04
gb|AAG00257.1|AC002130_22 F1N21.5 [Arabidopsis thaliana]               45  8e-04
ref|NP_176892.1| nuclear matrix constituent protein 1, putative;...    45  8e-04
dbj|BAC41822.1| unknown protein [Arabidopsis thaliana]                 45  8e-04
pir||T14321 nuclear matrix constituent protein 1 - carrot gi|219...    43  0.003

>ref|NP_563924.1| putative nuclear matrix constituent protein; alternative splicing
            isoform supported by protein homology and gene
            predictions [Arabidopsis thaliana]
            gi|22329530|ref|NP_683302.1| putative nuclear matrix
            constituent protein; protein id: At1g13220.2 [Arabidopsis
            thaliana] gi|25518674|pir||G86266 hypothetical protein
            F3F19.25 - Arabidopsis thaliana
            gi|4850405|gb|AAD31075.1|AC007357_24 Similar to gb|D64087
            nuclear matrix constituent protein 1 (NMCP1) from Daucus
            carota. [Arabidopsis thaliana]
          Length = 1128

 Score = 47.0 bits (110), Expect = 2e-04
 Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
 Frame = -2

Query: 546  TLSEEVNGTADDIVEHGSEYRSESHGGVDEEDDED---YPRPGETSIGKKLWTFFTT 385
            T++E+ N   D+  +   +  +E +   D++DD D    PRPGE SI KKLWTF TT
Sbjct: 1072 TVNEDTNEDGDEEEDEAQDDDNEENQDDDDDDDGDDDGSPRPGEGSIRKKLWTFLTT 1128

>gb|AAG00257.1|AC002130_22 F1N21.5 [Arabidopsis thaliana]
          Length = 1166

 Score = 45.1 bits (105), Expect = 8e-04
 Identities = 25/58 (43%), Positives = 33/58 (56%)
 Frame = -2

Query: 558  DTARTLSEEVNGTADDIVEHGSEYRSESHGGVDEEDDEDYPRPGETSIGKKLWTFFTT 385
            D +  +SE+VN T           R++S G   E+D+ D   PG+ SIGKKLWTF TT
Sbjct: 1121 DESEAMSEDVNKTP---------LRADSDG---EDDESDAEHPGKVSIGKKLWTFLTT 1166

>ref|NP_176892.1| nuclear matrix constituent protein 1, putative; protein id:
            At1g67230.1 [Arabidopsis thaliana]
          Length = 1132

 Score = 45.1 bits (105), Expect = 8e-04
 Identities = 25/58 (43%), Positives = 33/58 (56%)
 Frame = -2

Query: 558  DTARTLSEEVNGTADDIVEHGSEYRSESHGGVDEEDDEDYPRPGETSIGKKLWTFFTT 385
            D +  +SE+VN T           R++S G   E+D+ D   PG+ SIGKKLWTF TT
Sbjct: 1087 DESEAMSEDVNKTP---------LRADSDG---EDDESDAEHPGKVSIGKKLWTFLTT 1132

>dbj|BAC41822.1| unknown protein [Arabidopsis thaliana]
          Length = 471

 Score = 45.1 bits (105), Expect = 8e-04
 Identities = 25/58 (43%), Positives = 33/58 (56%)
 Frame = -2

Query: 558 DTARTLSEEVNGTADDIVEHGSEYRSESHGGVDEEDDEDYPRPGETSIGKKLWTFFTT 385
           D +  +SE+VN T           R++S G   E+D+ D   PG+ SIGKKLWTF TT
Sbjct: 426 DESEAMSEDVNKTP---------LRADSDG---EDDESDAEHPGKVSIGKKLWTFLTT 471

>pir||T14321 nuclear matrix constituent protein 1 - carrot
            gi|2190187|dbj|BAA20407.1| nuclear matrix constituent
            protein 1 (NMCP1) [Daucus carota]
          Length = 1119

 Score = 43.1 bits (100), Expect = 0.003
 Identities = 23/54 (42%), Positives = 32/54 (58%), Gaps = 1/54 (1%)
 Frame = -2

Query: 543  LSEEVNGTADDIVEHGSEYRSESHGGVDEEDDED-YPRPGETSIGKKLWTFFTT 385
            LSEEVNGT +     G + + ++ G   E++D D    PGE S+ KK+W F TT
Sbjct: 1068 LSEEVNGTPEQ--SRGYQNQGDTSGAEGEDEDGDEVEHPGEVSMRKKVWKFLTT 1119

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 529,287,033
Number of Sequences: 1393205
Number of extensions: 11464813
Number of successful extensions: 47698
Number of sequences better than 10.0: 171
Number of HSP's better than 10.0 without gapping: 39083
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 44804
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 24854530794
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL038d03_f BP054388 1 544
2 MRL037e04_f BP085545 7 503
3 MPDL034d05_f AV778181 103 618
4 GENLf069g05 BP066090 120 611




Lotus japonicus
Kazusa DNA Research Institute