KMC005960A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005960A_C01 KMC005960A_c01
cgagaatgatAATTCAAGATTTAATAATTTGAACTGATAATAAAATTAGGTCCAATGGGG
CTCACCCACCAAAAGTTCATGTTGGACCAAATGTGAATTACAAAAGAATTGTTCTATTTG
TAAACTATCTGTAAATGACTAAATGTATCTTAATTTTACAAGACAAATACATGTACAAAA
CAACCCCAGTTAGATCCAAATGACTACTGTCATATGCGGTGGTATGGCCTCTTCATTGGA
GATTTTGGCCCTAATGAGGATGGTGGGAATGATCCACCCCAAGGATTTGGAGCTAGTGCT
TCTTGATGTAATGACTCAACACCACCCCGGAAACTTTGAAATGAAACAATACCAGATTGT
ACAGAAGGAATAAAAGAACCGCCAGAAGCATTTGGTTGTAAGGTGAAAGGAGAAGGAACT
ACCCCTGGACCATGATGCACCTCAATATTTGACCTAGGAGCAGGAAGAGGAGGTTGATAG
GAAACTGAGGGAGGGAAAGAAGCATGTGCATTATGTGCAACTTTACCAAGAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005960A_C01 KMC005960A_c01
         (532 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAC65569.1| mKIAA0458 protein [Mus musculus]                       37  0.11
gb|EAA33893.1| hypothetical protein [Neurospora crassa]                37  0.19
ref|XP_208528.1| hypothetical protein XP_208528 [Homo sapiens] g...    37  0.19
ref|NP_446337.1| arginine-glutamic acid dipeptide (RE) repeats; ...    37  0.19
ref|NP_036234.2| arginine-glutamic acid dipeptide (RE) repeats; ...    36  0.32

>dbj|BAC65569.1| mKIAA0458 protein [Mus musculus]
          Length = 882

 Score = 37.4 bits (85), Expect = 0.11
 Identities = 29/98 (29%), Positives = 34/98 (34%), Gaps = 5/98 (5%)
 Frame = -3

Query: 515 HNAHASFPPSVSYQ---PPLPA--PRSNIEVHHGPGVVPSPFTLQPNASGGSFIPSVQSG 351
           H  H S P   S     PP PA  P S++  HH P   P P  L P +      P+   G
Sbjct: 267 HPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPG 326

Query: 350 IVSFQSFRGGVESLHQEALAPNPWGGSFPPSSLGPKSP 237
           +   QS      S     L   P    FP     P  P
Sbjct: 327 LTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPFVPGGP 364

>gb|EAA33893.1| hypothetical protein [Neurospora crassa]
          Length = 1551

 Score = 36.6 bits (83), Expect = 0.19
 Identities = 22/56 (39%), Positives = 26/56 (46%)
 Frame = -3

Query: 491 PSVSYQPPLPAPRSNIEVHHGPGVVPSPFTLQPNASGGSFIPSVQSGIVSFQSFRG 324
           P  +   P+P P SN  V   P   PSP    P  SGG   PS  S  ++F SF G
Sbjct: 132 PHAAGSVPIPIPGSNARVP-SPAASPSPIPQVPQQSGGQRAPSNISAPMTFGSFPG 186

>ref|XP_208528.1| hypothetical protein XP_208528 [Homo sapiens]
           gi|27734821|ref|NP_775849.1| hypothetical protein
           FLJ90834 [Homo sapiens] gi|22761324|dbj|BAC11541.1|
           unnamed protein product [Homo sapiens]
          Length = 251

 Score = 36.6 bits (83), Expect = 0.19
 Identities = 26/82 (31%), Positives = 37/82 (44%), Gaps = 1/82 (1%)
 Frame = -3

Query: 479 YQPPLPAPRSNIEVHHGPGVVPSPFTLQPNA-SGGSFIPSVQSGIVSFQSFRGGVESLHQ 303
           YQPP PA R   + +   G++P P    P   SG  ++P++   ++      G    +  
Sbjct: 69  YQPPQPASRPQAKRYQ--GLLPVPLAPHPLCLSGQLYLPNIPCTVID-----GCGPVISH 121

Query: 302 EALAPNPWGGSFPPSSLGPKSP 237
             L   PWG   PPS LG  SP
Sbjct: 122 LKLTMYPWG--LPPSHLGSSSP 141

>ref|NP_446337.1| arginine-glutamic acid dipeptide (RE) repeats; atrophin-1 related
           protein [Rattus norvegicus] gi|11360394|pir||T42731
           atrophin-1 related protein - rat
           gi|1209103|gb|AAA98970.1| atrophin-1 related protein
           [Rattus norvegicus]
          Length = 1006

 Score = 36.6 bits (83), Expect = 0.19
 Identities = 30/108 (27%), Positives = 41/108 (37%), Gaps = 15/108 (13%)
 Frame = -3

Query: 515 HNAHASFPPSVSYQ---PPLPA--PRSNIEVHHGPGVVPSPFTLQPNASGGSFIPSVQSG 351
           H  H S P   S     PP PA  P S++  HH P   P P  L P +      P+   G
Sbjct: 385 HPPHLSGPSPFSMNANLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPG 444

Query: 350 IVSFQSF---------RGGVESLHQEALAP-NPWGGSFPPSSLGPKSP 237
           +   QS           GG+  +  ++  P +P+    PP    P  P
Sbjct: 445 LTQSQSLPPPAASHPTTGGLHQVPSQSPFPQHPFVPGGPPPITPPSCP 492

>ref|NP_036234.2| arginine-glutamic acid dipeptide (RE) repeats; atrophin 1-like;
            arginine glutamic acid dipeptide RE repeats [Homo
            sapiens] gi|8096340|dbj|BAA95898.1| RERE [Homo sapiens]
          Length = 1566

 Score = 35.8 bits (81), Expect = 0.32
 Identities = 32/107 (29%), Positives = 40/107 (36%), Gaps = 14/107 (13%)
 Frame = -3

Query: 515  HNAHASFPPSVSYQ---PPLPA--PRSNIEVHHGPGVVPSPFTLQPNASGGSFIPSVQSG 351
            H  H S P   S     PP PA  P S++  HH P   P P  L P +      P+   G
Sbjct: 947  HPPHLSGPSPFSMNANLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPG 1006

Query: 350  IVSFQSFRGGVES-----LHQEA----LAPNPWGGSFPPSSLGPKSP 237
            +   Q+      S     LHQ A     A +P+    PP    P  P
Sbjct: 1007 LTQSQNLPPPPASHPPTGLHQVAPQPPFAQHPFVPGGPPPITPPTCP 1053

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 506,665,412
Number of Sequences: 1393205
Number of extensions: 12295614
Number of successful extensions: 35383
Number of sequences better than 10.0: 71
Number of HSP's better than 10.0 without gapping: 32022
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35169
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 17596710992
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf016e12 BP063217 1 486
2 SPD019a03_f BP045459 11 402
3 SPD088d10_f BP051031 26 532
4 MFB039f07_f BP036871 34 447




Lotus japonicus
Kazusa DNA Research Institute