KMC000608A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000608A_C01 KMC000608A_c01
ggttccacaggtcatttattaactatatataaagggaaaaggataggcacaaaataatga
cacatgaaagGGTCTTAACTCTTTAGTAGAGCATTCTCTACTCTTGCTTCATATTATCAC
TCAATTACACATATGCAGCAGCTAACTGCACAACAACAATCATCGATCTCTTGGAGTACA
TATTGTATTACTATAATTTCTTATGACTTTAACTTTTCAATGAAAGAATCTGTAAAGACT
TTCGATAAAAATTGGGAACGTAATGAACAGAGGTATTAGTATTACCACTATAGAACAGGG
ACAGAAATGGACATTGGTACTGAAACATCACCCGCATAGCTTGTATCTCTAGTAGGAGCT
GAATTGCTAGCTAATGCATACCCTACACCTAATCCACCATAATGCAAAGATGCATTTTCA
CATCTCTGAAACACTCTAGCAAGTGAAGCAGCTTTCCGTCTCGCCCGCTTTGTCCCGGAG
AAAAGCAAGGTTTGAACGTAATCCAGCCAAAGCTGGAGCCTTAACCACCCTTTCAGTTGC
AGCCGCACCTCCACCCCTGCACAACTCGAGTAAAGCTGCAACTGCATTCTCTTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000608A_C01 KMC000608A_c01
         (594 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAL36401.1| putative arm repeat-containing protein [Arabidops...    72  4e-12
ref|NP_174228.1| arm repeat-containing protein, putative; protei...    72  4e-12
pir||T08872 hypothetical protein ARC1 - rape gi|2558938|gb|AAB97...    45  3e-07
emb|CAD20348.1| ARC1 protein [Brassica oleracea]                       44  2e-06
dbj|BAB93187.1| putative arm repeat containing protein [Oryza sa...    35  0.001

>gb|AAL36401.1| putative arm repeat-containing protein [Arabidopsis thaliana]
          Length = 729

 Score = 72.4 bits (176), Expect = 4e-12
 Identities = 40/68 (58%), Positives = 48/68 (69%)
 Frame = -2

Query: 497 VQTLLFSGTKRARRKAASLARVFQRCENASLHYGGLGVGYALASNSAPTRDTSYAGDVSV 318
           +QTLLF+GTKRARRKAASLARVFQR ENA++  G     Y    N+   RD  +  DVSV
Sbjct: 663 LQTLLFTGTKRARRKAASLARVFQRRENAAMRSG----VYGFVGNTNGNRDGGFTTDVSV 718

Query: 317 PMSISVPV 294
           P+SIS+ V
Sbjct: 719 PISISISV 726

 Score = 57.0 bits (136), Expect = 2e-07
 Identities = 28/42 (66%), Positives = 32/42 (75%)
 Frame = -1

Query: 594 KENAVAALLELCRGGGAAATERVVKAPALAGLRSNLAFLRDK 469
           KENAVAALLELCR GGAA  E+V++APA+AGL   L F   K
Sbjct: 631 KENAVAALLELCRSGGAAVAEKVLRAPAIAGLLQTLLFTGTK 672

>ref|NP_174228.1| arm repeat-containing protein, putative; protein id: At1g29340.1,
           supported by cDNA: gi_17381177 [Arabidopsis thaliana]
           gi|25354710|pir||A86416 probable arm repeat-containing
           protein - Arabidopsis thaliana
           gi|12323514|gb|AAG51726.1|AC068667_5 arm
           repeat-containing protein, putative; 6839-9028
           [Arabidopsis thaliana] gi|23297797|gb|AAN13028.1|
           putative arm repeat-containing protein [Arabidopsis
           thaliana]
          Length = 729

 Score = 72.4 bits (176), Expect = 4e-12
 Identities = 40/68 (58%), Positives = 48/68 (69%)
 Frame = -2

Query: 497 VQTLLFSGTKRARRKAASLARVFQRCENASLHYGGLGVGYALASNSAPTRDTSYAGDVSV 318
           +QTLLF+GTKRARRKAASLARVFQR ENA++  G     Y    N+   RD  +  DVSV
Sbjct: 663 LQTLLFTGTKRARRKAASLARVFQRRENAAMRSG----VYGFVGNTNGNRDGGFTTDVSV 718

Query: 317 PMSISVPV 294
           P+SIS+ V
Sbjct: 719 PISISISV 726

 Score = 57.0 bits (136), Expect = 2e-07
 Identities = 28/42 (66%), Positives = 32/42 (75%)
 Frame = -1

Query: 594 KENAVAALLELCRGGGAAATERVVKAPALAGLRSNLAFLRDK 469
           KENAVAALLELCR GGAA  E+V++APA+AGL   L F   K
Sbjct: 631 KENAVAALLELCRSGGAAVAEKVLRAPAIAGLLQTLLFTGTK 672

>pir||T08872 hypothetical protein ARC1 - rape gi|2558938|gb|AAB97738.1| arm
           repeat containing protein [Brassica napus]
          Length = 661

 Score = 45.4 bits (106), Expect(2) = 3e-07
 Identities = 22/36 (61%), Positives = 25/36 (69%)
 Frame = -1

Query: 594 KENAVAALLELCRGGGAAATERVVKAPALAGLRSNL 487
           KE A+A LL+LC  GGA  TE+VVK PALA L   L
Sbjct: 598 KEKAIATLLQLCTAGGAVVTEKVVKTPALAVLTRKL 633

 Score = 30.4 bits (67), Expect(2) = 3e-07
 Identities = 13/24 (54%), Positives = 19/24 (79%)
 Frame = -2

Query: 488 LLFSGTKRARRKAASLARVFQRCE 417
           LL +GT RA+RKA SL++V + C+
Sbjct: 633 LLLTGTDRAKRKAVSLSKVCKGCD 656

>emb|CAD20348.1| ARC1 protein [Brassica oleracea]
          Length = 285

 Score = 43.9 bits (102), Expect(2) = 2e-06
 Identities = 22/36 (61%), Positives = 25/36 (69%)
 Frame = -1

Query: 594 KENAVAALLELCRGGGAAATERVVKAPALAGLRSNL 487
           KE A+A LL+LC  GGA  TE+VVK PALA L   L
Sbjct: 228 KEKAIATLLQLCTLGGAVVTEKVVKTPALAVLTRKL 263

 Score = 29.6 bits (65), Expect(2) = 2e-06
 Identities = 13/23 (56%), Positives = 18/23 (77%)
 Frame = -2

Query: 488 LLFSGTKRARRKAASLARVFQRC 420
           LL +GT RA+RKA SL++V + C
Sbjct: 263 LLLTGTDRAKRKAVSLSKVCKGC 285

>dbj|BAB93187.1| putative arm repeat containing protein [Oryza sativa (japonica
           cultivar-group)] gi|29367589|gb|AAO72656.1| arm repeat
           protein [Oryza sativa (japonica cultivar-group)]
          Length = 680

 Score = 34.7 bits (78), Expect(2) = 0.001
 Identities = 21/45 (46%), Positives = 27/45 (59%)
 Frame = -2

Query: 488 LLFSGTKRARRKAASLARVFQRCENASLHYGGLGVGYALASNSAP 354
           L+  GT+RARRKAASL R+ +R   AS   G  G G  +A+   P
Sbjct: 631 LMSIGTERARRKAASLGRICRRWAAASAADGERGGGCPVATVVPP 675

 Score = 29.3 bits (64), Expect(2) = 0.001
 Identities = 14/29 (48%), Positives = 19/29 (65%)
 Frame = -1

Query: 594 KENAVAALLELCRGGGAAATERVVKAPAL 508
           +ENA AAL+ LCR  GA A  +V+  P +
Sbjct: 596 RENATAALVLLCRRLGAPAVTQVMAVPGV 624

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 481,763,401
Number of Sequences: 1393205
Number of extensions: 9568104
Number of successful extensions: 24225
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 23421
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 24188
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 22854740960
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf026e11 BP063714 1 254
2 GNLf019d06 BP075887 64 594
3 MWL067d06_f AV769768 75 577




Lotus japonicus
Kazusa DNA Research Institute