KMC005337A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005337A_C01 KMC005337A_c01
tttggctgctttgctgttgttgttgttttctcgaaagatggcgacCATGAACCCTTTGGA
TGGGTTGGGTGATGACGCTGAGGATCCTTCTCAGCTCATCGCTGCCGAACAGCTCAAGGC
TGCGGCCGCGGCGGCGACTGCTCCTCCCAAGAAAGCGGCGGGGGCTCAGGACCAGGGCAA
GCAGGCCAAACCGGCTCAGATGCCTTCCAAGCCACTTCCTCCTGGCTCAGGCTGTGAGGG
ATGCCAGAAATGAACCATCACGAGGAGGCCGTGGAGGTGCAAGAGGTGCTGGTCGTGGAT
TTGGACGTGGTCGTGGTTTCAATCGTGACTTCTCCAATGATGAGAACTCATTCCCTGCCT
CTGGAGCCCCTGATAATCTGGGTCCTTTTGAAGGTGATTCTGAGAAGGCTTCAGAAAGGC
GTGGTTATGGTGCACCACGAGGTCCTTATCGTGGTGGTGGTGGTGGTCGACGTGGAGGCT
TCAGCAATGGTGAAGCTGATGAAGAAGGACGACCTCGAAGAGCATTTGAACGCCACAGTG
GAACTGGCCGAGGAAGTGGATTCAAACGTGAAGGCGCTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005337A_C01 KMC005337A_c01
         (579 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T12180 probable transcription factor - fava bean gi|2104681...   172  2e-42
ref|NP_193416.1| nuclear antigen homolog; protein id: At4g16830....   104  1e-29
gb|AAM61393.1| nuclear antigen homolog [Arabidopsis thaliana]          97  5e-27
ref|NP_199532.1| putative protein; protein id: At5g47210.1, supp...    89  8e-27
gb|AAM63072.1| nuclear RNA binding protein A-like protein [Arabi...    89  8e-27

>pir||T12180 probable transcription factor - fava bean
           gi|2104681|emb|CAA66481.1| transcription factor [Vicia
           faba]
          Length = 370

 Score =  172 bits (437), Expect = 2e-42
 Identities = 98/148 (66%), Positives = 109/148 (73%), Gaps = 6/148 (4%)
 Frame = +3

Query: 153 KRRGLRTRASRPNRLRCLPSHFLLAQAVRDARNEPSRGGRGGARGAGRGFG-----RGRG 317
           K +G R + ++P +L   P+    AQAVR++RNE  RG RGG RG GRGFG     RGRG
Sbjct: 39  KDQGKRAQTNKPAQLPSKPAP--PAQAVRESRNEGGRGSRGG-RGGGRGFGGERGGRGRG 95

Query: 318 FNRDFS-NDENSFPASGAPDNLGPFEGDSEKASERRGYGAPRGPYRGGGGGRRGGFSNGE 494
           F RDFS NDENSFPAS APD+ G  EG  EK SERRG+G PR PYRGG   RRGGFSNGE
Sbjct: 96  FGRDFSSNDENSFPASRAPDSQGAVEG--EKFSERRGFGGPRPPYRGG---RRGGFSNGE 150

Query: 495 ADEEGRPRRAFERHSGTGRGSGFKREGA 578
             EEGRPRRAFERHSGTGRGS FKR+GA
Sbjct: 151 GGEEGRPRRAFERHSGTGRGSEFKRDGA 178

 Score = 77.4 bits (189), Expect = 1e-13
 Identities = 43/65 (66%), Positives = 46/65 (70%), Gaps = 3/65 (4%)
 Frame = +2

Query: 38  MATMNPLDGLGDDAEDPSQLIAAEQLKAAAAAATAPPKKAAGAQDQGKQA---KPAQMPS 208
           MAT NP D LGDD EDPSQLI  EQLKAAAA     P K A  +DQGK+A   KPAQ+PS
Sbjct: 1   MATTNPFDLLGDDVEDPSQLIITEQLKAAAA-----PTKKAAEKDQGKRAQTNKPAQLPS 55

Query: 209 KPLPP 223
           KP PP
Sbjct: 56  KPAPP 60

>ref|NP_193416.1| nuclear antigen homolog; protein id: At4g16830.1, supported by
           cDNA: 118826., supported by cDNA: gi_17380879, supported
           by cDNA: gi_20465928 [Arabidopsis thaliana]
           gi|7488137|pir||F71435 probable nuclear antigen -
           Arabidopsis thaliana gi|2245037|emb|CAB10456.1| nuclear
           antigen homolog [Arabidopsis thaliana]
           gi|6492264|gb|AAF14243.1|AF110227_1 nuclear RNA binding
           protein [Arabidopsis thaliana]
           gi|7268434|emb|CAB80954.1| nuclear antigen homolog
           [Arabidopsis thaliana] gi|17380880|gb|AAL36252.1|
           putative nuclear antigen homolog [Arabidopsis thaliana]
           gi|20465929|gb|AAM20150.1| putative nuclear antigen-like
           protein [Arabidopsis thaliana]
           gi|22022571|gb|AAM83242.1| AT4g16830/dl4440w
           [Arabidopsis thaliana] gi|23308317|gb|AAN18128.1|
           At4g16830/dl4440w [Arabidopsis thaliana]
          Length = 355

 Score =  104 bits (260), Expect(2) = 1e-29
 Identities = 73/125 (58%), Positives = 83/125 (66%), Gaps = 7/125 (5%)
 Frame = +3

Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG-FNRDFSNDENSFPASGAPDNLGPFEGDS 401
           AQAVR+AR++  RGG  G RG   GF RGRG +NRD   D N+  + G     G  EGD 
Sbjct: 55  AQAVREARSDAPRGG--GGRG---GFNRGRGGYNRD---DGNNGYSGGYTKPSG--EGDV 104

Query: 402 EKAS-ERRGYG-APRGPYRGGGGG----RRGGFSNGEADEEGRPRRAFERHSGTGRGSGF 563
            K+S ERRG G APRG +RG GGG    RRGGFSN   D E RPRRAFER SGTGRGS F
Sbjct: 105 SKSSYERRGGGGAPRGSFRGEGGGPGGGRRGGFSNEGGDGE-RPRRAFERRSGTGRGSDF 163

Query: 564 KREGA 578
           KR+G+
Sbjct: 164 KRDGS 168

 Score = 47.0 bits (110), Expect(2) = 1e-29
 Identities = 28/66 (42%), Positives = 36/66 (54%), Gaps = 4/66 (6%)
 Frame = +2

Query: 38  MATMNPLDGLGDDAEDPSQLIAA----EQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMP 205
           MAT+NP D L DDAEDPSQL  A    ++ K +   ++ P K A             ++P
Sbjct: 1   MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSA------------PKLP 48

Query: 206 SKPLPP 223
           SKPLPP
Sbjct: 49  SKPLPP 54

>gb|AAM61393.1| nuclear antigen homolog [Arabidopsis thaliana]
          Length = 354

 Score = 97.1 bits (240), Expect(2) = 5e-27
 Identities = 69/125 (55%), Positives = 80/125 (63%), Gaps = 7/125 (5%)
 Frame = +3

Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG-FNRDFSNDENSFPASGAPDNLGPFEGDS 401
           AQAVR+AR++  RG     RG G GF RG G +NRD   D N+  + G     G  EGD 
Sbjct: 55  AQAVREARSDAPRG-----RGRG-GFSRGHGGYNRD---DGNNGYSGGYTKPSG--EGDV 103

Query: 402 EKAS-ERRGYG-APRGPYRGGGGG----RRGGFSNGEADEEGRPRRAFERHSGTGRGSGF 563
            K+S ERRG G APRG +RG GGG    RRGGFSN E  E  RPRR +ER SGTGRGS F
Sbjct: 104 SKSSYERRGGGGAPRGSFRGEGGGPGGGRRGGFSN-EGGEGERPRRTYERRSGTGRGSDF 162

Query: 564 KREGA 578
           KR+G+
Sbjct: 163 KRDGS 167

 Score = 45.8 bits (107), Expect(2) = 5e-27
 Identities = 27/66 (40%), Positives = 36/66 (53%), Gaps = 4/66 (6%)
 Frame = +2

Query: 38  MATMNPLDGLGDDAEDPSQLIAA----EQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMP 205
           MAT+NP D L DDAEDPSQL  +    ++ K +   ++ P K A             ++P
Sbjct: 1   MATLNPFDLLDDDAEDPSQLAVSIEKIDKSKKSGPVSSLPAKSA------------PKLP 48

Query: 206 SKPLPP 223
           SKPLPP
Sbjct: 49  SKPLPP 54

>ref|NP_199532.1| putative protein; protein id: At5g47210.1, supported by cDNA:
           19104. [Arabidopsis thaliana] gi|8809603|dbj|BAA97154.1|
           gene_id:MQL5.6~pir||G71444~similar to unknown protein
           [Arabidopsis thaliana] gi|22655182|gb|AAM98181.1|
           putative protein [Arabidopsis thaliana]
          Length = 357

 Score = 89.0 bits (219), Expect(2) = 8e-27
 Identities = 60/127 (47%), Positives = 74/127 (58%), Gaps = 10/127 (7%)
 Frame = +3

Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG---FNRDFSNDENSFPASGAPDNLGPFEG 395
           +QAVR++RN P +GGRGG  G G GF RGRG   +NRD  N++       AP N   F G
Sbjct: 51  SQAVRESRNAP-QGGRGGTGGRG-GFSRGRGNGGYNRDNRNND-------APGNENGFSG 101

Query: 396 DSEKASERRGYGAPRGPYRGG---GGGR----RGGFSNGEADEEGRPRRAFERHSGTGRG 554
              + SE    GA RG   GG   GGGR    RGG +NGE+ +  RP R ++RHS TG G
Sbjct: 102 GYRRPSEDAD-GASRGGSVGGYRVGGGREGPRRGGVANGESGDVERPPRNYDRHSRTGHG 160

Query: 555 SGFKREG 575
           +G KR G
Sbjct: 161 TGMKRNG 167

 Score = 53.1 bits (126), Expect(2) = 8e-27
 Identities = 30/62 (48%), Positives = 34/62 (54%)
 Frame = +2

Query: 38  MATMNPLDGLGDDAEDPSQLIAAEQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMPSKPL 217
           MA++NP D LGDDAEDPSQL  A   K   AAA   P KA            A+ P+KP 
Sbjct: 1   MASLNPFDLLGDDAEDPSQLAVALSQKVEKAAAAVQPPKA------------AKFPTKPA 48

Query: 218 PP 223
           PP
Sbjct: 49  PP 50

>gb|AAM63072.1| nuclear RNA binding protein A-like protein [Arabidopsis thaliana]
          Length = 357

 Score = 89.0 bits (219), Expect(2) = 8e-27
 Identities = 60/127 (47%), Positives = 74/127 (58%), Gaps = 10/127 (7%)
 Frame = +3

Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG---FNRDFSNDENSFPASGAPDNLGPFEG 395
           +QAVR++RN P +GGRGG  G G GF RGRG   +NRD  N++       AP N   F G
Sbjct: 51  SQAVRESRNAP-QGGRGGTGGRG-GFSRGRGNGGYNRDNRNND-------APGNENGFSG 101

Query: 396 DSEKASERRGYGAPRGPYRGG---GGG----RRGGFSNGEADEEGRPRRAFERHSGTGRG 554
              + SE    GA RG   GG   GGG    RRGG +NGE+ +  RP R ++RHS TG G
Sbjct: 102 GYRRPSEDAD-GASRGGSVGGYRVGGGLEGPRRGGVANGESGDVERPPRNYDRHSRTGHG 160

Query: 555 SGFKREG 575
           +G KR G
Sbjct: 161 TGMKRNG 167

 Score = 53.1 bits (126), Expect(2) = 8e-27
 Identities = 30/62 (48%), Positives = 34/62 (54%)
 Frame = +2

Query: 38  MATMNPLDGLGDDAEDPSQLIAAEQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMPSKPL 217
           MA++NP D LGDDAEDPSQL  A   K   AAA   P KA            A+ P+KP 
Sbjct: 1   MASLNPFDLLGDDAEDPSQLAVALSQKVEKAAAAVQPPKA------------AKFPTKPA 48

Query: 218 PP 223
           PP
Sbjct: 49  PP 50

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 611,367,225
Number of Sequences: 1393205
Number of extensions: 18112776
Number of successful extensions: 277100
Number of sequences better than 10.0: 5310
Number of HSP's better than 10.0 without gapping: 106430
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 210279
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD084d12_f BP050710 1 547
2 MPD060g09_f AV774045 45 510
3 MPD087f03_f AV775729 69 581




Lotus japonicus
Kazusa DNA Research Institute