KMC017641A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC017641A_C01 KMC017641A_c01
ctttggtagattggtaaattggggtcagcttagtggtttcatcagtacgaaaccaaggga
AGGAGTGTGACTGTGTGAGTGACACAGAGAGATTGAATTCATCCATGGCTTGGCAACGGC
ATCCACCCATACCACCAACAGTGGCCTCCCGCAGCGGCCGTTCCTCCTCCAACCCCACCG
GCTGCCGCTCCTCCCACCCTCTCTCCCGACGAGGTTCGAACGATATTCATCACTGGCCTC
CCCGACGACGTGAAAGAGAGAGAGCTACAGAACCTGCTTCGATGGTTGCCTGGCTTCGAA
GCTTCTCAGCTCAATTTCAAAGCCGATAAACCCATGGGTTTTGCTCTCTTCTCCAGTCCT
CACCAAGCAATCGCCGCCAAAGATATTCTTCAGGACATGCTCTTCGATCCTGAAGCCAAG
TCCGTCCTCCACACTGAAATGGCCAAGAAAAATCTCTTCATCAAAAGAGGAATAGGGGCT
GATGCAGCTGCTTTTGATCAGAGTAAACGGTTAAGGACTGCTGGGGATTATACACACACT
GGTTATGTAACCCCATCTCCTTATCATCCTCCCCCGCCTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC017641A_C01 KMC017641A_c01
         (580 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAD33925.1| proline rich protein 3 [Cicer arietinum]              244  4e-64
gb|AAK32800.1|AF361632_1 At3g21211 [Arabidopsis thaliana] gi|235...   226  2e-58
dbj|BAB01713.1| gene_id:MXL8.7~unknown protein [Arabidopsis thal...   226  2e-58
ref|NP_683582.1| similar to RRM-containing protein; protein id: ...   211  4e-54
gb|AAO37215.1| hypothetical protein [Arabidopsis thaliana]             89  4e-17

>emb|CAD33925.1| proline rich protein 3 [Cicer arietinum]
          Length = 284

 Score =  244 bits (624), Expect = 4e-64
 Identities = 115/124 (92%), Positives = 123/124 (98%)
 Frame = +1

Query: 208 DEVRTIFITGLPDDVKERELQNLLRWLPGFEASQLNFKADKPMGFALFSSPHQAIAAKDI 387
           +EVRTIFITGLP+DVKERE+QNLLRWLPGFEASQLNFKA+KPMGFALFSSPHQAIAAKDI
Sbjct: 2   EEVRTIFITGLPEDVKEREIQNLLRWLPGFEASQLNFKAEKPMGFALFSSPHQAIAAKDI 61

Query: 388 LQDMLFDPEAKSVLHTEMAKKNLFIKRGIGADAAAFDQSKRLRTAGDYTHTGYVTPSPYH 567
           LQDMLFDP++KSVLHTEMAKKNLF+KRGIGADA AFDQSKRLRTAGDYTHTGYVTPSP+H
Sbjct: 62  LQDMLFDPDSKSVLHTEMAKKNLFVKRGIGADAVAFDQSKRLRTAGDYTHTGYVTPSPFH 121

Query: 568 PPPP 579
           PPPP
Sbjct: 122 PPPP 125

 Score = 37.0 bits (84), Expect = 0.18
 Identities = 33/118 (27%), Positives = 48/118 (39%), Gaps = 8/118 (6%)
 Frame = +1

Query: 148 PAAAVPPPTPPAAAPP--------TLSPDEVRTIFITGLPDDVKERELQNLLRWLPGFEA 303
           P A VP P P + A P        T       T+FI  L +++ E E++ L    PGF+ 
Sbjct: 149 PVAPVPMPAPVSIAAPSSYVPVQNTKDNPPCNTLFIGNLGENINEEEVRGLFSVQPGFKQ 208

Query: 304 SQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIKRGIG 477
            ++  +    + F  F   + A      LQ  +  P + SV       KN F KR  G
Sbjct: 209 MKILRQERHTVCFIEFEDVNSATNVHHNLQGAVI-PSSGSVGMRIQYSKNPFGKRKDG 265

>gb|AAK32800.1|AF361632_1 At3g21211 [Arabidopsis thaliana] gi|23505943|gb|AAN28831.1|
           At3g21211/At3g21211 [Arabidopsis thaliana]
           gi|26451397|dbj|BAC42798.1| unknown protein [Arabidopsis
           thaliana]
          Length = 339

 Score =  226 bits (575), Expect = 2e-58
 Identities = 113/177 (63%), Positives = 130/177 (72%), Gaps = 21/177 (11%)
 Frame = +1

Query: 112 GNGIHPYHQQWPPAAAVPPPTPPAAAPPTLSP---------------------DEVRTIF 228
           G GIHPYHQQWPPA A PPP   ++A P   P                     DE+RTIF
Sbjct: 3   GAGIHPYHQQWPPAGAPPPPAAVSSAAPPHPPPIHHHPPPPPVLVDNHNRPPYDELRTIF 62

Query: 229 ITGLPDDVKERELQNLLRWLPGFEASQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFD 408
           I GLPDDVKEREL NLLRWLPG+EASQ+NFK +KPMGFALFS+   A+AAKD LQ M+FD
Sbjct: 63  IAGLPDDVKERELLNLLRWLPGYEASQVNFKGEKPMGFALFSTAQFAMAAKDTLQHMVFD 122

Query: 409 PEAKSVLHTEMAKKNLFIKRGIGADAAAFDQSKRLRTAGDYTHTGYVTPSPYHPPPP 579
            E+KSV+HTEMAKKNLF+KRGI  D+ A+DQSKRLRT GD TH+ Y +PSP+HPPPP
Sbjct: 123 AESKSVIHTEMAKKNLFVKRGIVGDSNAYDQSKRLRTGGDCTHSVY-SPSPFHPPPP 178

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 38/144 (26%), Positives = 59/144 (40%), Gaps = 7/144 (4%)
 Frame = +1

Query: 127 PYHQQWPPAAAVPPPTPPAAAPPTLSPDE-------VRTIFITGLPDDVKERELQNLLRW 285
           PY     P   +P P PP AAP +  P +         T+FI  L +++ E EL++LL  
Sbjct: 197 PYAGYHAPPVPMPTP-PPIAAPSSYVPVQNIKDNPPCNTLFIGNLGENINEEELRSLLSA 255

Query: 286 LPGFEASQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIK 465
            PGF+  ++  +    + F  F   + A      LQ  +  P + S+       KN + K
Sbjct: 256 QPGFKQMKILRQERHTVCFIEFEDVNSATNVHHNLQGAVI-PSSGSIGMRIQYSKNPYGK 314

Query: 466 RGIGADAAAFDQSKRLRTAGDYTH 537
           R  G   + F         G  T+
Sbjct: 315 RKEGGGYSFFPSPSANGAQGALTY 338

>dbj|BAB01713.1| gene_id:MXL8.7~unknown protein [Arabidopsis thaliana]
          Length = 317

 Score =  226 bits (575), Expect = 2e-58
 Identities = 113/177 (63%), Positives = 130/177 (72%), Gaps = 21/177 (11%)
 Frame = +1

Query: 112 GNGIHPYHQQWPPAAAVPPPTPPAAAPPTLSP---------------------DEVRTIF 228
           G GIHPYHQQWPPA A PPP   ++A P   P                     DE+RTIF
Sbjct: 3   GAGIHPYHQQWPPAGAPPPPAAVSSAAPPHPPPIHHHPPPPPVLVDNHNRPPYDELRTIF 62

Query: 229 ITGLPDDVKERELQNLLRWLPGFEASQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFD 408
           I GLPDDVKEREL NLLRWLPG+EASQ+NFK +KPMGFALFS+   A+AAKD LQ M+FD
Sbjct: 63  IAGLPDDVKERELLNLLRWLPGYEASQVNFKGEKPMGFALFSTAQFAMAAKDTLQHMVFD 122

Query: 409 PEAKSVLHTEMAKKNLFIKRGIGADAAAFDQSKRLRTAGDYTHTGYVTPSPYHPPPP 579
            E+KSV+HTEMAKKNLF+KRGI  D+ A+DQSKRLRT GD TH+ Y +PSP+HPPPP
Sbjct: 123 AESKSVIHTEMAKKNLFVKRGIVGDSNAYDQSKRLRTGGDCTHSVY-SPSPFHPPPP 178

 Score = 38.1 bits (87), Expect = 0.080
 Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 7/69 (10%)
 Frame = +1

Query: 127 PYHQQWPPAAAVPPPTPPAAAPPTLSPDE-------VRTIFITGLPDDVKERELQNLLRW 285
           PY     P   +P P PP AAP +  P +         T+FI  L +++ E EL++LL  
Sbjct: 197 PYAGYHAPPVPMPTP-PPIAAPSSYVPVQNIKDNPPCNTLFIGNLGENINEEELRSLLSA 255

Query: 286 LPGFEASQL 312
            PGF+  ++
Sbjct: 256 QPGFKQMKI 264

>ref|NP_683582.1| similar to RRM-containing protein; protein id: At3g21215.1
           [Arabidopsis thaliana]
          Length = 285

 Score =  211 bits (538), Expect = 4e-54
 Identities = 112/207 (54%), Positives = 129/207 (62%), Gaps = 51/207 (24%)
 Frame = +1

Query: 112 GNGIHPYHQQWPPAAAVPPPTPPAAAPPTLSPD--------------------------- 210
           G GIHPYHQQWPPA A PPP   ++A P   P                            
Sbjct: 3   GAGIHPYHQQWPPAGAPPPPAAVSSAAPPHPPPIHHHPPPPPVLVDNHNRPPYDEVQLFL 62

Query: 211 ------------------------EVRTIFITGLPDDVKERELQNLLRWLPGFEASQLNF 318
                                   E+RTIFI GLPDDVKEREL NLLRWLPG+EASQ+NF
Sbjct: 63  FLSIHIGDSCAVLSRWACLIFVYVELRTIFIAGLPDDVKERELLNLLRWLPGYEASQVNF 122

Query: 319 KADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIKRGIGADAAAFD 498
           K +KPMGFALFS+   A+AAKD LQ M+FD E+KSV+HTEMAKKNLF+KRGI  D+ A+D
Sbjct: 123 KGEKPMGFALFSTAQFAMAAKDTLQHMVFDAESKSVIHTEMAKKNLFVKRGIVGDSNAYD 182

Query: 499 QSKRLRTAGDYTHTGYVTPSPYHPPPP 579
           QSKRLRT GD TH+ Y +PSP+HPPPP
Sbjct: 183 QSKRLRTGGDCTHSVY-SPSPFHPPPP 208

 Score = 31.6 bits (70), Expect = 7.5
 Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 8/51 (15%)
 Frame = +1

Query: 151 AAAVPPPTPP-AAAPPTLSPDE-------VRTIFITGLPDDVKERELQNLL 279
           A  VP PTPP  AAP +  P +         T+FI  L +++ E EL++LL
Sbjct: 233 APPVPMPTPPPIAAPSSYVPVQNIKDNPPCNTLFIGNLGENINEEELRSLL 283

>gb|AAO37215.1| hypothetical protein [Arabidopsis thaliana]
          Length = 277

 Score = 89.0 bits (219), Expect = 4e-17
 Identities = 55/144 (38%), Positives = 74/144 (51%), Gaps = 15/144 (10%)
 Frame = +1

Query: 160 VPPPTPPAAAPPTLSP--------------DEVRTIFITGLPDDVKERELQNLLRWLPGF 297
           VPPP P  +  P  S               DEVRT+F+ GLP+DVK RE+ NL R  PG+
Sbjct: 2   VPPPPPGVSPIPITSAHSVYLPTHVSIGARDEVRTLFVAGLPEDVKPREIYNLFREFPGY 61

Query: 298 EASQL-NFKADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIKRGI 474
           E S L +    KP  FA+FS    A+A    L  M+FD E  S LH ++AK N   KR  
Sbjct: 62  ETSHLRSSDGAKPFAFAVFSDLQSAVAVMHALNGMVFDLEKHSTLHIDLAKSNPKSKRSR 121

Query: 475 GADAAAFDQSKRLRTAGDYTHTGY 546
             D   ++  K+L++    T +G+
Sbjct: 122 TDD--GWESLKKLKSWNTTTESGF 143

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 581,827,914
Number of Sequences: 1393205
Number of extensions: 14663436
Number of successful extensions: 142383
Number of sequences better than 10.0: 429
Number of HSP's better than 10.0 without gapping: 76688
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 126889
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL077a07_f BP056751 1 432
2 SPD088c12_f BP051023 61 580




Lotus japonicus
Kazusa DNA Research Institute