KMC017690A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC017690A_C01 KMC017690A_c01
atcgaaagttgatttcttaagcgccgccgttttctccgccctgcgcgtctgtccgccgcc
gtcatttgtccttcctctagGTCGTCTCTGTCTCTTGTCTCACATCTTCACGTCTTTCTC
CTCCTTGTTCTCGCGTGACCACCGTCGTTTCTTCTTCCTCCAACTGCGTCCACCATCATT
TTCTCCTTCCTCCTGGCACTAACCTTTGAAGAAGAAAGGTCACAATCATGGTCCAATGGC
AGAGACATATTTTCCCAATTCTTCGTCACATTCATAAGGGAGTGGACCATGTCTATCATT
CAGCTCCAAAGCTTTCAATTTCTCACTTAAGCTCTTCTTTTCCACAAGGTCAGTTTCAGG
GAGCATGGACAACAACTTCGCCCAGTATCTCGAGGCCTATGTATCACTGTTTCCAGCATC
AGGGAATTTCAAGTTCTACTTGGTTGCTTGCAAATTCACCTGAGGAAACACCTGTTTCGT
CACCACTAGCCCCACTTTCATTATTGGGTAGTTCAAAAGGTGAAGACCAGAATCAGAAAG
CTGTTTCTAAGCCAGAAAATGTTCAAGCTGTATTAAAGGGGATAAAGCAGAGTCCTAAGA
AGGTCAATTTGGTTGCTGCTTTGGTTCGTGGTATGCTCGTTAAAGACGCATTGTTGCAGT
TGCAATTGACAGTAAAGCGAGCTGCAAAAACTGTATATCAGGTTATTCATTCAGCCAGAG
CAAATGCCTCTCATAATCATGGGTTGGATTCAGACCGTCTCATTATTGCTGAAGCATATG
TCGGAAAGGGATTTTATAAAAAGAGAATATCCATTCACGGGAAATGTAGACATGGGATCA
TGCACAGACCAGAGTGCAGGCTAACTGTTGTAGTAAGAGAGATAACCCCTGAAGAAGAGG
CAAAGATTGCAAGGCTGAAGGTCCACAACTTTAAGAAGCTCTCTAAGAGGGAGAGACGAC
TTGTGCCGCATAAGCTTATTGAGACCACTCCTGTTTGGGGCCGCAAAAACAAGCCTACCA
GTCAAAACTCGGGTGCTGCAGTTGCATGATTTGGTTCATGGAACAAATTTTCAGCATTAT
GCCCCTTTTACAAGAACTGGTAATAAGTGACGAAATTTTTTttctatttaaaaatatgaa
taactaatatacagtggcatctgttatcattttaagtgattgagtacgaacttttttgag
g


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC017690A_C01 KMC017690A_c01
         (1201 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_567805.1| Expressed protein; protein id: At4g28360.1, sup...   291  2e-77
ref|NP_564605.2| chloroplast 50S ribosomal protein L22, putative...   283  5e-75
pir||T04605 hypothetical protein F20O9.30 - Arabidopsis thaliana      263  5e-69
pir||H96563 hypothetical protein F19K6.5 [imported] - Arabidopsi...   257  2e-67
dbj|BAB39116.1| P0686E09.8 [Oryza sativa (japonica cultivar-group)]   233  3e-60

>ref|NP_567805.1| Expressed protein; protein id: At4g28360.1, supported by cDNA: 18360.
            [Arabidopsis thaliana] gi|21553938|gb|AAM63019.1| unknown
            [Arabidopsis thaliana]
          Length = 271

 Score =  291 bits (744), Expect = 2e-77
 Identities = 159/275 (57%), Positives = 202/275 (72%), Gaps = 3/275 (1%)
 Frame = +3

Query: 228  MVQWQRHIFPILRHIHKGVDHVYHSAPKLSIS--HLSSSFPQGQFQGAWTTTSPSIS-RP 398
            MV W+R++  ++R + + V + + S    S S  +L S F QG  Q   +   PS S RP
Sbjct: 1    MVGWKRNLQTVIRQVGRRVKNSHISTANYSSSTRNLESPFSQGYLQ---SLLRPSYSSRP 57

Query: 399  MYHCFQHQGISSSTWLLANSPEETPVSSPLAPLSLLGSSKGEDQNQKAVSKPENVQAVLK 578
            +YH  Q  GIS+S  L A+   E PVSSPL+  +LLGS  G+++ QK + K + VQAVLK
Sbjct: 58   LYHHLQQLGISTSRQLQAS---EEPVSSPLSSPALLGS--GKEEEQKIIPKRQKVQAVLK 112

Query: 579  GIKQSPKKVNLVAALVRGMLVKDALLQLQLTVKRAAKTVYQVIHSARANASHNHGLDSDR 758
             IKQSPKKVNLVAALVRGM V+DAL+QLQ+TVKRAA+TVY+VIH+ARANA+HNHGLD DR
Sbjct: 113  SIKQSPKKVNLVAALVRGMRVEDALIQLQVTVKRAAQTVYRVIHAARANATHNHGLDPDR 172

Query: 759  LIIAEAYVGKGFYKKRISIHGKCRHGIMHRPECRLTVVVREITPEEEAKIARLKVHNFKK 938
            L++AEA+VGKG + K+++ H K R GI+  P CRLTV+VRE TPEEEA+IARLKVHNFKK
Sbjct: 173  LLVAEAFVGKGLFGKKVAYHAKGRSGIISIPRCRLTVIVRETTPEEEAEIARLKVHNFKK 232

Query: 939  LSKRERRLVPHKLIETTPVWGRKNKPTSQNSGAAV 1043
             SKRER+LVPHKLIET+P+W R+    +  S   V
Sbjct: 233  KSKRERQLVPHKLIETSPIWNRRGTKANHRSSELV 267

>ref|NP_564605.2| chloroplast 50S ribosomal protein L22, putative; protein id:
            At1g52370.1 [Arabidopsis thaliana]
          Length = 269

 Score =  283 bits (723), Expect = 5e-75
 Identities = 154/273 (56%), Positives = 198/273 (72%), Gaps = 1/273 (0%)
 Frame = +3

Query: 228  MVQWQRHIFPILRHIHKGV-DHVYHSAPKLSISHLSSSFPQGQFQGAWTTTSPSISRPMY 404
            M  WQ+++  ++R + K V D    +A   S  +L S F QG  Q    +T  S  RP+Y
Sbjct: 1    MAGWQKNLQIVIRQVGKRVKDSHISTANYSSTRNLESPFSQGYLQSLLRSTYSS--RPLY 58

Query: 405  HCFQHQGISSSTWLLANSPEETPVSSPLAPLSLLGSSKGEDQNQKAVSKPENVQAVLKGI 584
            +  Q  GIS+S  L A    E PVSSPL+  +LLGS  G+++ QK + K + VQAVLK I
Sbjct: 59   YHLQQLGISTSRQLQAG---EEPVSSPLSSPALLGS--GKEEEQKIIPKRQKVQAVLKSI 113

Query: 585  KQSPKKVNLVAALVRGMLVKDALLQLQLTVKRAAKTVYQVIHSARANASHNHGLDSDRLI 764
            KQSPKKVNLVAALVRGM V+DAL+QLQ+TVKRA++TVY+VIH+ARANA+HNHGLD DRL+
Sbjct: 114  KQSPKKVNLVAALVRGMRVEDALMQLQVTVKRASQTVYRVIHAARANATHNHGLDPDRLL 173

Query: 765  IAEAYVGKGFYKKRISIHGKCRHGIMHRPECRLTVVVREITPEEEAKIARLKVHNFKKLS 944
            +AEA+VGKG + K+++ H K R GI+  P CRLTV+VRE T EEEA+IARLKVHNFKKL+
Sbjct: 174  VAEAFVGKGLFGKKVAYHAKGRSGIISIPRCRLTVIVRETTAEEEAEIARLKVHNFKKLN 233

Query: 945  KRERRLVPHKLIETTPVWGRKNKPTSQNSGAAV 1043
            KR+R+LVPHKLIET+P+W R+    +  S   V
Sbjct: 234  KRQRQLVPHKLIETSPIWNRRGTKGNHRSSELV 266

>pir||T04605 hypothetical protein F20O9.30 - Arabidopsis thaliana
          Length = 508

 Score =  263 bits (671), Expect = 5e-69
 Identities = 136/207 (65%), Positives = 167/207 (79%)
 Frame = +3

Query: 423  GISSSTWLLANSPEETPVSSPLAPLSLLGSSKGEDQNQKAVSKPENVQAVLKGIKQSPKK 602
            GIS+S  L A+   E PVSSPL+  +LLGS  G+++ QK + K + VQAVLK IKQSPKK
Sbjct: 217  GISTSRQLQAS---EEPVSSPLSSPALLGS--GKEEEQKIIPKRQKVQAVLKSIKQSPKK 271

Query: 603  VNLVAALVRGMLVKDALLQLQLTVKRAAKTVYQVIHSARANASHNHGLDSDRLIIAEAYV 782
            VNLVAALVRGM V+DAL+QLQ+TVKRAA+TVY+VIH+ARANA+HNHGLD DRL++AEA+V
Sbjct: 272  VNLVAALVRGMRVEDALIQLQVTVKRAAQTVYRVIHAARANATHNHGLDPDRLLVAEAFV 331

Query: 783  GKGFYKKRISIHGKCRHGIMHRPECRLTVVVREITPEEEAKIARLKVHNFKKLSKRERRL 962
            GKG + K+++ H K R GI+  P CRLTV+VRE TPEEEA+IARLKVHNFKK SKRER+L
Sbjct: 332  GKGLFGKKVAYHAKGRSGIISIPRCRLTVIVRETTPEEEAEIARLKVHNFKKKSKRERQL 391

Query: 963  VPHKLIETTPVWGRKNKPTSQNSGAAV 1043
            VPHKLIET+P+W R+    +  S   V
Sbjct: 392  VPHKLIETSPIWNRRGTKANHRSSELV 418

>pir||H96563 hypothetical protein F19K6.5 [imported] - Arabidopsis thaliana
            gi|12323131|gb|AAG51551.1|AC037424_16 chloroplast 50S
            ribosomal protein L22, putative; 25606-24374 [Arabidopsis
            thaliana]
          Length = 226

 Score =  257 bits (657), Expect = 2e-67
 Identities = 133/207 (64%), Positives = 166/207 (79%)
 Frame = +3

Query: 423  GISSSTWLLANSPEETPVSSPLAPLSLLGSSKGEDQNQKAVSKPENVQAVLKGIKQSPKK 602
            GIS+S  L A    E PVSSPL+  +LLGS  G+++ QK + K + VQAVLK IKQSPKK
Sbjct: 22   GISTSRQLQAG---EEPVSSPLSSPALLGS--GKEEEQKIIPKRQKVQAVLKSIKQSPKK 76

Query: 603  VNLVAALVRGMLVKDALLQLQLTVKRAAKTVYQVIHSARANASHNHGLDSDRLIIAEAYV 782
            VNLVAALVRGM V+DAL+QLQ+TVKRA++TVY+VIH+ARANA+HNHGLD DRL++AEA+V
Sbjct: 77   VNLVAALVRGMRVEDALMQLQVTVKRASQTVYRVIHAARANATHNHGLDPDRLLVAEAFV 136

Query: 783  GKGFYKKRISIHGKCRHGIMHRPECRLTVVVREITPEEEAKIARLKVHNFKKLSKRERRL 962
            GKG + K+++ H K R GI+  P CRLTV+VRE T EEEA+IARLKVHNFKKL+KR+R+L
Sbjct: 137  GKGLFGKKVAYHAKGRSGIISIPRCRLTVIVRETTAEEEAEIARLKVHNFKKLNKRQRQL 196

Query: 963  VPHKLIETTPVWGRKNKPTSQNSGAAV 1043
            VPHKLIET+P+W R+    +  S   V
Sbjct: 197  VPHKLIETSPIWNRRGTKGNHRSSELV 223

>dbj|BAB39116.1| P0686E09.8 [Oryza sativa (japonica cultivar-group)]
          Length = 285

 Score =  233 bits (595), Expect = 3e-60
 Identities = 133/241 (55%), Positives = 170/241 (70%), Gaps = 3/241 (1%)
 Frame = +3

Query: 327  LSSSFPQGQFQG---AWTTTSPSISRPMYHCFQHQGISSSTWLLANSPEETPVSSPLAPL 497
            +++ F  GQ+     A    S  ++   +   ++ GIS++  LLA      PVSSPL P 
Sbjct: 43   VNAPFGLGQYANLLRAQAFASRGVALNFHQLIRNAGISTTRNLLAADDAMVPVSSPLTPP 102

Query: 498  SLLGSSKGEDQNQKAVSKPENVQAVLKGIKQSPKKVNLVAALVRGMLVKDALLQLQLTVK 677
              LG  +  D+ + A+ K   VQA+ K IKQSPKKVNLVA LVRGM V+DALLQLQ+TVK
Sbjct: 103  --LGDGEQTDK-KGAIVKRLKVQAIKKDIKQSPKKVNLVAKLVRGMRVEDALLQLQVTVK 159

Query: 678  RAAKTVYQVIHSARANASHNHGLDSDRLIIAEAYVGKGFYKKRISIHGKCRHGIMHRPEC 857
            RAAKTVY   HSARANA+HNHGLD D+LI+ EA+VGKG Y KR+S H K R G+M RP C
Sbjct: 160  RAAKTVY---HSARANAAHNHGLDPDKLIVEEAFVGKGLYLKRLSYHAKGRCGVMVRPRC 216

Query: 858  RLTVVVREITPEEEAKIARLKVHNFKKLSKRERRLVPHKLIETTPVWGRKNKPTSQNSGA 1037
            RLTVVVRE T EEEAKIA+L+V N+KKL+++E++L+PH+LIE +P W RK K   + +GA
Sbjct: 217  RLTVVVREATAEEEAKIAKLRVSNYKKLTRKEKQLMPHRLIEVSPRWARKRK---EEAGA 273

Query: 1038 A 1040
            A
Sbjct: 274  A 274

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,042,184,310
Number of Sequences: 1393205
Number of extensions: 23972457
Number of successful extensions: 69451
Number of sequences better than 10.0: 195
Number of HSP's better than 10.0 without gapping: 63139
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 69004
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 75223582428
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB066c08_f BP038774 1 530
2 SPD032c01_f BP046523 211 791
3 SPDL080h10_f BP057016 635 1201




Lotus japonicus
Kazusa DNA Research Institute