KMC012177A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC012177A_C01 KMC012177A_c01
AGCAAATAGCAATCACATTATATAATATAAGATAGCAATCAGCGTCACCTTAGATATTCT
CTAAAGAATGATTACATTCTCAAGTACAGGGTTATTCAGCTTTTACATTTCTCTAACGTA
GAACGATTGAAATCTGATAATGCATTGTTGCAGCACTCTAGATGATTCTGGTACTCAATT
CTTTAGACAGGTAGCTCATGGAGCAGAATTCTCATCTTTCAGCAAATCATTAGCCTTATT
TAACCAAACTTCATAAATATCACGCAAGAACAGAACAAAGTGGACCTCTTTGAAGTCATT
TTGAAACTCCTTGATGGTAGAAATTGCTACTGTAGCAGCCTCGTCATAAGGATATCCATA
GACACCACATGATATGGCAGGGAACGCTATATACTGAATGTTTTTCTCCTTTGCAACCCT
CAAACTATTCCTGTAAGCACTAGCCAGAGAGGTAGCAGGGTCACTATTAGAATGGTAGAT
TGGTCCGACAGTATGAACGACATGAGAAACAGGCAATCTAAAACCCGGTCGTGATCCTCG
CTTCCCCTACGGGGCAGCGAACGCCACGCCTCACTTCCGGAACACTGTAGCATGCCTGAA
GAAGTTCTGGACCAGCAGCTCTATGTATAGCTCCGTCGGCGCCGCCACCTCCAAGCATTC
TCTCATTTGCAGGATTTACTATGGCGTCGGAGGAAGAATCGATGGACCATTGGCTGATGT
CACCTTCCTGAATGATCAATGCGGTGGTCGGAGACAGAGGGAAGCGAACTGCACCGTTGG
ATGAAGCGGaagccctcgccggagcgtccattcactgaccacgacggtggtgtgtggtga
gaaggggtagtgaaatttggggggcccgt


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC012177A_C01 KMC012177A_c01
         (869 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAN41297.1| unknown protein [Arabidopsis thaliana]                 160  3e-38
gb|AAK93649.2| unknown protein [Arabidopsis thaliana]                 160  3e-38
ref|NP_030605.1| expressed protein; protein id: At2g40600.1, sup...   160  3e-38
ref|NP_799613.1| hypothetical protein [Vibrio parahaemolyticus R...    84  1e-33
ref|NP_518455.1| CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solan...    89  3e-31

>gb|AAN41297.1| unknown protein [Arabidopsis thaliana]
          Length = 257

 Score =  160 bits (404), Expect = 3e-38
 Identities = 73/112 (65%), Positives = 90/112 (80%)
 Frame = -3

Query: 552 P*GKRGSRPGFRLPVSHVVHTVGPIYHSNSDPATSLASAYRNSLRVAKEKNIQYIAFPAI 373
           P G+    PGF LP S V+HTVGPIY S+ +P  SL ++Y+NSLRVAKE NI+YIAFPAI
Sbjct: 144 PTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKENNIKYIAFPAI 203

Query: 372 SCGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFLRDIYEVWLNKANDLLK 217
           SCG+YGYP+DEAA + ISTIK+F  DFKEVHFVLF  DI+ VW+NKA ++L+
Sbjct: 204 SCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLFADDIFSVWVNKAKEVLQ 255

 Score =  132 bits (333), Expect = 5e-30
 Identities = 65/89 (73%), Positives = 73/89 (81%)
 Frame = -2

Query: 787 ASSNGAVRFPLSPTTALIIQEGDISQWSIDSSSDAIVNPANERMLGGGGADGAIHRAAGP 608
           AS +    F LS ++ L I +GDI++WS+DSSSDAIVNPANERMLGGGGADGAIHRAAGP
Sbjct: 66  ASGDEGAVFNLSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGP 125

Query: 607 ELLQACYSVPEVRRGVRCPVGEARITTGF 521
           +L  ACY VPEVR GVRCP GEARIT GF
Sbjct: 126 QLRAACYEVPEVRPGVRCPTGEARITPGF 154

>gb|AAK93649.2| unknown protein [Arabidopsis thaliana]
          Length = 239

 Score =  160 bits (404), Expect = 3e-38
 Identities = 73/112 (65%), Positives = 90/112 (80%)
 Frame = -3

Query: 552 P*GKRGSRPGFRLPVSHVVHTVGPIYHSNSDPATSLASAYRNSLRVAKEKNIQYIAFPAI 373
           P G+    PGF LP S V+HTVGPIY S+ +P  SL ++Y+NSLRVAKE NI+YIAFPAI
Sbjct: 126 PTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKENNIKYIAFPAI 185

Query: 372 SCGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFLRDIYEVWLNKANDLLK 217
           SCG+YGYP+DEAA + ISTIK+F  DFKEVHFVLF  DI+ VW+NKA ++L+
Sbjct: 186 SCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLFADDIFSVWVNKAKEVLQ 237

 Score =  129 bits (325), Expect = 4e-29
 Identities = 64/89 (71%), Positives = 72/89 (79%)
 Frame = -2

Query: 787 ASSNGAVRFPLSPTTALIIQEGDISQWSIDSSSDAIVNPANERMLGGGGADGAIHRAAGP 608
           AS +    F LS ++ L I +GDI++WS+DSSSDAIVNPANERMLGGGGADGAIHRAAGP
Sbjct: 48  ASGDEGAVFNLSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGP 107

Query: 607 ELLQACYSVPEVRRGVRCPVGEARITTGF 521
           +L  ACY VPEVR  VRCP GEARIT GF
Sbjct: 108 QLRAACYEVPEVRPRVRCPTGEARITPGF 136

>ref|NP_030605.1| expressed protein; protein id: At2g40600.1, supported by cDNA:
           gi_15293076 [Arabidopsis thaliana]
           gi|20196872|gb|AAB87596.2| expressed protein
           [Arabidopsis thaliana]
          Length = 193

 Score =  160 bits (404), Expect = 3e-38
 Identities = 73/112 (65%), Positives = 90/112 (80%)
 Frame = -3

Query: 552 P*GKRGSRPGFRLPVSHVVHTVGPIYHSNSDPATSLASAYRNSLRVAKEKNIQYIAFPAI 373
           P G+    PGF LP S V+HTVGPIY S+ +P  SL ++Y+NSLRVAKE NI+YIAFPAI
Sbjct: 80  PTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKENNIKYIAFPAI 139

Query: 372 SCGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFLRDIYEVWLNKANDLLK 217
           SCG+YGYP+DEAA + ISTIK+F  DFKEVHFVLF  DI+ VW+NKA ++L+
Sbjct: 140 SCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLFADDIFSVWVNKAKEVLQ 191

 Score =  132 bits (333), Expect = 5e-30
 Identities = 65/89 (73%), Positives = 73/89 (81%)
 Frame = -2

Query: 787 ASSNGAVRFPLSPTTALIIQEGDISQWSIDSSSDAIVNPANERMLGGGGADGAIHRAAGP 608
           AS +    F LS ++ L I +GDI++WS+DSSSDAIVNPANERMLGGGGADGAIHRAAGP
Sbjct: 2   ASGDEGAVFNLSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGP 61

Query: 607 ELLQACYSVPEVRRGVRCPVGEARITTGF 521
           +L  ACY VPEVR GVRCP GEARIT GF
Sbjct: 62  QLRAACYEVPEVRPGVRCPTGEARITPGF 90

>ref|NP_799613.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633]
           gi|28808241|dbj|BAC61446.1| hypothetical protein [Vibrio
           parahaemolyticus]
          Length = 170

 Score = 84.0 bits (206), Expect(2) = 1e-33
 Identities = 43/71 (60%), Positives = 53/71 (74%)
 Frame = -2

Query: 742 ALIIQEGDISQWSIDSSSDAIVNPANERMLGGGGADGAIHRAAGPELLQACYSVPEVRRG 563
           A+ + +GDI+   +D    AIVN AN RMLGGGG DGAIHRAAGP L+ ACY+V +V  G
Sbjct: 3   AISLVQGDITTAHVD----AIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDV-DG 57

Query: 562 VRCPVGEARIT 530
           +RCP G+ARIT
Sbjct: 58  IRCPFGDARIT 68

 Score = 82.0 bits (201), Expect(2) = 1e-33
 Identities = 40/91 (43%), Positives = 56/91 (60%)
 Frame = -3

Query: 516 LPVSHVVHTVGPIYHSNSDPATSLASAYRNSLRVAKEKNIQYIAFPAISCGVYGYPYDEA 337
           L   +V+H VGPIY   +DP T L SAY+ SL +A   + Q +A PAISCGVYGYP  EA
Sbjct: 73  LNARYVIHAVGPIYDKFADPKTVLESAYQRSLDLALANHCQSVALPAISCGVYGYPPQEA 132

Query: 336 ATVAISTIKEFQNDFKEVHFVLFLRDIYEVW 244
           A VA++  +  +    ++ F LF  ++  +W
Sbjct: 133 AEVAMAVCQRPEYAALDMRFYLFSEEMLSIW 163

>ref|NP_518455.1| CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
           gi|20178146|sp|Q8Y2K1|Y334_RALSO Hypothetical protein
           RSc0334 gi|17427343|emb|CAD13862.1| CONSERVED
           HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 171

 Score = 89.4 bits (220), Expect(2) = 3e-31
 Identities = 47/109 (43%), Positives = 67/109 (61%), Gaps = 4/109 (3%)
 Frame = -3

Query: 546 GKRGSRPGFRLPVSHVVHTVGPIYHSN-SDPATSLASAYRNSLRVAKEKNIQYIAFPAIS 370
           G+    PGF LP  +++HTVGPI+     D A  LA+ YRNSL +AK+ +++ IAFP IS
Sbjct: 62  GQAKITPGFLLPARYIIHTVGPIWRGGRQDEAALLAACYRNSLALAKQHDVRTIAFPCIS 121

Query: 369 CGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFLR---DIYEVWLNKA 232
            GVYG+P   AA +A+ T++E   D  ++ F  F      +YE  LN+A
Sbjct: 122 TGVYGFPPQLAAPIAVRTVREHGADLDDIVFCCFSAADLALYETALNEA 170

 Score = 68.9 bits (167), Expect(2) = 3e-31
 Identities = 39/77 (50%), Positives = 47/77 (60%)
 Frame = -2

Query: 751 PTTALIIQEGDISQWSIDSSSDAIVNPANERMLGGGGADGAIHRAAGPELLQACYSVPEV 572
           PT  L     DI+  + D    AIVN AN  +LGGGG DGAIHRAAGPELL+AC ++   
Sbjct: 4   PTVTLRALRADITTLACD----AIVNAANSALLGGGGVDGAIHRAAGPELLEACRALH-- 57

Query: 571 RRGVRCPVGEARITTGF 521
                C  G+A+IT GF
Sbjct: 58  ----GCRTGQAKITPGF 70

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 751,889,874
Number of Sequences: 1393205
Number of extensions: 16866118
Number of successful extensions: 47393
Number of sequences better than 10.0: 220
Number of HSP's better than 10.0 without gapping: 44472
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 47187
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 46545945579
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF096d05_f BP033300 1 551
2 MFB075f01_f BP039477 1 483
3 MPD063d07_f AV774199 379 869




Lotus japonicus
Kazusa DNA Research Institute