KMC019043A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019043A_C01 KMC019043A_c01
ttcactgcgtgcgagccctaaaagcgatgtccgcttcaccctacctggaacctgacgcgg
aggtggaattttgttccgaaATCGCCGGCCAGCGCTCCTGGTCCATCGGCAAGATCGTCA
GCAGGCCCAGCTTCTCCCCCGACCATGTCTTGGTAGAGTACGACTACGAGCAAGACACCA
ATCCCAAGACACAATCCGTGAGCATTGACAAGGTCCGGCCACGCCCTCCGCCGGAGACCC
ACCATGACTTCAAGATCGGCGACAAGGTGGACGCCTACGACAAGGGCAGCTGGAGGGAGG
GACACCTAGTCAAAGAATTAGAAGATGGCAAGTTCGCTGTGGATTTCAATCTTCCCAAGC
AGTTAAACGAGTTTCCCAAAGAGAATCTCAGGACCCACCGCGAATGGATCGATGATCATT
GGGAGCCACCAATCCAACAACAAAAGCAGGAGCTGTTCAGAATAGGCGACTTGGTTGAGG
TTTCCAGTAAGGTGAAGGGTTATCGAGGAACTTGGTTTCTTGCCGAAGTTGTTGAGCTAA
AGGTACAAGGAAAGTTCCTTGTTGAGCACAAGCATCGGCTGCATGATGTAACTGGAAAGC
TTTTGAAAGAGAAGATTGATGACGATCACATAAGGCCTCTTCCGCCCAA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019043A_C01 KMC019043A_c01
         (649 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_172403.1| unknown protein; protein id: At1g09320.1 [Arabi...   134  1e-30
pir||C86226 protein T31J12.4 [imported] - Arabidopsis thaliana g...   100  2e-20
ref|NP_187304.1| hypothetical protein; protein id: At3g06520.1 [...    98  8e-20
ref|NP_182245.1| unknown protein; protein id: At2g47230.1 [Arabi...    83  3e-15
ref|NP_171829.1| unknown protein; protein id: At1g03300.1 [Arabi...    77  2e-13

>ref|NP_172403.1| unknown protein; protein id: At1g09320.1 [Arabidopsis thaliana]
          Length = 491

 Score =  134 bits (336), Expect = 1e-30
 Identities = 84/247 (34%), Positives = 129/247 (52%), Gaps = 45/247 (18%)
 Frame = +3

Query: 42  YLEPDAEVEFCSEIAGQR-SWSIGKIVSRPSFSP-DHVLVEYDY------EQDTNPKTQS 197
           YL+P + VE  S+  G R SW +GK+++ PS S  D V  + +Y      ++ T P  + 
Sbjct: 13  YLKPGSAVEISSDEIGFRGSWYMGKVITIPSSSDKDSVKCQVEYTTLFFDKEGTKPLKEV 72

Query: 198 VSIDKVRPRPPPETHHDFK----IGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQL 365
           V + ++RP  PP +  + K    +G++VDA+    W EG + + L+DGKF+V F   K+ 
Sbjct: 73  VDMSQLRPPAPPMSEIEKKKKIVVGEEVDAFYNDGWWEGDVTEVLDDGKFSVFFRSSKEQ 132

Query: 366 NEFPKENLRTHREWIDDHWEPPIQQQKQE------------------------------- 452
             F K+ LR HREW+D  W+PP+++ ++E                               
Sbjct: 133 IRFRKDELRFHREWVDGAWKPPLEETEEEEDESEEDKLDDSEDEEDILARVDLETTRAIA 192

Query: 453 --LFRIGDLVEVSSKVKGYRGTWFLAEVVELKVQGKFLVEHKHRLHDVTGKLLKEKIDDD 626
             +F  G +VEVSS  +G++G WF A+VVE   + KFLVE++        + LKE+ D  
Sbjct: 193 KQMFSSGTVVEVSSDEEGFQGCWFAAKVVEPVGEDKFLVEYRDLREKDGIEPLKEETDFL 252

Query: 627 HIRPLPP 647
           HIRP PP
Sbjct: 253 HIRPPPP 259

 Score = 98.6 bits (244), Expect = 6e-20
 Identities = 62/218 (28%), Positives = 107/218 (48%), Gaps = 24/218 (11%)
 Frame = +3

Query: 63  VEFCSEIAG-QRSWSIGKIVSRPSFSPDHVLVEYD--YEQD-TNPKTQSVSIDKVRPRPP 230
           VE  S+  G Q  W   K+V       D  LVEY    E+D   P  +      +RP PP
Sbjct: 202 VEVSSDEEGFQGCWFAAKVVE--PVGEDKFLVEYRDLREKDGIEPLKEETDFLHIRPPPP 259

Query: 231 PETHHDFKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWI 410
            +   DF +GDK++A+    W  G ++  ++ G   + F   ++   F ++ LR H++W+
Sbjct: 260 RDEDIDFAVGDKINAFYNDGWWVGVVIDGMKHGTVGIYFRQSQEKMRFGRQGLRLHKDWV 319

Query: 411 DDHWEPPIQQQK--------------------QELFRIGDLVEVSSKVKGYRGTWFLAEV 530
           D  W+ P++  K                    ++ F IG  +EVS + +G+  +WFLA++
Sbjct: 320 DGTWQLPLKGGKIKREKTVSCNRNVRPKKATEKQAFSIGTPIEVSPEEEGFEDSWFLAKL 379

Query: 531 VELKVQGKFLVEHKHRLHDVTGKLLKEKIDDDHIRPLP 644
           +E + + K LVE+ +   +   + L+E+++   IRPLP
Sbjct: 380 IEYRGKDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLP 417

 Score = 56.2 bits (134), Expect = 4e-07
 Identities = 36/113 (31%), Positives = 52/113 (45%), Gaps = 4/113 (3%)
 Frame = +3

Query: 96  SWSIGKIVSRPSFSPDHVLVEYDY---EQDTNPKTQSVSIDKVRPRPPPETH-HDFKIGD 263
           SW + K++       D  LVEYD    E    P  + V++ ++RP P        F+  D
Sbjct: 373 SWFLAKLIEYRG--KDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLPLESVMVSPFERHD 430

Query: 264 KVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWIDDHW 422
           KV+A     W  G + K L    + V F   ++L +F    LR H+EWID  W
Sbjct: 431 KVNALYNDGWWVGVIRKVLAKSSYLVLFKNTQELLKFHHSQLRLHQEWIDGKW 483

>pir||C86226 protein T31J12.4 [imported] - Arabidopsis thaliana
           gi|4337176|gb|AAD18097.1| T31J12.4 [Arabidopsis
           thaliana]
          Length = 514

 Score =  100 bits (249), Expect = 2e-20
 Identities = 54/149 (36%), Positives = 88/149 (58%), Gaps = 12/149 (8%)
 Frame = +3

Query: 42  YLEPDAEVEFCSEIAGQR-SWSIGKIVSRPSFSP-DHVLVEYDY------EQDTNPKTQS 197
           YL+P + VE  S+  G R SW +GK+++ PS S  D V  + +Y      ++ T P  + 
Sbjct: 13  YLKPGSAVEISSDEIGFRGSWYMGKVITIPSSSDKDSVKCQVEYTTLFFDKEGTKPLKEV 72

Query: 198 VSIDKVRPRPPPETHHDFK----IGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQL 365
           V + ++RP  PP +  + K    +G++VDA+    W EG + + L+DGKF+V F   K+ 
Sbjct: 73  VDMSQLRPPAPPMSEIEKKKKIVVGEEVDAFYNDGWWEGDVTEVLDDGKFSVFFRSSKEQ 132

Query: 366 NEFPKENLRTHREWIDDHWEPPIQQQKQE 452
             F K+ LR HREW+D  W+PP+++ ++E
Sbjct: 133 IRFRKDELRFHREWVDGAWKPPLEETEEE 161

 Score = 98.6 bits (244), Expect = 6e-20
 Identities = 62/218 (28%), Positives = 107/218 (48%), Gaps = 24/218 (11%)
 Frame = +3

Query: 63  VEFCSEIAG-QRSWSIGKIVSRPSFSPDHVLVEYD--YEQD-TNPKTQSVSIDKVRPRPP 230
           VE  S+  G Q  W   K+V       D  LVEY    E+D   P  +      +RP PP
Sbjct: 225 VEVSSDEEGFQGCWFAAKVVE--PVGEDKFLVEYRDLREKDGIEPLKEETDFLHIRPPPP 282

Query: 231 PETHHDFKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWI 410
            +   DF +GDK++A+    W  G ++  ++ G   + F   ++   F ++ LR H++W+
Sbjct: 283 RDEDIDFAVGDKINAFYNDGWWVGVVIDGMKHGTVGIYFRQSQEKMRFGRQGLRLHKDWV 342

Query: 411 DDHWEPPIQQQK--------------------QELFRIGDLVEVSSKVKGYRGTWFLAEV 530
           D  W+ P++  K                    ++ F IG  +EVS + +G+  +WFLA++
Sbjct: 343 DGTWQLPLKGGKIKREKTVSCNRNVRPKKATEKQAFSIGTPIEVSPEEEGFEDSWFLAKL 402

Query: 531 VELKVQGKFLVEHKHRLHDVTGKLLKEKIDDDHIRPLP 644
           +E + + K LVE+ +   +   + L+E+++   IRPLP
Sbjct: 403 IEYRGKDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLP 440

 Score = 56.2 bits (134), Expect = 4e-07
 Identities = 36/113 (31%), Positives = 52/113 (45%), Gaps = 4/113 (3%)
 Frame = +3

Query: 96  SWSIGKIVSRPSFSPDHVLVEYDY---EQDTNPKTQSVSIDKVRPRPPPETH-HDFKIGD 263
           SW + K++       D  LVEYD    E    P  + V++ ++RP P        F+  D
Sbjct: 396 SWFLAKLIEYRG--KDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLPLESVMVSPFERHD 453

Query: 264 KVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWIDDHW 422
           KV+A     W  G + K L    + V F   ++L +F    LR H+EWID  W
Sbjct: 454 KVNALYNDGWWVGVIRKVLAKSSYLVLFKNTQELLKFHHSQLRLHQEWIDGKW 506

 Score = 38.5 bits (88), Expect = 0.079
 Identities = 24/67 (35%), Positives = 35/67 (51%), Gaps = 6/67 (8%)
 Frame = +3

Query: 465 GDLVEVSSKVKGYRGTWFLAEVVEL-----KVQGKFLVEHKHRLHDVTG-KLLKEKIDDD 626
           G  VE+SS   G+RG+W++ +V+ +     K   K  VE+     D  G K LKE +D  
Sbjct: 17  GSAVEISSDEIGFRGSWYMGKVITIPSSSDKDSVKCQVEYTTLFFDKEGTKPLKEVVDMS 76

Query: 627 HIRPLPP 647
            +RP  P
Sbjct: 77  QLRPPAP 83

>ref|NP_187304.1| hypothetical protein; protein id: At3g06520.1 [Arabidopsis
           thaliana] gi|12322681|gb|AAG51333.1|AC020580_13
           hypothetical protein; 66083-64412 [Arabidopsis thaliana]
          Length = 466

 Score = 98.2 bits (243), Expect = 8e-20
 Identities = 69/236 (29%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
 Frame = +3

Query: 27  MSASPYLEPDAEVEFCSEIAGQRSWSIGKIVSRPSFSPDHVLVEYDYEQDTNPKT----Q 194
           M++  +      VE    ++G  ++    +VS PS     V VE++        +    +
Sbjct: 1   MTSDRFWRGGDRVEVERLVSGATAYFPASVVSAPSVRKKLVWVEHESLTVGGSVSVRMKE 60

Query: 195 SVSIDKVRPRPPPETHHDFKIGDKVDAY--DKGSWREGHLVKELEDGKFAVDF---NLPK 359
            V+  ++RP PP E +  FK  D+VD +   +G W  G++   LED ++ V+F   N P+
Sbjct: 61  YVTPTRLRPSPPRELNRRFKADDEVDVFRDSEGCWVRGNVTTVLEDSRYIVEFKGENRPE 120

Query: 360 -QLNEFPKENLRTHREWIDDHWEPPIQQQ------------------KQELFRIGDLVEV 482
            ++++F   NLR HREW+D  W P + QQ                  +++ +  G LVEV
Sbjct: 121 IEVDQF---NLRLHREWLDGGWVPSLLQQSNFSESTAQRIKLKIKIKRRDQYEKGALVEV 177

Query: 483 SSKVKGYRGTWFLAEVVELKVQGKFLVEH-KHRLHDVTGKLLKEKIDDDHIRPLPP 647
            S+ K Y+G+W+ A ++ L    K++VEH K    D     L++ ++   IRP+PP
Sbjct: 178 RSEEKAYKGSWYCARILCLLGDDKYIVEHLKFSRDDGESIPLRDVVEAKDIRPVPP 233

 Score = 67.4 bits (163), Expect = 2e-10
 Identities = 64/229 (27%), Positives = 99/229 (42%), Gaps = 29/229 (12%)
 Frame = +3

Query: 48  EPDAEVEFCSEIAGQR-SWSIGKIVSRPSFSPDHVLVEY-DYEQDTN---PKTQSVSIDK 212
           E  A VE  SE    + SW   +I+       D  +VE+  + +D     P    V    
Sbjct: 170 EKGALVEVRSEEKAYKGSWYCARILCL--LGDDKYIVEHLKFSRDDGESIPLRDVVEAKD 227

Query: 213 VRPRPPPETHHD--FKIGDKVDAYDKGSWREGHLVKELEDG--KFAVDFNLPKQLNEFPK 380
           +RP PP E      ++ G  VDA+    W    + K L  G  K++V      +      
Sbjct: 228 IRPVPPSELSPVVCYEPGVIVDAWFNKRWWTSRVSKVLGGGSNKYSVFIISTGEETTILN 287

Query: 381 ENLRTHREWIDDHW---------------EPPIQQQK-----QELFRIGDLVEVSSKVKG 500
            NLR H++WI+  W               +PP+++ K     +++F  G  VEV S   G
Sbjct: 288 FNLRPHKDWINGQWVIPSKVLTDVPEECYKPPLKKLKSCERAEKVFNNGMEVEVRSDEPG 347

Query: 501 YRGTWFLAEVVELKVQGKFLVEHKHRLHDVTGKLLKEKIDDDHIRPLPP 647
           Y  +WF A++V    + ++ VE++    D   +LLKE+     IRP PP
Sbjct: 348 YEASWFSAKIVSYLGENRYTVEYQTLKTDDERELLKEEARGSDIRPPPP 396

>ref|NP_182245.1| unknown protein; protein id: At2g47230.1 [Arabidopsis thaliana]
           gi|25364476|pir||F84912 hypothetical protein At2g47230
           [imported] - Arabidopsis thaliana
           gi|2275201|gb|AAB63823.1| unknown protein [Arabidopsis
           thaliana]
          Length = 701

 Score = 83.2 bits (204), Expect = 3e-15
 Identities = 53/164 (32%), Positives = 82/164 (49%), Gaps = 8/164 (4%)
 Frame = +3

Query: 180 NPKTQSVSIDKVRPRPPPETHHDFKI--GDKVDAYDKGSWREGHLVKELEDGKFAVDFNL 353
           +P  +++    +RP PP   ++   +  G  VDA  K  W  G ++K+LE+GKF V ++ 
Sbjct: 55  SPLIENIEPRFIRPVPPENEYNGIVLEEGTVVDADHKDGWWTGVIIKKLENGKFWVYYDS 114

Query: 354 PKQLNEFPKENLRTHREWIDDHW-EPPIQQQKQELFRIGDLVEVSSKVKGYRGTWFLAEV 530
           P  + EF +  LR H  W    W  P IQ+  + +F  G + EVS+ V      WF A +
Sbjct: 115 PPDIIEFERNQLRPHLRWSGWKWLRPDIQELDKSMFSSGTMAEVSTIVDKAEVAWFPAMI 174

Query: 531 V-ELKVQG--KFLVE--HKHRLHDVTGKLLKEKIDDDHIRPLPP 647
           + E++V G  KF+V+  +KH             ID   +RP PP
Sbjct: 175 IKEIEVDGEKKFIVKDCNKHLSFSGDEARTNSTIDSSRVRPTPP 218

 Score = 41.2 bits (95), Expect = 0.012
 Identities = 28/116 (24%), Positives = 51/116 (43%)
 Frame = +3

Query: 78  EIAGQRSWSIGKIVSRPSFSPDHVLVEYDYEQDTNPKTQSVSIDKVRPRPPPETHHDFKI 257
           E+ G++ + +       SFS D        E  TN    ++   +VRP PPP     +++
Sbjct: 179 EVDGEKKFIVKDCNKHLSFSGD--------EARTN---STIDSSRVRPTPPPFPVEKYEL 227

Query: 258 GDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWIDDHWE 425
            D+V+ +    WR+G +   L+   + V   + K+       +LR  + W D  W+
Sbjct: 228 MDRVEVFRGSVWRQGLVRGVLDHNCYMVCLVVTKEEPVVKHSDLRPCKVWEDGVWQ 283

 Score = 35.0 bits (79), Expect = 0.87
 Identities = 25/70 (35%), Positives = 33/70 (46%), Gaps = 3/70 (4%)
 Frame = +3

Query: 447 QELFRIGDLVEVSSKVKGYRGTWF---LAEVVELKVQGKFLVEHKHRLHDVTGKLLKEKI 617
           +E  R G  VEVSS  +G+   WF   L E      + K  V +   L+D     L E I
Sbjct: 2   EETIRKGSEVEVSSTEEGFADAWFRGILQENPTKSGRKKLRVRYLTLLNDDALSPLIENI 61

Query: 618 DDDHIRPLPP 647
           +   IRP+PP
Sbjct: 62  EPRFIRPVPP 71

>ref|NP_171829.1| unknown protein; protein id: At1g03300.1 [Arabidopsis thaliana]
           gi|25364480|pir||E86164 F15K9.10 protein - Arabidopsis
           thaliana gi|3850574|gb|AAC72114.1| Strong similarity to
           T08I13.7 gi|2275201 unknown protein from Arabidopsis
           thaliana BAC gb|AC002337.  EST gb|Z17450 comes from this
           gene
          Length = 670

 Score = 77.0 bits (188), Expect = 2e-13
 Identities = 61/215 (28%), Positives = 101/215 (46%), Gaps = 17/215 (7%)
 Frame = +3

Query: 54  DAEVEFCSEIAGQRSWSIGKIVSRPSFSP---------DHVLVEYDYEQDTNPKTQSVSI 206
           D EVE  SE  G R+     I+     +P          ++    + E  ++P T  V  
Sbjct: 6   DCEVEIFSEEDGFRNAWYRAILEETPTNPTSESKKLRFSYMTKSLNKEGSSSPPT--VEQ 63

Query: 207 DKVRPRPPPETHHD--FKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPK 380
             +RP PP   ++   F+ G  VDA  K  WR G ++ ++E+  + V F+ P  + +F  
Sbjct: 64  RFIRPVPPENLYNGVVFEEGTMVDADYKHRWRTGVVINKMENDSYLVLFDCPPDIIQFET 123

Query: 381 ENLRTHREWIDDHW-EPPIQQQKQELFRIGDLVEVSSKVKGYRGTWFLAEVV-ELKVQG- 551
           ++LR H +W    W +P +++  + +F  G LVEVS  +     +W  A +V E++  G 
Sbjct: 124 KHLRAHLDWTGSEWVQPEVRELSKSMFSPGTLVEVSCVIDKVEVSWVTAMIVKEIEESGE 183

Query: 552 -KFLVE--HKHRLHDVTGKLLKEKIDDDHIRPLPP 647
            KF+V+  +KH    V        +D   +RP PP
Sbjct: 184 KKFIVKVCNKHLSCRVDEAKPNMTVDSCCVRPRPP 218

 Score = 37.0 bits (84), Expect = 0.23
 Identities = 21/77 (27%), Positives = 34/77 (43%)
 Frame = +3

Query: 213 VRPRPPPETHHDFKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLR 392
           VRPRPP     ++ + D V+ +   SWR+G +     + ++ V     K        +LR
Sbjct: 213 VRPRPPLFFVEEYDLRDCVEVFHGSSWRQGVVKGVHIEKQYTVTLEATKDKLVVKHSDLR 272

Query: 393 THREWIDDHWEPPIQQQ 443
             + W D  W    QQ+
Sbjct: 273 PFKVWEDGVWHNGPQQK 289

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 629,571,714
Number of Sequences: 1393205
Number of extensions: 15513848
Number of successful extensions: 65487
Number of sequences better than 10.0: 109
Number of HSP's better than 10.0 without gapping: 57637
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 64901
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27576232529
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB011h10_f BP034754 1 506
2 SPD096b03_f BP051646 260 649




Lotus japonicus
Kazusa DNA Research Institute