KMC001858A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001858A_C02 KMC001858A_c02
ctttttttttttttcttttttcttctttttttTGGAGTGGAACAAATAACTTTTATTAGA
CTAAACTTAATCACAATTATGTCTGTCTTAAACTAAATTTATTCCATTTACCAGACATGA
TTGACCCGTGGCAAAAGCACAAGGGCAACAACAAAACTAAAACAAGAAGTGGAAGAAGAG
AAGAACTCTATGCTTTTTTGCAGTGGCTGCAAAGGAGCACCACACAAACAATCATTCCCC
ACAAAAGCACTAGCTGGAAACTTAGTCTTGGGAATCTCACCACAAAGATGATTGTAGCTC
ACATTCAACTTCTCAAGCCCAGCAACACTCTTAGGAACTTTCCCAAACACCAAATTATGA
GACAAATCCAAATACTTAAACCTCTCCCCAAATTTCAATCTCTCAAAATCAAACTTCAAC
TTATTCCCAGAACCCCAAAACCCCACCAAATACTCTGTACTATTCACCAAACCAACCGCA
CTCCCTGAAATCTCATTCCCAGAAAGATCAATGAAGTCATAGAAGTACGTCTCTGCAGGC
TTCCAATCATCGAGCTTCATCTTTATCCCACACCGTGCGAGCTTCAACGAGAAGATGATC
GGAGATGAAGTGACCCACTTCGGGATTTGATTCAAATGGAACATGTTGTTCGACAAATCC
AATGATTCGATGCCTTTCACGTTCATTTCGGGAAATGGGTCTACGAGGAGATCGTnggcg
aggttgaga


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001858A_C02 KMC001858A_c02
         (729 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564426.1| disease resistance protein-related (LRR); prote...   264  1e-69
pir||G86459 Hypothetical 55.6 kDa protein - Arabidopsis thaliana...   264  1e-69
gb|AAM60932.1| putative disease resistance protein [Arabidopsis ...   263  1e-69
ref|NP_174624.1| leucine rich repeat protein family; protein id:...   219  3e-56
ref|NP_174625.1| leucine rich repeat protein family; protein id:...   216  2e-55

>ref|NP_564426.1| disease resistance protein-related (LRR); protein id: At1g33590.1,
           supported by cDNA: 105146. [Arabidopsis thaliana]
          Length = 477

 Score =  264 bits (674), Expect = 1e-69
 Identities = 125/175 (71%), Positives = 145/175 (82%)
 Frame = -2

Query: 728 LNLAXDLLVDPFPEMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMKLD 549
           L+L+ +LL DPFP +NVKGIESLDLS N FHLN IPKWVTSSPIIFSLKLA+CGIKM LD
Sbjct: 300 LDLSHNLLTDPFPVLNVKGIESLDLSYNQFHLNTIPKWVTSSPIIFSLKLAKCGIKMSLD 359

Query: 548 DWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFKYL 369
           DWKPA+T++YDFIDLS NEI+GS    +N TEYLV F  +GNKL+FD  +L F +    L
Sbjct: 360 DWKPAQTFYYDFIDLSENEITGSPARFLNQTEYLVEFKAAGNKLRFDMGKLTFAKTLTTL 419

Query: 368 DLSHNLVFGKVPKSVAGLEKLNVSYNHLCGEIPKTKFPASAFVGNDCLCGAPLQP 204
           D+S NLVFGKVP  VAGL+ LNVS+NHLCG++P TKFPASAFVGNDCLCG+PL P
Sbjct: 420 DISRNLVFGKVPAMVAGLKTLNVSHNHLCGKLPVTKFPASAFVGNDCLCGSPLSP 474

 Score = 37.0 bits (84), Expect = 0.28
 Identities = 43/158 (27%), Positives = 63/158 (39%), Gaps = 5/158 (3%)
 Frame = -2

Query: 728 LNLAXDLLVDPFP--EMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMK 555
           L L  +LL    P    N+K +  L+L  N      IP    S P + SL L+R G    
Sbjct: 179 LKLGNNLLTGTIPLGVANLKLMSYLNLGGNRL-TGTIPDIFKSMPELRSLTLSRNGFSGN 237

Query: 554 LDDWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFK 375
           L     +      F++L  N++SG+    +++ + L                        
Sbjct: 238 LPPSIASLAPILRFLELGHNKLSGTIPNFLSNFKAL-----------------------D 274

Query: 374 YLDLSHNLVFGKVPKSVAGLEK---LNVSYNHLCGEIP 270
            LDLS N   G +PKS A L K   L++S+N L    P
Sbjct: 275 TLDLSKNRFSGVIPKSFANLTKIFNLDLSHNLLTDPFP 312

>pir||G86459 Hypothetical 55.6 kDa protein - Arabidopsis thaliana
           gi|10998936|gb|AAG26075.1|AC069299_1 hypothetical
           protein [Arabidopsis thaliana]
          Length = 512

 Score =  264 bits (674), Expect = 1e-69
 Identities = 125/175 (71%), Positives = 145/175 (82%)
 Frame = -2

Query: 728 LNLAXDLLVDPFPEMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMKLD 549
           L+L+ +LL DPFP +NVKGIESLDLS N FHLN IPKWVTSSPIIFSLKLA+CGIKM LD
Sbjct: 335 LDLSHNLLTDPFPVLNVKGIESLDLSYNQFHLNTIPKWVTSSPIIFSLKLAKCGIKMSLD 394

Query: 548 DWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFKYL 369
           DWKPA+T++YDFIDLS NEI+GS    +N TEYLV F  +GNKL+FD  +L F +    L
Sbjct: 395 DWKPAQTFYYDFIDLSENEITGSPARFLNQTEYLVEFKAAGNKLRFDMGKLTFAKTLTTL 454

Query: 368 DLSHNLVFGKVPKSVAGLEKLNVSYNHLCGEIPKTKFPASAFVGNDCLCGAPLQP 204
           D+S NLVFGKVP  VAGL+ LNVS+NHLCG++P TKFPASAFVGNDCLCG+PL P
Sbjct: 455 DISRNLVFGKVPAMVAGLKTLNVSHNHLCGKLPVTKFPASAFVGNDCLCGSPLSP 509

 Score = 37.0 bits (84), Expect = 0.28
 Identities = 43/158 (27%), Positives = 63/158 (39%), Gaps = 5/158 (3%)
 Frame = -2

Query: 728 LNLAXDLLVDPFP--EMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMK 555
           L L  +LL    P    N+K +  L+L  N      IP    S P + SL L+R G    
Sbjct: 214 LKLGNNLLTGTIPLGVANLKLMSYLNLGGNRL-TGTIPDIFKSMPELRSLTLSRNGFSGN 272

Query: 554 LDDWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFK 375
           L     +      F++L  N++SG+    +++ + L                        
Sbjct: 273 LPPSIASLAPILRFLELGHNKLSGTIPNFLSNFKAL-----------------------D 309

Query: 374 YLDLSHNLVFGKVPKSVAGLEK---LNVSYNHLCGEIP 270
            LDLS N   G +PKS A L K   L++S+N L    P
Sbjct: 310 TLDLSKNRFSGVIPKSFANLTKIFNLDLSHNLLTDPFP 347

>gb|AAM60932.1| putative disease resistance protein [Arabidopsis thaliana]
          Length = 477

 Score =  263 bits (673), Expect = 1e-69
 Identities = 125/175 (71%), Positives = 145/175 (82%)
 Frame = -2

Query: 728 LNLAXDLLVDPFPEMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMKLD 549
           L+L+ +LL DPFP +NVKGIESLDLS N FHLN IPKWVTSSPIIFSLKLA+CGIKM LD
Sbjct: 300 LDLSHNLLTDPFPVLNVKGIESLDLSYNKFHLNTIPKWVTSSPIIFSLKLAKCGIKMSLD 359

Query: 548 DWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFKYL 369
           DWKPA+T++YDFIDLS NEI+GS    +N TEYLV F  +GNKL+FD  +L F +    L
Sbjct: 360 DWKPAQTFYYDFIDLSENEITGSPARFLNQTEYLVEFKAAGNKLRFDMGKLTFAKTLTTL 419

Query: 368 DLSHNLVFGKVPKSVAGLEKLNVSYNHLCGEIPKTKFPASAFVGNDCLCGAPLQP 204
           D+S NLVFGKVP  VAGL+ LNVS+NHLCG++P TKFPASAFVGNDCLCG+PL P
Sbjct: 420 DISRNLVFGKVPAMVAGLKTLNVSHNHLCGKLPVTKFPASAFVGNDCLCGSPLSP 474

 Score = 37.0 bits (84), Expect = 0.28
 Identities = 43/158 (27%), Positives = 63/158 (39%), Gaps = 5/158 (3%)
 Frame = -2

Query: 728 LNLAXDLLVDPFP--EMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMK 555
           L L  +LL    P    N+K +  L+L  N      IP    S P + SL L+R G    
Sbjct: 179 LKLGNNLLTGTIPLGVANLKLMSYLNLGGNRL-TGTIPDIFKSMPELRSLTLSRNGFSGN 237

Query: 554 LDDWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFK 375
           L     +      F++L  N++SG+    +++ + L                        
Sbjct: 238 LPPSIASLAPILRFLELGHNKLSGTIPNFLSNFKAL-----------------------D 274

Query: 374 YLDLSHNLVFGKVPKSVAGLEK---LNVSYNHLCGEIP 270
            LDLS N   G +PKS A L K   L++S+N L    P
Sbjct: 275 TLDLSKNRFSGVIPKSFANLTKIFNLDLSHNLLTDPFP 312

>ref|NP_174624.1| leucine rich repeat protein family; protein id: At1g33600.1
           [Arabidopsis thaliana] gi|25518789|pir||H86459
           hypothetical protein T1E4.2 - Arabidopsis thaliana
           gi|10998942|gb|AAG26081.1|AC069299_7 hypothetical
           protein [Arabidopsis thaliana]
          Length = 478

 Score =  219 bits (558), Expect = 3e-56
 Identities = 107/176 (60%), Positives = 127/176 (71%), Gaps = 1/176 (0%)
 Frame = -2

Query: 728 LNLAXDLLVDPFPEM-NVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMKL 552
           LNL+ + L  P P M NV G+ +LDLS N FHL  IPKWVTSSP ++SLKL +CGI M L
Sbjct: 300 LNLSHNFLTGPLPAMKNVDGLATLDLSYNQFHLKTIPKWVTSSPSMYSLKLVKCGINMSL 359

Query: 551 DDWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFKY 372
           D+WKP     Y +IDLS NEISGS     N    L  F  SGNKL+FD  +L   ER + 
Sbjct: 360 DNWKPVRPNIYFYIDLSENEISGSLTWFFNLAHNLYEFQASGNKLRFDMGKLNLSERLES 419

Query: 371 LDLSHNLVFGKVPKSVAGLEKLNVSYNHLCGEIPKTKFPASAFVGNDCLCGAPLQP 204
           LDLS NL+FGKVP +VA L+KLN+S+NHLCG++P TKFPASAFVGNDCLCG+PL P
Sbjct: 420 LDLSRNLIFGKVPMTVAKLQKLNLSHNHLCGKLPVTKFPASAFVGNDCLCGSPLSP 475

 Score = 45.1 bits (105), Expect = 0.001
 Identities = 47/161 (29%), Positives = 68/161 (42%), Gaps = 5/161 (3%)
 Frame = -2

Query: 728 LNLAXDLLVDPFPE--MNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMK 555
           LNL  +LL    P    N+K + SL+  NN      IP    S   + SL L+R      
Sbjct: 179 LNLGDNLLTGTIPLGLANLKILLSLNFGNNRLS-ETIPDIFKSMQKLQSLTLSRNKFSGN 237

Query: 554 LDDWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFK 375
           L     +     +++DLS N +SG+    +++ + L                        
Sbjct: 238 LPPSIASLKPILNYLDLSQNNLSGTIPTFLSNFKVLDS---------------------- 275

Query: 374 YLDLSHNLVFGKVPKSVAGLEK---LNVSYNHLCGEIPKTK 261
            LDLS N   G VPKS+A + K   LN+S+N L G +P  K
Sbjct: 276 -LDLSRNRFSGVVPKSLANMPKLFHLNLSHNFLTGPLPAMK 315

>ref|NP_174625.1| leucine rich repeat protein family; protein id: At1g33610.1
            [Arabidopsis thaliana] gi|25511729|pir||A86460 99.9K
            hypothetical protein T1E4.10 - Arabidopsis thaliana
            gi|10998940|gb|AAG26079.1|AC069299_5 hypothetical protein
            [Arabidopsis thaliana]
          Length = 907

 Score =  216 bits (551), Expect = 2e-55
 Identities = 104/176 (59%), Positives = 132/176 (74%), Gaps = 1/176 (0%)
 Frame = -2

Query: 728  LNLAXDLLVDPFPEM-NVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMKL 552
            L+L+ +LL  PFP + ++ GIESLDLS N FHL  IPKW+ SSP I+SLKLA+CG+K+ L
Sbjct: 729  LDLSHNLLTGPFPVLKSINGIESLDLSYNKFHLKTIPKWMISSPSIYSLKLAKCGLKISL 788

Query: 551  DDWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFKY 372
            DDWK A TY+YD IDLS NEISGS    ++  +YL+ F  +GNKL+FD  +L F    + 
Sbjct: 789  DDWKLAGTYYYDSIDLSENEISGSPAKFLSQXKYLMEFRAAGNKLRFDLGKLTFVRTLET 848

Query: 371  LDLSHNLVFGKVPKSVAGLEKLNVSYNHLCGEIPKTKFPASAFVGNDCLCGAPLQP 204
            LDLS NL+FG+V  + AGL+ +NVS NHLCG++P TKFPAS F GNDCLCG+PL P
Sbjct: 849  LDLSRNLIFGRVLATFAGLKTMNVSQNHLCGKLPVTKFPASXFAGNDCLCGSPLSP 904

 Score =  199 bits (507), Expect = 3e-50
 Identities = 99/164 (60%), Positives = 118/164 (71%)
 Frame = -2

Query: 728 LNLAXDLLVDPFPEMNVKGIESLDLSNNMFHLNQIPKWVTSSPIIFSLKLARCGIKMKLD 549
           L+L+ +LL   FP++ V  IE LDLS N F L  IP+WVT  P +F LKLA+CGIKM LD
Sbjct: 301 LDLSHNLLTGQFPDLTVNTIEYLDLSYNQFQLETIPQWVTLLPSVFLLKLAKCGIKMSLD 360

Query: 548 DWKPAETYFYDFIDLSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERLKFGERFKYL 369
           DWKPAE  +Y +IDLS NEISGS    +N T YL+ F  + NKL+FD   L F    K L
Sbjct: 361 DWKPAEPLYYHYIDLSKNEISGSLERFLNETRYLLEFRAAENKLRFDMGNLTFPRTLKTL 420

Query: 368 DLSHNLVFGKVPKSVAGLEKLNVSYNHLCGEIPKTKFPASAFVG 237
           DLS NLVFGKVP +VAGL++LN+S NHLCGE+P TKFPASAF G
Sbjct: 421 DLSRNLVFGKVPVTVAGLQRLNLSQNHLCGELPTTKFPASAFAG 464

 Score = 36.6 bits (83), Expect = 0.37
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 4/50 (8%)
 Frame = -2

Query: 407 FERLKFGERFKYLDLSHNLVFGKVPKSVAGLE----KLNVSYNHLCGEIP 270
           FE +K     K+LDLS N  +GK+P S+A L      L VS N+L G IP
Sbjct: 219 FESMKL---LKFLDLSSNEFYGKLPLSIATLAPTLLALQVSQNNLSGAIP 265

 Score = 33.1 bits (74), Expect = 4.1
 Identities = 25/84 (29%), Positives = 38/84 (44%), Gaps = 5/84 (5%)
 Frame = -2

Query: 506 LSGNEISGSAVGLVNSTEYLVGFWGSGNKLKFDFERL-KFGERFKYLDLSHNLVFGKVPK 330
           + GN  +G     + +   L       N+L      + K  +    LDLS N  FG++P 
Sbjct: 610 IDGNMFTGHIPSSIANLTRLTWLNLGNNRLSGTIPNIFKSMKELNSLDLSRNGFFGRLPP 669

Query: 329 SVAGLEK----LNVSYNHLCGEIP 270
           S+A L      L++S N+L G IP
Sbjct: 670 SIASLAPTLYYLDLSQNNLSGTIP 693

 Score = 32.0 bits (71), Expect = 9.2
 Identities = 26/87 (29%), Positives = 38/87 (42%), Gaps = 5/87 (5%)
 Frame = -2

Query: 515 FIDLSGNEISGSA-VGLVNSTEYLVGFWGSGNKLKFDFER-LKFGERFKYLDLSHNLVFG 342
           F+DLS NE  G   + +      L+    S N L       +    + + LDLS N   G
Sbjct: 227 FLDLSSNEFYGKLPLSIATLAPTLLALQVSQNNLSGAIPNYISRFNKLEKLDLSKNRFSG 286

Query: 341 KVPKSVAGLEKLN---VSYNHLCGEIP 270
            VP+    L  +N   +S+N L G+ P
Sbjct: 287 VVPQGFVNLTNINNLDLSHNLLTGQFP 313

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 656,336,238
Number of Sequences: 1393205
Number of extensions: 15119698
Number of successful extensions: 69850
Number of sequences better than 10.0: 730
Number of HSP's better than 10.0 without gapping: 51880
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 64447
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 34343566934
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf042d04 BP070456 1 461
2 GENf097c02 BP062310 20 569
3 GNf014e03 BP068386 29 134
4 SPD070d09_f BP049600 32 618
5 MF050f01_f BP030938 34 567
6 GENf020b05 BP059203 35 295
7 SPD085e08_f BP050793 35 636
8 MR034d03_f BP078618 35 345
9 MPDL012d10_f AV777134 35 450
10 GENf044e03 BP060210 35 415
11 GENf081h07 BP061838 35 575
12 GENf049d10 BP060420 35 576
13 SPD073b11_f BP049816 36 237
14 SPD046d05_f BP047660 36 588
15 SPD038a05_f BP046985 37 589
16 SPDL012d02_f BP052724 37 454
17 GNf059h02 BP071783 39 457
18 MR048d12_f BP079713 40 446
19 MWM173c02_f AV767391 40 627
20 GENf089h01 BP062122 40 422
21 SPD010c10_f BP044783 41 628
22 GNf045g08 BP070713 43 629
23 MWM023g05_f AV764999 61 558
24 GNf044h12 BP070648 70 506
25 MPD060d09_f AV774021 72 584
26 MR020b01_f BP077491 83 637
27 MR051g11_f BP079971 85 474
28 MR048g07_f BP079737 85 636
29 MR057g10_f BP080409 87 209
30 MF004g04_f BP028461 88 581
31 MR090e09_f BP082934 103 629
32 MR086a10_f BP082587 172 578
33 MF041e07_f BP030449 192 755




Lotus japonicus
Kazusa DNA Research Institute