KMC005331A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005331A_C01 KMC005331A_c01
cttcctatgactACATGTTTATGTGTTTGGGTTGATATCCAATATACAGAGAGAAATGCA
AGCCATAAGTTATAATAATAGTTCAGTTACATGCTATAACTCAAATTTCTGAAACTCTAA
CTTATCTCTCTCTATACTATGATAAAAATGAATGAGTATGTATCCTTTCAAGAAGAAAGG
ACAATTAAGAGTTAAAGGAAGAAGGAAACAGAAAGAAAAAAAAAATAGTAAGAGGGTTAA
AAAAACTAAGATAAACAAGTTTCCTTATGAAGTCTTACAGGCATAGATCAACAAGGAGAA
TTCAACTCTGTCTTTATCTTTCTGTGCAACTCTTCTCCTCCAACCACAACCTGCTATCCA
ATCCACTTCTAGTCCTGAACATGAAAACTGCATCCCCAGAAGCTGGATTGAAGAACCAGT
CATGAACATCCCAAAGCAAATCCACCAGCAACCCATCAACAAAAATTGTCTGATTCCCCC
TGAAATTCCACTGCAACCTCTTCACCCGGATCACTGTCTTCTTGTCAATGCAGACTGACA
AAACAGGGGACTTGAACAACCCTTCACTCTCTTCCACACTGCATCTGATCAAAACATCAT
GCCATGTTCCACTGTCACAGAATTGAGCCTTGGTTGTGTAAAGAGAATTTCCAGAACAAT
GCTCTCTCCGTGACAATAGTGAAACCTTAGCCACAGGGCTGTTGATTTTGAACTTCTTGG
ACACTGTTTCCCCTGCCATGTCACCAAGAACAAGACCAATTTCTGAAtcaaccagaatca
aaacgtaaaacccatcaacaggttcaggtccagaatcataattcgcatttgacagatccc
agaagat


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005331A_C01 KMC005331A_c01
         (847 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_180345.1| hypothetical protein; protein id: At2g27770.1 [...   220  8e-59
gb|AAM76760.1| hypothetical protein [Arabidopsis thaliana]            213  3e-55
ref|NP_198167.1| putative protein; protein id: At5g28150.1, supp...   147  2e-34
ref|NP_565847.1| expressed protein; protein id: At2g36470.1, sup...   144  2e-33
ref|NP_566240.1| expressed protein; protein id: At3g04860.1, sup...   144  2e-33

>ref|NP_180345.1| hypothetical protein; protein id: At2g27770.1 [Arabidopsis
           thaliana] gi|25407920|pir||G84676 hypothetical protein
           At2g27770 [imported] - Arabidopsis thaliana
           gi|3860255|gb|AAC73023.1| hypothetical protein
           [Arabidopsis thaliana] gi|26449502|dbj|BAC41877.1|
           unknown protein [Arabidopsis thaliana]
           gi|28950899|gb|AAO63373.1| At2g27770 [Arabidopsis
           thaliana]
          Length = 320

 Score =  220 bits (561), Expect(2) = 8e-59
 Identities = 115/182 (63%), Positives = 146/182 (80%), Gaps = 10/182 (5%)
 Frame = -1

Query: 847 IFWDLSNANYDS---GPEPVDGFYVLILVDSEIGLVLGDMAGETVSKKFKINSPVAKVSL 677
           +FWDLS+A YDS   GPEP++GFYV++LVD ++GL+LGD + ET+ KK          SL
Sbjct: 119 VFWDLSSAKYDSNLCGPEPINGFYVIVLVDGQMGLLLGDSSEETLRKKGFSGDIGFDFSL 178

Query: 676 LSRREHCSGNS-LYTTKAQFCDSGTWHDVLIRCSVEESEGLFKS---PVLSVCIDKKTVI 509
           +SR+EH +GN+  Y+TK +F ++G  H+++IRC+ +E+EGL +S   PVLSVCIDKKTVI
Sbjct: 179 VSRQEHFTGNNTFYSTKVRFVETGDSHEIVIRCN-KETEGLKQSNHYPVLSVCIDKKTVI 237

Query: 508 RVKRLQWNFRGNQTIFVDGLLVDLLWDVHDWFFN--PASGDAVFMFRTRSGLD-SRLWLE 338
           +VKRLQWNFRGNQTIF+DGLLVDL+WDVHDWFF+   A G AVFMFRTR+GLD SRLWLE
Sbjct: 238 KVKRLQWNFRGNQTIFLDGLLVDLMWDVHDWFFSNQGACGRAVFMFRTRNGLDSSRLWLE 297

Query: 337 EK 332
           EK
Sbjct: 298 EK 299

 Score = 29.6 bits (65), Expect(2) = 8e-59
 Identities = 12/22 (54%), Positives = 19/22 (85%), Gaps = 1/22 (4%)
 Frame = -2

Query: 333 RVAQKDK-DRVEFSLLIYACKT 271
           ++ +KD+ D+++FSL IYACKT
Sbjct: 299 KIVKKDQQDKLDFSLFIYACKT 320

>gb|AAM76760.1| hypothetical protein [Arabidopsis thaliana]
          Length = 320

 Score =  213 bits (542), Expect(2) = 3e-55
 Identities = 113/182 (62%), Positives = 142/182 (77%), Gaps = 10/182 (5%)
 Frame = -1

Query: 847 IFWDLSNANYDS---GPEPVDGFYVLILVDSEIGLVLGDMAGETVSKKFKINSPVAKVSL 677
           +FWDLS+A YDS   GP  ++GFYV++LVD  +GL+LGD + ET+ KK          SL
Sbjct: 119 VFWDLSSAKYDSNLCGPGTINGFYVIVLVDGSMGLLLGDSSEETLRKKGFSGDIGFDFSL 178

Query: 676 LSRREHCSGNS-LYTTKAQFCDSGTWHDVLIRCSVEESEGLFKS---PVLSVCIDKKTVI 509
            SR+EH +GN+  Y+TK +F ++G  H+++IRC+ +E+EGL +S   PVLSVCIDKKTVI
Sbjct: 179 XSRQEHFTGNNTFYSTKVRFVETGDSHEIVIRCN-KETEGLKQSNHYPVLSVCIDKKTVI 237

Query: 508 RVKRLQWNFRGNQTIFVDGLLVDLLWDVHDWFFN--PASGDAVFMFRTRSGLD-SRLWLE 338
           +VKRLQWNFRGNQTIF+DGLLVDL+WDVHDWFF+   A G AVFMFRTR+GLD SRLWLE
Sbjct: 238 KVKRLQWNFRGNQTIFLDGLLVDLMWDVHDWFFSNQGACGRAVFMFRTRNGLDSSRLWLE 297

Query: 337 EK 332
           EK
Sbjct: 298 EK 299

 Score = 25.0 bits (53), Expect(2) = 3e-55
 Identities = 11/22 (50%), Positives = 18/22 (81%), Gaps = 1/22 (4%)
 Frame = -2

Query: 333 RVAQKDK-DRVEFSLLIYACKT 271
           ++ +KD+ D+++FSL IYA KT
Sbjct: 299 KIVKKDQQDKLDFSLFIYARKT 320

>ref|NP_198167.1| putative protein; protein id: At5g28150.1, supported by cDNA:
           gi_19699076 [Arabidopsis thaliana]
           gi|19699077|gb|AAL90906.1| AT5g28150/T24G3_80
           [Arabidopsis thaliana] gi|23308357|gb|AAN18148.1|
           At5g28150/T24G3_80 [Arabidopsis thaliana]
          Length = 289

 Score =  147 bits (371), Expect = 2e-34
 Identities = 75/168 (44%), Positives = 106/168 (62%), Gaps = 1/168 (0%)
 Frame = -1

Query: 847 IFWDLSNANYDSGPEPVDGFYVLILVDSEIGLVLGDMAGETVSKKFKINSPVAKVSLLSR 668
           +FWDLS+A + SGPE + GFYV ++VD E+ L+LGDM  E   K     S +  V  +++
Sbjct: 100 VFWDLSSAKFGSGPEALGGFYVGVVVDKEMVLLLGDMKKEAFKKTNASPSSLGAV-FIAK 158

Query: 667 REHCSGNSLYTTKAQFCDSGTWHDVLIRCSVEESEGLFKSPVLSVCIDKKTVIRVKRLQW 488
           +EH  G  ++ TKAQ    G +HD+LI C    ++     P L V +D KT+++VKRL+W
Sbjct: 159 KEHVFGKRVFATKAQLFADGKFHDLLIECDTNVTD-----PCLVVRVDGKTLLQVKRLKW 213

Query: 487 NFRGNQTIFVDGLLVDLLWDVHDWFFN-PASGDAVFMFRTRSGLDSRL 347
            FRGN TI V+ + V++LWDVH W F  P +G+AVFMFRT    +  L
Sbjct: 214 KFRGNDTIVVNKMTVEVLWDVHSWLFGLPTTGNAVFMFRTCQSTEKSL 261

>ref|NP_565847.1| expressed protein; protein id: At2g36470.1, supported by cDNA:
           109103. [Arabidopsis thaliana] gi|25408477|pir||B84781
           hypothetical protein At2g36470 [imported] - Arabidopsis
           thaliana gi|4581147|gb|AAD24631.1| expressed protein
           [Arabidopsis thaliana]
          Length = 327

 Score =  144 bits (363), Expect = 2e-33
 Identities = 83/205 (40%), Positives = 115/205 (55%), Gaps = 34/205 (16%)
 Frame = -1

Query: 847 IFWDLSNANYDS-GPEPVDGFYVLILVDSEIGLVLGDMAGETVSKKFKINSPVAKVSLLS 671
           I WDLS A Y++ GPEP+  F+V+++V+SEI L +GD+  E        ++  +    +S
Sbjct: 101 ILWDLSEAEYENNGPEPIRRFFVVVVVNSEITLGVGDVDHER-------DTSSSSSWRVS 153

Query: 670 RREHCSGNSLYTTKAQFCDSGTWHDVLIRCSVEESEG-------LFKSP-VLSVCIDKKT 515
           + E  SG    TTKAQF D G  H++ I+C      G         KSP  +SV +DK+ 
Sbjct: 154 KTERFSGTCWLTTKAQFSDVGRKHEIQIQCGGGGGGGGEEGYLWKVKSPETMSVYVDKRK 213

Query: 514 VIRVKRLQWNFRGNQTIFVDGLLVDLLWDVHDWFFNPASGD------------------- 392
           V  VK+L+WNFRGNQT+F DG+L+D++WD+HDWF+                         
Sbjct: 214 VFSVKKLKWNFRGNQTMFFDGMLIDMMWDLHDWFYKETLSSVSTSSSSKTASSSSSSSTS 273

Query: 391 ------AVFMFRTRSGLDSRLWLEE 335
                 AVFMFR RSGLDSRLW++E
Sbjct: 274 SSTPPCAVFMFRRRSGLDSRLWIDE 298

>ref|NP_566240.1| expressed protein; protein id: At3g04860.1, supported by cDNA:
           5170., supported by cDNA: gi_16612260 [Arabidopsis
           thaliana] gi|12322843|gb|AAG51405.1|AC009465_5 unknown
           protein; 64727-65596 [Arabidopsis thaliana]
           gi|16612261|gb|AAL27500.1|AF439828_1 AT3g04860/T9J14_19
           [Arabidopsis thaliana] gi|21928087|gb|AAM78072.1|
           AT3g04860/T9J14_19 [Arabidopsis thaliana]
          Length = 289

 Score =  144 bits (362), Expect = 2e-33
 Identities = 74/161 (45%), Positives = 101/161 (61%), Gaps = 2/161 (1%)
 Frame = -1

Query: 847 IFWDLSNANYDSGPEPVDGFYVLILVDSEIGLVLGDMAGETVSKKFKINSPVAKVSLLSR 668
           +FWDLS+A + S PEP+ GFYV ++VD E+ L+LGDM  E   K     S       +++
Sbjct: 100 VFWDLSSAKFGSSPEPLGGFYVGVVVDKEMVLLLGDMKKEAFKKTNAAPSSSLGAVFIAK 159

Query: 667 REHCSGNSLYTTKAQFCDSGTWHDVLIRCSVEESEGLFKSPVLSVCIDKKTVIRVKRLQW 488
           +EH  G   + TKAQF   G  HD++I C    S+     P L V +D K +++V+RL W
Sbjct: 160 KEHVFGKRTFATKAQFSGDGKTHDLVIECDTSLSD-----PCLIVRVDGKILMQVQRLHW 214

Query: 487 NFRGNQTIFVDGLLVDLLWDVHDWFFN-PAS-GDAVFMFRT 371
            FRGN TI V+ + V++LWDVH WFF  P+S G+AVFMFRT
Sbjct: 215 KFRGNDTIIVNRISVEVLWDVHSWFFGLPSSPGNAVFMFRT 255

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 735,208,925
Number of Sequences: 1393205
Number of extensions: 16272144
Number of successful extensions: 51376
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 46888
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 51100
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 44316199683
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD080g12_f BP050417 1 474
2 MPD093a06_f AV776079 13 469
3 MPD059c01_f AV773937 46 596
4 MFB081d05_f BP039927 46 486
5 MPD080e02_f AV775258 49 468
6 SPD092c03_f BP051329 112 450
7 MF071g04_f BP032082 404 870




Lotus japonicus
Kazusa DNA Research Institute