KMC003527A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003527A_C01 KMC003527A_c01
atccattttatctaaaattgatatacttataaAATTAATTAATAATGACAAAATTACAAA
AACAAGCATAAAGAATGTAAAATTAACAGCAATGTATTATCAATGTATACATGAAAATCT
ACATAATATCTATGCTAATAATACAAAAAATTTATAAAGAATGTATTAATTAATTAATCT
AAACAGTTTCAATTTTCATCACCAAAAATCCAATCCATAACTACAATCCTTATGAACAAT
CTTCGACTTCGCCGTTAACTGATCCACCGCCACGTCACAATCCACCTTAACGGTAATCTT
CCACGTCTTAACGGACCCCACTTTAATCTTCACCGGCGCTCTCAGCTTCATCGTCAACGG
CACGCTCCGTTTCGCCACCGCGTTCACCAACGCCGTCCGATCCGACCCGGCGAGTTCGAC
GCCGTCGCCCTTCAACACCGCCTTAAACACCGTCACATTGTTCGACGCCTGGTAAAACGC
CGGCAACGCGCCGCCGCAGAGCCGCGAGTCTCTATAGAACATTTCCACCGAGCTTCCTTT
CAAATAGTAGATCCCGATCTTGTCGTTGCCGTTGTTGGCTCTCACCGCGACGTCGAACTC
CGGCGAGATTGCCGCCGCGGATGACGGCGATGGCGAGGTGAGGTTCATTCCTTTGACGGA
GACGGTTTGGATTGAGTAGTTAGGCGCTTTGGGACGGAAAACGAGGTAGAAGACGCCGGC
GGCGATGCCGAGGAGGATGAGGAAGATGAAGAGGAGGCCGATGAGCCAGCAGAGGCAGCA
GCAGAAGCCGCATCGGCGAGTTTTGCGGCGGTTGTAGTTGGCGTAGCGGCGCGCATTCTC
CGGCGGAGGGACGCGGTAGACTTGGTCCTTGGGGATCTGGATGACGTAGGTTCCGGTAGG
CGAGGCTGGTTTCTCCGAGGGAGGTGGAGCGGGTTGTGACTCGGCGGAAGCAGGGGGTga
ggtgtgggggtgaactcggtcagccattgtggtgagtcggtgactcagtgagtgaggagc
gagttgcggcgagttaga


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003527A_C01 KMC003527A_c01
         (1038 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565634.1| expressed protein; protein id: At2g27080.1, sup...   275  8e-73
ref|NP_197612.1| putative protein; protein id: At5g21130.1 [Arab...   214  2e-54
dbj|BAB85244.1| P0425G02.19 [Oryza sativa (japonica cultivar-gro...   197  2e-49
ref|NP_175856.1| hypothetical protein; protein id: At1g54540.1 [...   132  1e-29
ref|NP_198513.1| putative protein; protein id: At5g36970.1 [Arab...   131  2e-29

>ref|NP_565634.1| expressed protein; protein id: At2g27080.1, supported by cDNA:
           gi_13877664, supported by cDNA: gi_15450380, supported
           by cDNA: gi_16974488 [Arabidopsis thaliana]
           gi|25407856|pir||E84668 hypothetical protein At2g27080
           [imported] - Arabidopsis thaliana
           gi|3885338|gb|AAC77866.1| expressed protein [Arabidopsis
           thaliana] gi|13877665|gb|AAK43910.1|AF370591_1 Unknown
           protein [Arabidopsis thaliana]
           gi|15450381|gb|AAK96484.1| At2g27080/T20P8.13
           [Arabidopsis thaliana] gi|16974489|gb|AAL31248.1|
           At2g27080/T20P8.13 [Arabidopsis thaliana]
          Length = 260

 Score =  275 bits (703), Expect = 8e-73
 Identities = 134/265 (50%), Positives = 177/265 (66%), Gaps = 3/265 (1%)
 Frame = -1

Query: 987 MADRVHPHTSPPASAESQPAPPPSE---KPASPTGTYVIQIPKDQVYRVPPPENARRYAN 817
           MA+RV+P  SPP S +        E   KPA P  TYVIQ+PKDQ+YR+PPPENA R+  
Sbjct: 1   MAERVYPADSPPQSGQFSGNFSSGEFPKKPAPPPSTYVIQVPKDQIYRIPPPENAHRFEQ 60

Query: 816 YNRRKTRRCGFCCCLCWLIGLLFIFLILLGIAAGVFYLVFRPKAPNYSIQTVSVKGMNLT 637
            +R+KT R    CC C  +  +FI ++L GI+  V YL++RP+AP YSI+  SV G+NL 
Sbjct: 61  LSRKKTNRSNCRCCFCSFLAAVFILIVLAGISFAVLYLIYRPEAPKYSIEGFSVSGINLN 120

Query: 636 SPSPSSAAAISPEFDVAVRANNGNDKIGIYYLKGSSVEMFYRDSRLCGGALPAFYQASNN 457
           S SP     ISP F+V VR+ NGN KIG+YY K SSV+++Y D  +  G +P FYQ + N
Sbjct: 121 STSP-----ISPSFNVTVRSRNGNGKIGVYYEKESSVDVYYNDVDISNGVMPVFYQPAKN 175

Query: 456 VTVFKAVLKGDGVELAGSDRTALVNAVAKRSVPLTMKLRAPVKIKVGSVKTWKITVKVDC 277
           VTV K VL G  ++L    R  + N V+K++VP  +K++APVKIK GSVKTW + V VDC
Sbjct: 176 VTVVKLVLSGSKIQLTSGMRKEMRNEVSKKTVPFKLKIKAPVKIKFGSVKTWTMIVNVDC 235

Query: 276 DVAVDQLTAKSKIVHKDCSYGLDFW 202
           DV VD+LTA S+IV + CS+ +D W
Sbjct: 236 DVTVDKLTAPSRIVSRKCSHDVDLW 260

>ref|NP_197612.1| putative protein; protein id: At5g21130.1 [Arabidopsis thaliana]
           gi|29294053|gb|AAO73890.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 281

 Score =  214 bits (544), Expect = 2e-54
 Identities = 111/248 (44%), Positives = 153/248 (60%), Gaps = 2/248 (0%)
 Frame = -1

Query: 948 SAESQPAPPPSEKPASPTGTYVIQIPKDQVYRVPPPENARRYANYNRRKTRRCGFCC--C 775
           S +S+  PPP        GTYVI++PKDQ+YRVPPPENA RY   +RRKT +   CC  C
Sbjct: 46  SQKSRIGPPP--------GTYVIKLPKDQIYRVPPPENAHRYEYLSRRKTNKS--CCRRC 95

Query: 774 LCWLIGLLFIFLILLGIAAGVFYLVFRPKAPNYSIQTVSVKGMNLTSPSPSSAAAISPEF 595
           LC+ +  L I ++L  IA G FYLV++P  P +S+  VSV G+NLTS SP      SP  
Sbjct: 96  LCYSLSALLIIIVLAAIAFGFFYLVYQPHKPQFSVSGVSVTGINLTSSSP-----FSPVI 150

Query: 594 DVAVRANNGNDKIGIYYLKGSSVEMFYRDSRLCGGALPAFYQASNNVTVFKAVLKGDGVE 415
            + +R+ N   K+G+ Y KG+  ++F+  ++L  G   AF Q + NVTV   VLKG  V+
Sbjct: 151 RIKLRSQNVKGKLGLIYEKGNEADVFFNGTKLGNGEFTAFKQPAGNVTVIVTVLKGSSVK 210

Query: 414 LAGSDRTALVNAVAKRSVPLTMKLRAPVKIKVGSVKTWKITVKVDCDVAVDQLTAKSKIV 235
           L  S R  L  +  K  VP  ++++APVK KVGSV TW +T+ VDC + VD+LTA + + 
Sbjct: 211 LKSSSRKELTESQKKGKVPFGLRIKAPVKFKVGSVTTWTMTITVDCKITVDKLTASATVK 270

Query: 234 HKDCSYGL 211
            ++C  GL
Sbjct: 271 TENCETGL 278

>dbj|BAB85244.1| P0425G02.19 [Oryza sativa (japonica cultivar-group)]
           gi|20161461|dbj|BAB90385.1| P0432B10.3 [Oryza sativa
           (japonica cultivar-group)]
          Length = 312

 Score =  197 bits (501), Expect = 2e-49
 Identities = 112/271 (41%), Positives = 158/271 (57%), Gaps = 22/271 (8%)
 Frame = -1

Query: 969 PHTSPPASAESQP-------APPPSEK-------PASPTGTYVIQIPKDQVYRVPPPENA 832
           P +SPP S   +        APPP++        PA   GTYV+Q+PKD+V+RVPPPENA
Sbjct: 31  PSSSPPYSFHFEKPLPPTAAAPPPADARRYQQLLPAPQPGTYVVQMPKDKVFRVPPPENA 90

Query: 831 RRYANYNRRKTRRCGFCCC-LC-WLIGLLFIFLILLGIAAGVFYLVFRPKAPNYSIQTVS 658
           R + +Y RR  RR    C  +C WL+  L +    L  +A V YLVF+P+ P+Y++ +++
Sbjct: 91  RLFQHYTRRARRRARCSCARVCSWLLLALVLLAAALAASAAVVYLVFKPRQPDYTLLSLA 150

Query: 657 VKGM-----NLTSPSPSSAAAISPEFDVAVRANNGNDKIGIYYLKGSS-VEMFYRDSRLC 496
           V G+     N +S +  +  + SPEFD  VRA+N N KIG++Y  G S V + Y   RL 
Sbjct: 151 VSGLGGILGNASSTAAPAPVSFSPEFDATVRADNPNGKIGVHYEGGGSHVAVSYGGVRLA 210

Query: 495 GGALPAFYQASNNVTVFKAVLKGDGVELAGSDRTALVNAVAKRSVPLTMKLRAPVKIKVG 316
            GA PAFYQ   NVTV  A  KG G+  +      +  A   RSVP  + ++ PV+++VG
Sbjct: 211 DGAWPAFYQGPRNVTVLVATAKGLGIRFSERLLGDIAAAGRLRSVPFDVDVKVPVRLQVG 270

Query: 315 SVKTWKITVKVDCDVAVDQLTAKSKIVHKDC 223
            V+TW + V+V C V VD+L A +K+V K C
Sbjct: 271 GVRTWAVPVRVRCAVVVDRLAADAKVVSKSC 301

>ref|NP_175856.1| hypothetical protein; protein id: At1g54540.1 [Arabidopsis
           thaliana] gi|25405746|pir||D96587 hypothetical protein
           F20D21.35 [imported] - Arabidopsis thaliana
           gi|4585996|gb|AAD25632.1|AC005287_34 Hypothetical
           protein [Arabidopsis thaliana]
          Length = 239

 Score =  132 bits (331), Expect = 1e-29
 Identities = 85/257 (33%), Positives = 123/257 (47%), Gaps = 3/257 (1%)
 Frame = -1

Query: 978 RVHPHTSPPASAESQPAPPPSEKPASPTGTYVIQIPKDQVYRVPPPENARRYANYNRRKT 799
           ++HP     A+      P P +    P     +Q P      +PPP    +  N      
Sbjct: 6   KIHPVLQMEANKTKTTTPAPGKTVLLP-----VQRP------IPPPVIPSKNRN------ 48

Query: 798 RRCGFCCCL-CWLIGLLFIFLILLGIAAGVFYLVFRPKAPNYSIQTVSVKGMNLTSPSPS 622
                CC + CW++ LL I LI L IA  V Y VF PK P+Y + ++ V  + +      
Sbjct: 49  ----MCCKIFCWVLSLLVIALIALAIAVAVVYFVFHPKLPSYEVNSLRVTNLGINLD--- 101

Query: 621 SAAAISPEFDVAVRANNGNDKIGIYYLKGSSVEMFYRDSRLCGGALPAFYQASNNVTVFK 442
              ++S EF V + A N N+KIGIYY KG  + ++Y  ++LC G +P FYQ   NVT   
Sbjct: 102 --LSLSAEFKVEITARNPNEKIGIYYEKGGHIGVWYDKTKLCEGPIPRFYQGHRNVTKLN 159

Query: 441 AVLKGDGVELAGSDRTALVNAVAKRSVPLTMKLRAPVKIKVGSVKTWKITVKVDCDVAVD 262
             L G   +   +   AL        VPL +K+ APV IK+G++K  KI +   C + VD
Sbjct: 160 VALTG-RAQYGNTVLAALQQQQQTGRVPLDLKVNAPVAIKLGNLKMKKIRILGSCKLVVD 218

Query: 261 QLTAKSKIVHK--DCSY 217
            L+  + I  K  DCS+
Sbjct: 219 SLSTNNNINIKASDCSF 235

>ref|NP_198513.1| putative protein; protein id: At5g36970.1 [Arabidopsis thaliana]
          Length = 248

 Score =  131 bits (329), Expect = 2e-29
 Identities = 78/260 (30%), Positives = 124/260 (47%), Gaps = 6/260 (2%)
 Frame = -1

Query: 978 RVHPHTSPPASAESQPAPPPSEKPASPTGTYVIQ----IPKDQVYRVPPPENARRYANYN 811
           ++HP + P A       PP    P  P G+   +        Q   + PP          
Sbjct: 6   KIHPVSDPEA-------PPHPTAPLVPRGSSRSEHGDPTKTQQAAPLDPPRE-------- 50

Query: 810 RRKTRRCGFCCCLCWLIGLLFIFLILLGIAAGVFYLVFRPKAPNYSIQTVSVKGMNLTSP 631
            +K  R  +C C+C+ + +LF+ ++++G   G+ YLVFRPK P+Y+I  + +    L   
Sbjct: 51  -KKGSRSCWCRCVCYTLLVLFLLIVIVGAIVGILYLVFRPKFPDYNIDRLQLTRFQLNQD 109

Query: 630 SPSSAAAISPEFDVAVRANNGNDKIGIYYLKGSSVEMFYRDSRLCGGALPAFYQASNNVT 451
                 ++S  F+V + A N N+KIGIYY  GS + + Y  +R+  G+LP FYQ   N T
Sbjct: 110 -----LSLSTAFNVTITAKNPNEKIGIYYEDGSKISVLYMQTRISNGSLPKFYQGHENTT 164

Query: 450 VFKAVLKGDGVELAGSDRTALVNAVAKRSVPLTMKLRAPVKIKVGSVKTWKITVKVDCDV 271
           +    + G          T         S+PL +++  PV+IK+G +K  K+   V C V
Sbjct: 165 IILVEMTGFTQNATSLMTTLQEQQRLTGSIPLRIRVTQPVRIKLGKLKLMKVRFLVRCGV 224

Query: 270 AVDQLTAKS--KIVHKDCSY 217
           +VD L A S  ++   +C Y
Sbjct: 225 SVDSLAANSVIRVRSSNCKY 244

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 936,371,615
Number of Sequences: 1393205
Number of extensions: 25287484
Number of successful extensions: 347829
Number of sequences better than 10.0: 4293
Number of HSP's better than 10.0 without gapping: 141461
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 271987
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 60980933767
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB019c02_f BP035323 1 575
2 MR018a12_f BP077330 33 506
3 GNf048c05 BP070900 70 468
4 MF005h03_f BP028514 70 274
5 SPD002f09_f BP044182 74 537
6 GNf098b12 BP074613 74 477
7 SPD072h11_f BP049799 78 585
8 SPD018c07_f BP045405 79 620
9 MFB025c02_f BP035791 79 508
10 MWM054d04_f AV765552 79 190
11 SPD016g08_f BP045289 82 268
12 MFB002e01_f BP034042 97 243
13 MFB094h08_f BP040884 146 738
14 MFB054h10_f BP037958 150 684
15 MWM104h01_f AV766422 157 498
16 GNf054e07 BP071401 169 569
17 MF096a10_f BP033285 500 1048




Lotus japonicus
Kazusa DNA Research Institute