KMC005319A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005319A_C01 KMC005319A_c01
AGAGAAACTTTATTAATTTATCAAACTGTAACGAAATAACATGCATCTTAGCATTTAATC
CCCTCTAAATAACCTTTTATTATTTCACCAACTCATAGTCATTCCTTTTATCTTTCCTAT
GAAACATATGATATGAACCTTAAAAACAAATACAAAGAAATAGTTACTTGAAGAACATGA
TTGGAGTTGAGAGAATGTGCAATGGCGTGGATGTGGTGATGATCACATAATTGAGCACCC
TGTTGGATTTTCTTCACTGAAATACTGATCACACCGATTGTTCACATAATAGGGTCTGCT
TCCACCATAACTATCATAAACGGGCCTCGCATAATACCCGTCATAACATGGCATCGGCCC
ACCATACTCATAGAAGCCCGGCCCGGCAGGCCTGCCCTCGTAACAAGGAGTTGGAGCATA
CATCATCCCAACTGGCACTGCCATTGGCGGTGGCCCCACTGGGTGGACCGGTATCGGACC
GGGTGCGGGTTTGTCCTTGGGCTTCTCATCTCCCTTCTTTGGCCCATCTGACTTGGGCTT
TTCCTTGTCCCCATCTTTCTTTTTCTCAGGCTCCGCCGCCTTGGGCTTCTCGGAATCCTT
CTTCTTCTCAGGCTCAGCAGCAGCCTTGGGCTTTTCCCCGTCTTTCTTCTTCTCATGCTC
AGCCGCCTTCGGCTTGTCGGCCTCCTTCTTCTTCTCAGGCTCCAGGCGGTCTGGGCTTCG
GCAGTTCCACTATCTCAATGCTCTTGATGGTACCCCCACCCTCGCCACATAGCTTGTCCC
TAATCTTCTCAGGGCTACAGCATACCACGGTAATGAACACAATCTTGTCCTTCTCGTCGT
ACTTCTGGTCTCGAATTTGAGGGAACTTGCAGAGAATTTTCTTGACCTTTTTGTAGCATT
TCTCACATTCAAGATCAACCTTGAGTTTCATGAGGgtta


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005319A_C01 KMC005319A_c01
         (939 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T06396 isoprenylated protein - soybean (fragment) gi|532705...   219  5e-56
gb|AAL38281.1| unknown protein [Arabidopsis thaliana] gi|2014872...   129  6e-55
pir||T06394 isoprenylated protein - soybean (fragment) gi|532703...   193  3e-48
gb|AAN03471.1| unknown protein [Glycine max]                          132  3e-43
emb|CAA07370.1| proline-rich protein [Capsicum annuum]                124  3e-27

>pir||T06396 isoprenylated protein - soybean (fragment) gi|532705|gb|AAA65013.1|
           unknown
          Length = 154

 Score =  219 bits (558), Expect = 5e-56
 Identities = 118/159 (74%), Positives = 120/159 (75%), Gaps = 9/159 (5%)
 Frame = -1

Query: 675 KPKAAEHEKKKDGEKPKAAAEPEKKKDS--EKPKAAEPEKKKDGDKEKPKSDGPKK---- 514
           KPKA E EKKKDG   K  AEPEKKKD   EKPKA EPEKKKDG  EKPK D PKK    
Sbjct: 1   KPKA-EPEKKKDGGGEKPKAEPEKKKDGGGEKPKA-EPEKKKDGG-EKPKGDAPKKEAEK 57

Query: 513 ---GDEKPKDKPAPGPIPVHPVGPPPMAVPVGMMYAPTPCYEGRPAGPGFYEYGGPMPCY 343
              G EKPKDKPAP P+PV P    PMAVPVGM+YAP PCYEGRP GPG YEYGGPM CY
Sbjct: 58  PKPGPEKPKDKPAPAPLPVQPHMAAPMAVPVGMLYAPPPCYEGRPVGPG-YEYGGPMFCY 116

Query: 342 DGYYARPVYDSYGGSRPYYVNNRCDQYFSEENPTGCSIM 226
           DGYYARPVYDSYGG RP YV NR DQYFSEENP GC IM
Sbjct: 117 DGYYARPVYDSYGGGRPCYV-NRGDQYFSEENPQGCIIM 154

>gb|AAL38281.1| unknown protein [Arabidopsis thaliana] gi|20148729|gb|AAM10255.1|
           unknown protein [Arabidopsis thaliana]
          Length = 254

 Score =  129 bits (325), Expect(2) = 6e-55
 Identities = 59/78 (75%), Positives = 67/78 (85%)
 Frame = -3

Query: 937 TLMKLKVDLECEKCYKKVKKILCKFPQIRDQKYDEKDKIVFITVVCCSPEKIRDKLCGEG 758
           T+MKLKVDL+C KCYKKVKK+LCKFPQIRDQ +DEK  IV I VVCCSPE+I DKLC +G
Sbjct: 10  TMMKLKVDLDCAKCYKKVKKVLCKFPQIRDQLFDEKSNIVIIKVVCCSPERIMDKLCSKG 69

Query: 757 GGTIKSIEIVELPKPRPP 704
           GG+IK+IEIVE PKP  P
Sbjct: 70  GGSIKTIEIVEPPKPPQP 87

 Score =  107 bits (268), Expect(2) = 6e-55
 Identities = 76/179 (42%), Positives = 93/179 (51%), Gaps = 16/179 (8%)
 Frame = -1

Query: 714 PDRLEPEKKKEADKPKAAEHEKKKDGEKPKAAAEPEKKKDSEKPKAAEPEKKKDGDKEKP 535
           P   +P +K +  +PKA E  K K+ EKPK   +PEK K+ EKPK  +PEK K+ +K K 
Sbjct: 87  PQPQQPPQKPKDAQPKAPE--KPKEPEKPK---QPEKPKEPEKPK--QPEKPKEPEKTKQ 139

Query: 534 KSDGP-----KKGDEKPKDKPAPGPIPVHPVGPPPMAVPVGMMYAPTPC----YEGRPAG 382
            +  P           P   PAP P P  P GPPP A+P+     P  C    Y+G   G
Sbjct: 140 PAPAPAPAPAPAAKPAPAPAPAPAPAPKQP-GPPPQAIPMMPQGQPAMCCGPYYDGY-GG 197

Query: 381 PGFYEYGGPMPCYDGYYARPVYDSYGGS----RPYYVN---NRCDQYFSEENPTGCSIM 226
           P F  YG P   Y+  Y RPVY+S+GG      P Y      RCD YFSEENP  CSIM
Sbjct: 198 PAFNGYGMPPQPYE-CYGRPVYESWGGGCPPPPPAYRQCHVTRCD-YFSEENPQSCSIM 254

>pir||T06394 isoprenylated protein - soybean (fragment) gi|532703|gb|AAA65012.1|
           unknown
          Length = 130

 Score =  193 bits (491), Expect = 3e-48
 Identities = 99/139 (71%), Positives = 104/139 (74%), Gaps = 2/139 (1%)
 Frame = -1

Query: 636 EKPKAAAEPEKKKDS--EKPKAAEPEKKKDGDKEKPKSDGPKKGDEKPKDKPAPGPIPVH 463
           EKPK   EPEKKKD   EKPK  EPEKKKDG ++      PK G EKPKDKP P P+PV 
Sbjct: 3   EKPKT--EPEKKKDGGGEKPKE-EPEKKKDGGEK------PKPGPEKPKDKPTPAPLPVQ 53

Query: 462 PVGPPPMAVPVGMMYAPTPCYEGRPAGPGFYEYGGPMPCYDGYYARPVYDSYGGSRPYYV 283
           P    PMAVPVGM+YAP PCY GRP GPG YEYGGPM CYDGYYARPVYDSY G RP Y 
Sbjct: 54  PHIAAPMAVPVGMLYAPPPCYGGRPVGPG-YEYGGPMLCYDGYYARPVYDSYSGGRPCY- 111

Query: 282 NNRCDQYFSEENPTGCSIM 226
            NRCDQYFSEENP GC+IM
Sbjct: 112 GNRCDQYFSEENPQGCTIM 130

>gb|AAN03471.1| unknown protein [Glycine max]
          Length = 240

 Score =  132 bits (333), Expect(2) = 3e-43
 Identities = 60/78 (76%), Positives = 70/78 (88%)
 Frame = -3

Query: 937 TLMKLKVDLECEKCYKKVKKILCKFPQIRDQKYDEKDKIVFITVVCCSPEKIRDKLCGEG 758
           T+M+LKVDL+C KCYKKVKKILCKFPQIRDQ YDEK+ IV I VVCC+PE++RDK+C +G
Sbjct: 6   TIMRLKVDLQCHKCYKKVKKILCKFPQIRDQVYDEKNNIVTIAVVCCNPEELRDKICCKG 65

Query: 757 GGTIKSIEIVELPKPRPP 704
            GTIKSIEIVE PKP+PP
Sbjct: 66  CGTIKSIEIVEPPKPKPP 83

 Score = 65.5 bits (158), Expect(2) = 3e-43
 Identities = 41/74 (55%), Positives = 46/74 (61%)
 Frame = -1

Query: 702 EPEKKKEADKPKAAEHEKKKDGEKPKAAAEPEKKKDSEKPKAAEPEKKKDGDKEKPKSDG 523
           EPEK KE +KPK  E EK K+ EKPK   EP K K+ EKPK  EPEK K  + EKPK   
Sbjct: 91  EPEKSKEPEKPK--EPEKPKEPEKPK---EPPKPKEPEKPK--EPEKPK--EPEKPKEPE 141

Query: 522 PKKGDEKPKDKPAP 481
             K  EKPKD  +P
Sbjct: 142 KPKEPEKPKDPRSP 155

 Score = 54.7 bits (130), Expect = 2e-06
 Identities = 35/68 (51%), Positives = 40/68 (58%)
 Frame = -1

Query: 702 EPEKKKEADKPKAAEHEKKKDGEKPKAAAEPEKKKDSEKPKAAEPEKKKDGDKEKPKSDG 523
           EPEK KE +KPK  E  K K+ EKPK   EPEK K+ EKPK  EPEK K  + EKPK   
Sbjct: 103 EPEKPKEPEKPK--EPPKPKEPEKPK---EPEKPKEPEKPK--EPEKPK--EPEKPKDPR 153

Query: 522 PKKGDEKP 499
             +    P
Sbjct: 154 SPRSPRSP 161

 Score = 48.5 bits (114), Expect = 1e-04
 Identities = 38/92 (41%), Positives = 45/92 (48%), Gaps = 2/92 (2%)
 Frame = -1

Query: 720 RSPDRLEPEKKKEADKPKAAEHEKKKDGEKPKAAAEPEKKKDSEKPKAAEPEKKKDGDKE 541
           +S + +EP K K   +P           EKPK   EPEK K+ EKPK  EPEK K  + E
Sbjct: 70  KSIEIVEPPKPKPPPQP-----------EKPK---EPEKSKEPEKPK--EPEKPK--EPE 111

Query: 540 KPKSDGPKKGDEKPK--DKPAPGPIPVHPVGP 451
           KPK     K  EKPK  +KP     P  P  P
Sbjct: 112 KPKEPPKPKEPEKPKEPEKPKEPEKPKEPEKP 143

 Score = 35.0 bits (79), Expect = 1.6
 Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 16/115 (13%)
 Frame = -2

Query: 593 PRSPRRRSLRKRKMGTRKSPSQMGQRREMRSPRTNPHP----------VRYRSTQWGHRQ 444
           P+ P +    ++     K P +  + ++ RSPR+   P          VR+R     H Q
Sbjct: 125 PKEPEKPKEPEKPKEPEK-PKEPEKPKDPRSPRSPRSPSLNALLRCRFVRHRCQCVRHSQ 183

Query: 443 -----WQCQLG*CMLQLLVTRAGLPGRASM-SMVGRCHVMTGIMRGPFMIVMVEA 297
                W  +LG  +   +  + G  G A M + VG  HVMT IM G FMIV V A
Sbjct: 184 HLRHRWWFRLGCAVFHAM--KVGPVGHALMGTAVGPHHVMTLIMEGLFMIVTVGA 236

>emb|CAA07370.1| proline-rich protein [Capsicum annuum]
          Length = 238

 Score =  124 bits (310), Expect = 3e-27
 Identities = 57/76 (75%), Positives = 64/76 (84%)
 Frame = -3

Query: 931 MKLKVDLECEKCYKKVKKILCKFPQIRDQKYDEKDKIVFITVVCCSPEKIRDKLCGEGGG 752
           M LKVDL+C  CYKKVKKILCKFPQIRDQ YDEK   V ITV+CC+PEK+RDKLC +G G
Sbjct: 1   MVLKVDLQCCSCYKKVKKILCKFPQIRDQIYDEKGNKVTITVICCNPEKLRDKLCSKGCG 60

Query: 751 TIKSIEIVELPKPRPP 704
            IKSIEI+E PKP+PP
Sbjct: 61  VIKSIEIIEPPKPKPP 76

 Score =  112 bits (280), Expect = 8e-24
 Identities = 84/190 (44%), Positives = 98/190 (51%), Gaps = 25/190 (13%)
 Frame = -1

Query: 720 RSPDRLEPEKKKEADKPKAAEHEKKKDGEKPKAAAEPEKKKDSEKPKAAEPEKKKDGDK- 544
           +S + +EP K K  +KPK  E EK K  EKPK   EPEK K  EKPK  EPEK K  +K 
Sbjct: 63  KSIEIIEPPKPKPPEKPK--EPEKPKQPEKPK---EPEKPKQPEKPK--EPEKPKQPEKP 115

Query: 543 ---EKPKSDGPKKGDEKPKD----KPAPGPIPVHPVGPPPMAVPVGMMYAPTP------- 406
              EKPK+    K  EKPK+    K AP P PV P  P P +VPV +   P P       
Sbjct: 116 KEPEKPKAPEKPKEPEKPKEPEKPKEAPKPPPVAP-PPAPESVPVPVQEYPPPPSGYCCG 174

Query: 405 -CYEGRPAGPGFYEYGGPMPCYDGYYARPVYDSYG---GSRPYYVN-----NRCDQYF-S 256
            CY G   GP +  YG P+P        P Y +YG   G  PY  N     +RCD Y  S
Sbjct: 175 QCYAGHTGGPCYQWYGRPVP------PPPCYGNYGYSYGPGPYCYNRGCYVSRCDPYLCS 228

Query: 255 EENPTGCSIM 226
           +EN TGCSIM
Sbjct: 229 DENATGCSIM 238

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 910,243,152
Number of Sequences: 1393205
Number of extensions: 26324079
Number of successful extensions: 245358
Number of sequences better than 10.0: 5235
Number of HSP's better than 10.0 without gapping: 120909
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 201269
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 52414431048
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD091h09_f BP051308 1 568
2 MFB059b12_f BP038256 1 408
3 MFB039c02_f BP036844 1 486
4 SPD078g03_f BP050262 2 554
5 MPDL067h02_f AV779941 62 562
6 MWM245d10_f AV768482 416 940
7 MPD057e06_f AV773821 466 965




Lotus japonicus
Kazusa DNA Research Institute