KMC004335A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004335A_C01 KMC004335A_c01
AAAATATGAGGAAGACATTCATGCTGGTGGTGACATTCATGCCGTTGTTATTCCACGAAC
CAATTCCACAAGTTAAAAAAAAAAAAAGGATTGTGATTTGAGAGCCCTTGGCCAGAATAA
ACAAAAAAATAAGACAGAAATTTCATCATGTACACATTTTCTTCCTATGTAGCACTCTTC
ATGTCATGACTAGTTTGATTTCATTCCTTTCTAGTAGTTCGATTCATCCTGGTGCGGTTA
GACTCGTGAGAGAAACCTGCCACACTCTTATCTCTCCATCAAGAGAACCACTGAAAATCG
AAACAACGCCGTCAGGGGAATTCTGATCACTCTCCGCAACCGCCGCCAAGGACTTCACCG
GTTTCCGGTGACCGTCGAGGACAGCGAGGCAGCAAAACGGTCCATCGAAACCCCGCTTCC
AAACCCTGACCGTCCGATCAGCGGACCCACTCAGGAGAAAATCAGAGACGTTGATCAAAC
ACAGTATCGCCTTCTCGTGGCCCCTCAGGGCCCCACTCACCACCATGTGGTTCGCACTGT
CCTCCCTCTCCCACACCAAAATCGAACGATCGCACGCGCCGGAAAACAGAACCGAACCG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004335A_C01 KMC004335A_c01
         (599 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564219.1| G-protein beta family; protein id: At1g24530.1,...   169  3e-41
ref|NP_173823.1| G-protein beta family; protein id: At1g24130.1 ...   134  1e-30
ref|NP_199823.1| G-protein beta family; protein id: At5g50120.1 ...   111  6e-24
emb|CAD41096.1| OSJNBb0011N17.13 [Oryza sativa (japonica cultiva...   108  4e-23
ref|NP_175369.1| En/Spm-like transposon protein, putative; prote...    97  2e-19

>ref|NP_564219.1| G-protein beta family; protein id: At1g24530.1, supported by cDNA:
           gi_19310648 [Arabidopsis thaliana]
           gi|9743339|gb|AAF97963.1|AC000103_13 F21J9.19
           [Arabidopsis thaliana] gi|15028341|gb|AAK76647.1|
           unknown protein [Arabidopsis thaliana]
           gi|19310649|gb|AAL85055.1| unknown protein [Arabidopsis
           thaliana]
          Length = 418

 Score =  169 bits (427), Expect = 3e-41
 Identities = 84/118 (71%), Positives = 97/118 (82%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCLINVSDFLLSGSADRTVRVW 419
           GSVLFSG+CDRSILVWERED++N+M V GALRGH+KAIL L NVSD LLSGSADRTVR+W
Sbjct: 292 GSVLFSGSCDRSILVWEREDTSNYMAVRGALRGHDKAILSLFNVSDLLLSGSADRTVRIW 351

Query: 418 KRGFDGPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRVWQVSLT 245
           +RG D  + CL VL GH KPVKSLAAV E +    D VVSI SGSLDGE++ W+VS+T
Sbjct: 352 RRGPDSSYSCLEVLSGHTKPVKSLAAVREKEL---DDVVSIISGSLDGEVKCWKVSVT 406

 Score = 48.1 bits (113), Expect = 8e-05
 Identities = 34/118 (28%), Positives = 62/118 (51%), Gaps = 2/118 (1%)
 Frame = -2

Query: 589 LFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCL-INVSDFLLSGSADRTVRVW-K 416
           ++S + D+++ +W   D    +    +++ H+ A+  + ++ +  + +GSADR +RVW K
Sbjct: 208 IYSVSWDKTLKIWRASD----LRCKESIKAHDDAVNAIAVSTNGTVYTGSADRRIRVWAK 263

Query: 415 RGFDGPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRVWQVSLTS 242
              +     +A L+ H+  V +LA        + DG V +FSGS D  I VW+   TS
Sbjct: 264 PTGEKRHTLVATLEKHKSAVNALAL-------NDDGSV-LFSGSCDRSILVWEREDTS 313

 Score = 33.5 bits (75), Expect = 2.1
 Identities = 22/80 (27%), Positives = 37/80 (45%)
 Frame = -2

Query: 499 HEKAILCLINVSDFLLSGSADRTVRVWKRGFDGPFCCLAVLDGHRKPVKSLAAVAESDQN 320
           H  A+  L     F+ S S D+T+++W+        C   +  H   V ++A        
Sbjct: 194 HADAVTALAVSDGFIYSVSWDKTLKIWRA---SDLRCKESIKAHDDAVNAIAV------- 243

Query: 319 SPDGVVSIFSGSLDGEIRVW 260
           S +G  ++++GS D  IRVW
Sbjct: 244 STNG--TVYTGSADRRIRVW 261

>ref|NP_173823.1| G-protein beta family; protein id: At1g24130.1 [Arabidopsis
           thaliana] gi|7486376|pir||T00642 hypothetical protein
           F3I6.5 - Arabidopsis thaliana gi|2829890|gb|AAC00598.1|
           Hypothetical protein [Arabidopsis thaliana]
          Length = 415

 Score =  134 bits (336), Expect = 1e-30
 Identities = 72/127 (56%), Positives = 93/127 (72%), Gaps = 7/127 (5%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWER----EDSANHMVVSGALRGHEKAILCLINVSDFLLSGSADRT 431
           G VL+SGACDRSILVWER    +D   HM V GALRGH KAI+CL   SD +LSGSAD++
Sbjct: 290 GKVLYSGACDRSILVWERLINGDDEELHMSVVGALRGHRKAIMCLAVASDLVLSGSADKS 349

Query: 430 VRVWKRGF--DGPFCCLAVLDGHRKPVKSLA-AVAESDQNSPDGVVSIFSGSLDGEIRVW 260
           +RVW+RG      + CLAVL+GH KPVKSLA +V++SD NS D    ++SGSLD  ++VW
Sbjct: 350 LRVWRRGLMEKEGYSCLAVLEGHTKPVKSLAVSVSDSDSNS-DYSCMVYSGSLDLSLKVW 408

Query: 259 QVSLTSL 239
            + ++S+
Sbjct: 409 NLRVSSI 415

 Score = 53.5 bits (127), Expect = 2e-06
 Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 1/115 (0%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCLINVSD-FLLSGSADRTVRV 422
           GS+L+S + DRS  +W   D      +    + H+ AI  ++   D F+ +GSAD+ ++V
Sbjct: 204 GSLLYSASWDRSFKIWRTSD---FKCLDSIEKAHDDAINAIVVSKDGFVYTGSADKKIKV 260

Query: 421 WKRGFDGPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRVWQ 257
           W +  D     +A L  H   V +LA        S DG V ++SG+ D  I VW+
Sbjct: 261 WNKK-DKKHSLVATLTKHLSAVNALAI-------SEDGKV-LYSGACDRSILVWE 306

>ref|NP_199823.1| G-protein beta family; protein id: At5g50120.1 [Arabidopsis
           thaliana] gi|10177223|dbj|BAB10298.1| contains
           similarity to GTP-binding regulatory protein and
           WD-repeat protein~gene_id:MPF21.14 [Arabidopsis
           thaliana]
          Length = 388

 Score =  111 bits (278), Expect = 6e-24
 Identities = 57/115 (49%), Positives = 76/115 (65%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCLINVSDFLLSGSADRTVRVW 419
           GS+L SG  D SILVWER+D  + +VV G LRGH +++LCL  VSD L SGSAD+TVR+W
Sbjct: 272 GSLLHSGGSDGSILVWERDDGGD-IVVVGMLRGHTESVLCLAVVSDILCSGSADKTVRLW 330

Query: 418 KRGFDGPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRVWQV 254
           K      + CLA+L+GH  PVK L       + + +    I+SG LD +++VWQV
Sbjct: 331 KCSAK-DYSCLAMLEGHLGPVKCLTGAFRDSRKADEASYHIYSGGLDSQVKVWQV 384

 Score = 39.7 bits (91), Expect = 0.030
 Identities = 27/122 (22%), Positives = 56/122 (45%), Gaps = 8/122 (6%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCL-INVSDFLLSGSADRTVRV 422
           G++L+S + DR++ +W   D      +      H+ AI  + ++ +  + +GS+D+ ++V
Sbjct: 177 GTLLYSVSWDRTLKIWRTTD---FKCLESFTNAHDDAINAVALSENGDIYTGSSDQRIKV 233

Query: 421 WKRGFD-------GPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRV 263
           W++  +            +A+L  H   + +LA    +          + SG  DG I V
Sbjct: 234 WRKNINEENVKKKRKHSLVAILSEHNSGINALALSGTNGS-------LLHSGGSDGSILV 286

Query: 262 WQ 257
           W+
Sbjct: 287 WE 288

 Score = 33.5 bits (75), Expect = 2.1
 Identities = 21/68 (30%), Positives = 35/68 (50%), Gaps = 7/68 (10%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWER-------EDSANHMVVSGALRGHEKAILCLINVSDFLLSGSA 440
           G  L++G+ D  + +W         E S+N  V++G  RG   A+  L+ ++D L +   
Sbjct: 48  GKRLYTGSNDGVVRLWNANTLETLAEASSNGDVITGE-RGGGGAVKSLVILADKLFTAHQ 106

Query: 439 DRTVRVWK 416
           D  +RVWK
Sbjct: 107 DHKIRVWK 114

>emb|CAD41096.1| OSJNBb0011N17.13 [Oryza sativa (japonica cultivar-group)]
          Length = 470

 Score =  108 bits (271), Expect = 4e-23
 Identities = 57/133 (42%), Positives = 83/133 (61%), Gaps = 14/133 (10%)
 Frame = -2

Query: 598 GSVLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCLINVSD------FLLSGSAD 437
           G VL+SG  DR ++VWEREDSA+HMV  GALRGH +A+L +   +        ++SG+AD
Sbjct: 336 GQVLYSGGNDRCVVVWEREDSASHMVAVGALRGHRRAVLSVACAAGDAADGALVVSGAAD 395

Query: 436 RTVRVWKRGFDG-PFCCLAVLDGHRKPVKSLAAV-------AESDQNSPDGVVSIFSGSL 281
           +TVR W+RG DG  + C+AV+DGH   V+S+AA          +D +  D    + S S 
Sbjct: 396 QTVRAWRRGADGRGYYCVAVIDGHGSAVRSVAAALVTAQKKRRADDDGGDEEWRVCSASF 455

Query: 280 DGEIRVWQVSLTS 242
           DGE+R+W + + +
Sbjct: 456 DGEVRLWSLRVAA 468

>ref|NP_175369.1| En/Spm-like transposon protein, putative; protein id: At1g49450.1,
           supported by cDNA: gi_17979258 [Arabidopsis thaliana]
           gi|25405310|pir||B96531 hypothetical protein F13F21.11
           [imported] - Arabidopsis thaliana
           gi|5430755|gb|AAD43155.1|AC007504_10 Hypothetical
           Protein [Arabidopsis thaliana]
          Length = 471

 Score = 97.1 bits (240), Expect = 2e-19
 Identities = 49/124 (39%), Positives = 71/124 (56%), Gaps = 9/124 (7%)
 Frame = -2

Query: 595 SVLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCLINVSDFLLSGSADRTVRVWK 416
           +V++ G+ D ++  WER+    H    G + GH  A+LCL      LLSG AD+ + VWK
Sbjct: 347 AVVYCGSSDGTVNFWERQKYLTH---KGTIHGHRMAVLCLATAGSLLLSGGADKNICVWK 403

Query: 415 RGFDGPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVS---------IFSGSLDGEIRV 263
           R  DG   CL+VL  H  PVK LAAV E++++  DG            ++SGSLD  ++V
Sbjct: 404 RNGDGSHTCLSVLMDHEGPVKCLAAVEEAEEDHNDGDDGGEKGDQRWIVYSGSLDNSVKV 463

Query: 262 WQVS 251
           W+V+
Sbjct: 464 WRVT 467

 Score = 50.1 bits (118), Expect = 2e-05
 Identities = 33/113 (29%), Positives = 58/113 (51%), Gaps = 1/113 (0%)
 Frame = -2

Query: 592 VLFSGACDRSILVWEREDSANHMVVSGALRGHEKAILCLIN-VSDFLLSGSADRTVRVWK 416
           +L+SG+ D+++ VW   DS        ++  H+ A+  +++   D + +GSAD T++VWK
Sbjct: 259 LLYSGSWDKTLKVWRLSDSK----CLESIEAHDDAVNTVVSGFDDLVFTGSADGTLKVWK 314

Query: 415 RGFDGPFCCLAVLDGHRKPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRVWQ 257
           R   G      ++    K   ++ A+A    N  D VV  + GS DG +  W+
Sbjct: 315 REVQGKEMKHVLVQVLMKQENAVTALA---VNLTDAVV--YCGSSDGTVNFWE 362

 Score = 34.7 bits (78), Expect = 0.96
 Identities = 26/98 (26%), Positives = 43/98 (43%)
 Frame = -2

Query: 544 EDSANHMVVSGALRGHEKAILCLINVSDFLLSGSADRTVRVWKRGFDGPFCCLAVLDGHR 365
           ED  ++ ++   +R  E  +  L    D L +GS  + +RVWK         L    G +
Sbjct: 119 EDDPDNGLIGTVVR-QEGHVYSLAASGDLLFTGSDSKNIRVWKD--------LKDFSGFK 169

Query: 364 KPVKSLAAVAESDQNSPDGVVSIFSGSLDGEIRVWQVS 251
                + A+  +  N       +F+G  DG+IRVW+ S
Sbjct: 170 STSGFVKAIVVTRDN------RVFTGHQDGKIRVWRGS 201

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 553,867,184
Number of Sequences: 1393205
Number of extensions: 13114672
Number of successful extensions: 72956
Number of sequences better than 10.0: 1251
Number of HSP's better than 10.0 without gapping: 57386
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 69777
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23426109484
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR052h07_f BP080044 1 355
2 MR043e11_f BP079336 1 457
3 MR028c07_f BP078147 119 603




Lotus japonicus
Kazusa DNA Research Institute