Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC146759.10 + phase: 1 /partial
         (178 letters)

Database: ara_mips 
           26,719 sequences; 11,318,596 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

At5g35530 40S ribosomal protein S3                                    303  3e-83
At3g53870 ribosomal protein S3a homolog                               298  1e-81
At2g31610 40S ribosomal protein; contains C-terminal domain           293  3e-80
rps3 -chloroplast genome- ribosomal protein S3                         40  9e-04
At3g55420 unknown protein (At3g55420)                                  29  1.2
At5g06710 homeobox protein                                             27  4.4
At1g13170 putative oxysterol-binding protein                           27  4.4
At4g29520 unknown protein                                              27  5.8
At4g30150 hypothetical protein                                         27  7.6
At3g47630 unknown protein                                              27  7.6
At1g17740 unknown protein                                              27  7.6
At2g06025 unknown protein                                              26  9.9

>At5g35530 40S ribosomal protein S3
          Length = 248

 Score =  303 bits (776), Expect = 3e-83
 Identities = 150/173 (86%), Positives = 160/173 (91%)

Query: 1   EKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 60
           EKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIAQAESLRYKLLGGLAVRRACY
Sbjct: 61  EKGRRIRELTSLVQKRFKFPQDSVELYAEKVANRGLCAIAQAESLRYKLLGGLAVRRACY 120

Query: 61  GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKDYIDSAVRHVLLRQG 120
           GVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP K+YID+AVRHVLLRQG
Sbjct: 121 GVLRFVMESGAKGCEVIVSGKLRAARAKSMKFKDGYMVSSGQPTKEYIDAAVRHVLLRQG 180

Query: 121 VLGIKVKIMLDWDPKGKQGPKTPLPDIVTIHTPKEEEEYIRPAAVVANDIEVP 173
           VLG+KVKIMLDWDPKGKQGP TPLPD+V IHTPKE++ YI PA VV     VP
Sbjct: 181 VLGLKVKIMLDWDPKGKQGPMTPLPDVVIIHTPKEDDVYIAPAQVVTQAAFVP 233


>At3g53870 ribosomal protein S3a homolog
          Length = 249

 Score =  298 bits (762), Expect = 1e-81
 Identities = 148/167 (88%), Positives = 155/167 (92%)

Query: 1   EKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 60
           EKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY
Sbjct: 61  EKGRRIRELTSLVQKRFKFPVDSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 120

Query: 61  GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKDYIDSAVRHVLLRQG 120
           GVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP K+YIDSAVRHVLLRQG
Sbjct: 121 GVLRFVMESGAKGCEVIVSGKLRAARAKSMKFKDGYMVSSGQPTKEYIDSAVRHVLLRQG 180

Query: 121 VLGIKVKIMLDWDPKGKQGPKTPLPDIVTIHTPKEEEEYIRPAAVVA 167
           VLGIKVK+MLDWDPKG  GPKTPLPD+V IH+PKEEE    PA V A
Sbjct: 181 VLGIKVKVMLDWDPKGISGPKTPLPDVVIIHSPKEEEAIYAPAQVAA 227


>At2g31610 40S ribosomal protein; contains C-terminal domain
          Length = 250

 Score =  293 bits (750), Expect = 3e-80
 Identities = 145/167 (86%), Positives = 153/167 (90%)

Query: 1   EKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 60
           EKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY
Sbjct: 61  EKGRRIRELTSLVQKRFKFPVDSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 120

Query: 61  GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKDYIDSAVRHVLLRQG 120
           GVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP K+YID+AVRHVLLRQG
Sbjct: 121 GVLRFVMESGAKGCEVIVSGKLRAARAKSMKFKDGYMVSSGQPTKEYIDAAVRHVLLRQG 180

Query: 121 VLGIKVKIMLDWDPKGKQGPKTPLPDIVTIHTPKEEEEYIRPAAVVA 167
           VLGIKVKIMLDWDP GK GPKTPLPD+V IH PK++  Y  PA   A
Sbjct: 181 VLGIKVKIMLDWDPTGKSGPKTPLPDVVIIHAPKDDVVYSAPAQAAA 227


>rps3 -chloroplast genome- ribosomal protein S3
          Length = 218

 Score = 39.7 bits (91), Expect = 9e-04
 Identities = 31/133 (23%), Positives = 64/133 (47%), Gaps = 5/133 (3%)

Query: 1   EKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 60
           +K RR+ EL   VQK        + +   +++N        AE +  +L   ++ R+A  
Sbjct: 87  DKPRRVEELQMNVQKELNCVNRKLNIAITRISNPYGDPNILAEFIAGQLKNRVSFRKAMK 146

Query: 61  GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKF-KDGYM-ISSGQPVKDYIDSAVRHVLLR 118
             +    ++  KG +V ++G++  +    +++ ++G + + + +   DY    VR +   
Sbjct: 147 KAIELTEQANTKGIQVQIAGRIDGKEIARVEWIREGRVPLQTIEAKIDYCSYTVRTIY-- 204

Query: 119 QGVLGIKVKIMLD 131
            GVLGIK+ I +D
Sbjct: 205 -GVLGIKIWIFVD 216


>At3g55420 unknown protein (At3g55420)
          Length = 216

 Score = 29.3 bits (64), Expect = 1.2
 Identities = 17/53 (32%), Positives = 26/53 (48%)

Query: 4   RRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVR 56
           + IR+L +    RFKF +  +  Y   V N G    A AE L+  L   +A++
Sbjct: 135 KMIRKLKNWEMARFKFRKGCITFYVYAVRNAGNEGFAAAEDLKVILQAVVALK 187


>At5g06710 homeobox protein
          Length = 336

 Score = 27.3 bits (59), Expect = 4.4
 Identities = 38/156 (24%), Positives = 62/156 (39%), Gaps = 28/156 (17%)

Query: 14  QKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKG 73
           +K+F F E +      K+NN  + + + +E    K LG        +G  R +  S +  
Sbjct: 12  KKQFSFMEKN-----SKINNPSVSSTSTSE----KDLGFCMALDVAFGGHRSLSSSSSPS 62

Query: 74  CEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKDYIDSAVRHVLLRQGVLGIKVKIMLDWD 133
            E     K  A RAK     D + +SS       +D  ++        L +     L  +
Sbjct: 63  VED--EKKKPAPRAKK---SDEFRVSSS------VDPPLQ--------LQLHFPNWLPEN 103

Query: 134 PKGKQGPKTPLPDIVTIHTPKEEEEYIRPAAVVAND 169
            KG+QG + PL     +   +EEEE +   +V   D
Sbjct: 104 SKGRQGGRMPLGAATVVEEEEEEEEAVPSMSVSPPD 139


>At1g13170 putative oxysterol-binding protein
          Length = 816

 Score = 27.3 bits (59), Expect = 4.4
 Identities = 41/206 (19%), Positives = 78/206 (36%), Gaps = 37/206 (17%)

Query: 2   KGRRIRELTSVVQKRFK--FPENSVELYAEKVNNRGLCAIAQAESLRYKLLG-------- 51
           +GR+ +    ++ + ++  +P+  +  ++EKV++  +      E   +   G        
Sbjct: 506 EGRQCKPFNPLLGETYEADYPDKGLRFFSEKVSHHPMIVACHCEGQGWNFWGDSNIKGKF 565

Query: 52  -GLAVRRACYGVLRFVMESG--------AKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQ 102
            G +++    GVL    + G              I+ GKL      +M+ K G   S   
Sbjct: 566 WGRSIQLDPVGVLTLKFDDGEIYQWSKVTTSIYNIILGKLYCDHYGTMRIKGGSNYSCRL 625

Query: 103 PVKD--YIDSAVR--HVLLRQGVLGIKVKIML-DW---------DPKGKQGPKTPLPDIV 148
             K+   ID   R  H  ++    G KV I++  W         DP  K     P+ + V
Sbjct: 626 KFKEQSVIDRNPRQVHGFVQDNRTGEKVAILIGKWDEAMYYVLGDPTTKPKGYDPMTEAV 685

Query: 149 TI----HTPKEEEEYIRPAAVVANDI 170
            +     +P +    + P A+  N+I
Sbjct: 686 LLWERDKSPTKTRYNLSPFAISLNEI 711


>At4g29520 unknown protein
          Length = 306

 Score = 26.9 bits (58), Expect = 5.8
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 1   EKGRRIRELTSVVQKRFKFPENSVELYAEKVNNR 34
           EK  +  EL   + K FK    +++ +A+KV+NR
Sbjct: 250 EKESKTEELKKTITKEFKKKGEALKRHAQKVSNR 283


>At4g30150 hypothetical protein
          Length = 1966

 Score = 26.6 bits (57), Expect = 7.6
 Identities = 19/68 (27%), Positives = 32/68 (46%), Gaps = 4/68 (5%)

Query: 39  IAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMI 98
           + ++E    KLL   A+R A +  ++ + E  A GC       L A  +K+MK+      
Sbjct: 697 LERSEKSVEKLLSSQALRLAIHKAIKVIPEGQASGC----IKSLTADVSKTMKWIKQVCC 752

Query: 99  SSGQPVKD 106
           S+G   +D
Sbjct: 753 STGATEQD 760


>At3g47630 unknown protein
          Length = 320

 Score = 26.6 bits (57), Expect = 7.6
 Identities = 23/112 (20%), Positives = 47/112 (41%), Gaps = 17/112 (15%)

Query: 8   ELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL--GGLAVRRACYGVL-- 63
           ++  +V+ +F   ++  + + E+   + L   + AE+   KL+    L+  R+    L  
Sbjct: 179 KVNKIVKGQFDLFQSMYKPFLEECETKNLLRFSSAEASHTKLVQDSSLSATRSLVSSLPA 238

Query: 64  -------------RFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQ 102
                        +FV E+G    EV +S +  A +      +   M+SSG+
Sbjct: 239 SVRSQMGKSLGEKKFVSETGRVMGEVCISSREEAAKCMEKVMRRRVMVSSGR 290


>At1g17740 unknown protein
          Length = 624

 Score = 26.6 bits (57), Expect = 7.6
 Identities = 36/140 (25%), Positives = 58/140 (40%), Gaps = 16/140 (11%)

Query: 1   EKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVR-RAC 59
           +KG RI E   VV    ++P +S+++    V +    A++ A        G +++  +  
Sbjct: 479 QKGLRISEERMVVDSSPEYPVDSIQVQILNVESNFAGAVSDA--------GDISIEGKVK 530

Query: 60  YGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYM--ISSGQPVKDYIDSAVRHVLL 117
           YGV       G+ G +V + G L   R        G +  I   Q V     S  R VL 
Sbjct: 531 YGVPHLTC-VGSFGVDVSLEGNLILCRQVDQPGMIGQVGNILGEQNVNVNFMSVGRTVLR 589

Query: 118 RQGVLGIKVKIMLDWDPKGK 137
           +Q ++ I V    D +P  K
Sbjct: 590 KQAIMAIGV----DEEPDNK 605


>At2g06025 unknown protein
          Length = 288

 Score = 26.2 bits (56), Expect = 9.9
 Identities = 18/48 (37%), Positives = 25/48 (51%), Gaps = 8/48 (16%)

Query: 29  EKVNNRGLCAIAQAESLRYKLLGGLAVRR-------ACYGVLRFVMES 69
           EKV ++  C+I Q  S RY  +  L V +       AC  +LRF +ES
Sbjct: 191 EKVKSQLFCSINQEGSNRYGYIANLCVAKSARRQGIAC-NMLRFAVES 237


  Database: ara_mips
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,978,382
  Number of sequences in database:  6832
  
  Database: /data/blast2/ara_mips_chr2
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,737,135
  Number of sequences in database:  4184
  
  Database: /data/blast2/ara_mips_chr3
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,236,886
  Number of sequences in database:  5377
  
  Database: /data/blast2/ara_mips_chr4
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,748,816
  Number of sequences in database:  4030
  
  Database: /data/blast2/ara_mips_chr5
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,569,679
  Number of sequences in database:  6098
  
  Database: /data/blast2/ara_mips_chl
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 25,951
  Number of sequences in database:  85
  
  Database: /data/blast2/ara_mips_mit
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 21,747
  Number of sequences in database:  113
  
Lambda     K      H
   0.319    0.138    0.398 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,883,811
Number of Sequences: 26719
Number of extensions: 159418
Number of successful extensions: 363
Number of sequences better than 10.0: 12
Number of HSP's better than 10.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 356
Number of HSP's gapped (non-prelim): 12
length of query: 178
length of database: 11,318,596
effective HSP length: 93
effective length of query: 85
effective length of database: 8,833,729
effective search space: 750866965
effective search space used: 750866965
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (26.2 bits)


Medicago: description of AC146759.10