Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149547.6 + phase: 0 
         (160 letters)

Database: ara_mips 
           26,719 sequences; 11,318,596 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

At1g20270 unknown protein                                             243  4e-65
At2g17720 similar to prolyl 4-hydroxylase alpha subunit               228  1e-60
At5g66060 prolyl 4-hydroxylase, alpha subunit-like protein            207  2e-54
At4g35810 putative protein                                            179  4e-46
At3g28480 prolyl 4-hydroxylase like protein                           170  3e-43
At5g18900 unknown protein                                             167  2e-42
At3g06300 unknown protein                                             162  6e-41
At2g43080 unknown protein                                             151  1e-37
At4g35820 putative protein                                            138  1e-33
At4g33910 unknown protein                                             120  3e-28
At3g28490 prolyl 4-hydroxylase, putative                              103  3e-23
At4g25600 unknown protein                                              60  4e-10
At2g23100 hypothetical protein                                         41  3e-04
At1g06290 acyl-CoA oxidase, putative, 3' partial                       28  2.1
At1g48930 putative glucanase gb|AAB91971.1; similar to ESTs gb|A...    28  2.8
At5g39400 PTEN -like protein                                           27  4.7
At2g34260 unknown protein                                              27  4.7
At2g13870 En/Spm-like transposon protein                               27  4.7
At1g52850 hypothetical protein                                         27  4.7
At1g63020 RNA polymerase IIA largest subunit, putative                 27  6.2

>At1g20270 unknown protein
          Length = 287

 Score =  243 bits (619), Expect = 4e-65
 Identities = 114/180 (63%), Positives = 137/180 (75%), Gaps = 22/180 (12%)

Query: 1   MHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILH 59
           M KSTV D ETGKS D+  RTSSGTF+ RG DKI++ IE+RIAD+TFIP ++GE + +LH
Sbjct: 108 MVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLH 167

Query: 60  YEVGQKYEPHPDFFTDEINTKNGGERVATMLMYL---------------------PWWNE 98
           YE GQKYEPH D+F DE NTKNGG+R+ATMLMYL                     PW+NE
Sbjct: 168 YEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNE 227

Query: 99  LSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRVGKWSI 158
           LS+CGKKGLS+KP+MGDALLFWSM+PD TLDP S+HG CPVI+G+KWS TKWM VG++ I
Sbjct: 228 LSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYKI 287


>At2g17720 similar to prolyl 4-hydroxylase alpha subunit
          Length = 291

 Score =  228 bits (580), Expect = 1e-60
 Identities = 106/180 (58%), Positives = 133/180 (73%), Gaps = 22/180 (12%)

Query: 1   MHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILH 59
           M KSTV D++TG S D+  RTSSGTF+ RGHD+++  IE+RI+DFTFIPVENGE + +LH
Sbjct: 112 MVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLH 171

Query: 60  YEVGQKYEPHPDFFTDEINTKNGGERVATMLMYL---------------------PWWNE 98
           Y+VGQKYEPH D+F DE NTKNGG+R+AT+LMYL                     PWWNE
Sbjct: 172 YQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNE 231

Query: 99  LSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRVGKWSI 158
           LS CGK+GLS+ PK  DALLFW+M+PD +LDP S+HG CPV+KG+KWS TKW  V ++ +
Sbjct: 232 LSKCGKEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFKV 291


>At5g66060 prolyl 4-hydroxylase, alpha subunit-like protein
          Length = 267

 Score =  207 bits (527), Expect = 2e-54
 Identities = 99/157 (63%), Positives = 118/157 (75%), Gaps = 22/157 (14%)

Query: 1   MHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILH 59
           M KSTV D++TGKS D+  RTSSGTF+ RG DK +R IE+RI+DFTFIPVE+GE + +LH
Sbjct: 110 MEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLH 169

Query: 60  YEVGQKYEPHPDFFTDEINTKNGGERVATMLMYL---------------------PWWNE 98
           YE+GQKYEPH D+F DE NT+NGG+R+AT+LMYL                     PWWNE
Sbjct: 170 YEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNE 229

Query: 99  LSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHG 135
           LS+CGK GLS+KPKMGDALLFWSM PD TLDP S+HG
Sbjct: 230 LSECGKGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 266


>At4g35810 putative protein
          Length = 307

 Score =  179 bits (455), Expect = 4e-46
 Identities = 83/138 (60%), Positives = 100/138 (72%), Gaps = 21/138 (15%)

Query: 19  RTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTDEIN 78
           RTSSGTF+NRGHD+I+  IE RI+DFTFIP ENGE + +LHYEVGQ+YEPH D+F DE N
Sbjct: 162 RTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFN 221

Query: 79  TKNGGERVATMLMYL---------------------PWWNELSDCGKKGLSIKPKMGDAL 117
            + GG+R+AT+LMYL                     PWW+ELS CGK+GLS+ PK  DAL
Sbjct: 222 VRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDAL 281

Query: 118 LFWSMKPDGTLDPLSMHG 135
           LFWSMKPD +LDP S+HG
Sbjct: 282 LFWSMKPDASLDPSSLHG 299


>At3g28480 prolyl 4-hydroxylase like protein
          Length = 316

 Score =  170 bits (430), Expect = 3e-43
 Identities = 80/174 (45%), Positives = 120/174 (67%), Gaps = 21/174 (12%)

Query: 1   MHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILH 59
           + KS V D+++G+SV++  RTSSG F+++  D I+ N+E ++A +TF+P ENGES+ ILH
Sbjct: 88  LEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILH 147

Query: 60  YEVGQKYEPHPDFFTDEINTKNGGERVATMLMYL-----------PWW---------NEL 99
           YE GQKYEPH D+F D+ N + GG R+AT+LMYL           P W         +  
Sbjct: 148 YENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSW 207

Query: 100 SDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRV 153
           ++C K+G ++KP+ GDALLF+++ P+ T D  S+HG+CPV++G+KWS T+W+ V
Sbjct: 208 TECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHV 261


>At5g18900 unknown protein
          Length = 298

 Score =  167 bits (424), Expect = 2e-42
 Identities = 79/173 (45%), Positives = 113/173 (64%), Gaps = 23/173 (13%)

Query: 4   STVDDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILHYEVG 63
           +  D+++G+S  +  RTSSGTFI++G D I+  IE +I+ +TF+P ENGE + +L YE G
Sbjct: 73  AVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHG 132

Query: 64  QKYEPHPDFFTDEINTKNGGERVATMLMYL-----------------------PWWNELS 100
           QKY+ H D+F D++N   GG R+AT+LMYL                           +LS
Sbjct: 133 QKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLS 192

Query: 101 DCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRV 153
           DC K+G+++KP+ GDALLF+++ PD   DPLS+HG CPVI+G+KWS TKW+ V
Sbjct: 193 DCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHV 245


>At3g06300 unknown protein
          Length = 299

 Score =  162 bits (411), Expect = 6e-41
 Identities = 76/173 (43%), Positives = 112/173 (63%), Gaps = 23/173 (13%)

Query: 4   STVDDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILHYEVG 63
           +  D++ G+S  +  RTSSGTFI++G D I+  IE +++ +TF+P ENGE + +L YE G
Sbjct: 74  AVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHG 133

Query: 64  QKYEPHPDFFTDEINTKNGGERVATMLMYL-----------------------PWWNELS 100
           QKY+ H D+F D++N   GG R+AT+L+YL                          ++LS
Sbjct: 134 QKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLS 193

Query: 101 DCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRV 153
           DC KKG+++KPK G+ALLF++++ D   DP S+HG CPVI+G+KWS TKW+ V
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV 246


>At2g43080 unknown protein
          Length = 283

 Score =  151 bits (382), Expect = 1e-37
 Identities = 83/167 (49%), Positives = 104/167 (61%), Gaps = 19/167 (11%)

Query: 4   STVDDETGKSVDNSARTSSGTFINRGHDK--ILRNIEQRIADFTFIPVENGESVNILHYE 61
           + VD +TGK V +  RTSSG F+        I++ IE+RIA F+ +P ENGE + +L YE
Sbjct: 113 TVVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYE 172

Query: 62  VGQKYEPHPDFFTDEINTKNGGERVATMLMYL-----------PWWNELSDC---GK--K 105
             Q Y+PH D+F D  N K GG+RVATMLMYL           P   +  DC   GK  K
Sbjct: 173 PQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGD-GDCTCGGKIMK 231

Query: 106 GLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMR 152
           G+S+KP  GDA+LFWSM  DG  DP S+HG C V+ G+KWS TKWMR
Sbjct: 232 GISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278


>At4g35820 putative protein
          Length = 272

 Score =  138 bits (348), Expect = 1e-33
 Identities = 70/135 (51%), Positives = 95/135 (69%), Gaps = 19/135 (14%)

Query: 10  TGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILHYEVGQKYEPH 69
           TG   ++S+RTSSGTFI  GHDKI++ IE+RI++FTFIP ENGE++ +++YEVGQK+EPH
Sbjct: 138 TGLGEESSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPH 197

Query: 70  PDFFTDEINTKNGGERVATMLMYLPWWNELSDC---------GKKGLSIKPKMGDALLFW 120
            D          G +R+AT+LMYL   ++  +           KKG+S++PK GDALLFW
Sbjct: 198 FD----------GFQRIATVLMYLSDVDKGGETVFPEAKGIKSKKGVSVRPKKGDALLFW 247

Query: 121 SMKPDGTLDPLSMHG 135
           SM+PDG+ DP S HG
Sbjct: 248 SMRPDGSRDPSSKHG 262


>At4g33910 unknown protein
          Length = 288

 Score =  120 bits (301), Expect = 3e-28
 Identities = 66/166 (39%), Positives = 97/166 (57%), Gaps = 27/166 (16%)

Query: 11  GKSVDNS--ARTSSGTFINRGHDKI--LRNIEQRIADFTFIPVENGESVNILHYEVGQKY 66
           G++ +N+   RTSSGTFI+   +    L  +E++IA  T IP  +GES NIL YE+GQKY
Sbjct: 120 GETAENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKY 179

Query: 67  EPHPDFFTDEINTKNGGERVATMLMYLPWWNELSDCGKK--------------------G 106
           + H D F          +R+A+ L+YL   +++ + G+                     G
Sbjct: 180 DSHYDVFNPTEYGPQSSQRIASFLLYL---SDVEEGGETMFPFENGSNMGIGYDYKQCIG 236

Query: 107 LSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMR 152
           L +KP+ GD LLF+S+ P+GT+D  S+HG+CPV KG+KW  TKW+R
Sbjct: 237 LKVKPRKGDGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282


>At3g28490 prolyl 4-hydroxylase, putative
          Length = 213

 Score =  103 bits (258), Expect = 3e-23
 Identities = 63/158 (39%), Positives = 88/158 (54%), Gaps = 29/158 (18%)

Query: 1   MHKSTV--DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNIL 58
           + KS V  D ++G+S D+  RTSSG F+ +  D I+ N+E ++A +TF+P ENGE++ IL
Sbjct: 64  LEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQIL 123

Query: 59  HYEVGQKYEPHPDFFTDEINTKNGGERVATMLMYLPWWNELSDCGKKGLSIKPKMGDALL 118
           HYE GQKY+PH D+F D+   + GG R+AT+LMY      LS+  K G ++ P       
Sbjct: 124 HYENGQKYDPHFDYFYDKKALELGGHRIATVLMY------LSNVTKGGETVFPN------ 171

Query: 119 FWSMKPDGTLDPLSMHGACPVIKGDKWS-CTKWMRVGK 155
            W              G  P +K D WS C K    GK
Sbjct: 172 -WK-------------GKTPQLKDDSWSKCAKQGYAGK 195


>At4g25600 unknown protein
          Length = 291

 Score = 60.5 bits (145), Expect = 4e-10
 Identities = 37/132 (28%), Positives = 69/132 (52%), Gaps = 14/132 (10%)

Query: 31  DKILRNIEQRIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTDEINTKNGGERVATML 90
           D ++  IE++++ +TF+P ENG S+ +  Y   +K     D+F +E ++      +AT++
Sbjct: 105 DPVVAGIEEKVSAWTFLPGENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLHESLLATVV 163

Query: 91  MYLPWWNE-------------LSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGAC 137
           +YL    +              + C + G  ++P  G+A+LF++   + +LD  S H  C
Sbjct: 164 LYLSNTTQGGELLFPNSEMKPKNSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRC 223

Query: 138 PVIKGDKWSCTK 149
           PV+KG+    TK
Sbjct: 224 PVVKGELLVATK 235


>At2g23100 hypothetical protein
          Length = 1036

 Score = 40.8 bits (94), Expect = 3e-04
 Identities = 20/41 (48%), Positives = 27/41 (65%)

Query: 33  ILRNIEQRIADFTFIPVENGESVNILHYEVGQKYEPHPDFF 73
           +L  IE++IA  T  P +  ES NIL Y++GQKY+ H D F
Sbjct: 861 VLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAF 901


>At1g06290 acyl-CoA oxidase, putative, 3' partial
          Length = 289

 Score = 28.1 bits (61), Expect = 2.1
 Identities = 17/52 (32%), Positives = 23/52 (43%), Gaps = 2/52 (3%)

Query: 28  RGHDKILRNIEQRIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTDEINT 79
           R H+K L+N E  +    F   E G   N+   E    Y+P  + F   INT
Sbjct: 168 RHHEKWLKNTEDYVVKGCFAMTELGHGSNVRGIETVTTYDPKTEEFV--INT 217


>At1g48930 putative glucanase gb|AAB91971.1; similar to ESTs
           gb|AI995074.1, gb|AA651396.1
          Length = 627

 Score = 27.7 bits (60), Expect = 2.8
 Identities = 14/39 (35%), Positives = 20/39 (50%), Gaps = 3/39 (7%)

Query: 64  QKYEPHPDFFTDEINTKNGGERVATM---LMYLPWWNEL 99
           ++Y+   D+F      KNGG  + T    LMY+  WN L
Sbjct: 308 KQYQTKADYFACACLKKNGGYNIQTTPGGLMYVREWNNL 346


>At5g39400 PTEN -like protein
          Length = 412

 Score = 26.9 bits (58), Expect = 4.7
 Identities = 11/44 (25%), Positives = 19/44 (43%)

Query: 63  GQKYEPHPDFFTDEINTKNGGERVATMLMYLPWWNELSDCGKKG 106
           G   E   + +     T N G  + +   Y+ +W++L    KKG
Sbjct: 171 GMSAEEALEMYASRRTTNNNGVSIPSQRRYVKYWSDLLSFSKKG 214


>At2g34260 unknown protein
          Length = 353

 Score = 26.9 bits (58), Expect = 4.7
 Identities = 19/63 (30%), Positives = 29/63 (45%), Gaps = 11/63 (17%)

Query: 65  KYEPHPDFFTDEINT----KNG-----GERVATMLMYLPWWNELSDCGKKGLSIKPKMGD 115
           K +   +F  DE+ +    KNG     G +  T+L+Y   W    DC  + + + P   D
Sbjct: 168 KVQSQSEFSEDELLSVVIMKNGRKVICGTQNGTLLLYS--WGFFKDCSDRFVDLAPNSVD 225

Query: 116 ALL 118
           ALL
Sbjct: 226 ALL 228


>At2g13870 En/Spm-like transposon protein
          Length = 441

 Score = 26.9 bits (58), Expect = 4.7
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 14  VDNSARTSSGTFINRGHDKILRNIEQRIAD 43
           +D + R S GTFI+R  +++ + +  RI +
Sbjct: 266 LDETHRKSDGTFIDRKSEEVYKEVSSRIRE 295


>At1g52850 hypothetical protein
          Length = 447

 Score = 26.9 bits (58), Expect = 4.7
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 14  VDNSARTSSGTFINRGHDKILRNIEQRIAD 43
           +D + R S GTFI+R  +++ + +  RI +
Sbjct: 266 LDETHRKSDGTFIDRKSEEVYKEVSSRIQE 295


>At1g63020 RNA polymerase IIA largest subunit, putative
          Length = 1453

 Score = 26.6 bits (57), Expect = 6.2
 Identities = 18/52 (34%), Positives = 26/52 (49%), Gaps = 5/52 (9%)

Query: 103  GKKGLSI-KPKMGDALLFWSMKPDGTLDPLSMH----GACPVIKGDKWSCTK 149
            G KG+ + K K GD+  F  ++ DGT +  S H    GA  +I   K +  K
Sbjct: 1384 GVKGIRVAKSKHGDSCCFEVVRIDGTFEDFSYHKCVLGATKIIAPKKMNFYK 1435


  Database: ara_mips
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,978,382
  Number of sequences in database:  6832
  
  Database: /data/blast2/ara_mips_chr2
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,737,135
  Number of sequences in database:  4184
  
  Database: /data/blast2/ara_mips_chr3
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,236,886
  Number of sequences in database:  5377
  
  Database: /data/blast2/ara_mips_chr4
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,748,816
  Number of sequences in database:  4030
  
  Database: /data/blast2/ara_mips_chr5
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,569,679
  Number of sequences in database:  6098
  
  Database: /data/blast2/ara_mips_chl
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 25,951
  Number of sequences in database:  85
  
  Database: /data/blast2/ara_mips_mit
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 21,747
  Number of sequences in database:  113
  
Lambda     K      H
   0.317    0.136    0.438 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,156,296
Number of Sequences: 26719
Number of extensions: 176592
Number of successful extensions: 370
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 337
Number of HSP's gapped (non-prelim): 24
length of query: 160
length of database: 11,318,596
effective HSP length: 91
effective length of query: 69
effective length of database: 8,887,167
effective search space: 613214523
effective search space used: 613214523
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (26.2 bits)


Medicago: description of AC149547.6