
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149547.9 + phase: 0
(180 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g20270 unknown protein 291 1e-79
At2g17720 similar to prolyl 4-hydroxylase alpha subunit 278 1e-75
At5g66060 prolyl 4-hydroxylase, alpha subunit-like protein 265 1e-71
At4g35810 putative protein 220 3e-58
At3g28480 prolyl 4-hydroxylase like protein 196 7e-51
At5g18900 unknown protein 189 9e-49
At3g06300 unknown protein 187 2e-48
At4g35820 putative protein 162 7e-41
At2g43080 unknown protein 161 2e-40
At4g33910 unknown protein 150 3e-37
At3g28490 prolyl 4-hydroxylase, putative 125 1e-29
At4g25600 unknown protein 89 2e-18
At2g23100 hypothetical protein 46 1e-05
At3g20810 unknown protein 30 0.91
At4g10320 isoleucine-tRNA ligase - like protein 28 2.7
At3g57430 putative protein 28 2.7
At3g04550 unknown protein 28 3.5
At1g48660 hypothetical protein 28 3.5
>At1g20270 unknown protein
Length = 287
Score = 291 bits (746), Expect = 1e-79
Identities = 132/180 (73%), Positives = 153/180 (84%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
M KS V+D ETG DSR RTSSG FL+RG D+I+K IE+RIAD+TFIP +HGE VLH
Sbjct: 108 MVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLH 167
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNE 120
YE GQKYEPHYDYF+D F+T GQR+ATMLMYLSDVEEGGETVFP A NFSSVPW+NE
Sbjct: 168 YEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNE 227
Query: 121 LSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEFKI 180
LS+CGK GLS+KP+MG+A+LFWSM+PDATLDP+SLHG CPVI+G+KW KWMHVGE+KI
Sbjct: 228 LSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYKI 287
>At2g17720 similar to prolyl 4-hydroxylase alpha subunit
Length = 291
Score = 278 bits (710), Expect = 1e-75
Identities = 125/180 (69%), Positives = 150/180 (82%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
M KS V+DE+TG DSR RTSSG FL+RG D +V+ IE+RI+DFTFIPVE+GE VLH
Sbjct: 112 MVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLH 171
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNE 120
Y+VGQKYEPHYDYF+D F+T GQRIAT+LMYLSDV++GGETVFP A+GN S+VPWWNE
Sbjct: 172 YQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNE 231
Query: 121 LSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEFKI 180
LS CGK GLS+ PK +A+LFW+M+PDA+LDPSSLHG CPV+KG+KW KW HV EFK+
Sbjct: 232 LSKCGKEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFKV 291
>At5g66060 prolyl 4-hydroxylase, alpha subunit-like protein
Length = 267
Score = 265 bits (677), Expect = 1e-71
Identities = 120/157 (76%), Positives = 137/157 (86%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
M KS V+DE+TG DSR RTSSG FL RG D+ ++ IE+RI+DFTFIPVEHGE VLH
Sbjct: 110 MEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLH 169
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNE 120
YE+GQKYEPHYDYFMD ++T GQRIAT+LMYLSDVEEGGETVFP AKGN+S+VPWWNE
Sbjct: 170 YEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNE 229
Query: 121 LSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHG 157
LS+CGKGGLS+KPKMG+A+LFWSM PDATLDPSSLHG
Sbjct: 230 LSECGKGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 266
>At4g35810 putative protein
Length = 307
Score = 220 bits (561), Expect = 3e-58
Identities = 111/188 (59%), Positives = 127/188 (67%), Gaps = 31/188 (16%)
Query: 1 MHKSAVIDEETGNGVDSR-------------------------------ERTSSGAFLKR 29
M KS V+D +TG +DSR RTSSG FL R
Sbjct: 112 MMKSKVVDVKTGKSIDSRFCTLTSVVVFTFQLNLERFENSKFANPSLCRVRTSSGTFLNR 171
Query: 30 GSDRIVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQRIAT 89
G D IV+ IE RI+DFTFIP E+GE VLHYEVGQ+YEPH+DYF D F+ GQRIAT
Sbjct: 172 GHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIAT 231
Query: 90 MLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDAT 149
+LMYLSDV+EGGETVFP AKGN S VPWW+ELS CGK GLS+ PK +A+LFWSMKPDA+
Sbjct: 232 VLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDAS 291
Query: 150 LDPSSLHG 157
LDPSSLHG
Sbjct: 292 LDPSSLHG 299
>At3g28480 prolyl 4-hydroxylase like protein
Length = 316
Score = 196 bits (497), Expect = 7e-51
Identities = 89/179 (49%), Positives = 129/179 (71%), Gaps = 1/179 (0%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
+ KS V D ++G V+S RTSSG FL + D IV N+E ++A +TF+P E+GE+ +LH
Sbjct: 88 LEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILH 147
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNE 120
YE GQKYEPH+DYF D + G RIAT+LMYLS+VE+GGETVFP KG + + +
Sbjct: 148 YENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK-DDS 206
Query: 121 LSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEFK 179
++C K G ++KP+ G+A+LF+++ P+AT D +SLHG+CPV++G+KW +W+HV F+
Sbjct: 207 WTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFE 265
>At5g18900 unknown protein
Length = 298
Score = 189 bits (479), Expect = 9e-49
Identities = 91/180 (50%), Positives = 125/180 (68%), Gaps = 2/180 (1%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
+ +SAV D ++G S RTSSG F+ +G D IV IE +I+ +TF+P E+GE+ VL
Sbjct: 69 LKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLR 128
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWN- 119
YE GQKY+ H+DYF D + G R+AT+LMYLS+V +GGETVFP+A+ V N
Sbjct: 129 YEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENK 188
Query: 120 -ELSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEF 178
+LSDC K G+++KP+ G+A+LF+++ PDA DP SLHG CPVI+G+KW KW+HV F
Sbjct: 189 EDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248
>At3g06300 unknown protein
Length = 299
Score = 187 bits (476), Expect = 2e-48
Identities = 90/180 (50%), Positives = 124/180 (68%), Gaps = 2/180 (1%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
+ +SAV D + G S RTSSG F+ +G D IV IE +++ +TF+P E+GE+ VL
Sbjct: 70 LQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLR 129
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAK--GNFSSVPWW 118
YE GQKY+ H+DYF D + G RIAT+L+YLS+V +GGETVFP+A+ S
Sbjct: 130 YEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENK 189
Query: 119 NELSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEF 178
++LSDC K G+++KPK GNA+LF++++ DA DP SLHG CPVI+G+KW KW+HV F
Sbjct: 190 DDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSF 249
>At4g35820 putative protein
Length = 272
Score = 162 bits (411), Expect = 7e-41
Identities = 85/157 (54%), Positives = 109/157 (69%), Gaps = 22/157 (14%)
Query: 1 MHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLH 60
M +S V + TG G +S RTSSG F++ G D+IVK IE+RI++FTFIP E+GE V++
Sbjct: 128 MARSKVRNALTGLGEESSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVIN 187
Query: 61 YEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNE 120
YEVGQK+EPH+D F QRIAT+LMYLSDV++GGETVFP AKG S
Sbjct: 188 YEVGQKFEPHFDGF----------QRIATVLMYLSDVDKGGETVFPEAKGIKS------- 230
Query: 121 LSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHG 157
K G+S++PK G+A+LFWSM+PD + DPSS HG
Sbjct: 231 -----KKGVSVRPKKGDALLFWSMRPDGSRDPSSKHG 262
>At2g43080 unknown protein
Length = 283
Score = 161 bits (408), Expect = 2e-40
Identities = 88/177 (49%), Positives = 108/177 (60%), Gaps = 18/177 (10%)
Query: 4 SAVIDEETGNGVDSRERTSSGAFLKR--GSDRIVKNIERRIADFTFIPVEHGENFNVLHY 61
S V+D +TG GV S RTSSG FL S I++ IE+RIA F+ +P E+GE VL Y
Sbjct: 112 STVVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRY 171
Query: 62 EVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNEL 121
E Q Y+PH+DYF DTF+ GQR+ATMLMYL+D EGGET FP A
Sbjct: 172 EPQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGD----------- 220
Query: 122 SDCGKG-----GLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWM 173
DC G G+S+KP G+A+LFWSM D DP S+HG C V+ G+KW KWM
Sbjct: 221 GDCTCGGKIMKGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWM 277
>At4g33910 unknown protein
Length = 288
Score = 150 bits (380), Expect = 3e-37
Identities = 74/156 (47%), Positives = 104/156 (66%), Gaps = 6/156 (3%)
Query: 20 RTSSGAFLKRGSDRI--VKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDT 77
RTSSG F+ + + +ER+IA T IP HGE+FN+L YE+GQKY+ HYD F T
Sbjct: 130 RTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPT 189
Query: 78 FSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGN 137
+ QRIA+ L+YLSDVEEGGET+FP G+ + + + C GL +KP+ G+
Sbjct: 190 EYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKGD 245
Query: 138 AILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWM 173
+LF+S+ P+ T+D +SLHG+CPV KG+KW+ KW+
Sbjct: 246 GLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWI 281
>At3g28490 prolyl 4-hydroxylase, putative
Length = 213
Score = 125 bits (314), Expect = 1e-29
Identities = 66/133 (49%), Positives = 87/133 (64%), Gaps = 2/133 (1%)
Query: 1 MHKSAVI-DEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVL 59
+ KS V+ D ++G DS RTSSG FL + D IV N+E ++A +TF+P E+GE +L
Sbjct: 64 LEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQIL 123
Query: 60 HYEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWN 119
HYE GQKY+PH+DYF D + G RIAT+LMYLS+V +GGETVFPN KG + +
Sbjct: 124 HYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLK-DD 182
Query: 120 ELSDCGKGGLSIK 132
S C K G + K
Sbjct: 183 SWSKCAKQGYAGK 195
>At4g25600 unknown protein
Length = 291
Score = 88.6 bits (218), Expect = 2e-18
Identities = 55/178 (30%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 1 MHKSAVIDEETGNGVDSRERT----SSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENF 56
+++ + +EE + + R+ T S A K D +V IE +++ +TF+P E+G +
Sbjct: 70 LYRGFLSEEECDHLISLRKETTEVYSVDADGKTQLDPVVAGIEEKVSAWTFLPGENGGSI 129
Query: 57 NVLHYEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVP 116
V Y +K DYF + S+ +AT+++YLS+ +GGE +FPN++
Sbjct: 130 KVRSY-TSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSE------- 181
Query: 117 WWNELSDCGKGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMH 174
+ C +GG ++P GNAILF++ +A+LD S H CPV+KG+ + K ++
Sbjct: 182 -MKPKNSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIY 238
>At2g23100 hypothetical protein
Length = 1036
Score = 45.8 bits (107), Expect = 1e-05
Identities = 21/54 (38%), Positives = 32/54 (58%)
Query: 34 IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQRI 87
++ IE +IA T P ++ E+FN+L Y++GQKY+ HYD F QR+
Sbjct: 861 VLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQRV 914
>At3g20810 unknown protein
Length = 418
Score = 29.6 bits (65), Expect = 0.91
Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 19/89 (21%)
Query: 46 TFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMYLS--DVEEGGET 103
T P+ H + N+L VG+KY Y F+ Y+ TML S D++ ET
Sbjct: 310 TVTPLHHDPHHNILAQVVGKKYIRLYPSFLQDELYPYS----ETMLCNSSQVDLDNIDET 365
Query: 104 VFPNA-----------KGNFSSVP--WWN 119
FP A +G +P WW+
Sbjct: 366 EFPKAMELEFMDCILEEGEMLYIPPKWWH 394
>At4g10320 isoleucine-tRNA ligase - like protein
Length = 1190
Score = 28.1 bits (61), Expect = 2.7
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 63 VGQKYEPHYDYFMDTFSTTY 82
VG+KYEP +DYF D S +
Sbjct: 316 VGKKYEPLFDYFSDFSSEAF 335
>At3g57430 putative protein
Length = 803
Score = 28.1 bits (61), Expect = 2.7
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 140 LFWSMKPDATLDPSSLHGACPV 161
+F+ MKPD ++PSS H AC V
Sbjct: 553 IFYVMKPDYGVEPSSDHYACVV 574
>At3g04550 unknown protein
Length = 449
Score = 27.7 bits (60), Expect = 3.5
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 115 VPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDATL 150
+P WN ++ GKGG+++ + +L W K + L
Sbjct: 347 LPSWNPVAAIGKGGVAVSFRDDRKVLPWDGKEEPLL 382
>At1g48660 hypothetical protein
Length = 573
Score = 27.7 bits (60), Expect = 3.5
Identities = 15/43 (34%), Positives = 24/43 (54%), Gaps = 4/43 (9%)
Query: 42 IADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAG 84
I+ F F+P++H E+ N + VG K +Y +T T+Y G
Sbjct: 347 ISYFEFLPIDHEEDMNTIVDLVGVKLGCYY----ETVVTSYFG 385
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.318 0.137 0.431
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,601,398
Number of Sequences: 26719
Number of extensions: 202053
Number of successful extensions: 407
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 379
Number of HSP's gapped (non-prelim): 19
length of query: 180
length of database: 11,318,596
effective HSP length: 93
effective length of query: 87
effective length of database: 8,833,729
effective search space: 768534423
effective search space used: 768534423
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (26.6 bits)
Medicago: description of AC149547.9