
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146553.10 - phase: 0
(301 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC78028 similar to GP|22136524|gb|AAM91340.1 unknown protein {Ar... 616 e-177
TC78029 similar to GP|21593296|gb|AAM65245.1 prolyl 4-hydroxylas... 491 e-139
TC86903 similar to GP|21617881|gb|AAM66931.1 prolyl 4-hydroxylas... 346 5e-96
TC86651 similar to PIR|F84555|F84555 similar to prolyl 4-hydroxy... 248 2e-66
TC79830 similar to PIR|G84861|G84861 hypothetical protein At2g43... 179 2e-45
BG448236 similar to GP|21537370|gb putative prolyl 4-hydroxylase... 170 5e-43
TC83824 similar to GP|17381226|gb|AAL36425.1 unknown protein {Ar... 106 1e-23
AJ388831 weakly similar to GP|10177121|dbj prolyl 4-hydroxylase ... 91 4e-19
BF518749 81 4e-16
BG447864 weakly similar to GP|10177121|db prolyl 4-hydroxylase ... 77 1e-14
BF521324 similar to GP|18086437|gb| AT3g28480/MFJ20_16 {Arabidop... 71 4e-13
BQ144147 67 1e-11
TC84014 similar to GP|21593296|gb|AAM65245.1 prolyl 4-hydroxylas... 42 4e-04
TC84326 weakly similar to PIR|F85225|F85225 hypothetical protein... 29 2.5
TC88383 similar to GP|21592515|gb|AAM64465.1 nuclear receptor bi... 27 9.6
>TC78028 similar to GP|22136524|gb|AAM91340.1 unknown protein {Arabidopsis
thaliana}, partial (86%)
Length = 1217
Score = 616 bits (1588), Expect = e-177
Identities = 299/303 (98%), Positives = 299/303 (98%), Gaps = 2/303 (0%)
Frame = +1
Query: 1 MSVICRVWCSIIVPLLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLE 60
MSVICRVWCSIIVPLLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLE
Sbjct: 10 MSVICRVWCSIIVPLLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLE 189
Query: 61 CDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLP 120
CDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLP
Sbjct: 190 CDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLP 369
Query: 121 KENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAE 180
KENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAE
Sbjct: 370 KENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAE 549
Query: 181 --ESPRHKLSETDEDLSECGKKGVAVNPRRGDALLFCSLHPNAIPDTLSLHAGCPVIEGE 238
ESPRHKLSETDEDLSECGKKGVAV PRRGDALLF SLHPNAIPDTLSLHAGCPVIEGE
Sbjct: 550 LQESPRHKLSETDEDLSECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGE 729
Query: 239 KWSATKWIHVDSFDKTVGAGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSC 298
KWSATKWIHVDSFDKTVGAGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSC
Sbjct: 730 KWSATKWIHVDSFDKTVGAGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSC 909
Query: 299 KTC 301
KTC
Sbjct: 910 KTC 918
>TC78029 similar to GP|21593296|gb|AAM65245.1 prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(89%)
Length = 1126
Score = 491 bits (1265), Expect = e-139
Identities = 238/301 (79%), Positives = 265/301 (87%), Gaps = 1/301 (0%)
Frame = +3
Query: 2 SVICRVWCSIIVPLLLICKIHFALGSYAGT-SAIIDPTKVKQVSWKPRAFVYKGFLTDLE 60
S + RV C ++ L+LI + SYAG+ S+II+P+KVKQ+SW PRAFVY+GFLTDLE
Sbjct: 105 SEMSRVLCFLLF-LVLIIQTDEVQSSYAGSASSIINPSKVKQISWIPRAFVYQGFLTDLE 281
Query: 61 CDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLP 120
CDHLIS+AKSELKRSAVADNLSG+S+LS+VRTSSGMFISKNKD IVSGIED+IS+WTFLP
Sbjct: 282 CDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFISKNKDPIVSGIEDRISAWTFLP 461
Query: 121 KENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAE 180
KENGEDIQVLRYEHGQKYDPHYDYFADKVNI +GGHR+ATVLMYLTNVTKGGETVFP AE
Sbjct: 462 KENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAE 641
Query: 181 ESPRHKLSETDEDLSECGKKGVAVNPRRGDALLFCSLHPNAIPDTLSLHAGCPVIEGEKW 240
E PR + S+ DLSEC KKG+AV PRRGDALLF SL NAIPDT SLHAGCPV+EGEKW
Sbjct: 642 EPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 821
Query: 241 SATKWIHVDSFDKTVGAGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSCKT 300
SATKWIHVDSFDK VGAGG C+DQH+SCERWA+LGECT NP YMVG+S LPGYCRKSCK
Sbjct: 822 SATKWIHVDSFDKIVGAGGGCSDQHDSCERWASLGECTNNPVYMVGSSNLPGYCRKSCKA 1001
Query: 301 C 301
C
Sbjct: 1002C 1004
>TC86903 similar to GP|21617881|gb|AAM66931.1 prolyl 4-hydroxylase putative
{Arabidopsis thaliana}, partial (68%)
Length = 1310
Score = 346 bits (888), Expect = 5e-96
Identities = 173/280 (61%), Positives = 207/280 (73%), Gaps = 4/280 (1%)
Frame = +3
Query: 26 GSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGES 85
GS G DPT+V Q+SW PRAF+YK FLTD ECDHLI ++K +L++S VADN SG+S
Sbjct: 264 GSVFGAKVKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKS 443
Query: 86 KLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF 145
SEVRTSSGMF++K +D IVSGIE +I++WTFLP ENGE +QVL Y +G+KY+PH+D+F
Sbjct: 444 IQSEVRTSSGMFLNKQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFF 623
Query: 146 ADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSE-TDEDLSECGKKGVAV 204
DK N GGHRVATVLMYL+NV KGGET+FP+AE KLS+ DE SEC KG AV
Sbjct: 624 HDKANQRLGGHRVATVLMYLSNVEKGGETIFPHAE----GKLSQPKDESWSECAHKGYAV 791
Query: 205 NPRRGDALLFCSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKTVGAGGD---C 261
PR+GDALLF SLH +A D+ SLH CPVIEGEKWSATKWIHV F+K V + C
Sbjct: 792 KPRKGDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATKWIHVADFEKPVRQALEDRVC 971
Query: 262 TDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSCKTC 301
D++E+C RWA +GEC KNP YMVG G G C KSC C
Sbjct: 972 ADENENCARWAKVGECEKNPLYMVGKGG-NGKCMKSCNVC 1088
>TC86651 similar to PIR|F84555|F84555 similar to prolyl 4-hydroxylase alpha
subunit [imported] - Arabidopsis thaliana, partial (85%)
Length = 1329
Score = 248 bits (633), Expect = 2e-66
Identities = 120/212 (56%), Positives = 157/212 (73%)
Frame = +2
Query: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99
V+ VSW+PRAFVY FLT EC++LI IAK + +S V D+ +G+SK S VRTSSG F++
Sbjct: 350 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLA 529
Query: 100 KNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 159
+ +D IV IE KI+ +TF+P E+GE +QVL YE GQKY+PHYDYF D+ N GG R+A
Sbjct: 530 RGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIA 709
Query: 160 TVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVNPRRGDALLFCSLHP 219
TVLMYLT+V +GGETVFP A+ + +LS+CGKKG+++ P+RGDALLF S+ P
Sbjct: 710 TVLMYLTDVEEGGETVFPAAKGN--FSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKP 883
Query: 220 NAIPDTLSLHAGCPVIEGEKWSATKWIHVDSF 251
+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 884 DATLDASSLHGGCPVIKGNKWSSTKWIRVNEY 979
>TC79830 similar to PIR|G84861|G84861 hypothetical protein At2g43080
[imported] - Arabidopsis thaliana, partial (87%)
Length = 1130
Score = 179 bits (453), Expect = 2e-45
Identities = 98/213 (46%), Positives = 133/213 (62%), Gaps = 5/213 (2%)
Frame = +1
Query: 39 KVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFI 98
K + +SW PR + FL+ ECD+L +A LK S V D +G+ S+VRTSSGMF+
Sbjct: 313 KPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFL 492
Query: 99 S--KNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGH 156
S + K ++ IE +IS ++ +P ENGE +QVLRYE Q Y PH+DYF+D N+ RGG
Sbjct: 493 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 672
Query: 157 RVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECG---KKGVAVNPRRGDALL 213
R+AT+LMYL + +GGET FP+A D CG KG+ V P +G+A+L
Sbjct: 673 RIATMLMYLGDNVEGGETHFPSA-----------GSDECSCGGKLTKGLCVKPVKGNAVL 819
Query: 214 FCSLHPNAIPDTLSLHAGCPVIEGEKWSATKWI 246
F S+ + D S+H GCPV+ GEKWSATKW+
Sbjct: 820 FWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWM 918
>BG448236 similar to GP|21537370|gb putative prolyl 4-hydroxylase alpha
subunit {Arabidopsis thaliana}, partial (52%)
Length = 682
Score = 170 bits (431), Expect = 5e-43
Identities = 80/137 (58%), Positives = 105/137 (76%)
Frame = +3
Query: 43 VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNK 102
+SW+PRAFVY FL+ EC+HLI++AK L +S+V D+ +G+S S VRTSSGMF+ + K
Sbjct: 210 LSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESRVRTSSGMFLKRGK 389
Query: 103 DAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 162
D I+ IE +I+ +TF+P ENGE +QVL Y G+KY+PHYDYF D+ N GG RVATVL
Sbjct: 390 DKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQRVATVL 569
Query: 163 MYLTNVTKGGETVFPNA 179
MYL++V +GGETVFP A
Sbjct: 570 MYLSDVEEGGETVFPAA 620
>TC83824 similar to GP|17381226|gb|AAL36425.1 unknown protein {Arabidopsis
thaliana}, partial (43%)
Length = 680
Score = 106 bits (265), Expect = 1e-23
Identities = 60/124 (48%), Positives = 74/124 (59%)
Frame = +3
Query: 123 NGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEES 182
+GE +LRYE GQ+Y+ HYD F + RVA+ L+YLT+V +GGET+FP E
Sbjct: 3 HGEAFNILRYEVGQRYNSHYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENG 179
Query: 183 PRHKLSETDEDLSECGKKGVAVNPRRGDALLFCSLHPNAIPDTLSLHAGCPVIEGEKWSA 242
+ ED G+ V PR+GD LLF SL PN D SLH CPVI+GEKW A
Sbjct: 180 LNMDGTYGYEDCV-----GLRVKPRQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVA 344
Query: 243 TKWI 246
TKWI
Sbjct: 345 TKWI 356
>AJ388831 weakly similar to GP|10177121|dbj prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(33%)
Length = 505
Score = 91.3 bits (225), Expect = 4e-19
Identities = 44/91 (48%), Positives = 61/91 (66%)
Frame = +3
Query: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99
V+ +SW+PRAF+Y FLT EC+HLI+IAK + +SAV D +G S RTSSG F+
Sbjct: 195 VQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSSERTSSGAFLK 374
Query: 100 KNKDAIVSGIEDKISSWTFLPKENGEDIQVL 130
+ D IV IE +I+ +TF+P E+GE+ L
Sbjct: 375 RGSDRIVKNIERRIADFTFIPXEHGENFNGL 467
>BF518749
Length = 416
Score = 81.3 bits (199), Expect = 4e-16
Identities = 45/96 (46%), Positives = 62/96 (63%)
Frame = +1
Query: 35 IDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSS 94
IDP++V Q+SW+PR F+YKGFL+D ECD+LIS+A+ + + G SK E TS
Sbjct: 163 IDPSRVVQISWQPRVFLYKGFLSDKECDYLISLAQEK------SSGNGGYSKKEE--TSL 318
Query: 95 GMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVL 130
M D IV IE+++S WTFL KEN + +QV+
Sbjct: 319 DM-----DDDIVKRIEERLSVWTFLSKENSKPLQVM 411
>BG447864 weakly similar to GP|10177121|db prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(50%)
Length = 639
Score = 76.6 bits (187), Expect = 1e-14
Identities = 37/78 (47%), Positives = 52/78 (66%)
Frame = +3
Query: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99
V+ VSW+P AFVY FLT EC++LI I K + +S V D+ +G+SK S VRT SG F++
Sbjct: 405 VEVVSWEPXAFVYHNFLTKEECEYLIDIXKPSMHKSTVVDSETGKSKDSXVRTXSGTFLA 584
Query: 100 KNKDAIVSGIEDKISSWT 117
+ +D I I KI+ +T
Sbjct: 585 RGRDXIXRNIXKKIADFT 638
>BF521324 similar to GP|18086437|gb| AT3g28480/MFJ20_16 {Arabidopsis
thaliana}, partial (15%)
Length = 285
Score = 71.2 bits (173), Expect = 4e-13
Identities = 32/57 (56%), Positives = 39/57 (68%)
Frame = +1
Query: 26 GSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLS 82
GS G DPT+V Q+SW PRAF+Y FLTD ECDHLI ++K L++S ADN S
Sbjct: 115 GSVFGAKVKFDPTRVTQLSWSPRAFLYNNFLTDEECDHLIELSKDNLEKSMAADNES 285
>BQ144147
Length = 729
Score = 66.6 bits (161), Expect = 1e-11
Identities = 29/60 (48%), Positives = 41/60 (68%)
Frame = +2
Query: 117 TFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVF 176
T LP ENG+++ + RY+HG ++P+ +Y K+NI +GGHR TVLM +TN TKG F
Sbjct: 92 TSLPIENGDNLHIWRYKHGHNHNPNDNYSTHKINIVQGGHRPPTVLMLITNETKGNRN*F 271
>TC84014 similar to GP|21593296|gb|AAM65245.1 prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(9%)
Length = 653
Score = 41.6 bits (96), Expect = 4e-04
Identities = 20/20 (100%), Positives = 20/20 (100%)
Frame = +2
Query: 83 GESKLSEVRTSSGMFISKNK 102
GESKLSEVRTSSGMFISKNK
Sbjct: 23 GESKLSEVRTSSGMFISKNK 82
>TC84326 weakly similar to PIR|F85225|F85225 hypothetical protein AT4g19900
[imported] - Arabidopsis thaliana, partial (18%)
Length = 756
Score = 28.9 bits (63), Expect = 2.5
Identities = 12/43 (27%), Positives = 22/43 (50%)
Frame = -3
Query: 4 ICRVWCSIIVPLLLICKIHFALGSYAGTSAIIDPTKVKQVSWK 46
+C+ WCS+++P L + S A ++ P +VSW+
Sbjct: 112 LCKCWCSVLIPAL-------SFDSVASLLGLVCPLYNTRVSWQ 5
>TC88383 similar to GP|21592515|gb|AAM64465.1 nuclear receptor binding
factor-like protein {Arabidopsis thaliana}, partial
(90%)
Length = 1313
Score = 26.9 bits (58), Expect = 9.6
Identities = 12/26 (46%), Positives = 17/26 (65%)
Frame = -3
Query: 210 DALLFCSLHPNAIPDTLSLHAGCPVI 235
+A FC L+ DTL+ HAGCP++
Sbjct: 624 NATRFCKLN-----DTLTNHAGCPIL 562
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.135 0.422
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,306,288
Number of Sequences: 36976
Number of extensions: 170929
Number of successful extensions: 666
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 658
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 658
length of query: 301
length of database: 9,014,727
effective HSP length: 96
effective length of query: 205
effective length of database: 5,465,031
effective search space: 1120331355
effective search space used: 1120331355
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.9 bits)
Medicago: description of AC146553.10