
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149471.21 - phase: 0
(281 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC79830 similar to PIR|G84861|G84861 hypothetical protein At2g43... 579 e-166
TC86651 similar to PIR|F84555|F84555 similar to prolyl 4-hydroxy... 217 5e-57
TC86903 similar to GP|21617881|gb|AAM66931.1 prolyl 4-hydroxylas... 191 2e-49
TC78029 similar to GP|21593296|gb|AAM65245.1 prolyl 4-hydroxylas... 189 1e-48
TC78028 similar to GP|22136524|gb|AAM91340.1 unknown protein {Ar... 181 3e-46
BG448236 similar to GP|21537370|gb putative prolyl 4-hydroxylase... 158 3e-39
TC83824 similar to GP|17381226|gb|AAL36425.1 unknown protein {Ar... 98 3e-21
AJ388831 weakly similar to GP|10177121|dbj prolyl 4-hydroxylase ... 79 1e-15
BG447864 weakly similar to GP|10177121|db prolyl 4-hydroxylase ... 67 1e-11
BQ144147 46 2e-05
BF518749 44 5e-05
BF521324 similar to GP|18086437|gb| AT3g28480/MFJ20_16 {Arabidop... 39 0.003
BE941277 29 1.8
TC79352 weakly similar to PIR|T01829|T01829 hypothetical protein... 28 3.9
TC89186 similar to PIR|G96563|G96563 probable coatomer complex s... 28 5.1
TC85105 similar to GP|20160725|dbj|BAB89667. putative transaldol... 27 6.7
>TC79830 similar to PIR|G84861|G84861 hypothetical protein At2g43080
[imported] - Arabidopsis thaliana, partial (87%)
Length = 1130
Score = 579 bits (1492), Expect = e-166
Identities = 281/281 (100%), Positives = 281/281 (100%)
Frame = +1
Query: 1 MSPPAMKIVFGLLTFVTIGMIIGALSQLAFIRRLELEEPFTTTTTTRSLLPRGYTYWNNN 60
MSPPAMKIVFGLLTFVTIGMIIGALSQLAFIRRLELEEPFTTTTTTRSLLPRGYTYWNNN
Sbjct: 94 MSPPAMKIVFGLLTFVTIGMIIGALSQLAFIRRLELEEPFTTTTTTRSLLPRGYTYWNNN 273
Query: 61 NDKEAQILRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKG 120
NDKEAQILRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKG
Sbjct: 274 NDKEAQILRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKG 453
Query: 121 IKSDVRTSSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHD 180
IKSDVRTSSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHD
Sbjct: 454 IKSDVRTSSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHD 633
Query: 181 YFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTKGLCVKPVKGNA 240
YFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTKGLCVKPVKGNA
Sbjct: 634 YFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTKGLCVKPVKGNA 813
Query: 241 VLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWMRQSVHV 281
VLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWMRQSVHV
Sbjct: 814 VLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWMRQSVHV 936
>TC86651 similar to PIR|F84555|F84555 similar to prolyl 4-hydroxylase alpha
subunit [imported] - Arabidopsis thaliana, partial (85%)
Length = 1329
Score = 217 bits (552), Expect = 5e-57
Identities = 117/227 (51%), Positives = 150/227 (65%), Gaps = 8/227 (3%)
Frame = +2
Query: 58 NNNNDKEAQILRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANT 117
+ N+D+E + G EV+SW PR + HNFL+ EEC+YL +A P + STVVD+ T
Sbjct: 311 DRNDDEEGK----GEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSET 478
Query: 118 GKGIKSDVRTSSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRP 177
GK S VRTSSG FL+ K ++ IEK+I+ ++ IP+E+GE +QVL YE Q Y P
Sbjct: 479 GKSKDSRVRTSSGTFLARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEP 652
Query: 178 HHDYFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGS--------DECSCGGKLTK 229
H+DYF D FN K GGQRIAT+LMYL D EGGET FP+A +E S GK K
Sbjct: 653 HYDYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGK--K 826
Query: 230 GLCVKPVKGNAVLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWMR 276
GL +KP +G+A+LFWSM D D S+HGGCPV+ G KWS+TKW+R
Sbjct: 827 GLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKWSSTKWIR 967
>TC86903 similar to GP|21617881|gb|AAM66931.1 prolyl 4-hydroxylase putative
{Arabidopsis thaliana}, partial (68%)
Length = 1310
Score = 191 bits (486), Expect = 2e-49
Identities = 101/204 (49%), Positives = 136/204 (66%), Gaps = 6/204 (2%)
Frame = +3
Query: 78 LSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFLSHEE 137
LSWSPR L NFL+ EECD+L ++ +L+ S V D +GK I+S+VRTSSGMFL+ ++
Sbjct: 315 LSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRTSSGMFLNKQQ 494
Query: 138 RKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQRIAT 197
+ ++ IE RI+ ++ +P+ENGE MQVL Y + Y PH D+F D N + GG R+AT
Sbjct: 495 DE--IVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKANQRLGGHRVAT 668
Query: 198 MLMYLGDNVEGGETHFP------SAGSDECSCGGKLTKGLCVKPVKGNAVLFWSMGLDGQ 251
+LMYL + +GGET FP S DE S KG VKP KG+A+LF+S+ LD
Sbjct: 669 VLMYLSNVEKGGETIFPHAEGKLSQPKDE-SWSECAHKGYAVKPRKGDALLFFSLHLDAT 845
Query: 252 SDPDSVHGGCPVLAGEKWSATKWM 275
+D S+HG CPV+ GEKWSATKW+
Sbjct: 846 TDSKSLHGSCPVIEGEKWSATKWI 917
>TC78029 similar to GP|21593296|gb|AAM65245.1 prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(89%)
Length = 1126
Score = 189 bits (479), Expect = 1e-48
Identities = 101/210 (48%), Positives = 135/210 (64%), Gaps = 8/210 (3%)
Frame = +3
Query: 74 KPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFL 133
K + +SW PR + FL+ ECD+L +A LK S V D +G SDVRTSSGMF+
Sbjct: 216 KVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFI 395
Query: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
S + K P++ IE RIS ++ +P ENGE +QVLRYE Q Y PH+DYF+D N+ +GG
Sbjct: 396 S--KNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGH 569
Query: 194 RIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLT--------KGLCVKPVKGNAVLFWS 245
R+AT+LMYL + +GGET FP A G K + KG+ VKP +G+A+LF+S
Sbjct: 570 RLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFFS 749
Query: 246 MGLDGQSDPDSVHGGCPVLAGEKWSATKWM 275
+ + D +S+H GCPVL GEKWSATKW+
Sbjct: 750 LDTNAIPDTNSLHAGCPVLEGEKWSATKWI 839
>TC78028 similar to GP|22136524|gb|AAM91340.1 unknown protein {Arabidopsis
thaliana}, partial (86%)
Length = 1217
Score = 181 bits (459), Expect = 3e-46
Identities = 99/215 (46%), Positives = 135/215 (62%), Gaps = 13/215 (6%)
Frame = +1
Query: 74 KPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFL 133
K + +SW PR + FL+ ECD+L +A LK S V D +G+ S+VRTSSGMF+
Sbjct: 124 KVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFI 303
Query: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
S + K ++ IE +IS ++ +P ENGE +QVLRYE Q Y PH+DYF+D N+ RGG
Sbjct: 304 S--KNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGH 477
Query: 194 RIATMLMYLGDNVEGGETHFPSA-------------GSDECSCGGKLTKGLCVKPVKGNA 240
R+AT+LMYL + +GGET FP+A D CG KG+ VKP +G+A
Sbjct: 478 RVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECG---KKGVAVKPRRGDA 648
Query: 241 VLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWM 275
+LF+S+ + D S+H GCPV+ GEKWSATKW+
Sbjct: 649 LLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWI 753
>BG448236 similar to GP|21537370|gb putative prolyl 4-hydroxylase alpha
subunit {Arabidopsis thaliana}, partial (52%)
Length = 682
Score = 158 bits (399), Expect = 3e-39
Identities = 92/208 (44%), Positives = 125/208 (59%)
Frame = +3
Query: 12 LLTFVTIGMIIGALSQLAFIRRLELEEPFTTTTTTRSLLPRGYTYWNNNNDKEAQILRLG 71
LLT + ++I L L + + TT SL+ + + +K+ Q
Sbjct: 57 LLTLFMLTLVIIVLLALGIL--------YLPNTTDDSLITDRRKIYESLAEKKEQWT--- 203
Query: 72 YVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGM 131
E+LSW PR + HNFLS EEC++L +A P L S+VVD+ TGK +S VRTSSGM
Sbjct: 204 ----EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESRVRTSSGM 371
Query: 132 FLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRG 191
FL + K +I IE+RI+ ++ IP+ENGE +QVL Y + Y PH+DYF D FN K G
Sbjct: 372 FLKRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNG 545
Query: 192 GQRIATMLMYLGDNVEGGETHFPSAGSD 219
GQR+AT+LMYL D EGGET FP+A ++
Sbjct: 546 GQRVATVLMYLSDVEEGGETVFPAAXAN 629
>TC83824 similar to GP|17381226|gb|AAL36425.1 unknown protein {Arabidopsis
thaliana}, partial (43%)
Length = 680
Score = 98.2 bits (243), Expect = 3e-21
Identities = 52/119 (43%), Positives = 73/119 (60%), Gaps = 2/119 (1%)
Frame = +3
Query: 160 NGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAG-- 217
+GE +LRYE Q Y H+D F+ + QR+A+ L+YL D EGGET FP
Sbjct: 3 HGEAFNILRYEVGQRYNSHYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGL 182
Query: 218 SDECSCGGKLTKGLCVKPVKGNAVLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWMR 276
+ + + G + GL VKP +G+ +LF+S+ +G D S+HG CPV+ GEKW ATKW+R
Sbjct: 183 NMDGTYGYEDCVGLRVKPRQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIR 359
>AJ388831 weakly similar to GP|10177121|dbj prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(33%)
Length = 505
Score = 79.3 bits (194), Expect = 1e-15
Identities = 38/87 (43%), Positives = 56/87 (63%)
Frame = +3
Query: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFLSH 135
+++SW PR L HNFL+ EEC++L +A P + S V+D TG G+ S RTSSG FL
Sbjct: 198 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSSERTSSGAFLKR 377
Query: 136 EERKYPMIHAIEKRISVYSQIPIENGE 162
+ ++ IE+RI+ ++ IP E+GE
Sbjct: 378 GSDR--IVKNIERRIADFTFIPXEHGE 452
>BG447864 weakly similar to GP|10177121|db prolyl 4-hydroxylase alpha
subunit-like protein {Arabidopsis thaliana}, partial
(50%)
Length = 639
Score = 66.6 bits (161), Expect = 1e-11
Identities = 35/75 (46%), Positives = 46/75 (60%)
Frame = +3
Query: 60 NNDKEAQILRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGK 119
N+D+E + G EV+SW P + HNFL+ EEC+YL + P + STVVD+ TGK
Sbjct: 372 NDDEEGK----GEQWVEVVSWEPXAFVYHNFLTKEECEYLIDIXKPSMHKSTVVDSETGK 539
Query: 120 GIKSDVRTSSGMFLS 134
S VRT SG FL+
Sbjct: 540 SKDSXVRTXSGTFLA 584
>BQ144147
Length = 729
Score = 45.8 bits (107), Expect = 2e-05
Identities = 19/64 (29%), Positives = 36/64 (55%)
Frame = +2
Query: 154 SQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQRIATMLMYLGDNVEGGETHF 213
+ +PIENG+ + + RY+ + P+ +Y + N+ +GG R T+LM + + +G F
Sbjct: 92 TSLPIENGDNLHIWRYKHGHNHNPNDNYSTHKINIVQGGHRPPTVLMLITNETKGNRN*F 271
Query: 214 PSAG 217
+G
Sbjct: 272 SQSG 283
>BF518749
Length = 416
Score = 44.3 bits (103), Expect = 5e-05
Identities = 43/160 (26%), Positives = 71/160 (43%)
Frame = +1
Query: 8 IVFGLLTFVTIGMIIGALSQLAFIRRLELEEPFTTTTTTRSLLPRGYTYWNNNNDKEAQI 67
+ F L TF T+ ++ + S R EL + L R Y++N D +
Sbjct: 28 LTFLLFTFFTLSLLTTSFSD----SRKELRNK--NENVLKQL--RNSVYYSNRIDPSRVV 183
Query: 68 LRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRT 127
+SW PR+ L FLS +ECDYL IS + ++G G S
Sbjct: 184 Q---------ISWQPRVFLYKGFLSDKECDYL---------ISLAQEKSSGNGGYSKKEE 309
Query: 128 SSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVL 167
+S L ++ ++ IE+R+SV++ + EN + +QV+
Sbjct: 310 TS---LDMDD---DIVKRIEERLSVWTFLSKENSKPLQVM 411
>BF521324 similar to GP|18086437|gb| AT3g28480/MFJ20_16 {Arabidopsis
thaliana}, partial (15%)
Length = 285
Score = 38.5 bits (88), Expect = 0.003
Identities = 18/37 (48%), Positives = 24/37 (64%)
Frame = +1
Query: 78 LSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVD 114
LSWSPR L +NFL+ EECD+L ++ L+ S D
Sbjct: 166 LSWSPRAFLYNNFLTDEECDHLIELSKDNLEKSMAAD 276
>BE941277
Length = 361
Score = 29.3 bits (64), Expect = 1.8
Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 1/74 (1%)
Frame = +1
Query: 82 PRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDV-RTSSGMFLSHEERKY 140
P + LL N SY C++ RG K+ D G+GIK ++ + F S E Y
Sbjct: 70 PSVYLLPNMWSYTTCEF-RGA-----KLLGSADQGGGEGIKIELNQLKPYYFASDEGNAY 231
Query: 141 PMIHAIEKRISVYS 154
I + K I+V S
Sbjct: 232 DCIAGLTKFIAVPS 273
>TC79352 weakly similar to PIR|T01829|T01829 hypothetical protein T15F16.10 -
Arabidopsis thaliana, partial (31%)
Length = 1700
Score = 28.1 bits (61), Expect = 3.9
Identities = 18/64 (28%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Frame = +1
Query: 140 YPMIHAIEKRISVYSQIPIENGELMQVLRY--EKNQYYRPHHDYFSDTFNLKRGGQRIAT 197
+ M + +E +S+ + + +E+ LM+ L + K++Y RPHH F + GQ +
Sbjct: 1246 FSMKYVLEA*VSLKTWMNLES--LMRRLTFLLPKHRYLRPHHRLFLERMIFPLVGQLMNW 1419
Query: 198 MLMY 201
+L+Y
Sbjct: 1420 LLLY 1431
>TC89186 similar to PIR|G96563|G96563 probable coatomer complex subunit
33791-27676 [imported] - Arabidopsis thaliana, partial
(32%)
Length = 1389
Score = 27.7 bits (60), Expect = 5.1
Identities = 12/68 (17%), Positives = 36/68 (52%)
Frame = +1
Query: 132 FLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRG 191
+++H ++ + + + + + + P+ENG+ L E+N+ + +++++ N + G
Sbjct: 718 YINHADKSHVTLVEAFRNMQIEEEEPLENGDSNHELT-EQNEEHYTEEEHYTEEQNGEEG 894
Query: 192 GQRIATML 199
Q A ++
Sbjct: 895 SQEEAVVV 918
>TC85105 similar to GP|20160725|dbj|BAB89667. putative transaldolase {Oryza
sativa (japonica cultivar-group)}, partial (32%)
Length = 731
Score = 27.3 bits (59), Expect = 6.7
Identities = 15/55 (27%), Positives = 30/55 (54%)
Frame = +2
Query: 131 MFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDT 185
+F+ +++R MI +++ + + +E+ L+ +LRY K QY HH + T
Sbjct: 221 IFMRNKDRVLTMIIYVDQFQIWFHSLLLESEVLLPILRYLKRQY---HHQMLTIT 376
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.320 0.138 0.420
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,446,535
Number of Sequences: 36976
Number of extensions: 133858
Number of successful extensions: 631
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 617
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 619
length of query: 281
length of database: 9,014,727
effective HSP length: 95
effective length of query: 186
effective length of database: 5,502,007
effective search space: 1023373302
effective search space used: 1023373302
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.9 bits)
Medicago: description of AC149471.21