Miyakogusa Predicted Gene
- Lj5g3v1601890.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1601890.1 tr|G7IEH8|G7IEH8_MEDTR
Heparan-alpha-glucosaminide N-acetyltransferase OS=Medicago truncatula
GN=MTR,80.16,0,FAMILY NOT NAMED,NULL; seg,NULL,CUFF.55593.1
(484 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G47900.1 | Symbols: | Protein of unknown function (DUF1624) ... 405 e-113
AT5G47900.4 | Symbols: | Protein of unknown function (DUF1624) ... 347 1e-95
AT5G27730.1 | Symbols: | Protein of unknown function (DUF1624) ... 347 1e-95
AT5G47900.7 | Symbols: | Protein of unknown function (DUF1624) ... 340 1e-93
AT5G47900.2 | Symbols: | Protein of unknown function (DUF1624) ... 314 8e-86
AT5G47900.6 | Symbols: | Protein of unknown function (DUF1624) ... 311 5e-85
AT5G47900.5 | Symbols: | Protein of unknown function (DUF1624) ... 231 1e-60
AT5G47900.3 | Symbols: | Protein of unknown function (DUF1624) ... 230 1e-60
>AT5G47900.1 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392408-19394963 FORWARD LENGTH=440
Length = 440
Score = 405 bits (1042), Expect = e-113, Method: Compositional matrix adjust.
Identities = 201/399 (50%), Positives = 275/399 (68%), Gaps = 11/399 (2%)
Query: 51 SPSLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGV 110
S SL +RL SLDVFRGLTVA MILVD+VG PS+NHSPW G+TLADFVMPFFLF V
Sbjct: 36 SSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIV 95
Query: 111 GVSIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGV 170
GVS+A +K +S + AT+K + R++K + HG +LTYG+D+ KIR +G+
Sbjct: 96 GVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGI 155
Query: 171 LQRIAIGYFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPD 230
LQRIAI Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PD
Sbjct: 156 LQRIAIAYLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPD 214
Query: 231 WKFE---------HSNFLR-NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTK 280
W+++ + FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK
Sbjct: 215 WEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274
Query: 281 ECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVIFQGHKQRIX 340
+CS+N P+ GPLPPD+P WC APFDPEG+LSSLMA +TC +GL +GHI++ F+ HK+R+
Sbjct: 275 QCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLN 334
Query: 341 XXXXXXXXXXXIGYVLEILGIPLSKALYTLSYMCVTAGASGLVLTAIYYIVDIGHFRKPT 400
+G L + G+ L+K LYTLSYMCVT+GASG +L+AIY +VD+ +++ +
Sbjct: 335 QWILRSFCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRAS 394
Query: 401 ALLQWMGMNALIVYALAACDIFPLVIQGFYWHSPENNLV 439
+L+WMG++AL +Y L AC++ L+I GFYW +P NNL+
Sbjct: 395 LVLEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLL 433
>AT5G47900.4 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392426-19394963 FORWARD LENGTH=435
Length = 435
Score = 347 bits (890), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 180/408 (44%), Positives = 252/408 (61%), Gaps = 28/408 (6%)
Query: 51 SPSLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGV 110
S SL +RL SLDVFRGLTVA MILVD+VG PS+NHSPW G+TLADFVMPFFLF V
Sbjct: 30 SSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIV 89
Query: 111 GVSIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGV 170
GVS+A +K +S + AT+K + R++K + HG +LTYG+D+ KIR +G+
Sbjct: 90 GVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGI 149
Query: 171 LQRIAIGYFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPD 230
LQRIAI Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PD
Sbjct: 150 LQRIAIAYLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPD 208
Query: 231 WKFE---------HSNFLR-NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTK 280
W+++ + FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK
Sbjct: 209 WEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 268
Query: 281 ECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVIFQGHKQRIX 340
+CS+N P+ GPLPPD+P WC APFDPEG+LSSLMA +TC +GL +GHI++ F+ + +
Sbjct: 269 QCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQ 328
Query: 341 XXXXXXXXXXXIGYVLEILGIPL---------SKALYTLSYMCVTAGASGLVLTAIYYIV 391
E + ++ L+ L + +GL V
Sbjct: 329 VYNEPSISIRRSQKAFESMDFTFFLSSDVRSRTEPLWGLGIFVIRDIPNGL--------V 380
Query: 392 DIGHFRKPTALLQWMGMNALIVYALAACDIFPLVIQGFYWHSPENNLV 439
D+ +++ + +L+WMG++AL +Y L AC++ L+I GFYW +P NNL+
Sbjct: 381 DVYGYKRASLVLEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLL 428
>AT5G27730.1 | Symbols: | Protein of unknown function (DUF1624) |
chr5:9820311-9822868 FORWARD LENGTH=472
Length = 472
Score = 347 bits (889), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 190/444 (42%), Positives = 271/444 (61%), Gaps = 12/444 (2%)
Query: 53 SLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGVGV 112
SL+ RLASLD+FRGLTVALMILVD+ G +P + H+PW G LADFVMPFFLF VGV
Sbjct: 29 SLAGNRPRLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPFFLFIVGV 88
Query: 113 SIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGVLQ 172
SIAL K++S+K A KKV RT K + H LTYGVD++ +R+ G+LQ
Sbjct: 89 SIALSLKRISNKFEACKKVGFRTCKLLFWGLLLQGGFSHAPDELTYGVDVTMMRFCGILQ 148
Query: 173 RIAIGYFFASTSEIWLVNNNILVDSPAAYA--RKYSIQWMFSILLCSVYLCLLYALYIPD 230
RIA+ Y + EI+ +++ S ++ + Y W+ + + +YL LY Y+PD
Sbjct: 149 RIALSYLVVALVEIFTKDSHEENLSTGRFSIFKSYYWHWIVAASVLVIYLATLYGTYVPD 208
Query: 231 WKF---EHSNFLR----NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTKECS 283
W+F + + L +V CGVRG L PPCNAVG++DR +LG NH+Y P +RR+K C+
Sbjct: 209 WEFVVYDKDSVLYGKILSVSCGVRGKLNPPCNAVGYVDRQVLGINHMYHHPAWRRSKACT 268
Query: 284 VNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVIFQGHKQRIXXXX 343
+SP G + D+P WC APF+PEGILSS+ A ++ +G+ FGHI++ +GH R+
Sbjct: 269 DDSPYEGAIRQDAPSWCRAPFEPEGILSSISAILSTIIGVHFGHIILHLKGHSARLKHWI 328
Query: 344 XXXXXXXXIGYVLEILGI-PLSKALYTLSYMCVTAGASGLVLTAIYYIVDIGHFRKPTAL 402
+G L + PL+K LY+ SY+CVT+GA+ LV +++Y +VDI ++
Sbjct: 329 STGLVLLALGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLVDILEWKHMFLP 388
Query: 403 LQWMGMNALIVYALAACDIFPLVIQGFYWHSPENNLVD-ASETLLQNVFHSEKWGTLAFV 461
L+W+GMNA++VY + A I G+Y+ P N L++ E + V+HS + G L +V
Sbjct: 389 LKWIGMNAMLVYVMGAEGILAAFFNGWYYRHPHNTLINWIREHVFIRVWHSRRVGVLMYV 448
Query: 462 IVG-ILFWCLFAGFLHKKGIYIKM 484
I ILFW L G H+ IY K+
Sbjct: 449 IFAEILFWGLVTGVFHRFKIYWKL 472
>AT5G47900.7 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392408-19394753 FORWARD LENGTH=435
Length = 435
Score = 340 bits (873), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 181/388 (46%), Positives = 239/388 (61%), Gaps = 48/388 (12%)
Query: 51 SPSLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGV 110
S SL +RL SLDVFRGLTVA MILVD+VG PS+NHSPW G+TLADFVMPFFLF V
Sbjct: 36 SSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIV 95
Query: 111 GVSIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGV 170
GVS+A +K +S + AT+K + R++K + HG +LTYG+D+ KIR +G+
Sbjct: 96 GVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGI 155
Query: 171 LQRIAIGYFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPD 230
LQRIAI Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PD
Sbjct: 156 LQRIAIAYLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPD 214
Query: 231 WKFE---------HSNFLR-NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTK 280
W+++ + FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK
Sbjct: 215 WEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274
Query: 281 ECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVIF-------- 332
+CS+N P+ GPLPPD+P WC APFDPEG+LSSLMA +TC +GL +GHI++ F
Sbjct: 275 QCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQ 334
Query: 333 -----------------------------QGHKQRIXXXXXXXXXXXXIGYVLEILGIPL 363
Q HK+R+ +G L + G+ L
Sbjct: 335 VYNEPSISIRPFFFILSETYLLLYVINFLQDHKKRLNQWILRSFCLLMLGLALNLFGMHL 394
Query: 364 SKALYTLSYMCVTAGASGLVLTAIYYIV 391
+K LYTLSYMCVT+GASG +L+AIY +V
Sbjct: 395 NKPLYTLSYMCVTSGASGFLLSAIYLMV 422
>AT5G47900.2 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392408-19394161 FORWARD LENGTH=359
Length = 359
Score = 314 bits (805), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 153/293 (52%), Positives = 202/293 (68%), Gaps = 11/293 (3%)
Query: 51 SPSLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGV 110
S SL +RL SLDVFRGLTVA MILVD+VG PS+NHSPW G+TLADFVMPFFLF V
Sbjct: 36 SSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIV 95
Query: 111 GVSIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGV 170
GVS+A +K +S + AT+K + R++K + HG +LTYG+D+ KIR +G+
Sbjct: 96 GVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGI 155
Query: 171 LQRIAIGYFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPD 230
LQRIAI Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PD
Sbjct: 156 LQRIAIAYLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPD 214
Query: 231 WKFE---------HSNFLR-NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTK 280
W+++ + FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK
Sbjct: 215 WEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274
Query: 281 ECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVIFQ 333
+CS+N P+ GPLPPD+P WC APFDPEG+LSSLMA +TC +GL +GHI++ F+
Sbjct: 275 QCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 327
>AT5G47900.6 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392556-19394963 FORWARD LENGTH=340
Length = 340
Score = 311 bits (798), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 221/332 (66%), Gaps = 11/332 (3%)
Query: 118 FKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGVLQRIAIG 177
F + S+ AT+K + R++K + HG +LTYG+D+ KIR +G+LQRIAI
Sbjct: 3 FAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIA 62
Query: 178 YFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPDWKFE--- 234
Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PDW+++
Sbjct: 63 YLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILK 121
Query: 235 ------HSNFLR-NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTKECSVNSP 287
+ FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK+CS+N P
Sbjct: 122 EDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYP 181
Query: 288 DYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVIFQGHKQRIXXXXXXXX 347
+ GPLPPD+P WC APFDPEG+LSSLMA +TC +GL +GHI++ F+ HK+R+
Sbjct: 182 NNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRSF 241
Query: 348 XXXXIGYVLEILGIPLSKALYTLSYMCVTAGASGLVLTAIYYIVDIGHFRKPTALLQWMG 407
+G L + G+ L+K LYTLSYMCVT+GASG +L+AIY +VD+ +++ + +L+WMG
Sbjct: 242 CLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRASLVLEWMG 301
Query: 408 MNALIVYALAACDIFPLVIQGFYWHSPENNLV 439
++AL +Y L AC++ L+I GFYW +P NNL+
Sbjct: 302 IHALPIYVLIACNLVFLIIHGFYWKNPINNLL 333
>AT5G47900.5 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392408-19393885 FORWARD LENGTH=295
Length = 295
Score = 231 bits (588), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/240 (49%), Positives = 157/240 (65%), Gaps = 11/240 (4%)
Query: 51 SPSLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGV 110
S SL +RL SLDVFRGLTVA MILVD+VG PS+NHSPW G+TLADFVMPFFLF V
Sbjct: 36 SSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIV 95
Query: 111 GVSIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGV 170
GVS+A +K +S + AT+K + R++K + HG +LTYG+D+ KIR +G+
Sbjct: 96 GVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGI 155
Query: 171 LQRIAIGYFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPD 230
LQRIAI Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PD
Sbjct: 156 LQRIAIAYLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPD 214
Query: 231 WKFE---------HSNFLR-NVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTK 280
W+++ + FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK
Sbjct: 215 WEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274
>AT5G47900.3 | Symbols: | Protein of unknown function (DUF1624) |
chr5:19392408-19393885 FORWARD LENGTH=292
Length = 292
Score = 230 bits (587), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/239 (49%), Positives = 157/239 (65%), Gaps = 12/239 (5%)
Query: 51 SPSLSVPSQRLASLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGLTLADFVMPFFLFGV 110
S SL +RL SLDVFRGLTVA MILVD+VG PS+NHSPW G+TLADFVMPFFLF V
Sbjct: 36 SSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIV 95
Query: 111 GVSIALVFKKVSSKPNATKKVISRTIKXXXXXXXXXXXYFHGRGHLTYGVDLSKIRWLGV 170
GVS+A +K +S + AT+K + R++K + HG +LTYG+D+ KIR +G+
Sbjct: 96 GVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGI 155
Query: 171 LQRIAIGYFFASTSEIWLVNNNILVDSPAAYARKYSIQWMFSILLCSVYLCLLYALYIPD 230
LQRIAI Y + EIWL N+ V S + +KY W+ + ++ ++YL LLY LY+PD
Sbjct: 156 LQRIAIAYLVVALCEIWLKGNHN-VSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPD 214
Query: 231 WKFE---------HSNFLRNVHCGVRGSLEPPCNAVGFIDRLILGENHIYQRPVYRRTK 280
W+++ + FL V CGVRG P CNAVG +DR+ LG H+Y++PVY RTK
Sbjct: 215 WEYQILKEDQGSTLTTFL--VKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 271