Miyakogusa Predicted Gene
- Lj2g3v1349500.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1349500.1 Non Chatacterized Hit- tr|I1JBT7|I1JBT7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.56680 PE,75.86,0,no
description,Concanavalin A-like lectin/glucanase, subgroup; seg,NULL;
Galactoside-binding lectin,,CUFF.36813.1
(339 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G62620.2 | Symbols: | Galactosyltransferase family protein |... 346 1e-95
AT5G62620.1 | Symbols: | Galactosyltransferase family protein |... 344 5e-95
AT1G74800.1 | Symbols: | Galactosyltransferase family protein |... 323 1e-88
AT1G27120.1 | Symbols: | Galactosyltransferase family protein |... 290 9e-79
AT4G21060.1 | Symbols: | Galactosyltransferase family protein |... 228 3e-60
AT4G21060.2 | Symbols: | Galactosyltransferase family protein |... 228 5e-60
AT1G26810.1 | Symbols: GALT1 | galactosyltransferase1 | chr1:928... 67 2e-11
>AT5G62620.2 | Symbols: | Galactosyltransferase family protein |
chr5:25137136-25139433 FORWARD LENGTH=596
Length = 596
Score = 346 bits (888), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 201/375 (53%), Positives = 245/375 (65%), Gaps = 45/375 (12%)
Query: 1 MKRGIKLDPFGLPNRLTLVQIXXXXXXXXXXXXTFEIPLAFRAGLAS-----------EN 49
++R K D F ++ VQI TFEIP F+ GL+S N
Sbjct: 9 LERLEKFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQDPLTRPEKHN 68
Query: 50 SALGLLTDALPVTVPL--LLEENHQIEA---------SVVSTLSFNG-TFS-----GDSE 92
S L P T PL LL + Q E+ ++S+L F+ TF+ G E
Sbjct: 69 SQRELQERRAP-TRPLKSLLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKDGSVE 127
Query: 93 LHKVASHAWLAGKKLWSEVESGK----VEMFSSKLKSENGSDSCLNSVTLSGFEFREKFK 148
LHK A AW G+K+W E+ESGK +E K E+G++SC SV+L+G + ++
Sbjct: 128 LHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDLLKR-G 186
Query: 149 GVMVLPCGLTLWSHVTVVGTPRWAHAERDPKIAVVRDGDEAVMVSQFMLELQGLKAVDKE 208
+M LPCGLTL SH+TVVG PR AH+E+DPKI+++++GDEAV VSQF LELQGLKAV+ E
Sbjct: 187 NIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKAVEGE 246
Query: 209 EPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCDGWKSRADEETVDGQVKCEKWI 268
EPPRILH NPRLKGDWSGKPVIEQNTCYRMQWGSA RC+GW+SR DEETVDGQVKCEKW
Sbjct: 247 EPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKCEKWA 306
Query: 269 RDDDNHSEEWK----ATWWLNRLIGRKKKVTVDWPYPFAEGKLFVLTISAGLEGYHVSV- 323
RDD S+E + A+WWL+RLIGR KKVTV+WP+PF KLFVLT+SAGLEGYHVSV
Sbjct: 307 RDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYHVSVD 366
Query: 324 ------FPYRTVGTL 332
FPYRT TL
Sbjct: 367 GKHVTSFPYRTGFTL 381
>AT5G62620.1 | Symbols: | Galactosyltransferase family protein |
chr5:25137136-25139764 FORWARD LENGTH=681
Length = 681
Score = 344 bits (882), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 201/375 (53%), Positives = 245/375 (65%), Gaps = 45/375 (12%)
Query: 1 MKRGIKLDPFGLPNRLTLVQIXXXXXXXXXXXXTFEIPLAFRAGLAS-----------EN 49
++R K D F ++ VQI TFEIP F+ GL+S N
Sbjct: 9 LERLEKFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQDPLTRPEKHN 68
Query: 50 SALGLLTDALPVTVPL--LLEENHQIEA---------SVVSTLSFNG-TFS-----GDSE 92
S L P T PL LL + Q E+ ++S+L F+ TF+ G E
Sbjct: 69 SQRELQERRAP-TRPLKSLLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKDGSVE 127
Query: 93 LHKVASHAWLAGKKLWSEVESGK----VEMFSSKLKSENGSDSCLNSVTLSGFEFREKFK 148
LHK A AW G+K+W E+ESGK +E K E+G++SC SV+L+G + ++
Sbjct: 128 LHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDLLKR-G 186
Query: 149 GVMVLPCGLTLWSHVTVVGTPRWAHAERDPKIAVVRDGDEAVMVSQFMLELQGLKAVDKE 208
+M LPCGLTL SH+TVVG PR AH+E+DPKI+++++GDEAV VSQF LELQGLKAV+ E
Sbjct: 187 NIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKAVEGE 246
Query: 209 EPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCDGWKSRADEETVDGQVKCEKWI 268
EPPRILH NPRLKGDWSGKPVIEQNTCYRMQWGSA RC+GW+SR DEETVDGQVKCEKW
Sbjct: 247 EPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKCEKWA 306
Query: 269 RDDDNHSEEWK----ATWWLNRLIGRKKKVTVDWPYPFAEGKLFVLTISAGLEGYHVSV- 323
RDD S+E + A+WWL+RLIGR KKVTV+WP+PF KLFVLT+SAGLEGYHVSV
Sbjct: 307 RDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYHVSVD 366
Query: 324 ------FPYRTVGTL 332
FPYRT TL
Sbjct: 367 GKHVTSFPYRTGFTL 381
>AT1G74800.1 | Symbols: | Galactosyltransferase family protein |
chr1:28102221-28104993 REVERSE LENGTH=672
Length = 672
Score = 323 bits (828), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 165/279 (59%), Positives = 202/279 (72%), Gaps = 26/279 (9%)
Query: 67 LEENHQIEASVVSTLSFNG-TFS-----GDSELHKVASHAWLAGKKLWSEVESGKVEMFS 120
+ E+H+ V+S+L F+ TF G ELHK A AW G+KLW E+ESG++E
Sbjct: 107 VREHHR---GVLSSLRFDSETFDPSSKDGSVELHKSAKEAWQLGRKLWKELESGRLEKLV 163
Query: 121 SKLKSENGSDSCLNSVTLSGFEFREKFKGVMVLPCGLTLWSHVTVVGTPRWAHAERDPKI 180
K +N DSC +SV+L+G EF + +M LPCGLTL SH+T+VG PR AH
Sbjct: 164 EK-PEKNKPDSCPHSVSLTGSEFMNRENKLMELPCGLTLGSHITLVGRPRKAHP------ 216
Query: 181 AVVRDGDEAVMVSQFMLELQGLKAVDKEEPPRILHFNPRLKGDWSGKPVIEQNTCYRMQW 240
++GD + +VSQF++ELQGLK V+ E+PPRILHFNPRLKGDWS KPVIEQN+CYRMQW
Sbjct: 217 ---KEGDWSKLVSQFVIELQGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQW 273
Query: 241 GSALRCDGWKSRADEETVDGQVKCEKWIRDDDNHSEEWKATWWLNRLIGRKKKVTVDWPY 300
G A RC+GWKSR DEETVD VKCEKWIRDDDN+SE +A WWLNRLIGR+K+V V+WP+
Sbjct: 274 GPAQRCEGWKSRDDEETVDSHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPF 333
Query: 301 PFAEGKLFVLTISAGLEGYHVSV-------FPYRTVGTL 332
PF E KLFVLT+SAGLEGYH++V FPYRT TL
Sbjct: 334 PFVEEKLFVLTLSAGLEGYHINVDGKHVTSFPYRTGFTL 372
>AT1G27120.1 | Symbols: | Galactosyltransferase family protein |
chr1:9421389-9423910 FORWARD LENGTH=673
Length = 673
Score = 290 bits (742), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 159/290 (54%), Positives = 198/290 (68%), Gaps = 32/290 (11%)
Query: 60 PVTVPLLLEENHQIEASVVSTL----SF--NGTFSGD-SELHKVASHAWLAGKKLWSEVE 112
P V L L E E VS + SF NG FS + S HK A HA G+K+W ++
Sbjct: 92 PGRVQLRLPERKMREFKSVSEIFVNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLD 151
Query: 113 SGKVEMFSSKLKSENGSDSCLNSVTLSGFEFREKFKGVMVLPCGLTLWSHVTVVGTPRWA 172
SG ++ + +K+ + C + V++S EF + + ++VLPCGLTL SH+TVV TP WA
Sbjct: 152 SGLIKPDKAPVKTR--IEKCPDMVSVSESEFVNRSR-ILVLPCGLTLGSHITVVATPHWA 208
Query: 173 HAERDPKIAVVRDGDEAVMVSQFMLELQGLKAVDKEEPPRILHFNPRLKGDWSGKPVIEQ 232
H E+D GD+ MVSQFM+ELQGLKAVD E+PPRILHFNPR+KGDWSG+PVIEQ
Sbjct: 209 HVEKD--------GDKTAMVSQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQ 260
Query: 233 NTCYRMQWGSALRCDGWKSRADEETVDGQVKCEKWIR------DDDNHSEEWKATWWLNR 286
NTCYRMQWGS LRCDG +S DEE VDG+VKCE+W R ++ + +E K TWWLNR
Sbjct: 261 NTCYRMQWGSGLRCDGRESSDDEEYVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNR 320
Query: 287 LIGRKKK-VTVDWPYPFAEGKLFVLTISAGLEGYHVSV-------FPYRT 328
L+GR+KK +T DW YPFAEGKLFVLT+ AG+EGYH+SV FPYRT
Sbjct: 321 LMGRRKKMITHDWDYPFAEGKLFVLTLRAGMEGYHISVNGRHITSFPYRT 370
>AT4G21060.1 | Symbols: | Galactosyltransferase family protein |
chr4:11240730-11244860 FORWARD LENGTH=741
Length = 741
Score = 228 bits (582), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 114/249 (45%), Positives = 161/249 (64%), Gaps = 13/249 (5%)
Query: 91 SELHKVASHAWLAGKKLWSEVESGKVEMFSSKLKSENGS-DSCLNSVTLSGFEFREKFKG 149
S ++A AW+ G K W +V+ +V+ + G +SC + ++++G + K
Sbjct: 190 SPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDL-NKANR 248
Query: 150 VMVLPCGLTLWSHVTVVGTPRWAHAERDPKIAVVRDGDEAVMVSQFMLELQGLKAVDKEE 209
+M+LPCGL S +T++GTP++AH E P+ + + V+VSQFM+ELQGLK D E
Sbjct: 249 IMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEY 308
Query: 210 PPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCDGWKSRADEET-VDGQVKCEKWI 268
PP+ILH NPR+KGDW+ +PVIE NTCYRMQWG A RCDG S+ D + VDG +CEKW
Sbjct: 309 PPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKWT 368
Query: 269 RD---DDNHSEEWKATWWLNRLIGRKKKVTVDWPYPFAEGKLFVLTISAGLEGYHVSV-- 323
++ D S+E K T W R IGR++K V W +PFAEGK+FVLT+ AG++G+H++V
Sbjct: 369 QNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGG 428
Query: 324 -----FPYR 327
FPYR
Sbjct: 429 RHVSSFPYR 437
>AT4G21060.2 | Symbols: | Galactosyltransferase family protein |
chr4:11242003-11244860 FORWARD LENGTH=684
Length = 684
Score = 228 bits (581), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 114/249 (45%), Positives = 161/249 (64%), Gaps = 13/249 (5%)
Query: 91 SELHKVASHAWLAGKKLWSEVESGKVEMFSSKLKSENGS-DSCLNSVTLSGFEFREKFKG 149
S ++A AW+ G K W +V+ +V+ + G +SC + ++++G + K
Sbjct: 133 SPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDL-NKANR 191
Query: 150 VMVLPCGLTLWSHVTVVGTPRWAHAERDPKIAVVRDGDEAVMVSQFMLELQGLKAVDKEE 209
+M+LPCGL S +T++GTP++AH E P+ + + V+VSQFM+ELQGLK D E
Sbjct: 192 IMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEY 251
Query: 210 PPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCDGWKSRADEET-VDGQVKCEKWI 268
PP+ILH NPR+KGDW+ +PVIE NTCYRMQWG A RCDG S+ D + VDG +CEKW
Sbjct: 252 PPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKWT 311
Query: 269 RD---DDNHSEEWKATWWLNRLIGRKKKVTVDWPYPFAEGKLFVLTISAGLEGYHVSV-- 323
++ D S+E K T W R IGR++K V W +PFAEGK+FVLT+ AG++G+H++V
Sbjct: 312 QNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGG 371
Query: 324 -----FPYR 327
FPYR
Sbjct: 372 RHVSSFPYR 380
>AT1G26810.1 | Symbols: GALT1 | galactosyltransferase1 |
chr1:9286862-9289327 REVERSE LENGTH=643
Length = 643
Score = 67.0 bits (162), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 86/185 (46%), Gaps = 30/185 (16%)
Query: 153 LPCGLTLWSHVTVVGTPRWAHAERDPKIAVVRDGDEAVMVSQFMLELQGLKAVDKEEPPR 212
+PCGLT S +TV+G P DG +V F ++L G + +PP
Sbjct: 175 IPCGLTQGSSITVIGIP---------------DG----LVGSFRIDLTGQPLPGEPDPPI 215
Query: 213 ILHFNPRLKGDWSGK-PVIEQNTCYRMQ-WGSALRCDGWKSRADEETVDGQVKCEKWIRD 270
I+H+N RL GD S + PVI QN+ Q WG+ RC + +++ VD +C K +
Sbjct: 216 IVHYNVRLLGDKSTEDPVIVQNSWTASQDWGAEERCPKFDPDMNKK-VDDLDECNKMVGG 274
Query: 271 DDNHSEEWKATWWLNRLIGRKKKVTVDWPY-PFAEGKLFVLTISAGLEGY-------HVS 322
+ N + +R + ++ + Y PF +G L V T+ G EG H++
Sbjct: 275 EINRTSSTSLQSNTSRGVPVAREASKHEKYFPFKQGFLSVATLRVGTEGMQMTVDGKHIT 334
Query: 323 VFPYR 327
F +R
Sbjct: 335 SFAFR 339