Miyakogusa Predicted Gene
- Lj5g3v2239730.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2239730.1 CUFF.56984.1
(458 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G75420.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 660 0.0
AT1G19710.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 635 0.0
AT1G52420.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 160 1e-39
AT3G15940.2 | Symbols: | UDP-Glycosyltransferase superfamily pr... 148 7e-36
AT3G15940.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 148 7e-36
AT1G78800.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 49 1e-05
>AT1G75420.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:28305469-28307317 FORWARD LENGTH=463
Length = 463
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/452 (75%), Positives = 394/452 (87%), Gaps = 4/452 (0%)
Query: 7 KKRWPLVLLAFLSVSTVTVLFMRPNNADSCNTKNFEQQRSQIRSPVQDRPGPSPLDFMKS 66
+KRW L++L FLSVSTV ++ +R + + F ++++ S + + +PLDFMKS
Sbjct: 9 RKRWALMVLLFLSVSTVCMILVRSSFETCSISSQFVEEKNGESSAAKFQS--NPLDFMKS 66
Query: 67 KLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWITNQNPVENDQVIYSLESKMLDRGV 126
KLVL+VSHELSLSGGPLLLMELAFLLRGVG+DVVWITNQ P+E+D+V+YSLE KMLDRGV
Sbjct: 67 KLVLLVSHELSLSGGPLLLMELAFLLRGVGADVVWITNQKPLEDDEVVYSLEHKMLDRGV 126
Query: 127 QVLPAKGEKAIDTALKADMVILNTAVAGKWLDAVLKEKVSRVLPKVLWWIHEMRGHYFKE 186
QV+ AKG+KA+DT+LKAD+++LNTAVAGKWLDAVLKE V +VLPK+LWWIHEMRGHYF
Sbjct: 127 QVISAKGQKAVDTSLKADLIVLNTAVAGKWLDAVLKENVVKVLPKILWWIHEMRGHYFNA 186
Query: 187 EYVKHLPFVAGAMIDSHTTAEYWKNRTRERLRIKMPETYVVHLGNSKELMEVADDSVAKR 246
+ VKHLPFVAGAMIDSH TA YWKNRT+ RL IKMP+TYVVHLGNSKELMEVA+DSVAKR
Sbjct: 187 DLVKHLPFVAGAMIDSHATAGYWKNRTQARLGIKMPKTYVVHLGNSKELMEVAEDSVAKR 246
Query: 247 VLREHVRESLGVRSDDLLFAIINSVSRGKGQDLFLRSFYESLQFIQEKKLQLPSLHAVVV 306
VLREHVRESLGVR++DLLF IINSVSRGKGQDLFLR+F+ESL+ I+EKKLQ+P++HAVVV
Sbjct: 247 VLREHVRESLGVRNEDLLFGIINSVSRGKGQDLFLRAFHESLERIKEKKLQVPTMHAVVV 306
Query: 307 GSDMNAQTKFEMELRKFVIDKKIQDRVHFVNKTLAVAPYLASIDVLVQNSQARGECFGRI 366
GSDM+ QTKFE ELR FV +KK+++ VHFVNKTL VAPY+A+IDVLVQNSQARGECFGRI
Sbjct: 307 GSDMSKQTKFETELRNFVREKKLENFVHFVNKTLTVAPYIAAIDVLVQNSQARGECFGRI 366
Query: 367 TIEAMAFRLPVLGTAAGGTMEIVVNGSTGLLHPVGKGGVTPLANNIVKLATHVEKRLTMG 426
TIEAMAF+LPVLGTAAGGTMEIVVNG+TGLLH GK GV PLA NIVKLAT VE RL MG
Sbjct: 367 TIEAMAFKLPVLGTAAGGTMEIVVNGTTGLLHSAGKEGVIPLAKNIVKLATQVELRLRMG 426
Query: 427 KKGYERVKERFMEKHMSDRIALVLKDVLARQH 458
K GYERVKE F+E HMS RIA VLK+VL QH
Sbjct: 427 KNGYERVKEMFLEHHMSHRIASVLKEVL--QH 456
>AT1G19710.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:6814920-6816716 FORWARD LENGTH=479
Length = 479
Score = 635 bits (1637), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/457 (74%), Positives = 389/457 (85%), Gaps = 8/457 (1%)
Query: 7 KKRWPLVLLAFLSVSTVTVLFMRPNNADSCNTKNFEQQRSQIRSP---VQDRPGP-SPLD 62
KKRWPL++L LSVSTV ++ +R + DSC+ R + + +Q G +PL+
Sbjct: 14 KKRWPLMILLVLSVSTVGMILVR-STFDSCSVSGKRCSREKEDNSDIKIQSVSGSLNPLE 72
Query: 63 FMKSKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWITNQNPVENDQVIYSLESKML 122
FMKSKLVL+VSHELSLSGGPLLLMELAFLLRGV S+VVWITNQ PVE D+VI LE KML
Sbjct: 73 FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKPVEEDEVIKVLEHKML 132
Query: 123 DRGVQVLPAKGEKAIDTALKADMVILNTAVAGKWLDAVLKEKVSRVLPKVLWWIHEMRGH 182
DRGVQV+ AK +KAIDTALK+D+V+LNTAVAGKWLDAVLK+ V +VLPKVLWWIHEMRGH
Sbjct: 133 DRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPKVLPKVLWWIHEMRGH 192
Query: 183 YFKEEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLRIKMPETYVVHLGNSKELMEVADDS 242
YFK + VKHLPFVAGAMIDSH TAEYWKNRT +RL IKMP+TYVVHLGNSKELMEVA+DS
Sbjct: 193 YFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKTYVVHLGNSKELMEVAEDS 252
Query: 243 VAKRVLREHVRESLGVRSDDLLFAIINSVSRGKGQDLFLRSFYESLQFIQE-KKLQLPSL 301
AK VLRE VRESLGVR++D+LF IINSVSRGKGQDLFLR+F+ESL+ I+E KKL++P++
Sbjct: 253 FAKNVLREQVRESLGVRNEDILFGIINSVSRGKGQDLFLRAFHESLKVIKETKKLEVPTM 312
Query: 302 HAVVVGSDMNAQTKFEMELRKFVIDKKIQDRVHFVNKTLAVAPYLASIDVLVQNSQARGE 361
HAVVVGSDM+AQTKFE ELR FV + K+Q VHFVNKT+ VAPYLA+IDVLVQNSQARGE
Sbjct: 313 HAVVVGSDMSAQTKFETELRNFVQEMKLQKIVHFVNKTMKVAPYLAAIDVLVQNSQARGE 372
Query: 362 CFGRITIEAMAFRLPVLGTAAGGTMEIVVNGSTGLLHPVGKGGVTPLANNIVKLATHVEK 421
CFGRITIEAMAF+LPVLGTAAGGTMEIVVN +TGLLH GK GV PLA NIVKLAT+V+
Sbjct: 373 CFGRITIEAMAFKLPVLGTAAGGTMEIVVNRTTGLLHNTGKDGVLPLAKNIVKLATNVKM 432
Query: 422 RLTMGKKGYERVKERFMEKHMSDRIALVLKDVLARQH 458
R TMGKKGYERVKE F+E HMS RIA VL++VL QH
Sbjct: 433 RNTMGKKGYERVKEMFLEHHMSHRIASVLREVL--QH 467
>AT1G52420.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:19528667-19531035 FORWARD LENGTH=670
Length = 670
Score = 160 bits (406), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 141/472 (29%), Positives = 220/472 (46%), Gaps = 94/472 (19%)
Query: 49 RSPVQDRPGPSPLDFMK---SKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWITNQ 105
RS DR DF + S+ +++ HELS++G P+ +MELA L G+ V
Sbjct: 217 RSGTCDRKS----DFKRLVWSRRFVLLFHELSMTGAPISMMELASELLSCGATV------ 266
Query: 106 NPVENDQVIYSLESKMLD----RGVQVLPAKGEKAIDTALKADMVILNTAVAGKWLDAVL 161
V+ S ++ R ++V+ KGE + TA+KAD++I +AV W+D +
Sbjct: 267 -----SAVVLSRRGGLMQELSRRRIKVVEDKGELSFKTAMKADLIIAGSAVCTSWIDQYM 321
Query: 162 KEKVSRVLPKVLWWIHEMRGHYFKE-----EYVKHLPFVAGAMIDSHTTAEYWKNRTRER 216
+ ++ WWI E R YF + VK L F+ S + + W E
Sbjct: 322 NHHPAGG-SQIAWWIMENRREYFDRAKPVLDRVKMLIFL------SESQSRQWLTWCEEE 374
Query: 217 LRIKM-PETYVVHLGNSKELMEVADD--------------SVAKRVLREHVRESLGVRSD 261
IK+ + +V L + EL VA V +++LRE VR LG+
Sbjct: 375 -HIKLRSQPVIVPLSVNDELAFVAGIPSSLNTPTLSPEKMRVKRQILRESVRTELGITDS 433
Query: 262 DLLFAIINSVSRGKGQDLFLRSFYESLQ------------FIQEKKLQLPSLHAV----- 304
D+L ++S++ KGQ L L S +L I+++K+ L S H +
Sbjct: 434 DMLVMSLSSINPTKGQLLLLESIALALSERGQESQRNHKGIIRKEKVSLSSKHRLRGSSR 493
Query: 305 -------------------------VVGSDMNAQTKFEMELRKFVIDK-KIQDRVHFVNK 338
VGS N + + E+ F+ + + V +
Sbjct: 494 QMKSVSLTLDNGLRREKQELKVLLGSVGSKSN-KVGYVKEMLSFLSNSGNLSKSVMWTPA 552
Query: 339 TLAVAPYLASIDVLVQNSQARGECFGRITIEAMAFRLPVLGTAAGGTMEIVVNGSTGLLH 398
T VA ++ DV V NSQ GE FGR+TIEAMA+ L V+GT AGGT E+V + TGLLH
Sbjct: 553 TTRVASLYSAADVYVTNSQGVGETFGRVTIEAMAYGLAVVGTDAGGTKEMVQHNMTGLLH 612
Query: 399 PVGKGGVTPLANNIVKLATHVEKRLTMGKKGYERVKERFMEKHMSDRIALVL 450
+G+ G LA+N++ L + ++RL +G +G + V++ +M++HM R VL
Sbjct: 613 SMGRSGNKELAHNLLYLLRNPDERLRLGSEGRKMVEKMYMKQHMYKRFVDVL 664
>AT3G15940.2 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr3:5393632-5396187 REVERSE LENGTH=697
Length = 697
Score = 148 bits (374), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 143/494 (28%), Positives = 213/494 (43%), Gaps = 109/494 (22%)
Query: 47 QIRSPVQDRPGPSPLDFMK---SKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWIT 103
Q RS DR DF + S+ +++ HELS++G P+ +MELA L G+ V
Sbjct: 217 QKRSGTCDRKS----DFKRLVWSRRFVLLFHELSMTGAPISMMELASELLSCGATVY--- 269
Query: 104 NQNPVENDQVIYSLESKMLD----RGVQVLPAKGEKAIDTALKADMVILNTAVAGKWLDA 159
V+ S +L R ++V+ KGE + TA+KAD+VI +AV W+D
Sbjct: 270 --------AVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCASWIDQ 321
Query: 160 VLKEKVSRVLPKVLWWIHEMRGHYF----------------------------KEEYVK- 190
+ + ++ WW+ E R YF +E++VK
Sbjct: 322 YMDHHPAGG-SQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSKQWLTWCEEDHVKL 380
Query: 191 -------------HLPFVAGAMIDSHTTAEYWKNRTRERLRIKMPETYVVHLGNSKELME 237
L FVAG + S T + R K+ E+ G + + M
Sbjct: 381 RSQPVIVPLSVNDELAFVAG--VSSSLNTPTLTQETMKEKRQKLRESVRTEFGLTDKDML 438
Query: 238 VAD--------------DSVAKRVLREHVRESLGVRSDDLLFAIINSVS----------- 272
V +SVA + RE +E + R+ + +N +
Sbjct: 439 VMSLSSINPGKGQLLLLESVALALEREQTQEQVAKRNQSKIIKNLNGIRKEKISLSARHR 498
Query: 273 -RGKGQDLFLRS-----FYESLQFIQEKKLQLPS---------LHAVVVGSDMNAQTKFE 317
RG + + + S L +KL L L VGS N + +
Sbjct: 499 LRGSSRKMKITSPAVDNHPSVLSATGRRKLLLSGNVTQKQDLKLLLGSVGSKSN-KVAYV 557
Query: 318 MELRKFVIDK-KIQDRVHFVNKTLAVAPYLASIDVLVQNSQARGECFGRITIEAMAFRLP 376
E+ F+ + + + V + T VA ++ DV V NSQ GE FGR+TIEAMA+ LP
Sbjct: 558 KEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGVGETFGRVTIEAMAYGLP 617
Query: 377 VLGTAAGGTMEIVVNGSTGLLHPVGKGGVTPLANNIVKLATHVEKRLTMGKKGYERVKER 436
VLGT AGGT EIV + TGLLHPVG+ G LA N++ L + RL +G +G E V++
Sbjct: 618 VLGTDAGGTKEIVEHNVTGLLHPVGRAGNKVLAQNLLFLLRNPSTRLQLGSQGREIVEKM 677
Query: 437 FMEKHMSDRIALVL 450
+M++HM R VL
Sbjct: 678 YMKQHMYKRFVDVL 691
>AT3G15940.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr3:5393632-5396187 REVERSE LENGTH=697
Length = 697
Score = 148 bits (374), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 143/494 (28%), Positives = 213/494 (43%), Gaps = 109/494 (22%)
Query: 47 QIRSPVQDRPGPSPLDFMK---SKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWIT 103
Q RS DR DF + S+ +++ HELS++G P+ +MELA L G+ V
Sbjct: 217 QKRSGTCDRKS----DFKRLVWSRRFVLLFHELSMTGAPISMMELASELLSCGATVY--- 269
Query: 104 NQNPVENDQVIYSLESKMLD----RGVQVLPAKGEKAIDTALKADMVILNTAVAGKWLDA 159
V+ S +L R ++V+ KGE + TA+KAD+VI +AV W+D
Sbjct: 270 --------AVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCASWIDQ 321
Query: 160 VLKEKVSRVLPKVLWWIHEMRGHYF----------------------------KEEYVK- 190
+ + ++ WW+ E R YF +E++VK
Sbjct: 322 YMDHHPAGG-SQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSKQWLTWCEEDHVKL 380
Query: 191 -------------HLPFVAGAMIDSHTTAEYWKNRTRERLRIKMPETYVVHLGNSKELME 237
L FVAG + S T + R K+ E+ G + + M
Sbjct: 381 RSQPVIVPLSVNDELAFVAG--VSSSLNTPTLTQETMKEKRQKLRESVRTEFGLTDKDML 438
Query: 238 VAD--------------DSVAKRVLREHVRESLGVRSDDLLFAIINSVS----------- 272
V +SVA + RE +E + R+ + +N +
Sbjct: 439 VMSLSSINPGKGQLLLLESVALALEREQTQEQVAKRNQSKIIKNLNGIRKEKISLSARHR 498
Query: 273 -RGKGQDLFLRS-----FYESLQFIQEKKLQLPS---------LHAVVVGSDMNAQTKFE 317
RG + + + S L +KL L L VGS N + +
Sbjct: 499 LRGSSRKMKITSPAVDNHPSVLSATGRRKLLLSGNVTQKQDLKLLLGSVGSKSN-KVAYV 557
Query: 318 MELRKFVIDK-KIQDRVHFVNKTLAVAPYLASIDVLVQNSQARGECFGRITIEAMAFRLP 376
E+ F+ + + + V + T VA ++ DV V NSQ GE FGR+TIEAMA+ LP
Sbjct: 558 KEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGVGETFGRVTIEAMAYGLP 617
Query: 377 VLGTAAGGTMEIVVNGSTGLLHPVGKGGVTPLANNIVKLATHVEKRLTMGKKGYERVKER 436
VLGT AGGT EIV + TGLLHPVG+ G LA N++ L + RL +G +G E V++
Sbjct: 618 VLGTDAGGTKEIVEHNVTGLLHPVGRAGNKVLAQNLLFLLRNPSTRLQLGSQGREIVEKM 677
Query: 437 FMEKHMSDRIALVL 450
+M++HM R VL
Sbjct: 678 YMKQHMYKRFVDVL 691
>AT1G78800.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:29625859-29627941 REVERSE LENGTH=403
Length = 403
Score = 48.5 bits (114), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 78/197 (39%), Gaps = 12/197 (6%)
Query: 263 LLFAIINSVSRGKGQDLFLRSFYESLQFIQEKKLQLPSLHAVVVGS---DMNAQTKFEME 319
L F IN R K DL + +F + + K L + V G + ++ E
Sbjct: 210 LNFLSINRFERKKNIDLAVSAF----AILCKHKQNLSDVTLTVAGGYDERLKENVEYLEE 265
Query: 320 LRKFVIDKKIQDRVHFVNKTLAVAPYLASIDVLVQNSQARGECFGRITIEAMAFRLPVLG 379
LR + + DRV+F+ L E FG + +EAMA PV+
Sbjct: 266 LRSLAEKEGVSDRVNFITSCSTAERNELLSSCLCVLYTPTDEHFGIVPLEAMAAYKPVIA 325
Query: 380 TAAGGTMEIVVNGSTG-LLHPVGKGGVTPLANNIVKLATHVEKRLTMGKKGYERVKERFM 438
+GG +E V NG TG L P + + +A + + E MG + V E F
Sbjct: 326 CNSGGPVETVKNGVTGYLCEPTPEDFSSAMA----RFIENPELANRMGAEARNHVVESFS 381
Query: 439 EKHMSDRIALVLKDVLA 455
K ++ L DV++
Sbjct: 382 VKTFGQKLNQYLVDVVS 398