Miyakogusa Predicted Gene
- Lj1g3v4528440.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4528440.1 Non Chatacterized Hit- tr|I1JP86|I1JP86_SOYBN
Uncharacterized protein OS=Glycine max PE=3 SV=1,87.89,0,PLP-dependent
transferases,Pyridoxal phosphate-dependent transferase, major domain;
seg,NULL; AROMAT,CUFF.32622.1
(488 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G20340.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent tr... 755 0.0
AT4G28680.1 | Symbols: TYRDC, TYRDC1 | L-tyrosine decarboxylase ... 662 0.0
AT4G28680.2 | Symbols: TYRDC, TYRDC1 | L-tyrosine decarboxylase ... 659 0.0
AT4G28680.4 | Symbols: TYRDC | L-tyrosine decarboxylase | chr4:1... 659 0.0
AT4G28680.3 | Symbols: TYRDC | L-tyrosine decarboxylase | chr4:1... 658 0.0
AT1G43710.1 | Symbols: emb1075 | Pyridoxal phosphate (PLP)-depen... 55 1e-07
>AT2G20340.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent
transferases superfamily protein | chr2:8779804-8782490
FORWARD LENGTH=490
Length = 490
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/483 (72%), Positives = 413/483 (85%)
Query: 3 GNGLRPMDAEQLREQGHKMVDFIADYYKTIENYPVLSQVEPGYLGKLLPDSAPTYPESLQ 62
G L+PMD+EQLRE GH MVDFIADYYKTIE++PVLSQV+PGYL KLLPDSAP +PE+L
Sbjct: 6 GKVLKPMDSEQLREYGHLMVDFIADYYKTIEDFPVLSQVQPGYLHKLLPDSAPDHPETLD 65
Query: 63 QVLDDVKEKILPGVTHWQSPNYFAYFPSNSSIAGFLGEMLSAGINIVGFSWITSPAATEL 122
QVLDDV+ KILPGVTHWQSP++FAY+PSNSS+AGFLGEMLSAG+ IVGFSW+TSPAATEL
Sbjct: 66 QVLDDVRAKILPGVTHWQSPSFFAYYPSNSSVAGFLGEMLSAGLGIVGFSWVTSPAATEL 125
Query: 123 ETIVLDWLAKALNLPDNFYSTGQGGGVIQGTASEXXXXXXXXXRDKILRRVGRDALPKLV 182
E IVLDW+AK LNLP+ F S G GGGVIQG+ASE RDK+LR VG++AL KLV
Sbjct: 126 EMIVLDWVAKLLNLPEQFMSKGNGGGVIQGSASEAVLVVLIAARDKVLRSVGKNALEKLV 185
Query: 183 TYASDQTHSALQKACQIAGLNPELCRLLKTDSSTNFALSPDVLSEAISRDIASGLIPFFL 242
Y+SDQTHSALQKACQIAG++PE CR+L TDSSTN+AL P+ L EA+SRD+ +GLIPFFL
Sbjct: 186 VYSSDQTHSALQKACQIAGIHPENCRVLTTDSSTNYALRPESLQEAVSRDLEAGLIPFFL 245
Query: 243 CATVGTTSSTAVDPLPALAEIAKTNTIWFHVDAAYAGSASICPEYRHHIDGVEEADSFNM 302
CA VGTTSSTAVDPL AL +IA +N IWFHVDAAYAGSA ICPEYR +IDGVE ADSFNM
Sbjct: 246 CANVGTTSSTAVDPLAALGKIANSNGIWFHVDAAYAGSACICPEYRQYIDGVETADSFNM 305
Query: 303 NAHKWFLTNFDCSLLWVKDRSALIQSLSTNPEFLKNKATQGNLVIDYKDWQIPLGRRFRS 362
NAHKWFLTNFDCSLLWVKD+ +L +LSTNPEFLKNKA+Q NLV+DYKDWQIPLGRRFRS
Sbjct: 306 NAHKWFLTNFDCSLLWVKDQDSLTLALSTNPEFLKNKASQANLVVDYKDWQIPLGRRFRS 365
Query: 363 LKLWMVLRLYGLEGLQSHIRSHIAMAAYFEELVGQDTRFKAVAPRTFSLICFRLLPPSNS 422
LKLWMVLRLYG E L+S+IR+HI +A FE+LV QD F+ V PR F+L+CFRL+P +
Sbjct: 366 LKLWMVLRLYGSETLKSYIRNHIKLAKEFEQLVSQDPNFEIVTPRIFALVCFRLVPVKDE 425
Query: 423 EDHGNKLNRDLLESVNSTGKIFITHTVLSGEYILRLAVGAPLTEKRHVTMAWQILQDKAT 482
E N NR+LL++VNS+GK+F++HT LSG+ +LR A+GAPLTE++HV AW+I+Q++A+
Sbjct: 426 EKKCNNRNRELLDAVNSSGKLFMSHTALSGKIVLRCAIGAPLTEEKHVKEAWKIIQEEAS 485
Query: 483 ALL 485
LL
Sbjct: 486 YLL 488
>AT4G28680.1 | Symbols: TYRDC, TYRDC1 | L-tyrosine decarboxylase |
chr4:14155248-14158546 FORWARD LENGTH=545
Length = 545
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/483 (63%), Positives = 379/483 (78%), Gaps = 4/483 (0%)
Query: 6 LRPMDAEQLREQGHKMVDFIADYYKTIEN----YPVLSQVEPGYLGKLLPDSAPTYPESL 61
++PMD+E LREQGH MVDFIADYYK +++ +PVLSQV+PGYL +LPDSAP PESL
Sbjct: 57 MKPMDSELLREQGHIMVDFIADYYKNLQDSPQDFPVLSQVQPGYLRDMLPDSAPERPESL 116
Query: 62 QQVLDDVKEKILPGVTHWQSPNYFAYFPSNSSIAGFLGEMLSAGINIVGFSWITSPAATE 121
+++LDDV +KI+PG+THWQSP+YFAY+ S++S+AGFLGEML+AG+++VGF+W+TSPAATE
Sbjct: 117 KELLDDVSKKIMPGITHWQSPSYFAYYASSTSVAGFLGEMLNAGLSVVGFTWLTSPAATE 176
Query: 122 LETIVLDWLAKALNLPDNFYSTGQGGGVIQGTASEXXXXXXXXXRDKILRRVGRDALPKL 181
LE IVLDWLAK L LPD+F STG GGGVIQGT E RD+IL++VG+ LP+L
Sbjct: 177 LEIIVLDWLAKLLQLPDHFLSTGNGGGVIQGTGCEAVLVVVLAARDRILKKVGKTLLPQL 236
Query: 182 VTYASDQTHSALQKACQIAGLNPELCRLLKTDSSTNFALSPDVLSEAISRDIASGLIPFF 241
V Y SDQTHS+ +KAC I G++ E RLLKTDSSTN+ + P+ L EAIS D+A G IPFF
Sbjct: 237 VVYGSDQTHSSFRKACLIGGIHEENIRLLKTDSSTNYGMPPESLEEAISHDLAKGFIPFF 296
Query: 242 LCATVGTTSSTAVDPLPALAEIAKTNTIWFHVDAAYAGSASICPEYRHHIDGVEEADSFN 301
+CATVGTTSS AVDPL L IAK IW HVDAAYAG+A ICPEYR IDG+E ADSFN
Sbjct: 297 ICATVGTTSSAAVDPLVPLGNIAKKYGIWLHVDAAYAGNACICPEYRKFIDGIENADSFN 356
Query: 302 MNAHKWFLTNFDCSLLWVKDRSALIQSLSTNPEFLKNKATQGNLVIDYKDWQIPLGRRFR 361
MNAHKW N CS LWVKDR +LI +L TNPE+L+ K ++ + V++YKDWQI L RRFR
Sbjct: 357 MNAHKWLFANQTCSPLWVKDRYSLIDALKTNPEYLEFKVSKKDTVVNYKDWQISLSRRFR 416
Query: 362 SLKLWMVLRLYGLEGLQSHIRSHIAMAAYFEELVGQDTRFKAVAPRTFSLICFRLLPPSN 421
SLKLWMVLRLYG E L++ IR H+ +A +FE+ V QD F+ V R FSL+CFRL P
Sbjct: 417 SLKLWMVLRLYGSENLRNFIRDHVNLAKHFEDYVAQDPSFEVVTTRYFSLVCFRLAPVDG 476
Query: 422 SEDHGNKLNRDLLESVNSTGKIFITHTVLSGEYILRLAVGAPLTEKRHVTMAWQILQDKA 481
ED N+ NR+LL +VNSTGKIFI+HT LSG+++LR AVGAPLTE++HVT AWQI+Q A
Sbjct: 477 DEDQCNERNRELLAAVNSTGKIFISHTALSGKFVLRFAVGAPLTEEKHVTEAWQIIQKHA 536
Query: 482 TAL 484
+
Sbjct: 537 SKF 539
>AT4G28680.2 | Symbols: TYRDC, TYRDC1 | L-tyrosine decarboxylase |
chr4:14155248-14158546 FORWARD LENGTH=547
Length = 547
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/485 (63%), Positives = 379/485 (78%), Gaps = 6/485 (1%)
Query: 6 LRPMDAEQLREQGHKMVDFIADYYKTIEN----YPVLSQVEPGYLGKLLPDSAPTYPESL 61
++PMD+E LREQGH MVDFIADYYK +++ +PVLSQV+PGYL +LPDSAP PESL
Sbjct: 57 MKPMDSELLREQGHIMVDFIADYYKNLQDSPQDFPVLSQVQPGYLRDMLPDSAPERPESL 116
Query: 62 QQVLDDVKEKILPGVTHWQSPNYFAYFPSNSSIAGFLGEMLSAGINIVGFSWITSPAATE 121
+++LDDV +KI+PG+THWQSP+YFAY+ S++S+AGFLGEML+AG+++VGF+W+TSPAATE
Sbjct: 117 KELLDDVSKKIMPGITHWQSPSYFAYYASSTSVAGFLGEMLNAGLSVVGFTWLTSPAATE 176
Query: 122 LETIVLDWLAKALNLPDNFYSTGQGGGVIQGTASEXXXXXXXXXRDKILRRVGRDALPKL 181
LE IVLDWLAK L LPD+F STG GGGVIQGT E RD+IL++VG+ LP+L
Sbjct: 177 LEIIVLDWLAKLLQLPDHFLSTGNGGGVIQGTGCEAVLVVVLAARDRILKKVGKTLLPQL 236
Query: 182 VTYASDQTHSALQKACQIAGLNPELCRLLKTDSSTNFALSPDVLSEAISRDIASGLIPFF 241
V Y SDQTHS+ +KAC I G++ E RLLKTDSSTN+ + P+ L EAIS D+A G IPFF
Sbjct: 237 VVYGSDQTHSSFRKACLIGGIHEENIRLLKTDSSTNYGMPPESLEEAISHDLAKGFIPFF 296
Query: 242 LCATVGTTSSTAVDPLPALAEIAKTNTIWFHVDAAYAGSASICPEYRHHIDGVEEADSFN 301
+CATVGTTSS AVDPL L IAK IW HVDAAYAG+A ICPEYR IDG+E ADSFN
Sbjct: 297 ICATVGTTSSAAVDPLVPLGNIAKKYGIWLHVDAAYAGNACICPEYRKFIDGIENADSFN 356
Query: 302 MNAHKWFLTNFDCSLLWVKDRSALIQSLSTNPEFL--KNKATQGNLVIDYKDWQIPLGRR 359
MNAHKW N CS LWVKDR +LI +L TNPE+L K K ++ + V++YKDWQI L RR
Sbjct: 357 MNAHKWLFANQTCSPLWVKDRYSLIDALKTNPEYLEFKVKVSKKDTVVNYKDWQISLSRR 416
Query: 360 FRSLKLWMVLRLYGLEGLQSHIRSHIAMAAYFEELVGQDTRFKAVAPRTFSLICFRLLPP 419
FRSLKLWMVLRLYG E L++ IR H+ +A +FE+ V QD F+ V R FSL+CFRL P
Sbjct: 417 FRSLKLWMVLRLYGSENLRNFIRDHVNLAKHFEDYVAQDPSFEVVTTRYFSLVCFRLAPV 476
Query: 420 SNSEDHGNKLNRDLLESVNSTGKIFITHTVLSGEYILRLAVGAPLTEKRHVTMAWQILQD 479
ED N+ NR+LL +VNSTGKIFI+HT LSG+++LR AVGAPLTE++HVT AWQI+Q
Sbjct: 477 DGDEDQCNERNRELLAAVNSTGKIFISHTALSGKFVLRFAVGAPLTEEKHVTEAWQIIQK 536
Query: 480 KATAL 484
A+
Sbjct: 537 HASKF 541
>AT4G28680.4 | Symbols: TYRDC | L-tyrosine decarboxylase |
chr4:14155184-14158546 FORWARD LENGTH=538
Length = 538
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/485 (63%), Positives = 380/485 (78%), Gaps = 6/485 (1%)
Query: 6 LRPMDAEQLREQGHKMVDFIADYYKTIEN----YPVLSQVEPGYLGKLLPDSAPTYPESL 61
++PMD+E LREQGH MVDFIADYYK +++ +PVLSQV+PGYL +LPDSAP PESL
Sbjct: 48 MKPMDSELLREQGHIMVDFIADYYKNLQDSPQDFPVLSQVQPGYLRDMLPDSAPERPESL 107
Query: 62 QQVLDDVKEKILPGVTHWQSPNYFAYFPSNSSIAGFLGEMLSAGINIVGFSWITSPAATE 121
+++LDDV +KI+PG+THWQSP+YFAY+ S++S+AGFLGEML+AG+++VGF+W+TSPAATE
Sbjct: 108 KELLDDVSKKIMPGITHWQSPSYFAYYASSTSVAGFLGEMLNAGLSVVGFTWLTSPAATE 167
Query: 122 LETIVLDWLAKALNLPDNFYSTGQG--GGVIQGTASEXXXXXXXXXRDKILRRVGRDALP 179
LE IVLDWLAK L LPD+F STG+G GGVIQGT E RD+IL++VG+ LP
Sbjct: 168 LEIIVLDWLAKLLQLPDHFLSTGKGNGGGVIQGTGCEAVLVVVLAARDRILKKVGKTLLP 227
Query: 180 KLVTYASDQTHSALQKACQIAGLNPELCRLLKTDSSTNFALSPDVLSEAISRDIASGLIP 239
+LV Y SDQTHS+ +KAC I G++ E RLLKTDSSTN+ + P+ L EAIS D+A G IP
Sbjct: 228 QLVVYGSDQTHSSFRKACLIGGIHEENIRLLKTDSSTNYGMPPESLEEAISHDLAKGFIP 287
Query: 240 FFLCATVGTTSSTAVDPLPALAEIAKTNTIWFHVDAAYAGSASICPEYRHHIDGVEEADS 299
FF+CATVGTTSS AVDPL L IAK IW HVDAAYAG+A ICPEYR IDG+E ADS
Sbjct: 288 FFICATVGTTSSAAVDPLVPLGNIAKKYGIWLHVDAAYAGNACICPEYRKFIDGIENADS 347
Query: 300 FNMNAHKWFLTNFDCSLLWVKDRSALIQSLSTNPEFLKNKATQGNLVIDYKDWQIPLGRR 359
FNMNAHKW N CS LWVKDR +LI +L TNPE+L+ K ++ + V++YKDWQI L RR
Sbjct: 348 FNMNAHKWLFANQTCSPLWVKDRYSLIDALKTNPEYLEFKVSKKDTVVNYKDWQISLSRR 407
Query: 360 FRSLKLWMVLRLYGLEGLQSHIRSHIAMAAYFEELVGQDTRFKAVAPRTFSLICFRLLPP 419
FRSLKLWMVLRLYG E L++ IR H+ +A +FE+ V QD F+ V R FSL+CFRL P
Sbjct: 408 FRSLKLWMVLRLYGSENLRNFIRDHVNLAKHFEDYVAQDPSFEVVTTRYFSLVCFRLAPV 467
Query: 420 SNSEDHGNKLNRDLLESVNSTGKIFITHTVLSGEYILRLAVGAPLTEKRHVTMAWQILQD 479
ED N+ NR+LL +VNSTGKIFI+HT LSG+++LR AVGAPLTE++HVT AWQI+Q
Sbjct: 468 DGDEDQCNERNRELLAAVNSTGKIFISHTALSGKFVLRFAVGAPLTEEKHVTEAWQIIQK 527
Query: 480 KATAL 484
A+
Sbjct: 528 HASKF 532
>AT4G28680.3 | Symbols: TYRDC | L-tyrosine decarboxylase |
chr4:14155248-14158546 FORWARD LENGTH=547
Length = 547
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/485 (63%), Positives = 380/485 (78%), Gaps = 6/485 (1%)
Query: 6 LRPMDAEQLREQGHKMVDFIADYYKTIEN----YPVLSQVEPGYLGKLLPDSAPTYPESL 61
++PMD+E LREQGH MVDFIADYYK +++ +PVLSQV+PGYL +LPDSAP PESL
Sbjct: 57 MKPMDSELLREQGHIMVDFIADYYKNLQDSPQDFPVLSQVQPGYLRDMLPDSAPERPESL 116
Query: 62 QQVLDDVKEKILPGVTHWQSPNYFAYFPSNSSIAGFLGEMLSAGINIVGFSWITSPAATE 121
+++LDDV +KI+PG+THWQSP+YFAY+ S++S+AGFLGEML+AG+++VGF+W+TSPAATE
Sbjct: 117 KELLDDVSKKIMPGITHWQSPSYFAYYASSTSVAGFLGEMLNAGLSVVGFTWLTSPAATE 176
Query: 122 LETIVLDWLAKALNLPDNFYSTGQG--GGVIQGTASEXXXXXXXXXRDKILRRVGRDALP 179
LE IVLDWLAK L LPD+F STG+G GGVIQGT E RD+IL++VG+ LP
Sbjct: 177 LEIIVLDWLAKLLQLPDHFLSTGKGNGGGVIQGTGCEAVLVVVLAARDRILKKVGKTLLP 236
Query: 180 KLVTYASDQTHSALQKACQIAGLNPELCRLLKTDSSTNFALSPDVLSEAISRDIASGLIP 239
+LV Y SDQTHS+ +KAC I G++ E RLLKTDSSTN+ + P+ L EAIS D+A G IP
Sbjct: 237 QLVVYGSDQTHSSFRKACLIGGIHEENIRLLKTDSSTNYGMPPESLEEAISHDLAKGFIP 296
Query: 240 FFLCATVGTTSSTAVDPLPALAEIAKTNTIWFHVDAAYAGSASICPEYRHHIDGVEEADS 299
FF+CATVGTTSS AVDPL L IAK IW HVDAAYAG+A ICPEYR IDG+E ADS
Sbjct: 297 FFICATVGTTSSAAVDPLVPLGNIAKKYGIWLHVDAAYAGNACICPEYRKFIDGIENADS 356
Query: 300 FNMNAHKWFLTNFDCSLLWVKDRSALIQSLSTNPEFLKNKATQGNLVIDYKDWQIPLGRR 359
FNMNAHKW N CS LWVKDR +LI +L TNPE+L+ K ++ + V++YKDWQI L RR
Sbjct: 357 FNMNAHKWLFANQTCSPLWVKDRYSLIDALKTNPEYLEFKVSKKDTVVNYKDWQISLSRR 416
Query: 360 FRSLKLWMVLRLYGLEGLQSHIRSHIAMAAYFEELVGQDTRFKAVAPRTFSLICFRLLPP 419
FRSLKLWMVLRLYG E L++ IR H+ +A +FE+ V QD F+ V R FSL+CFRL P
Sbjct: 417 FRSLKLWMVLRLYGSENLRNFIRDHVNLAKHFEDYVAQDPSFEVVTTRYFSLVCFRLAPV 476
Query: 420 SNSEDHGNKLNRDLLESVNSTGKIFITHTVLSGEYILRLAVGAPLTEKRHVTMAWQILQD 479
ED N+ NR+LL +VNSTGKIFI+HT LSG+++LR AVGAPLTE++HVT AWQI+Q
Sbjct: 477 DGDEDQCNERNRELLAAVNSTGKIFISHTALSGKFVLRFAVGAPLTEEKHVTEAWQIIQK 536
Query: 480 KATAL 484
A+
Sbjct: 537 HASKF 541
>AT1G43710.1 | Symbols: emb1075 | Pyridoxal phosphate
(PLP)-dependent transferases superfamily protein |
chr1:16486534-16488298 REVERSE LENGTH=482
Length = 482
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/323 (22%), Positives = 129/323 (39%), Gaps = 44/323 (13%)
Query: 80 QSPNYFAYFPSNSSIA-GFLGEMLSAGINIVGFSWITSPAATE---LETIVLDWLAKALN 135
++ N+ Y P N G LG++ IN +G +I S E VLDW A+
Sbjct: 102 RTKNHLGY-PYNLDFDYGALGQLQHFSINNLGDPFIESNYGVHSRPFEVGVLDWFARLWE 160
Query: 136 LPDNFYSTGQGGGVIQGTASEXXXXXXXXXRDKILRRVGRDALPKLVTYASDQTHSALQK 195
+ + Y G I +E VGR+ P + YAS ++H ++ K
Sbjct: 161 IERDDY-----WGYITNCGTEGNLHGIL---------VGREMFPDGILYASRESHYSVFK 206
Query: 196 ACQIAGLNPELCRLLKTDSSTNFALSPDVLSEAISRDIASGLIPFFLCATVGTTSSTAVD 255
A ++ + E K D+ + + D L + + +A+ P L +GTT AVD
Sbjct: 207 AARMYRMECE-----KVDTLMSGEIDCDDLRKKL---LANKDKPAILNVNIGTTVKGAVD 258
Query: 256 PLPALAEIAKT-----NTIWFHVDAAYAGSASICPEYRHHIDGVEEADSFNMNAHKWFLT 310
L + + + + + H D A G + + + S +++ HK+
Sbjct: 259 DLDLVIKTLEECGFSHDRFYIHCDGALFGLMMPFVKRAPKVTFNKPIGSVSVSGHKFVGC 318
Query: 311 NFDCSLLWVKDRSALIQSLSTNPEFLKNKATQGNLVIDYKDWQIPLGRRFRSLKLWMVLR 370
C + R I+ LS+N E+L A++ ++ ++ PL LW L
Sbjct: 319 PMPCGVQIT--RMEHIKVLSSNVEYL---ASRDATIMGSRNGHAPLF-------LWYTLN 366
Query: 371 LYGLEGLQSHIRSHIAMAAYFEE 393
G +G Q ++ + A Y ++
Sbjct: 367 RKGYKGFQKEVQKCLRNAHYLKD 389