Miyakogusa Predicted Gene
- Lj3g3v2097100.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2097100.1 Non Chatacterized Hit- tr|I1KGS8|I1KGS8_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.42752
PE,85.54,0,seg,NULL; Acetyltransf_1,GNAT domain; PUTATIVE
UNCHARACTERIZED PROTEIN,NULL; N-TERMINAL ACETYLTRANSF,CUFF.43573.1
(407 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G37580.1 | Symbols: HLS1, COP3, UNS2 | Acyl-CoA N-acyltransfe... 360 1e-99
AT2G23060.1 | Symbols: | Acyl-CoA N-acyltransferases (NAT) supe... 354 6e-98
AT5G67430.1 | Symbols: | Acyl-CoA N-acyltransferases (NAT) supe... 310 1e-84
AT2G23060.2 | Symbols: | Acyl-CoA N-acyltransferases (NAT) supe... 306 1e-83
AT2G30090.1 | Symbols: | Acyl-CoA N-acyltransferases (NAT) supe... 251 6e-67
>AT4G37580.1 | Symbols: HLS1, COP3, UNS2 | Acyl-CoA
N-acyltransferases (NAT) superfamily protein |
chr4:17658932-17660564 FORWARD LENGTH=403
Length = 403
Score = 360 bits (923), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 188/405 (46%), Positives = 255/405 (62%), Gaps = 33/405 (8%)
Query: 28 YDEERHKVAVEKLEKLCEIGQRGKPSLVTDLMGDPICRIRHFQLHVMLVAXXXXXXXXX- 86
YD R V VE +E+ CE+G GK SL TDL+GDPICRIRH ++MLVA
Sbjct: 7 YDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTEKKEIV 66
Query: 87 -XIRGCVKTVTRGDSV--------------YVKLAYVLGLRVSPQHRRLGIGTKLVENLE 131
IRGC+KTVT G + Y KLAYVLGLRVSP HRR GIG KLV+ +E
Sbjct: 67 GMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFKLVKMME 126
Query: 132 EWCKQKGAKYTYMATDCTNEPSINLFTKKCGYSKFRSLTMLVQPVHAHYKPISRSIAVLH 191
EW +Q GA+Y+Y+AT+ N+ S+NLFT KCGYS+FR+ ++LV PV+AH +SR + V+
Sbjct: 127 EWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNVSRRVTVIK 186
Query: 192 IPPSLAKSMYNHMFANSEFYPKDIDSILSNKLNLGTFMAIPKKHLSKCDLTRGIL----- 246
+ P A+++Y F+ +EF+P+DIDS+L+NKL+LGTF+A+P+
Sbjct: 187 LEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSWPGSAKFL 246
Query: 247 ---PPSYAMLSVWNTKEVFKLQVKGVSPLAHACCVGTRLLDEWLPWLRLPSFPDIFRPFG 303
P S+A+LSVWN K+ F L+V+G S L TR++D+ LP+L+LPS P +F PFG
Sbjct: 247 EYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPSVFEPFG 306
Query: 304 VYFLYGLHMEGKRGHHLMKGLCGFVHNMARDDGGCGAVVAELGQRDPVRDAVPHWRKFSW 363
++F+YG+ EG R ++K LC HN+A+ GGCG V AE+ DP+R +PHW+ S
Sbjct: 307 LHFMYGIGGEGPRAVKMVKSLCAHAHNLAK-AGGCGVVAAEVAGEDPLRRGIPHWKVLSC 365
Query: 364 AEDMWCIKNL-EDAKQGIVPEKCGPSGFFTSRSSSPVIFVDPRDF 407
ED+WCIK L +D G+V G +T IFVDPR+F
Sbjct: 366 DEDLWCIKRLGDDYSDGVV-------GDWTKSPPGVSIFVDPREF 403
>AT2G23060.1 | Symbols: | Acyl-CoA N-acyltransferases (NAT)
superfamily protein | chr2:9812839-9814633 REVERSE
LENGTH=413
Length = 413
Score = 354 bits (909), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 185/412 (44%), Positives = 256/412 (62%), Gaps = 40/412 (9%)
Query: 28 YDEERHKVAVEKLEKLCEIGQRGKPSLVTDLMGDPICRIRHFQLHVMLVAX---XXXXXX 84
YD + VE +E+ CE+G GK SL TDL+GDPICR+RH ++MLVA
Sbjct: 10 YDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPKEKKEL 69
Query: 85 XXXIRGCVKTVTRGDS--------------------VYVKLAYVLGLRVSPQHRRLGIGT 124
IRGC+KTVT G + +Y KLAY+LGLRVSP HRR GIG
Sbjct: 70 VGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQGIGF 129
Query: 125 KLVENLEEWCKQKGAKYTYMATDCTNEPSINLFTKKCGYSKFRSLTMLVQPVHAHYKPIS 184
KLV+ +E+W Q GA+Y+Y AT+ N S+NLFT KCGY++FR+ ++LV PV+AH IS
Sbjct: 130 KLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAHRVNIS 189
Query: 185 RSIAVLHIPPSLAKSMYNHMFANSEFYPKDIDSILSNKLNLGTFMAIPK--------KHL 236
R + V+ + PS A+ +Y F+ +EF+P+DIDS+L+NKL+LGTF+A+P+ +
Sbjct: 190 RRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSRSW 249
Query: 237 SKCDLTRGILPPSYAMLSVWNTKEVFKLQVKGVSPLAHACCVGTRLLDEWLPWLRLPSFP 296
P S+A+LSVWN K+ F+L+V+G S L TR++D+ LP+L++PS P
Sbjct: 250 PGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLKIPSIP 309
Query: 297 DIFRPFGVYFLYGLHMEGKRGHHLMKGLCGFVHNMARDDGGCGAVVAELGQRDPVRDAVP 356
+FRPFG++F+YG+ EG R ++K LC HN+A+ +GGCG V AE+ +P+R +P
Sbjct: 310 AVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAK-EGGCGVVAAEVAGEEPLRRGIP 368
Query: 357 HWRKFSWAEDMWCIKNL-EDAKQGIVPEKCGPSGFFTSRSSSPVIFVDPRDF 407
HW+ S AED+WCIK L ED G V G +T IFVDPR+F
Sbjct: 369 HWKVLSCAEDLWCIKRLGEDYSDGSV-------GDWTKSPPGDSIFVDPREF 413
>AT5G67430.1 | Symbols: | Acyl-CoA N-acyltransferases (NAT)
superfamily protein | chr5:26910429-26911856 FORWARD
LENGTH=386
Length = 386
Score = 310 bits (795), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 171/395 (43%), Positives = 240/395 (60%), Gaps = 36/395 (9%)
Query: 28 YDEERHKVAVEKLEKLCEIGQRGKPSLVTDLMGDPICRIRHFQLHVMLVAXXXXXXXXXX 87
YD +R +VE+LE+ CE+G SL+ DLMGDP+ RIR MLVA
Sbjct: 13 YDPKRDLTSVEELEESCEVG-----SLLVDLMGDPLARIRQSPSFHMLVAEIGNEIVGM- 66
Query: 88 IRGCVKTVTRG-------DSV-----YVKLAYVLGLRVSPQHRRLGIGTKLVENLEEWCK 135
IRG +K VTRG D V KLA+V GLRVSP +RR+GIG KLV+ LEEW
Sbjct: 67 IRGTIKMVTRGVNALRQADDVSPEINTTKLAFVSGLRVSPFYRRMGIGLKLVQRLEEWFL 126
Query: 136 QKGAKYTYMATDCTNEPSINLFTKKCGYSKFRSLTMLVQPVHAHYKPISRSIAVLHIPPS 195
+ A Y+Y+ T+ N S+ LFT+K GYSKFR+ T LV PV H +SR + ++ + PS
Sbjct: 127 RNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNHRVTVSRRVKIIKLAPS 186
Query: 196 LAKSMYNHMFANSEFYPKDIDSILSNKLNLGTFMAIPKKHLSKCDLTRGILPP---SYAM 252
A+S+Y + F+ +EF+P DI+SIL+NKL+LGT++A+P+ D G LP S+A+
Sbjct: 187 DAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPRGG----DNVSGSLPDQTGSWAV 242
Query: 253 LSVWNTKEVFKLQVKGVSPLAHACCVGTRLLDEWLPWLRLPSFPDIFRPFGVYFLYGLHM 312
+S+WN+K+V++LQVKG S L TR+ D P+L++PSFP++F+ F ++F+YG+
Sbjct: 243 ISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLKIPSFPNLFKSFAMHFMYGIGG 302
Query: 313 EGKRGHHLMKGLCGFVHNMARDDGGCGAVVAELGQRDPVRDAVPHWRKFSWAEDMWCIKN 372
EG R +++ LC HN+AR GC V AE+ +P+R +PHW+ S ED+WC+K
Sbjct: 303 EGPRAAEMVEALCSHAHNLAR-KSGCAVVAAEVASCEPLRVGIPHWKVLS-PEDLWCLKR 360
Query: 373 LEDAKQGIVPEKCGPSGFFTSRSSSPVIFVDPRDF 407
L G+ K P G IFVDPR+
Sbjct: 361 LRYDDDGVDWTKS-PPGL--------SIFVDPREI 386
>AT2G23060.2 | Symbols: | Acyl-CoA N-acyltransferases (NAT)
superfamily protein | chr2:9812839-9814199 REVERSE
LENGTH=358
Length = 358
Score = 306 bits (785), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 160/349 (45%), Positives = 223/349 (63%), Gaps = 37/349 (10%)
Query: 88 IRGCVKTVTRGDS--------------------VYVKLAYVLGLRVSPQHRRLGIGTKLV 127
IRGC+KTVT G + +Y KLAY+LGLRVSP HRR GIG KLV
Sbjct: 18 IRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQGIGFKLV 77
Query: 128 ENLEEWCKQKGAKYTYMATDCTNEPSINLFTKKCGYSKFRSLTMLVQPVHAHYKPISRSI 187
+ +E+W Q GA+Y+Y AT+ N S+NLFT KCGY++FR+ ++LV PV+AH ISR +
Sbjct: 78 KAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAHRVNISRRV 137
Query: 188 AVLHIPPSLAKSMYNHMFANSEFYPKDIDSILSNKLNLGTFMAIPKK--HLSKCDLTRGI 245
V+ + PS A+ +Y F+ +EF+P+DIDS+L+NKL+LGTF+A+P+ + S G
Sbjct: 138 TVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSRSWPGS 197
Query: 246 L------PPSYAMLSVWNTKEVFKLQVKGVSPLAHACCVGTRLLDEWLPWLRLPSFPDIF 299
P S+A+LSVWN K+ F+L+V+G S L TR++D+ LP+L++PS P +F
Sbjct: 198 AKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLKIPSIPAVF 257
Query: 300 RPFGVYFLYGLHMEGKRGHHLMKGLCGFVHNMARDDGGCGAVVAELGQRDPVRDAVPHWR 359
RPFG++F+YG+ EG R ++K LC HN+A+ +GGCG V AE+ +P+R +PHW+
Sbjct: 258 RPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAK-EGGCGVVAAEVAGEEPLRRGIPHWK 316
Query: 360 KFSWAEDMWCIKNL-EDAKQGIVPEKCGPSGFFTSRSSSPVIFVDPRDF 407
S AED+WCIK L ED G V G +T IFVDPR+F
Sbjct: 317 VLSCAEDLWCIKRLGEDYSDGSV-------GDWTKSPPGDSIFVDPREF 358
>AT2G30090.1 | Symbols: | Acyl-CoA N-acyltransferases (NAT)
superfamily protein | chr2:12843583-12845597 REVERSE
LENGTH=386
Length = 386
Score = 251 bits (642), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 157/390 (40%), Positives = 223/390 (57%), Gaps = 33/390 (8%)
Query: 28 YDEERHKVAVEKLEKLCEIGQRGKPSLVTDLMGDPICRIRHFQLHVMLVAXXXXXXXXXX 87
YD+ R ++ + ++EK CEIG + L TD +GDPICRIR+ +MLVA
Sbjct: 18 YDDRRDRIQMGRMEKSCEIGHDHQTLLFTDTLGDPICRIRNSPFFIMLVAGVGNKLVGS- 76
Query: 88 IRGCVKTVTRGDSVYVKLAYVLGLRVSPQHRRLGIGTKLVENLEEWCKQKGAKYTYMATD 147
I+G VK V D V++ YVLGLRV P +RR GIG+ LV LEEW + A Y YMAT+
Sbjct: 77 IQGSVKPVEFHDK-SVRVGYVLGLRVVPSYRRRGIGSILVRKLEEWFESHNADYAYMATE 135
Query: 148 CTNEPSINLFTKKCGYSKFRSLTMLVQPVH-AHYKPISRSIAVLHIPPSLAKSMY-NHMF 205
NE S LF + GY FR+ +LV PV+ + I + + A+S+Y ++
Sbjct: 136 KDNEASHGLFIGRLGYVVFRNPAILVNPVNPGRGLKLPSDIGIRKLKVKEAESLYRRNVA 195
Query: 206 ANSEFYPKDIDSILSNKLNLGTFMAIPKKHLSKCDLTRGILPPSYAMLSVWNTKEVFKLQ 265
A +EF+P DI+ IL NKL++GT++A + + D TR S+AMLSVW++ +VFKL+
Sbjct: 196 ATTEFFPDDINKILRNKLSIGTWVA----YYNNVDNTR-----SWAMLSVWDSSKVFKLR 246
Query: 266 VKGVSPLAHACCVG-TRLLDEWLPWLRLPSFPDIFRPFGVYFLYGLHMEGKRGHHLMKGL 324
++ +PL++ ++L +L L L PD+F PFG YFLYG+H EG L++ L
Sbjct: 247 IER-APLSYLLLTKVSKLFGNFLSLLGLTVLPDLFTPFGFYFLYGVHSEGPHCGKLVRAL 305
Query: 325 CGFVHNMARDDGGCG--AVVAELGQ----RDPVRDAVPHWRKFSWAEDMWCIKNLEDAKQ 378
C VHNMA + GC VV E+ + D ++ +PHW+ S +DMWCIK
Sbjct: 306 CEHVHNMAALNDGCACKVVVVEVDKGSNGDDSLQRCIPHWKMLSCDDDMWCIK------- 358
Query: 379 GIVPEKCGPSGF-FTSRSSS-PVIFVDPRD 406
P KC + F + RS S +FVDPR+
Sbjct: 359 ---PLKCEKNKFDLSERSKSRSSLFVDPRE 385