Miyakogusa Predicted Gene
- Lj0g3v0216379.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0216379.1 tr|B7FZQ7|B7FZQ7_PHATC Predicted protein
OS=Phaeodactylum tricornutum (strain CCAP 1055/1)
GN=PHATRD,27.8,9e-16,ZF_TAZ,Zinc finger, TAZ-type; KAT11,Histone
H3-K56 acetyltransferase, RTT109; zf-TAZ,Zinc finger, TA,CUFF.13962.1
(542 letters)
Database: Glyma1.pep
75,778 sequences; 25,431,882 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Glyma19g05420.1 543 e-154
Glyma07g00230.1 481 e-135
Glyma15g00250.1 479 e-135
Glyma08g24290.1 475 e-134
Glyma01g40090.1 52 1e-06
>Glyma19g05420.1
Length = 840
Score = 543 bits (1400), Expect = e-154, Method: Compositional matrix adjust.
Identities = 307/588 (52%), Positives = 374/588 (63%), Gaps = 111/588 (18%)
Query: 1 MYVQEFGSECGNPNKRCVYISYLDSVKYFRPERETIRGEALRTLVYHEILIGYLDFCKKR 60
MYVQEFGSECG PN+R VYISYLDSVKYFRP+R T GEALRT+VYHEILIGYLDFCKKR
Sbjct: 311 MYVQEFGSECGYPNQRSVYISYLDSVKYFRPKRVTKSGEALRTIVYHEILIGYLDFCKKR 370
Query: 61 GFTTCYIWACPPTKGEDYILYCHPDTQKTPKNDKLRHWYLSMLRKAAEENIVLGLSNIYD 120
GF TCYIWACPP KGEDY+LYCHPDTQKTPK DKLRHWY SMLRKAAEENIV+G +N++D
Sbjct: 371 GFATCYIWACPPMKGEDYLLYCHPDTQKTPKKDKLRHWYHSMLRKAAEENIVVGSTNLHD 430
Query: 121 HFFVPTGKWDSKVTASRLPYFDGDYWSSRAMGLAGDIDKDSG--YEKTLKKLVKNKRALK 178
HFFV TG DSKVT +RLPYFDG +WS AM A DI+++ G Y+ K+V +KR LK
Sbjct: 431 HFFVTTGSCDSKVTTARLPYFDGGFWSGAAMDKARDIEQECGGDYKMIFDKVV-SKRCLK 489
Query: 179 ALGHVN-PSIDTAKDILVMQRLGKEILPYKEDFIIVHLQHSCVHCQEVIVSGKRGFCTEC 237
++GHVN PS TAKDILVM +LG+ ILP+KEDF++V Q+ C+HC EVI +GKR FCTEC
Sbjct: 490 SMGHVNPPSEGTAKDILVMHKLGQTILPFKEDFLVVQFQYVCMHCHEVIANGKRWFCTEC 549
Query: 238 KKFQECER-----CHTADSHTSTSG--------EKHTLCQ-----------------VLV 267
KKFQECER C++ H S S ++ T Q VL+
Sbjct: 550 KKFQECERSFFKSCYSRYLHLSISSFFPIVLWQKRKTENQNTDLNEEWGILFYLSATVLM 609
Query: 268 EDVPLDTKQNDIILDNQLFENRHNFLSFCQKNQLQFDTLRRAKHSSMMILYHLKYSSI-- 325
+DV DTK+NDIILDN LF++RHNFLSFCQ+N+ QFD+LRRAK+SSMMILY +K ++
Sbjct: 610 DDVLGDTKENDIILDNGLFDSRHNFLSFCQRNRFQFDSLRRAKYSSMMILYLVKNPTLLT 669
Query: 326 ----CSICGGERNEFQHSCRCECCPEFSAC----KERGAECHAQEXXXXXXXXXXXXGNE 377
C +C + N Q +CE CPEF+ C ERGA CHA
Sbjct: 670 VGTTCRVC-SKNNVSQRYWKCENCPEFTVCSECYNERGANCHA----------------- 711
Query: 378 ESKLKSTCHEQKLSQTHSKSLSVSGKEESKLKSTGQEHKLSQTSTSLSVSAGNGEPKKKS 437
L++ +S + S SG +E L Q S L
Sbjct: 712 ----------HTLNEAYSPAQSPSGNQE-----------LQQNSAML------------- 737
Query: 438 TLAPTQVKKLLEVLQHASQCNNSNETQPCPYTYCLKMKILLSHARECITRASGRCRPCYK 497
++LL+V++HAS C +S +TQPC Y +C ++K L +HA C R SG C+ C K
Sbjct: 738 -------QQLLDVIEHASLC-HSIKTQPCTYPHCRQIKKLFAHASRCEIRFSGGCQFCKK 789
Query: 498 ALMLLDAHSKTCEVSECRIPCCMNLKK-------RRESRDRAAVVKSK 538
L HSK C S CRIP CM+LKK + ESR RAAV++SK
Sbjct: 790 VWQGLTLHSKNCRDSACRIPRCMDLKKQVEWIATQSESRLRAAVLQSK 837
>Glyma07g00230.1
Length = 1700
Score = 481 bits (1238), Expect = e-135, Method: Compositional matrix adjust.
Identities = 263/560 (46%), Positives = 338/560 (60%), Gaps = 82/560 (14%)
Query: 1 MYVQEFGSECGNPNKRCVYISYLDSVKYFRPERETIRGEALRTLVYHEILIGYLDFCKKR 60
MYVQEFGSEC PN+R VY+SYLDSVKYFRPE + + GEALRT VYHEILIGYL++CKKR
Sbjct: 1185 MYVQEFGSECQFPNQRRVYLSYLDSVKYFRPEVKAVTGEALRTFVYHEILIGYLEYCKKR 1244
Query: 61 GFTTCYIWACPPTKGEDYILYCHPDTQKTPKNDKLRHWYLSMLRKAAEENIVLGLSNIYD 120
GFT+CYIWACPP KGEDYILYCHP+ QKTPK+DKLR WYL+MLRKAA+ENIV+ L+N+YD
Sbjct: 1245 GFTSCYIWACPPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKAAKENIVVDLTNLYD 1304
Query: 121 HFFVPTGKWDSKVTASRLPYFDGDYWSSRAMGLAGDI----DKDSGYEKTLKKLVKNKRA 176
HFFV TG+ +KVTA+RLPYFDGDYW A L + D +K K KRA
Sbjct: 1305 HFFVSTGECRAKVTAARLPYFDGDYWPGAAEDLIYQLRQEEDGRKQNKKGTTKKTITKRA 1364
Query: 177 LKALGHVNPSIDTAKDILVMQRLGKEILPYKEDFIIVHLQHSCVHCQEVIVSGKRGFCTE 236
LKA G + S + +KD+L+M +LG+ I P KEDFI+VHLQH+C C ++VSG R C +
Sbjct: 1365 LKASGQSDLSANASKDLLLMHKLGETICPMKEDFIMVHLQHACTSCCILMVSGNRWVCNQ 1424
Query: 237 CKKFQECERCHTA-------DSHTSTSGEKHTLCQVLVEDVPLDTKQNDIILDNQLFENR 289
CK FQ C+RC+ A + H EKHTL V + DVP DTK D IL+++ F+ R
Sbjct: 1425 CKNFQICDRCYEAELKREERERHPINQREKHTLYPVEITDVPSDTKDKDDILESEFFDTR 1484
Query: 290 HNFLSFCQKNQLQFDTLRRAKHSSMMILYHL------KYSSICSICGGERNEFQHSCRCE 343
FLS CQ N Q+DTLRRAKHSSMM+LYHL + + C+IC + E RCE
Sbjct: 1485 QAFLSLCQGNHYQYDTLRRAKHSSMMVLYHLHNPTAPAFVTTCNICYLDI-ETGQGWRCE 1543
Query: 344 CCPEFSACKERGAECHAQEXXXXXXXXXXXXGNEESKLKSTCHEQKLSQTHSKSLSVSGK 403
CPE+ C + C+++ H
Sbjct: 1544 VCPEYDVC-------------------------------NACYQKDGGIDHP-------- 1564
Query: 404 EESKLKSTGQEHKLSQTSTSLSVSAGNGEPKKKSTLAPTQVKKLLEVLQHASQCNNSNET 463
HKL+ + + A N E ++ +Q++K+L++L HASQC +++
Sbjct: 1565 -----------HKLTNHPSMVDRDAQNKEARQHRV---SQLRKMLDLLVHASQCRSAH-- 1608
Query: 464 QPCPYTYCLKMKILLSHARECITRASGRCRPCYKALMLLDAHSKTCEVSECRIPCCMNLK 523
C Y C K+K L H C TRASG C C K LL H++ C+ SEC +P C +LK
Sbjct: 1609 --CQYPNCRKVKGLFRHGMHCKTRASGGCVLCKKMWYLLQLHARACKESECHVPRCRDLK 1666
Query: 524 -------KRRESRDRAAVVK 536
++ +SR RAAV++
Sbjct: 1667 EHLRRLQQQSDSRRRAAVME 1686
>Glyma15g00250.1
Length = 1628
Score = 479 bits (1232), Expect = e-135, Method: Compositional matrix adjust.
Identities = 263/560 (46%), Positives = 337/560 (60%), Gaps = 82/560 (14%)
Query: 1 MYVQEFGSECGNPNKRCVYISYLDSVKYFRPERETIRGEALRTLVYHEILIGYLDFCKKR 60
MYVQEFGSEC PN+R VY+SYLDSVKYFRPE + + GEALRT VYHEILIGYL++CKKR
Sbjct: 1113 MYVQEFGSECQFPNQRRVYLSYLDSVKYFRPEVKAVTGEALRTFVYHEILIGYLEYCKKR 1172
Query: 61 GFTTCYIWACPPTKGEDYILYCHPDTQKTPKNDKLRHWYLSMLRKAAEENIVLGLSNIYD 120
GFT+CYIWACPP KGEDYILYCHP+ QKTPK+DKLR WYLSMLRKA++ENIV+ L+N+YD
Sbjct: 1173 GFTSCYIWACPPLKGEDYILYCHPEIQKTPKSDKLREWYLSMLRKASKENIVVDLTNLYD 1232
Query: 121 HFFVPTGKWDSKVTASRLPYFDGDYWSSRAMGLAGDI----DKDSGYEKTLKKLVKNKRA 176
HFFV +G+ +KVTA+RLPYFDGDYW A L + D +K K KRA
Sbjct: 1233 HFFVSSGECRAKVTAARLPYFDGDYWPGAAEDLIYQLRQEEDGRKQNKKGTTKKTITKRA 1292
Query: 177 LKALGHVNPSIDTAKDILVMQRLGKEILPYKEDFIIVHLQHSCVHCQEVIVSGKRGFCTE 236
LKA G + S + +KD+L+M +LG+ I P KEDFI+VHLQH+C HC ++VSG R C +
Sbjct: 1293 LKASGQSDLSGNASKDLLLMHKLGETICPMKEDFIMVHLQHACTHCCILMVSGNRWVCRQ 1352
Query: 237 CKKFQECERCHTAD-------SHTSTSGEKHTLCQVLVEDVPLDTKQNDIILDNQLFENR 289
CK FQ C++C+ A+ H EKHTL V + DVP DTK D IL+++ F+ R
Sbjct: 1353 CKNFQICDKCYEAELKREEREQHPINQREKHTLYPVEITDVPADTKDKDEILESEFFDTR 1412
Query: 290 HNFLSFCQKNQLQFDTLRRAKHSSMMILYHL------KYSSICSICGGERNEFQHSCRCE 343
FLS CQ N Q+DTLRRAKHSSMMILYHL + + C+IC + E RCE
Sbjct: 1413 QAFLSLCQGNHYQYDTLRRAKHSSMMILYHLHNPTAPAFVTTCNICRLDI-ETGQGWRCE 1471
Query: 344 CCPEFSACKERGAECHAQEXXXXXXXXXXXXGNEESKLKSTCHEQKLSQTHSKSLSVSGK 403
CPE+ C + C+++ H
Sbjct: 1472 VCPEYDVC-------------------------------NACYQKDRGADHP-------- 1492
Query: 404 EESKLKSTGQEHKLSQTSTSLSVSAGNGEPKKKSTLAPTQVKKLLEVLQHASQCNNSNET 463
HKL+ + A N E ++ L Q++K+L++L HASQC +++
Sbjct: 1493 -----------HKLTNHPSMADRDAQNKEARQLRVL---QLRKMLDLLVHASQCRSAH-- 1536
Query: 464 QPCPYTYCLKMKILLSHARECITRASGRCRPCYKALMLLDAHSKTCEVSECRIPCCMNLK 523
C Y C K+K L H C RASG C C K LL H++ C+ SEC +P C +LK
Sbjct: 1537 --CQYPNCRKVKGLFRHGMHCKIRASGGCVLCKKMWYLLQLHARACKESECHVPRCRDLK 1594
Query: 524 -------KRRESRDRAAVVK 536
++ +SR RAAV++
Sbjct: 1595 EHLRRLQQQSDSRRRAAVME 1614
>Glyma08g24290.1
Length = 1694
Score = 475 bits (1223), Expect = e-134, Method: Compositional matrix adjust.
Identities = 260/560 (46%), Positives = 337/560 (60%), Gaps = 82/560 (14%)
Query: 1 MYVQEFGSECGNPNKRCVYISYLDSVKYFRPERETIRGEALRTLVYHEILIGYLDFCKKR 60
MYVQEFGSEC PN+R VY+SYLDSVKYFRPE + + GEALRT VYHEILIGYL++CKKR
Sbjct: 1179 MYVQEFGSECQFPNQRRVYLSYLDSVKYFRPEVKAVTGEALRTFVYHEILIGYLEYCKKR 1238
Query: 61 GFTTCYIWACPPTKGEDYILYCHPDTQKTPKNDKLRHWYLSMLRKAAEENIVLGLSNIYD 120
GFT+CYIWACPP KGEDYILYCHP+ QKTPK+DKLR WYL+MLRKA++EN+V+ L+N+YD
Sbjct: 1239 GFTSCYIWACPPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKASKENVVVDLTNLYD 1298
Query: 121 HFFVPTGKWDSKVTASRLPYFDGDYWSSRAMGLAGDI----DKDSGYEKTLKKLVKNKRA 176
HFFV TG+ +KVTA+RLPYFDGDYW A L + D +K K KRA
Sbjct: 1299 HFFVSTGECRAKVTAARLPYFDGDYWPGAAEDLIYQLRQEEDGRKQNKKGTTKKTITKRA 1358
Query: 177 LKALGHVNPSIDTAKDILVMQRLGKEILPYKEDFIIVHLQHSCVHCQEVIVSGKRGFCTE 236
LKA G + S + +KD+L+M +LG+ I P KEDFI+VHLQH+C C ++VSG R C +
Sbjct: 1359 LKASGQSDLSGNASKDLLLMHKLGETISPMKEDFIMVHLQHACTSCCILMVSGNRWVCNQ 1418
Query: 237 CKKFQECERCHTA-------DSHTSTSGEKHTLCQVLVEDVPLDTKQNDIILDNQLFENR 289
CK F C+RC+ A + H EKHTL V + DVP DTK D IL+++ F+ R
Sbjct: 1419 CKNFHICDRCYEAELKREERERHPINHREKHTLYPVEITDVPSDTKDKDDILESEFFDTR 1478
Query: 290 HNFLSFCQKNQLQFDTLRRAKHSSMMILYHL------KYSSICSICGGERNEFQHSCRCE 343
FLS CQ N Q+DTLRRAKHSSMM+LYHL + + C+IC + E RCE
Sbjct: 1479 QAFLSLCQGNHYQYDTLRRAKHSSMMVLYHLHNPTAPAFVTTCNICYLDI-ETGQGWRCE 1537
Query: 344 CCPEFSACKERGAECHAQEXXXXXXXXXXXXGNEESKLKSTCHEQKLSQTHSKSLSVSGK 403
CPE+ C + C+++ H
Sbjct: 1538 VCPEYDVC-------------------------------NACYQKDGGIDHP-------- 1558
Query: 404 EESKLKSTGQEHKLSQTSTSLSVSAGNGEPKKKSTLAPTQVKKLLEVLQHASQCNNSNET 463
HKL+ + + A N E ++ + Q++K+L++L HASQC +++
Sbjct: 1559 -----------HKLTNHPSMVDRDAQNTEAREVRVV---QLRKMLDLLVHASQCRSAH-- 1602
Query: 464 QPCPYTYCLKMKILLSHARECITRASGRCRPCYKALMLLDAHSKTCEVSECRIPCCMNLK 523
C Y C K+K L H C TRASG C C K LL H++ C+ SEC +P C +LK
Sbjct: 1603 --CQYPNCRKVKGLFRHGMHCKTRASGGCVLCKKMWYLLQLHARACKESECHVPRCRDLK 1660
Query: 524 -------KRRESRDRAAVVK 536
++ +SR RAAV++
Sbjct: 1661 EHLRRLQQQSDSRRRAAVME 1680
>Glyma01g40090.1
Length = 379
Score = 52.4 bits (124), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 69/172 (40%), Gaps = 11/172 (6%)
Query: 377 EESKLKSTCHEQKLSQTHSKSLSVSGKEESKLKSTGQEHKLSQTSTSLSVSAGNGEPKKK 436
+ +L CH + L S G + KL E ++ ++ + K K
Sbjct: 188 DAPRLSLICHRKVLKNFKVVS-ETEGWKTMKLSHPALEKEIVESMVDEDIIKKEKIRKNK 246
Query: 437 STLAPTQVKKLLEVLQH---------ASQCNNSNETQPCPYTYCLKMKILLSHARECITR 487
Q+ + +E L H + QPC YT C +++L+ H C R
Sbjct: 247 EREIYLQLYEAMEALVHICRDGCRTIGPHDKDFKANQPCKYTACNGLELLVRHFAGCKLR 306
Query: 488 ASGRCRPCYKALMLLDAHSKTC-EVSECRIPCCMNLKKRRESRDRAAVVKSK 538
G C C + LL+ HS+ C + CR+P C N K+R +++ ++ K
Sbjct: 307 VRGGCVHCNRMWQLLELHSRLCVDPDYCRVPLCRNFKERISKQNKKDEIRWK 358