Miyakogusa Predicted Gene
- Lj2g3v1890940.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1890940.1 Non Chatacterized Hit- tr|I1LHC7|I1LHC7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.49371 PE,85.27,0,SET,SET
domain; SAM_MT43_EZ,Histone-lysine N-methyltransferase, EZ; CXC,CXC
domain; SET (Su(var)3-9,,CUFF.38024.1
(842 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G23380.1 | Symbols: CLF, ICU1, SDG1, SET1 | SET domain-contai... 956 0.0
AT4G02020.1 | Symbols: EZA1, SWN, SDG10 | SET domain-containing ... 485 e-137
AT1G02580.1 | Symbols: MEA, EMB173, FIS1, SDG5 | SET domain-cont... 338 9e-93
AT5G42400.1 | Symbols: ATXR7, SDG25 | SET domain protein 25 | ch... 97 5e-20
AT4G27910.1 | Symbols: ATX4, SDG16 | SET domain protein 16 | chr... 95 2e-19
AT5G53430.1 | Symbols: SDG29, SET29, ATX5 | SET domain group 29 ... 95 2e-19
AT2G31650.1 | Symbols: ATX1, SDG27 | homologue of trithorax | ch... 92 1e-18
AT1G05830.2 | Symbols: ATX2 | trithorax-like protein 2 | chr1:17... 89 2e-17
AT1G05830.1 | Symbols: ATX2, SDG30 | trithorax-like protein 2 | ... 89 2e-17
AT3G61740.2 | Symbols: SDG14, ATX3 | SET domain protein 14 | chr... 89 2e-17
AT1G76710.2 | Symbols: ASHH1 | SET domain group 26 | chr1:287898... 88 2e-17
AT1G76710.1 | Symbols: SDG26 | SET domain group 26 | chr1:287898... 88 2e-17
AT3G61740.1 | Symbols: SDG14, ATX3 | SET domain protein 14 | chr... 88 2e-17
AT4G30860.1 | Symbols: ASHR3, SDG4 | SET domain group 4 | chr4:1... 84 5e-16
AT3G59960.1 | Symbols: ASHH4, SDG24 | histone-lysine N-methyltra... 72 1e-12
AT1G77300.2 | Symbols: EFS, SDG8, ASHH2, LAZ2 | histone methyltr... 70 9e-12
AT1G77300.1 | Symbols: EFS, SDG8, CCR1, ASHH2, LAZ2 | histone me... 69 1e-11
AT2G44150.1 | Symbols: ASHH3, SDG7 | histone-lysine N-methyltran... 66 1e-10
AT2G23740.2 | Symbols: | nucleic acid binding;sequence-specific... 60 9e-09
AT2G23740.1 | Symbols: | nucleic acid binding;sequence-specific... 60 9e-09
AT2G35160.1 | Symbols: SUVH5, SGD9 | SU(VAR)3-9 homolog 5 | chr2... 59 1e-08
AT3G03750.2 | Symbols: SUVR3, SDG20 | SET domain protein 20 | ch... 57 4e-08
AT2G22740.1 | Symbols: SUVH6, SDG23 | SU(VAR)3-9 homolog 6 | chr... 53 9e-07
AT2G22740.2 | Symbols: SUVH6 | SU(VAR)3-9 homolog 6 | chr2:96642... 53 9e-07
>AT2G23380.1 | Symbols: CLF, ICU1, SDG1, SET1 | SET
domain-containing protein | chr2:9955570-9960117 FORWARD
LENGTH=902
Length = 902
Score = 956 bits (2471), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/897 (56%), Positives = 590/897 (65%), Gaps = 86/897 (9%)
Query: 11 PAVNDVLLVIDSLKKQVAAERVVSVKKKMEENRLKLVGVTDHLYKMSMERRTGGIADMNG 70
PA +V VI+SLKK++AA+R +S+KK+++EN+ L +T + SMER G + +G
Sbjct: 27 PASKEVSEVIESLKKKLAADRCISIKKRIDENKKNLFAITQSFMRSSMER---GGSCKDG 83
Query: 71 SLDLLSKRQKGATDMHNGVNASNGDRESNGYHEDGHGSTAVLLGYNVTTKNAVRPIKLPE 130
S DLL KRQ+ + M +G++ SN +R Y EDG S+ ++ G +V K ++RPIK+P+
Sbjct: 84 S-DLLVKRQRDSPGMKSGIDESNNNR----YVEDGPASSGMVQGSSVPVKISLRPIKMPD 138
Query: 131 VKRLPPYTTWIFLDRNQRMTEDQSVVGRRRIYYDQNGGEALICXXXXXXXXXXXXXKREF 190
+KRL PYTTW+FLDRNQRMTEDQSVVGRRRIYYDQ GGEALIC KR+F
Sbjct: 139 IKRLSPYTTWVFLDRNQRMTEDQSVVGRRRIYYDQTGGEALICSDSEEEAIDDEEEKRDF 198
Query: 191 VESEDFILRMTIRELGISDIVLESLAQCFSRNSSEIKARYETLTXXXXXXXXXXXXXXXX 250
+E ED+I+RMT+ +LG+SD VL LA SR++SEIKAR+ L
Sbjct: 199 LEPEDYIIRMTLEQLGLSDSVLAELASFLSRSTSEIKARHGVLMKEKEVSESGDN----- 253
Query: 251 XXQIDSSFVEKDLEAALDSFDNLFCRRCLVFDCRLHGCSQDLVFPAEKQPSWNPPDTENA 310
Q +SS + KD+E ALDSFDNLFCRRCLVFDCRLHGCSQDL+FPAEK W PP EN
Sbjct: 254 --QAESSLLNKDMEGALDSFDNLFCRRCLVFDCRLHGCSQDLIFPAEKPAPWCPPVDENL 311
Query: 311 PCGPNCFRSVLKSERFAKV--------TSSTQADVEDXXXXXXXXXXXXXXXXXXXXXQS 362
CG NC++++LKS RF TSS A + S
Sbjct: 312 TCGANCYKTLLKSGRFPGYGTIEGKTGTSSDGAGTK----TTPTKFSSKLNGRKPKTFPS 367
Query: 363 ESASSNAKNISESSDSENGPGRDXXXXXXXXXXXXXXXGKSGIGKRNSKRVAERVLVGMQ 422
ESASSN K E+SDSENG +D G+ KRN RVAERV Q
Sbjct: 368 ESASSNEKCALETSDSENGLQQDTNSDKVSSSPKVKGSGRRVGRKRNKNRVAERVPRKTQ 427
Query: 423 KRQKKPVASDSDSIS----------------------------------------EAVGG 442
KRQKK ASDSDSI+ E
Sbjct: 428 KRQKKTEASDSDSIASGSCSPSDAKHKDNEDATSSSQKHVKSGNSGKSRKNGTPAEVSNN 487
Query: 443 SLNDMVTVPH---------AMRGEDSLRKEETVDENTFKQDLTDNGSWKALEKGLLEKGM 493
S+ D V V A ++SLRKEE + E + L N W+ LEK L +KG+
Sbjct: 488 SVKDDVPVCQSNEVASELDAPGSDESLRKEEFMGETVSRGRLATNKLWRPLEKSLFDKGV 547
Query: 494 EIFGRNSCLLARNLLSGLKTCWDVFQYINCEEGKMSGPSGDATNSLVEGYSKG------- 546
EIFG NSCL+ARNLLSG K+CW+VFQY+ C E K S GD N +G SK
Sbjct: 548 EIFGMNSCLIARNLLSGFKSCWEVFQYMTCSENKASFFGGDGLNP--DGSSKFDINGNMV 605
Query: 547 NNEVXXXXXXXXXXXXXXXLKYTWKSAAYHSIRKRITERKDQPCRQYNPCGCQSACGKQC 606
NN+V LKYTWKSAAYHSIRKRITE+KDQPCRQ+NPC C+ ACGK+C
Sbjct: 606 NNQVRRRSRFLRRRGKVRRLKYTWKSAAYHSIRKRITEKKDQPCRQFNPCNCKIACGKEC 665
Query: 607 PCLLNGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVGCG 666
PCLLNGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWV G
Sbjct: 666 PCLLNGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVIGG 725
Query: 667 DGTLGNPSQRGDNYECRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGEL 726
DG+LG PSQRGDNYECRNM RVLLG SD+SGWGAFLKN V KHEYLGEYTGEL
Sbjct: 726 DGSLGVPSQRGDNYECRNMKLLLKQQQRVLLGISDVSGWGAFLKNSVSKHEYLGEYTGEL 785
Query: 727 ISHREADKRGKIYDRENSSFLFNLNDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGD 786
ISH+EADKRGKIYDREN SFLFNLNDQFVLDAYRKGDKLKFANH+P+PNCYAKVIMVAGD
Sbjct: 786 ISHKEADKRGKIYDRENCSFLFNLNDQFVLDAYRKGDKLKFANHSPEPNCYAKVIMVAGD 845
Query: 787 HRVGIFAKERIGAGEELFYDYRYEPDRAPAWARKPEASGSKK-EDGAPSSGRAKKLA 842
HRVGIFAKERI AGEELFYDYRYEPDRAPAWA+KPEA GSKK E+ PS GR KKLA
Sbjct: 846 HRVGIFAKERILAGEELFYDYRYEPDRAPAWAKKPEAPGSKKDENVTPSVGRPKKLA 902
>AT4G02020.1 | Symbols: EZA1, SWN, SDG10 | SET domain-containing
protein | chr4:886693-891743 FORWARD LENGTH=856
Length = 856
Score = 485 bits (1249), Expect = e-137, Method: Compositional matrix adjust.
Identities = 239/363 (65%), Positives = 269/363 (74%), Gaps = 6/363 (1%)
Query: 481 WKALEKGLLEKGMEIFGRNSCLLARNLLSGLKTCWDVFQYINCEEGKMSGPSGDATNSLV 540
W +EK L KG+EIFGRNSCL+ARNLLSGLKTC DV Y+ E + S L
Sbjct: 493 WNPIEKDLYLKGVEIFGRNSCLIARNLLSGLKTCLDVSNYMRENEVSVFRRSSTPNLLLD 552
Query: 541 EGYS---KGNNEVXXXXXXXXXXXXXXXLKYTWKSAAYHSIRKRITERKDQPCRQYNPCG 597
+G + N+EV LKY+ KSA + S+ KRI K+Q C+QY PCG
Sbjct: 553 DGRTDPGNDNDEVPPRTRLFRRKGKTRKLKYSTKSAGHPSVWKRIAGGKNQSCKQYTPCG 612
Query: 598 CQSACGKQCPCLLNGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDV 657
C S CGK CPCL N TCCEKYCGC KSCKNRFRGCHCAKSQCRSRQCPCFAA RECDPDV
Sbjct: 613 CLSMCGKDCPCLTNETCCEKYCGCSKSCKNRFRGCHCAKSQCRSRQCPCFAAGRECDPDV 672
Query: 658 CRNCWVGCGDGTLGNPSQRGDNYECRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHE 717
CRNCWV CGDG+LG +RG+ +C NM R+LLG+SD++GWGAFLKN V K+E
Sbjct: 673 CRNCWVSCGDGSLGEAPRRGEG-QCGNMRLLLRQQQRILLGKSDVAGWGAFLKNSVSKNE 731
Query: 718 YLGEYTGELISHREADKRGKIYDRENSSFLFNLNDQFVLDAYRKGDKLKFANHAPDPNCY 777
YLGEYTGELISH EADKRGKIYDR NSSFLF+LNDQ+VLDA RKGDKLKFANH+ PNCY
Sbjct: 732 YLGEYTGELISHHEADKRGKIYDRANSSFLFDLNDQYVLDAQRKGDKLKFANHSAKPNCY 791
Query: 778 AKVIMVAGDHRVGIFAKERIGAGEELFYDYRYEPDRAPAWARKPEASGSKKEDGAPSSGR 837
AKV+ VAGDHRVGIFA ERI A EELFYDYRY PD+AP WARKPE GSKK+D A + R
Sbjct: 792 AKVMFVAGDHRVGIFANERIEASEELFYDYRYGPDQAPVWARKPE--GSKKDDSAITHRR 849
Query: 838 AKK 840
A+K
Sbjct: 850 ARK 852
Score = 208 bits (530), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 168/304 (55%), Gaps = 37/304 (12%)
Query: 20 IDSLKKQVAAERVVSVKKKMEENRLKLVGVTDHLYKMSMERRTGGIADMNGSLDLLSKRQ 79
+ LK+++ ERV S+K+K E NR K V H+ S + A+ NG+ ++LS R
Sbjct: 36 LSELKRKIQGERVRSIKEKFEANRKK---VDAHVSPFSSAASSRATAEDNGNSNMLSSRM 92
Query: 80 KGATDMHNGVNASNGDRESNGYHEDGHGSTAVLLGYNVTTKNAVRP-IKLPEVKRLPPYT 138
+ NG + GDR+ V TK+ + +KLP +R+PPYT
Sbjct: 93 RMPLCKLNGFSHGVGDRDY------------------VPTKDVISASVKLPIAERIPPYT 134
Query: 139 TWIFLDRNQRMTEDQSVVGRRRIYYDQNGGEALICXXXXXXXXXXXXXKREFVESEDFIL 198
TWIFLDRNQRM EDQSVVGRR+IYY+Q+GGE LIC KREF E ED I+
Sbjct: 135 TWIFLDRNQRMAEDQSVVGRRQIYYEQHGGETLICSDSEEEPEPEEE-KREFSEGEDSII 193
Query: 199 RMTIRELGISDIVLESLAQCFSRNSSEIKARYETLTXXXXXXXXXXXXXXXXXXQIDSSF 258
+ +E G+ + V ++L Q S ++S+I RY L +S F
Sbjct: 194 WLIGQEYGMGEEVQDALCQLLSVDASDILERYNELKLKDKQNTEEFS---------NSGF 244
Query: 259 -----VEKDLEAALDSFDNLFCRRCLVFDCRLHGCSQDLVFPAEKQPSWNPPDTENAPCG 313
+EK L AALDSFDNLFCRRCLVFDCRLHGCSQ L+ +EKQP W+ + + PC
Sbjct: 245 KLGISLEKGLGAALDSFDNLFCRRCLVFDCRLHGCSQPLISASEKQPYWSDYEGDRKPCS 304
Query: 314 PNCF 317
+C+
Sbjct: 305 KHCY 308
>AT1G02580.1 | Symbols: MEA, EMB173, FIS1, SDG5 | SET
domain-containing protein | chr1:544796-548994 FORWARD
LENGTH=689
Length = 689
Score = 338 bits (867), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 176/401 (43%), Positives = 244/401 (60%), Gaps = 25/401 (6%)
Query: 432 DSDSISEAVGGSLNDMVTVPHAMRGEDSLRKEETVDENTFKQDLTDNGSWKALEKGLLEK 491
D SE + + H M ++S+ + V + +N W +EK L K
Sbjct: 301 DRQPCSEHCYLKVRSVTEADHVMDNDNSISNKIVVSD-------PNNTMWTPVEKDLYLK 353
Query: 492 GMEIFGRNSCLLARNLLSGLKTCWDVFQYINCEEGKMSGPSGDATNSL-VEGYSKGNNEV 550
G+EIFGRNSC +A N+L GLKTC +++ Y+ ++ T SL + ++ +N+V
Sbjct: 354 GIEIFGRNSCDVALNILRGLKTCLEIYNYMREQD--------QCTMSLDLNKTTQRHNQV 405
Query: 551 XXXXXXXXXXXXXXXLKYTWKSAAYHSIRKRITERKDQPCRQYNPCGCQSACGKQCPCLL 610
+ K A Y K+ T + + + Y PC C+S CG+QCPCL
Sbjct: 406 TKKVSRKSSRSVRKKSRLR-KYARYPPALKKTTSGEAKFYKHYTPCTCKSKCGQQCPCLT 464
Query: 611 NGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVGCGDGTL 670
+ CCEKYCGC K C NRF GC+CA QC +RQCPCFAA+RECDPD+CR+C + CGDGTL
Sbjct: 465 HENCCEKYCGCSKDCNNRFGGCNCAIGQCTNRQCPCFAANRECDPDLCRSCPLSCGDGTL 524
Query: 671 G-NPSQRGDNYECRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISH 729
G P Q +C+NM ++L+G+SD+ GWGAF + + K+EYLGEYTGELI+H
Sbjct: 525 GETPVQ----IQCKNMQFLLQTNKKILIGKSDVHGWGAFTWDSLKKNEYLGEYTGELITH 580
Query: 730 READKRGKIYDRENSSFLFNLNDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRV 789
EA++RG+I DR SS+LF LNDQ +DA RKG++ KF NH+ PNCYAK+++V GD R+
Sbjct: 581 DEANERGRIEDRIGSSYLFTLNDQLEIDARRKGNEFKFLNHSARPNCYAKLMIVRGDQRI 640
Query: 790 GIFAKERIGAGEELFYDYRYEPDRAPAWA--RKPEASGSKK 828
G+FA+ I GEELF+DY Y P+ A W+ R+P +G+ K
Sbjct: 641 GLFAERAIEEGEELFFDYCYGPEHAD-WSRGREPRKTGASK 680
Score = 82.0 bits (201), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 135/333 (40%), Gaps = 45/333 (13%)
Query: 20 IDSLKKQVAAERVVSVKKKMEENRLKLVGVTDHLYKMSMERRTGGIADMNG--SLDLLSK 77
++ +K+Q+ ER + +K+K E + V T + S + D NG + LLS+
Sbjct: 17 LNQIKEQIEKERFLHIKRKFELRYIPSVA-THASHHQSFDLNQPAAEDDNGGDNKSLLSR 75
Query: 78 RQKGATDMHNGVNASNGDRESNGYHEDGHGSTAV------LLGYNVTTKNAVRPIKLPEV 131
Q H ++ E GY D A+ L +V + +KLP V
Sbjct: 76 MQNPLR--HFSASSDYNSYEDQGYVLDEDQDYALEEDVPLFLDEDVPL---LPSVKLPIV 130
Query: 132 KRLPPYTTWIFLDRNQRMTEDQSVVGRRRIYYDQNGGEALICXX------XXXXXXXXXX 185
++LP TW+F +Q M E SV+G+R+IYY GEAL
Sbjct: 131 EKLPRSITWVFTKSSQLMAESDSVIGKRQIYY--LNGEALELSSEEDEEDEEEDEEEIKK 188
Query: 186 XKREFVESEDFILRMTIRELGISDIVL-ESLAQCFSRNSSEIKARYETLTXXXXXXXXXX 244
K EF E D + ++ G+ D+V+ +LA+ + S+I RY L
Sbjct: 189 EKCEFSEDVDRFIWTVGQDYGLDDLVVRRALAKYLEVDVSDILERYNELKLKNDGTAGEA 248
Query: 245 XXXXXXXXQIDSSFVEKDLEAALDSF-DNLFCRRCLVFDCRLHGCSQDLVFPAEKQPSWN 303
S K + A F D CRRC++FDC +H + +E + S
Sbjct: 249 -----------SDLTSKTITTAFQDFADRRHCRRCMIFDCHMHEKYEPESRSSEDKSSLF 297
Query: 304 PPDTENAPCGPNCFRSVLKSERFAKVTSSTQAD 336
D + PC +C + KV S T+AD
Sbjct: 298 -EDEDRQPCSEHC---------YLKVRSVTEAD 320
>AT5G42400.1 | Symbols: ATXR7, SDG25 | SET domain protein 25 |
chr5:16954469-16960671 REVERSE LENGTH=1423
Length = 1423
Score = 96.7 bits (239), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 80/150 (53%), Gaps = 4/150 (2%)
Query: 670 LGNPSQRGDNYECRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISH 729
L N D + M + +S I WG + +++ EY GELI
Sbjct: 1243 LRNLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRS 1302
Query: 730 READKRGKIYDREN--SSFLFNLNDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH 787
++ R + Y++ SS+LF L+D +VLDA ++G +F NH+ +PNCY K+I V G
Sbjct: 1303 SISEIRERQYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKK 1362
Query: 788 RVGIFAKERIGAGEELFYDYRY--EPDRAP 815
++ I+AK I AGEE+ Y+Y++ E D+ P
Sbjct: 1363 KIFIYAKRHIDAGEEISYNYKFPLEDDKIP 1392
>AT4G27910.1 | Symbols: ATX4, SDG16 | SET domain protein 16 |
chr4:13894694-13900256 FORWARD LENGTH=1027
Length = 1027
Score = 95.1 bits (235), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/124 (41%), Positives = 71/124 (57%), Gaps = 4/124 (3%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDRENSS-FLFNLND 752
RV GRS I GWG F + + + E + EY GE + AD R Y R +LF +++
Sbjct: 886 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRRVGKDCYLFKISE 945
Query: 753 QFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH--RVGIFAKERIGAGEELFYDYRYE 810
+ V+DA KG+ + NH+ PNCYA+ IM GD R+ + AK + GEEL YDY ++
Sbjct: 946 EVVVDATDKGNIARLINHSCTPNCYAR-IMSVGDEESRIVLIAKANVAVGEELTYDYLFD 1004
Query: 811 PDRA 814
PD A
Sbjct: 1005 PDEA 1008
>AT5G53430.1 | Symbols: SDG29, SET29, ATX5 | SET domain group 29 |
chr5:21677623-21683166 FORWARD LENGTH=1043
Length = 1043
Score = 94.7 bits (234), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 71/121 (58%), Gaps = 2/121 (1%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDRENSS-FLFNLND 752
RV GRS I GWG F + + + E + EY GE + AD R Y RE +LF +++
Sbjct: 902 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREGKDCYLFKISE 961
Query: 753 QFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH-RVGIFAKERIGAGEELFYDYRYEP 811
+ V+DA KG+ + NH+ PNCYA+++ V D R+ + AK + + EEL YDY ++P
Sbjct: 962 EVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDYLFDP 1021
Query: 812 D 812
D
Sbjct: 1022 D 1022
>AT2G31650.1 | Symbols: ATX1, SDG27 | homologue of trithorax |
chr2:13455448-13462181 REVERSE LENGTH=1062
Length = 1062
Score = 92.0 bits (227), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGK-IYDRE--NSSFLFNL 750
R+ G+S I G+G F K + + EYTGEL+ ADKR + IY+ +++F +
Sbjct: 899 RLAFGKSGIHGFGIFAKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRI 958
Query: 751 NDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
+D+ V+DA R G NH+ PNCY++VI V GD + IFAK I EEL YDYR+
Sbjct: 959 DDERVIDATRTGSIAHLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRF 1017
>AT1G05830.2 | Symbols: ATX2 | trithorax-like protein 2 |
chr1:1754452-1761225 FORWARD LENGTH=1083
Length = 1083
Score = 88.6 bits (218), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGK-IYDRE--NSSFLFNL 750
R+ G+S I G+G F K + + EYTGEL+ ADKR IY+ +++F +
Sbjct: 920 RLAFGKSGIHGFGIFAKLPHRAGDMVIEYTGELVRPPIADKREHLIYNSMVGAGTYMFRI 979
Query: 751 NDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
+++ V+DA R G NH+ +PNCY++VI V GD + IFAK + EEL YDYR+
Sbjct: 980 DNERVIDATRTGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDVAKWEELTYDYRF 1038
>AT1G05830.1 | Symbols: ATX2, SDG30 | trithorax-like protein 2 |
chr1:1754452-1761225 FORWARD LENGTH=1083
Length = 1083
Score = 88.6 bits (218), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGK-IYDRE--NSSFLFNL 750
R+ G+S I G+G F K + + EYTGEL+ ADKR IY+ +++F +
Sbjct: 920 RLAFGKSGIHGFGIFAKLPHRAGDMVIEYTGELVRPPIADKREHLIYNSMVGAGTYMFRI 979
Query: 751 NDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
+++ V+DA R G NH+ +PNCY++VI V GD + IFAK + EEL YDYR+
Sbjct: 980 DNERVIDATRTGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDVAKWEELTYDYRF 1038
>AT3G61740.2 | Symbols: SDG14, ATX3 | SET domain protein 14 |
chr3:22851133-22856548 REVERSE LENGTH=982
Length = 982
Score = 88.6 bits (218), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 70/124 (56%), Gaps = 3/124 (2%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDRENSS-FLFNLND 752
RV G+S I GWG F + + + E + EY G + AD R Y + +LF +++
Sbjct: 840 RVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQGKDCYLFKISE 899
Query: 753 QFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAG--DHRVGIFAKERIGAGEELFYDYRYE 810
+ V+DA G+ + NH+ PNCYA+++ + D+R+ + AK + AGEEL YDY +E
Sbjct: 900 EIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLFE 959
Query: 811 PDRA 814
D +
Sbjct: 960 VDES 963
>AT1G76710.2 | Symbols: ASHH1 | SET domain group 26 |
chr1:28789887-28792371 REVERSE LENGTH=492
Length = 492
Score = 88.2 bits (217), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 69/131 (52%), Gaps = 2/131 (1%)
Query: 682 CRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDR 741
C+N + L + + GWG + +++ EY GE+IS +EA KR + Y+
Sbjct: 78 CKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYET 137
Query: 742 E--NSSFLFNLNDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGA 799
+++ +LN +DA +KG +F NH+ PNC + V G+ RVGIFAKE I
Sbjct: 138 HGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISP 197
Query: 800 GEELFYDYRYE 810
EL YDY +E
Sbjct: 198 RTELAYDYNFE 208
>AT1G76710.1 | Symbols: SDG26 | SET domain group 26 |
chr1:28789887-28792371 REVERSE LENGTH=492
Length = 492
Score = 88.2 bits (217), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 69/131 (52%), Gaps = 2/131 (1%)
Query: 682 CRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDR 741
C+N + L + + GWG + +++ EY GE+IS +EA KR + Y+
Sbjct: 78 CKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYET 137
Query: 742 E--NSSFLFNLNDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGA 799
+++ +LN +DA +KG +F NH+ PNC + V G+ RVGIFAKE I
Sbjct: 138 HGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISP 197
Query: 800 GEELFYDYRYE 810
EL YDY +E
Sbjct: 198 RTELAYDYNFE 208
>AT3G61740.1 | Symbols: SDG14, ATX3 | SET domain protein 14 |
chr3:22851133-22856548 REVERSE LENGTH=1018
Length = 1018
Score = 88.2 bits (217), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 70/124 (56%), Gaps = 3/124 (2%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDRENSS-FLFNLND 752
RV G+S I GWG F + + + E + EY G + AD R Y + +LF +++
Sbjct: 876 RVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQGKDCYLFKISE 935
Query: 753 QFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAG--DHRVGIFAKERIGAGEELFYDYRYE 810
+ V+DA G+ + NH+ PNCYA+++ + D+R+ + AK + AGEEL YDY +E
Sbjct: 936 EIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLFE 995
Query: 811 PDRA 814
D +
Sbjct: 996 VDES 999
>AT4G30860.1 | Symbols: ASHR3, SDG4 | SET domain group 4 |
chr4:15024546-15027427 FORWARD LENGTH=497
Length = 497
Score = 84.0 bits (206), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 70/120 (58%), Gaps = 6/120 (5%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDRENSS----FLFN 749
++ + +++ GWG + K +++ EY GE+IS + ++R ++D ++ ++
Sbjct: 327 KIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQR--LWDMKHKGMKDFYMCE 384
Query: 750 LNDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
+ F +DA KG+ +F NH+ +PNC + V G+ RVG+FA +I AGE L YDYR+
Sbjct: 385 IQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRF 444
>AT3G59960.1 | Symbols: ASHH4, SDG24 | histone-lysine
N-methyltransferase ASHH4 | chr3:22148334-22150386
FORWARD LENGTH=352
Length = 352
Score = 72.4 bits (176), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 68/118 (57%), Gaps = 2/118 (1%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKR-GKI-YDRENSSFLFNLN 751
++ L +++ G+G + E++ EY GE+I + ++R K+ + E + +L +N
Sbjct: 112 KMKLVQTEKCGYGIVADEDINSGEFIIEYVGEVIDDKICEERLWKLNHKVETNFYLCQIN 171
Query: 752 DQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
V+DA KG+K ++ NH+ PN + ++ G+ R+GIFA I GE+L YDY++
Sbjct: 172 WNMVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLTYDYQF 229
>AT1G77300.2 | Symbols: EFS, SDG8, ASHH2, LAZ2 | histone
methyltransferases(H3-K4 specific);histone
methyltransferases(H3-K36 specific) |
chr1:29040936-29048810 REVERSE LENGTH=1501
Length = 1501
Score = 69.7 bits (169), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 2/108 (1%)
Query: 704 GWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIY--DRENSSFLFNLNDQFVLDAYRK 761
G+G L V + ++L EY GE++ + + R K Y + + LN V+DA K
Sbjct: 1037 GYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAK 1096
Query: 762 GDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
G+ +F NH+ +PNC + MV G+ VGIF+ + + G+EL +DY Y
Sbjct: 1097 GNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNY 1144
>AT1G77300.1 | Symbols: EFS, SDG8, CCR1, ASHH2, LAZ2 | histone
methyltransferases(H3-K4 specific);histone
methyltransferases(H3-K36 specific) |
chr1:29040160-29048810 REVERSE LENGTH=1805
Length = 1805
Score = 69.3 bits (168), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 2/108 (1%)
Query: 704 GWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIY--DRENSSFLFNLNDQFVLDAYRK 761
G+G L V + ++L EY GE++ + + R K Y + + LN V+DA K
Sbjct: 1037 GYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAK 1096
Query: 762 GDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
G+ +F NH+ +PNC + MV G+ VGIF+ + + G+EL +DY Y
Sbjct: 1097 GNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNY 1144
>AT2G44150.1 | Symbols: ASHH3, SDG7 | histone-lysine
N-methyltransferase ASHH3 | chr2:18258863-18261003
FORWARD LENGTH=363
Length = 363
Score = 66.2 bits (160), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 67/118 (56%), Gaps = 2/118 (1%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKR-GKIYDR-ENSSFLFNLN 751
++ L +++ G G + + E++ EY GE+I + ++R K+ R E + +L +
Sbjct: 117 KMKLIQTEKCGSGIVAEEEIEAGEFIIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEIT 176
Query: 752 DQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDHRVGIFAKERIGAGEELFYDYRY 809
V+DA KG+K ++ NH+ +PN + ++ G+ R+GIFA I GE L YDY++
Sbjct: 177 RDMVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLTYDYQF 234
>AT2G23740.2 | Symbols: | nucleic acid binding;sequence-specific DNA
binding transcription factors;zinc ion binding |
chr2:10098213-10103229 FORWARD LENGTH=1382
Length = 1382
Score = 59.7 bits (143), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 67/147 (45%), Gaps = 17/147 (11%)
Query: 682 CRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDR 741
C+N ++ + R++ GWG + + ++ EY GE++ +EA+KR Y
Sbjct: 1213 CQNRVLQNGIRAKLEVFRTESKGWGLRACEHILRGTFVCEYIGEVLDQQEANKRRNQYGN 1272
Query: 742 ENSSFLF----NLND---------QFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH- 787
+ S++ N+ND + +DA G+ +F NH+ PN ++V
Sbjct: 1273 GDCSYILDIDANINDIGRLMEEELDYAIDATTHGNISRFINHSCSPNLVNHQVIVESMES 1332
Query: 788 ---RVGIFAKERIGAGEELFYDYRYEP 811
+G++A I AGEE+ DY P
Sbjct: 1333 PLAHIGLYASMDIAAGEEITRDYGRRP 1359
>AT2G23740.1 | Symbols: | nucleic acid binding;sequence-specific DNA
binding transcription factors;zinc ion binding |
chr2:10098213-10103229 FORWARD LENGTH=1375
Length = 1375
Score = 59.7 bits (143), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 67/147 (45%), Gaps = 17/147 (11%)
Query: 682 CRNMXXXXXXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDR 741
C+N ++ + R++ GWG + + ++ EY GE++ +EA+KR Y
Sbjct: 1206 CQNRVLQNGIRAKLEVFRTESKGWGLRACEHILRGTFVCEYIGEVLDQQEANKRRNQYGN 1265
Query: 742 ENSSFLF----NLND---------QFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH- 787
+ S++ N+ND + +DA G+ +F NH+ PN ++V
Sbjct: 1266 GDCSYILDIDANINDIGRLMEEELDYAIDATTHGNISRFINHSCSPNLVNHQVIVESMES 1325
Query: 788 ---RVGIFAKERIGAGEELFYDYRYEP 811
+G++A I AGEE+ DY P
Sbjct: 1326 PLAHIGLYASMDIAAGEEITRDYGRRP 1352
>AT2G35160.1 | Symbols: SUVH5, SGD9 | SU(VAR)3-9 homolog 5 |
chr2:14823562-14825946 FORWARD LENGTH=794
Length = 794
Score = 59.3 bits (142), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 68/131 (51%), Gaps = 18/131 (13%)
Query: 694 RVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADK-RGKIYDRENSSFLFNLND 752
++ + +++ GWG + ++ EY GEL+ ++A+ GK +LF+L D
Sbjct: 648 KLEIFKTESRGWGVRSLESIPIGSFICEYAGELLEDKQAESLTGK------DEYLFDLGD 701
Query: 753 Q---FVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH------RVGIFAKERIGAGEEL 803
+ F ++A +KG+ +F NH+ PN YA+ ++ DH + FA + I +EL
Sbjct: 702 EDDPFTINAAQKGNIGRFINHSCSPNLYAQDVLY--DHEEIRIPHIMFFALDNIPPLQEL 759
Query: 804 FYDYRYEPDRA 814
YDY Y+ D+
Sbjct: 760 SYDYNYKIDQV 770
>AT3G03750.2 | Symbols: SUVR3, SDG20 | SET domain protein 20 |
chr3:939976-941511 FORWARD LENGTH=354
Length = 354
Score = 57.4 bits (137), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 81/195 (41%), Gaps = 34/195 (17%)
Query: 631 GCHCAKSQCRSRQCPCFA-ADRECDPDVCRNCWVGCGDGTLGNPSQRGDNYECRNMXXXX 689
GC C + C C C A A E ++ C GCG G+ +C N
Sbjct: 144 GCECER--CEEGYCKCLAFAGME---EIANECGSGCGCGS-----------DCSNRVTQK 187
Query: 690 XXXXRVLLGRSDISGWGAFLKNGVGKHEYLGEYTGELISHREADKRGKIYDRENSSFLFN 749
+ + R + GW + + + +++ EY GEL++ EA +R IYD+ S+ F
Sbjct: 188 GVSVSLKIVRDEKKGWCLYADQLIKQGQFICEYAGELLTTDEARRRQNIYDKLRSTQSFA 247
Query: 750 LNDQFV--------------LDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH---RVGIF 792
V +DA R G+ +F NH+ D + V++ + R+ F
Sbjct: 248 SALLVVREHLPSGQACLRINIDATRIGNVARFINHSCDGGNLSTVLLRSSGALLPRLCFF 307
Query: 793 AKERIGAGEELFYDY 807
A + I A EEL + Y
Sbjct: 308 AAKDIIAEEELSFSY 322
>AT2G22740.1 | Symbols: SUVH6, SDG23 | SU(VAR)3-9 homolog 6 |
chr2:9664256-9666628 REVERSE LENGTH=790
Length = 790
Score = 52.8 bits (125), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 33/142 (23%)
Query: 704 GWGAFLKNGVGKHEYLGEYTGELISHREADKR----------GKIYDRENSSFLFNL--- 750
GWG + ++ EY GEL+ EA++R G YD + + L
Sbjct: 627 GWGVRCLKSIPIGSFICEYVGELLEDSEAERRIGNDEYLFDIGNRYDNSLAQGMSELMLG 686
Query: 751 ------------NDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH------RVGIF 792
+ F +DA KG+ +F NH+ PN YA+ ++ DH V F
Sbjct: 687 TQAGRSMAEGDESSGFTIDAASKGNVGRFINHSCSPNLYAQNVLY--DHEDSRIPHVMFF 744
Query: 793 AKERIGAGEELFYDYRYEPDRA 814
A++ I +EL YDY Y D+
Sbjct: 745 AQDNIPPLQELCYDYNYALDQV 766
>AT2G22740.2 | Symbols: SUVH6 | SU(VAR)3-9 homolog 6 |
chr2:9664256-9666628 REVERSE LENGTH=790
Length = 790
Score = 52.8 bits (125), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 33/142 (23%)
Query: 704 GWGAFLKNGVGKHEYLGEYTGELISHREADKR----------GKIYDRENSSFLFNL--- 750
GWG + ++ EY GEL+ EA++R G YD + + L
Sbjct: 627 GWGVRCLKSIPIGSFICEYVGELLEDSEAERRIGNDEYLFDIGNRYDNSLAQGMSELMLG 686
Query: 751 ------------NDQFVLDAYRKGDKLKFANHAPDPNCYAKVIMVAGDH------RVGIF 792
+ F +DA KG+ +F NH+ PN YA+ ++ DH V F
Sbjct: 687 TQAGRSMAEGDESSGFTIDAASKGNVGRFINHSCSPNLYAQNVLY--DHEDSRIPHVMFF 744
Query: 793 AKERIGAGEELFYDYRYEPDRA 814
A++ I +EL YDY Y D+
Sbjct: 745 AQDNIPPLQELCYDYNYALDQV 766