Miyakogusa Predicted Gene
- Lj1g3v1819810.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1819810.1 Non Chatacterized Hit- tr|K4C7K0|K4C7K0_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,34.98,2e-18,seg,NULL; ANDROGEN INDUCED INHIBITOR OF
PROLIFERATION (AS3) / PDS5-RELATED,NULL; Tudor domain,Tudor
,CUFF.29087.1
(877 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G31880.2 | Symbols: | LOCATED IN: cytosol; EXPRESSED IN: 24 ... 283 3e-76
AT4G31880.1 | Symbols: | LOCATED IN: cytosol, chloroplast; EXPR... 282 7e-76
AT1G15940.1 | Symbols: | Tudor/PWWP/MBT superfamily protein | c... 157 2e-38
AT1G80810.2 | Symbols: | Tudor/PWWP/MBT superfamily protein | c... 154 4e-37
AT1G80810.1 | Symbols: | Tudor/PWWP/MBT superfamily protein | c... 154 4e-37
AT5G10950.1 | Symbols: | Tudor/PWWP/MBT superfamily protein | c... 108 2e-23
AT5G47690.2 | Symbols: | binding | chr5:19317899-19327014 FORWA... 92 2e-18
AT5G47690.1 | Symbols: | binding | chr5:19317899-19327014 FORWA... 92 2e-18
AT5G47690.3 | Symbols: | binding | chr5:19317899-19327014 FORWA... 92 2e-18
AT4G32970.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 74 3e-13
AT4G02070.2 | Symbols: MSH6 | MUTS homolog 6 | chr4:906079-91293... 60 6e-09
AT4G02070.1 | Symbols: MSH6, MSH6-1, ATMSH6 | MUTS homolog 6 | c... 60 6e-09
AT1G77600.2 | Symbols: | ARM repeat superfamily protein | chr1:... 58 3e-08
AT1G77600.1 | Symbols: | ARM repeat superfamily protein | chr1:... 58 3e-08
AT1G77600.3 | Symbols: | ARM repeat superfamily protein | chr1:... 57 5e-08
AT4G32620.2 | Symbols: | Enhancer of polycomb-like transcriptio... 56 1e-07
AT4G32620.1 | Symbols: | Enhancer of polycomb-like transcriptio... 56 1e-07
AT1G05830.2 | Symbols: ATX2 | trithorax-like protein 2 | chr1:17... 50 6e-06
AT1G05830.1 | Symbols: ATX2, SDG30 | trithorax-like protein 2 | ... 50 6e-06
>AT4G31880.2 | Symbols: | LOCATED IN: cytosol; EXPRESSED IN: 24
plant structures; EXPRESSED DURING: 14 growth stages;
BEST Arabidopsis thaliana protein match is:
Tudor/PWWP/MBT superfamily protein (TAIR:AT1G15940.1). |
chr4:15419435-15423939 REVERSE LENGTH=872
Length = 872
Score = 283 bits (725), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 239/643 (37%), Positives = 340/643 (52%), Gaps = 106/643 (16%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+LETVAKVRSCVVMLDLECD L++EMFQHFLK IR+HH +VFSSME +MTLVLEESEDI
Sbjct: 129 ILETVAKVRSCVVMLDLECDALLIEMFQHFLKAIRDHHSGNVFSSMENIMTLVLEESEDI 188
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQPYLVQAVKTLGISVDD-SKIL 119
++L P+L + KKD+ E+S ++ +L E+VL NCA+KL+ YL +AVK+ G+ +D S I+
Sbjct: 189 PSEMLSPILHSVKKDD-EISQVSRRLAEQVLSNCASKLKTYLTEAVKSSGVPLDKYSNIV 247
Query: 120 AKICQDASDSFEKNDVCVSSEHVEDKGKSPKQSSEETTQAAKGDATEAEHSQQDNPNGNR 179
A IC+ + +++ V V++E + +G +++ E + + + T+A +
Sbjct: 248 ASICEGTFSALQQDQV-VANEKEDSQGHIKRETEVEAAEISTPERTDAPK--------DE 298
Query: 180 SPKSVMSNGVACVREDNALADSKSNKQEDTDCSGHSESLDVSGQRELNNLDAEKVDNDGG 239
S KS +SNGVA + + DS KQ+DT + LD +LNN EK D
Sbjct: 299 SGKSGVSNGVAQQNDSSVDTDSM-KKQDDTGAKDEPQQLDNPRNTDLNNTTEEKPD---- 353
Query: 240 XXXXXXXXXXXXXXXXXXXXXXXVKAGSQGQVVAH---EKEAEKPSVVAG--SQDQVDAD 294
V H EKE E SV S+D +
Sbjct: 354 --------------------------------VEHQIEEKENESSSVKQADLSKDSDIKE 381
Query: 295 EKEAEKRLDS-EAPSSPHEGHSVEAAGPTEDDKETHADISSPEACDDESEVKASPSTSEN 353
E E + LDS + +SP SV AA +E++K I + DE+ +SPS +E
Sbjct: 382 ETEPAELLDSKDVLTSPPVDSSVTAATSSENEKNKSVQILPSKTSGDETANVSSPSMAEE 441
Query: 354 LHDQIRSKKLGKAKIKDIPAKEGATEDV----SI---KVSRGASNSEAKPARRSVKKALG 406
L +Q KK K KE +TE+V SI +VS + SE + ++S KK +
Sbjct: 442 LPEQSVPKKTANQK-----KKESSTEEVKPSASIATEEVSEEPNTSEPQVTKKSGKK-VA 495
Query: 407 RNSDVKDTTVADLVKKGSGAASDVDAKKNSAKKLGEHKSDINAKKHSAKKLDEQKGGSGS 466
+S K T V + S+ K S KK+ SD NA++
Sbjct: 496 SSSKTKPT-----VPPSKKSTSETKVAKQSEKKV--VGSD-NAQE--------------- 532
Query: 467 SSRKLENNKKSGRGKANSEAAVAKSSAIDVDKEMTVYSPRSGTKSTKS-----ENTEEIP 521
S++ E KK GRGKA E ++ SS D E S SG ++KS + EE P
Sbjct: 533 STKPKEEKKKPGRGKAIDEESLHTSSG---DNEKPAVS--SGKLASKSKKEAKQTVEESP 587
Query: 522 LTSAKRKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYD 581
++ KRKR+ G+ K S GE+LVG R+KVWWP D+ +Y GVV S+D+A+KKH V+YD
Sbjct: 588 NSNTKRKRSLGQGKAS-----GESLVGSRIKVWWPMDQAYYKGVVESYDAAKKKHLVIYD 642
Query: 582 DGDEETLNLREEKWGVIKKADSDADGEEGSDQAGLDGSAEMPP 624
DGD+E L L+ +KW + +++ D EE +DQ G + A P
Sbjct: 643 DGDQEILYLKNQKWSPLDESELSQD-EEAADQTGQEEDASTVP 684
>AT4G31880.1 | Symbols: | LOCATED IN: cytosol, chloroplast;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14
growth stages; BEST Arabidopsis thaliana protein match
is: Tudor/PWWP/MBT superfamily protein
(TAIR:AT1G15940.1); Has 137162 Blast hits to 70781
proteins in 2973 species: Archae - 289; Bacteria -
24182; Metazoa - 56725; Fungi - 20130; Plants - 6559;
Viruses - 758; Other Eukaryotes - 28519 (source: NCBI
BLink). | chr4:15419435-15423939 REVERSE LENGTH=873
Length = 873
Score = 282 bits (722), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 242/643 (37%), Positives = 342/643 (53%), Gaps = 105/643 (16%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+LETVAKVRSCVVMLDLECD L++EMFQHFLK IR+HH +VFSSME +MTLVLEESEDI
Sbjct: 129 ILETVAKVRSCVVMLDLECDALLIEMFQHFLKAIRDHHSGNVFSSMENIMTLVLEESEDI 188
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQPYLVQAVKTLGISVDD-SKIL 119
++L P+L + KKD+ E+S ++ +L E+VL NCA+KL+ YL +AVK+ G+ +D S I+
Sbjct: 189 PSEMLSPILHSVKKDD-EISQVSRRLAEQVLSNCASKLKTYLTEAVKSSGVPLDKYSNIV 247
Query: 120 AKICQDASDSFEKNDVCVSSEHVEDKGKSPKQSSEETTQAAKGDATEAEHSQQDNPNGNR 179
A IC+ + +++ V V++E + +G +++ E +AA+ E + D P +
Sbjct: 248 ASICEGTFSALQQDQV-VANEKEDSQGHIKRET--EVEKAAEISTPE----RTDAPK-DE 299
Query: 180 SPKSVMSNGVACVREDNALADSKSNKQEDTDCSGHSESLDVSGQRELNNLDAEKVDNDGG 239
S KS +SNGVA + + DS KQ+DT + LD +LNN EK D
Sbjct: 300 SGKSGVSNGVAQQNDSSVDTDSM-KKQDDTGAKDEPQQLDNPRNTDLNNTTEEKPD---- 354
Query: 240 XXXXXXXXXXXXXXXXXXXXXXXVKAGSQGQVVAH---EKEAEKPSVVAG--SQDQVDAD 294
V H EKE E SV S+D +
Sbjct: 355 --------------------------------VEHQIEEKENESSSVKQADLSKDSDIKE 382
Query: 295 EKEAEKRLDS-EAPSSPHEGHSVEAAGPTEDDKETHADISSPEACDDESEVKASPSTSEN 353
E E + LDS + +SP SV AA +E++K I + DE+ +SPS +E
Sbjct: 383 ETEPAELLDSKDVLTSPPVDSSVTAATSSENEKNKSVQILPSKTSGDETANVSSPSMAEE 442
Query: 354 LHDQIRSKKLGKAKIKDIPAKEGATEDV----SI---KVSRGASNSEAKPARRSVKKALG 406
L +Q KK K KE +TE+V SI +VS + SE + ++S KK +
Sbjct: 443 LPEQSVPKKTANQK-----KKESSTEEVKPSASIATEEVSEEPNTSEPQVTKKSGKK-VA 496
Query: 407 RNSDVKDTTVADLVKKGSGAASDVDAKKNSAKKLGEHKSDINAKKHSAKKLDEQKGGSGS 466
+S K T V + S+ K S KK+ SD NA++
Sbjct: 497 SSSKTKPT-----VPPSKKSTSETKVAKQSEKKV--VGSD-NAQE--------------- 533
Query: 467 SSRKLENNKKSGRGKANSEAAVAKSSAIDVDKEMTVYSPRSGTKSTKS-----ENTEEIP 521
S++ E KK GRGKA E ++ SS D E S SG ++KS + EE P
Sbjct: 534 STKPKEEKKKPGRGKAIDEESLHTSSG---DNEKPAVS--SGKLASKSKKEAKQTVEESP 588
Query: 522 LTSAKRKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYD 581
++ KRKR+ G+ K S GE+LVG R+KVWWP D+ +Y GVV S+D+A+KKH V+YD
Sbjct: 589 NSNTKRKRSLGQGKAS-----GESLVGSRIKVWWPMDQAYYKGVVESYDAAKKKHLVIYD 643
Query: 582 DGDEETLNLREEKWGVIKKADSDADGEEGSDQAGLDGSAEMPP 624
DGD+E L L+ +KW + +++ D EE +DQ G + A P
Sbjct: 644 DGDQEILYLKNQKWSPLDESELSQD-EEAADQTGQEEDASTVP 685
>AT1G15940.1 | Symbols: | Tudor/PWWP/MBT superfamily protein |
chr1:5473672-5478050 FORWARD LENGTH=990
Length = 990
Score = 157 bits (398), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 74/125 (59%), Positives = 97/125 (77%), Gaps = 1/125 (0%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+LETVAKVRS +VMLDLECD L+LEMFQ FLK IR HP+ V SMET+M V++ESE++
Sbjct: 133 ILETVAKVRSSLVMLDLECDDLVLEMFQRFLKIIRPDHPQLVLVSMETIMITVIDESEEV 192
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQPYLVQAVKTLGISVD-DSKIL 119
MDLL LL T KKD+++VSP AL LVE+VL +C KLQP +++A+K+ G S+D S ++
Sbjct: 193 PMDLLEILLTTVKKDSQDVSPAALTLVEKVLSSCTCKLQPCIMEALKSSGTSLDMYSPVV 252
Query: 120 AKICQ 124
+ ICQ
Sbjct: 253 SSICQ 257
Score = 97.4 bits (241), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/188 (36%), Positives = 99/188 (52%), Gaps = 23/188 (12%)
Query: 432 AKKNSAKKLGEHKSDI---NAKKH--------SAKKLDEQKGGSGSSSRK----LENNKK 476
AKK + KK K D+ N KKH S+KK G + +S++K K
Sbjct: 438 AKKQTVKKTNPAKEDLTKSNVKKHEDGIKTGKSSKKEKADNGLAKTSAKKPLAETMMVKP 497
Query: 477 SGRGKANSEAAVAKSSAIDVDKEM------TVYSPRSGTKSTKSENTEEIPLTSAKRKRT 530
SG+ +S+A S +D + R+ T +TK +E+ P + K KR
Sbjct: 498 SGKKLVHSDAKKKNSEGASMDTPIPQSSKSKKKDSRATTPATK--KSEQAPKSHPKMKRI 555
Query: 531 PGKEKESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNL 590
G+E ES+T + GE LVG RV VWWP D++FY GV+ S+ +K H+V Y DGD E LNL
Sbjct: 556 AGEEVESNTNELGEELVGKRVNVWWPLDKKFYEGVIKSYCRVKKMHQVTYSDGDVEELNL 615
Query: 591 REEKWGVI 598
++E++ +I
Sbjct: 616 KKERFKII 623
>AT1G80810.2 | Symbols: | Tudor/PWWP/MBT superfamily protein |
chr1:30365575-30368898 FORWARD LENGTH=774
Length = 774
Score = 154 bits (388), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 141/225 (62%), Gaps = 6/225 (2%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+L+ VAKV+SC+VMLDLEC LIL+MF++F K IR HP+ VFSSME +M +++E+E +
Sbjct: 79 VLDNVAKVKSCLVMLDLECYDLILQMFRNFFKFIRSDHPQLVFSSMELIMIAIIDETEQV 138
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQPYLVQAVKTLGISVD-DSKIL 119
S DLL LLAT KK+N+ VSP++ L E+VL CA KL+PY+++A+K+ G S+D S ++
Sbjct: 139 STDLLDSLLATVKKENQNVSPMSWSLAEKVLSRCARKLKPYIIEALKSRGTSLDMYSPVV 198
Query: 120 AKICQDASDSFEKNDVCVSSEHVE--DKGKSPKQS-SEETTQAAKGDATEAEHSQQDNPN 176
+ ICQ ++ + + + EH E D G S K++ S+ +++ T + ++ N
Sbjct: 199 SSICQSVFNTPKVHSPVNTKEHEEKLDLGHSRKENLSKSSSKRPARHETRGINEKEKVRN 258
Query: 177 GNRSPKSVMSNGVACVREDNALADSKSNKQEDTDCSGHSESLDVS 221
GN+S S++ + VR ++ A+ + + + E D+S
Sbjct: 259 GNKS--SLLKQSLKQVRSESTDAEITGKRGRKPNSLMNPEDYDIS 301
Score = 90.5 bits (223), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 64/93 (68%), Gaps = 4/93 (4%)
Query: 514 SENTEEIPLTSAKRKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSAR 573
+E+ EE P + R+RT KE SD +GE+LVG RV +WWP D+ FY GV++S+ + +
Sbjct: 476 NESEEETPKSHPTRRRTVRKEV-SDG--FGEDLVGKRVNIWWPLDKTFYEGVIDSYCTRK 532
Query: 574 KKHKVLYDDGDEETLNLREEKWGVIKKADSDAD 606
K H+V+Y DGD E LNL EE+W +++ D+ AD
Sbjct: 533 KMHRVIYSDGDSEELNLTEERWELLED-DTSAD 564
>AT1G80810.1 | Symbols: | Tudor/PWWP/MBT superfamily protein |
chr1:30365575-30368898 FORWARD LENGTH=773
Length = 773
Score = 154 bits (388), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 141/225 (62%), Gaps = 6/225 (2%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+L+ VAKV+SC+VMLDLEC LIL+MF++F K IR HP+ VFSSME +M +++E+E +
Sbjct: 79 VLDNVAKVKSCLVMLDLECYDLILQMFRNFFKFIRSDHPQLVFSSMELIMIAIIDETEQV 138
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQPYLVQAVKTLGISVD-DSKIL 119
S DLL LLAT KK+N+ VSP++ L E+VL CA KL+PY+++A+K+ G S+D S ++
Sbjct: 139 STDLLDSLLATVKKENQNVSPMSWSLAEKVLSRCARKLKPYIIEALKSRGTSLDMYSPVV 198
Query: 120 AKICQDASDSFEKNDVCVSSEHVE--DKGKSPKQS-SEETTQAAKGDATEAEHSQQDNPN 176
+ ICQ ++ + + + EH E D G S K++ S+ +++ T + ++ N
Sbjct: 199 SSICQSVFNTPKVHSPVNTKEHEEKLDLGHSRKENLSKSSSKRPARHETRGINEKEKVRN 258
Query: 177 GNRSPKSVMSNGVACVREDNALADSKSNKQEDTDCSGHSESLDVS 221
GN+S S++ + VR ++ A+ + + + E D+S
Sbjct: 259 GNKS--SLLKQSLKQVRSESTDAEITGKRGRKPNSLMNPEDYDIS 301
Score = 90.5 bits (223), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 64/93 (68%), Gaps = 4/93 (4%)
Query: 514 SENTEEIPLTSAKRKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSAR 573
+E+ EE P + R+RT KE SD +GE+LVG RV +WWP D+ FY GV++S+ + +
Sbjct: 476 NESEEETPKSHPTRRRTVRKEV-SDG--FGEDLVGKRVNIWWPLDKTFYEGVIDSYCTRK 532
Query: 574 KKHKVLYDDGDEETLNLREEKWGVIKKADSDAD 606
K H+V+Y DGD E LNL EE+W +++ D+ AD
Sbjct: 533 KMHRVIYSDGDSEELNLTEERWELLED-DTSAD 564
>AT5G10950.1 | Symbols: | Tudor/PWWP/MBT superfamily protein |
chr5:3459557-3461632 REVERSE LENGTH=395
Length = 395
Score = 108 bits (269), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 70/85 (82%)
Query: 529 RTPGKEKESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETL 588
R+ GK+K SD +KYGE LVG R++VWWP D +FY GVV+S+ S++KKH+V Y+DGD+ETL
Sbjct: 20 RSSGKDKVSDARKYGEALVGSRIRVWWPMDSKFYKGVVDSYVSSKKKHRVFYEDGDKETL 79
Query: 589 NLREEKWGVIKKADSDADGEEGSDQ 613
+L++E+W +I++ D++++ +E S Q
Sbjct: 80 DLKKERWELIEEDDAESESDEISLQ 104
>AT5G47690.2 | Symbols: | binding | chr5:19317899-19327014 FORWARD
LENGTH=1606
Length = 1606
Score = 91.7 bits (226), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+LETVAK RSCVVMLDLECD L+ E+F FL R+ HPE VFSSM+ +M ++LEESED+
Sbjct: 126 ILETVAKYRSCVVMLDLECDDLVKEVFTTFLDVARDDHPEIVFSSMQNIMIVLLEESEDV 185
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQ 99
LL LL+ ++ +V A +L +V+E+CA K++
Sbjct: 186 QEHLLLILLSKLGRNRSDVRDAARRLAMKVIEHCAPKVE 224
Score = 84.3 bits (207), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 526 KRKRTPGKEKES--DTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDG 583
KRK G K S + K + L+G R++VWWP D+ FY G V S+DS +++H +LY+DG
Sbjct: 1346 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDKRFYEGTVKSYDSTKQRHVILYEDG 1405
Query: 584 DEETLNLREEKWGVI 598
D E LNL++E+W +I
Sbjct: 1406 DVEVLNLKKEQWELI 1420
>AT5G47690.1 | Symbols: | binding | chr5:19317899-19327014 FORWARD
LENGTH=1605
Length = 1605
Score = 91.7 bits (226), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+LETVAK RSCVVMLDLECD L+ E+F FL R+ HPE VFSSM+ +M ++LEESED+
Sbjct: 126 ILETVAKYRSCVVMLDLECDDLVKEVFTTFLDVARDDHPEIVFSSMQNIMIVLLEESEDV 185
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQ 99
LL LL+ ++ +V A +L +V+E+CA K++
Sbjct: 186 QEHLLLILLSKLGRNRSDVRDAARRLAMKVIEHCAPKVE 224
Score = 84.3 bits (207), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 526 KRKRTPGKEKES--DTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDG 583
KRK G K S + K + L+G R++VWWP D+ FY G V S+DS +++H +LY+DG
Sbjct: 1345 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDKRFYEGTVKSYDSTKQRHVILYEDG 1404
Query: 584 DEETLNLREEKWGVI 598
D E LNL++E+W +I
Sbjct: 1405 DVEVLNLKKEQWELI 1419
>AT5G47690.3 | Symbols: | binding | chr5:19317899-19327014 FORWARD
LENGTH=1607
Length = 1607
Score = 91.7 bits (226), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPEDVFSSMETVMTLVLEESEDI 60
+LETVAK RSCVVMLDLECD L+ E+F FL R+ HPE VFSSM+ +M ++LEESED+
Sbjct: 126 ILETVAKYRSCVVMLDLECDDLVKEVFTTFLDVARDDHPEIVFSSMQNIMIVLLEESEDV 185
Query: 61 SMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKLQ 99
LL LL+ ++ +V A +L +V+E+CA K++
Sbjct: 186 QEHLLLILLSKLGRNRSDVRDAARRLAMKVIEHCAPKVE 224
Score = 84.3 bits (207), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 526 KRKRTPGKEKES--DTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDG 583
KRK G K S + K + L+G R++VWWP D+ FY G V S+DS +++H +LY+DG
Sbjct: 1346 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDKRFYEGTVKSYDSTKQRHVILYEDG 1405
Query: 584 DEETLNLREEKWGVI 598
D E LNL++E+W +I
Sbjct: 1406 DVEVLNLKKEQWELI 1420
>AT4G32970.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32960.1); Has 552 Blast hits to 489 proteins
in 85 species: Archae - 4; Bacteria - 14; Metazoa - 187;
Fungi - 12; Plants - 225; Viruses - 0; Other Eukaryotes
- 110 (source: NCBI BLink). | chr4:15910671-15914300
REVERSE LENGTH=638
Length = 638
Score = 74.3 bits (181), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 55/91 (60%), Gaps = 10/91 (10%)
Query: 489 AKSSAIDVDKEMTVYSPRSGTKSTKS--ENTEEIPLTSAKRKRTPGKEKESDTKKYGENL 546
AK A+ DK S + STK + E+ P T+ KR + GKEK SD KKY E +
Sbjct: 234 AKKPAVSCDK--------SASDSTKGAKQPLEKKPKTNTKRIHSLGKEKTSDFKKYDEKI 285
Query: 547 VGLRVKVWWPEDREFYTGVVNSFDSARKKHK 577
VG RVK+WWP DR +Y VV S+ SA+++H+
Sbjct: 286 VGSRVKIWWPLDRAYYEAVVISYCSAKERHR 316
>AT4G02070.2 | Symbols: MSH6 | MUTS homolog 6 | chr4:906079-912930
FORWARD LENGTH=1321
Length = 1321
Score = 60.5 bits (145), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 43/64 (67%), Gaps = 2/64 (3%)
Query: 536 ESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK- 594
+S YG+ +VG +V+V+WP D+++Y G V +D KH V Y+DG+EE+L+L +EK
Sbjct: 114 QSVVITYGDEVVGKQVRVYWPLDKKWYDGSVTFYDKGEGKHVVEYEDGEEESLDLGKEKT 173
Query: 595 -WGV 597
W V
Sbjct: 174 EWVV 177
>AT4G02070.1 | Symbols: MSH6, MSH6-1, ATMSH6 | MUTS homolog 6 |
chr4:906079-912930 FORWARD LENGTH=1324
Length = 1324
Score = 60.5 bits (145), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 43/64 (67%), Gaps = 2/64 (3%)
Query: 536 ESDTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK- 594
+S YG+ +VG +V+V+WP D+++Y G V +D KH V Y+DG+EE+L+L +EK
Sbjct: 114 QSVVITYGDEVVGKQVRVYWPLDKKWYDGSVTFYDKGEGKHVVEYEDGEEESLDLGKEKT 173
Query: 595 -WGV 597
W V
Sbjct: 174 EWVV 177
>AT1G77600.2 | Symbols: | ARM repeat superfamily protein |
chr1:29152890-29162156 REVERSE LENGTH=1410
Length = 1410
Score = 57.8 bits (138), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 66/128 (51%), Gaps = 24/128 (18%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPE-------------------- 40
+LETV++++ C++MLD +C L+ EMF F +REHH +
Sbjct: 134 ILETVSRLKFCLLMLDEDCQDLVHEMFNMFFSLVREHHQQSLINQKSMKTQQRKANTQQT 193
Query: 41 --DVFSSMETVMTLVLEESEDISMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKL 98
+F+++ +M+ VLEE + S ++ +L K+ E+ + A +L ++E CA +L
Sbjct: 194 QHSLFNNILAIMSDVLEEEANSSFVVV--ILENLVKEGEDTTSGADKLASSLIERCADRL 251
Query: 99 QPYLVQAV 106
+P + +
Sbjct: 252 EPLICSFL 259
Score = 57.0 bits (136), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 34/58 (58%)
Query: 538 DTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEKW 595
D +GE ++G R+K+ P D FY G V F+S HK+++D+GD E + L E W
Sbjct: 1249 DISNHGEAIIGQRIKLLSPTDGCFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSESW 1306
>AT1G77600.1 | Symbols: | ARM repeat superfamily protein |
chr1:29152890-29162156 REVERSE LENGTH=1367
Length = 1367
Score = 57.8 bits (138), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 66/128 (51%), Gaps = 24/128 (18%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPE-------------------- 40
+LETV++++ C++MLD +C L+ EMF F +REHH +
Sbjct: 134 ILETVSRLKFCLLMLDEDCQDLVHEMFNMFFSLVREHHQQSLINQKSMKTQQRKANTQQT 193
Query: 41 --DVFSSMETVMTLVLEESEDISMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKL 98
+F+++ +M+ VLEE + S ++ +L K+ E+ + A +L ++E CA +L
Sbjct: 194 QHSLFNNILAIMSDVLEEEANSSFVVV--ILENLVKEGEDTTSGADKLASSLIERCADRL 251
Query: 99 QPYLVQAV 106
+P + +
Sbjct: 252 EPLICSFL 259
Score = 56.6 bits (135), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 34/58 (58%)
Query: 538 DTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEKW 595
D +GE ++G R+K+ P D FY G V F+S HK+++D+GD E + L E W
Sbjct: 1213 DISNHGEAIIGQRIKLLSPTDGCFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSESW 1270
>AT1G77600.3 | Symbols: | ARM repeat superfamily protein |
chr1:29152890-29162156 REVERSE LENGTH=1424
Length = 1424
Score = 57.0 bits (136), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 66/128 (51%), Gaps = 24/128 (18%)
Query: 1 MLETVAKVRSCVVMLDLECDHLILEMFQHFLKEIREHHPE-------------------- 40
+LETV++++ C++MLD +C L+ EMF F +REHH +
Sbjct: 134 ILETVSRLKFCLLMLDEDCQDLVHEMFNMFFSLVREHHQQSLINQKSMKTQQRKANTQQT 193
Query: 41 --DVFSSMETVMTLVLEESEDISMDLLCPLLATFKKDNEEVSPIALQLVERVLENCATKL 98
+F+++ +M+ VLEE + S ++ +L K+ E+ + A +L ++E CA +L
Sbjct: 194 QHSLFNNILAIMSDVLEEEANSSFVVV--ILENLVKEGEDTTSGADKLASSLIERCADRL 251
Query: 99 QPYLVQAV 106
+P + +
Sbjct: 252 EPLICSFL 259
Score = 57.0 bits (136), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 34/58 (58%)
Query: 538 DTKKYGENLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEKW 595
D +GE ++G R+K+ P D FY G V F+S HK+++D+GD E + L E W
Sbjct: 1263 DISNHGEAIIGQRIKLLSPTDGCFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSESW 1320
>AT4G32620.2 | Symbols: | Enhancer of polycomb-like transcription
factor protein | chr4:15731968-15737222 FORWARD
LENGTH=1540
Length = 1540
Score = 55.8 bits (133), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 35/53 (66%)
Query: 546 LVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEKWGVI 598
L+ ++KV+WP D +Y G V+ FD + H V YDD DEE +NL+ E++ ++
Sbjct: 355 LLNKKIKVFWPLDERWYHGFVDGFDGDKNLHHVKYDDRDEEWINLQGERFKIL 407
>AT4G32620.1 | Symbols: | Enhancer of polycomb-like transcription
factor protein | chr4:15731968-15737222 FORWARD
LENGTH=1539
Length = 1539
Score = 55.8 bits (133), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 35/53 (66%)
Query: 546 LVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEKWGVI 598
L+ ++KV+WP D +Y G V+ FD + H V YDD DEE +NL+ E++ ++
Sbjct: 355 LLNKKIKVFWPLDERWYHGFVDGFDGDKNLHHVKYDDRDEEWINLQGERFKIL 407
>AT1G05830.2 | Symbols: ATX2 | trithorax-like protein 2 |
chr1:1754452-1761225 FORWARD LENGTH=1083
Length = 1083
Score = 50.4 bits (119), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%)
Query: 545 NLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK 594
+ +GL+ KV+WP D +Y G + ++ K H V Y DGD E L LR EK
Sbjct: 219 HFIGLQCKVFWPLDAVWYPGSIVGYNVETKHHIVKYGDGDGEELALRREK 268
>AT1G05830.1 | Symbols: ATX2, SDG30 | trithorax-like protein 2 |
chr1:1754452-1761225 FORWARD LENGTH=1083
Length = 1083
Score = 50.4 bits (119), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%)
Query: 545 NLVGLRVKVWWPEDREFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK 594
+ +GL+ KV+WP D +Y G + ++ K H V Y DGD E L LR EK
Sbjct: 219 HFIGLQCKVFWPLDAVWYPGSIVGYNVETKHHIVKYGDGDGEELALRREK 268