Miyakogusa Predicted Gene

Lj5g3v0511000.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0511000.2 tr|D7MP20|D7MP20_ARALL Binding protein
OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_494201
PE=4,44.72,3e-18,SUBFAMILY NOT NAMED,NULL; ANDROGEN INDUCED INHIBITOR
OF PROLIFERATION (AS3) / PDS5-RELATED,NULL; seg,CUFF.53762.2
         (796 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G31880.2 | Symbols:  | LOCATED IN: cytosol; EXPRESSED IN: 24 ...   484   e-136
AT4G31880.1 | Symbols:  | LOCATED IN: cytosol, chloroplast; EXPR...   483   e-136
AT1G15940.1 | Symbols:  | Tudor/PWWP/MBT superfamily protein | c...   226   4e-59
AT1G80810.2 | Symbols:  | Tudor/PWWP/MBT superfamily protein | c...   209   4e-54
AT1G80810.1 | Symbols:  | Tudor/PWWP/MBT superfamily protein | c...   209   4e-54
AT5G47690.2 | Symbols:  | binding | chr5:19317899-19327014 FORWA...   186   5e-47
AT5G47690.1 | Symbols:  | binding | chr5:19317899-19327014 FORWA...   186   6e-47
AT5G47690.3 | Symbols:  | binding | chr5:19317899-19327014 FORWA...   186   6e-47
AT1G77600.1 | Symbols:  | ARM repeat superfamily protein | chr1:...   114   3e-25
AT1G77600.2 | Symbols:  | ARM repeat superfamily protein | chr1:...   114   3e-25
AT5G10950.1 | Symbols:  | Tudor/PWWP/MBT superfamily protein | c...   114   3e-25
AT1G77600.3 | Symbols:  | ARM repeat superfamily protein | chr1:...   114   4e-25
AT4G32970.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    81   3e-15
AT4G02070.2 | Symbols: MSH6 | MUTS homolog 6 | chr4:906079-91293...    70   7e-12
AT4G02070.1 | Symbols: MSH6, MSH6-1, ATMSH6 | MUTS homolog 6 | c...    70   7e-12
AT4G32620.2 | Symbols:  | Enhancer of polycomb-like transcriptio...    59   1e-08
AT4G32620.1 | Symbols:  | Enhancer of polycomb-like transcriptio...    59   1e-08
AT1G05830.2 | Symbols: ATX2 | trithorax-like protein 2 | chr1:17...    50   6e-06
AT1G05830.1 | Symbols: ATX2, SDG30 | trithorax-like protein 2 | ...    50   6e-06

>AT4G31880.2 | Symbols:  | LOCATED IN: cytosol; EXPRESSED IN: 24
           plant structures; EXPRESSED DURING: 14 growth stages;
           BEST Arabidopsis thaliana protein match is:
           Tudor/PWWP/MBT superfamily protein (TAIR:AT1G15940.1). |
           chr4:15419435-15423939 REVERSE LENGTH=872
          Length = 872

 Score =  484 bits (1245), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 315/725 (43%), Positives = 437/725 (60%), Gaps = 64/725 (8%)

Query: 1   MASTDKELEKQLREAGNKLANPPSSVDDLLRLLDQVENCLLKVEQSPPKSMQDALSPSLK 60
           M+ +DKELE Q+ EAG KL +PPSS+D+LL  LD++   L +VEQSPP SMQ+AL+P +K
Sbjct: 1   MSDSDKELENQIIEAGEKLIDPPSSLDELLSFLDKLFVSLAEVEQSPPDSMQNALTPLMK 60

Query: 61  ALIDDKLLRHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSS 120
            L+  KL +HSD+DVKVAVA+CISEITRITAPDAPYDDDQMK VF+LIVSSFE+L D SS
Sbjct: 61  GLVGGKLFKHSDVDVKVAVAACISEITRITAPDAPYDDDQMKEVFKLIVSSFEDLVDKSS 120

Query: 121 RSCAKRTSILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSVFASMETIMNX 180
           RS AKR SIL+TVAKVRSCVVMLDLECDAL+ EMFQHF K IR+HHSG+VF+SME IM  
Sbjct: 121 RSYAKRISILETVAKVRSCVVMLDLECDALLIEMFQHFLKAIRDHHSGNVFSSMENIMTL 180

Query: 181 XXXXXXXXXXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATRLKPYLVQAVDALGLS 240
                         PI+ SVKKD+ E+  ++RRL E+VL + A++LK YL +AV + G+ 
Sbjct: 181 VLEESEDIPSEMLSPILHSVKKDD-EISQVSRRLAEQVLSNCASKLKTYLTEAVKSSGVP 239

Query: 241 LDDYDDVLASICQDTSGNLDQNDVTENSKEAEQ----------------PQQDDPVGDRS 284
           LD Y +++ASIC+ T   L Q+ V  N KE  Q                P++ D   D S
Sbjct: 240 LDKYSNIVASICEGTFSALQQDQVVANEKEDSQGHIKRETEVEAAEISTPERTDAPKDES 299

Query: 285 PKSVMSNGIAQAGEDDALVDSKSLLKPDDTDYSVQSKGDNLSGNEKPNDL-NPENVESNE 343
            KS +SNG+AQ  ++D+ VD+ S+ K DDT              ++P  L NP N + N 
Sbjct: 300 GKSGVSNGVAQ--QNDSSVDTDSMKKQDDT-----------GAKDEPQQLDNPRNTDLNN 346

Query: 344 LKPVQP-----TKKRGRKPSTSKMAEPSEGSYLADEKDAEKKIDSKSHSKDVPSSHKGDC 398
               +P      +++  + S+ K A+ S+ S + +E +  + +D    SKDV +S   D 
Sbjct: 347 TTEEKPDVEHQIEEKENESSSVKQADLSKDSDIKEETEPAELLD----SKDVLTSPPVDS 402

Query: 399 A-EAAGPSKNNKEVNAKISSPKAGDGESDAVASPSPSERTHDDEXXXXXXXXXXXXXXXA 457
           +  AA  S+N K  + +I   K    E+  V+SPS +E    ++                
Sbjct: 403 SVTAATSSENEKNKSVQILPSKTSGDETANVSSPSMAEELP-EQSVPKKTANQKKKESST 461

Query: 458 KEV--AAEDVSQKVSEGTCDSEAKPTRPSAKKGLGRGSDVKITTVVDAVKKGSGSNDLDV 515
           +EV  +A   +++VSE    SE + T+ S KK     S  K    V   KK +    +  
Sbjct: 462 EEVKPSASIATEEVSEEPNTSEPQVTKKSGKK---VASSSKTKPTVPPSKKSTSETKV-- 516

Query: 516 KKQSAKKTEERNKGSGGS--SLRLSEDKKRSGRGKGNPEKALAKSSAKDEDEEMVSLQKS 573
               AK++E++  GS  +  S +  E+KK+ GRGK   E++L  SS  D ++  VS  K 
Sbjct: 517 ----AKQSEKKVVGSDNAQESTKPKEEKKKPGRGKAIDEESLHTSSG-DNEKPAVSSGKL 571

Query: 574 AMKSTKEH---LEETPKTNLKRKRTPGKENESDNKKYGDELVGTRVKVWWPEDKEFYEGI 630
           A KS KE    +EE+P +N KRKR+ G+      K  G+ LVG+R+KVWWP D+ +Y+G+
Sbjct: 572 ASKSKKEAKQTVEESPNSNTKRKRSLGQ-----GKASGESLVGSRIKVWWPMDQAYYKGV 626

Query: 631 VDSFDSSSKKHKVLYDDGDQETLNLKKEKWSIIQDDADSDEEEARGGTSHDDASADMPPT 690
           V+S+D++ KKH V+YDDGDQE L LK +KWS + +   S +EEA   T  ++ ++ +P T
Sbjct: 627 VESYDAAKKKHLVIYDDGDQEILYLKNQKWSPLDESELSQDEEAADQTGQEEDASTVPLT 686

Query: 691 KKGKT 695
           KK KT
Sbjct: 687 KKAKT 691


>AT4G31880.1 | Symbols:  | LOCATED IN: cytosol, chloroplast;
           EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14
           growth stages; BEST Arabidopsis thaliana protein match
           is: Tudor/PWWP/MBT superfamily protein
           (TAIR:AT1G15940.1); Has 137162 Blast hits to 70781
           proteins in 2973 species: Archae - 289; Bacteria -
           24182; Metazoa - 56725; Fungi - 20130; Plants - 6559;
           Viruses - 758; Other Eukaryotes - 28519 (source: NCBI
           BLink). | chr4:15419435-15423939 REVERSE LENGTH=873
          Length = 873

 Score =  483 bits (1244), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 315/726 (43%), Positives = 437/726 (60%), Gaps = 65/726 (8%)

Query: 1   MASTDKELEKQLREAGNKLANPPSSVDDLLRLLDQVENCLLKVEQSPPKSMQDALSPSLK 60
           M+ +DKELE Q+ EAG KL +PPSS+D+LL  LD++   L +VEQSPP SMQ+AL+P +K
Sbjct: 1   MSDSDKELENQIIEAGEKLIDPPSSLDELLSFLDKLFVSLAEVEQSPPDSMQNALTPLMK 60

Query: 61  ALIDDKLLRHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSS 120
            L+  KL +HSD+DVKVAVA+CISEITRITAPDAPYDDDQMK VF+LIVSSFE+L D SS
Sbjct: 61  GLVGGKLFKHSDVDVKVAVAACISEITRITAPDAPYDDDQMKEVFKLIVSSFEDLVDKSS 120

Query: 121 RSCAKRTSILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSVFASMETIMNX 180
           RS AKR SIL+TVAKVRSCVVMLDLECDAL+ EMFQHF K IR+HHSG+VF+SME IM  
Sbjct: 121 RSYAKRISILETVAKVRSCVVMLDLECDALLIEMFQHFLKAIRDHHSGNVFSSMENIMTL 180

Query: 181 XXXXXXXXXXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATRLKPYLVQAVDALGLS 240
                         PI+ SVKKD+ E+  ++RRL E+VL + A++LK YL +AV + G+ 
Sbjct: 181 VLEESEDIPSEMLSPILHSVKKDD-EISQVSRRLAEQVLSNCASKLKTYLTEAVKSSGVP 239

Query: 241 LDDYDDVLASICQDTSGNLDQNDVTENSKEAEQ-----------------PQQDDPVGDR 283
           LD Y +++ASIC+ T   L Q+ V  N KE  Q                 P++ D   D 
Sbjct: 240 LDKYSNIVASICEGTFSALQQDQVVANEKEDSQGHIKRETEVEKAAEISTPERTDAPKDE 299

Query: 284 SPKSVMSNGIAQAGEDDALVDSKSLLKPDDTDYSVQSKGDNLSGNEKPNDL-NPENVESN 342
           S KS +SNG+AQ  ++D+ VD+ S+ K DDT              ++P  L NP N + N
Sbjct: 300 SGKSGVSNGVAQ--QNDSSVDTDSMKKQDDT-----------GAKDEPQQLDNPRNTDLN 346

Query: 343 ELKPVQP-----TKKRGRKPSTSKMAEPSEGSYLADEKDAEKKIDSKSHSKDVPSSHKGD 397
                +P      +++  + S+ K A+ S+ S + +E +  + +D    SKDV +S   D
Sbjct: 347 NTTEEKPDVEHQIEEKENESSSVKQADLSKDSDIKEETEPAELLD----SKDVLTSPPVD 402

Query: 398 CA-EAAGPSKNNKEVNAKISSPKAGDGESDAVASPSPSERTHDDEXXXXXXXXXXXXXXX 456
            +  AA  S+N K  + +I   K    E+  V+SPS +E    ++               
Sbjct: 403 SSVTAATSSENEKNKSVQILPSKTSGDETANVSSPSMAEELP-EQSVPKKTANQKKKESS 461

Query: 457 AKEV--AAEDVSQKVSEGTCDSEAKPTRPSAKKGLGRGSDVKITTVVDAVKKGSGSNDLD 514
            +EV  +A   +++VSE    SE + T+ S KK     S  K    V   KK +    + 
Sbjct: 462 TEEVKPSASIATEEVSEEPNTSEPQVTKKSGKK---VASSSKTKPTVPPSKKSTSETKV- 517

Query: 515 VKKQSAKKTEERNKGSGGS--SLRLSEDKKRSGRGKGNPEKALAKSSAKDEDEEMVSLQK 572
                AK++E++  GS  +  S +  E+KK+ GRGK   E++L  SS  D ++  VS  K
Sbjct: 518 -----AKQSEKKVVGSDNAQESTKPKEEKKKPGRGKAIDEESLHTSSG-DNEKPAVSSGK 571

Query: 573 SAMKSTKEH---LEETPKTNLKRKRTPGKENESDNKKYGDELVGTRVKVWWPEDKEFYEG 629
            A KS KE    +EE+P +N KRKR+ G+      K  G+ LVG+R+KVWWP D+ +Y+G
Sbjct: 572 LASKSKKEAKQTVEESPNSNTKRKRSLGQ-----GKASGESLVGSRIKVWWPMDQAYYKG 626

Query: 630 IVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSIIQDDADSDEEEARGGTSHDDASADMPP 689
           +V+S+D++ KKH V+YDDGDQE L LK +KWS + +   S +EEA   T  ++ ++ +P 
Sbjct: 627 VVESYDAAKKKHLVIYDDGDQEILYLKNQKWSPLDESELSQDEEAADQTGQEEDASTVPL 686

Query: 690 TKKGKT 695
           TKK KT
Sbjct: 687 TKKAKT 692


>AT1G15940.1 | Symbols:  | Tudor/PWWP/MBT superfamily protein |
           chr1:5473672-5478050 FORWARD LENGTH=990
          Length = 990

 Score =  226 bits (576), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 121/266 (45%), Positives = 165/266 (62%), Gaps = 1/266 (0%)

Query: 9   EKQLREAGNKLANPPSSVDDLLRLLDQVENCLLKVEQSPPKSMQDALSPSLKALIDDKLL 68
           E+ L +A   L  P  S D  L LL+ +E+ L  VEQ    S+Q AL P ++AL+   LL
Sbjct: 13  EQALTDAAENLLKPHFSTDATLSLLEVMESLLATVEQDLSSSVQKALHPPMRALVSADLL 72

Query: 69  RHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTS 128
           R+ D DV+V+V SC++EI RITAP+APY+D+QMK +F++ + +FE L D SSRS  K   
Sbjct: 73  RNPDSDVRVSVVSCLTEIMRITAPEAPYNDEQMKDIFQVTIEAFEKLADASSRSYRKAEV 132

Query: 129 ILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSVFASMETIMNXXXXXXXXX 188
           IL+TVAKVRS +VMLDLECD L+ EMFQ F K IR  H   V  SMETIM          
Sbjct: 133 ILETVAKVRSSLVMLDLECDDLVLEMFQRFLKIIRPDHPQLVLVSMETIMITVIDESEEV 192

Query: 189 XXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATRLKPYLVQAVDALGLSLDDYDDVL 248
                  ++ +VKKD+++V P A  L EKVL S   +L+P +++A+ + G SLD Y  V+
Sbjct: 193 PMDLLEILLTTVKKDSQDVSPAALTLVEKVLSSCTCKLQPCIMEALKSSGTSLDMYSPVV 252

Query: 249 ASICQ-DTSGNLDQNDVTENSKEAEQ 273
           +SICQ + +     NDV     EA++
Sbjct: 253 SSICQSEFATTQAHNDVKPKDNEADE 278


>AT1G80810.2 | Symbols:  | Tudor/PWWP/MBT superfamily protein |
           chr1:30365575-30368898 FORWARD LENGTH=774
          Length = 774

 Score =  209 bits (533), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 102/203 (50%), Positives = 133/203 (65%)

Query: 51  MQDALSPSLKALIDDKLLRHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVS 110
           MQ AL PS  AL+   LL H D DV+V+V SC++EI RITAP+ PY DD MK +FRL + 
Sbjct: 1   MQSALIPSRNALVSVDLLSHPDSDVRVSVVSCLTEIVRITAPETPYSDDLMKEIFRLTIE 60

Query: 111 SFENLYDDSSRSCAKRTSILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSV 170
           +FE L D SSRS  K   +LD VAKV+SC+VMLDLEC  LI +MF++FFK IR  H   V
Sbjct: 61  AFEKLADASSRSYKKAEFVLDNVAKVKSCLVMLDLECYDLILQMFRNFFKFIRSDHPQLV 120

Query: 171 FASMETIMNXXXXXXXXXXXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATRLKPYL 230
           F+SME IM                 ++ +VKK+N+ V P++  L EKVL   A +LKPY+
Sbjct: 121 FSSMELIMIAIIDETEQVSTDLLDSLLATVKKENQNVSPMSWSLAEKVLSRCARKLKPYI 180

Query: 231 VQAVDALGLSLDDYDDVLASICQ 253
           ++A+ + G SLD Y  V++SICQ
Sbjct: 181 IEALKSRGTSLDMYSPVVSSICQ 203



 Score =  108 bits (270), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 101/171 (59%), Gaps = 10/171 (5%)

Query: 505 KKGSGSNDLDVKKQSAKK---TEERNKGSGGSSLRLSEDKKRSGRGKGNPEKALAKSSAK 561
           K+    N L+  +++AKK    E +   S G  L      KR    +   +  + +SS +
Sbjct: 403 KQTRSKNGLEKSQKTAKKKPVVEAKIVNSSGKRLSARSVAKRRNLERAPLDTLVPQSSKR 462

Query: 562 DEDEEMVSLQKSAMKSTKEHLEETPKTNLKRKRTPGKENESDNKKYGDELVGTRVKVWWP 621
              ++MVS Q +A +   E  EETPK++  R+RT  KE  SD   +G++LVG RV +WWP
Sbjct: 463 ---KKMVS-QVAARQLANESEEETPKSHPTRRRTVRKEV-SDG--FGEDLVGKRVNIWWP 515

Query: 622 EDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSIIQDDADSDEE 672
            DK FYEG++DS+ +  K H+V+Y DGD E LNL +E+W +++DD  +DE+
Sbjct: 516 LDKTFYEGVIDSYCTRKKMHRVIYSDGDSEELNLTEERWELLEDDTSADEQ 566


>AT1G80810.1 | Symbols:  | Tudor/PWWP/MBT superfamily protein |
           chr1:30365575-30368898 FORWARD LENGTH=773
          Length = 773

 Score =  209 bits (533), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 102/203 (50%), Positives = 133/203 (65%)

Query: 51  MQDALSPSLKALIDDKLLRHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVS 110
           MQ AL PS  AL+   LL H D DV+V+V SC++EI RITAP+ PY DD MK +FRL + 
Sbjct: 1   MQSALIPSRNALVSVDLLSHPDSDVRVSVVSCLTEIVRITAPETPYSDDLMKEIFRLTIE 60

Query: 111 SFENLYDDSSRSCAKRTSILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSV 170
           +FE L D SSRS  K   +LD VAKV+SC+VMLDLEC  LI +MF++FFK IR  H   V
Sbjct: 61  AFEKLADASSRSYKKAEFVLDNVAKVKSCLVMLDLECYDLILQMFRNFFKFIRSDHPQLV 120

Query: 171 FASMETIMNXXXXXXXXXXXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATRLKPYL 230
           F+SME IM                 ++ +VKK+N+ V P++  L EKVL   A +LKPY+
Sbjct: 121 FSSMELIMIAIIDETEQVSTDLLDSLLATVKKENQNVSPMSWSLAEKVLSRCARKLKPYI 180

Query: 231 VQAVDALGLSLDDYDDVLASICQ 253
           ++A+ + G SLD Y  V++SICQ
Sbjct: 181 IEALKSRGTSLDMYSPVVSSICQ 203



 Score =  108 bits (269), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 100/170 (58%), Gaps = 10/170 (5%)

Query: 505 KKGSGSNDLDVKKQSAKK---TEERNKGSGGSSLRLSEDKKRSGRGKGNPEKALAKSSAK 561
           K+    N L+  +++AKK    E +   S G  L      KR    +   +  + +SS +
Sbjct: 403 KQTRSKNGLEKSQKTAKKKPVVEAKIVNSSGKRLSARSVAKRRNLERAPLDTLVPQSSKR 462

Query: 562 DEDEEMVSLQKSAMKSTKEHLEETPKTNLKRKRTPGKENESDNKKYGDELVGTRVKVWWP 621
              ++MVS Q +A +   E  EETPK++  R+RT  KE  SD   +G++LVG RV +WWP
Sbjct: 463 ---KKMVS-QVAARQLANESEEETPKSHPTRRRTVRKEV-SDG--FGEDLVGKRVNIWWP 515

Query: 622 EDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSIIQDDADSDE 671
            DK FYEG++DS+ +  K H+V+Y DGD E LNL +E+W +++DD  +DE
Sbjct: 516 LDKTFYEGVIDSYCTRKKMHRVIYSDGDSEELNLTEERWELLEDDTSADE 565


>AT5G47690.2 | Symbols:  | binding | chr5:19317899-19327014 FORWARD
           LENGTH=1606
          Length = 1606

 Score =  186 bits (472), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 106/250 (42%), Positives = 150/250 (60%), Gaps = 7/250 (2%)

Query: 9   EKQLREAGNKLANPPSSVDDLLRLLDQVENCLLKVEQSPPKSMQDALSPSLKALIDDKLL 68
           E+QL+E G+KL   P S D LL+LL +   CL ++EQSPP ++  ++ P L A+I  ++L
Sbjct: 6   EEQLKELGSKLDLAPVSKDSLLKLLKEAAVCLSELEQSPPPAVLKSIQPFLDAVIKPEIL 65

Query: 69  RHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTS 128
            H D DVK+ VASC+SEITRITAP+APY D+ MK +F+LIVS+F  L D S  S  +R  
Sbjct: 66  NHQDKDVKLLVASCVSEITRITAPEAPYSDNIMKDIFQLIVSAFAGLNDVSGPSFGRRVL 125

Query: 129 ILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSVFASMETIMNXXXXXXXXX 188
           IL+TVAK RSCVVMLDLECD L+ E+F  F    R+ H   VF+SM+ IM          
Sbjct: 126 ILETVAKYRSCVVMLDLECDDLVKEVFTTFLDVARDDHPEIVFSSMQNIMIVLLEESEDV 185

Query: 189 XXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATR----LKPYLVQAVDA---LGLSL 241
                  ++  + ++  +V   ARRL  KV+E  A +    +K +L+ ++        S 
Sbjct: 186 QEHLLLILLSKLGRNRSDVRDAARRLAMKVIEHCAPKVESDIKQFLISSMSGDSRFSSSQ 245

Query: 242 DDYDDVLASI 251
            DY +V+  +
Sbjct: 246 IDYHEVIYDL 255



 Score = 95.9 bits (237), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 41/75 (54%), Positives = 54/75 (72%), Gaps = 2/75 (2%)

Query: 591  KRKRTPG--KENESDNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDG 648
            KRK   G  K +  +NK   DEL+G R++VWWP DK FYEG V S+DS+ ++H +LY+DG
Sbjct: 1346 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDKRFYEGTVKSYDSTKQRHVILYEDG 1405

Query: 649  DQETLNLKKEKWSII 663
            D E LNLKKE+W +I
Sbjct: 1406 DVEVLNLKKEQWELI 1420


>AT5G47690.1 | Symbols:  | binding | chr5:19317899-19327014 FORWARD
           LENGTH=1605
          Length = 1605

 Score =  186 bits (472), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 106/250 (42%), Positives = 150/250 (60%), Gaps = 7/250 (2%)

Query: 9   EKQLREAGNKLANPPSSVDDLLRLLDQVENCLLKVEQSPPKSMQDALSPSLKALIDDKLL 68
           E+QL+E G+KL   P S D LL+LL +   CL ++EQSPP ++  ++ P L A+I  ++L
Sbjct: 6   EEQLKELGSKLDLAPVSKDSLLKLLKEAAVCLSELEQSPPPAVLKSIQPFLDAVIKPEIL 65

Query: 69  RHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTS 128
            H D DVK+ VASC+SEITRITAP+APY D+ MK +F+LIVS+F  L D S  S  +R  
Sbjct: 66  NHQDKDVKLLVASCVSEITRITAPEAPYSDNIMKDIFQLIVSAFAGLNDVSGPSFGRRVL 125

Query: 129 ILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSVFASMETIMNXXXXXXXXX 188
           IL+TVAK RSCVVMLDLECD L+ E+F  F    R+ H   VF+SM+ IM          
Sbjct: 126 ILETVAKYRSCVVMLDLECDDLVKEVFTTFLDVARDDHPEIVFSSMQNIMIVLLEESEDV 185

Query: 189 XXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATR----LKPYLVQAVDA---LGLSL 241
                  ++  + ++  +V   ARRL  KV+E  A +    +K +L+ ++        S 
Sbjct: 186 QEHLLLILLSKLGRNRSDVRDAARRLAMKVIEHCAPKVESDIKQFLISSMSGDSRFSSSQ 245

Query: 242 DDYDDVLASI 251
            DY +V+  +
Sbjct: 246 IDYHEVIYDL 255



 Score = 95.9 bits (237), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 41/75 (54%), Positives = 54/75 (72%), Gaps = 2/75 (2%)

Query: 591  KRKRTPG--KENESDNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDG 648
            KRK   G  K +  +NK   DEL+G R++VWWP DK FYEG V S+DS+ ++H +LY+DG
Sbjct: 1345 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDKRFYEGTVKSYDSTKQRHVILYEDG 1404

Query: 649  DQETLNLKKEKWSII 663
            D E LNLKKE+W +I
Sbjct: 1405 DVEVLNLKKEQWELI 1419


>AT5G47690.3 | Symbols:  | binding | chr5:19317899-19327014 FORWARD
           LENGTH=1607
          Length = 1607

 Score =  186 bits (472), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 106/250 (42%), Positives = 150/250 (60%), Gaps = 7/250 (2%)

Query: 9   EKQLREAGNKLANPPSSVDDLLRLLDQVENCLLKVEQSPPKSMQDALSPSLKALIDDKLL 68
           E+QL+E G+KL   P S D LL+LL +   CL ++EQSPP ++  ++ P L A+I  ++L
Sbjct: 6   EEQLKELGSKLDLAPVSKDSLLKLLKEAAVCLSELEQSPPPAVLKSIQPFLDAVIKPEIL 65

Query: 69  RHSDIDVKVAVASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTS 128
            H D DVK+ VASC+SEITRITAP+APY D+ MK +F+LIVS+F  L D S  S  +R  
Sbjct: 66  NHQDKDVKLLVASCVSEITRITAPEAPYSDNIMKDIFQLIVSAFAGLNDVSGPSFGRRVL 125

Query: 129 ILDTVAKVRSCVVMLDLECDALISEMFQHFFKTIREHHSGSVFASMETIMNXXXXXXXXX 188
           IL+TVAK RSCVVMLDLECD L+ E+F  F    R+ H   VF+SM+ IM          
Sbjct: 126 ILETVAKYRSCVVMLDLECDDLVKEVFTTFLDVARDDHPEIVFSSMQNIMIVLLEESEDV 185

Query: 189 XXXXXCPIIDSVKKDNEEVLPIARRLGEKVLESSATR----LKPYLVQAVDA---LGLSL 241
                  ++  + ++  +V   ARRL  KV+E  A +    +K +L+ ++        S 
Sbjct: 186 QEHLLLILLSKLGRNRSDVRDAARRLAMKVIEHCAPKVESDIKQFLISSMSGDSRFSSSQ 245

Query: 242 DDYDDVLASI 251
            DY +V+  +
Sbjct: 246 IDYHEVIYDL 255



 Score = 95.9 bits (237), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 41/75 (54%), Positives = 54/75 (72%), Gaps = 2/75 (2%)

Query: 591  KRKRTPG--KENESDNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDG 648
            KRK   G  K +  +NK   DEL+G R++VWWP DK FYEG V S+DS+ ++H +LY+DG
Sbjct: 1346 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDKRFYEGTVKSYDSTKQRHVILYEDG 1405

Query: 649  DQETLNLKKEKWSII 663
            D E LNLKKE+W +I
Sbjct: 1406 DVEVLNLKKEQWELI 1420


>AT1G77600.1 | Symbols:  | ARM repeat superfamily protein |
           chr1:29152890-29162156 REVERSE LENGTH=1367
          Length = 1367

 Score =  114 bits (285), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 95/159 (59%), Gaps = 9/159 (5%)

Query: 27  DDLLRLLDQVENCLLKVEQSPP-------KSMQDALSPSLKALIDDKLLRHSDIDVKVAV 79
           D L++LL +V N L K++Q          K ++  L P  K++I   LL++ D DV + V
Sbjct: 25  DSLVKLLREVANTLSKIDQPSATNKEKGLKLIEAELRPLKKSIIKHALLKNRDNDVSLLV 84

Query: 80  ASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTSILDTVAKVRSC 139
             C+SE+ RI AP  P++D+ ++ +F L ++ F  L D  S   +KR  IL+TV++++ C
Sbjct: 85  TVCVSELFRILAPHLPFEDEYLRDIFTLFIAEFSELSDTVSPYFSKRAKILETVSRLKFC 144

Query: 140 VVMLDLECDALISEMFQHFFKTIREHHSGSVF--ASMET 176
           ++MLD +C  L+ EMF  FF  +REHH  S+    SM+T
Sbjct: 145 LLMLDEDCQDLVHEMFNMFFSLVREHHQQSLINQKSMKT 183



 Score = 60.1 bits (144), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%)

Query: 603  DNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSI 662
            D   +G+ ++G R+K+  P D  FY G V+ F+S S  HK+++D+GD E + L  E W  
Sbjct: 1213 DISNHGEAIIGQRIKLLSPTDGCFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSESWET 1272

Query: 663  IQDDADSDEE 672
            +  ++   +E
Sbjct: 1273 LSHESMGQQE 1282


>AT1G77600.2 | Symbols:  | ARM repeat superfamily protein |
           chr1:29152890-29162156 REVERSE LENGTH=1410
          Length = 1410

 Score =  114 bits (284), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 91/151 (60%), Gaps = 7/151 (4%)

Query: 27  DDLLRLLDQVENCLLKVEQSPP-------KSMQDALSPSLKALIDDKLLRHSDIDVKVAV 79
           D L++LL +V N L K++Q          K ++  L P  K++I   LL++ D DV + V
Sbjct: 25  DSLVKLLREVANTLSKIDQPSATNKEKGLKLIEAELRPLKKSIIKHALLKNRDNDVSLLV 84

Query: 80  ASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTSILDTVAKVRSC 139
             C+SE+ RI AP  P++D+ ++ +F L ++ F  L D  S   +KR  IL+TV++++ C
Sbjct: 85  TVCVSELFRILAPHLPFEDEYLRDIFTLFIAEFSELSDTVSPYFSKRAKILETVSRLKFC 144

Query: 140 VVMLDLECDALISEMFQHFFKTIREHHSGSV 170
           ++MLD +C  L+ EMF  FF  +REHH  S+
Sbjct: 145 LLMLDEDCQDLVHEMFNMFFSLVREHHQQSL 175



 Score = 60.1 bits (144), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%)

Query: 603  DNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSI 662
            D   +G+ ++G R+K+  P D  FY G V+ F+S S  HK+++D+GD E + L  E W  
Sbjct: 1249 DISNHGEAIIGQRIKLLSPTDGCFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSESWET 1308

Query: 663  IQDDADSDEE 672
            +  ++   +E
Sbjct: 1309 LSHESMGQQE 1318


>AT5G10950.1 | Symbols:  | Tudor/PWWP/MBT superfamily protein |
           chr5:3459557-3461632 REVERSE LENGTH=395
          Length = 395

 Score =  114 bits (284), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 56/110 (50%), Positives = 79/110 (71%), Gaps = 10/110 (9%)

Query: 594 RTPGKENESDNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETL 653
           R+ GK+  SD +KYG+ LVG+R++VWWP D +FY+G+VDS+ SS KKH+V Y+DGD+ETL
Sbjct: 20  RSSGKDKVSDARKYGEALVGSRIRVWWPMDSKFYKGVVDSYVSSKKKHRVFYEDGDKETL 79

Query: 654 NLKKEKWSII-QDDADSD------EEEARGGTSHDDASADMPPTKKGKTS 696
           +LKKE+W +I +DDA+S+      +EE+ G +S        PP  KGK S
Sbjct: 80  DLKKERWELIEEDDAESESDEISLQEESAGESSE---GTPEPPKGKGKAS 126


>AT1G77600.3 | Symbols:  | ARM repeat superfamily protein |
           chr1:29152890-29162156 REVERSE LENGTH=1424
          Length = 1424

 Score =  114 bits (284), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 91/151 (60%), Gaps = 7/151 (4%)

Query: 27  DDLLRLLDQVENCLLKVEQSPP-------KSMQDALSPSLKALIDDKLLRHSDIDVKVAV 79
           D L++LL +V N L K++Q          K ++  L P  K++I   LL++ D DV + V
Sbjct: 25  DSLVKLLREVANTLSKIDQPSATNKEKGLKLIEAELRPLKKSIIKHALLKNRDNDVSLLV 84

Query: 80  ASCISEITRITAPDAPYDDDQMKAVFRLIVSSFENLYDDSSRSCAKRTSILDTVAKVRSC 139
             C+SE+ RI AP  P++D+ ++ +F L ++ F  L D  S   +KR  IL+TV++++ C
Sbjct: 85  TVCVSELFRILAPHLPFEDEYLRDIFTLFIAEFSELSDTVSPYFSKRAKILETVSRLKFC 144

Query: 140 VVMLDLECDALISEMFQHFFKTIREHHSGSV 170
           ++MLD +C  L+ EMF  FF  +REHH  S+
Sbjct: 145 LLMLDEDCQDLVHEMFNMFFSLVREHHQQSL 175



 Score = 60.1 bits (144), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%)

Query: 603  DNKKYGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSI 662
            D   +G+ ++G R+K+  P D  FY G V+ F+S S  HK+++D+GD E + L  E W  
Sbjct: 1263 DISNHGEAIIGQRIKLLSPTDGCFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSESWET 1322

Query: 663  IQDDADSDEE 672
            +  ++   +E
Sbjct: 1323 LSHESMGQQE 1332


>AT4G32970.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G32960.1); Has 552 Blast hits to 489 proteins
           in 85 species: Archae - 4; Bacteria - 14; Metazoa - 187;
           Fungi - 12; Plants - 225; Viruses - 0; Other Eukaryotes
           - 110 (source: NCBI BLink). | chr4:15910671-15914300
           REVERSE LENGTH=638
          Length = 638

 Score = 80.9 bits (198), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 55/78 (70%), Gaps = 3/78 (3%)

Query: 568 VSLQKSAMKSTK---EHLEETPKTNLKRKRTPGKENESDNKKYGDELVGTRVKVWWPEDK 624
           VS  KSA  STK   + LE+ PKTN KR  + GKE  SD KKY +++VG+RVK+WWP D+
Sbjct: 239 VSCDKSASDSTKGAKQPLEKKPKTNTKRIHSLGKEKTSDFKKYDEKIVGSRVKIWWPLDR 298

Query: 625 EFYEGIVDSFDSSSKKHK 642
            +YE +V S+ S+ ++H+
Sbjct: 299 AYYEAVVISYCSAKERHR 316



 Score = 50.4 bits (119), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 1/79 (1%)

Query: 195 PIIDSVKKDNEEVLPIARRLGEKVLESSATRLKPYLVQAVDALGLSLDDYDDVLASICQD 254
           P   +VKKD+E V  I+RR  E+VL + A+++K YL +A     +SL +  DV+ SIC  
Sbjct: 355 PFPPNVKKDDE-VPQISRRQAEQVLSNCASQIKKYLTEAGKLSSVSLANPSDVVDSICGR 413

Query: 255 TSGNLDQNDVTENSKEAEQ 273
               L Q DV  N KE  Q
Sbjct: 414 AFDALKQEDVVANEKEDRQ 432


>AT4G02070.2 | Symbols: MSH6 | MUTS homolog 6 | chr4:906079-912930
           FORWARD LENGTH=1321
          Length = 1321

 Score = 69.7 bits (169), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 49/75 (65%), Gaps = 2/75 (2%)

Query: 607 YGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEK--WSIIQ 664
           YGDE+VG +V+V+WP DK++Y+G V  +D    KH V Y+DG++E+L+L KEK  W + +
Sbjct: 120 YGDEVVGKQVRVYWPLDKKWYDGSVTFYDKGEGKHVVEYEDGEEESLDLGKEKTEWVVGE 179

Query: 665 DDADSDEEEARGGTS 679
              D      RG ++
Sbjct: 180 KSGDRFNRLKRGASA 194


>AT4G02070.1 | Symbols: MSH6, MSH6-1, ATMSH6 | MUTS homolog 6 |
           chr4:906079-912930 FORWARD LENGTH=1324
          Length = 1324

 Score = 69.7 bits (169), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 49/75 (65%), Gaps = 2/75 (2%)

Query: 607 YGDELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEK--WSIIQ 664
           YGDE+VG +V+V+WP DK++Y+G V  +D    KH V Y+DG++E+L+L KEK  W + +
Sbjct: 120 YGDEVVGKQVRVYWPLDKKWYDGSVTFYDKGEGKHVVEYEDGEEESLDLGKEKTEWVVGE 179

Query: 665 DDADSDEEEARGGTS 679
              D      RG ++
Sbjct: 180 KSGDRFNRLKRGASA 194


>AT4G32620.2 | Symbols:  | Enhancer of polycomb-like transcription
           factor protein | chr4:15731968-15737222 FORWARD
           LENGTH=1540
          Length = 1540

 Score = 59.3 bits (142), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 35/53 (66%)

Query: 611 LVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSII 663
           L+  ++KV+WP D+ +Y G VD FD     H V YDD D+E +NL+ E++ I+
Sbjct: 355 LLNKKIKVFWPLDERWYHGFVDGFDGDKNLHHVKYDDRDEEWINLQGERFKIL 407


>AT4G32620.1 | Symbols:  | Enhancer of polycomb-like transcription
           factor protein | chr4:15731968-15737222 FORWARD
           LENGTH=1539
          Length = 1539

 Score = 59.3 bits (142), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 35/53 (66%)

Query: 611 LVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSII 663
           L+  ++KV+WP D+ +Y G VD FD     H V YDD D+E +NL+ E++ I+
Sbjct: 355 LLNKKIKVFWPLDERWYHGFVDGFDGDKNLHHVKYDDRDEEWINLQGERFKIL 407


>AT1G05830.2 | Symbols: ATX2 | trithorax-like protein 2 |
           chr1:1754452-1761225 FORWARD LENGTH=1083
          Length = 1083

 Score = 50.1 bits (118), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 37/70 (52%)

Query: 610 ELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSIIQDDADS 669
             +G + KV+WP D  +Y G +  ++  +K H V Y DGD E L L++EK   +    D 
Sbjct: 219 HFIGLQCKVFWPLDAVWYPGSIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDM 278

Query: 670 DEEEARGGTS 679
           +    + GT+
Sbjct: 279 ELLNMKFGTN 288


>AT1G05830.1 | Symbols: ATX2, SDG30 | trithorax-like protein 2 |
           chr1:1754452-1761225 FORWARD LENGTH=1083
          Length = 1083

 Score = 50.1 bits (118), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 37/70 (52%)

Query: 610 ELVGTRVKVWWPEDKEFYEGIVDSFDSSSKKHKVLYDDGDQETLNLKKEKWSIIQDDADS 669
             +G + KV+WP D  +Y G +  ++  +K H V Y DGD E L L++EK   +    D 
Sbjct: 219 HFIGLQCKVFWPLDAVWYPGSIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDM 278

Query: 670 DEEEARGGTS 679
           +    + GT+
Sbjct: 279 ELLNMKFGTN 288