Miyakogusa Predicted Gene

chr1.CM0433.310.nd
Show Alignment: 

BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr1.CM0433.310.nd + phase: 2 /pseudo/partial
         (475 letters)

Database: TAIR8_pep 
           32,825 sequences; 13,166,001 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G10950.1 | Symbols:  | cylicin-related | chr5:3459558-3461633...   100   3e-21
AT4G31880.1 | Symbols:  | binding | chr4:15419441-15423945 REVERSE     98   9e-21
AT1G15940.1 | Symbols:  | binding | chr1:5473666-5478044 FORWARD       89   6e-18
AT1G80810.2 | Symbols:  | binding | chr1:30370467-30373790 FORWARD     86   6e-17
AT1G80810.1 | Symbols:  | binding | chr1:30370467-30373790 FORWARD     86   7e-17
AT5G47690.3 | Symbols:  | binding | chr5:19335125-19344240 FORWARD     80   2e-15
AT5G47690.2 | Symbols:  | binding | chr5:19335125-19344240 FORWARD     80   2e-15
AT5G47690.1 | Symbols:  | binding | chr5:19335125-19344240 FORWARD     80   2e-15
AT4G32970.1 | Symbols:  | similar to unknown protein [Arabidopsi...    67   3e-11
AT4G02070.1 | Symbols: MSH6-1, ATMSH6, MSH6 | MSH6 (MUTS HOMOLOG...    58   1e-08
AT1G77600.1 | Symbols:  | binding | chr1:29157784-29167856 REVERSE     54   3e-07
AT4G32620.1 | Symbols:  | nucleic acid binding | chr4:15731974-1...    52   7e-07
AT1G05830.2 | Symbols:  | trithorax protein, putative / PHD fing...    44   2e-04
AT1G05830.1 | Symbols:  | trithorax protein, putative / PHD fing...    44   2e-04

>AT5G10950.1 | Symbols:  | cylicin-related | chr5:3459558-3461633
           REVERSE
          Length = 395

 Score = 99.8 bits (247), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 66/82 (80%), Gaps = 4/82 (4%)

Query: 139 RTPGKEKESDTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGD 198
           R+ GK+K SD +KYGE LVG R++VWWP D     +FY GVV+S+ S++KKH+V Y+DGD
Sbjct: 20  RSSGKDKVSDARKYGEALVGSRIRVWWPMD----SKFYKGVVDSYVSSKKKHRVFYEDGD 75

Query: 199 EETLNLREEKWGVIKKADSDAD 220
           +ETL+L++E+W +I++ D++++
Sbjct: 76  KETLDLKKERWELIEEDDAESE 97


>AT4G31880.1 | Symbols:  | binding | chr4:15419441-15423945 REVERSE
          Length = 873

 Score = 98.2 bits (243), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 62/149 (41%), Positives = 88/149 (59%), Gaps = 17/149 (11%)

Query: 88  GRGKANSEAAVAKSSAIDVDKEM-TVYSPRSGTKSTKS--ENTEEIPLTSAKRKRTPGKE 144
           GRGKA  E ++  SS    D E   V S +  +KS K   +  EE P ++ KRKR+ G+ 
Sbjct: 545 GRGKAIDEESLHTSSG---DNEKPAVSSGKLASKSKKEAKQTVEESPNSNTKRKRSLGQG 601

Query: 145 KESDTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGDEETLNL 204
           K S     GE+LVG R+KVWWP D+     +Y GVV S+D+A+KKH V+YDDGD+E L L
Sbjct: 602 KAS-----GESLVGSRIKVWWPMDQ----AYYKGVVESYDAAKKKHLVIYDDGDQEILYL 652

Query: 205 REEKWGVIKKAD--SDADGGHQRRKENQA 231
           + +KW  + +++   D +   Q  +E  A
Sbjct: 653 KNQKWSPLDESELSQDEEAADQTGQEEDA 681


>AT1G15940.1 | Symbols:  | binding | chr1:5473666-5478044 FORWARD
          Length = 990

 Score = 89.0 bits (219), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 96/197 (48%), Gaps = 35/197 (17%)

Query: 42  AKKNSAKKLGEHKSDI---NAKKHSAKKLDEQKGGSGSSSRKLENN-------------- 84
           AKK + KK    K D+   N KKH     D  K G  S   K +N               
Sbjct: 438 AKKQTVKKTNPAKEDLTKSNVKKHE----DGIKTGKSSKKEKADNGLAKTSAKKPLAETM 493

Query: 85  --KKSGRGKANSEAAVAKSSAIDVDKEM------TVYSPRSGTKSTKSENTEEIPLTSAK 136
             K SG+   +S+A    S    +D  +           R+ T +TK   +E+ P +  K
Sbjct: 494 MVKPSGKKLVHSDAKKKNSEGASMDTPIPQSSKSKKKDSRATTPATK--KSEQAPKSHPK 551

Query: 137 RKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDD 196
            KR  G+E ES+T + GE LVG RV VWWP D+    +FY GV+ S+   +K H+V Y D
Sbjct: 552 MKRIAGEEVESNTNELGEELVGKRVNVWWPLDK----KFYEGVIKSYCRVKKMHQVTYSD 607

Query: 197 GDEETLNLREEKWGVIK 213
           GD E LNL++E++ +I+
Sbjct: 608 GDVEELNLKKERFKIIE 624


>AT1G80810.2 | Symbols:  | binding | chr1:30370467-30373790 FORWARD
          Length = 774

 Score = 85.5 bits (210), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 63/97 (64%), Gaps = 8/97 (8%)

Query: 124 SENTEEIPLTSAKRKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSF 183
           +E+ EE P +   R+RT  KE    +  +GE+LVG RV +WWP D+     FY GV++S+
Sbjct: 476 NESEEETPKSHPTRRRTVRKEV---SDGFGEDLVGKRVNIWWPLDK----TFYEGVIDSY 528

Query: 184 DSARKKHKVLYDDGDEETLNLREEKWGVIKKADSDAD 220
            + +K H+V+Y DGD E LNL EE+W +++  D+ AD
Sbjct: 529 CTRKKMHRVIYSDGDSEELNLTEERWELLED-DTSAD 564


>AT1G80810.1 | Symbols:  | binding | chr1:30370467-30373790 FORWARD
          Length = 773

 Score = 85.5 bits (210), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 63/97 (64%), Gaps = 8/97 (8%)

Query: 124 SENTEEIPLTSAKRKRTPGKEKESDTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSF 183
           +E+ EE P +   R+RT  KE    +  +GE+LVG RV +WWP D+     FY GV++S+
Sbjct: 476 NESEEETPKSHPTRRRTVRKEV---SDGFGEDLVGKRVNIWWPLDK----TFYEGVIDSY 528

Query: 184 DSARKKHKVLYDDGDEETLNLREEKWGVIKKADSDAD 220
            + +K H+V+Y DGD E LNL EE+W +++  D+ AD
Sbjct: 529 CTRKKMHRVIYSDGDSEELNLTEERWELLED-DTSAD 564


>AT5G47690.3 | Symbols:  | binding | chr5:19335125-19344240 FORWARD
          Length = 1607

 Score = 80.5 bits (197), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 52/79 (65%), Gaps = 6/79 (7%)

Query: 136  KRKRTPGKEKES--DTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVL 193
            KRK   G  K S  + K   + L+G R++VWWP D+    RFY G V S+DS +++H +L
Sbjct: 1346 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDK----RFYEGTVKSYDSTKQRHVIL 1401

Query: 194  YDDGDEETLNLREEKWGVI 212
            Y+DGD E LNL++E+W +I
Sbjct: 1402 YEDGDVEVLNLKKEQWELI 1420


>AT5G47690.2 | Symbols:  | binding | chr5:19335125-19344240 FORWARD
          Length = 1606

 Score = 80.5 bits (197), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 52/79 (65%), Gaps = 6/79 (7%)

Query: 136  KRKRTPGKEKES--DTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVL 193
            KRK   G  K S  + K   + L+G R++VWWP D+    RFY G V S+DS +++H +L
Sbjct: 1346 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDK----RFYEGTVKSYDSTKQRHVIL 1401

Query: 194  YDDGDEETLNLREEKWGVI 212
            Y+DGD E LNL++E+W +I
Sbjct: 1402 YEDGDVEVLNLKKEQWELI 1420


>AT5G47690.1 | Symbols:  | binding | chr5:19335125-19344240 FORWARD
          Length = 1605

 Score = 80.5 bits (197), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 52/79 (65%), Gaps = 6/79 (7%)

Query: 136  KRKRTPGKEKES--DTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVL 193
            KRK   G  K S  + K   + L+G R++VWWP D+    RFY G V S+DS +++H +L
Sbjct: 1345 KRKNVSGLAKCSTKENKLVNDELIGCRIEVWWPMDK----RFYEGTVKSYDSTKQRHVIL 1400

Query: 194  YDDGDEETLNLREEKWGVI 212
            Y+DGD E LNL++E+W +I
Sbjct: 1401 YEDGDVEVLNLKKEQWELI 1419


>AT4G32970.1 | Symbols:  | similar to unknown protein [Arabidopsis
           thaliana] (TAIR:AT4G32960.1); similar to unnamed protein
           product [Vitis vinifera] (GB:CAO43777.1); contains
           domain Tudor/PWWP/MBT (SSF63748); contains domain
           ANDROGEN INDUCED INHIBITOR OF PROLIFERATION (AS3) /
           PDS5-RELATED (PTHR12663) | chr4:15910348-15914303
           REVERSE
          Length = 750

 Score = 66.6 bits (161), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 54/93 (58%), Gaps = 10/93 (10%)

Query: 99  AKSSAIDVDKEMTVYSPRSGTKSTKSENTEEIPLTSAKRKRTPGKEKESDTKKYGENLVG 158
           AK  A+  DK     S    TK  K +  E+ P T+ KR  + GKEK SD KKY E +VG
Sbjct: 234 AKKPAVSCDK-----SASDSTKGAK-QPLEKKPKTNTKRIHSLGKEKTSDFKKYDEKIVG 287

Query: 159 LRVKVWWPEDREIFYRFYTGVVNSFDSARKKHK 191
            RVK+WWP DR     +Y  VV S+ SA+++H+
Sbjct: 288 SRVKIWWPLDRA----YYEAVVISYCSAKERHR 316


>AT4G02070.1 | Symbols: MSH6-1, ATMSH6, MSH6 | MSH6 (MUTS HOMOLOG
           6-1) | chr4:906079-912930 FORWARD
          Length = 1324

 Score = 57.8 bits (138), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 44/66 (66%), Gaps = 6/66 (9%)

Query: 152 YGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK--W 209
           YG+ +VG +V+V+WP D+    ++Y G V  +D    KH V Y+DG+EE+L+L +EK  W
Sbjct: 120 YGDEVVGKQVRVYWPLDK----KWYDGSVTFYDKGEGKHVVEYEDGEEESLDLGKEKTEW 175

Query: 210 GVIKKA 215
            V +K+
Sbjct: 176 VVGEKS 181


>AT1G77600.1 | Symbols:  | binding | chr1:29157784-29167856 REVERSE
          Length = 1285

 Score = 53.5 bits (127), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 148  DTKKYGENLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGDEETLNLREE 207
            D   +GE ++G R+K+  P D      FY G V  F+S    HK+++D+GD E + L  E
Sbjct: 1131 DISNHGEAIIGQRIKLLSPTDG----CFYPGTVEKFNSKSNSHKIIFDNGDVELVCLDSE 1186

Query: 208  KW 209
             W
Sbjct: 1187 SW 1188


>AT4G32620.1 | Symbols:  | nucleic acid binding |
           chr4:15731974-15737228 FORWARD
          Length = 1539

 Score = 52.4 bits (124), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 4/57 (7%)

Query: 156 LVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEKWGVI 212
           L+  ++KV+WP D     R+Y G V+ FD  +  H V YDD DEE +NL+ E++ ++
Sbjct: 355 LLNKKIKVFWPLDE----RWYHGFVDGFDGDKNLHHVKYDDRDEEWINLQGERFKIL 407


>AT1G05830.2 | Symbols:  | trithorax protein, putative / PHD finger
           family protein / SET domain-containing protein |
           chr1:1754451-1761224 FORWARD
          Length = 1056

 Score = 44.3 bits (103), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 155 NLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK 208
           + +GL+ KV+WP D      +Y G +  ++   K H V Y DGD E L LR EK
Sbjct: 219 HFIGLQCKVFWPLDA----VWYPGSIVGYNVETKHHIVKYGDGDGEELALRREK 268


>AT1G05830.1 | Symbols:  | trithorax protein, putative / PHD finger
           family protein / SET domain-containing protein |
           chr1:1754451-1761224 FORWARD
          Length = 1056

 Score = 44.3 bits (103), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 155 NLVGLRVKVWWPEDREIFYRFYTGVVNSFDSARKKHKVLYDDGDEETLNLREEK 208
           + +GL+ KV+WP D      +Y G +  ++   K H V Y DGD E L LR EK
Sbjct: 219 HFIGLQCKVFWPLDA----VWYPGSIVGYNVETKHHIVKYGDGDGEELALRREK 268