Miyakogusa Predicted Gene

Lj1g3v3944820.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3944820.1 Non Chatacterized Hit- tr|J3M5T7|J3M5T7_ORYBR
Uncharacterized protein OS=Oryza brachyantha
GN=OB05G1,40.4,0.000000000002,zf-C3Hc3H,Potential DNA-binding domain;
seg,NULL,CUFF.31536.1
         (232 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G31600.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   188   2e-48
AT3G53860.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   181   5e-46
AT1G05860.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   172   2e-43
AT2G31600.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   126   1e-29

>AT2G31600.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT3G53860.1); Has 35333 Blast hits to
           34131 proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr2:13448541-13450083 REVERSE LENGTH=301
          Length = 301

 Score =  188 bits (478), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 94/221 (42%), Positives = 136/221 (61%), Gaps = 15/221 (6%)

Query: 14  IDGADQDAALANSTVLSRSEVIARRLRRVKQLARCYRGHYWALMEDLKSKYRDYYWTYGK 73
           I  + +D  LA S+ ++RSE++ RR   +KQLA+CYR +YWALMED+K+++RDY+W YG 
Sbjct: 63  ITMSQEDEILARSSHITRSELLKRRSHNLKQLAKCYRDNYWALMEDVKAQHRDYWWKYGI 122

Query: 74  SPFKDDDHG-------GANGTVVSE-EKVNGTADHASAAAGVDFVRCASG-------GCK 118
           S FKD+++        G  G +    + V G+ D+ +   GV   + A+        GCK
Sbjct: 123 SQFKDENNQSNKRRRLGQEGDIGDGGDAVEGSGDNVTNNDGVKSDQYANSNCGSCMYGCK 182

Query: 119 TKAMAMSRFCHAHIVFDTKQKLYVGCTTVAKNLPTGPSFCNKPVLTSVVPRSCHQHYQLG 178
            KAMA++++C  HI+ D+KQKLY GCT V K  P GP  C KP L S VP  C+ H+Q  
Sbjct: 183 AKAMALTKYCQLHILKDSKQKLYTGCTNVIKRAPAGPLLCGKPTLASTVPALCNIHFQKA 242

Query: 179 EKCLTRAIRRAGYNIPINRKPSLPLHVVIPEFVREIQNKRK 219
           +K + +A++ AG+N+    KP   LHV++  FV  IQ KRK
Sbjct: 243 QKHVAKALKDAGHNVSSTSKPPPKLHVIVAAFVHHIQAKRK 283


>AT3G53860.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT2G31600.1); Has 70 Blast hits to 70
           proteins in 17 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 66; Viruses - 0; Other
           Eukaryotes - 4 (source: NCBI BLink). |
           chr3:19949850-19951270 REVERSE LENGTH=281
          Length = 281

 Score =  181 bits (458), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 90/205 (43%), Positives = 126/205 (61%), Gaps = 9/205 (4%)

Query: 19  QDAALANSTVLSRSEVIARRLRRVKQLARCYRGHYWALMEDLKSKYRDYYWTYGKSPFKD 78
           +D  LA+S+ L+R E++ RR   +KQLA+CY+ HYWALMEDLK+++RDY+  YG S FKD
Sbjct: 64  EDEILASSSHLTRPELLRRRADNLKQLAKCYKNHYWALMEDLKAQHRDYWCKYGVSQFKD 123

Query: 79  DDHGGANGTVVSEEKVNGTADHASAAAGVDFVRCASG----GCKTKAMAMSRFCHAHIVF 134
           + +       +  E   G+ D  +   G  +    SG    GCK KAMA++++C  HI+ 
Sbjct: 124 EQNQSNKRRRLDPE---GSGDKGND--GDQYANSNSGFCMYGCKAKAMALTKYCQLHILK 178

Query: 135 DTKQKLYVGCTTVAKNLPTGPSFCNKPVLTSVVPRSCHQHYQLGEKCLTRAIRRAGYNIP 194
           D+KQKLY GCT V    P GP  C KP L S VP  C+ HYQ  +K + +A++ AG+N+ 
Sbjct: 179 DSKQKLYTGCTNVINRSPAGPLLCGKPTLASTVPVLCNVHYQKAQKNVAKALKDAGHNVS 238

Query: 195 INRKPSLPLHVVIPEFVREIQNKRK 219
              KP   LHV++  FV  IQ +RK
Sbjct: 239 STSKPPPKLHVIVAAFVHHIQAQRK 263


>AT1G05860.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G31600.1); Has 101 Blast
           hits to 100 proteins in 32 species: Archae - 0; Bacteria
           - 0; Metazoa - 28; Fungi - 2; Plants - 66; Viruses - 0;
           Other Eukaryotes - 5 (source: NCBI BLink). |
           chr1:1769061-1770349 FORWARD LENGTH=280
          Length = 280

 Score =  172 bits (435), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 92/214 (42%), Positives = 125/214 (58%), Gaps = 10/214 (4%)

Query: 14  IDGADQDAALANSTVLSRSEVIARRLRRVKQLARCYRGHYWALMEDLKSKYRDYYWTYGK 73
           I  A +D  L NS  L+R E++ RR   +KQL+RCYR HYWALMEDLK+++R Y W YG 
Sbjct: 51  ISMAVEDQILGNSNHLTRPELLRRRSHNLKQLSRCYRDHYWALMEDLKAQHRYYSWNYGV 110

Query: 74  SPFKDDDH--------GGANGTVVSEEKVNGTADHASAAAGVDFVRCASGGCKTKAMAMS 125
           SPFKD+++         G  G  +     N   ++    AG + V C SG CK+KAMA++
Sbjct: 111 SPFKDENYHQNKRRKVEGQTGDEIEGSGDNDNNNNDGVKAG-NCVACGSG-CKSKAMALT 168

Query: 126 RFCHAHIVFDTKQKLYVGCTTVAKNLPTGPSFCNKPVLTSVVPRSCHQHYQLGEKCLTRA 185
            +C  HI+ D KQKLY  CT V K   +    C KP L S VP  C+ H+Q  +K + RA
Sbjct: 169 NYCQLHILMDKKQKLYTSCTYVNKRAQSKAITCPKPTLASTVPALCNVHFQKAQKDVARA 228

Query: 186 IRRAGYNIPINRKPSLPLHVVIPEFVREIQNKRK 219
           ++ AG+N+    +P   LH ++  FV  IQ KRK
Sbjct: 229 LKDAGHNVSSASRPPPKLHDIVAAFVHHIQAKRK 262


>AT2G31600.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT3G53860.1); Has 35333 Blast hits to
           34131 proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr2:13449205-13450083 REVERSE LENGTH=216
          Length = 216

 Score =  126 bits (317), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 64/151 (42%), Positives = 95/151 (62%), Gaps = 15/151 (9%)

Query: 14  IDGADQDAALANSTVLSRSEVIARRLRRVKQLARCYRGHYWALMEDLKSKYRDYYWTYGK 73
           I  + +D  LA S+ ++RSE++ RR   +KQLA+CYR +YWALMED+K+++RDY+W YG 
Sbjct: 63  ITMSQEDEILARSSHITRSELLKRRSHNLKQLAKCYRDNYWALMEDVKAQHRDYWWKYGI 122

Query: 74  SPFKDDDHG-------GANGTVVS-EEKVNGTADHASAAAGVDFVRCASG-------GCK 118
           S FKD+++        G  G +    + V G+ D+ +   GV   + A+        GCK
Sbjct: 123 SQFKDENNQSNKRRRLGQEGDIGDGGDAVEGSGDNVTNNDGVKSDQYANSNCGSCMYGCK 182

Query: 119 TKAMAMSRFCHAHIVFDTKQKLYVGCTTVAK 149
            KAMA++++C  HI+ D+KQKLY GCT V K
Sbjct: 183 AKAMALTKYCQLHILKDSKQKLYTGCTNVIK 213