Miyakogusa Predicted Gene

Lj1g3v1967260.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v1967260.1 CUFF.28234.1
         (351 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G10710.1 | Symbols:  | INVOLVED IN: chromosome segregation, c...   413   e-116
AT5G10710.2 | Symbols:  | INVOLVED IN: chromosome segregation, c...   413   e-116
AT5G10710.3 | Symbols:  | INVOLVED IN: chromosome segregation, c...   370   e-103

>AT5G10710.1 | Symbols:  | INVOLVED IN: chromosome segregation, cell
           division; LOCATED IN: chromosome, centromeric region,
           nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED
           DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s:
           Centromere protein Cenp-O (InterPro:IPR018464); Has
           30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr5:3379619-3382648
           FORWARD LENGTH=312
          Length = 312

 Score =  413 bits (1062), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 199/314 (63%), Positives = 252/314 (80%), Gaps = 8/314 (2%)

Query: 39  MGEVNFMQDDDNSLEITRARLGSLLKRHGDLTERLSRDSDKMIFERLQKEFEAARASQTE 98
           MGE+    D D  L+ TRARL +LLKRH +L++RL+RDSDK + +RL KEFEAAR SQ++
Sbjct: 1   MGEMIVSMDQDIRLDTTRARLSNLLKRHRELSDRLTRDSDKTMLDRLNKEFEAARRSQSQ 60

Query: 99  EICLDGEQWNDGLLATIRERVHMEADRKAMPGDADI-LAC-PQEKITYKIGNKVICCLEG 156
           E+ LDGE+WNDGLLAT+RERVHMEADRKA  G+A   L C P+E+ITY++GNKVICCL+G
Sbjct: 61  EVFLDGEEWNDGLLATLRERVHMEADRKADNGNAGFSLVCHPEERITYRVGNKVICCLDG 120

Query: 157 ARIGIQYETSFAGDPCEFYHCVLESKSFLEKMTVLEHTVPFFLPIRETENDLLSSNAMKF 216
           +RIGIQ+ETS AG+  E YHCVLESKSFLEKM VLEHT+PFFLP+ + ENDLL SNA KF
Sbjct: 121 SRIGIQFETSTAGETYEVYHCVLESKSFLEKMIVLEHTIPFFLPLSDLENDLLFSNAKKF 180

Query: 217 IDHIGDLLQAYVDRREQVRLVKELYGNQISEMYHNLPHHMVEFVLDDFDCKVTVSLRYAD 276
           ID++GDLLQAYVDR+EQVRL+KEL+G+QISE+YH+LP+HM+EF +DD DCK  VSLRY D
Sbjct: 181 IDNVGDLLQAYVDRKEQVRLIKELFGHQISEIYHSLPYHMIEFSMDDCDCKFVVSLRYGD 240

Query: 277 LISVLPSRISVLAWPMLKKNTAATLNRKEDGNFGSHPAPVRLHYAEDALRTMSLPEAYAE 336
           L+  LP+++ +L WPM        L++K+  + GS   PVRL +AEDA R  SLPEAYAE
Sbjct: 241 LLCELPTKVRILVWPM------HHLSKKQCTSPGSPAIPVRLPFAEDAFRIQSLPEAYAE 294

Query: 337 IVLNLPQALQQMYH 350
           I+ N+P  ++Q++ 
Sbjct: 295 IMPNMPNEIRQLFQ 308


>AT5G10710.2 | Symbols:  | INVOLVED IN: chromosome segregation, cell
           division; LOCATED IN: chromosome, centromeric region,
           nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED
           DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s:
           Centromere protein Cenp-O (InterPro:IPR018464); Has 43
           Blast hits to 43 proteins in 15 species: Archae - 0;
           Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 31;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr5:3379619-3382648 FORWARD LENGTH=312
          Length = 312

 Score =  413 bits (1062), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 199/314 (63%), Positives = 252/314 (80%), Gaps = 8/314 (2%)

Query: 39  MGEVNFMQDDDNSLEITRARLGSLLKRHGDLTERLSRDSDKMIFERLQKEFEAARASQTE 98
           MGE+    D D  L+ TRARL +LLKRH +L++RL+RDSDK + +RL KEFEAAR SQ++
Sbjct: 1   MGEMIVSMDQDIRLDTTRARLSNLLKRHRELSDRLTRDSDKTMLDRLNKEFEAARRSQSQ 60

Query: 99  EICLDGEQWNDGLLATIRERVHMEADRKAMPGDADI-LAC-PQEKITYKIGNKVICCLEG 156
           E+ LDGE+WNDGLLAT+RERVHMEADRKA  G+A   L C P+E+ITY++GNKVICCL+G
Sbjct: 61  EVFLDGEEWNDGLLATLRERVHMEADRKADNGNAGFSLVCHPEERITYRVGNKVICCLDG 120

Query: 157 ARIGIQYETSFAGDPCEFYHCVLESKSFLEKMTVLEHTVPFFLPIRETENDLLSSNAMKF 216
           +RIGIQ+ETS AG+  E YHCVLESKSFLEKM VLEHT+PFFLP+ + ENDLL SNA KF
Sbjct: 121 SRIGIQFETSTAGETYEVYHCVLESKSFLEKMIVLEHTIPFFLPLSDLENDLLFSNAKKF 180

Query: 217 IDHIGDLLQAYVDRREQVRLVKELYGNQISEMYHNLPHHMVEFVLDDFDCKVTVSLRYAD 276
           ID++GDLLQAYVDR+EQVRL+KEL+G+QISE+YH+LP+HM+EF +DD DCK  VSLRY D
Sbjct: 181 IDNVGDLLQAYVDRKEQVRLIKELFGHQISEIYHSLPYHMIEFSMDDCDCKFVVSLRYGD 240

Query: 277 LISVLPSRISVLAWPMLKKNTAATLNRKEDGNFGSHPAPVRLHYAEDALRTMSLPEAYAE 336
           L+  LP+++ +L WPM        L++K+  + GS   PVRL +AEDA R  SLPEAYAE
Sbjct: 241 LLCELPTKVRILVWPM------HHLSKKQCTSPGSPAIPVRLPFAEDAFRIQSLPEAYAE 294

Query: 337 IVLNLPQALQQMYH 350
           I+ N+P  ++Q++ 
Sbjct: 295 IMPNMPNEIRQLFQ 308


>AT5G10710.3 | Symbols:  | INVOLVED IN: chromosome segregation, cell
           division; LOCATED IN: chromosome, centromeric region,
           nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED
           DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s:
           Centromere protein Cenp-O (InterPro:IPR018464). |
           chr5:3379619-3382648 FORWARD LENGTH=298
          Length = 298

 Score =  370 bits (950), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 184/314 (58%), Positives = 236/314 (75%), Gaps = 22/314 (7%)

Query: 39  MGEVNFMQDDDNSLEITRARLGSLLKRHGDLTERLSRDSDKMIFERLQKEFEAARASQTE 98
           MGE+    D D  L+ TRARL +LLKRH +L++RL+RDSDK + +RL KEFEAAR SQ++
Sbjct: 1   MGEMIVSMDQDIRLDTTRARLSNLLKRHRELSDRLTRDSDKTMLDRLNKEFEAARRSQSQ 60

Query: 99  EICLDGEQWNDGLLATIRERVHMEADRKAMPGDADI-LAC-PQEKITYKIGNKVICCLEG 156
           E+               +  VHMEADRKA  G+A   L C P+E+ITY++GNKVICCL+G
Sbjct: 61  EV--------------FQALVHMEADRKADNGNAGFSLVCHPEERITYRVGNKVICCLDG 106

Query: 157 ARIGIQYETSFAGDPCEFYHCVLESKSFLEKMTVLEHTVPFFLPIRETENDLLSSNAMKF 216
           +RIGIQ+ETS AG+  E YHCVLESKSFLEKM VLEHT+PFFLP+ + ENDLL SNA KF
Sbjct: 107 SRIGIQFETSTAGETYEVYHCVLESKSFLEKMIVLEHTIPFFLPLSDLENDLLFSNAKKF 166

Query: 217 IDHIGDLLQAYVDRREQVRLVKELYGNQISEMYHNLPHHMVEFVLDDFDCKVTVSLRYAD 276
           ID++GDLLQAYVDR+EQVRL+KEL+G+QISE+YH+LP+HM+EF +DD DCK  VSLRY D
Sbjct: 167 IDNVGDLLQAYVDRKEQVRLIKELFGHQISEIYHSLPYHMIEFSMDDCDCKFVVSLRYGD 226

Query: 277 LISVLPSRISVLAWPMLKKNTAATLNRKEDGNFGSHPAPVRLHYAEDALRTMSLPEAYAE 336
           L+  LP+++ +L WPM        L++K+  + GS   PVRL +AEDA R  SLPEAYAE
Sbjct: 227 LLCELPTKVRILVWPM------HHLSKKQCTSPGSPAIPVRLPFAEDAFRIQSLPEAYAE 280

Query: 337 IVLNLPQALQQMYH 350
           I+ N+P  ++Q++ 
Sbjct: 281 IMPNMPNEIRQLFQ 294