Miyakogusa Predicted Gene

Lj4g3v1683310.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v1683310.1 tr|A9THB8|A9THB8_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_170090,62.9,0.000000003,coiled-coil,NULL;
seg,NULL,CUFF.49597.1
         (374 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G07890.3 | Symbols:  | myosin heavy chain-related | chr5:2517...   201   9e-52
AT5G07890.1 | Symbols:  | myosin heavy chain-related | chr5:2517...   201   9e-52
AT5G61200.3 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   197   1e-50
AT5G07890.2 | Symbols:  | myosin heavy chain-related | chr5:2517...   163   2e-40
AT5G61200.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   137   1e-32
AT5G61200.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   132   4e-31

>AT5G07890.3 | Symbols:  | myosin heavy chain-related |
           chr5:2517718-2519493 REVERSE LENGTH=409
          Length = 409

 Score =  201 bits (510), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 148/383 (38%), Positives = 213/383 (55%), Gaps = 21/383 (5%)

Query: 8   DSDAPSSVEELVHLGTGFRQHRKEKDVLRSSQSQSFQPIRILERYEKSSPGALTDDKKHI 67
           D +    VE+L+ +GT  R+ RK+KD+LR SQ  S + +R LE + KS   +  +D   I
Sbjct: 21  DCENSFDVEDLLQIGTTRRELRKQKDLLRESQPHSIELVRRLELHTKSLSESRLEDTARI 80

Query: 68  QRLEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVFSLREELR 115
           Q +EKELLNC++EID+L+D+L  R            +LE KL E  +L+EEV SLR+EL 
Sbjct: 81  QMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELC 140

Query: 116 RSNSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMMALEQSLF 175
            S S+   L QELE+KEIEL+ S+ ++EKLE + SS+ LES  E+ESMKLD+ ALEQ+LF
Sbjct: 141 MSKSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALF 200

Query: 176 EAKKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANMNTRLFSR 235
           +A K Q+E+++E ++L  +I+E Q+  Q  ++ +            K  A+  + + F +
Sbjct: 201 DAMKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQ 260

Query: 236 KVGDWLENKNRSHLKRK---PSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAADLKGK 292
              + LE+++   L        LS       ++R C + +   L      L    +L  K
Sbjct: 261 STKERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDAIMKKLE-----LSQNVNLIDK 315

Query: 293 ME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECKRRAC 351
           +E M  QI                             QEMAELRY+ T LL+EE  RR C
Sbjct: 316 VEGMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVC 375

Query: 352 IEHASLQRISELEAQLQREQKTP 374
           IE ASLQRISELEAQ++R+ K P
Sbjct: 376 IEQASLQRISELEAQIKRDVKKP 398


>AT5G07890.1 | Symbols:  | myosin heavy chain-related |
           chr5:2517718-2519493 REVERSE LENGTH=409
          Length = 409

 Score =  201 bits (510), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 148/383 (38%), Positives = 213/383 (55%), Gaps = 21/383 (5%)

Query: 8   DSDAPSSVEELVHLGTGFRQHRKEKDVLRSSQSQSFQPIRILERYEKSSPGALTDDKKHI 67
           D +    VE+L+ +GT  R+ RK+KD+LR SQ  S + +R LE + KS   +  +D   I
Sbjct: 21  DCENSFDVEDLLQIGTTRRELRKQKDLLRESQPHSIELVRRLELHTKSLSESRLEDTARI 80

Query: 68  QRLEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVFSLREELR 115
           Q +EKELLNC++EID+L+D+L  R            +LE KL E  +L+EEV SLR+EL 
Sbjct: 81  QMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELC 140

Query: 116 RSNSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMMALEQSLF 175
            S S+   L QELE+KEIEL+ S+ ++EKLE + SS+ LES  E+ESMKLD+ ALEQ+LF
Sbjct: 141 MSKSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALF 200

Query: 176 EAKKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANMNTRLFSR 235
           +A K Q+E+++E ++L  +I+E Q+  Q  ++ +            K  A+  + + F +
Sbjct: 201 DAMKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQ 260

Query: 236 KVGDWLENKNRSHLKRK---PSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAADLKGK 292
              + LE+++   L        LS       ++R C + +   L      L    +L  K
Sbjct: 261 STKERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDAIMKKLE-----LSQNVNLIDK 315

Query: 293 ME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECKRRAC 351
           +E M  QI                             QEMAELRY+ T LL+EE  RR C
Sbjct: 316 VEGMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVC 375

Query: 352 IEHASLQRISELEAQLQREQKTP 374
           IE ASLQRISELEAQ++R+ K P
Sbjct: 376 IEQASLQRISELEAQIKRDVKKP 398


>AT5G61200.3 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast; BEST Arabidopsis thaliana protein match is:
           myosin heavy chain-related (TAIR:AT5G07890.3). |
           chr5:24620456-24622081 FORWARD LENGTH=389
          Length = 389

 Score =  197 bits (501), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 149/384 (38%), Positives = 215/384 (55%), Gaps = 41/384 (10%)

Query: 1   MSNSFRSDSDAPSSVEELVHLGTGFRQHRKEKDVLRSSQSQSFQPIRILERYEKSSPGAL 60
           + NSF +D        EL+ +G+   + R+EK++LR SQSQS + +R LE    S   + 
Sbjct: 21  VDNSFDAD--------ELLQIGSRCMELRREKEMLRESQSQSVELVRRLELNANSLSESR 72

Query: 61  TDDKKHIQRLEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVF 108
            +DK+ IQ LEKELLNC+QEID+L+D++N R            +LE+++ + G L+EEV 
Sbjct: 73  LEDKRRIQMLEKELLNCYQEIDYLRDQVNFRSQEMNDLSEHVLDLEVRVTKSGKLEEEVN 132

Query: 109 SLREELRRSNSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMM 168
            LREEL  S S+Q  L QELE+ E EL+ S FS+EKLE S SS+ LESQ E+ES+KLD++
Sbjct: 133 YLREELCSSKSEQLLLLQELESTETELQFSLFSVEKLEESVSSLTLESQCEIESIKLDIV 192

Query: 169 ALEQSLFEAKKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANM 228
           ALEQ+LF+A+K Q E+++E+++L  ++ EL+   ++ ++              +  A+  
Sbjct: 193 ALEQALFDAQKFQGESIQENDKLREIVKELRLNSREAEENAECLEKQNKELMERCVASER 252

Query: 229 NTRLFSRKVGDWLENKNRSHLKRKPSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAAD 288
           N           +++  +S   R  S SE    P       +    ++  L +F      
Sbjct: 253 N-----------IKDLRQSFRGRLESESEAPVNP-------DCFHDIIKKLEVF--QDGK 292

Query: 289 LKGKME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECK 347
           L+ KME M  QI                             QEMAELRY+ T LLEEECK
Sbjct: 293 LRDKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYEMTCLLEEECK 352

Query: 348 RRACIEHASLQRISELEAQLQREQ 371
           RRACIE ASLQRI+ LEAQ++RE+
Sbjct: 353 RRACIEQASLQRIANLEAQIKREK 376


>AT5G07890.2 | Symbols:  | myosin heavy chain-related |
           chr5:2517718-2519049 REVERSE LENGTH=328
          Length = 328

 Score =  163 bits (413), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 177/321 (55%), Gaps = 21/321 (6%)

Query: 70  LEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVFSLREELRRS 117
           +EKELLNC++EID+L+D+L  R            +LE KL E  +L+EEV SLR+EL  S
Sbjct: 2   MEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMS 61

Query: 118 NSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMMALEQSLFEA 177
            S+   L QELE+KEIEL+ S+ ++EKLE + SS+ LES  E+ESMKLD+ ALEQ+LF+A
Sbjct: 62  KSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALFDA 121

Query: 178 KKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANMNTRLFSRKV 237
            K Q+E+++E ++L  +I+E Q+  Q  ++ +            K  A+  + + F +  
Sbjct: 122 MKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQST 181

Query: 238 GDWLENKNRSHLKRK---PSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAADLKGKME 294
            + LE+++   L        LS       ++R C + +   L      L    +L  K+E
Sbjct: 182 KERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDAIMKKLE-----LSQNVNLIDKVE 236

Query: 295 -MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECKRRACIE 353
            M  QI                             QEMAELRY+ T LL+EE  RR CIE
Sbjct: 237 GMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVCIE 296

Query: 354 HASLQRISELEAQLQREQKTP 374
            ASLQRISELEAQ++R+ K P
Sbjct: 297 QASLQRISELEAQIKRDVKKP 317


>AT5G61200.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: myosin heavy chain-related (TAIR:AT5G07890.2); Has
           22208 Blast hits to 14344 proteins in 1121 species:
           Archae - 324; Bacteria - 1921; Metazoa - 12512; Fungi -
           1464; Plants - 1009; Viruses - 53; Other Eukaryotes -
           4925 (source: NCBI BLink). | chr5:24621035-24622081
           FORWARD LENGTH=283
          Length = 283

 Score =  137 bits (345), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 108/284 (38%), Positives = 156/284 (54%), Gaps = 21/284 (7%)

Query: 89  NVRNLELKLEEMGDLQEEVFSLREELRRSNSKQFSLTQELETKEIELEQSAFSIEKLEGS 148
           +V +LE+++ + G L+EEV  LREEL  S S+Q  L QELE+ E EL+ S FS+EKLE S
Sbjct: 7   HVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQLLLLQELESTETELQFSLFSVEKLEES 66

Query: 149 FSSIALESQFEVESMKLDMMALEQSLFEAKKTQDEALEESNRLSRLIDELQYALQDTQQT 208
            SS+ LESQ E+ES+KLD++ALEQ+LF+A+K Q E+++E+++L  ++ EL+   ++ ++ 
Sbjct: 67  VSSLTLESQCEIESIKLDIVALEQALFDAQKFQGESIQENDKLREIVKELRLNSREAEEN 126

Query: 209 ITSXXXXXXXXXXKLDAANMNTRLFSRKVGDWLENKNRSHLKRKPSLSEQESKPEDIRTC 268
                        +  A+  N +   +     LE++           SE    P      
Sbjct: 127 AECLEKQNKELMERCVASERNIKDLRQSFRGRLESE-----------SEAPVNP------ 169

Query: 269 GEVLGPLLGSLAMFLGPAADLKGKME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXX 327
            +    ++  L +F      L+ KME M  QI                            
Sbjct: 170 -DCFHDIIKKLEVF--QDGKLRDKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDL 226

Query: 328 VQEMAELRYQFTGLLEEECKRRACIEHASLQRISELEAQLQREQ 371
            QEMAELRY+ T LLEEECKRRACIE ASLQRI+ LEAQ++RE+
Sbjct: 227 TQEMAELRYEMTCLLEEECKRRACIEQASLQRIANLEAQIKREK 270


>AT5G61200.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; BEST
           Arabidopsis thaliana protein match is: myosin heavy
           chain-related (TAIR:AT5G07890.2); Has 30201 Blast hits
           to 17322 proteins in 780 species: Archae - 12; Bacteria
           - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
           Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:24621035-24622023 FORWARD LENGTH=295
          Length = 295

 Score =  132 bits (332), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 107/280 (38%), Positives = 152/280 (54%), Gaps = 21/280 (7%)

Query: 89  NVRNLELKLEEMGDLQEEVFSLREELRRSNSKQFSLTQELETKEIELEQSAFSIEKLEGS 148
           +V +LE+++ + G L+EEV  LREEL  S S+Q  L QELE+ E EL+ S FS+EKLE S
Sbjct: 7   HVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQLLLLQELESTETELQFSLFSVEKLEES 66

Query: 149 FSSIALESQFEVESMKLDMMALEQSLFEAKKTQDEALEESNRLSRLIDELQYALQDTQQT 208
            SS+ LESQ E+ES+KLD++ALEQ+LF+A+K Q E+++E+++L  ++ EL+   ++ ++ 
Sbjct: 67  VSSLTLESQCEIESIKLDIVALEQALFDAQKFQGESIQENDKLREIVKELRLNSREAEEN 126

Query: 209 ITSXXXXXXXXXXKLDAANMNTRLFSRKVGDWLENKNRSHLKRKPSLSEQESKPEDIRTC 268
                        +  A+  N +           +  +S   R  S SE    P      
Sbjct: 127 AECLEKQNKELMERCVASERNIK-----------DLRQSFRGRLESESEAPVNP------ 169

Query: 269 GEVLGPLLGSLAMFLGPAADLKGKME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXX 327
            +    ++  L +F      L+ KME M  QI                            
Sbjct: 170 -DCFHDIIKKLEVF--QDGKLRDKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDL 226

Query: 328 VQEMAELRYQFTGLLEEECKRRACIEHASLQRISELEAQL 367
            QEMAELRY+ T LLEEECKRRACIE ASLQRI+ LEAQ+
Sbjct: 227 TQEMAELRYEMTCLLEEECKRRACIEQASLQRIANLEAQV 266