Miyakogusa Predicted Gene
- Lj4g3v1683310.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1683310.1 tr|A9THB8|A9THB8_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_170090,62.9,0.000000003,coiled-coil,NULL;
seg,NULL,CUFF.49597.1
(374 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G07890.3 | Symbols: | myosin heavy chain-related | chr5:2517... 201 9e-52
AT5G07890.1 | Symbols: | myosin heavy chain-related | chr5:2517... 201 9e-52
AT5G61200.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 197 1e-50
AT5G07890.2 | Symbols: | myosin heavy chain-related | chr5:2517... 163 2e-40
AT5G61200.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 137 1e-32
AT5G61200.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 132 4e-31
>AT5G07890.3 | Symbols: | myosin heavy chain-related |
chr5:2517718-2519493 REVERSE LENGTH=409
Length = 409
Score = 201 bits (510), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 148/383 (38%), Positives = 213/383 (55%), Gaps = 21/383 (5%)
Query: 8 DSDAPSSVEELVHLGTGFRQHRKEKDVLRSSQSQSFQPIRILERYEKSSPGALTDDKKHI 67
D + VE+L+ +GT R+ RK+KD+LR SQ S + +R LE + KS + +D I
Sbjct: 21 DCENSFDVEDLLQIGTTRRELRKQKDLLRESQPHSIELVRRLELHTKSLSESRLEDTARI 80
Query: 68 QRLEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVFSLREELR 115
Q +EKELLNC++EID+L+D+L R +LE KL E +L+EEV SLR+EL
Sbjct: 81 QMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELC 140
Query: 116 RSNSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMMALEQSLF 175
S S+ L QELE+KEIEL+ S+ ++EKLE + SS+ LES E+ESMKLD+ ALEQ+LF
Sbjct: 141 MSKSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALF 200
Query: 176 EAKKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANMNTRLFSR 235
+A K Q+E+++E ++L +I+E Q+ Q ++ + K A+ + + F +
Sbjct: 201 DAMKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQ 260
Query: 236 KVGDWLENKNRSHLKRK---PSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAADLKGK 292
+ LE+++ L LS ++R C + + L L +L K
Sbjct: 261 STKERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDAIMKKLE-----LSQNVNLIDK 315
Query: 293 ME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECKRRAC 351
+E M QI QEMAELRY+ T LL+EE RR C
Sbjct: 316 VEGMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVC 375
Query: 352 IEHASLQRISELEAQLQREQKTP 374
IE ASLQRISELEAQ++R+ K P
Sbjct: 376 IEQASLQRISELEAQIKRDVKKP 398
>AT5G07890.1 | Symbols: | myosin heavy chain-related |
chr5:2517718-2519493 REVERSE LENGTH=409
Length = 409
Score = 201 bits (510), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 148/383 (38%), Positives = 213/383 (55%), Gaps = 21/383 (5%)
Query: 8 DSDAPSSVEELVHLGTGFRQHRKEKDVLRSSQSQSFQPIRILERYEKSSPGALTDDKKHI 67
D + VE+L+ +GT R+ RK+KD+LR SQ S + +R LE + KS + +D I
Sbjct: 21 DCENSFDVEDLLQIGTTRRELRKQKDLLRESQPHSIELVRRLELHTKSLSESRLEDTARI 80
Query: 68 QRLEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVFSLREELR 115
Q +EKELLNC++EID+L+D+L R +LE KL E +L+EEV SLR+EL
Sbjct: 81 QMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELC 140
Query: 116 RSNSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMMALEQSLF 175
S S+ L QELE+KEIEL+ S+ ++EKLE + SS+ LES E+ESMKLD+ ALEQ+LF
Sbjct: 141 MSKSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALF 200
Query: 176 EAKKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANMNTRLFSR 235
+A K Q+E+++E ++L +I+E Q+ Q ++ + K A+ + + F +
Sbjct: 201 DAMKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQ 260
Query: 236 KVGDWLENKNRSHLKRK---PSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAADLKGK 292
+ LE+++ L LS ++R C + + L L +L K
Sbjct: 261 STKERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDAIMKKLE-----LSQNVNLIDK 315
Query: 293 ME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECKRRAC 351
+E M QI QEMAELRY+ T LL+EE RR C
Sbjct: 316 VEGMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVC 375
Query: 352 IEHASLQRISELEAQLQREQKTP 374
IE ASLQRISELEAQ++R+ K P
Sbjct: 376 IEQASLQRISELEAQIKRDVKKP 398
>AT5G61200.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast; BEST Arabidopsis thaliana protein match is:
myosin heavy chain-related (TAIR:AT5G07890.3). |
chr5:24620456-24622081 FORWARD LENGTH=389
Length = 389
Score = 197 bits (501), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 149/384 (38%), Positives = 215/384 (55%), Gaps = 41/384 (10%)
Query: 1 MSNSFRSDSDAPSSVEELVHLGTGFRQHRKEKDVLRSSQSQSFQPIRILERYEKSSPGAL 60
+ NSF +D EL+ +G+ + R+EK++LR SQSQS + +R LE S +
Sbjct: 21 VDNSFDAD--------ELLQIGSRCMELRREKEMLRESQSQSVELVRRLELNANSLSESR 72
Query: 61 TDDKKHIQRLEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVF 108
+DK+ IQ LEKELLNC+QEID+L+D++N R +LE+++ + G L+EEV
Sbjct: 73 LEDKRRIQMLEKELLNCYQEIDYLRDQVNFRSQEMNDLSEHVLDLEVRVTKSGKLEEEVN 132
Query: 109 SLREELRRSNSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMM 168
LREEL S S+Q L QELE+ E EL+ S FS+EKLE S SS+ LESQ E+ES+KLD++
Sbjct: 133 YLREELCSSKSEQLLLLQELESTETELQFSLFSVEKLEESVSSLTLESQCEIESIKLDIV 192
Query: 169 ALEQSLFEAKKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANM 228
ALEQ+LF+A+K Q E+++E+++L ++ EL+ ++ ++ + A+
Sbjct: 193 ALEQALFDAQKFQGESIQENDKLREIVKELRLNSREAEENAECLEKQNKELMERCVASER 252
Query: 229 NTRLFSRKVGDWLENKNRSHLKRKPSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAAD 288
N +++ +S R S SE P + ++ L +F
Sbjct: 253 N-----------IKDLRQSFRGRLESESEAPVNP-------DCFHDIIKKLEVF--QDGK 292
Query: 289 LKGKME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECK 347
L+ KME M QI QEMAELRY+ T LLEEECK
Sbjct: 293 LRDKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYEMTCLLEEECK 352
Query: 348 RRACIEHASLQRISELEAQLQREQ 371
RRACIE ASLQRI+ LEAQ++RE+
Sbjct: 353 RRACIEQASLQRIANLEAQIKREK 376
>AT5G07890.2 | Symbols: | myosin heavy chain-related |
chr5:2517718-2519049 REVERSE LENGTH=328
Length = 328
Score = 163 bits (413), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 177/321 (55%), Gaps = 21/321 (6%)
Query: 70 LEKELLNCFQEIDFLQDRLNVR------------NLELKLEEMGDLQEEVFSLREELRRS 117
+EKELLNC++EID+L+D+L R +LE KL E +L+EEV SLR+EL S
Sbjct: 2 MEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMS 61
Query: 118 NSKQFSLTQELETKEIELEQSAFSIEKLEGSFSSIALESQFEVESMKLDMMALEQSLFEA 177
S+ L QELE+KEIEL+ S+ ++EKLE + SS+ LES E+ESMKLD+ ALEQ+LF+A
Sbjct: 62 KSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALFDA 121
Query: 178 KKTQDEALEESNRLSRLIDELQYALQDTQQTITSXXXXXXXXXXKLDAANMNTRLFSRKV 237
K Q+E+++E ++L +I+E Q+ Q ++ + K A+ + + F +
Sbjct: 122 MKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQST 181
Query: 238 GDWLENKNRSHLKRK---PSLSEQESKPEDIRTCGEVLGPLLGSLAMFLGPAADLKGKME 294
+ LE+++ L LS ++R C + + L L +L K+E
Sbjct: 182 KERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDAIMKKLE-----LSQNVNLIDKVE 236
Query: 295 -MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEMAELRYQFTGLLEEECKRRACIE 353
M QI QEMAELRY+ T LL+EE RR CIE
Sbjct: 237 GMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVCIE 296
Query: 354 HASLQRISELEAQLQREQKTP 374
ASLQRISELEAQ++R+ K P
Sbjct: 297 QASLQRISELEAQIKRDVKKP 317
>AT5G61200.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: myosin heavy chain-related (TAIR:AT5G07890.2); Has
22208 Blast hits to 14344 proteins in 1121 species:
Archae - 324; Bacteria - 1921; Metazoa - 12512; Fungi -
1464; Plants - 1009; Viruses - 53; Other Eukaryotes -
4925 (source: NCBI BLink). | chr5:24621035-24622081
FORWARD LENGTH=283
Length = 283
Score = 137 bits (345), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 108/284 (38%), Positives = 156/284 (54%), Gaps = 21/284 (7%)
Query: 89 NVRNLELKLEEMGDLQEEVFSLREELRRSNSKQFSLTQELETKEIELEQSAFSIEKLEGS 148
+V +LE+++ + G L+EEV LREEL S S+Q L QELE+ E EL+ S FS+EKLE S
Sbjct: 7 HVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQLLLLQELESTETELQFSLFSVEKLEES 66
Query: 149 FSSIALESQFEVESMKLDMMALEQSLFEAKKTQDEALEESNRLSRLIDELQYALQDTQQT 208
SS+ LESQ E+ES+KLD++ALEQ+LF+A+K Q E+++E+++L ++ EL+ ++ ++
Sbjct: 67 VSSLTLESQCEIESIKLDIVALEQALFDAQKFQGESIQENDKLREIVKELRLNSREAEEN 126
Query: 209 ITSXXXXXXXXXXKLDAANMNTRLFSRKVGDWLENKNRSHLKRKPSLSEQESKPEDIRTC 268
+ A+ N + + LE++ SE P
Sbjct: 127 AECLEKQNKELMERCVASERNIKDLRQSFRGRLESE-----------SEAPVNP------ 169
Query: 269 GEVLGPLLGSLAMFLGPAADLKGKME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXX 327
+ ++ L +F L+ KME M QI
Sbjct: 170 -DCFHDIIKKLEVF--QDGKLRDKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDL 226
Query: 328 VQEMAELRYQFTGLLEEECKRRACIEHASLQRISELEAQLQREQ 371
QEMAELRY+ T LLEEECKRRACIE ASLQRI+ LEAQ++RE+
Sbjct: 227 TQEMAELRYEMTCLLEEECKRRACIEQASLQRIANLEAQIKREK 270
>AT5G61200.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; BEST
Arabidopsis thaliana protein match is: myosin heavy
chain-related (TAIR:AT5G07890.2); Has 30201 Blast hits
to 17322 proteins in 780 species: Archae - 12; Bacteria
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:24621035-24622023 FORWARD LENGTH=295
Length = 295
Score = 132 bits (332), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 107/280 (38%), Positives = 152/280 (54%), Gaps = 21/280 (7%)
Query: 89 NVRNLELKLEEMGDLQEEVFSLREELRRSNSKQFSLTQELETKEIELEQSAFSIEKLEGS 148
+V +LE+++ + G L+EEV LREEL S S+Q L QELE+ E EL+ S FS+EKLE S
Sbjct: 7 HVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQLLLLQELESTETELQFSLFSVEKLEES 66
Query: 149 FSSIALESQFEVESMKLDMMALEQSLFEAKKTQDEALEESNRLSRLIDELQYALQDTQQT 208
SS+ LESQ E+ES+KLD++ALEQ+LF+A+K Q E+++E+++L ++ EL+ ++ ++
Sbjct: 67 VSSLTLESQCEIESIKLDIVALEQALFDAQKFQGESIQENDKLREIVKELRLNSREAEEN 126
Query: 209 ITSXXXXXXXXXXKLDAANMNTRLFSRKVGDWLENKNRSHLKRKPSLSEQESKPEDIRTC 268
+ A+ N + + +S R S SE P
Sbjct: 127 AECLEKQNKELMERCVASERNIK-----------DLRQSFRGRLESESEAPVNP------ 169
Query: 269 GEVLGPLLGSLAMFLGPAADLKGKME-MPHQIQXXXXXXXXXXXXXXXXXXXXXXXXXXX 327
+ ++ L +F L+ KME M QI
Sbjct: 170 -DCFHDIIKKLEVF--QDGKLRDKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDL 226
Query: 328 VQEMAELRYQFTGLLEEECKRRACIEHASLQRISELEAQL 367
QEMAELRY+ T LLEEECKRRACIE ASLQRI+ LEAQ+
Sbjct: 227 TQEMAELRYEMTCLLEEECKRRACIEQASLQRIANLEAQV 266