Miyakogusa Predicted Gene
- Lj1g3v4730380.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4730380.1 Non Chatacterized Hit- tr|Q4P6G2|Q4P6G2_USTMA
Putative uncharacterized protein OS=Ustilago maydis (s,34.59,4e-16,no
description,Armadillo-like helical; TESTIS EXPRESSED GENE
10-RELATED,NULL; UNCHARACTERIZED,NULL,CUFF.33063.1
(163 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G06350.1 | Symbols: | ARM repeat superfamily protein | chr5:... 234 2e-62
AT5G27010.1 | Symbols: | ARM repeat superfamily protein | chr5:... 191 2e-49
AT4G04680.1 | Symbols: | INVOLVED IN: biological_process unknow... 160 4e-40
>AT5G06350.1 | Symbols: | ARM repeat superfamily protein |
chr5:1938781-1944197 FORWARD LENGTH=877
Length = 877
Score = 234 bits (596), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 118/163 (72%), Positives = 135/163 (82%)
Query: 1 MTRPKANSKKQRGGGVDFKKIRRKIGRKLPPPKNTTNTEIKSKAIVLPEQSVAAEKTGLA 60
M R KA +KKQ+ G+DFKKI+RK+GRKLPPPKN TNTEIKSKAI+LPEQSVAAEK+GLA
Sbjct: 1 MVRSKAPAKKQQKKGIDFKKIKRKLGRKLPPPKNATNTEIKSKAIILPEQSVAAEKSGLA 60
Query: 61 VNKKGLTLKELLQQTSHHNAKVRRDALTGIKDLFNKYPAELKLHKYAAVEKLRERIGDDD 120
+KKGLTLKELL QTSHHNAKVR+DAL GIKDLF +P EL+ HKYA ++KLRERI DDD
Sbjct: 61 TSKKGLTLKELLPQTSHHNAKVRKDALYGIKDLFKNHPEELQSHKYAIIQKLRERISDDD 120
Query: 121 KVVRKSLYDLFKLVILPGCKEDNQELITSLLMAYIFNAMTHLA 163
K+VR Y LF + I P CKEDNQ L+ SLLM YIF+AM H A
Sbjct: 121 KLVRDVFYQLFDIDIFPLCKEDNQGLMVSLLMPYIFSAMAHSA 163
>AT5G27010.1 | Symbols: | ARM repeat superfamily protein |
chr5:9503315-9507569 REVERSE LENGTH=863
Length = 863
Score = 191 bits (484), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 101/161 (62%), Positives = 119/161 (73%), Gaps = 2/161 (1%)
Query: 1 MTRPKANSKKQRGGGVDFKKIRRKIGRKLPPPKNTTNTEIKSKAIVLPEQSVAAEKTGLA 60
M+R KA ++KQ+ G+DFKKI+RK+GRKLPPP N TNTEIKSKAI+L EQSVAAE+ G A
Sbjct: 1 MSRSKAPARKQQKKGIDFKKIKRKLGRKLPPPNNATNTEIKSKAIILHEQSVAAERDGFA 60
Query: 61 VNKKGLTLKELLQQTSHHNAKVRRDALTGIKDLFNKYPAELKLHKYAAVEKLRERIGDDD 120
+KKGLTL EL +T H NAKVR+DAL GIKDL +PAEL +KYA KLRE I DDD
Sbjct: 61 TSKKGLTLLELKNRTGHPNAKVRKDALHGIKDLLKHHPAELLSNKYATTHKLRELITDDD 120
Query: 121 KVVRKSLYDLFKLVILPGCKED-NQELITSLLMAYIFNAMT 160
K+VR Y L + L CKED N+ L+ S LM YIF AMT
Sbjct: 121 KLVRDDFYTLLTGIFL-ACKEDINKGLMVSSLMPYIFTAMT 160
>AT4G04680.1 | Symbols: | INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; BEST Arabidopsis
thaliana protein match is: ARM repeat superfamily
protein (TAIR:AT5G06350.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr4:2372266-2373327 FORWARD LENGTH=261
Length = 261
Score = 160 bits (405), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 77/120 (64%), Positives = 95/120 (79%)
Query: 42 SKAIVLPEQSVAAEKTGLAVNKKGLTLKELLQQTSHHNAKVRRDALTGIKDLFNKYPAEL 101
+ A++L EQ+VAAEK+GLA +KKGLTLK+LL QTSH NAK+R+DAL G+KDL +PAEL
Sbjct: 23 AAAMILAEQNVAAEKSGLATSKKGLTLKDLLPQTSHCNAKLRKDALNGLKDLLKNHPAEL 82
Query: 102 KLHKYAAVEKLRERIGDDDKVVRKSLYDLFKLVILPGCKEDNQELITSLLMAYIFNAMTH 161
+ HKYA ++KLRERI DDD +VR +LY LF+ VILP CK DNQ + SLLM YI AM H
Sbjct: 83 QSHKYAIIQKLRERIMDDDSLVRDALYQLFESVILPACKNDNQSPMVSLLMPYISCAMAH 142