Miyakogusa Predicted Gene
- Lj3g3v2476540.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2476540.1 Non Chatacterized Hit- tr|C6T5Y3|C6T5Y3_SOYBN
Putative uncharacterized protein OS=Glycine max PE=2
S,50.81,5e-19,DUF688,Protein of unknown function DUF688,CUFF.44047.1
(232 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G27810.1 | Symbols: | unknown protein; CONTAINS InterPro DOM... 86 3e-17
AT5G53030.1 | Symbols: | unknown protein; CONTAINS InterPro DOM... 74 7e-14
AT5G53030.2 | Symbols: | unknown protein; CONTAINS InterPro DOM... 65 3e-11
AT4G00950.1 | Symbols: MEE47 | Protein of unknown function (DUF6... 64 1e-10
AT2G46535.1 | Symbols: | unknown protein; CONTAINS InterPro DOM... 50 1e-06
>AT4G27810.1 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF688
(InterPro:IPR007789); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT5G53030.1); Has 73
Blast hits to 66 proteins in 11 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 73;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr4:13854641-13855671 REVERSE LENGTH=196
Length = 196
Score = 85.5 bits (210), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 89/193 (46%), Gaps = 33/193 (17%)
Query: 34 LPLFKPPPMHSPERPGMLTPPLHTSASVPFGWEEEPGKPR------PCTDIVSFSNPMPK 87
LPLF P + + PG+ TPP++ + SVPF WEE PGKPR P +
Sbjct: 17 LPLFSIPFNRACDTPGLATPPVNIAGSVPFLWEEAPGKPRVSDENKPLASKQNEREGGGG 76
Query: 88 LTPKCLELPPRLQVDAINISKIPSPTTVLEGPYMGSRRVSDDFCGSFGAERGRLGTLVLK 147
+CLELPPRL A PSPTTVL+GPY RR L +
Sbjct: 77 GVVRCLELPPRLFFPA---DDEPSPTTVLDGPYDVPRR--------------SLSVIRRS 119
Query: 148 EKSWFGSWSENAFKVKHVFSSSADNDTDHVVGSDNNVRTRKMKPYGSFSNPFHAKSHVWE 207
E++ SE F+ +S + G V+ +++ GS N H+KS
Sbjct: 120 ERA-----SEGRFEFSRSTNSRCCDG-----GGGTTVKISRVRRKGSLLNLSHSKSQFLA 169
Query: 208 RICERWKQVVPWR 220
R+ + +KQV+PWR
Sbjct: 170 RVYQGFKQVIPWR 182
>AT5G53030.1 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF688
(InterPro:IPR007789); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT4G27810.1); Has 1807
Blast hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:21505319-21506329 FORWARD LENGTH=245
Length = 245
Score = 74.3 bits (181), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 96/220 (43%), Gaps = 36/220 (16%)
Query: 32 QSLPLFKPP--PMHSPERPGMLTPPLHTSASVPFGWEEEPGKPRPCTDIVSFSNPMPKLT 89
+ LPLF P + PG+ TPP++ + SVPF WEE PGKPR + K
Sbjct: 17 KQLPLFSYPMNNIAYETTPGLATPPVNIAGSVPFLWEEAPGKPRRVKKPARLNQ---KGV 73
Query: 90 PKCLELPPRLQV--DAINISKIPSPTTVLEGPYMGSRRVSDDFCGSFGAERGRLGT---- 143
+ LELPPRL + ++ +++ PSPTTVL+GPY RR S S R G
Sbjct: 74 VRSLELPPRLVLPGESTTVNE-PSPTTVLDGPY-DLRRRSLSLPRSAAVIRKLRGVPAPA 131
Query: 144 ------LVLKEKSW--FGS---WSENAFKVKHVFSSSADNDTDHV-------VGSDNNVR 185
LV W FG+ SE F D D D V+
Sbjct: 132 PEKEERLVGGSSRWGSFGNCKEVSEGIFDFSRFRDDGYDCRRDWAGGGGVGNFAGDAKVK 191
Query: 186 TRKMKPYGSFSNPFH-AKSHVW----ERICERWKQVVPWR 220
++ GSF N H KS W R+ E +KQV+PW+
Sbjct: 192 LYRIIKKGSFFNLSHTTKSDFWLKMQARVYEGFKQVIPWK 231
>AT5G53030.2 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF688
(InterPro:IPR007789); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT4G27810.1); Has 35333
Blast hits to 34131 proteins in 2444 species: Archae -
798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr5:21505319-21505993 FORWARD
LENGTH=224
Length = 224
Score = 65.5 bits (158), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 57/98 (58%), Gaps = 8/98 (8%)
Query: 32 QSLPLFKPP--PMHSPERPGMLTPPLHTSASVPFGWEEEPGKPRPCTDIVSFSNPMPKLT 89
+ LPLF P + PG+ TPP++ + SVPF WEE PGKPR + K
Sbjct: 17 KQLPLFSYPMNNIAYETTPGLATPPVNIAGSVPFLWEEAPGKPRRVKKPARLNQ---KGV 73
Query: 90 PKCLELPPRLQV--DAINISKIPSPTTVLEGPYMGSRR 125
+ LELPPRL + ++ +++ PSPTTVL+GPY RR
Sbjct: 74 VRSLELPPRLVLPGESTTVNE-PSPTTVLDGPYDLRRR 110
>AT4G00950.1 | Symbols: MEE47 | Protein of unknown function (DUF688)
| chr4:405984-407087 REVERSE LENGTH=291
Length = 291
Score = 63.5 bits (153), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 122/296 (41%), Gaps = 107/296 (36%)
Query: 26 EAESKIQ---SLPLFKPPPMHSPERPGM----LTPPLHTS--ASVPFGWEEEPGKPRPCT 76
EAE + + +L + K P + P +P ++ P+H+S ASVPF WEEEPGKP+ +
Sbjct: 2 EAEKETEQEGNLTVMKLPVL--PTKPNTHSHSMSSPIHSSISASVPFSWEEEPGKPKQHS 59
Query: 77 DIVSFSNPMPKL---------TPKCLELPPRLQV---DAINISKIPSPTTVLEGPY--MG 122
S S+ L T K LELPPRL + D +++K+ SP TV +GPY
Sbjct: 60 TSSSSSSSSSPLTSYSSSPFETHKSLELPPRLHLLEKDGGSVTKLHSPITVFDGPYSMTT 119
Query: 123 SRRV-----------SDDFCGSFGAE----------------------------RGRLGT 143
S+R+ S D GSF ++ RGRLG
Sbjct: 120 SKRMDSPSFRMMVKGSADCYGSFRSDIDGDLEDLEVGSKQQENLSSGSLAVVKKRGRLGF 179
Query: 144 L------VLKEKSWFGSWSENAFKVKHVFSSSAD---------------------NDTDH 176
LK K+ FG S +VF SS D +DTD
Sbjct: 180 FGFRRRRALKGKTEFGRGS-------YVFPSSVDRESEYSRKEEEEEKEDKRFGYDDTDG 232
Query: 177 VVGSDN----NVRTRKMKPYGSF-----SNPFHAKSHVWERICERWKQVVPWRSGK 223
+ S + +V+ + GSF +KSH W + KQVVPW+S K
Sbjct: 233 ISCSQSSRFCDVKISSISRTGSFSTLPAPPSSSSKSHFWTNVYAGLKQVVPWKSKK 288
>AT2G46535.1 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF688
(InterPro:IPR007789); BEST Arabidopsis thaliana protein
match is: Protein of unknown function (DUF688)
(TAIR:AT3G61840.1); Has 48 Blast hits to 48 proteins in
8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:19109698-19110320 FORWARD
LENGTH=175
Length = 175
Score = 50.1 bits (118), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 85/183 (46%), Gaps = 40/183 (21%)
Query: 44 SPERPGMLTPPLHTSASVPFGWEEEPGKPRPCTDIVSFSNPMPKLTPKCLELPPRLQVDA 103
SP P + P+HT ASVPF WE++PGKP+ +S+ PKCL+LPPRL +
Sbjct: 28 SPASPRVFASPIHTLASVPFCWEDQPGKPKHPLRPLSY--------PKCLDLPPRLLLPG 79
Query: 104 INISKIPSPTTVLEGPYMGSRRVSDDFCGSFGAERGRLGTLVLKEKSWFGSWSENAFKVK 163
+++P P G R F +GR G +V++
Sbjct: 80 -EFTQMPLPER-----KHGLLR--------FLRRKGR-GDVVVRG--------------N 110
Query: 164 HVFSSSADNDTDHVVGSDNNVRTRKMKPYGSFSNPFHAK-SHVWERICERWKQVVPWRSG 222
+VF S D++ ++NN++ K GS+ K SH W +C+ K +PW++
Sbjct: 111 YVFLSENQRAGDNI--NENNMKIMKFNRSGSYHGGGSVKGSHFWGSLCKGLKLAMPWKNK 168
Query: 223 KLK 225
K++
Sbjct: 169 KMR 171