Miyakogusa Predicted Gene
- Lj3g3v0140690.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0140690.1 Non Chatacterized Hit- tr|I1NGF9|I1NGF9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.5134
PE=,45.38,1e-18,seg,NULL; DUF581,Protein of unknown function
DUF581,CUFF.40335.1
(376 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G25690.2 | Symbols: | Protein of unknown function (DUF581) |... 94 1e-19
AT2G25690.1 | Symbols: | Protein of unknown function (DUF581) |... 94 1e-19
AT5G11460.1 | Symbols: | Protein of unknown function (DUF581) |... 91 1e-18
AT3G22550.1 | Symbols: | Protein of unknown function (DUF581) |... 82 4e-16
AT3G63210.1 | Symbols: MARD1 | Protein of unknown function (DUF5... 75 9e-14
AT1G79970.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 70 2e-12
AT1G79970.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 70 3e-12
>AT2G25690.2 | Symbols: | Protein of unknown function (DUF581) |
chr2:10940530-10941649 REVERSE LENGTH=324
Length = 324
Score = 94.0 bits (232), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 66/108 (61%), Gaps = 13/108 (12%)
Query: 246 EDYTCIISHGPNPKRTHIFGDCILECHNNDFTEFSKKEETXXXXXXXXXXXXXXXXXXDN 305
EDYTCII+HGPNPK THI+GD +LECH N+ +E DN
Sbjct: 218 EDYTCIIAHGPNPKTTHIYGDRVLECHKNELKGDEDNKEK-----------FGSVFPSDN 266
Query: 306 VLSFCYSCNKKLDEGEDIDMFRGKKAFCCFKCRSEEILV-EEEMEKNC 352
L C CNKKL G+DI M+R +K+FC +CRSEE+++ EE++E+ C
Sbjct: 267 FLGICNFCNKKLGGGDDIYMYR-EKSFCSEECRSEEMMIDEEDLEEPC 313
>AT2G25690.1 | Symbols: | Protein of unknown function (DUF581) |
chr2:10940530-10941649 REVERSE LENGTH=324
Length = 324
Score = 94.0 bits (232), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 66/108 (61%), Gaps = 13/108 (12%)
Query: 246 EDYTCIISHGPNPKRTHIFGDCILECHNNDFTEFSKKEETXXXXXXXXXXXXXXXXXXDN 305
EDYTCII+HGPNPK THI+GD +LECH N+ +E DN
Sbjct: 218 EDYTCIIAHGPNPKTTHIYGDRVLECHKNELKGDEDNKEK-----------FGSVFPSDN 266
Query: 306 VLSFCYSCNKKLDEGEDIDMFRGKKAFCCFKCRSEEILV-EEEMEKNC 352
L C CNKKL G+DI M+R +K+FC +CRSEE+++ EE++E+ C
Sbjct: 267 FLGICNFCNKKLGGGDDIYMYR-EKSFCSEECRSEEMMIDEEDLEEPC 313
>AT5G11460.1 | Symbols: | Protein of unknown function (DUF581) |
chr5:3657064-3658388 REVERSE LENGTH=344
Length = 344
Score = 91.3 bits (225), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 65/114 (57%), Gaps = 3/114 (2%)
Query: 232 GHIGSLSAKEIELSEDYTCIISHGPNPKRTHIFGDCILECHNNDFTE---FSKKEETXXX 288
G + + ++E+SEDYTC+ISHGPNPK TH +GD ++E + + ++E+
Sbjct: 194 GSLNGGTESDMEISEDYTCVISHGPNPKTTHFYGDQVMESVEREELKNRCCKNEKESIFA 253
Query: 289 XXXXXXXXXXXXXXXDNVLSFCYSCNKKLDEGEDIDMFRGKKAFCCFKCRSEEI 342
+ LSFCY C+KKL GEDI M+ G KAFC +CRS+EI
Sbjct: 254 VAPLDLTTPVDVLPPKDFLSFCYGCSKKLGMGEDIYMYSGYKAFCSSECRSKEI 307
Score = 63.9 bits (154), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 54/94 (57%), Gaps = 10/94 (10%)
Query: 38 DSESVWSPTSPLDFKLLN-LSNPFSANSSKPFQTGHKKNLDCRKIGLGIVSSLVNETKLS 96
D ES WSPTSPLDF+L + L NPF+A+SS+ G +++ D K+GL IV SLV+
Sbjct: 42 DYESAWSPTSPLDFRLFSTLGNPFAASSSRSIWRGKQRSWDSGKVGLSIVHSLVD----- 96
Query: 97 SETHGDSKRGTIIFGPQVKKT---SLLKFSKNNH 127
+ H DS ++ P K SL++ + H
Sbjct: 97 -DHHTDSSATIVLPSPDSKNIIFGSLMRSGQKPH 129
>AT3G22550.1 | Symbols: | Protein of unknown function (DUF581) |
chr3:7991827-7992805 REVERSE LENGTH=267
Length = 267
Score = 82.4 bits (202), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 235 GSLSAKEIELSEDYTCIISHGPNPKRTHIFGDCILECHNNDFTEFSKKEETXXXXXXXXX 294
G A ++ELSEDYTC+ HGPNP+ HIF +CI+E S+
Sbjct: 160 GYFPASDMELSEDYTCVTCHGPNPRTIHIFDNCIVE---------SQPGVVFFRSSDPVN 210
Query: 295 XXXXXXXXXDNVLSFCYSCNKKLDEGEDIDMFRGKKAFCCFKCRSEEILVEEE 347
D+ LS C +C K L +DI M+RG +AFC +CRS E+++ EE
Sbjct: 211 ESDSDYSPPDSFLSCCCNCKKSLGPRDDIFMYRGDRAFCSSECRSIEMMMSEE 263
>AT3G63210.1 | Symbols: MARD1 | Protein of unknown function (DUF581)
| chr3:23354019-23354906 REVERSE LENGTH=263
Length = 263
Score = 74.7 bits (182), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 14/110 (12%)
Query: 237 LSAKEIELSEDYTCIISHGPNPKRTHIFGDCILECHNNDFTEFSKKEETXXXXXXXXXXX 296
L+ EI+ +EDYT +ISHGPNP THIF N+ F E +
Sbjct: 165 LAVSEIDQTEDYTRVISHGPNPTITHIFD-------NSVFVEATP-------CSVPLPQP 210
Query: 297 XXXXXXXDNVLSFCYSCNKKLDEGEDIDMFRGKKAFCCFKCRSEEILVEE 346
++ LS C++C K LD+ +DI ++RG+K FC +CR +E+L+++
Sbjct: 211 AMETKSTESFLSRCFTCKKNLDQKQDIYIYRGEKGFCSSECRYQEMLLDQ 260
>AT1G79970.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: Protein of unknown function (DUF581)
(TAIR:AT2G25690.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr1:30082773-30083592 FORWARD LENGTH=240
Length = 240
Score = 70.1 bits (170), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 236 SLSAKEIELSEDYTCIISHGPNPKRTHIFGDCILECHNNDF 276
SLS +E+ LSEDYTCIISHGPNPK T+IFGDCIL+C D
Sbjct: 144 SLSGREMALSEDYTCIISHGPNPKTTYIFGDCILDCDPKDL 184
>AT1G79970.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: Protein of unknown function (DUF581)
(TAIR:AT2G25690.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:30082773-30083429 FORWARD LENGTH=218
Length = 218
Score = 69.7 bits (169), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 236 SLSAKEIELSEDYTCIISHGPNPKRTHIFGDCILECHNNDF 276
SLS +E+ LSEDYTCIISHGPNPK T+IFGDCIL+C D
Sbjct: 144 SLSGREMALSEDYTCIISHGPNPKTTYIFGDCILDCDPKDL 184