Miyakogusa Predicted Gene
- Lj1g3v1525910.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1525910.1 Non Chatacterized Hit- tr|K3YV18|K3YV18_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si018114,42.65,1e-17,seg,NULL; coiled-coil,NULL,CUFF.27434.1
(325 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G09060.1 | Symbols: | unknown protein; LOCATED IN: membrane;... 247 9e-66
AT4G09060.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 243 2e-64
AT1G14680.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 205 4e-53
>AT4G09060.1 | Symbols: | unknown protein; LOCATED IN: membrane;
EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT1G14680.1); Has 8168
Blast hits to 6036 proteins in 601 species: Archae -
151; Bacteria - 570; Metazoa - 4684; Fungi - 544; Plants
- 345; Viruses - 91; Other Eukaryotes - 1783 (source:
NCBI BLink). | chr4:5797781-5800253 REVERSE LENGTH=341
Length = 341
Score = 247 bits (630), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 210/324 (64%), Gaps = 6/324 (1%)
Query: 2 DLPPHIENYIKQTIDDTLGISVPPQTLAAKLRATQEXXXXXXXXXXXXXAKLNHKDQLVQ 61
+L +E YIK TID +LG+ + ++L KL +E ++L KD ++
Sbjct: 18 NLSQEVEEYIKDTIDHSLGLPISMESLQKKLYTAEESQRRLREQYQGLISRLKEKDHVID 77
Query: 62 RLKSESDMNARALKKFVEENNRLASECENLVAECQKWEHEIALYEHDRDALMEFGNEADE 121
R++SE+ MNA+ALKKFVEEN +LASEC NL+++C+K E E LY DRDALMEFGNE+DE
Sbjct: 78 RVRSEASMNAQALKKFVEENQKLASECGNLLSQCKKLEKECLLYHQDRDALMEFGNESDE 137
Query: 122 RARESQSRVXXXXXXXXXXXXXXXXYKHH-HESVDSSSGTPVEENLLDSVLATFISNDDD 180
RARE+++RV +K + + TP+EE+LLDSVL + IS D+
Sbjct: 138 RAREAEARVRELEDEIGRMSEEMQRFKRQIGDGEVENCTTPLEEDLLDSVLGSLISKDET 197
Query: 181 -SAYAFLVANSGNESCKQLLSMWNCIKPSTRSVLSLIAKVKSLEKDKEHLTTNLHKAEEE 239
FL AN ++SC+ LLS W+ +KPST+ VLSL++K K EK+KE + NL KAE+E
Sbjct: 198 IMGRLFLEANIHDQSCQALLSKWDHLKPSTQKVLSLVSKAKKFEKEKECIIMNLAKAEQE 257
Query: 240 VKLLFEENIILDEKNKKILKQLKEGKHP-GSGGKHTSSASAKSNKRKTSPRTTSPMERKI 298
V+L+ N LD++N+K+L+Q + P S K+ +SASAKSNKRK+ +SP+E+++
Sbjct: 258 VELVSTLNRKLDKENRKLLRQ---QQSPLCSADKNRNSASAKSNKRKSPKTMSSPVEKRL 314
Query: 299 DFGELESARQPLSPLRHNSPDCRM 322
+F E +R+PLSP+ +NSPD +M
Sbjct: 315 EFSSPEISRKPLSPVWNNSPDSKM 338
>AT4G09060.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G14680.1). | chr4:5797781-5800253 REVERSE
LENGTH=367
Length = 367
Score = 243 bits (619), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 213/349 (61%), Gaps = 30/349 (8%)
Query: 2 DLPPHIENYIKQTIDDTLGISVPPQTLAAKLRATQEXXXXXXXXXXXXXAKLNHKDQLVQ 61
+L +E YIK TID +LG+ + ++L KL +E ++L KD ++
Sbjct: 18 NLSQEVEEYIKDTIDHSLGLPISMESLQKKLYTAEESQRRLREQYQGLISRLKEKDHVID 77
Query: 62 RLKSESDMNARALKKFVEENNRLASECENLVAECQKWEHEIALYEHDRDALMEFGNEADE 121
R++SE+ MNA+ALKKFVEEN +LASEC NL+++C+K E E LY DRDALMEFGNE+DE
Sbjct: 78 RVRSEASMNAQALKKFVEENQKLASECGNLLSQCKKLEKECLLYHQDRDALMEFGNESDE 137
Query: 122 RARESQSRVXXXXXXXXXXXXXXXXYK---------------------------HHHESV 154
RARE+++RV +K ++H ++
Sbjct: 138 RAREAEARVRELEDEIGRMSEEMQRFKRQIGDGEKLERSVRVNISKKMYLLAATYNHFNI 197
Query: 155 DSSSGTPVEENLLDSVLATFISNDDD-SAYAFLVANSGNESCKQLLSMWNCIKPSTRSVL 213
+ TP+EE+LLDSVL + IS D+ FL AN ++SC+ LLS W+ +KPST+ VL
Sbjct: 198 VENCTTPLEEDLLDSVLGSLISKDETIMGRLFLEANIHDQSCQALLSKWDHLKPSTQKVL 257
Query: 214 SLIAKVKSLEKDKEHLTTNLHKAEEEVKLLFEENIILDEKNKKILKQLKEGKHPGSGGKH 273
SL++K K EK+KE + NL KAE+EV+L+ N LD++N+K+L+Q + S K+
Sbjct: 258 SLVSKAKKFEKEKECIIMNLAKAEQEVELVSTLNRKLDKENRKLLRQ--QQSPLCSADKN 315
Query: 274 TSSASAKSNKRKTSPRTTSPMERKIDFGELESARQPLSPLRHNSPDCRM 322
+SASAKSNKRK+ +SP+E++++F E +R+PLSP+ +NSPD +M
Sbjct: 316 RNSASAKSNKRKSPKTMSSPVEKRLEFSSPEISRKPLSPVWNNSPDSKM 364
>AT1G14680.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G09060.1); Has 8132 Blast hits to 6366 proteins
in 685 species: Archae - 171; Bacteria - 671; Metazoa -
4046; Fungi - 445; Plants - 318; Viruses - 23; Other
Eukaryotes - 2458 (source: NCBI BLink). |
chr1:5040861-5041733 FORWARD LENGTH=290
Length = 290
Score = 205 bits (521), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 183/304 (60%), Gaps = 23/304 (7%)
Query: 1 MDLPPHIENYIKQTIDDTLGISVPPQTLAAKLRATQEXXXXXXXXXXXXXAKLNHKDQLV 60
MDL ++++IK+TID +LG+ + L KL +E ++L K++++
Sbjct: 1 MDLSREVDDFIKETIDHSLGLPISMDVLQKKLYTAEESQRRLREQYLSLVSRLKEKEKVI 60
Query: 61 QRLKSESDMNARALKKFVEENNRLASECENLVAECQKWEHEIALYEHDRDALMEFGNEAD 120
++SE+ MNA++LKKFVEEN +L SE E+LV +C+KWE E LY DR++LMEFGNE D
Sbjct: 61 DLVRSEASMNAQSLKKFVEENQKLGSEREDLVNQCKKWEKECFLYHQDRESLMEFGNETD 120
Query: 121 ERARESQSRVXXXXXXXXXXXXXXXXYKHHHESVDSSSGTPVEENLLDSVLATFISNDDD 180
ERARE++SRV + + + S + E+ L+DS+LA+F+S D+
Sbjct: 121 ERAREAESRV----------RELEEEVRKMSDEIKSRIESE-EDCLVDSILASFVSKDES 169
Query: 181 SAYA--FLVANS-GNESCKQLLSMWNCIKPSTRSVLSLIAKVKSLEKDKEHLTTNLHKAE 237
+ FL ANS ++ C+ LL W+ +KP+T+ +SL++ VK +EK+KE L NL KAE
Sbjct: 170 ISLGRIFLEANSEEDKCCETLLRKWDQLKPTTQKFVSLVSMVKRIEKEKECLIMNLAKAE 229
Query: 238 EEVKLLFEENIILDEKNKKILKQLKEGKHPGSGGKHTSSASAKSNKRKTSPRTTSPMERK 297
+EV+L+ E+N LD +N+K L+Q + GS K NKRK+ +SP+E++
Sbjct: 230 QEVELVSEQNRELDRENRKFLRQCSAERSHGSN---------KFNKRKSIKMMSSPIEKR 280
Query: 298 IDFG 301
I+
Sbjct: 281 IELS 284