Miyakogusa Predicted Gene
- Lj0g3v0168269.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0168269.1 Non Chatacterized Hit- tr|B9S2Q4|B9S2Q4_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,27.27,1e-18,seg,NULL,CUFF.10542.1
(331 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G44860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 129 2e-30
AT1G31130.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 127 9e-30
AT4G19950.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 120 1e-27
AT1G69430.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 64 1e-10
AT1G26650.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 58 9e-09
>AT5G44860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G19950.1); Has 233 Blast hits to 227 proteins
in 25 species: Archae - 0; Bacteria - 13; Metazoa - 1;
Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr5:18110688-18111653 REVERSE
LENGTH=321
Length = 321
Score = 129 bits (325), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 160/327 (48%), Gaps = 18/327 (5%)
Query: 1 MERDQEKMQFLGFFGVFKESYMIM-FSWRKXXXXXXXXXXXXXXXXXXXNMEVSQVMFGG 59
M+ E++QFL G+ +ES I FS + +S +
Sbjct: 1 MDLAAEELQFLNIQGILRESTTIPKFSPKTFYLITLTLI-----------FPLSFAILAH 49
Query: 60 ILFYSERISRTKFGTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIAS 119
LF +++ P + N EW +I++ IY +F FSL+ST+AVV T+AS
Sbjct: 50 SLFTQPILAQLDATPPSDQSKTNH---EWTLLLIYQFIYVIFLFAFSLLSTAAVVFTVAS 106
Query: 120 IYTSREVTLKKAMGAVAKVWKSLVITFLCTFAAFFIYNVV--AGIVLFIWALTIGENIYG 177
+YT + V+ M A+ V K L ITFL +YN V +V+ I A+ + I
Sbjct: 107 LYTGKPVSFSSTMSAIPLVLKRLFITFLWVSLMMLVYNSVFLLFLVVLIVAIDLQSVILA 166
Query: 178 MATLLVIGILYIVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFM 237
+ +++VI +L++ +Y+ A W LA+VVS+LE GI AM KS EL+ G+ ++ + FM
Sbjct: 167 VFSMVVIFVLFLGVHVYMTAWWHLASVVSVLEPIYGIAAMKKSYELLNGRTNMACSMVFM 226
Query: 238 -LSVFFFLIQFLFSMVLLRLFHVGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSY 296
L++ +V+ GL KI G ++Q+V Y+ CKS+
Sbjct: 227 YLALCGITAGVFGGVVVHGGDDFGLFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVCKSF 286
Query: 297 HHEIIGKSALSEQLEVYQGEHELVKAT 323
HH+ I KSAL + L Y G++ +K++
Sbjct: 287 HHQPIDKSALHDHLGGYLGDYVPLKSS 313
>AT1G31130.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G19950.1); Has 246 Blast hits to 244 proteins
in 29 species: Archae - 2; Bacteria - 16; Metazoa - 0;
Fungi - 0; Plants - 222; Viruses - 0; Other Eukaryotes -
6 (source: NCBI BLink). | chr1:11114963-11115928 REVERSE
LENGTH=321
Length = 321
Score = 127 bits (320), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 137/259 (52%), Gaps = 13/259 (5%)
Query: 71 KFGTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIASIYTSREVTLKK 130
K P+ R + +W +IF+ Y +F FSL+ST+AVV T+AS+YT + V+
Sbjct: 62 KSDPPNSDRSRH----DWTVLLIFQFSYLIFLFAFSLLSTAAVVFTVASLYTGKPVSFSS 117
Query: 131 AMGAVAKVWKSLVITFLCTFAAFFIYNVVAGIVLFIWALTIGENIYGMATL--LVIGILY 188
+ A+ KV+K L ITFL F YN V + L + + + N G+A + ++I +LY
Sbjct: 118 TLSAIPKVFKRLFITFLWVALLMFAYNAVFFVFLVMLLVALDLNSLGLAIVAGVIISVLY 177
Query: 189 IVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFMLSVFFFLIQFL 248
+Y A+W L +V+S+LE GI AM K+ EL+KGK +++ + F+ LI +
Sbjct: 178 FGVHVYFTALWHLGSVISVLEPVYGIAAMRKAYELLKGKTKMAMGLIFVYLFLCGLIGVV 237
Query: 249 FSMVLLRLFH----VGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSYHHEIIGKS 304
F V++ H G + G ++Q+V Y+ CKSYHH+ I K+
Sbjct: 238 FGAVVV---HGGGKYGTFTRTLVGGLLVGVLVMVNLVGLLVQSVFYYVCKSYHHQTIDKT 294
Query: 305 ALSEQLEVYQGEHELVKAT 323
AL +QL Y G++ +K+
Sbjct: 295 ALYDQLGGYLGDYVPLKSN 313
>AT4G19950.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins
in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7;
Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr4:10809977-10810942 FORWARD
LENGTH=321
Length = 321
Score = 120 bits (301), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 160/326 (49%), Gaps = 16/326 (4%)
Query: 1 MERDQEKMQFLGFFGVFKESYMIMFSWRKXXXXXXXXXXXXXXXXXXXNMEVSQVMFGGI 60
M+ E++QFL G+ +ES I K + +Q + I
Sbjct: 1 MDLAPEELQFLNKRGILRESTSIPQYSLKTFYLITLTLIFPLSFAILAHSLFTQPILAQI 60
Query: 61 LFYSERISRTKFGTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIASI 120
Y + Q +L + EW ++F+ Y +F FSL+ST+AVV T+AS+
Sbjct: 61 DTYPQA---------DQSQLQH----EWTVLLVFQFCYIIFLFAFSLLSTAAVVFTVASL 107
Query: 121 YTSREVTLKKAMGAVAKVWKSLVITFLCTFAAFFIYNVVAGI--VLFIWALTIGENIYGM 178
YT + V+ M A+ V K L ITFL YN V I V I A+ + + +
Sbjct: 108 YTGKPVSFSSTMSAIPLVLKRLFITFLWVSLLMLAYNTVFLIFLVTLIVAVDLQNVVLAV 167
Query: 179 ATLLVIGILYIVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFML 238
+L+VI +L++V +Y+ A+W LA+VVS+LE G+ AM KS EL+KGK ++ + F+
Sbjct: 168 FSLVVIFVLFLVVHVYMTALWHLASVVSVLEPIYGLAAMKKSYELLKGKTLMACSMVFIY 227
Query: 239 SVFFFLIQFLFSMVLLR-LFHVGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSYH 297
V I +F V++R G+ +I +G ++Q+V Y+ CKS+H
Sbjct: 228 LVHCGFIAGVFGAVVVRGGDDYGIFARIVAGGFLVGVLVIVNLIGLLVQSVFYYVCKSFH 287
Query: 298 HEIIGKSALSEQLEVYQGEHELVKAT 323
H+ I KSAL + L Y GE+ +K+
Sbjct: 288 HQEIDKSALHDHLGGYLGEYVPLKSN 313
>AT1G69430.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G26650.1); Has 216 Blast hits to 215 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:26098025-26099077 FORWARD
LENGTH=350
Length = 350
Score = 63.9 bits (154), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 104/210 (49%), Gaps = 10/210 (4%)
Query: 99 FVFFLIFSLVSTSAVVHTIASIYTSREVTLKKAMGAVAKVWKSLVITFL--CT-----FA 151
F F+ SL+S +AVV+++ Y+ ++V + K + + ++WK LVIT+L CT
Sbjct: 127 FPLFITLSLLSRAAVVYSVDCTYSRKKVVVTKFVVIMQRLWKRLVITYLWICTVIVVCLT 186
Query: 152 AFFIYNVVAGIVLFIWALTIGENIYGMATLLVIGILYIVGFIYLAAVWQLANVVSMLEDS 211
+F ++ V ++ + N YG +++G+++ V F + V+S+LED
Sbjct: 187 SFCVFLVAVCSSFYVLGFSPDFNAYGA---ILVGLVFSVVFANAIIICNTTIVISILEDV 243
Query: 212 SGIKAMMKSNELIKGKKGLSLIITFMLSVFFFLIQFLFSMVLLRLFHVGLVWKIASGVXX 271
SG A++++++LIKG+ + L+I ++ ++ LF + L + ++ G
Sbjct: 244 SGPGALVRASDLIKGQTQVGLLIFLGSTIGLTFVEGLFEHRVKSLSYGDGSSRLWEGPLL 303
Query: 272 XXXXXXXXXXXXVIQTVLYFDCKSYHHEII 301
++ V YF C+SY E +
Sbjct: 304 VVMYSFVVLIDTMMSAVFYFSCRSYSMEAV 333
>AT1G26650.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G69430.1); Has 205 Blast hits to 204 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 205; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:9210335-9211342 FORWARD
LENGTH=335
Length = 335
Score = 57.8 bits (138), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 58/258 (22%), Positives = 120/258 (46%), Gaps = 22/258 (8%)
Query: 73 GTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIASIYTSREVTLKKAM 132
G P Q + ++ ++A + + F F+ SL+S +AVV+++ Y+ V + K +
Sbjct: 86 GLPLQPFVKHSCQ-KFAETAVSSAMCFPVFITVSLLSKAAVVYSVDCSYSREVVDISKFL 144
Query: 133 GAVAKVWKSLVITF--LCTFA----AFFIYNVVAGIVLF-IWALTIGENIYGMATLLVIG 185
+ K+W+ +V T+ +C FF +VA F + + N+YG +++G
Sbjct: 145 VILQKIWRRVVFTYVWICILIVGCFTFFCVLLVAICSSFSVLGFSPDFNVYGA---MLVG 201
Query: 186 ILYIVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFMLSVFFFLI 245
+ + V F + A V+S+LED SG+ A+M++++LIKG+ + L++ ++ +
Sbjct: 202 LAFSVVFANAIIICNTAIVISVLEDVSGLGALMRASDLIKGQIQVGLLMFLGSTLGLAFV 261
Query: 246 QFLFSMVLLRLFHVGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSYHHEIIGKSA 305
+ LF + ++ + ++ G ++ V YF C+ Y+
Sbjct: 262 EGLFDHRVKKVSYGDGSSRLWEGPLLVLMYSFVTLIDSMMSAVFYFSCRVYY-------- 313
Query: 306 LSEQLEVYQGEHELVKAT 323
+E +GE + + T
Sbjct: 314 ---SMEASRGETQPIMET 328