Miyakogusa Predicted Gene
- Lj4g3v0244110.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0244110.1 Non Chatacterized Hit- tr|D5A9U7|D5A9U7_PICSI
Putative uncharacterized protein OS=Picea sitchensis
P,37.63,2e-18,seg,NULL,CUFF.46748.1
(320 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G44860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 379 e-105
AT4G19950.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 375 e-104
AT1G31130.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 351 4e-97
AT1G26650.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 49 6e-06
AT4G16850.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 48 7e-06
>AT5G44860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G19950.1); Has 233 Blast hits to 227 proteins
in 25 species: Archae - 0; Bacteria - 13; Metazoa - 1;
Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr5:18110688-18111653 REVERSE
LENGTH=321
Length = 321
Score = 379 bits (974), Expect = e-105, Method: Compositional matrix adjust.
Identities = 201/321 (62%), Positives = 228/321 (71%), Gaps = 1/321 (0%)
Query: 1 MDLAPEELQFXXXXXXXXXXXXXPKRSPKTFYLITLTLIFPLSFAILAHSLFTHPIISQL 60
MDLA EELQF PK SPKTFYLITLTLIFPLSFAILAHSLFT PI++QL
Sbjct: 1 MDLAAEELQFLNIQGILRESTTIPKFSPKTFYLITLTLIFPLSFAILAHSLFTQPILAQL 60
Query: 61 Q-SPFDDPSQTTHEWTLLLLIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTIS 119
+P D S+T HEWTLLL+ QF Y+IFLFAFSLLSTAAVVFTVASLYT KPVSFS T+S
Sbjct: 61 DATPPSDQSKTNHEWTLLLIYQFIYVIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTMS 120
Query: 120 AIPKVFKRLFITFLWVSLLMIIYNFVFVXXXXXXXXXXDTQNSVLMLFSVLAMLVLFIGV 179
AIP V KRLFITFLWVSL+M++YN VF+ D Q+ +L +FS++ + VLF+GV
Sbjct: 121 AIPLVLKRLFITFLWVSLMMLVYNSVFLLFLVVLIVAIDLQSVILAVFSMVVIFVLFLGV 180
Query: 180 HVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFSX 239
HVY+TA WHLASVVSVLEP+YG AAMKKSYELL GR A +V YL +CGI G F
Sbjct: 181 HVYMTAWWHLASVVSVLEPIYGIAAMKKSYELLNGRTNMACSMVFMYLALCGITAGVFGG 240
Query: 240 XXXXXXXXXXXFSRIXXXXXXXXXXXXXXXXXXXXQSVFYYVCKSYHHQGIDKSALHDHL 299
F++I QSVFYYVCKS+HHQ IDKSALHDHL
Sbjct: 241 VVVHGGDDFGLFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVCKSFHHQPIDKSALHDHL 300
Query: 300 GGYLGEYVPLKSSIQMENLDV 320
GGYLG+YVPLKSSIQMEN D+
Sbjct: 301 GGYLGDYVPLKSSIQMENFDI 321
>AT4G19950.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins
in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7;
Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr4:10809977-10810942 FORWARD
LENGTH=321
Length = 321
Score = 375 bits (963), Expect = e-104, Method: Compositional matrix adjust.
Identities = 204/321 (63%), Positives = 228/321 (71%), Gaps = 1/321 (0%)
Query: 1 MDLAPEELQFXXXXXXXXXXXXXPKRSPKTFYLITLTLIFPLSFAILAHSLFTHPIISQL 60
MDLAPEELQF P+ S KTFYLITLTLIFPLSFAILAHSLFT PI++Q+
Sbjct: 1 MDLAPEELQFLNKRGILRESTSIPQYSLKTFYLITLTLIFPLSFAILAHSLFTQPILAQI 60
Query: 61 QS-PFDDPSQTTHEWTLLLLIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTIS 119
+ P D SQ HEWT+LL+ QFCY+IFLFAFSLLSTAAVVFTVASLYT KPVSFS T+S
Sbjct: 61 DTYPQADQSQLQHEWTVLLVFQFCYIIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTMS 120
Query: 120 AIPKVFKRLFITFLWVSLLMIIYNFVFVXXXXXXXXXXDTQNSVLMLFSVLAMLVLFIGV 179
AIP V KRLFITFLWVSLLM+ YN VF+ D QN VL +FS++ + VLF+ V
Sbjct: 121 AIPLVLKRLFITFLWVSLLMLAYNTVFLIFLVTLIVAVDLQNVVLAVFSLVVIFVLFLVV 180
Query: 180 HVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFSX 239
HVY+TALWHLASVVSVLEP+YG AAMKKSYELLKG+ A +V YLV CG I G F
Sbjct: 181 HVYMTALWHLASVVSVLEPIYGLAAMKKSYELLKGKTLMACSMVFIYLVHCGFIAGVFGA 240
Query: 240 XXXXXXXXXXXFSRIXXXXXXXXXXXXXXXXXXXXQSVFYYVCKSYHHQGIDKSALHDHL 299
F+RI QSVFYYVCKS+HHQ IDKSALHDHL
Sbjct: 241 VVVRGGDDYGIFARIVAGGFLVGVLVIVNLIGLLVQSVFYYVCKSFHHQEIDKSALHDHL 300
Query: 300 GGYLGEYVPLKSSIQMENLDV 320
GGYLGEYVPLKS+IQMEN +V
Sbjct: 301 GGYLGEYVPLKSNIQMENFEV 321
>AT1G31130.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G19950.1); Has 246 Blast hits to 244 proteins
in 29 species: Archae - 2; Bacteria - 16; Metazoa - 0;
Fungi - 0; Plants - 222; Viruses - 0; Other Eukaryotes -
6 (source: NCBI BLink). | chr1:11114963-11115928 REVERSE
LENGTH=321
Length = 321
Score = 351 bits (900), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 185/322 (57%), Positives = 221/322 (68%), Gaps = 3/322 (0%)
Query: 1 MDLAPEELQFXXXXXXXXXXXXXPKRSPKTFYLITLTLIFPLSFAILAHSLFTHPIISQL 60
MDL PEELQF KRSP+TFYLITL+ IFPLSFAILAHSLFT PI+++L
Sbjct: 1 MDLQPEELQFLTIPQLLQESISIKKRSPRTFYLITLSFIFPLSFAILAHSLFTQPILAKL 60
Query: 61 QSPFDDPS--QTTHEWTLLLLIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTI 118
D P+ ++ H+WT+LL+ QF YLIFLFAFSLLSTAAVVFTVASLYT KPVSFS T+
Sbjct: 61 DKS-DPPNSDRSRHDWTVLLIFQFSYLIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTL 119
Query: 119 SAIPKVFKRLFITFLWVSLLMIIYNFVFVXXXXXXXXXXDTQNSVLMLFSVLAMLVLFIG 178
SAIPKVFKRLFITFLWV+LLM YN VF D + L + + + + VL+ G
Sbjct: 120 SAIPKVFKRLFITFLWVALLMFAYNAVFFVFLVMLLVALDLNSLGLAIVAGVIISVLYFG 179
Query: 179 VHVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFS 238
VHVY TALWHL SV+SVLEP+YG AAM+K+YELLKG+ + A L+ YL +CG+IG F
Sbjct: 180 VHVYFTALWHLGSVISVLEPVYGIAAMRKAYELLKGKTKMAMGLIFVYLFLCGLIGVVFG 239
Query: 239 XXXXXXXXXXXXFSRIXXXXXXXXXXXXXXXXXXXXQSVFYYVCKSYHHQGIDKSALHDH 298
F+R QSVFYYVCKSYHHQ IDK+AL+D
Sbjct: 240 AVVVHGGGKYGTFTRTLVGGLLVGVLVMVNLVGLLVQSVFYYVCKSYHHQTIDKTALYDQ 299
Query: 299 LGGYLGEYVPLKSSIQMENLDV 320
LGGYLG+YVPLKS+IQ+E+LD+
Sbjct: 300 LGGYLGDYVPLKSNIQLEDLDI 321
>AT1G26650.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G69430.1); Has 205 Blast hits to 204 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 205; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:9210335-9211342 FORWARD
LENGTH=335
Length = 335
Score = 48.5 bits (114), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 83/168 (49%), Gaps = 23/168 (13%)
Query: 83 CYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTISAIPKVFKRLFITFLWVSLLMIIY 142
C+ +F+ SLLS AAVV++V Y+ + V S + + K+++R+ T++W+ +L I+
Sbjct: 110 CFPVFI-TVSLLSKAAVVYSVDCSYSREVVDISKFLVILQKIWRRVVFTYVWICIL-IVG 167
Query: 143 NFVFVXXXXXXXXXXDTQNSVLMLFSVL----------AMLVLFIGVHVYITA--LWHLA 190
F F ++ FSVL AMLV V+ A + + A
Sbjct: 168 CFTFFCVLLV---------AICSSFSVLGFSPDFNVYGAMLVGLAFSVVFANAIIICNTA 218
Query: 191 SVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFS 238
V+SVLE + G A+ ++ +L+KG+++ ++ G + + G F
Sbjct: 219 IVISVLEDVSGLGALMRASDLIKGQIQVGLLMFLGSTLGLAFVEGLFD 266
>AT4G16850.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G31130.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:9480699-9481640 FORWARD LENGTH=313
Length = 313
Score = 48.1 bits (113), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 174 VLFIGVHVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLV 228
+LF+GV VYI A+ L VVSVLE YGF A+K+ L+KGR R + + G V
Sbjct: 178 ILFLGVEVYIMAITGLGFVVSVLEERYGFDAIKEGTALMKGR-RITGLALAGVFV 231