Miyakogusa Predicted Gene

Lj4g3v0244110.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v0244110.1 Non Chatacterized Hit- tr|D5A9U7|D5A9U7_PICSI
Putative uncharacterized protein OS=Picea sitchensis
P,37.63,2e-18,seg,NULL,CUFF.46748.1
         (320 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G44860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   379   e-105
AT4G19950.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   375   e-104
AT1G31130.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   351   4e-97
AT1G26650.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   6e-06
AT4G16850.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    48   7e-06

>AT5G44860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G19950.1); Has 233 Blast hits to 227 proteins
           in 25 species: Archae - 0; Bacteria - 13; Metazoa - 1;
           Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr5:18110688-18111653 REVERSE
           LENGTH=321
          Length = 321

 Score =  379 bits (974), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 201/321 (62%), Positives = 228/321 (71%), Gaps = 1/321 (0%)

Query: 1   MDLAPEELQFXXXXXXXXXXXXXPKRSPKTFYLITLTLIFPLSFAILAHSLFTHPIISQL 60
           MDLA EELQF             PK SPKTFYLITLTLIFPLSFAILAHSLFT PI++QL
Sbjct: 1   MDLAAEELQFLNIQGILRESTTIPKFSPKTFYLITLTLIFPLSFAILAHSLFTQPILAQL 60

Query: 61  Q-SPFDDPSQTTHEWTLLLLIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTIS 119
             +P  D S+T HEWTLLL+ QF Y+IFLFAFSLLSTAAVVFTVASLYT KPVSFS T+S
Sbjct: 61  DATPPSDQSKTNHEWTLLLIYQFIYVIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTMS 120

Query: 120 AIPKVFKRLFITFLWVSLLMIIYNFVFVXXXXXXXXXXDTQNSVLMLFSVLAMLVLFIGV 179
           AIP V KRLFITFLWVSL+M++YN VF+          D Q+ +L +FS++ + VLF+GV
Sbjct: 121 AIPLVLKRLFITFLWVSLMMLVYNSVFLLFLVVLIVAIDLQSVILAVFSMVVIFVLFLGV 180

Query: 180 HVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFSX 239
           HVY+TA WHLASVVSVLEP+YG AAMKKSYELL GR   A  +V  YL +CGI  G F  
Sbjct: 181 HVYMTAWWHLASVVSVLEPIYGIAAMKKSYELLNGRTNMACSMVFMYLALCGITAGVFGG 240

Query: 240 XXXXXXXXXXXFSRIXXXXXXXXXXXXXXXXXXXXQSVFYYVCKSYHHQGIDKSALHDHL 299
                      F++I                    QSVFYYVCKS+HHQ IDKSALHDHL
Sbjct: 241 VVVHGGDDFGLFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVCKSFHHQPIDKSALHDHL 300

Query: 300 GGYLGEYVPLKSSIQMENLDV 320
           GGYLG+YVPLKSSIQMEN D+
Sbjct: 301 GGYLGDYVPLKSSIQMENFDI 321


>AT4G19950.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins
           in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7;
           Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr4:10809977-10810942 FORWARD
           LENGTH=321
          Length = 321

 Score =  375 bits (963), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 204/321 (63%), Positives = 228/321 (71%), Gaps = 1/321 (0%)

Query: 1   MDLAPEELQFXXXXXXXXXXXXXPKRSPKTFYLITLTLIFPLSFAILAHSLFTHPIISQL 60
           MDLAPEELQF             P+ S KTFYLITLTLIFPLSFAILAHSLFT PI++Q+
Sbjct: 1   MDLAPEELQFLNKRGILRESTSIPQYSLKTFYLITLTLIFPLSFAILAHSLFTQPILAQI 60

Query: 61  QS-PFDDPSQTTHEWTLLLLIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTIS 119
            + P  D SQ  HEWT+LL+ QFCY+IFLFAFSLLSTAAVVFTVASLYT KPVSFS T+S
Sbjct: 61  DTYPQADQSQLQHEWTVLLVFQFCYIIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTMS 120

Query: 120 AIPKVFKRLFITFLWVSLLMIIYNFVFVXXXXXXXXXXDTQNSVLMLFSVLAMLVLFIGV 179
           AIP V KRLFITFLWVSLLM+ YN VF+          D QN VL +FS++ + VLF+ V
Sbjct: 121 AIPLVLKRLFITFLWVSLLMLAYNTVFLIFLVTLIVAVDLQNVVLAVFSLVVIFVLFLVV 180

Query: 180 HVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFSX 239
           HVY+TALWHLASVVSVLEP+YG AAMKKSYELLKG+   A  +V  YLV CG I G F  
Sbjct: 181 HVYMTALWHLASVVSVLEPIYGLAAMKKSYELLKGKTLMACSMVFIYLVHCGFIAGVFGA 240

Query: 240 XXXXXXXXXXXFSRIXXXXXXXXXXXXXXXXXXXXQSVFYYVCKSYHHQGIDKSALHDHL 299
                      F+RI                    QSVFYYVCKS+HHQ IDKSALHDHL
Sbjct: 241 VVVRGGDDYGIFARIVAGGFLVGVLVIVNLIGLLVQSVFYYVCKSFHHQEIDKSALHDHL 300

Query: 300 GGYLGEYVPLKSSIQMENLDV 320
           GGYLGEYVPLKS+IQMEN +V
Sbjct: 301 GGYLGEYVPLKSNIQMENFEV 321


>AT1G31130.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G19950.1); Has 246 Blast hits to 244 proteins
           in 29 species: Archae - 2; Bacteria - 16; Metazoa - 0;
           Fungi - 0; Plants - 222; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr1:11114963-11115928 REVERSE
           LENGTH=321
          Length = 321

 Score =  351 bits (900), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 185/322 (57%), Positives = 221/322 (68%), Gaps = 3/322 (0%)

Query: 1   MDLAPEELQFXXXXXXXXXXXXXPKRSPKTFYLITLTLIFPLSFAILAHSLFTHPIISQL 60
           MDL PEELQF              KRSP+TFYLITL+ IFPLSFAILAHSLFT PI+++L
Sbjct: 1   MDLQPEELQFLTIPQLLQESISIKKRSPRTFYLITLSFIFPLSFAILAHSLFTQPILAKL 60

Query: 61  QSPFDDPS--QTTHEWTLLLLIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTI 118
               D P+  ++ H+WT+LL+ QF YLIFLFAFSLLSTAAVVFTVASLYT KPVSFS T+
Sbjct: 61  DKS-DPPNSDRSRHDWTVLLIFQFSYLIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTL 119

Query: 119 SAIPKVFKRLFITFLWVSLLMIIYNFVFVXXXXXXXXXXDTQNSVLMLFSVLAMLVLFIG 178
           SAIPKVFKRLFITFLWV+LLM  YN VF           D  +  L + + + + VL+ G
Sbjct: 120 SAIPKVFKRLFITFLWVALLMFAYNAVFFVFLVMLLVALDLNSLGLAIVAGVIISVLYFG 179

Query: 179 VHVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFS 238
           VHVY TALWHL SV+SVLEP+YG AAM+K+YELLKG+ + A  L+  YL +CG+IG  F 
Sbjct: 180 VHVYFTALWHLGSVISVLEPVYGIAAMRKAYELLKGKTKMAMGLIFVYLFLCGLIGVVFG 239

Query: 239 XXXXXXXXXXXXFSRIXXXXXXXXXXXXXXXXXXXXQSVFYYVCKSYHHQGIDKSALHDH 298
                       F+R                     QSVFYYVCKSYHHQ IDK+AL+D 
Sbjct: 240 AVVVHGGGKYGTFTRTLVGGLLVGVLVMVNLVGLLVQSVFYYVCKSYHHQTIDKTALYDQ 299

Query: 299 LGGYLGEYVPLKSSIQMENLDV 320
           LGGYLG+YVPLKS+IQ+E+LD+
Sbjct: 300 LGGYLGDYVPLKSNIQLEDLDI 321


>AT1G26650.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G69430.1); Has 205 Blast hits to 204 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 205; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:9210335-9211342 FORWARD
           LENGTH=335
          Length = 335

 Score = 48.5 bits (114), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 46/168 (27%), Positives = 83/168 (49%), Gaps = 23/168 (13%)

Query: 83  CYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSYTISAIPKVFKRLFITFLWVSLLMIIY 142
           C+ +F+   SLLS AAVV++V   Y+ + V  S  +  + K+++R+  T++W+ +L I+ 
Sbjct: 110 CFPVFI-TVSLLSKAAVVYSVDCSYSREVVDISKFLVILQKIWRRVVFTYVWICIL-IVG 167

Query: 143 NFVFVXXXXXXXXXXDTQNSVLMLFSVL----------AMLVLFIGVHVYITA--LWHLA 190
            F F               ++   FSVL          AMLV      V+  A  + + A
Sbjct: 168 CFTFFCVLLV---------AICSSFSVLGFSPDFNVYGAMLVGLAFSVVFANAIIICNTA 218

Query: 191 SVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLVICGIIGGAFS 238
            V+SVLE + G  A+ ++ +L+KG+++   ++  G  +    + G F 
Sbjct: 219 IVISVLEDVSGLGALMRASDLIKGQIQVGLLMFLGSTLGLAFVEGLFD 266


>AT4G16850.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G31130.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:9480699-9481640 FORWARD LENGTH=313
          Length = 313

 Score = 48.1 bits (113), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 174 VLFIGVHVYITALWHLASVVSVLEPLYGFAAMKKSYELLKGRVRYAAVLVCGYLV 228
           +LF+GV VYI A+  L  VVSVLE  YGF A+K+   L+KGR R   + + G  V
Sbjct: 178 ILFLGVEVYIMAITGLGFVVSVLEERYGFDAIKEGTALMKGR-RITGLALAGVFV 231