Miyakogusa Predicted Gene

Lj0g3v0168269.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0168269.1 Non Chatacterized Hit- tr|B9S2Q4|B9S2Q4_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,27.27,1e-18,seg,NULL,CUFF.10542.1
         (331 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G44860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   129   2e-30
AT1G31130.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   127   9e-30
AT4G19950.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   120   1e-27
AT1G69430.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    64   1e-10
AT1G26650.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    58   9e-09

>AT5G44860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G19950.1); Has 233 Blast hits to 227 proteins
           in 25 species: Archae - 0; Bacteria - 13; Metazoa - 1;
           Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr5:18110688-18111653 REVERSE
           LENGTH=321
          Length = 321

 Score =  129 bits (325), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/327 (30%), Positives = 160/327 (48%), Gaps = 18/327 (5%)

Query: 1   MERDQEKMQFLGFFGVFKESYMIM-FSWRKXXXXXXXXXXXXXXXXXXXNMEVSQVMFGG 59
           M+   E++QFL   G+ +ES  I  FS +                       +S  +   
Sbjct: 1   MDLAAEELQFLNIQGILRESTTIPKFSPKTFYLITLTLI-----------FPLSFAILAH 49

Query: 60  ILFYSERISRTKFGTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIAS 119
            LF    +++     P  +   N    EW   +I++ IY +F   FSL+ST+AVV T+AS
Sbjct: 50  SLFTQPILAQLDATPPSDQSKTNH---EWTLLLIYQFIYVIFLFAFSLLSTAAVVFTVAS 106

Query: 120 IYTSREVTLKKAMGAVAKVWKSLVITFLCTFAAFFIYNVV--AGIVLFIWALTIGENIYG 177
           +YT + V+    M A+  V K L ITFL       +YN V    +V+ I A+ +   I  
Sbjct: 107 LYTGKPVSFSSTMSAIPLVLKRLFITFLWVSLMMLVYNSVFLLFLVVLIVAIDLQSVILA 166

Query: 178 MATLLVIGILYIVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFM 237
           + +++VI +L++   +Y+ A W LA+VVS+LE   GI AM KS EL+ G+  ++  + FM
Sbjct: 167 VFSMVVIFVLFLGVHVYMTAWWHLASVVSVLEPIYGIAAMKKSYELLNGRTNMACSMVFM 226

Query: 238 -LSVFFFLIQFLFSMVLLRLFHVGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSY 296
            L++          +V+      GL  KI  G               ++Q+V Y+ CKS+
Sbjct: 227 YLALCGITAGVFGGVVVHGGDDFGLFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVCKSF 286

Query: 297 HHEIIGKSALSEQLEVYQGEHELVKAT 323
           HH+ I KSAL + L  Y G++  +K++
Sbjct: 287 HHQPIDKSALHDHLGGYLGDYVPLKSS 313


>AT1G31130.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G19950.1); Has 246 Blast hits to 244 proteins
           in 29 species: Archae - 2; Bacteria - 16; Metazoa - 0;
           Fungi - 0; Plants - 222; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr1:11114963-11115928 REVERSE
           LENGTH=321
          Length = 321

 Score =  127 bits (320), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 86/259 (33%), Positives = 137/259 (52%), Gaps = 13/259 (5%)

Query: 71  KFGTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIASIYTSREVTLKK 130
           K   P+  R  +    +W   +IF+  Y +F   FSL+ST+AVV T+AS+YT + V+   
Sbjct: 62  KSDPPNSDRSRH----DWTVLLIFQFSYLIFLFAFSLLSTAAVVFTVASLYTGKPVSFSS 117

Query: 131 AMGAVAKVWKSLVITFLCTFAAFFIYNVVAGIVLFIWALTIGENIYGMATL--LVIGILY 188
            + A+ KV+K L ITFL      F YN V  + L +  + +  N  G+A +  ++I +LY
Sbjct: 118 TLSAIPKVFKRLFITFLWVALLMFAYNAVFFVFLVMLLVALDLNSLGLAIVAGVIISVLY 177

Query: 189 IVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFMLSVFFFLIQFL 248
               +Y  A+W L +V+S+LE   GI AM K+ EL+KGK  +++ + F+      LI  +
Sbjct: 178 FGVHVYFTALWHLGSVISVLEPVYGIAAMRKAYELLKGKTKMAMGLIFVYLFLCGLIGVV 237

Query: 249 FSMVLLRLFH----VGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSYHHEIIGKS 304
           F  V++   H     G   +   G               ++Q+V Y+ CKSYHH+ I K+
Sbjct: 238 FGAVVV---HGGGKYGTFTRTLVGGLLVGVLVMVNLVGLLVQSVFYYVCKSYHHQTIDKT 294

Query: 305 ALSEQLEVYQGEHELVKAT 323
           AL +QL  Y G++  +K+ 
Sbjct: 295 ALYDQLGGYLGDYVPLKSN 313


>AT4G19950.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins
           in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7;
           Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr4:10809977-10810942 FORWARD
           LENGTH=321
          Length = 321

 Score =  120 bits (301), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 160/326 (49%), Gaps = 16/326 (4%)

Query: 1   MERDQEKMQFLGFFGVFKESYMIMFSWRKXXXXXXXXXXXXXXXXXXXNMEVSQVMFGGI 60
           M+   E++QFL   G+ +ES  I     K                   +   +Q +   I
Sbjct: 1   MDLAPEELQFLNKRGILRESTSIPQYSLKTFYLITLTLIFPLSFAILAHSLFTQPILAQI 60

Query: 61  LFYSERISRTKFGTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIASI 120
             Y +           Q +L +    EW   ++F+  Y +F   FSL+ST+AVV T+AS+
Sbjct: 61  DTYPQA---------DQSQLQH----EWTVLLVFQFCYIIFLFAFSLLSTAAVVFTVASL 107

Query: 121 YTSREVTLKKAMGAVAKVWKSLVITFLCTFAAFFIYNVVAGI--VLFIWALTIGENIYGM 178
           YT + V+    M A+  V K L ITFL        YN V  I  V  I A+ +   +  +
Sbjct: 108 YTGKPVSFSSTMSAIPLVLKRLFITFLWVSLLMLAYNTVFLIFLVTLIVAVDLQNVVLAV 167

Query: 179 ATLLVIGILYIVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFML 238
            +L+VI +L++V  +Y+ A+W LA+VVS+LE   G+ AM KS EL+KGK  ++  + F+ 
Sbjct: 168 FSLVVIFVLFLVVHVYMTALWHLASVVSVLEPIYGLAAMKKSYELLKGKTLMACSMVFIY 227

Query: 239 SVFFFLIQFLFSMVLLR-LFHVGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSYH 297
            V    I  +F  V++R     G+  +I +G               ++Q+V Y+ CKS+H
Sbjct: 228 LVHCGFIAGVFGAVVVRGGDDYGIFARIVAGGFLVGVLVIVNLIGLLVQSVFYYVCKSFH 287

Query: 298 HEIIGKSALSEQLEVYQGEHELVKAT 323
           H+ I KSAL + L  Y GE+  +K+ 
Sbjct: 288 HQEIDKSALHDHLGGYLGEYVPLKSN 313


>AT1G69430.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G26650.1); Has 216 Blast hits to 215 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:26098025-26099077 FORWARD
           LENGTH=350
          Length = 350

 Score = 63.9 bits (154), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 104/210 (49%), Gaps = 10/210 (4%)

Query: 99  FVFFLIFSLVSTSAVVHTIASIYTSREVTLKKAMGAVAKVWKSLVITFL--CT-----FA 151
           F  F+  SL+S +AVV+++   Y+ ++V + K +  + ++WK LVIT+L  CT       
Sbjct: 127 FPLFITLSLLSRAAVVYSVDCTYSRKKVVVTKFVVIMQRLWKRLVITYLWICTVIVVCLT 186

Query: 152 AFFIYNVVAGIVLFIWALTIGENIYGMATLLVIGILYIVGFIYLAAVWQLANVVSMLEDS 211
           +F ++ V      ++   +   N YG    +++G+++ V F     +     V+S+LED 
Sbjct: 187 SFCVFLVAVCSSFYVLGFSPDFNAYGA---ILVGLVFSVVFANAIIICNTTIVISILEDV 243

Query: 212 SGIKAMMKSNELIKGKKGLSLIITFMLSVFFFLIQFLFSMVLLRLFHVGLVWKIASGVXX 271
           SG  A++++++LIKG+  + L+I    ++    ++ LF   +  L +     ++  G   
Sbjct: 244 SGPGALVRASDLIKGQTQVGLLIFLGSTIGLTFVEGLFEHRVKSLSYGDGSSRLWEGPLL 303

Query: 272 XXXXXXXXXXXXVIQTVLYFDCKSYHHEII 301
                       ++  V YF C+SY  E +
Sbjct: 304 VVMYSFVVLIDTMMSAVFYFSCRSYSMEAV 333


>AT1G26650.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G69430.1); Has 205 Blast hits to 204 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 205; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:9210335-9211342 FORWARD
           LENGTH=335
          Length = 335

 Score = 57.8 bits (138), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 58/258 (22%), Positives = 120/258 (46%), Gaps = 22/258 (8%)

Query: 73  GTPHQKRLINTISIEWASFIIFKLIYFVFFLIFSLVSTSAVVHTIASIYTSREVTLKKAM 132
           G P Q  + ++   ++A   +   + F  F+  SL+S +AVV+++   Y+   V + K +
Sbjct: 86  GLPLQPFVKHSCQ-KFAETAVSSAMCFPVFITVSLLSKAAVVYSVDCSYSREVVDISKFL 144

Query: 133 GAVAKVWKSLVITF--LCTFA----AFFIYNVVAGIVLF-IWALTIGENIYGMATLLVIG 185
             + K+W+ +V T+  +C        FF   +VA    F +   +   N+YG    +++G
Sbjct: 145 VILQKIWRRVVFTYVWICILIVGCFTFFCVLLVAICSSFSVLGFSPDFNVYGA---MLVG 201

Query: 186 ILYIVGFIYLAAVWQLANVVSMLEDSSGIKAMMKSNELIKGKKGLSLIITFMLSVFFFLI 245
           + + V F     +   A V+S+LED SG+ A+M++++LIKG+  + L++    ++    +
Sbjct: 202 LAFSVVFANAIIICNTAIVISVLEDVSGLGALMRASDLIKGQIQVGLLMFLGSTLGLAFV 261

Query: 246 QFLFSMVLLRLFHVGLVWKIASGVXXXXXXXXXXXXXXVIQTVLYFDCKSYHHEIIGKSA 305
           + LF   + ++ +     ++  G               ++  V YF C+ Y+        
Sbjct: 262 EGLFDHRVKKVSYGDGSSRLWEGPLLVLMYSFVTLIDSMMSAVFYFSCRVYY-------- 313

Query: 306 LSEQLEVYQGEHELVKAT 323
               +E  +GE + +  T
Sbjct: 314 ---SMEASRGETQPIMET 328