Miyakogusa Predicted Gene

Lj3g3v1461450.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v1461450.1 Non Chatacterized Hit- tr|D7SHN7|D7SHN7_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,29.65,0.000000001,SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.42644.1
         (350 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G60590.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...   362   e-100
AT3G60590.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...   360   e-100
AT3G60590.4 | Symbols:  | unknown protein; LOCATED IN: chloropla...   315   3e-86
AT3G60590.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...   315   3e-86
AT1G48460.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    52   5e-07

>AT3G60590.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:22398228-22399753 FORWARD LENGTH=404
          Length = 404

 Score =  362 bits (929), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 186/343 (54%), Positives = 242/343 (70%), Gaps = 19/343 (5%)

Query: 1   MSRGMASLTQTHYPINTSTLRRAHPRSQKLLKSGKL-FQLQGLSFPSIHINQSRVCCT-K 58
           +S+GM SLT++   + T  +R  +P   + LK+G   F+ + L          RV CT K
Sbjct: 77  VSQGMVSLTKSLCTM-TPRVRLKNPNMLQKLKTGSCNFRFRNL----------RVLCTPK 125

Query: 59  LTPWEPSPVTYAPTDKQSDNFLQNSANIFETLDSSETAESPTANAEGLKETKYQPGLQLQ 118
           L+ WEPSP  +A  ++ +D  L  +AN+FE++  SE+AE    +    + T  Q    +Q
Sbjct: 126 LSQWEPSPFIHASAEEAADIVLDKTANVFESI-VSESAEEEKVDMSAQQRTNSQ----VQ 180

Query: 119 FLKWPMWLLGPSILLVTGMVPTLWLPISSIFLGPNIASLLSLIGLDCIFNLGATLFLLMA 178
            LKWP+WLLGPS+LL +GM PTLWLP+SS+FLG N+ SLLSLIGLDCIFNLGATLFLLMA
Sbjct: 181 VLKWPIWLLGPSVLLTSGMAPTLWLPLSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMA 240

Query: 179 DSCSRPKNPMQDCKSKAPFSYQFWNIVATLTGFVVPLLMMLGSQKG-VSVLQPQLPSISF 237
           DSC+RPK+P Q C SK PFSY+FWN+ + + GF+VP+L++ GSQ G ++ LQPQ+P +S 
Sbjct: 241 DSCARPKDPSQSCNSKPPFSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLSS 300

Query: 238 AVXXXXXXXXXSVQILTEMLTWHWQSPVWLVTPVIYESYRVLQLMRGLKLGVELSAPAWI 297
           AV         +VQ LTE+LTWHWQSPVWLVTPV+YE+YR+LQLMRGL L  E++AP W+
Sbjct: 301 AVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVWV 360

Query: 298 MHSIRGLVCWWVLILGLQLMRVAWFAGLTARARKQQQSSSDTS 340
           +H +RGLV WWVLILG+QLMRVAWFAG  +R    QQ  S  S
Sbjct: 361 VHMLRGLVSWWVLILGMQLMRVAWFAGFASRTTTGQQPQSVAS 403


>AT3G60590.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast,
           chloroplast inner membrane, chloroplast envelope;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr3:22398764-22399753 FORWARD LENGTH=329
          Length = 329

 Score =  360 bits (924), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 186/343 (54%), Positives = 242/343 (70%), Gaps = 19/343 (5%)

Query: 1   MSRGMASLTQTHYPINTSTLRRAHPRSQKLLKSGKL-FQLQGLSFPSIHINQSRVCCT-K 58
           +S+GM SLT++   + T  +R  +P   + LK+G   F+ + L          RV CT K
Sbjct: 2   VSQGMVSLTKSLCTM-TPRVRLKNPNMLQKLKTGSCNFRFRNL----------RVLCTPK 50

Query: 59  LTPWEPSPVTYAPTDKQSDNFLQNSANIFETLDSSETAESPTANAEGLKETKYQPGLQLQ 118
           L+ WEPSP  +A  ++ +D  L  +AN+FE++  SE+AE    +    + T  Q    +Q
Sbjct: 51  LSQWEPSPFIHASAEEAADIVLDKTANVFESI-VSESAEEEKVDMSAQQRTNSQ----VQ 105

Query: 119 FLKWPMWLLGPSILLVTGMVPTLWLPISSIFLGPNIASLLSLIGLDCIFNLGATLFLLMA 178
            LKWP+WLLGPS+LL +GM PTLWLP+SS+FLG N+ SLLSLIGLDCIFNLGATLFLLMA
Sbjct: 106 VLKWPIWLLGPSVLLTSGMAPTLWLPLSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMA 165

Query: 179 DSCSRPKNPMQDCKSKAPFSYQFWNIVATLTGFVVPLLMMLGSQKG-VSVLQPQLPSISF 237
           DSC+RPK+P Q C SK PFSY+FWN+ + + GF+VP+L++ GSQ G ++ LQPQ+P +S 
Sbjct: 166 DSCARPKDPSQSCNSKPPFSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLSS 225

Query: 238 AVXXXXXXXXXSVQILTEMLTWHWQSPVWLVTPVIYESYRVLQLMRGLKLGVELSAPAWI 297
           AV         +VQ LTE+LTWHWQSPVWLVTPV+YE+YR+LQLMRGL L  E++AP W+
Sbjct: 226 AVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVWV 285

Query: 298 MHSIRGLVCWWVLILGLQLMRVAWFAGLTARARKQQQSSSDTS 340
           +H +RGLV WWVLILG+QLMRVAWFAG  +R    QQ  S  S
Sbjct: 286 VHMLRGLVSWWVLILGMQLMRVAWFAGFASRTTTGQQPQSVAS 328


>AT3G60590.4 | Symbols:  | unknown protein; LOCATED IN: chloroplast
           inner membrane; EXPRESSED IN: 23 plant structures;
           EXPRESSED DURING: 14 growth stages; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48460.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr3:22399043-22399753 FORWARD LENGTH=236
          Length = 236

 Score =  315 bits (807), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 149/226 (65%), Positives = 182/226 (80%), Gaps = 1/226 (0%)

Query: 116 QLQFLKWPMWLLGPSILLVTGMVPTLWLPISSIFLGPNIASLLSLIGLDCIFNLGATLFL 175
           Q+Q LKWP+WLLGPS+LL +GM PTLWLP+SS+FLG N+ SLLSLIGLDCIFNLGATLFL
Sbjct: 10  QVQVLKWPIWLLGPSVLLTSGMAPTLWLPLSSVFLGSNVVSLLSLIGLDCIFNLGATLFL 69

Query: 176 LMADSCSRPKNPMQDCKSKAPFSYQFWNIVATLTGFVVPLLMMLGSQKG-VSVLQPQLPS 234
           LMADSC+RPK+P Q C SK PFSY+FWN+ + + GF+VP+L++ GSQ G ++ LQPQ+P 
Sbjct: 70  LMADSCARPKDPSQSCNSKPPFSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPF 129

Query: 235 ISFAVXXXXXXXXXSVQILTEMLTWHWQSPVWLVTPVIYESYRVLQLMRGLKLGVELSAP 294
           +S AV         +VQ LTE+LTWHWQSPVWLVTPV+YE+YR+LQLMRGL L  E++AP
Sbjct: 130 LSSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAP 189

Query: 295 AWIMHSIRGLVCWWVLILGLQLMRVAWFAGLTARARKQQQSSSDTS 340
            W++H +RGLV WWVLILG+QLMRVAWFAG  +R    QQ  S  S
Sbjct: 190 VWVVHMLRGLVSWWVLILGMQLMRVAWFAGFASRTTTGQQPQSVAS 235


>AT3G60590.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast,
           chloroplast inner membrane, chloroplast envelope;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 81 Blast
           hits to 81 proteins in 19 species: Archae - 0; Bacteria
           - 10; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0;
           Other Eukaryotes - 1 (source: NCBI BLink). |
           chr3:22399043-22399753 FORWARD LENGTH=236
          Length = 236

 Score =  315 bits (807), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 149/226 (65%), Positives = 182/226 (80%), Gaps = 1/226 (0%)

Query: 116 QLQFLKWPMWLLGPSILLVTGMVPTLWLPISSIFLGPNIASLLSLIGLDCIFNLGATLFL 175
           Q+Q LKWP+WLLGPS+LL +GM PTLWLP+SS+FLG N+ SLLSLIGLDCIFNLGATLFL
Sbjct: 10  QVQVLKWPIWLLGPSVLLTSGMAPTLWLPLSSVFLGSNVVSLLSLIGLDCIFNLGATLFL 69

Query: 176 LMADSCSRPKNPMQDCKSKAPFSYQFWNIVATLTGFVVPLLMMLGSQKG-VSVLQPQLPS 234
           LMADSC+RPK+P Q C SK PFSY+FWN+ + + GF+VP+L++ GSQ G ++ LQPQ+P 
Sbjct: 70  LMADSCARPKDPSQSCNSKPPFSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPF 129

Query: 235 ISFAVXXXXXXXXXSVQILTEMLTWHWQSPVWLVTPVIYESYRVLQLMRGLKLGVELSAP 294
           +S AV         +VQ LTE+LTWHWQSPVWLVTPV+YE+YR+LQLMRGL L  E++AP
Sbjct: 130 LSSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAP 189

Query: 295 AWIMHSIRGLVCWWVLILGLQLMRVAWFAGLTARARKQQQSSSDTS 340
            W++H +RGLV WWVLILG+QLMRVAWFAG  +R    QQ  S  S
Sbjct: 190 VWVVHMLRGLVSWWVLILGMQLMRVAWFAGFASRTTTGQQPQSVAS 235


>AT1G48460.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           envelope; EXPRESSED IN: 21 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G63040.1);
           Has 60 Blast hits to 60 proteins in 14 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 60;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr1:17911469-17913149 FORWARD LENGTH=340
          Length = 340

 Score = 52.4 bits (124), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 51/243 (20%), Positives = 96/243 (39%), Gaps = 23/243 (9%)

Query: 48  HINQSRVCCTKLTPWEPSPVTYAPTDKQSDNFLQNSANIFETLDSSETAESPTANAEGLK 107
            +++ RV C+       S     P  KQSD      A +F     S +    T     ++
Sbjct: 53  QLSRVRVACSS------SQSDSRPEKKQSDKSNYARAELFRGKSGSVSFNGLTHQL--VE 104

Query: 108 ETKYQPGLQLQFLKWPMWLLGPSILLVTGMVPTLWLP--ISSIFLGPNIASLLSLIGLDC 165
           E+K       +     +W+L P +L+ + ++P  +L   I + F    +A +++    + 
Sbjct: 105 ESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKNDTVAEIVTSFCFET 164

Query: 166 IFNLGATLFLLMADSCSRPKNPMQDCKSKAPFSYQFWNIVATLTGFVVPLLMMLGSQKGV 225
           +F  G  +FL + D   RP            FS + W ++  L G++    + +G +  V
Sbjct: 165 VFYAGLAIFLSVTDRVQRPY---------LDFSSKRWGLITGLRGYLTSAFLTMGLKVVV 215

Query: 226 SVLQPQLP----SISFAVXXXXXXXXXSVQILTEMLTWHWQSPVWLVTPVIYESYRVLQL 281
            V    +      I   +         +VQ + E       S  W + P+++E YR+ Q+
Sbjct: 216 PVFAVYMTWPALGIDALIAVLPFLVGCAVQRVFEARLERRGSSCWPIVPIVFEVYRLYQV 275

Query: 282 MRG 284
            R 
Sbjct: 276 TRA 278