Miyakogusa Predicted Gene

Lj1g3v0375290.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v0375290.1 tr|B9MVG9|B9MVG9_POPTR Predicted protein
OS=Populus trichocarpa GN=POPTRDRAFT_677769 PE=4
SV=1,29.56,0.000000000000002,seg,NULL,CUFF.25596.1
         (373 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G63040.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   311   5e-85
AT5G63040.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   311   6e-85
AT1G48460.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   142   3e-34
AT3G60590.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...    63   3e-10
AT3G60590.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    63   4e-10
AT3G60590.4 | Symbols:  | unknown protein; LOCATED IN: chloropla...    61   1e-09
AT3G60590.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...    61   1e-09

>AT5G63040.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:25288504-25290326 FORWARD LENGTH=366
          Length = 366

 Score =  311 bits (797), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 168/254 (66%), Positives = 203/254 (79%), Gaps = 9/254 (3%)

Query: 112 SNSSYHLSGSDGKPGLISFYSRPYGRDNEVLLSNPEKSQN---SILWFMGPAVLVASFIF 168
            +S+   + +DGKPG ISFY+ P  +  ++++  P ++Q+    +LW +GPAVLV+SFI 
Sbjct: 109 DDSTIQYNRNDGKPGFISFYN-PRNKTEDIII--PPETQSPWGRLLWLIGPAVLVSSFIL 165

Query: 169 PSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHLRRPVQLDIVANNGNT 228
           P +Y+R+++S +FEDSLLTDFLILFFTEA+FYCGV+ FL ++D  R+     +  N  N 
Sbjct: 166 PPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG-KVPQNRIN- 223

Query: 229 LPPQMGQRISSVATLVLSLVIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 288
            P Q+GQRISSVATLVLSL+IPMVTMG VWPWTGPAASATLAPYLVGIVVQFAFEQYARY
Sbjct: 224 -PSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 282

Query: 289 RKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVRGAEMTSHNMAIISSLGTLLNVLQF 348
           R SPS   IP IFQVYRLHQLNRAAQLVTALS TV+GAE T +N+AI  SLGTLLNV+Q 
Sbjct: 283 RNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSLGTLLNVIQV 342

Query: 349 LGVICIWSLSSFLM 362
           LGVI IWS+SSFLM
Sbjct: 343 LGVISIWSISSFLM 356


>AT5G63040.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr5:25288950-25290326 FORWARD LENGTH=366
          Length = 366

 Score =  311 bits (796), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 168/254 (66%), Positives = 203/254 (79%), Gaps = 9/254 (3%)

Query: 112 SNSSYHLSGSDGKPGLISFYSRPYGRDNEVLLSNPEKSQN---SILWFMGPAVLVASFIF 168
            +S+   + +DGKPG ISFY+ P  +  ++++  P ++Q+    +LW +GPAVLV+SFI 
Sbjct: 109 DDSTIQYNRNDGKPGFISFYN-PRNKTEDIII--PPETQSPWGRLLWLIGPAVLVSSFIL 165

Query: 169 PSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHLRRPVQLDIVANNGNT 228
           P +Y+R+++S +FEDSLLTDFLILFFTEA+FYCGV+ FL ++D  R+     +  N  N 
Sbjct: 166 PPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG-KVPQNRIN- 223

Query: 229 LPPQMGQRISSVATLVLSLVIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 288
            P Q+GQRISSVATLVLSL+IPMVTMG VWPWTGPAASATLAPYLVGIVVQFAFEQYARY
Sbjct: 224 -PSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 282

Query: 289 RKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVRGAEMTSHNMAIISSLGTLLNVLQF 348
           R SPS   IP IFQVYRLHQLNRAAQLVTALS TV+GAE T +N+AI  SLGTLLNV+Q 
Sbjct: 283 RNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSLGTLLNVIQV 342

Query: 349 LGVICIWSLSSFLM 362
           LGVI IWS+SSFLM
Sbjct: 343 LGVISIWSISSFLM 356


>AT1G48460.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           envelope; EXPRESSED IN: 21 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G63040.1);
           Has 60 Blast hits to 60 proteins in 14 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 60;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr1:17911469-17913149 FORWARD LENGTH=340
          Length = 340

 Score =  142 bits (358), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 87/286 (30%), Positives = 148/286 (51%), Gaps = 4/286 (1%)

Query: 84  DQSEHSELEIAATVPENNNNIVQNFSPASNSSYHLSGSDGKPGLISFYSRPYGRDNE-VL 142
           ++ + S + +A +  ++++   +  S  SN +       GK G +SF    +    E  L
Sbjct: 50  NKPQLSRVRVACSSSQSDSRPEKKQSDKSNYA-RAELFRGKSGSVSFNGLTHQLVEESKL 108

Query: 143 LSNP-EKSQNSILWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYC 201
           +S P ++ + S LW + P VL++S I P  ++  ++   F++  + + +  F  E +FY 
Sbjct: 109 VSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKNDTVAEIVTSFCFETVFYA 168

Query: 202 GVSLFLFLLDHLRRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTMGLVWPWT 261
           G+++FL + D ++RP  LD  +     +    G   S+  T+ L +V+P+  + + WP  
Sbjct: 169 GLAIFLSVTDRVQRP-YLDFSSKRWGLITGLRGYLTSAFLTMGLKVVVPVFAVYMTWPAL 227

Query: 262 GPAASATLAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSS 321
           G  A   + P+LVG  VQ  FE     R S  W  +P +F+VYRL+Q+ RAA  V  L  
Sbjct: 228 GIDALIAVLPFLVGCAVQRVFEARLERRGSSCWPIVPIVFEVYRLYQVTRAATFVQRLMF 287

Query: 322 TVRGAEMTSHNMAIISSLGTLLNVLQFLGVICIWSLSSFLMRFIPS 367
            ++ A  T+       +L  L+  LQFL V+C+WS  +FLMR  PS
Sbjct: 288 MMKDAATTAEITERGVALVGLVVTLQFLAVMCLWSFITFLMRLFPS 333


>AT3G60590.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast,
           chloroplast inner membrane, chloroplast envelope;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr3:22398764-22399753 FORWARD LENGTH=329
          Length = 329

 Score = 62.8 bits (151), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 15/184 (8%)

Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
           +W +GP+VL+ S + P+L++   LS +F  S +   L L   + IF  G +LFL + D  
Sbjct: 111 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 168

Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLV---WPWTGPAA 265
            RP      + + N+ PP    +  ++ +L++  ++PM+ +     GL+    P     +
Sbjct: 169 ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLS 224

Query: 266 SAT-LAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVR 324
           SA  L PY + + VQ   E    + +SP W   P +++ YR+ QL R   L   +++ V 
Sbjct: 225 SAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVW 284

Query: 325 GAEM 328
              M
Sbjct: 285 VVHM 288


>AT3G60590.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:22398228-22399753 FORWARD LENGTH=404
          Length = 404

 Score = 62.8 bits (151), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 15/184 (8%)

Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
           +W +GP+VL+ S + P+L++   LS +F  S +   L L   + IF  G +LFL + D  
Sbjct: 186 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 243

Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLV---WPWTGPAA 265
            RP      + + N+ PP    +  ++ +L++  ++PM+ +     GL+    P     +
Sbjct: 244 ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLS 299

Query: 266 SAT-LAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVR 324
           SA  L PY + + VQ   E    + +SP W   P +++ YR+ QL R   L   +++ V 
Sbjct: 300 SAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVW 359

Query: 325 GAEM 328
              M
Sbjct: 360 VVHM 363


>AT3G60590.4 | Symbols:  | unknown protein; LOCATED IN: chloroplast
           inner membrane; EXPRESSED IN: 23 plant structures;
           EXPRESSED DURING: 14 growth stages; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48460.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr3:22399043-22399753 FORWARD LENGTH=236
          Length = 236

 Score = 60.8 bits (146), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 91/185 (49%), Gaps = 17/185 (9%)

Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
           +W +GP+VL+ S + P+L++   LS +F  S +   L L   + IF  G +LFL + D  
Sbjct: 18  IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 75

Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLVWPWTGP----- 263
            RP      + + N+ PP    +  ++ +L++  ++PM+ +     GL+     P     
Sbjct: 76  ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLAS-LQPQIPFL 130

Query: 264 AASATLAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTV 323
           +++  L PY + + VQ   E    + +SP W   P +++ YR+ QL R   L   +++ V
Sbjct: 131 SSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPV 190

Query: 324 RGAEM 328
               M
Sbjct: 191 WVVHM 195


>AT3G60590.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast,
           chloroplast inner membrane, chloroplast envelope;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 81 Blast
           hits to 81 proteins in 19 species: Archae - 0; Bacteria
           - 10; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0;
           Other Eukaryotes - 1 (source: NCBI BLink). |
           chr3:22399043-22399753 FORWARD LENGTH=236
          Length = 236

 Score = 60.8 bits (146), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 91/185 (49%), Gaps = 17/185 (9%)

Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
           +W +GP+VL+ S + P+L++   LS +F  S +   L L   + IF  G +LFL + D  
Sbjct: 18  IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 75

Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLVWPWTGP----- 263
            RP      + + N+ PP    +  ++ +L++  ++PM+ +     GL+     P     
Sbjct: 76  ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLAS-LQPQIPFL 130

Query: 264 AASATLAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTV 323
           +++  L PY + + VQ   E    + +SP W   P +++ YR+ QL R   L   +++ V
Sbjct: 131 SSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPV 190

Query: 324 RGAEM 328
               M
Sbjct: 191 WVVHM 195