Miyakogusa Predicted Gene

Lj1g3v1605280.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v1605280.1 Non Chatacterized Hit- tr|I3S152|I3S152_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,37.69,3e-17,
,CUFF.27569.1
         (367 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G48460.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   341   5e-94
AT5G63040.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   131   6e-31
AT5G63040.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   131   7e-31
AT3G60590.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...    49   5e-06
AT3G60590.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   5e-06

>AT1G48460.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           envelope; EXPRESSED IN: 21 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G63040.1);
           Has 60 Blast hits to 60 proteins in 14 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 60;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr1:17911469-17913149 FORWARD LENGTH=340
          Length = 340

 Score =  341 bits (874), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 160/261 (61%), Positives = 198/261 (75%)

Query: 107 YNGVEPFRGKSGSVSFCGLTHQLVEEGKLESAPFNEEESSYFWLLGPAAFLSCLILPQFF 166
           Y   E FRGKSGSVSF GLTHQLVEE KL SAPF EE+ S+ W+L P   +S LILPQFF
Sbjct: 80  YARAELFRGKSGSVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFF 139

Query: 167 VGNVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYLQFSSKRWGLITGLR 226
           +  ++EA F +  + +IV+SF FE +FY GLA FL V DRVQ+PYL FSSKRWGLITGLR
Sbjct: 140 LSGIIEATFKNDTVAEIVTSFCFETVFYAGLAIFLSVTDRVQRPYLDFSSKRWGLITGLR 199

Query: 227 GYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQFAFEKYLDKRGSAC 286
           GYL+SAFLTMGLKVVVP+  +Y  W    +  ++AV PFL GC +Q  FE  L++RGS+C
Sbjct: 200 GYLTSAFLTMGLKVVVPVFAVYMTWPALGIDALIAVLPFLVGCAVQRVFEARLERRGSSC 259

Query: 287 WPLVPIIFEVYRLYQLTKAAHFVERLMFSLKGLPATPEILERSGALFAMIVSFQVLGIVC 346
           WP+VPI+FEVYRLYQ+T+AA FV+RLMF +K    T EI ER  AL  ++V+ Q L ++C
Sbjct: 260 WPIVPIVFEVYRLYQVTRAATFVQRLMFMMKDAATTAEITERGVALVGLVVTLQFLAVMC 319

Query: 347 LWSLMTFLVRLFPSRPVADHY 367
           LWS +TFL+RLFPSRPV ++Y
Sbjct: 320 LWSFITFLMRLFPSRPVGENY 340


>AT5G63040.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:25288504-25290326 FORWARD LENGTH=366
          Length = 366

 Score =  131 bits (330), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 92/265 (34%), Positives = 138/265 (52%), Gaps = 14/265 (5%)

Query: 99  SSGVPVDAYNGVEPFR--GKSGSVSFCGLTHQLVEEGKLESAPFNEEESS----YFWLLG 152
           +   P D  + ++  R  GK G +SF           K E      E  S      WL+G
Sbjct: 102 TEATPRDDDSTIQYNRNDGKPGFISFYN------PRNKTEDIIIPPETQSPWGRLLWLIG 155

Query: 153 PAAFLSCLILPQFFVGNVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYL 212
           PA  +S  ILP  ++  +V A F D +L D +  F  EALFY G+A FL ++DR +K   
Sbjct: 156 PAVLVSSFILPPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG 215

Query: 213 QFSSKRWGLITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQ 272
           +    R  +     G   S+  T+ L +++P++ +   W     A    +AP+L G V+Q
Sbjct: 216 KVPQNR--INPSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQ 273

Query: 273 FAFEKYLDKRGSACWPLVPIIFEVYRLYQLTKAAHFVERLMFSLKGLPATPEILERSGAL 332
           FAFE+Y   R S   P++PIIF+VYRL+QL +AA  V  L F++KG  AT   L    +L
Sbjct: 274 FAFEQYARYRNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSL 333

Query: 333 FAMIVSFQVLGIVCLWSLMTFLVRL 357
             ++   QVLG++ +WS+ +FL+ L
Sbjct: 334 GTLLNVIQVLGVISIWSISSFLMWL 358


>AT5G63040.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr5:25288950-25290326 FORWARD LENGTH=366
          Length = 366

 Score =  131 bits (330), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 92/265 (34%), Positives = 138/265 (52%), Gaps = 14/265 (5%)

Query: 99  SSGVPVDAYNGVEPFR--GKSGSVSFCGLTHQLVEEGKLESAPFNEEESS----YFWLLG 152
           +   P D  + ++  R  GK G +SF           K E      E  S      WL+G
Sbjct: 102 TEATPRDDDSTIQYNRNDGKPGFISFYN------PRNKTEDIIIPPETQSPWGRLLWLIG 155

Query: 153 PAAFLSCLILPQFFVGNVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYL 212
           PA  +S  ILP  ++  +V A F D +L D +  F  EALFY G+A FL ++DR +K   
Sbjct: 156 PAVLVSSFILPPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG 215

Query: 213 QFSSKRWGLITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQ 272
           +    R  +     G   S+  T+ L +++P++ +   W     A    +AP+L G V+Q
Sbjct: 216 KVPQNR--INPSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQ 273

Query: 273 FAFEKYLDKRGSACWPLVPIIFEVYRLYQLTKAAHFVERLMFSLKGLPATPEILERSGAL 332
           FAFE+Y   R S   P++PIIF+VYRL+QL +AA  V  L F++KG  AT   L    +L
Sbjct: 274 FAFEQYARYRNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSL 333

Query: 333 FAMIVSFQVLGIVCLWSLMTFLVRL 357
             ++   QVLG++ +WS+ +FL+ L
Sbjct: 334 GTLLNVIQVLGVISIWSISSFLMWL 358


>AT3G60590.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast,
           chloroplast inner membrane, chloroplast envelope;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr3:22398764-22399753 FORWARD LENGTH=329
          Length = 329

 Score = 49.3 bits (116), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/214 (24%), Positives = 90/214 (42%), Gaps = 41/214 (19%)

Query: 116 KSGSVSFCGLTHQLVEEGKLESAPFNEEESSY------FWLLGPAAFLSCLILPQFFVGN 169
           K+ +V F  +  +  EE K++ +      S         WLLGP+  L+  + P  ++  
Sbjct: 74  KTANV-FESIVSESAEEEKVDMSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWLP- 131

Query: 170 VVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYLQ---------FSSKRWG 220
            + + F    +V ++S    + +F +G   FL + D   +P            FS K W 
Sbjct: 132 -LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFWN 190

Query: 221 LITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQFAF----- 275
           + + + G+L            VP+LLL+ + S   LA +    PFL+  VI F +     
Sbjct: 191 MFSLIIGFL------------VPMLLLFGSQS-GLLASLQPQIPFLSSAVILFPYFILLA 237

Query: 276 -----EKYLDKRGSACWPLVPIIFEVYRLYQLTK 304
                E       S  W + P+++E YR+ QL +
Sbjct: 238 VQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMR 271


>AT3G60590.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:22398228-22399753 FORWARD LENGTH=404
          Length = 404

 Score = 49.3 bits (116), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 91/215 (42%), Gaps = 41/215 (19%)

Query: 115 GKSGSVSFCGLTHQLVEEGKLESAPFNEEESSY------FWLLGPAAFLSCLILPQFFVG 168
            K+ +V F  +  +  EE K++ +      S         WLLGP+  L+  + P  ++ 
Sbjct: 148 DKTANV-FESIVSESAEEEKVDMSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWLP 206

Query: 169 NVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYLQ---------FSSKRW 219
             + + F    +V ++S    + +F +G   FL + D   +P            FS K W
Sbjct: 207 --LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFW 264

Query: 220 GLITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQFAFEKYL 279
            + + + G+L            VP+LLL+ + S   LA +    PFL+  VI F +   L
Sbjct: 265 NMFSLIIGFL------------VPMLLLFGSQS-GLLASLQPQIPFLSSAVILFPYFILL 311

Query: 280 DKRG----------SACWPLVPIIFEVYRLYQLTK 304
             +           S  W + P+++E YR+ QL +
Sbjct: 312 AVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMR 346