Miyakogusa Predicted Gene
- Lj1g3v1605280.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1605280.1 Non Chatacterized Hit- tr|I3S152|I3S152_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,37.69,3e-17,
,CUFF.27569.1
(367 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G48460.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 341 5e-94
AT5G63040.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 131 6e-31
AT5G63040.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 131 7e-31
AT3G60590.2 | Symbols: | unknown protein; LOCATED IN: chloropla... 49 5e-06
AT3G60590.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 49 5e-06
>AT1G48460.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast
envelope; EXPRESSED IN: 21 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G63040.1);
Has 60 Blast hits to 60 proteins in 14 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 60;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:17911469-17913149 FORWARD LENGTH=340
Length = 340
Score = 341 bits (874), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 160/261 (61%), Positives = 198/261 (75%)
Query: 107 YNGVEPFRGKSGSVSFCGLTHQLVEEGKLESAPFNEEESSYFWLLGPAAFLSCLILPQFF 166
Y E FRGKSGSVSF GLTHQLVEE KL SAPF EE+ S+ W+L P +S LILPQFF
Sbjct: 80 YARAELFRGKSGSVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFF 139
Query: 167 VGNVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYLQFSSKRWGLITGLR 226
+ ++EA F + + +IV+SF FE +FY GLA FL V DRVQ+PYL FSSKRWGLITGLR
Sbjct: 140 LSGIIEATFKNDTVAEIVTSFCFETVFYAGLAIFLSVTDRVQRPYLDFSSKRWGLITGLR 199
Query: 227 GYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQFAFEKYLDKRGSAC 286
GYL+SAFLTMGLKVVVP+ +Y W + ++AV PFL GC +Q FE L++RGS+C
Sbjct: 200 GYLTSAFLTMGLKVVVPVFAVYMTWPALGIDALIAVLPFLVGCAVQRVFEARLERRGSSC 259
Query: 287 WPLVPIIFEVYRLYQLTKAAHFVERLMFSLKGLPATPEILERSGALFAMIVSFQVLGIVC 346
WP+VPI+FEVYRLYQ+T+AA FV+RLMF +K T EI ER AL ++V+ Q L ++C
Sbjct: 260 WPIVPIVFEVYRLYQVTRAATFVQRLMFMMKDAATTAEITERGVALVGLVVTLQFLAVMC 319
Query: 347 LWSLMTFLVRLFPSRPVADHY 367
LWS +TFL+RLFPSRPV ++Y
Sbjct: 320 LWSFITFLMRLFPSRPVGENY 340
>AT5G63040.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:25288504-25290326 FORWARD LENGTH=366
Length = 366
Score = 131 bits (330), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/265 (34%), Positives = 138/265 (52%), Gaps = 14/265 (5%)
Query: 99 SSGVPVDAYNGVEPFR--GKSGSVSFCGLTHQLVEEGKLESAPFNEEESS----YFWLLG 152
+ P D + ++ R GK G +SF K E E S WL+G
Sbjct: 102 TEATPRDDDSTIQYNRNDGKPGFISFYN------PRNKTEDIIIPPETQSPWGRLLWLIG 155
Query: 153 PAAFLSCLILPQFFVGNVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYL 212
PA +S ILP ++ +V A F D +L D + F EALFY G+A FL ++DR +K
Sbjct: 156 PAVLVSSFILPPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG 215
Query: 213 QFSSKRWGLITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQ 272
+ R + G S+ T+ L +++P++ + W A +AP+L G V+Q
Sbjct: 216 KVPQNR--INPSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQ 273
Query: 273 FAFEKYLDKRGSACWPLVPIIFEVYRLYQLTKAAHFVERLMFSLKGLPATPEILERSGAL 332
FAFE+Y R S P++PIIF+VYRL+QL +AA V L F++KG AT L +L
Sbjct: 274 FAFEQYARYRNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSL 333
Query: 333 FAMIVSFQVLGIVCLWSLMTFLVRL 357
++ QVLG++ +WS+ +FL+ L
Sbjct: 334 GTLLNVIQVLGVISIWSISSFLMWL 358
>AT5G63040.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr5:25288950-25290326 FORWARD LENGTH=366
Length = 366
Score = 131 bits (330), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 92/265 (34%), Positives = 138/265 (52%), Gaps = 14/265 (5%)
Query: 99 SSGVPVDAYNGVEPFR--GKSGSVSFCGLTHQLVEEGKLESAPFNEEESS----YFWLLG 152
+ P D + ++ R GK G +SF K E E S WL+G
Sbjct: 102 TEATPRDDDSTIQYNRNDGKPGFISFYN------PRNKTEDIIIPPETQSPWGRLLWLIG 155
Query: 153 PAAFLSCLILPQFFVGNVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYL 212
PA +S ILP ++ +V A F D +L D + F EALFY G+A FL ++DR +K
Sbjct: 156 PAVLVSSFILPPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG 215
Query: 213 QFSSKRWGLITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQ 272
+ R + G S+ T+ L +++P++ + W A +AP+L G V+Q
Sbjct: 216 KVPQNR--INPSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQ 273
Query: 273 FAFEKYLDKRGSACWPLVPIIFEVYRLYQLTKAAHFVERLMFSLKGLPATPEILERSGAL 332
FAFE+Y R S P++PIIF+VYRL+QL +AA V L F++KG AT L +L
Sbjct: 274 FAFEQYARYRNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSL 333
Query: 333 FAMIVSFQVLGIVCLWSLMTFLVRL 357
++ QVLG++ +WS+ +FL+ L
Sbjct: 334 GTLLNVIQVLGVISIWSISSFLMWL 358
>AT3G60590.2 | Symbols: | unknown protein; LOCATED IN: chloroplast,
chloroplast inner membrane, chloroplast envelope;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr3:22398764-22399753 FORWARD LENGTH=329
Length = 329
Score = 49.3 bits (116), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/214 (24%), Positives = 90/214 (42%), Gaps = 41/214 (19%)
Query: 116 KSGSVSFCGLTHQLVEEGKLESAPFNEEESSY------FWLLGPAAFLSCLILPQFFVGN 169
K+ +V F + + EE K++ + S WLLGP+ L+ + P ++
Sbjct: 74 KTANV-FESIVSESAEEEKVDMSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWLP- 131
Query: 170 VVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYLQ---------FSSKRWG 220
+ + F +V ++S + +F +G FL + D +P FS K W
Sbjct: 132 -LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFWN 190
Query: 221 LITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQFAF----- 275
+ + + G+L VP+LLL+ + S LA + PFL+ VI F +
Sbjct: 191 MFSLIIGFL------------VPMLLLFGSQS-GLLASLQPQIPFLSSAVILFPYFILLA 237
Query: 276 -----EKYLDKRGSACWPLVPIIFEVYRLYQLTK 304
E S W + P+++E YR+ QL +
Sbjct: 238 VQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMR 271
>AT3G60590.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G48460.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:22398228-22399753 FORWARD LENGTH=404
Length = 404
Score = 49.3 bits (116), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 91/215 (42%), Gaps = 41/215 (19%)
Query: 115 GKSGSVSFCGLTHQLVEEGKLESAPFNEEESSY------FWLLGPAAFLSCLILPQFFVG 168
K+ +V F + + EE K++ + S WLLGP+ L+ + P ++
Sbjct: 148 DKTANV-FESIVSESAEEEKVDMSAQQRTNSQVQVLKWPIWLLGPSVLLTSGMAPTLWLP 206
Query: 169 NVVEAFFNDMILVDIVSSFTFEALFYIGLATFLHVVDRVQKPYLQ---------FSSKRW 219
+ + F +V ++S + +F +G FL + D +P FS K W
Sbjct: 207 --LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSCARPKDPSQSCNSKPPFSYKFW 264
Query: 220 GLITGLRGYLSSAFLTMGLKVVVPLLLLYAAWSVARLAVIVAVAPFLAGCVIQFAFEKYL 279
+ + + G+L VP+LLL+ + S LA + PFL+ VI F + L
Sbjct: 265 NMFSLIIGFL------------VPMLLLFGSQS-GLLASLQPQIPFLSSAVILFPYFILL 311
Query: 280 DKRG----------SACWPLVPIIFEVYRLYQLTK 304
+ S W + P+++E YR+ QL +
Sbjct: 312 AVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMR 346