Miyakogusa Predicted Gene
- Lj1g3v0375290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0375290.1 tr|B9MVG9|B9MVG9_POPTR Predicted protein
OS=Populus trichocarpa GN=POPTRDRAFT_677769 PE=4
SV=1,29.56,0.000000000000002,seg,NULL,CUFF.25596.1
(373 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G63040.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 311 5e-85
AT5G63040.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 311 6e-85
AT1G48460.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 142 3e-34
AT3G60590.2 | Symbols: | unknown protein; LOCATED IN: chloropla... 63 3e-10
AT3G60590.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 63 4e-10
AT3G60590.4 | Symbols: | unknown protein; LOCATED IN: chloropla... 61 1e-09
AT3G60590.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 61 1e-09
>AT5G63040.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:25288504-25290326 FORWARD LENGTH=366
Length = 366
Score = 311 bits (797), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 168/254 (66%), Positives = 203/254 (79%), Gaps = 9/254 (3%)
Query: 112 SNSSYHLSGSDGKPGLISFYSRPYGRDNEVLLSNPEKSQN---SILWFMGPAVLVASFIF 168
+S+ + +DGKPG ISFY+ P + ++++ P ++Q+ +LW +GPAVLV+SFI
Sbjct: 109 DDSTIQYNRNDGKPGFISFYN-PRNKTEDIII--PPETQSPWGRLLWLIGPAVLVSSFIL 165
Query: 169 PSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHLRRPVQLDIVANNGNT 228
P +Y+R+++S +FEDSLLTDFLILFFTEA+FYCGV+ FL ++D R+ + N N
Sbjct: 166 PPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG-KVPQNRIN- 223
Query: 229 LPPQMGQRISSVATLVLSLVIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 288
P Q+GQRISSVATLVLSL+IPMVTMG VWPWTGPAASATLAPYLVGIVVQFAFEQYARY
Sbjct: 224 -PSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 282
Query: 289 RKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVRGAEMTSHNMAIISSLGTLLNVLQF 348
R SPS IP IFQVYRLHQLNRAAQLVTALS TV+GAE T +N+AI SLGTLLNV+Q
Sbjct: 283 RNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSLGTLLNVIQV 342
Query: 349 LGVICIWSLSSFLM 362
LGVI IWS+SSFLM
Sbjct: 343 LGVISIWSISSFLM 356
>AT5G63040.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr5:25288950-25290326 FORWARD LENGTH=366
Length = 366
Score = 311 bits (796), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 168/254 (66%), Positives = 203/254 (79%), Gaps = 9/254 (3%)
Query: 112 SNSSYHLSGSDGKPGLISFYSRPYGRDNEVLLSNPEKSQN---SILWFMGPAVLVASFIF 168
+S+ + +DGKPG ISFY+ P + ++++ P ++Q+ +LW +GPAVLV+SFI
Sbjct: 109 DDSTIQYNRNDGKPGFISFYN-PRNKTEDIII--PPETQSPWGRLLWLIGPAVLVSSFIL 165
Query: 169 PSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHLRRPVQLDIVANNGNT 228
P +Y+R+++S +FEDSLLTDFLILFFTEA+FYCGV+ FL ++D R+ + N N
Sbjct: 166 PPVYLRRIVSAVFEDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRKGSG-KVPQNRIN- 223
Query: 229 LPPQMGQRISSVATLVLSLVIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 288
P Q+GQRISSVATLVLSL+IPMVTMG VWPWTGPAASATLAPYLVGIVVQFAFEQYARY
Sbjct: 224 -PSQLGQRISSVATLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQFAFEQYARY 282
Query: 289 RKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVRGAEMTSHNMAIISSLGTLLNVLQF 348
R SPS IP IFQVYRLHQLNRAAQLVTALS TV+GAE T +N+AI SLGTLLNV+Q
Sbjct: 283 RNSPSSPIIPIIFQVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSLGTLLNVIQV 342
Query: 349 LGVICIWSLSSFLM 362
LGVI IWS+SSFLM
Sbjct: 343 LGVISIWSISSFLM 356
>AT1G48460.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast
envelope; EXPRESSED IN: 21 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G63040.1);
Has 60 Blast hits to 60 proteins in 14 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 60;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:17911469-17913149 FORWARD LENGTH=340
Length = 340
Score = 142 bits (358), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 148/286 (51%), Gaps = 4/286 (1%)
Query: 84 DQSEHSELEIAATVPENNNNIVQNFSPASNSSYHLSGSDGKPGLISFYSRPYGRDNE-VL 142
++ + S + +A + ++++ + S SN + GK G +SF + E L
Sbjct: 50 NKPQLSRVRVACSSSQSDSRPEKKQSDKSNYA-RAELFRGKSGSVSFNGLTHQLVEESKL 108
Query: 143 LSNP-EKSQNSILWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYC 201
+S P ++ + S LW + P VL++S I P ++ ++ F++ + + + F E +FY
Sbjct: 109 VSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKNDTVAEIVTSFCFETVFYA 168
Query: 202 GVSLFLFLLDHLRRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTMGLVWPWT 261
G+++FL + D ++RP LD + + G S+ T+ L +V+P+ + + WP
Sbjct: 169 GLAIFLSVTDRVQRP-YLDFSSKRWGLITGLRGYLTSAFLTMGLKVVVPVFAVYMTWPAL 227
Query: 262 GPAASATLAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSS 321
G A + P+LVG VQ FE R S W +P +F+VYRL+Q+ RAA V L
Sbjct: 228 GIDALIAVLPFLVGCAVQRVFEARLERRGSSCWPIVPIVFEVYRLYQVTRAATFVQRLMF 287
Query: 322 TVRGAEMTSHNMAIISSLGTLLNVLQFLGVICIWSLSSFLMRFIPS 367
++ A T+ +L L+ LQFL V+C+WS +FLMR PS
Sbjct: 288 MMKDAATTAEITERGVALVGLVVTLQFLAVMCLWSFITFLMRLFPS 333
>AT3G60590.2 | Symbols: | unknown protein; LOCATED IN: chloroplast,
chloroplast inner membrane, chloroplast envelope;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr3:22398764-22399753 FORWARD LENGTH=329
Length = 329
Score = 62.8 bits (151), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 15/184 (8%)
Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
+W +GP+VL+ S + P+L++ LS +F S + L L + IF G +LFL + D
Sbjct: 111 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 168
Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLV---WPWTGPAA 265
RP + + N+ PP + ++ +L++ ++PM+ + GL+ P +
Sbjct: 169 ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLS 224
Query: 266 SAT-LAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVR 324
SA L PY + + VQ E + +SP W P +++ YR+ QL R L +++ V
Sbjct: 225 SAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVW 284
Query: 325 GAEM 328
M
Sbjct: 285 VVHM 288
>AT3G60590.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G48460.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:22398228-22399753 FORWARD LENGTH=404
Length = 404
Score = 62.8 bits (151), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 15/184 (8%)
Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
+W +GP+VL+ S + P+L++ LS +F S + L L + IF G +LFL + D
Sbjct: 186 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 243
Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLV---WPWTGPAA 265
RP + + N+ PP + ++ +L++ ++PM+ + GL+ P +
Sbjct: 244 ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLASLQPQIPFLS 299
Query: 266 SAT-LAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTVR 324
SA L PY + + VQ E + +SP W P +++ YR+ QL R L +++ V
Sbjct: 300 SAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPVW 359
Query: 325 GAEM 328
M
Sbjct: 360 VVHM 363
>AT3G60590.4 | Symbols: | unknown protein; LOCATED IN: chloroplast
inner membrane; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 14 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G48460.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr3:22399043-22399753 FORWARD LENGTH=236
Length = 236
Score = 60.8 bits (146), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 91/185 (49%), Gaps = 17/185 (9%)
Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
+W +GP+VL+ S + P+L++ LS +F S + L L + IF G +LFL + D
Sbjct: 18 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 75
Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLVWPWTGP----- 263
RP + + N+ PP + ++ +L++ ++PM+ + GL+ P
Sbjct: 76 ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLAS-LQPQIPFL 130
Query: 264 AASATLAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTV 323
+++ L PY + + VQ E + +SP W P +++ YR+ QL R L +++ V
Sbjct: 131 SSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPV 190
Query: 324 RGAEM 328
M
Sbjct: 191 WVVHM 195
>AT3G60590.1 | Symbols: | unknown protein; LOCATED IN: chloroplast,
chloroplast inner membrane, chloroplast envelope;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G48460.1); Has 81 Blast
hits to 81 proteins in 19 species: Archae - 0; Bacteria
- 10; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0;
Other Eukaryotes - 1 (source: NCBI BLink). |
chr3:22399043-22399753 FORWARD LENGTH=236
Length = 236
Score = 60.8 bits (146), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 91/185 (49%), Gaps = 17/185 (9%)
Query: 154 LWFMGPAVLVASFIFPSLYMRKLLSIIFEDSLLTDFLILFFTEAIFYCGVSLFLFLLDHL 213
+W +GP+VL+ S + P+L++ LS +F S + L L + IF G +LFL + D
Sbjct: 18 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 75
Query: 214 RRPVQLDIVANNGNTLPPQMGQRISSVATLVLSLVIPMVTM-----GLVWPWTGP----- 263
RP + + N+ PP + ++ +L++ ++PM+ + GL+ P
Sbjct: 76 ARPKD---PSQSCNSKPP-FSYKFWNMFSLIIGFLVPMLLLFGSQSGLLAS-LQPQIPFL 130
Query: 264 AASATLAPYLVGIVVQFAFEQYARYRKSPSWCAIPFIFQVYRLHQLNRAAQLVTALSSTV 323
+++ L PY + + VQ E + +SP W P +++ YR+ QL R L +++ V
Sbjct: 131 SSAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTLSAEVNAPV 190
Query: 324 RGAEM 328
M
Sbjct: 191 WVVHM 195