Miyakogusa Predicted Gene
- Lj0g3v0324599.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0324599.2 Non Chatacterized Hit- tr|I0Z1V0|I0Z1V0_9CHLO
Uncharacterized protein OS=Coccomyxa subellipsoidea
C-,35.64,6e-19,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.22081.2
(202 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 322 2e-88
AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 322 2e-88
AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 322 2e-88
AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 298 2e-81
AT2G25260.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 282 8e-77
>AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 322 bits (824), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 156/201 (77%), Positives = 164/201 (81%)
Query: 1 MAEPDHVFVRPLPNLAYGENPAAFPFFYIRPDQNEKIIRKYYPEEKGPVTNIDPIGNSPV 60
MAEPDHVFV PLPNLA G PAAFPFFYI P++ E I+RKYYP E GPVTNIDPIGNSPV
Sbjct: 157 MAEPDHVFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPV 216
Query: 61 IIKTDVIAKIAPTWMNVSLKMKEDPETDKAFGWVLEMYAYAIASALHGVRHILRKDFMLQ 120
II + + KIAPTWMNVSL MK DPETDKAFGWVLEMY YAIASA+HGVRHILRKDFMLQ
Sbjct: 217 IISKESLEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQ 276
Query: 121 PPWDLETHNKYIIHYTYGCDYNLKGELTYGKIGEWRFDKRSHXXXXXXXXXXXXXXXXXE 180
PPWDL T K+IIHYTYGCDYN+KGELTYGKIGEWRFDKRSH E
Sbjct: 277 PPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPE 336
Query: 181 SVVTLVKMVNEATANIPNWDT 201
SVVTLVKMVNEATA IPNWDT
Sbjct: 337 SVVTLVKMVNEATATIPNWDT 357
>AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 228 Blast hits to 200 proteins in 21 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
| chr5:4338676-4340827 FORWARD LENGTH=358
Length = 358
Score = 322 bits (824), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 156/201 (77%), Positives = 164/201 (81%)
Query: 1 MAEPDHVFVRPLPNLAYGENPAAFPFFYIRPDQNEKIIRKYYPEEKGPVTNIDPIGNSPV 60
MAEPDHVFV PLPNLA G PAAFPFFYI P++ E I+RKYYP E GPVTNIDPIGNSPV
Sbjct: 157 MAEPDHVFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPV 216
Query: 61 IIKTDVIAKIAPTWMNVSLKMKEDPETDKAFGWVLEMYAYAIASALHGVRHILRKDFMLQ 120
II + + KIAPTWMNVSL MK DPETDKAFGWVLEMY YAIASA+HGVRHILRKDFMLQ
Sbjct: 217 IISKESLEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQ 276
Query: 121 PPWDLETHNKYIIHYTYGCDYNLKGELTYGKIGEWRFDKRSHXXXXXXXXXXXXXXXXXE 180
PPWDL T K+IIHYTYGCDYN+KGELTYGKIGEWRFDKRSH E
Sbjct: 277 PPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPE 336
Query: 181 SVVTLVKMVNEATANIPNWDT 201
SVVTLVKMVNEATA IPNWDT
Sbjct: 337 SVVTLVKMVNEATATIPNWDT 357
>AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 322 bits (824), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 156/201 (77%), Positives = 164/201 (81%)
Query: 1 MAEPDHVFVRPLPNLAYGENPAAFPFFYIRPDQNEKIIRKYYPEEKGPVTNIDPIGNSPV 60
MAEPDHVFV PLPNLA G PAAFPFFYI P++ E I+RKYYP E GPVTNIDPIGNSPV
Sbjct: 157 MAEPDHVFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPV 216
Query: 61 IIKTDVIAKIAPTWMNVSLKMKEDPETDKAFGWVLEMYAYAIASALHGVRHILRKDFMLQ 120
II + + KIAPTWMNVSL MK DPETDKAFGWVLEMY YAIASA+HGVRHILRKDFMLQ
Sbjct: 217 IISKESLEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQ 276
Query: 121 PPWDLETHNKYIIHYTYGCDYNLKGELTYGKIGEWRFDKRSHXXXXXXXXXXXXXXXXXE 180
PPWDL T K+IIHYTYGCDYN+KGELTYGKIGEWRFDKRSH E
Sbjct: 277 PPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPE 336
Query: 181 SVVTLVKMVNEATANIPNWDT 201
SVVTLVKMVNEATA IPNWDT
Sbjct: 337 SVVTLVKMVNEATATIPNWDT 357
>AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane,
membrane; EXPRESSED IN: cultured cell, leaf; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:8754794-8756855 REVERSE LENGTH=366
Length = 366
Score = 298 bits (762), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 139/201 (69%), Positives = 163/201 (81%)
Query: 1 MAEPDHVFVRPLPNLAYGENPAAFPFFYIRPDQNEKIIRKYYPEEKGPVTNIDPIGNSPV 60
M+EPDH+ V+P+PNLA AAFPFFYI P + EK++RKYYPE +GPVTNIDPIGNSPV
Sbjct: 166 MSEPDHIIVKPIPNLAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPV 225
Query: 61 IIKTDVIAKIAPTWMNVSLKMKEDPETDKAFGWVLEMYAYAIASALHGVRHILRKDFMLQ 120
I+ D + KIAPTWMNVSL MK+DPE DKAFGWVLEMYAYA++SALHGV +IL KDFM+Q
Sbjct: 226 IVGKDALKKIAPTWMNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQ 285
Query: 121 PPWDLETHNKYIIHYTYGCDYNLKGELTYGKIGEWRFDKRSHXXXXXXXXXXXXXXXXXE 180
PPWD+E +KYIIHYTYGCDY++KG+LTYGKIGEWRFDKRS+ +
Sbjct: 286 PPWDIEVGDKYIIHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQ 345
Query: 181 SVVTLVKMVNEATANIPNWDT 201
SVVTLVKM+NEATANIPNW +
Sbjct: 346 SVVTLVKMINEATANIPNWGS 366
>AT2G25260.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits
to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr2:10755617-10757766 REVERSE LENGTH=358
Length = 358
Score = 282 bits (722), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 129/201 (64%), Positives = 158/201 (78%)
Query: 1 MAEPDHVFVRPLPNLAYGENPAAFPFFYIRPDQNEKIIRKYYPEEKGPVTNIDPIGNSPV 60
MAEPDH+ V+P+PNLA G AAFPFFYI P + E ++RK++P+E GP++ IDPIGNSPV
Sbjct: 158 MAEPDHIIVKPIPNLARGNLAAAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPV 217
Query: 61 IIKTDVIAKIAPTWMNVSLKMKEDPETDKAFGWVLEMYAYAIASALHGVRHILRKDFMLQ 120
I+ + + KIAPTWMNVSL MK DP+TDKAFGWVLEMYAYA++SALHGV +IL KDFM+Q
Sbjct: 218 IVTKNALMKIAPTWMNVSLAMKNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQ 277
Query: 121 PPWDLETHNKYIIHYTYGCDYNLKGELTYGKIGEWRFDKRSHXXXXXXXXXXXXXXXXXE 180
PPWD ET +IIHYTYGCD+++KG++ GKIGEWRFDKRS+ E
Sbjct: 278 PPWDTETKKTFIIHYTYGCDFDMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPE 337
Query: 181 SVVTLVKMVNEATANIPNWDT 201
SVVTLV M+NEATANIPNW++
Sbjct: 338 SVVTLVTMINEATANIPNWES 358