Miyakogusa Predicted Gene
- Lj0g3v0154389.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0154389.1 tr|B9P4Y2|B9P4Y2_POPTR Predicted protein
OS=Populus trichocarpa GN=POPTRDRAFT_790273 PE=4
SV=1,36.36,1e-18,seg,NULL,CUFF.9586.1
(374 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G61900.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 375 e-104
AT1G61900.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 375 e-104
AT1G61900.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 375 e-104
AT2G30700.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 218 7e-57
>AT1G61900.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: anchored to
plasma membrane, plasma membrane, anchored to membrane;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G30700.1); Has 65 Blast
hits to 65 proteins in 12 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 65; Viruses - 0;
Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:22882508-22884722 REVERSE LENGTH=433
Length = 433
Score = 375 bits (963), Expect = e-104, Method: Compositional matrix adjust.
Identities = 177/292 (60%), Positives = 231/292 (79%), Gaps = 8/292 (2%)
Query: 82 NTSIPKLSGLCTLNFTTAESLLSVTAVDCWEGFAPFLANVICCPQLEATLTILIGQSSKF 141
N+++PKLSGLC+LNF+ +ESL+ T+ +CW FAP LANV+CCPQL+ATLTI++G++SK
Sbjct: 59 NSTMPKLSGLCSLNFSASESLIQTTSHNCWTVFAPLLANVMCCPQLDATLTIILGKASKE 118
Query: 142 TDALALNSTVAKHCLSDVEQILMGQGGTGDLKHVCSVHPLNLTEAACPVKNVNDFNDIVD 201
T LALN T +KHCLSD+EQIL+G+G +G L +CS+H NLT ++CPV NV++F VD
Sbjct: 119 TGLLALNRTQSKHCLSDLEQILVGKGASGQLNKICSIHSSNLTSSSCPVINVDEFESTVD 178
Query: 202 TSKLLTACEKIDPVKECCYQICQNAILEAATAIASKGSDILEMDASHVLPEHSIRVNDCR 261
T+KLL ACEKIDPVKECC + CQNAIL+AAT I+ K AS L ++S R+NDC+
Sbjct: 179 TAKLLLACEKIDPVKECCEEACQNAILDAATNISLK--------ASETLTDNSDRINDCK 230
Query: 262 NIVLRWVASKLDPSHAKKVLRGLSNCNVNKVCPLVLPDTKQVAKGCGHGISNKTACCNAM 321
N+V RW+A+KLDPS K+ LRGL+NC +N+VCPLV P K + C + +SN+T CC AM
Sbjct: 231 NVVNRWLATKLDPSRVKETLRGLANCKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAM 290
Query: 322 ESYVSHLQKQSFITNLQALDCAETLAMKLKKSNITEDVYSLCHVSLKDFSLQ 373
ESYVSHLQKQ+ ITNLQALDCA +L KL+K NIT++++S+CH+SLKDFSLQ
Sbjct: 291 ESYVSHLQKQTLITNLQALDCATSLGTKLQKLNITKNIFSVCHISLKDFSLQ 342
>AT1G61900.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: anchored to
plasma membrane, plasma membrane, anchored to membrane;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G30700.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr1:22882561-22884722 REVERSE LENGTH=413
Length = 413
Score = 375 bits (963), Expect = e-104, Method: Compositional matrix adjust.
Identities = 177/292 (60%), Positives = 231/292 (79%), Gaps = 8/292 (2%)
Query: 82 NTSIPKLSGLCTLNFTTAESLLSVTAVDCWEGFAPFLANVICCPQLEATLTILIGQSSKF 141
N+++PKLSGLC+LNF+ +ESL+ T+ +CW FAP LANV+CCPQL+ATLTI++G++SK
Sbjct: 59 NSTMPKLSGLCSLNFSASESLIQTTSHNCWTVFAPLLANVMCCPQLDATLTIILGKASKE 118
Query: 142 TDALALNSTVAKHCLSDVEQILMGQGGTGDLKHVCSVHPLNLTEAACPVKNVNDFNDIVD 201
T LALN T +KHCLSD+EQIL+G+G +G L +CS+H NLT ++CPV NV++F VD
Sbjct: 119 TGLLALNRTQSKHCLSDLEQILVGKGASGQLNKICSIHSSNLTSSSCPVINVDEFESTVD 178
Query: 202 TSKLLTACEKIDPVKECCYQICQNAILEAATAIASKGSDILEMDASHVLPEHSIRVNDCR 261
T+KLL ACEKIDPVKECC + CQNAIL+AAT I+ K AS L ++S R+NDC+
Sbjct: 179 TAKLLLACEKIDPVKECCEEACQNAILDAATNISLK--------ASETLTDNSDRINDCK 230
Query: 262 NIVLRWVASKLDPSHAKKVLRGLSNCNVNKVCPLVLPDTKQVAKGCGHGISNKTACCNAM 321
N+V RW+A+KLDPS K+ LRGL+NC +N+VCPLV P K + C + +SN+T CC AM
Sbjct: 231 NVVNRWLATKLDPSRVKETLRGLANCKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAM 290
Query: 322 ESYVSHLQKQSFITNLQALDCAETLAMKLKKSNITEDVYSLCHVSLKDFSLQ 373
ESYVSHLQKQ+ ITNLQALDCA +L KL+K NIT++++S+CH+SLKDFSLQ
Sbjct: 291 ESYVSHLQKQTLITNLQALDCATSLGTKLQKLNITKNIFSVCHISLKDFSLQ 342
>AT1G61900.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G30700.1). | chr1:22882508-22884722 REVERSE
LENGTH=429
Length = 429
Score = 375 bits (963), Expect = e-104, Method: Compositional matrix adjust.
Identities = 177/292 (60%), Positives = 231/292 (79%), Gaps = 8/292 (2%)
Query: 82 NTSIPKLSGLCTLNFTTAESLLSVTAVDCWEGFAPFLANVICCPQLEATLTILIGQSSKF 141
N+++PKLSGLC+LNF+ +ESL+ T+ +CW FAP LANV+CCPQL+ATLTI++G++SK
Sbjct: 59 NSTMPKLSGLCSLNFSASESLIQTTSHNCWTVFAPLLANVMCCPQLDATLTIILGKASKE 118
Query: 142 TDALALNSTVAKHCLSDVEQILMGQGGTGDLKHVCSVHPLNLTEAACPVKNVNDFNDIVD 201
T LALN T +KHCLSD+EQIL+G+G +G L +CS+H NLT ++CPV NV++F VD
Sbjct: 119 TGLLALNRTQSKHCLSDLEQILVGKGASGQLNKICSIHSSNLTSSSCPVINVDEFESTVD 178
Query: 202 TSKLLTACEKIDPVKECCYQICQNAILEAATAIASKGSDILEMDASHVLPEHSIRVNDCR 261
T+KLL ACEKIDPVKECC + CQNAIL+AAT I+ K AS L ++S R+NDC+
Sbjct: 179 TAKLLLACEKIDPVKECCEEACQNAILDAATNISLK--------ASETLTDNSDRINDCK 230
Query: 262 NIVLRWVASKLDPSHAKKVLRGLSNCNVNKVCPLVLPDTKQVAKGCGHGISNKTACCNAM 321
N+V RW+A+KLDPS K+ LRGL+NC +N+VCPLV P K + C + +SN+T CC AM
Sbjct: 231 NVVNRWLATKLDPSRVKETLRGLANCKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAM 290
Query: 322 ESYVSHLQKQSFITNLQALDCAETLAMKLKKSNITEDVYSLCHVSLKDFSLQ 373
ESYVSHLQKQ+ ITNLQALDCA +L KL+K NIT++++S+CH+SLKDFSLQ
Sbjct: 291 ESYVSHLQKQTLITNLQALDCATSLGTKLQKLNITKNIFSVCHISLKDFSLQ 342
>AT2G30700.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G61900.1); Has 68 Blast hits to 67 proteins in
13 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi
- 0; Plants - 66; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:13082033-13084384 REVERSE
LENGTH=480
Length = 480
Score = 218 bits (554), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 114/294 (38%), Positives = 169/294 (57%), Gaps = 3/294 (1%)
Query: 82 NTSIPKLSGLCTLNFTTAESLLSVTAVDCWEGFAPFLANVICCPQLEATLTILIGQSSKF 141
+T PKL+G C +F S++ A DC + FA + NVICCPQ + L I GQ +
Sbjct: 95 DTYEPKLTGKCPTDFQAISSVIDTAASDCSQPFAALVGNVICCPQFVSLLHIFQGQHNVK 154
Query: 142 TDALALNSTVAKHCLSDVEQILMGQGGTGDLKHVCSVHPLNLTEAACPVKNVNDFNDIVD 201
++ L L VA C SD+ IL+ + + +CSV NLT +CPV +V F +V+
Sbjct: 155 SNKLVLPDAVATDCFSDIVSILVSRRANMTIPALCSVTSSNLTGGSCPVTDVTTFEKVVN 214
Query: 202 TSKLLTACEKIDPVKECCYQICQNAILEAATAIASKGSDILEMDASHVLPEHSIR-VNDC 260
+SKLL AC +DP+KECC ICQ AI+EAA I+ G + D + +++ +NDC
Sbjct: 215 SSKLLDACRTVDPLKECCRPICQPAIMEAALIIS--GHQMTVGDKIPLAGSNNVNAINDC 272
Query: 261 RNIVLRWVASKLDPSHAKKVLRGLSNCNVNKVCPLVLPDTKQVAKGCGHGISNKTACCNA 320
+N+V +++ KL A R LS+C VNK CPL + +V K C + + +CC++
Sbjct: 273 KNVVFSYLSRKLPADKANAAFRILSSCKVNKACPLEFKEPTEVIKACRNVAAPSPSCCSS 332
Query: 321 MESYVSHLQKQSFITNLQALDCAETLAMKLKKSNITEDVYSLCHVSLKDFSLQG 374
+ +Y+S +Q Q ITN QA+ CA + L+K + ++Y LC V LKDFS+Q
Sbjct: 333 LNAYISGIQNQMLITNKQAIVCATVIGSMLRKGGVMTNIYELCDVDLKDFSVQA 386