Miyakogusa Predicted Gene
- Lj1g3v2095730.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2095730.1 tr|A9TFK0|A9TFK0_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_144873,36.36,6e-18,seg,NULL; FAMILY NOT
NAMED,NULL,CUFF.28470.1
(313 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G11700.1 | Symbols: | LOCATED IN: vacuole; EXPRESSED IN: 24 ... 286 9e-78
AT5G11700.2 | Symbols: | BEST Arabidopsis thaliana protein matc... 286 9e-78
AT4G32920.3 | Symbols: | glycine-rich protein | chr4:15888153-1... 207 5e-54
AT4G32920.2 | Symbols: | glycine-rich protein | chr4:15888153-1... 207 5e-54
AT4G32920.1 | Symbols: | glycine-rich protein | chr4:15888153-1... 207 5e-54
AT5G47020.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 137 8e-33
>AT5G11700.1 | Symbols: | LOCATED IN: vacuole; EXPRESSED IN: 24
plant structures; EXPRESSED DURING: 13 growth stages;
BEST Arabidopsis thaliana protein match is: glycine-rich
protein (TAIR:AT4G32920.3); Has 1807 Blast hits to 1807
proteins in 277 species: Archae - 0; Bacteria - 0;
Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
Other Eukaryotes - 339 (source: NCBI BLink). |
chr5:3762961-3771123 REVERSE LENGTH=1419
Length = 1419
Score = 286 bits (733), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 154/271 (56%), Positives = 178/271 (65%)
Query: 41 FHQDYSXXXXXXXXXXXXSVSCVDDLGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILP 100
FHQDYS SVSC +DLGGVG LDTTCKI D NLT VYIAG+GNF ILP
Sbjct: 50 FHQDYSPPAPPPPPPHGPSVSCSEDLGGVGFLDTTCKIVADLNLTHDVYIAGKGNFIILP 109
Query: 101 GVRFHCEIPGCWITVNVTGNFSLGSNASIVTGAFELESEYAVFENGSVVNTTCMAGDPPP 160
GVRFHC IPGC I +NV+GNFSLG+ ++IV G EL + A F NGS VNTT +AG PPP
Sbjct: 110 GVRFHCPIPGCSIAINVSGNFSLGAESTIVAGTLELTAGNASFANGSAVNTTGLAGSPPP 169
Query: 161 QTSXXXXXXXXXXXXXXXXXXXCLVDTKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSK 220
QTS CL DTKKLPEDVWGGDAYSWS+LQ P S+GS+G STS+
Sbjct: 170 QTSGTPQGIDGAGGGHGGRGACCLTDTKKLPEDVWGGDAYSWSTLQKPWSYGSKGGSTSR 229
Query: 221 ESEYGGLGGGVVRLTIHKIVEMNASLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISA 280
E +YGG GGG V++ I +++++N SLLA SIYIKAY+MTG G ISA
Sbjct: 230 EIDYGGGGGGKVKMDILQLLDVNGSLLANGGYGGAKGGGGSGGSIYIKAYKMTGIGKISA 289
Query: 281 CXXXXXXXXXXXRVSVDVFSRHDEPKISVHG 311
C RVSVD+FSRHD+PKI VHG
Sbjct: 290 CGGSGYGGGGGGRVSVDIFSRHDDPKIFVHG 320
>AT5G11700.2 | Symbols: | BEST Arabidopsis thaliana protein match
is: glycine-rich protein (TAIR:AT4G32920.3); Has 8203
Blast hits to 3102 proteins in 389 species: Archae - 3;
Bacteria - 5624; Metazoa - 852; Fungi - 139; Plants -
704; Viruses - 77; Other Eukaryotes - 804 (source: NCBI
BLink). | chr5:3762961-3771123 REVERSE LENGTH=1476
Length = 1476
Score = 286 bits (733), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 154/271 (56%), Positives = 178/271 (65%)
Query: 41 FHQDYSXXXXXXXXXXXXSVSCVDDLGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILP 100
FHQDYS SVSC +DLGGVG LDTTCKI D NLT VYIAG+GNF ILP
Sbjct: 50 FHQDYSPPAPPPPPPHGPSVSCSEDLGGVGFLDTTCKIVADLNLTHDVYIAGKGNFIILP 109
Query: 101 GVRFHCEIPGCWITVNVTGNFSLGSNASIVTGAFELESEYAVFENGSVVNTTCMAGDPPP 160
GVRFHC IPGC I +NV+GNFSLG+ ++IV G EL + A F NGS VNTT +AG PPP
Sbjct: 110 GVRFHCPIPGCSIAINVSGNFSLGAESTIVAGTLELTAGNASFANGSAVNTTGLAGSPPP 169
Query: 161 QTSXXXXXXXXXXXXXXXXXXXCLVDTKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSK 220
QTS CL DTKKLPEDVWGGDAYSWS+LQ P S+GS+G STS+
Sbjct: 170 QTSGTPQGIDGAGGGHGGRGACCLTDTKKLPEDVWGGDAYSWSTLQKPWSYGSKGGSTSR 229
Query: 221 ESEYGGLGGGVVRLTIHKIVEMNASLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISA 280
E +YGG GGG V++ I +++++N SLLA SIYIKAY+MTG G ISA
Sbjct: 230 EIDYGGGGGGKVKMDILQLLDVNGSLLANGGYGGAKGGGGSGGSIYIKAYKMTGIGKISA 289
Query: 281 CXXXXXXXXXXXRVSVDVFSRHDEPKISVHG 311
C RVSVD+FSRHD+PKI VHG
Sbjct: 290 CGGSGYGGGGGGRVSVDIFSRHDDPKIFVHG 320
>AT4G32920.3 | Symbols: | glycine-rich protein |
chr4:15888153-15896006 REVERSE LENGTH=1432
Length = 1432
Score = 207 bits (528), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 151/247 (61%), Gaps = 1/247 (0%)
Query: 66 LGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGS 125
LGGVG+LD+TCK+ D NLTR + I G+GN ++LPGVR C+ PGC I+VN++GNFSL
Sbjct: 64 LGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCSISVNISGNFSLAE 123
Query: 126 NASIVTGAFELESEYAVFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLV 185
N+S++ G F L +E A F S V+TT +AG+PPP TS CL
Sbjct: 124 NSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGAGGGYGGRGACCLS 183
Query: 186 D-TKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNA 244
D T K+PEDV+GGD Y WSSL+ P +GSRG STS E +YGG GGG V + I + +N
Sbjct: 184 DTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGTVAIEILGYISLNG 243
Query: 245 SLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDE 304
S+LA+ SI++ A++M GNG +SA RVSVD++SRH +
Sbjct: 244 SVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGGGRVSVDIYSRHSD 303
Query: 305 PKISVHG 311
PKI +G
Sbjct: 304 PKIFFNG 310
>AT4G32920.2 | Symbols: | glycine-rich protein |
chr4:15888153-15896006 REVERSE LENGTH=1432
Length = 1432
Score = 207 bits (528), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 151/247 (61%), Gaps = 1/247 (0%)
Query: 66 LGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGS 125
LGGVG+LD+TCK+ D NLTR + I G+GN ++LPGVR C+ PGC I+VN++GNFSL
Sbjct: 64 LGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCSISVNISGNFSLAE 123
Query: 126 NASIVTGAFELESEYAVFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLV 185
N+S++ G F L +E A F S V+TT +AG+PPP TS CL
Sbjct: 124 NSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGAGGGYGGRGACCLS 183
Query: 186 D-TKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNA 244
D T K+PEDV+GGD Y WSSL+ P +GSRG STS E +YGG GGG V + I + +N
Sbjct: 184 DTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGTVAIEILGYISLNG 243
Query: 245 SLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDE 304
S+LA+ SI++ A++M GNG +SA RVSVD++SRH +
Sbjct: 244 SVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGGGRVSVDIYSRHSD 303
Query: 305 PKISVHG 311
PKI +G
Sbjct: 304 PKIFFNG 310
>AT4G32920.1 | Symbols: | glycine-rich protein |
chr4:15888153-15896006 REVERSE LENGTH=1432
Length = 1432
Score = 207 bits (528), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 151/247 (61%), Gaps = 1/247 (0%)
Query: 66 LGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGS 125
LGGVG+LD+TCK+ D NLTR + I G+GN ++LPGVR C+ PGC I+VN++GNFSL
Sbjct: 64 LGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCSISVNISGNFSLAE 123
Query: 126 NASIVTGAFELESEYAVFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLV 185
N+S++ G F L +E A F S V+TT +AG+PPP TS CL
Sbjct: 124 NSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGAGGGYGGRGACCLS 183
Query: 186 D-TKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNA 244
D T K+PEDV+GGD Y WSSL+ P +GSRG STS E +YGG GGG V + I + +N
Sbjct: 184 DTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGTVAIEILGYISLNG 243
Query: 245 SLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDE 304
S+LA+ SI++ A++M GNG +SA RVSVD++SRH +
Sbjct: 244 SVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGGGRVSVDIYSRHSD 303
Query: 305 PKISVHG 311
PKI +G
Sbjct: 304 PKIFFNG 310
>AT5G47020.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G11700.2);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:19082005-19089800 FORWARD
LENGTH=1421
Length = 1421
Score = 137 bits (345), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 134/290 (46%), Gaps = 6/290 (2%)
Query: 22 SRQCACDDEFSVTDLDWSVFHQDYSXXXXXXXXXXXXSVSCVDDLGGVGTLDTTCKITED 81
S C ++ VT+ + SV + +S SV+C DL GVG+L+TTC + +
Sbjct: 13 STPCFSLSQYGVTEFESSV--RLFSDEASGNSTSSPISVTC-QDLDGVGSLNTTCTLNSN 69
Query: 82 ANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGSNASIVTGAFELESEYA 141
VY+ G GN NIL V C + GC IT NV+G LG +A IV G+ +
Sbjct: 70 LRFDSDVYVYGTGNLNILAHVLVDCPVEGCMITFNVSGTIHLGQSARIVAGSVVFSAINL 129
Query: 142 VFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLVDTKKLPEDVWGGDAYS 201
++ S + TT +AG PP QTS C+ K WGGD Y+
Sbjct: 130 TMDSNSSIYTTALAGPPPSQTSGTPYGIDGAGGGHGGRGASCVKSNKT---TYWGGDVYA 186
Query: 202 WSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNASLLAEXXXXXXXXXXXX 261
WSSL +P S+GS G G GGG V+L + V +N ++ A+
Sbjct: 187 WSSLHDPWSYGSEGGVKLSTKNIRGKGGGRVKLILTDTVHVNGTVSADGGDAGEEGGGGS 246
Query: 262 XXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDEPKISVHG 311
SI I+A ++ G G ISA R+S+D +S ++ K+ VHG
Sbjct: 247 GGSICIRAVKLKGYGKISASGGRGWGGGGGGRISLDCYSIQEDVKVFVHG 296