Miyakogusa Predicted Gene
- Lj2g3v1988570.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1988570.1 tr|A9TLF2|A9TLF2_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_147386,47.41,2e-18,FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.38268.1
(151 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G47020.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 163 4e-41
AT4G32920.3 | Symbols: | glycine-rich protein | chr4:15888153-1... 102 8e-23
AT4G32920.2 | Symbols: | glycine-rich protein | chr4:15888153-1... 102 8e-23
AT4G32920.1 | Symbols: | glycine-rich protein | chr4:15888153-1... 102 8e-23
AT5G11700.1 | Symbols: | LOCATED IN: vacuole; EXPRESSED IN: 24 ... 91 2e-19
AT5G11700.2 | Symbols: | BEST Arabidopsis thaliana protein matc... 91 2e-19
>AT5G47020.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G11700.2); Has
1807 Blast hits to 1807 proteins in 277 species: Archae -
0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants -
385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:19082005-19089800 FORWARD LENGTH=1421
Length = 1421
Score = 163 bits (413), Expect = 4e-41, Method: Composition-based stats.
Identities = 81/151 (53%), Positives = 99/151 (65%), Gaps = 4/151 (2%)
Query: 1 MLLADLSVTLLMLLQFYWXXXXXXXXXXXXXXXXXXXXXXXGLNALFSKEPRRASLSRVY 60
+LLADLSVTLL LLQFYW GLNAL SKE RRASL+R+Y
Sbjct: 1273 LLLADLSVTLLALLQFYWLALAAFLAILLILPLSLLCPFPAGLNALLSKEMRRASLTRIY 1332
Query: 61 ALWSATSLSNIGVAFICCLVHYAVSHLHPDE-ASTRSVKREDDKCWLLPVILFLFKSVQV 119
LW+ATSL+N+ VAFIC ++H S DE + + R+DDK W+LP L L KS+Q
Sbjct: 1333 GLWNATSLTNVIVAFICGVIH---SGFFTDELPNIWNAIRDDDKWWVLPTFLLLLKSIQA 1389
Query: 120 RLVNWHIANLEIQDFSLFCPDPDAFWAHESG 150
R ++WH+ANLE+ DFSL CPDPD FWA+ESG
Sbjct: 1390 RFLDWHVANLEVPDFSLLCPDPDTFWAYESG 1420
>AT4G32920.3 | Symbols: | glycine-rich protein |
chr4:15888153-15896006 REVERSE LENGTH=1432
Length = 1432
Score = 102 bits (255), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 13/112 (11%)
Query: 42 GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHY-------AVSHLHPDEAS 93
G++ALFS PRR AS +RVYALW+ TSL N+ VAF+C VHY + +L P
Sbjct: 1324 GVSALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYHGSSSGKKIPYLQP---- 1379
Query: 94 TRSVKREDDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
++ ++++ W+ PV LFL K +Q +LVNWH+ANLEIQD+SL+ D + FW
Sbjct: 1380 -WNISMDENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFW 1430
>AT4G32920.2 | Symbols: | glycine-rich protein |
chr4:15888153-15896006 REVERSE LENGTH=1432
Length = 1432
Score = 102 bits (255), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 13/112 (11%)
Query: 42 GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHY-------AVSHLHPDEAS 93
G++ALFS PRR AS +RVYALW+ TSL N+ VAF+C VHY + +L P
Sbjct: 1324 GVSALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYHGSSSGKKIPYLQP---- 1379
Query: 94 TRSVKREDDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
++ ++++ W+ PV LFL K +Q +LVNWH+ANLEIQD+SL+ D + FW
Sbjct: 1380 -WNISMDENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFW 1430
>AT4G32920.1 | Symbols: | glycine-rich protein |
chr4:15888153-15896006 REVERSE LENGTH=1432
Length = 1432
Score = 102 bits (255), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 13/112 (11%)
Query: 42 GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHY-------AVSHLHPDEAS 93
G++ALFS PRR AS +RVYALW+ TSL N+ VAF+C VHY + +L P
Sbjct: 1324 GVSALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYHGSSSGKKIPYLQP---- 1379
Query: 94 TRSVKREDDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
++ ++++ W+ PV LFL K +Q +LVNWH+ANLEIQD+SL+ D + FW
Sbjct: 1380 -WNISMDENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFW 1430
>AT5G11700.1 | Symbols: | LOCATED IN: vacuole; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: glycine-rich
protein (TAIR:AT4G32920.3); Has 1807 Blast hits to 1807
proteins in 277 species: Archae - 0; Bacteria - 0;
Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
Other Eukaryotes - 339 (source: NCBI BLink). |
chr5:3762961-3771123 REVERSE LENGTH=1419
Length = 1419
Score = 91.3 bits (225), Expect = 2e-19, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 70/112 (62%), Gaps = 13/112 (11%)
Query: 42 GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHYAVSHLHPDEASTRSVKRE 100
G+NALFS PRR A L+RVYALW+ SL N+ VAF+C VHY H + ++++ + +
Sbjct: 1311 GINALFSHGPRRSAGLARVYALWNFMSLVNVFVAFLCGYVHY-----HSESSASKKIPFQ 1365
Query: 101 -------DDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
+ + W+ P L + K +Q +L+N H+ANLEIQD SL+ D + FW
Sbjct: 1366 PWNINMGESEWWIFPAGLVVCKIMQSQLINRHVANLEIQDRSLYSKDYELFW 1417
>AT5G11700.2 | Symbols: | BEST Arabidopsis thaliana protein match is:
glycine-rich protein (TAIR:AT4G32920.3); Has 8203 Blast
hits to 3102 proteins in 389 species: Archae - 3;
Bacteria - 5624; Metazoa - 852; Fungi - 139; Plants -
704; Viruses - 77; Other Eukaryotes - 804 (source: NCBI
BLink). | chr5:3762961-3771123 REVERSE LENGTH=1476
Length = 1476
Score = 91.3 bits (225), Expect = 2e-19, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 70/112 (62%), Gaps = 13/112 (11%)
Query: 42 GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHYAVSHLHPDEASTRSVKRE 100
G+NALFS PRR A L+RVYALW+ SL N+ VAF+C VHY H + ++++ + +
Sbjct: 1368 GINALFSHGPRRSAGLARVYALWNFMSLVNVFVAFLCGYVHY-----HSESSASKKIPFQ 1422
Query: 101 -------DDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
+ + W+ P L + K +Q +L+N H+ANLEIQD SL+ D + FW
Sbjct: 1423 PWNINMGESEWWIFPAGLVVCKIMQSQLINRHVANLEIQDRSLYSKDYELFW 1474