Miyakogusa Predicted Gene

Lj2g3v1988570.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v1988570.1 tr|A9TLF2|A9TLF2_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_147386,47.41,2e-18,FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.38268.1
         (151 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G47020.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   163   4e-41
AT4G32920.3 | Symbols:  | glycine-rich protein | chr4:15888153-1...   102   8e-23
AT4G32920.2 | Symbols:  | glycine-rich protein | chr4:15888153-1...   102   8e-23
AT4G32920.1 | Symbols:  | glycine-rich protein | chr4:15888153-1...   102   8e-23
AT5G11700.1 | Symbols:  | LOCATED IN: vacuole; EXPRESSED IN: 24 ...    91   2e-19
AT5G11700.2 | Symbols:  | BEST Arabidopsis thaliana protein matc...    91   2e-19

>AT5G47020.1 | Symbols:  | unknown protein; FUNCTIONS IN:
            molecular_function unknown; INVOLVED IN:
            biological_process unknown; LOCATED IN: endomembrane
            system; EXPRESSED IN: 23 plant structures; EXPRESSED
            DURING: 13 growth stages; BEST Arabidopsis thaliana
            protein match is: unknown protein (TAIR:AT5G11700.2); Has
            1807 Blast hits to 1807 proteins in 277 species: Archae -
            0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants -
            385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI
            BLink). | chr5:19082005-19089800 FORWARD LENGTH=1421
          Length = 1421

 Score =  163 bits (413), Expect = 4e-41,   Method: Composition-based stats.
 Identities = 81/151 (53%), Positives = 99/151 (65%), Gaps = 4/151 (2%)

Query: 1    MLLADLSVTLLMLLQFYWXXXXXXXXXXXXXXXXXXXXXXXGLNALFSKEPRRASLSRVY 60
            +LLADLSVTLL LLQFYW                       GLNAL SKE RRASL+R+Y
Sbjct: 1273 LLLADLSVTLLALLQFYWLALAAFLAILLILPLSLLCPFPAGLNALLSKEMRRASLTRIY 1332

Query: 61   ALWSATSLSNIGVAFICCLVHYAVSHLHPDE-ASTRSVKREDDKCWLLPVILFLFKSVQV 119
             LW+ATSL+N+ VAFIC ++H   S    DE  +  +  R+DDK W+LP  L L KS+Q 
Sbjct: 1333 GLWNATSLTNVIVAFICGVIH---SGFFTDELPNIWNAIRDDDKWWVLPTFLLLLKSIQA 1389

Query: 120  RLVNWHIANLEIQDFSLFCPDPDAFWAHESG 150
            R ++WH+ANLE+ DFSL CPDPD FWA+ESG
Sbjct: 1390 RFLDWHVANLEVPDFSLLCPDPDTFWAYESG 1420


>AT4G32920.3 | Symbols:  | glycine-rich protein |
            chr4:15888153-15896006 REVERSE LENGTH=1432
          Length = 1432

 Score =  102 bits (255), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 13/112 (11%)

Query: 42   GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHY-------AVSHLHPDEAS 93
            G++ALFS  PRR AS +RVYALW+ TSL N+ VAF+C  VHY        + +L P    
Sbjct: 1324 GVSALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYHGSSSGKKIPYLQP---- 1379

Query: 94   TRSVKREDDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
              ++  ++++ W+ PV LFL K +Q +LVNWH+ANLEIQD+SL+  D + FW
Sbjct: 1380 -WNISMDENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFW 1430


>AT4G32920.2 | Symbols:  | glycine-rich protein |
            chr4:15888153-15896006 REVERSE LENGTH=1432
          Length = 1432

 Score =  102 bits (255), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 13/112 (11%)

Query: 42   GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHY-------AVSHLHPDEAS 93
            G++ALFS  PRR AS +RVYALW+ TSL N+ VAF+C  VHY        + +L P    
Sbjct: 1324 GVSALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYHGSSSGKKIPYLQP---- 1379

Query: 94   TRSVKREDDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
              ++  ++++ W+ PV LFL K +Q +LVNWH+ANLEIQD+SL+  D + FW
Sbjct: 1380 -WNISMDENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFW 1430


>AT4G32920.1 | Symbols:  | glycine-rich protein |
            chr4:15888153-15896006 REVERSE LENGTH=1432
          Length = 1432

 Score =  102 bits (255), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 13/112 (11%)

Query: 42   GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHY-------AVSHLHPDEAS 93
            G++ALFS  PRR AS +RVYALW+ TSL N+ VAF+C  VHY        + +L P    
Sbjct: 1324 GVSALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYHGSSSGKKIPYLQP---- 1379

Query: 94   TRSVKREDDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
              ++  ++++ W+ PV LFL K +Q +LVNWH+ANLEIQD+SL+  D + FW
Sbjct: 1380 -WNISMDENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFW 1430


>AT5G11700.1 | Symbols:  | LOCATED IN: vacuole; EXPRESSED IN: 24 plant
            structures; EXPRESSED DURING: 13 growth stages; BEST
            Arabidopsis thaliana protein match is: glycine-rich
            protein (TAIR:AT4G32920.3); Has 1807 Blast hits to 1807
            proteins in 277 species: Archae - 0; Bacteria - 0;
            Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
            Other Eukaryotes - 339 (source: NCBI BLink). |
            chr5:3762961-3771123 REVERSE LENGTH=1419
          Length = 1419

 Score = 91.3 bits (225), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 70/112 (62%), Gaps = 13/112 (11%)

Query: 42   GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHYAVSHLHPDEASTRSVKRE 100
            G+NALFS  PRR A L+RVYALW+  SL N+ VAF+C  VHY     H + ++++ +  +
Sbjct: 1311 GINALFSHGPRRSAGLARVYALWNFMSLVNVFVAFLCGYVHY-----HSESSASKKIPFQ 1365

Query: 101  -------DDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
                   + + W+ P  L + K +Q +L+N H+ANLEIQD SL+  D + FW
Sbjct: 1366 PWNINMGESEWWIFPAGLVVCKIMQSQLINRHVANLEIQDRSLYSKDYELFW 1417


>AT5G11700.2 | Symbols:  | BEST Arabidopsis thaliana protein match is:
            glycine-rich protein (TAIR:AT4G32920.3); Has 8203 Blast
            hits to 3102 proteins in 389 species: Archae - 3;
            Bacteria - 5624; Metazoa - 852; Fungi - 139; Plants -
            704; Viruses - 77; Other Eukaryotes - 804 (source: NCBI
            BLink). | chr5:3762961-3771123 REVERSE LENGTH=1476
          Length = 1476

 Score = 91.3 bits (225), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 70/112 (62%), Gaps = 13/112 (11%)

Query: 42   GLNALFSKEPRR-ASLSRVYALWSATSLSNIGVAFICCLVHYAVSHLHPDEASTRSVKRE 100
            G+NALFS  PRR A L+RVYALW+  SL N+ VAF+C  VHY     H + ++++ +  +
Sbjct: 1368 GINALFSHGPRRSAGLARVYALWNFMSLVNVFVAFLCGYVHY-----HSESSASKKIPFQ 1422

Query: 101  -------DDKCWLLPVILFLFKSVQVRLVNWHIANLEIQDFSLFCPDPDAFW 145
                   + + W+ P  L + K +Q +L+N H+ANLEIQD SL+  D + FW
Sbjct: 1423 PWNINMGESEWWIFPAGLVVCKIMQSQLINRHVANLEIQDRSLYSKDYELFW 1474