Miyakogusa Predicted Gene

Lj1g3v2095730.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2095730.1 tr|A9TFK0|A9TFK0_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_144873,36.36,6e-18,seg,NULL; FAMILY NOT
NAMED,NULL,CUFF.28470.1
         (313 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G11700.1 | Symbols:  | LOCATED IN: vacuole; EXPRESSED IN: 24 ...   286   9e-78
AT5G11700.2 | Symbols:  | BEST Arabidopsis thaliana protein matc...   286   9e-78
AT4G32920.3 | Symbols:  | glycine-rich protein | chr4:15888153-1...   207   5e-54
AT4G32920.2 | Symbols:  | glycine-rich protein | chr4:15888153-1...   207   5e-54
AT4G32920.1 | Symbols:  | glycine-rich protein | chr4:15888153-1...   207   5e-54
AT5G47020.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   137   8e-33

>AT5G11700.1 | Symbols:  | LOCATED IN: vacuole; EXPRESSED IN: 24
           plant structures; EXPRESSED DURING: 13 growth stages;
           BEST Arabidopsis thaliana protein match is: glycine-rich
           protein (TAIR:AT4G32920.3); Has 1807 Blast hits to 1807
           proteins in 277 species: Archae - 0; Bacteria - 0;
           Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
           Other Eukaryotes - 339 (source: NCBI BLink). |
           chr5:3762961-3771123 REVERSE LENGTH=1419
          Length = 1419

 Score =  286 bits (733), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 154/271 (56%), Positives = 178/271 (65%)

Query: 41  FHQDYSXXXXXXXXXXXXSVSCVDDLGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILP 100
           FHQDYS            SVSC +DLGGVG LDTTCKI  D NLT  VYIAG+GNF ILP
Sbjct: 50  FHQDYSPPAPPPPPPHGPSVSCSEDLGGVGFLDTTCKIVADLNLTHDVYIAGKGNFIILP 109

Query: 101 GVRFHCEIPGCWITVNVTGNFSLGSNASIVTGAFELESEYAVFENGSVVNTTCMAGDPPP 160
           GVRFHC IPGC I +NV+GNFSLG+ ++IV G  EL +  A F NGS VNTT +AG PPP
Sbjct: 110 GVRFHCPIPGCSIAINVSGNFSLGAESTIVAGTLELTAGNASFANGSAVNTTGLAGSPPP 169

Query: 161 QTSXXXXXXXXXXXXXXXXXXXCLVDTKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSK 220
           QTS                   CL DTKKLPEDVWGGDAYSWS+LQ P S+GS+G STS+
Sbjct: 170 QTSGTPQGIDGAGGGHGGRGACCLTDTKKLPEDVWGGDAYSWSTLQKPWSYGSKGGSTSR 229

Query: 221 ESEYGGLGGGVVRLTIHKIVEMNASLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISA 280
           E +YGG GGG V++ I +++++N SLLA               SIYIKAY+MTG G ISA
Sbjct: 230 EIDYGGGGGGKVKMDILQLLDVNGSLLANGGYGGAKGGGGSGGSIYIKAYKMTGIGKISA 289

Query: 281 CXXXXXXXXXXXRVSVDVFSRHDEPKISVHG 311
           C           RVSVD+FSRHD+PKI VHG
Sbjct: 290 CGGSGYGGGGGGRVSVDIFSRHDDPKIFVHG 320


>AT5G11700.2 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: glycine-rich protein (TAIR:AT4G32920.3); Has 8203
           Blast hits to 3102 proteins in 389 species: Archae - 3;
           Bacteria - 5624; Metazoa - 852; Fungi - 139; Plants -
           704; Viruses - 77; Other Eukaryotes - 804 (source: NCBI
           BLink). | chr5:3762961-3771123 REVERSE LENGTH=1476
          Length = 1476

 Score =  286 bits (733), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 154/271 (56%), Positives = 178/271 (65%)

Query: 41  FHQDYSXXXXXXXXXXXXSVSCVDDLGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILP 100
           FHQDYS            SVSC +DLGGVG LDTTCKI  D NLT  VYIAG+GNF ILP
Sbjct: 50  FHQDYSPPAPPPPPPHGPSVSCSEDLGGVGFLDTTCKIVADLNLTHDVYIAGKGNFIILP 109

Query: 101 GVRFHCEIPGCWITVNVTGNFSLGSNASIVTGAFELESEYAVFENGSVVNTTCMAGDPPP 160
           GVRFHC IPGC I +NV+GNFSLG+ ++IV G  EL +  A F NGS VNTT +AG PPP
Sbjct: 110 GVRFHCPIPGCSIAINVSGNFSLGAESTIVAGTLELTAGNASFANGSAVNTTGLAGSPPP 169

Query: 161 QTSXXXXXXXXXXXXXXXXXXXCLVDTKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSK 220
           QTS                   CL DTKKLPEDVWGGDAYSWS+LQ P S+GS+G STS+
Sbjct: 170 QTSGTPQGIDGAGGGHGGRGACCLTDTKKLPEDVWGGDAYSWSTLQKPWSYGSKGGSTSR 229

Query: 221 ESEYGGLGGGVVRLTIHKIVEMNASLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISA 280
           E +YGG GGG V++ I +++++N SLLA               SIYIKAY+MTG G ISA
Sbjct: 230 EIDYGGGGGGKVKMDILQLLDVNGSLLANGGYGGAKGGGGSGGSIYIKAYKMTGIGKISA 289

Query: 281 CXXXXXXXXXXXRVSVDVFSRHDEPKISVHG 311
           C           RVSVD+FSRHD+PKI VHG
Sbjct: 290 CGGSGYGGGGGGRVSVDIFSRHDDPKIFVHG 320


>AT4G32920.3 | Symbols:  | glycine-rich protein |
           chr4:15888153-15896006 REVERSE LENGTH=1432
          Length = 1432

 Score =  207 bits (528), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 114/247 (46%), Positives = 151/247 (61%), Gaps = 1/247 (0%)

Query: 66  LGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGS 125
           LGGVG+LD+TCK+  D NLTR + I G+GN ++LPGVR  C+ PGC I+VN++GNFSL  
Sbjct: 64  LGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCSISVNISGNFSLAE 123

Query: 126 NASIVTGAFELESEYAVFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLV 185
           N+S++ G F L +E A F   S V+TT +AG+PPP TS                   CL 
Sbjct: 124 NSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGAGGGYGGRGACCLS 183

Query: 186 D-TKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNA 244
           D T K+PEDV+GGD Y WSSL+ P  +GSRG STS E +YGG GGG V + I   + +N 
Sbjct: 184 DTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGTVAIEILGYISLNG 243

Query: 245 SLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDE 304
           S+LA+              SI++ A++M GNG +SA            RVSVD++SRH +
Sbjct: 244 SVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGGGRVSVDIYSRHSD 303

Query: 305 PKISVHG 311
           PKI  +G
Sbjct: 304 PKIFFNG 310


>AT4G32920.2 | Symbols:  | glycine-rich protein |
           chr4:15888153-15896006 REVERSE LENGTH=1432
          Length = 1432

 Score =  207 bits (528), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 114/247 (46%), Positives = 151/247 (61%), Gaps = 1/247 (0%)

Query: 66  LGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGS 125
           LGGVG+LD+TCK+  D NLTR + I G+GN ++LPGVR  C+ PGC I+VN++GNFSL  
Sbjct: 64  LGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCSISVNISGNFSLAE 123

Query: 126 NASIVTGAFELESEYAVFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLV 185
           N+S++ G F L +E A F   S V+TT +AG+PPP TS                   CL 
Sbjct: 124 NSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGAGGGYGGRGACCLS 183

Query: 186 D-TKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNA 244
           D T K+PEDV+GGD Y WSSL+ P  +GSRG STS E +YGG GGG V + I   + +N 
Sbjct: 184 DTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGTVAIEILGYISLNG 243

Query: 245 SLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDE 304
           S+LA+              SI++ A++M GNG +SA            RVSVD++SRH +
Sbjct: 244 SVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGGGRVSVDIYSRHSD 303

Query: 305 PKISVHG 311
           PKI  +G
Sbjct: 304 PKIFFNG 310


>AT4G32920.1 | Symbols:  | glycine-rich protein |
           chr4:15888153-15896006 REVERSE LENGTH=1432
          Length = 1432

 Score =  207 bits (528), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 114/247 (46%), Positives = 151/247 (61%), Gaps = 1/247 (0%)

Query: 66  LGGVGTLDTTCKITEDANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGS 125
           LGGVG+LD+TCK+  D NLTR + I G+GN ++LPGVR  C+ PGC I+VN++GNFSL  
Sbjct: 64  LGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCSISVNISGNFSLAE 123

Query: 126 NASIVTGAFELESEYAVFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLV 185
           N+S++ G F L +E A F   S V+TT +AG+PPP TS                   CL 
Sbjct: 124 NSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGAGGGYGGRGACCLS 183

Query: 186 D-TKKLPEDVWGGDAYSWSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNA 244
           D T K+PEDV+GGD Y WSSL+ P  +GSRG STS E +YGG GGG V + I   + +N 
Sbjct: 184 DTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGTVAIEILGYISLNG 243

Query: 245 SLLAEXXXXXXXXXXXXXXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDE 304
           S+LA+              SI++ A++M GNG +SA            RVSVD++SRH +
Sbjct: 244 SVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGGGRVSVDIYSRHSD 303

Query: 305 PKISVHG 311
           PKI  +G
Sbjct: 304 PKIFFNG 310


>AT5G47020.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 23 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G11700.2);
           Has 1807 Blast hits to 1807 proteins in 277 species:
           Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
           Plants - 385; Viruses - 0; Other Eukaryotes - 339
           (source: NCBI BLink). | chr5:19082005-19089800 FORWARD
           LENGTH=1421
          Length = 1421

 Score =  137 bits (345), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 94/290 (32%), Positives = 134/290 (46%), Gaps = 6/290 (2%)

Query: 22  SRQCACDDEFSVTDLDWSVFHQDYSXXXXXXXXXXXXSVSCVDDLGGVGTLDTTCKITED 81
           S  C    ++ VT+ + SV  + +S            SV+C  DL GVG+L+TTC +  +
Sbjct: 13  STPCFSLSQYGVTEFESSV--RLFSDEASGNSTSSPISVTC-QDLDGVGSLNTTCTLNSN 69

Query: 82  ANLTRGVYIAGEGNFNILPGVRFHCEIPGCWITVNVTGNFSLGSNASIVTGAFELESEYA 141
                 VY+ G GN NIL  V   C + GC IT NV+G   LG +A IV G+    +   
Sbjct: 70  LRFDSDVYVYGTGNLNILAHVLVDCPVEGCMITFNVSGTIHLGQSARIVAGSVVFSAINL 129

Query: 142 VFENGSVVNTTCMAGDPPPQTSXXXXXXXXXXXXXXXXXXXCLVDTKKLPEDVWGGDAYS 201
             ++ S + TT +AG PP QTS                   C+   K      WGGD Y+
Sbjct: 130 TMDSNSSIYTTALAGPPPSQTSGTPYGIDGAGGGHGGRGASCVKSNKT---TYWGGDVYA 186

Query: 202 WSSLQNPSSFGSRGASTSKESEYGGLGGGVVRLTIHKIVEMNASLLAEXXXXXXXXXXXX 261
           WSSL +P S+GS G          G GGG V+L +   V +N ++ A+            
Sbjct: 187 WSSLHDPWSYGSEGGVKLSTKNIRGKGGGRVKLILTDTVHVNGTVSADGGDAGEEGGGGS 246

Query: 262 XXSIYIKAYRMTGNGIISACXXXXXXXXXXXRVSVDVFSRHDEPKISVHG 311
             SI I+A ++ G G ISA            R+S+D +S  ++ K+ VHG
Sbjct: 247 GGSICIRAVKLKGYGKISASGGRGWGGGGGGRISLDCYSIQEDVKVFVHG 296