Miyakogusa Predicted Gene

Lj3g3v2666510.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2666510.1 Non Chatacterized Hit- tr|K4BMX8|K4BMX8_SOLLC
Uncharacterized protein OS=Solanum lycopersicum GN=Sol,42.37,7e-19,
,CUFF.44362.1
         (125 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G13898.1 | Symbols:  | unknown protein; LOCATED IN: endomembr...    75   1e-14
AT2G30370.1 | Symbols: CHAL, EPFL6 | allergen-related | chr2:129...    53   4e-08
AT5G10310.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    52   1e-07
AT2G30370.2 | Symbols: CHAL, EPFL6 | allergen-related | chr2:129...    52   1e-07
AT4G37810.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    49   1e-06

>AT3G13898.1 | Symbols:  | unknown protein; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT5G10310.1). |
           chr3:4584011-4584334 FORWARD LENGTH=107
          Length = 107

 Score = 75.1 bits (183), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 37/72 (51%), Positives = 47/72 (65%), Gaps = 3/72 (4%)

Query: 51  SKKGLMGMSEKREEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPSPSS--HLGIQYAN 108
           S++ ++  +E +EE   R   +IGS PP CE KCYGC PCEAIQ P+ SS  HL   YAN
Sbjct: 36  SRRRILNPNENKEEIVKR-RRRIGSKPPSCEKKCYGCEPCEAIQFPTISSIPHLSPHYAN 94

Query: 109 YELESWKCKCGP 120
           Y+ E W+C C P
Sbjct: 95  YQPEGWRCHCPP 106


>AT2G30370.1 | Symbols: CHAL, EPFL6 | allergen-related |
           chr2:12940577-12942167 REVERSE LENGTH=230
          Length = 230

 Score = 53.1 bits (126), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 3/87 (3%)

Query: 39  EALESSKPYTIESKKGLMGMSEKREEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPSP 98
           + +E  K   +  K   +G    +E    R +  +GSSPP C  KC  C PC+ + VP P
Sbjct: 147 DRVEEGKSTVVIKKTRKIG-DRSKEAELRRILRGLGSSPPRCSSKCGRCTPCKPVHVPVP 205

Query: 99  SSHLGIQYANYELESWKCKCGPSFYSP 125
                   A Y  E+W+CKCG   Y P
Sbjct: 206 PGT--PVTAEYYPEAWRCKCGNKLYMP 230


>AT5G10310.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 17 plant structures; EXPRESSED
           DURING: 10 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G13898.1);
           Has 1807 Blast hits to 1807 proteins in 277 species:
           Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
           Plants - 385; Viruses - 0; Other Eukaryotes - 339
           (source: NCBI BLink). | chr5:3241666-3242127 REVERSE
           LENGTH=122
          Length = 122

 Score = 52.0 bits (123), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 22/85 (25%)

Query: 63  EEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPS--------------------PSSHL 102
           + A     +++GS+PP C ++C  C PC AIQVP+                    PSS  
Sbjct: 38  QVALIEDKARLGSTPPSCHNRCNNCHPCMAIQVPTLPTRSRFTRVNPFSGGFVRPPSSLT 97

Query: 103 GI--QYANYELESWKCKCGPSFYSP 125
            +  QY+NY+   WKC C   FY+P
Sbjct: 98  TVLDQYSNYKPMGWKCHCNGHFYNP 122


>AT2G30370.2 | Symbols: CHAL, EPFL6 | allergen-related |
           chr2:12940577-12942167 REVERSE LENGTH=156
          Length = 156

 Score = 52.0 bits (123), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 37/78 (47%), Gaps = 2/78 (2%)

Query: 48  TIESKKGLMGMSEKREEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPSPSSHLGIQYA 107
           T+  KK        +E    R +  +GSSPP C  KC  C PC+ + VP P        A
Sbjct: 81  TVVIKKTRKIGDRSKEAELRRILRGLGSSPPRCSSKCGRCTPCKPVHVPVPPGTP--VTA 138

Query: 108 NYELESWKCKCGPSFYSP 125
            Y  E+W+CKCG   Y P
Sbjct: 139 EYYPEAWRCKCGNKLYMP 156


>AT4G37810.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT5G10310.1); Has 149 Blast hits
           to 149 proteins in 15 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 149; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr4:17780970-17781544 FORWARD LENGTH=128
          Length = 128

 Score = 48.5 bits (114), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 39/81 (48%), Gaps = 25/81 (30%)

Query: 68  RGMSKIGSSPPGCEH-KCYGCVPCEAIQVPS-PSSHL---------------------GI 104
           RG+  IGS PP CE  +C  C  CEAIQVP+ P + L                     G 
Sbjct: 50  RGL--IGSRPPRCERVRCRSCGHCEAIQVPTNPQTKLHSPLTTSSSSSSETIHLDYTRGD 107

Query: 105 QYANYELESWKCKCGPSFYSP 125
              NY+  SWKCKCG S Y+P
Sbjct: 108 DSTNYKPMSWKCKCGNSIYNP 128