Miyakogusa Predicted Gene

Lj4g3v1535080.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v1535080.1 Non Chatacterized Hit- tr|K4C8K9|K4C8K9_SOLLC
Uncharacterized protein OS=Solanum lycopersicum GN=Sol,48.12,3e-19,NHL
REPEAT-CONTAINING PROTEIN,NULL; FAMILY NOT NAMED,NULL,CUFF.49357.1
         (144 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G62865.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    85   2e-17
AT5G14890.1 | Symbols:  | NHL domain-containing protein | chr5:4...    84   4e-17
AT3G01430.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...    78   2e-15
AT3G48020.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    77   3e-15
AT5G25240.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    54   4e-08

>AT5G62865.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48020.1). | chr5:25234064-25234567 FORWARD
           LENGTH=167
          Length = 167

 Score = 84.7 bits (208), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/151 (43%), Positives = 78/151 (51%), Gaps = 23/151 (15%)

Query: 3   IELESESTTPTGHQRSNAFCFCFGPRRRS--------SWWQRVRT----SHSTVSGD--R 48
           +EL     T     R +  C CF   RRS        S W R+RT    +HS   GD  R
Sbjct: 1   MELSQSDPTRDPDTRYDQRCCCFPSFRRSRSSTAVGYSSWGRIRTVDDSNHSGDHGDEPR 60

Query: 49  WWSRGIRALKKVREWSEILAGPRWKTFIRRLSHH----RSHKRMTKYQYDPFSYALNFDE 104
           WW   IRA  K+REWSEI+AGPRWKTFIRR +      R      K+QYDP SY+LNFD+
Sbjct: 61  WW---IRASLKIREWSEIVAGPRWKTFIRRFNRDPRRGRDWDASEKFQYDPLSYSLNFDD 117

Query: 105 GQNGD--FPDDGFRNFSTRYAVAAVKPVSSP 133
               D      G R+FSTR+A   V    +P
Sbjct: 118 DDEEDEYVGLGGLRSFSTRFASVPVYSGKAP 148


>AT5G14890.1 | Symbols:  | NHL domain-containing protein |
           chr5:4818056-4821534 FORWARD LENGTH=754
          Length = 754

 Score = 83.6 bits (205), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 52/131 (39%), Positives = 73/131 (55%), Gaps = 24/131 (18%)

Query: 22  CF---CFGPRRRSS-----WWQRVRTSHSTVSGDRWWSRGIRALKKVREWSEILAGPRWK 73
           CF   C G  + S      WWQR+RT       +RWW  G     K+REWSEI+AGP+WK
Sbjct: 611 CFILPCLGSSQPSGPNGSVWWQRIRTVDKLEPDERWWVSG---WMKMREWSEIVAGPKWK 667

Query: 74  TFIRRLSHHR----------SHKRMTKYQYDPFSYALNFDEG-QNGDFPDD-GFRNFSTR 121
           TFIRR   +           +      ++YD +SY+LNFD+G Q G F D+  +R++S R
Sbjct: 668 TFIRRFGRNHCCNGGIDGGCNRPEHVSFRYDSWSYSLNFDDGKQTGHFEDEFPYRDYSMR 727

Query: 122 YAVAAVKPVSS 132
           +A  ++ PVS+
Sbjct: 728 FAAPSL-PVST 737


>AT3G01430.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: NHL domain-containing protein (TAIR:AT5G14890.1);
           Has 98 Blast hits to 98 proteins in 12 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr3:165595-166137 REVERSE LENGTH=180
          Length = 180

 Score = 78.2 bits (191), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/128 (39%), Positives = 71/128 (55%), Gaps = 31/128 (24%)

Query: 31  SSWWQRVRTSHSTVSGDRWWSRGIRALKKVREWSEILAGPRWKTFIRRLSH--------- 81
           S WWQR+ T       +RWW RG R   ++REWSE++AGPRWKT+IRR            
Sbjct: 45  SVWWQRITTVDKLEPDERWWIRGWR---RMREWSELVAGPRWKTYIRRFGRSNCCGGGGG 101

Query: 82  ---------------HRSHKRMTKYQYDPFSYALNFDEG-QNGDFPDD-GFRNFSTRYAV 124
                          +RS  +  K++YD  SY+LNFD+G Q G F D+  +R++S R+A 
Sbjct: 102 RVGNSSGGCGGGAMPNRSSDQ-GKFRYDQLSYSLNFDDGNQTGHFDDEFPYRDYSMRFAA 160

Query: 125 AAVKPVSS 132
            ++ PVS+
Sbjct: 161 PSL-PVST 167


>AT3G48020.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 11 plant structures; EXPRESSED DURING:
           LP.04 four leaves visible, 4 anthesis; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G62865.1); Has 82 Blast hits to 82 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 82; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:17724593-17725000 FORWARD
           LENGTH=135
          Length = 135

 Score = 77.4 bits (189), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/126 (42%), Positives = 69/126 (54%), Gaps = 14/126 (11%)

Query: 24  CFGPRRRSSWWQRV-RTSHSTVSGDRWWSRGIRALKKVREWSEILAGPRWKTFIRRLSHH 82
           C     +SSWWQR+ R +H      RWW   +RA  K+REWSEI+AGPRWKTFIRR +  
Sbjct: 15  CCSTTVKSSWWQRIHRNNHQE---PRWW---VRAFLKIREWSEIVAGPRWKTFIRRFNRD 68

Query: 83  ----RSHKRMTKYQYDPFSYALNFDEGQNGDFPD---DGFRNFSTRYAVAAVKPVSSPEK 135
               +      K++YDP SY L+F++    D  +    G R+FS RYA   V    SP  
Sbjct: 69  PRRGQDWDDSDKFRYDPVSYTLSFEDEDKDDDDEAGVGGVRSFSMRYASVPVASGKSPAV 128

Query: 136 GSDVAV 141
            S  AV
Sbjct: 129 ISVDAV 134


>AT5G25240.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; Has 1807
           Blast hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:8746779-8747174 REVERSE LENGTH=131
          Length = 131

 Score = 53.5 bits (127), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 9/115 (7%)

Query: 3   IELESESTTPTGHQRSNAFCFC-------FGPRRRSSWWQRVRTSHS-TVSGDRWWSRGI 54
           +  + E+     ++ + AFC C       F   RR     R R   S  +  +R  + G 
Sbjct: 1   MATDRENLLSDDYEETAAFCGCGYFRSFSFTRWRRGDDESRSRGGWSGCLQEERRGNWGS 60

Query: 55  RALKKVREWSEILAGPRWKTFIRRLSHHRSH-KRMTKYQYDPFSYALNFDEGQNG 108
             LK ++E SE +AGP+WK FIR  S  R   +R   + YD  +Y+LNFD+G +G
Sbjct: 61  EKLKGLKEISEKIAGPKWKNFIRSFSSGRKKMRRDVDFTYDLKNYSLNFDDGGDG 115