Miyakogusa Predicted Gene

Lj0g3v0124389.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0124389.1 Non Chatacterized Hit- tr|K4ADQ8|K4ADQ8_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si037015,39.39,1e-18,DUF4057,Domain of unknown function DUF4057;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,NODE_67298_length_1383_cov_53.085323.path3.1
         (232 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G39860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   273   1e-73
AT4G39860.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   266   1e-71
AT1G78150.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   192   1e-49
AT1G78150.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   192   2e-49
AT1G78150.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   192   2e-49
AT1G35780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   184   4e-47
AT2G22270.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   168   3e-42

>AT4G39860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G22270.1); Has 152 Blast hits to 146 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 2; Plants - 146; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
           LENGTH=299
          Length = 299

 Score =  273 bits (697), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 142/232 (61%), Positives = 179/232 (77%), Gaps = 4/232 (1%)

Query: 1   MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
           +KE+TGSGIF+   +        A ++ +T  R  QQ +NG+SQISFS + +VSPKKP T
Sbjct: 72  LKEMTGSGIFTDKGK--VGSESDATTDPKTGLRYYQQTLNGMSQISFSADGNVSPKKPTT 129

Query: 61  IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
           + EVAKQRELSG L +E D KSNK IS+AK +E++G+DIF PP EI PRS+ AA+  E++
Sbjct: 130 LTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQPRSLVAAQ-QEAR 188

Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
           G++DMGEP PRNLRTSVKVSNPAGGQSN LF E PV+KTSKKIH+ K  ELTG  IF+G+
Sbjct: 189 GNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQELTGNGIFKGD 248

Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
             PGSA+K LS AKLREM+G+NIF AD K+E++D   G R+PPGGESSI+L+
Sbjct: 249 ESPGSADKQLSSAKLREMSGNNIF-ADGKSESRDYFGGVRKPPGGESSISLV 299


>AT4G39860.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G22270.1); Has 148 Blast hits to 144 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 144; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
           LENGTH=298
          Length = 298

 Score =  266 bits (679), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 141/232 (60%), Positives = 178/232 (76%), Gaps = 5/232 (2%)

Query: 1   MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
           +KE+TGSGIF+   +        A ++ +T  R  Q  +NG+SQISFS + +VSPKKP T
Sbjct: 72  LKEMTGSGIFTDKGK--VGSESDATTDPKTGLRYYQ-TLNGMSQISFSADGNVSPKKPTT 128

Query: 61  IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
           + EVAKQRELSG L +E D KSNK IS+AK +E++G+DIF PP EI PRS+ AA+  E++
Sbjct: 129 LTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQPRSLVAAQ-QEAR 187

Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
           G++DMGEP PRNLRTSVKVSNPAGGQSN LF E PV+KTSKKIH+ K  ELTG  IF+G+
Sbjct: 188 GNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQELTGNGIFKGD 247

Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
             PGSA+K LS AKLREM+G+NIF AD K+E++D   G R+PPGGESSI+L+
Sbjct: 248 ESPGSADKQLSSAKLREMSGNNIF-ADGKSESRDYFGGVRKPPGGESSISLV 298


>AT1G78150.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G35780.1). | chr1:29404996-29406341 FORWARD
           LENGTH=303
          Length = 303

 Score =  192 bits (489), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 115/232 (49%), Positives = 151/232 (65%), Gaps = 31/232 (13%)

Query: 1   MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
           MKEITGSGIFS N +D  SE             + QQAVNGISQISF  EE++SPKKPAT
Sbjct: 103 MKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISFGEEENLSPKKPAT 152

Query: 61  IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
           +PEVAKQRELSGT+++E  +K  K +S+AK KE++G +IF PPPEI PRS    R    K
Sbjct: 153 VPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRS-GTNRALALK 211

Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
            + ++G                A  Q+ +   E   +KT+KKI+D K AEL+G +IF+G+
Sbjct: 212 DNFNLG----------------AESQTAE---EDSSVKTAKKIYDKKFAELSGNDIFKGD 252

Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
               + EK LS+AKL+E+ G+NIF AD K E +D + G R+PPGGE+SIAL+
Sbjct: 253 AASSNVEKHLSQAKLKEIGGNNIF-ADGKVEARDYLGGVRKPPGGETSIALV 303


>AT1G78150.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G35780.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:29404996-29406341 FORWARD LENGTH=274
          Length = 274

 Score =  192 bits (487), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 115/232 (49%), Positives = 151/232 (65%), Gaps = 31/232 (13%)

Query: 1   MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
           MKEITGSGIFS N +D  SE             + QQAVNGISQISF  EE++SPKKPAT
Sbjct: 74  MKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISFGEEENLSPKKPAT 123

Query: 61  IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
           +PEVAKQRELSGT+++E  +K  K +S+AK KE++G +IF PPPEI PRS    R    K
Sbjct: 124 VPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRS-GTNRALALK 182

Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
            + ++G                A  Q+ +   E   +KT+KKI+D K AEL+G +IF+G+
Sbjct: 183 DNFNLG----------------AESQTAE---EDSSVKTAKKIYDKKFAELSGNDIFKGD 223

Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
               + EK LS+AKL+E+ G+NIF AD K E +D + G R+PPGGE+SIAL+
Sbjct: 224 AASSNVEKHLSQAKLKEIGGNNIF-ADGKVEARDYLGGVRKPPGGETSIALV 274


>AT1G78150.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G35780.1); Has 152 Blast hits to 146 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 2; Plants - 149; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:29404996-29406341 FORWARD
           LENGTH=274
          Length = 274

 Score =  192 bits (487), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 115/232 (49%), Positives = 151/232 (65%), Gaps = 31/232 (13%)

Query: 1   MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
           MKEITGSGIFS N +D  SE             + QQAVNGISQISF  EE++SPKKPAT
Sbjct: 74  MKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISFGEEENLSPKKPAT 123

Query: 61  IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
           +PEVAKQRELSGT+++E  +K  K +S+AK KE++G +IF PPPEI PRS    R    K
Sbjct: 124 VPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRS-GTNRALALK 182

Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
            + ++G                A  Q+ +   E   +KT+KKI+D K AEL+G +IF+G+
Sbjct: 183 DNFNLG----------------AESQTAE---EDSSVKTAKKIYDKKFAELSGNDIFKGD 223

Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
               + EK LS+AKL+E+ G+NIF AD K E +D + G R+PPGGE+SIAL+
Sbjct: 224 AASSNVEKHLSQAKLKEIGGNNIF-ADGKVEARDYLGGVRKPPGGETSIALV 274


>AT1G35780.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G78150.2); Has 145 Blast hits to 144 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 145; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:13277778-13280113 REVERSE
           LENGTH=286
          Length = 286

 Score =  184 bits (468), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 118/237 (49%), Positives = 151/237 (63%), Gaps = 30/237 (12%)

Query: 1   MKEITGSGIFSANAEDSTSEAGSANS--NGRTSRRLVQQAVNGISQISFSTEESVSPKKP 58
           MKEITGSGIFS   E+  SE  SANS  NG+ SR   Q     +S ISF  EE V+PKKP
Sbjct: 75  MKEITGSGIFSVYEENDDSELASANSATNGK-SRTFQQPPAAIMSHISFGEEEIVTPKKP 133

Query: 59  ATIPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITE 118
           AT+PEVAKQRELSGTL+ + D+K NK  S+AK KEL+G++IF PPPEI  R     R   
Sbjct: 134 ATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPPPEIKLRPT--VRALA 191

Query: 119 SKGSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQ 178
            K + D+GE            + P G            LKT+KKI D K  +L+G N+F+
Sbjct: 192 YKDNFDLGE----------SDTKPDGE-----------LKTAKKIADRKFTDLSGNNVFK 230

Query: 179 GN-NPPGS--AEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
            + + P S  AE+ LS AKL+E++G++IF ADAKA+++D   G R+PPGGESSIAL+
Sbjct: 231 SDVSSPSSATAERLLSTAKLKEISGNDIF-ADAKAQSRDYFGGVRKPPGGESSIALV 286


>AT2G22270.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G39860.2); Has 247 Blast hits to 231 proteins
           in 42 species: Archae - 0; Bacteria - 17; Metazoa - 14;
           Fungi - 5; Plants - 145; Viruses - 0; Other Eukaryotes -
           66 (source: NCBI BLink). | chr2:9463765-9465282 FORWARD
           LENGTH=328
          Length = 328

 Score =  168 bits (426), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 114/259 (44%), Positives = 153/259 (59%), Gaps = 30/259 (11%)

Query: 1   MKEITGSGIFSANA-------------EDSTSE---AGSANS----NGR---TSRRLVQQ 37
           +KE+TGS IFS +              +D  S+   +G  N+    NG+     +  +  
Sbjct: 73  LKEMTGSDIFSDDGKYDPNHQTRIHYHQDQLSQISFSGEENATTPMNGKDDPNHQTRIHY 132

Query: 38  AVNGISQISFSTEESVSPKKPATIPEVAKQRELSGTLQSELDSKS-NKLISNAKTKELTG 96
             +  SQISFS EE+V+PKKP T+ E AKQ+ELS T++++ DSK   K ISN K K ++G
Sbjct: 133 HQDQRSQISFSGEENVTPKKPTTLNEAAKQKELSRTVETQADSKCKKKQISNTKNKAMSG 192

Query: 97  NDIFGPPPEIVPRSVAAARITESKGSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPV 156
           +DIF  P     R    A  +E KG+K+  E  PR+ R SVK SN  G  SN LF E  V
Sbjct: 193 HDIFASPESQPRRLFGGATQSEVKGNKNTEESAPRSSRASVKTSN--GQSSNRLFSEEHV 250

Query: 157 LKTSKKIHDHK--LAELTGTNIFQGNN-PPGSAEKPLSRAKLREMTGSNIFAADAKAETK 213
           +K+SKKIH+ K     LT   IF+ +  PPG +EK  S AK REM+G NIF AD K+E +
Sbjct: 251 VKSSKKIHNQKSQFQGLTSNGIFKSDKIPPGYSEKMQSSAKKREMSGHNIF-ADGKSEYR 309

Query: 214 DPIRGSRQPPGGESSIALL 232
           D   G+R+PPGGESSI+L+
Sbjct: 310 DYYGGARRPPGGESSISLV 328