Miyakogusa Predicted Gene
- Lj0g3v0124389.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0124389.1 Non Chatacterized Hit- tr|K4ADQ8|K4ADQ8_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si037015,39.39,1e-18,DUF4057,Domain of unknown function DUF4057;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,NODE_67298_length_1383_cov_53.085323.path3.1
(232 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G39860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 273 1e-73
AT4G39860.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 266 1e-71
AT1G78150.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 192 1e-49
AT1G78150.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 192 2e-49
AT1G78150.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 192 2e-49
AT1G35780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 184 4e-47
AT2G22270.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 168 3e-42
>AT4G39860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G22270.1); Has 152 Blast hits to 146 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 2; Plants - 146; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
LENGTH=299
Length = 299
Score = 273 bits (697), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 142/232 (61%), Positives = 179/232 (77%), Gaps = 4/232 (1%)
Query: 1 MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
+KE+TGSGIF+ + A ++ +T R QQ +NG+SQISFS + +VSPKKP T
Sbjct: 72 LKEMTGSGIFTDKGK--VGSESDATTDPKTGLRYYQQTLNGMSQISFSADGNVSPKKPTT 129
Query: 61 IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
+ EVAKQRELSG L +E D KSNK IS+AK +E++G+DIF PP EI PRS+ AA+ E++
Sbjct: 130 LTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQPRSLVAAQ-QEAR 188
Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
G++DMGEP PRNLRTSVKVSNPAGGQSN LF E PV+KTSKKIH+ K ELTG IF+G+
Sbjct: 189 GNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQELTGNGIFKGD 248
Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
PGSA+K LS AKLREM+G+NIF AD K+E++D G R+PPGGESSI+L+
Sbjct: 249 ESPGSADKQLSSAKLREMSGNNIF-ADGKSESRDYFGGVRKPPGGESSISLV 299
>AT4G39860.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G22270.1); Has 148 Blast hits to 144 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 144; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
LENGTH=298
Length = 298
Score = 266 bits (679), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 141/232 (60%), Positives = 178/232 (76%), Gaps = 5/232 (2%)
Query: 1 MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
+KE+TGSGIF+ + A ++ +T R Q +NG+SQISFS + +VSPKKP T
Sbjct: 72 LKEMTGSGIFTDKGK--VGSESDATTDPKTGLRYYQ-TLNGMSQISFSADGNVSPKKPTT 128
Query: 61 IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
+ EVAKQRELSG L +E D KSNK IS+AK +E++G+DIF PP EI PRS+ AA+ E++
Sbjct: 129 LTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQPRSLVAAQ-QEAR 187
Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
G++DMGEP PRNLRTSVKVSNPAGGQSN LF E PV+KTSKKIH+ K ELTG IF+G+
Sbjct: 188 GNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQELTGNGIFKGD 247
Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
PGSA+K LS AKLREM+G+NIF AD K+E++D G R+PPGGESSI+L+
Sbjct: 248 ESPGSADKQLSSAKLREMSGNNIF-ADGKSESRDYFGGVRKPPGGESSISLV 298
>AT1G78150.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G35780.1). | chr1:29404996-29406341 FORWARD
LENGTH=303
Length = 303
Score = 192 bits (489), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 115/232 (49%), Positives = 151/232 (65%), Gaps = 31/232 (13%)
Query: 1 MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
MKEITGSGIFS N +D SE + QQAVNGISQISF EE++SPKKPAT
Sbjct: 103 MKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISFGEEENLSPKKPAT 152
Query: 61 IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
+PEVAKQRELSGT+++E +K K +S+AK KE++G +IF PPPEI PRS R K
Sbjct: 153 VPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRS-GTNRALALK 211
Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
+ ++G A Q+ + E +KT+KKI+D K AEL+G +IF+G+
Sbjct: 212 DNFNLG----------------AESQTAE---EDSSVKTAKKIYDKKFAELSGNDIFKGD 252
Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
+ EK LS+AKL+E+ G+NIF AD K E +D + G R+PPGGE+SIAL+
Sbjct: 253 AASSNVEKHLSQAKLKEIGGNNIF-ADGKVEARDYLGGVRKPPGGETSIALV 303
>AT1G78150.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G35780.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:29404996-29406341 FORWARD LENGTH=274
Length = 274
Score = 192 bits (487), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 115/232 (49%), Positives = 151/232 (65%), Gaps = 31/232 (13%)
Query: 1 MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
MKEITGSGIFS N +D SE + QQAVNGISQISF EE++SPKKPAT
Sbjct: 74 MKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISFGEEENLSPKKPAT 123
Query: 61 IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
+PEVAKQRELSGT+++E +K K +S+AK KE++G +IF PPPEI PRS R K
Sbjct: 124 VPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRS-GTNRALALK 182
Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
+ ++G A Q+ + E +KT+KKI+D K AEL+G +IF+G+
Sbjct: 183 DNFNLG----------------AESQTAE---EDSSVKTAKKIYDKKFAELSGNDIFKGD 223
Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
+ EK LS+AKL+E+ G+NIF AD K E +D + G R+PPGGE+SIAL+
Sbjct: 224 AASSNVEKHLSQAKLKEIGGNNIF-ADGKVEARDYLGGVRKPPGGETSIALV 274
>AT1G78150.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G35780.1); Has 152 Blast hits to 146 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 2; Plants - 149; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:29404996-29406341 FORWARD
LENGTH=274
Length = 274
Score = 192 bits (487), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 115/232 (49%), Positives = 151/232 (65%), Gaps = 31/232 (13%)
Query: 1 MKEITGSGIFSANAEDSTSEAGSANSNGRTSRRLVQQAVNGISQISFSTEESVSPKKPAT 60
MKEITGSGIFS N +D SE + QQAVNGISQISF EE++SPKKPAT
Sbjct: 74 MKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISFGEEENLSPKKPAT 123
Query: 61 IPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITESK 120
+PEVAKQRELSGT+++E +K K +S+AK KE++G +IF PPPEI PRS R K
Sbjct: 124 VPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRS-GTNRALALK 182
Query: 121 GSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQGN 180
+ ++G A Q+ + E +KT+KKI+D K AEL+G +IF+G+
Sbjct: 183 DNFNLG----------------AESQTAE---EDSSVKTAKKIYDKKFAELSGNDIFKGD 223
Query: 181 NPPGSAEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
+ EK LS+AKL+E+ G+NIF AD K E +D + G R+PPGGE+SIAL+
Sbjct: 224 AASSNVEKHLSQAKLKEIGGNNIF-ADGKVEARDYLGGVRKPPGGETSIALV 274
>AT1G35780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G78150.2); Has 145 Blast hits to 144 proteins
in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 145; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:13277778-13280113 REVERSE
LENGTH=286
Length = 286
Score = 184 bits (468), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 118/237 (49%), Positives = 151/237 (63%), Gaps = 30/237 (12%)
Query: 1 MKEITGSGIFSANAEDSTSEAGSANS--NGRTSRRLVQQAVNGISQISFSTEESVSPKKP 58
MKEITGSGIFS E+ SE SANS NG+ SR Q +S ISF EE V+PKKP
Sbjct: 75 MKEITGSGIFSVYEENDDSELASANSATNGK-SRTFQQPPAAIMSHISFGEEEIVTPKKP 133
Query: 59 ATIPEVAKQRELSGTLQSELDSKSNKLISNAKTKELTGNDIFGPPPEIVPRSVAAARITE 118
AT+PEVAKQRELSGTL+ + D+K NK S+AK KEL+G++IF PPPEI R R
Sbjct: 134 ATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPPPEIKLRPT--VRALA 191
Query: 119 SKGSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPVLKTSKKIHDHKLAELTGTNIFQ 178
K + D+GE + P G LKT+KKI D K +L+G N+F+
Sbjct: 192 YKDNFDLGE----------SDTKPDGE-----------LKTAKKIADRKFTDLSGNNVFK 230
Query: 179 GN-NPPGS--AEKPLSRAKLREMTGSNIFAADAKAETKDPIRGSRQPPGGESSIALL 232
+ + P S AE+ LS AKL+E++G++IF ADAKA+++D G R+PPGGESSIAL+
Sbjct: 231 SDVSSPSSATAERLLSTAKLKEISGNDIF-ADAKAQSRDYFGGVRKPPGGESSIALV 286
>AT2G22270.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G39860.2); Has 247 Blast hits to 231 proteins
in 42 species: Archae - 0; Bacteria - 17; Metazoa - 14;
Fungi - 5; Plants - 145; Viruses - 0; Other Eukaryotes -
66 (source: NCBI BLink). | chr2:9463765-9465282 FORWARD
LENGTH=328
Length = 328
Score = 168 bits (426), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 114/259 (44%), Positives = 153/259 (59%), Gaps = 30/259 (11%)
Query: 1 MKEITGSGIFSANA-------------EDSTSE---AGSANS----NGR---TSRRLVQQ 37
+KE+TGS IFS + +D S+ +G N+ NG+ + +
Sbjct: 73 LKEMTGSDIFSDDGKYDPNHQTRIHYHQDQLSQISFSGEENATTPMNGKDDPNHQTRIHY 132
Query: 38 AVNGISQISFSTEESVSPKKPATIPEVAKQRELSGTLQSELDSKS-NKLISNAKTKELTG 96
+ SQISFS EE+V+PKKP T+ E AKQ+ELS T++++ DSK K ISN K K ++G
Sbjct: 133 HQDQRSQISFSGEENVTPKKPTTLNEAAKQKELSRTVETQADSKCKKKQISNTKNKAMSG 192
Query: 97 NDIFGPPPEIVPRSVAAARITESKGSKDMGEPLPRNLRTSVKVSNPAGGQSNDLFGEAPV 156
+DIF P R A +E KG+K+ E PR+ R SVK SN G SN LF E V
Sbjct: 193 HDIFASPESQPRRLFGGATQSEVKGNKNTEESAPRSSRASVKTSN--GQSSNRLFSEEHV 250
Query: 157 LKTSKKIHDHK--LAELTGTNIFQGNN-PPGSAEKPLSRAKLREMTGSNIFAADAKAETK 213
+K+SKKIH+ K LT IF+ + PPG +EK S AK REM+G NIF AD K+E +
Sbjct: 251 VKSSKKIHNQKSQFQGLTSNGIFKSDKIPPGYSEKMQSSAKKREMSGHNIF-ADGKSEYR 309
Query: 214 DPIRGSRQPPGGESSIALL 232
D G+R+PPGGESSI+L+
Sbjct: 310 DYYGGARRPPGGESSISLV 328