Miyakogusa Predicted Gene
- Lj0g3v0066399.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0066399.1 Non Chatacterized Hit- tr|C6TI18|C6TI18_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.55406
PE,82.69,0,seg,NULL; DUF4057,Domain of unknown function DUF4057;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NUL,CUFF.3127.1
(283 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G78150.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 342 2e-94
AT1G78150.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 342 2e-94
AT1G35780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 328 2e-90
AT1G78150.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 327 7e-90
AT4G39860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 297 6e-81
AT4G39860.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 290 6e-79
AT2G22270.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 183 1e-46
>AT1G78150.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G35780.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:29404996-29406341 FORWARD LENGTH=274
Length = 274
Score = 342 bits (877), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 187/286 (65%), Positives = 216/286 (75%), Gaps = 15/286 (5%)
Query: 1 MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXX-XDGIRKVVFGGQVTDEEV 59
MERSTPVRKPHTSTADLLTWSE DGI KVVFGGQVTDEEV
Sbjct: 1 MERSTPVRKPHTSTADLLTWSEVPPPDSPSSASRSAVRSHQPSDGISKVVFGGQVTDEEV 60
Query: 60 ESLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISF 119
ESLN+RKPCSE KMKEITGSGIF N +DD +E + +YQQAV GIS ISF
Sbjct: 61 ESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISF 110
Query: 120 GEEGSLLPKKPSSLPEVAKQRELSGTLESED-HKLNKQLSDAKCKELSGHDIFAPPPEIK 178
GEE +L PKKP+++PEVAKQRELSGT+E+E +KL KQLSDAK KE+SG +IFAPPPEIK
Sbjct: 111 GEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIK 170
Query: 179 PRPITARVLELKGSIDIGELAPSQ-ADGDGTTKTAKKIYKEKFADLSGNDIFRGEVPPSS 237
PR T R L LK + ++G A SQ A+ D + KTAKKIY +KFA+LSGNDIF+G+ S+
Sbjct: 171 PRSGTNRALALKDNFNLG--AESQTAEEDSSVKTAKKIYDKKFAELSGNDIFKGDAASSN 228
Query: 238 TEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIALV 283
EK LS AKL+EI G+NIFADGKVE+RD+LGGVRKPPGGE+SIALV
Sbjct: 229 VEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 274
>AT1G78150.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G35780.1); Has 152 Blast hits to 146 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 2; Plants - 149; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:29404996-29406341 FORWARD
LENGTH=274
Length = 274
Score = 342 bits (877), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 187/286 (65%), Positives = 216/286 (75%), Gaps = 15/286 (5%)
Query: 1 MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXX-XDGIRKVVFGGQVTDEEV 59
MERSTPVRKPHTSTADLLTWSE DGI KVVFGGQVTDEEV
Sbjct: 1 MERSTPVRKPHTSTADLLTWSEVPPPDSPSSASRSAVRSHQPSDGISKVVFGGQVTDEEV 60
Query: 60 ESLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISF 119
ESLN+RKPCSE KMKEITGSGIF N +DD +E + +YQQAV GIS ISF
Sbjct: 61 ESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISF 110
Query: 120 GEEGSLLPKKPSSLPEVAKQRELSGTLESED-HKLNKQLSDAKCKELSGHDIFAPPPEIK 178
GEE +L PKKP+++PEVAKQRELSGT+E+E +KL KQLSDAK KE+SG +IFAPPPEIK
Sbjct: 111 GEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIK 170
Query: 179 PRPITARVLELKGSIDIGELAPSQ-ADGDGTTKTAKKIYKEKFADLSGNDIFRGEVPPSS 237
PR T R L LK + ++G A SQ A+ D + KTAKKIY +KFA+LSGNDIF+G+ S+
Sbjct: 171 PRSGTNRALALKDNFNLG--AESQTAEEDSSVKTAKKIYDKKFAELSGNDIFKGDAASSN 228
Query: 238 TEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIALV 283
EK LS AKL+EI G+NIFADGKVE+RD+LGGVRKPPGGE+SIALV
Sbjct: 229 VEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 274
>AT1G35780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G78150.2); Has 145 Blast hits to 144 proteins
in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 145; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:13277778-13280113 REVERSE
LENGTH=286
Length = 286
Score = 328 bits (842), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 182/290 (62%), Positives = 206/290 (71%), Gaps = 11/290 (3%)
Query: 1 MERSTPVRKPHTSTADLLTWSETX--XXXXXXXXXXXXXXXXXXDGIRKVVFGGQVTDEE 58
ME++TPVRKPH STADLLTW E DGI KVVFGGQVTDEE
Sbjct: 1 MEKNTPVRKPHMSTADLLTWPENQPFESPAAVSSRSAARSHQPSDGISKVVFGGQVTDEE 60
Query: 59 VESLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGI-SHI 117
VESLNKRKPCS KMKEITGSGIF E+D +E SAN R +QQ A I SHI
Sbjct: 61 VESLNKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKSRTFQQPPAAIMSHI 120
Query: 118 SFGEEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNKQLSDAKCKELSGHDIFAPPPE 176
SFGEE + PKKP+++PEVAKQRELSGTLE + D KLNKQ SDAKCKELSGH+IFAPPPE
Sbjct: 121 SFGEEEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPPPE 180
Query: 177 IKPRPITARVLELKGSIDIGELAPSQADGDGTTKTAKKIYKEKFADLSGNDIFRGEV--P 234
IK RP T R L K + D+GE S DG KTAKKI KF DLSGN++F+ +V P
Sbjct: 181 IKLRP-TVRALAYKDNFDLGE---SDTKPDGELKTAKKIADRKFTDLSGNNVFKSDVSSP 236
Query: 235 PSST-EKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIALV 283
S+T E+ LS AKL+EISG++IFAD K +SRD+ GGVRKPPGGESSIALV
Sbjct: 237 SSATAERLLSTAKLKEISGNDIFADAKAQSRDYFGGVRKPPGGESSIALV 286
>AT1G78150.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G35780.1). | chr1:29404996-29406341 FORWARD
LENGTH=303
Length = 303
Score = 327 bits (837), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 187/315 (59%), Positives = 216/315 (68%), Gaps = 44/315 (13%)
Query: 1 MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXX-XDGIRKVVFGGQVTDEEV 59
MERSTPVRKPHTSTADLLTWSE DGI KVVFGGQVTDEEV
Sbjct: 1 MERSTPVRKPHTSTADLLTWSEVPPPDSPSSASRSAVRSHQPSDGISKVVFGGQVTDEEV 60
Query: 60 ESLNKR-----------------------------KPCSESKMKEITGSGIFVANGEDDT 90
ESLN+R KPCSE KMKEITGSGIF N +DD
Sbjct: 61 ESLNRRILDDAFDSFMRLVIYTNVKTCENVYDVIRKPCSEHKMKEITGSGIFSRNEKDDA 120
Query: 91 AEDGSANPQNKTGIRMYQQAVAGISHISFGEEGSLLPKKPSSLPEVAKQRELSGTLESED 150
+E + +YQQAV GIS ISFGEE +L PKKP+++PEVAKQRELSGT+E+E
Sbjct: 121 SEP----------LPVYQQAVNGISQISFGEEENLSPKKPATVPEVAKQRELSGTMENES 170
Query: 151 -HKLNKQLSDAKCKELSGHDIFAPPPEIKPRPITARVLELKGSIDIGELAPSQ-ADGDGT 208
+KL KQLSDAK KE+SG +IFAPPPEIKPR T R L LK + ++G A SQ A+ D +
Sbjct: 171 ANKLQKQLSDAKYKEISGQNIFAPPPEIKPRSGTNRALALKDNFNLG--AESQTAEEDSS 228
Query: 209 TKTAKKIYKEKFADLSGNDIFRGEVPPSSTEKSLSGAKLREISGSNIFADGKVESRDFLG 268
KTAKKIY +KFA+LSGNDIF+G+ S+ EK LS AKL+EI G+NIFADGKVE+RD+LG
Sbjct: 229 VKTAKKIYDKKFAELSGNDIFKGDAASSNVEKHLSQAKLKEIGGNNIFADGKVEARDYLG 288
Query: 269 GVRKPPGGESSIALV 283
GVRKPPGGE+SIALV
Sbjct: 289 GVRKPPGGETSIALV 303
>AT4G39860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G22270.1); Has 152 Blast hits to 146 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 2; Plants - 146; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
LENGTH=299
Length = 299
Score = 297 bits (760), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 163/302 (53%), Positives = 204/302 (67%), Gaps = 22/302 (7%)
Query: 1 MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXXXDGIRKVVFGGQVTDEEVE 60
MER+TPVR PHTSTADLL+WSET DGI K++ GGQ+TDEE +
Sbjct: 1 MERNTPVRNPHTSTADLLSWSETPPPPHHSTPSAARSHQPS-DGISKILGGGQITDEEAQ 59
Query: 61 SLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISFG 120
SLNK K CS K+KE+TGSGIF G+ + D + +P KTG+R YQQ + G+S ISF
Sbjct: 60 SLNKLKNCSGYKLKEMTGSGIFTDKGKVGSESDATTDP--KTGLRYYQQTLNGMSQISFS 117
Query: 121 EEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNKQLSDAKCKELSGHDIFAPPPEIKP 179
+G++ PKKP++L EVAKQRELSG L +E D K NKQ+S AK +E+SGHDIFAPP EI+P
Sbjct: 118 ADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQP 177
Query: 180 RPITARVLELKGSIDIGELAPSQ----------ADGDGT--------TKTAKKIYKEKFA 221
R + A E +G+ D+GE AP A G KT+KKI+ +KF
Sbjct: 178 RSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQ 237
Query: 222 DLSGNDIFRGEVPPSSTEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIA 281
+L+GN IF+G+ P S +K LS AKLRE+SG+NIFADGK ESRD+ GGVRKPPGGESSI+
Sbjct: 238 ELTGNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGGESSIS 297
Query: 282 LV 283
LV
Sbjct: 298 LV 299
>AT4G39860.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G22270.1); Has 148 Blast hits to 144 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 144; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
LENGTH=298
Length = 298
Score = 290 bits (743), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 162/302 (53%), Positives = 203/302 (67%), Gaps = 23/302 (7%)
Query: 1 MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXXXDGIRKVVFGGQVTDEEVE 60
MER+TPVR PHTSTADLL+WSET DGI K++ GGQ+TDEE +
Sbjct: 1 MERNTPVRNPHTSTADLLSWSETPPPPHHSTPSAARSHQPS-DGISKILGGGQITDEEAQ 59
Query: 61 SLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISFG 120
SLNK K CS K+KE+TGSGIF G+ + D + +P KTG+R YQ + G+S ISF
Sbjct: 60 SLNKLKNCSGYKLKEMTGSGIFTDKGKVGSESDATTDP--KTGLRYYQ-TLNGMSQISFS 116
Query: 121 EEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNKQLSDAKCKELSGHDIFAPPPEIKP 179
+G++ PKKP++L EVAKQRELSG L +E D K NKQ+S AK +E+SGHDIFAPP EI+P
Sbjct: 117 ADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQP 176
Query: 180 RPITARVLELKGSIDIGELAPSQ----------ADGDGT--------TKTAKKIYKEKFA 221
R + A E +G+ D+GE AP A G KT+KKI+ +KF
Sbjct: 177 RSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQ 236
Query: 222 DLSGNDIFRGEVPPSSTEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIA 281
+L+GN IF+G+ P S +K LS AKLRE+SG+NIFADGK ESRD+ GGVRKPPGGESSI+
Sbjct: 237 ELTGNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGGESSIS 296
Query: 282 LV 283
LV
Sbjct: 297 LV 298
>AT2G22270.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G39860.2); Has 247 Blast hits to 231 proteins
in 42 species: Archae - 0; Bacteria - 17; Metazoa - 14;
Fungi - 5; Plants - 145; Viruses - 0; Other Eukaryotes -
66 (source: NCBI BLink). | chr2:9463765-9465282 FORWARD
LENGTH=328
Length = 328
Score = 183 bits (465), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 131/328 (39%), Positives = 176/328 (53%), Gaps = 61/328 (18%)
Query: 10 PHTSTADLLTWSETXXXXXXXXXXXXXXXXXXXDGIRKVVFGG-QVTDEEVESLN----K 64
PH STADLL+WSE DG+ V+ GG Q+T+ E +SLN
Sbjct: 8 PHHSTADLLSWSEIRRPDYSTAANRSNQPS---DGMNDVLGGGGQITNAETKSLNTNVSH 64
Query: 65 RKPCSESKMKEITGSGIFVANGEDD--------------------------TAEDGSANP 98
RK CS K+KE+TGS IF +G+ D T +G +P
Sbjct: 65 RKNCSGHKLKEMTGSDIFSDDGKYDPNHQTRIHYHQDQLSQISFSGEENATTPMNGKDDP 124
Query: 99 QNKTGIRMYQQAVAGISHISFGEEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNK-Q 156
++T I +Q S ISF E ++ PKKP++L E AKQ+ELS T+E++ D K K Q
Sbjct: 125 NHQTRIHYHQDQR---SQISFSGEENVTPKKPTTLNEAAKQKELSRTVETQADSKCKKKQ 181
Query: 157 LSDAKCKELSGHDIFAPPPEIKPRPIT--ARVLELKGSIDIGELAPSQADGDGTT----- 209
+S+ K K +SGHDIFA P E +PR + A E+KG+ + E AP + T
Sbjct: 182 ISNTKNKAMSGHDIFASP-ESQPRRLFGGATQSEVKGNKNTEESAPRSSRASVKTSNGQS 240
Query: 210 -----------KTAKKIYKEK--FADLSGNDIFRGE-VPPSSTEKSLSGAKLREISGSNI 255
K++KKI+ +K F L+ N IF+ + +PP +EK S AK RE+SG NI
Sbjct: 241 SNRLFSEEHVVKSSKKIHNQKSQFQGLTSNGIFKSDKIPPGYSEKMQSSAKKREMSGHNI 300
Query: 256 FADGKVESRDFLGGVRKPPGGESSIALV 283
FADGK E RD+ GG R+PPGGESSI+LV
Sbjct: 301 FADGKSEYRDYYGGARRPPGGESSISLV 328