Miyakogusa Predicted Gene
- Lj4g3v1313810.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1313810.1 Non Characterized Hit- tr|I1L3L2|I1L3L2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.22811
PE,32.88,9e-17,DUF1442,Protein of unknown function DUF1442; no
description,NULL; seg,NULL,CUFF.48794.1
(216 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr4g102220.1 | DUF1442 family protein | HC | chr4:42327158-42... 219 2e-57
Medtr5g011800.1 | DUF1442 family protein | HC | chr5:3452852-345... 218 3e-57
Medtr5g011820.1 | DUF1442 family protein | HC | chr5:3462938-346... 206 1e-53
Medtr4g024370.1 | DUF1442 family protein | HC | chr4:8245188-824... 186 1e-47
Medtr4g024370.2 | DUF1442 family protein | HC | chr4:8249190-824... 186 1e-47
Medtr7g117380.1 | DUF1442 family protein | HC | chr7:48497377-48... 69 3e-12
Medtr6g060440.1 | DUF1442 family protein | HC | chr6:20762736-20... 66 3e-11
Medtr6g060390.1 | DUF1442 family protein | HC | chr6:20750761-20... 64 9e-11
Medtr1g076800.1 | DUF1442 family protein | HC | chr1:34282105-34... 63 3e-10
Medtr4g023950.1 | DUF1442 family protein | HC | chr4:8171319-817... 54 1e-07
>Medtr4g102220.1 | DUF1442 family protein | HC |
chr4:42327158-42326288 | 20130731
Length = 225
Score = 219 bits (558), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/226 (57%), Positives = 148/226 (65%), Gaps = 11/226 (4%)
Query: 1 MKLVWSPEAASKAYIDTVQSCP---ESGVAELVSAMAAGWKAQLIVETWSEGGPIATSRG 57
MKLVWSPE ASKAYIDTVQSC SG+AEL+SAMAAGW A++IVETWSEGG I TS G
Sbjct: 1 MKLVWSPETASKAYIDTVQSCKVLRGSGMAELISAMAAGWNAKMIVETWSEGGVIETSLG 60
Query: 58 LGIARGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXX-XXXXXXXXXXXXIDFMVVD 116
L IAR HT+GRHVC+VP+E SK+EY RMGE G T IDFMVVD
Sbjct: 61 LSIARKHTNGRHVCIVPNEASKLEYSKRMGEQGTSTEIIVGEAEEVMKDFIEEIDFMVVD 120
Query: 117 CKG-REFSKILRVAKLSSMGAVLVCKNANSRNVTGFKWXXXXXXX--XXXXXLVHSVFLP 173
C+G ++ K+L+VAKLS GAVL+CKN N R+ FKW +V SVFLP
Sbjct: 121 CEGIKDLMKVLKVAKLSVKGAVLICKNVNFRS-GNFKWENIVVEEGGSRSRRVVRSVFLP 179
Query: 174 VGKGLDIAHXXXXXXXXXXXXI---KGKRWIKHVDRQSGDVHVIRR 216
VGKGLDIAH KRWIKHVD++SG+VHVIRR
Sbjct: 180 VGKGLDIAHVSAVGGNLGKDGHGRGGSKRWIKHVDQRSGEVHVIRR 225
>Medtr5g011800.1 | DUF1442 family protein | HC |
chr5:3452852-3451834 | 20130731
Length = 229
Score = 218 bits (555), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 120/231 (51%), Positives = 146/231 (63%), Gaps = 17/231 (7%)
Query: 1 MKLVWSPEAASKAYIDTVQS-------CPESGVAELVSAMAAGWKAQLIVETWSEGGPIA 53
MKLVWSPE A+ +YIDTVQ+ ESG AELVS+MAAGW AQLIVETWS GG I
Sbjct: 1 MKLVWSPERATNSYIDTVQAITTINHLSSESGAAELVSSMAAGWNAQLIVETWSHGGVIP 60
Query: 54 TSRGLGIARGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXXXXXXXXXXXXXXIDFM 113
TS GL IA GHT GRHVC+VPDE+S+ EY MGEAG+ IDF+
Sbjct: 61 TSVGLSIASGHTGGRHVCIVPDEQSRSEYAKNMGEAGMSPEIIVGEPEEVMDGLVGIDFL 120
Query: 114 VVDCKGREFSKILRVAKLSSMGAVLVCKNAN--SRNVTGFKWXXXXXXXXXXXXLVHSVF 171
VVD + ++F+++LR+AKLS GAVL+CKNAN S+ +G+ W LV SVF
Sbjct: 121 VVDSRRKDFTRVLRLAKLSGKGAVLICKNANFISKMDSGYIW--RSVVARGSRRLVRSVF 178
Query: 172 LPVGKGLDIAHXXXXXXXXXXXXIKGK------RWIKHVDRQSGDVHVIRR 216
LPVGKG+ +AH +K K RWIKHVD++SGDVH IR+
Sbjct: 179 LPVGKGIHMAHFSAAGGDNSVAAMKHKGRAVHNRWIKHVDQRSGDVHFIRK 229
>Medtr5g011820.1 | DUF1442 family protein | HC |
chr5:3462938-3461949 | 20130731
Length = 232
Score = 206 bits (525), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 116/231 (50%), Positives = 141/231 (61%), Gaps = 17/231 (7%)
Query: 1 MKLVWSPEAASKAYIDTVQSC------PESGVAELVSAMAAGWKAQLIVETWSEGGPIAT 54
MKLVWSPE A+ +YIDTVQ+ ESGVAE VSAMAAGW AQLIVETWS GG I T
Sbjct: 4 MKLVWSPERATNSYIDTVQAVTTNHLVSESGVAEFVSAMAAGWNAQLIVETWSCGGVIPT 63
Query: 55 SRGLGIARGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXXXXXXXXXXXXXXIDFMV 114
S GL IA GH GRHVC+VPDE S+ EY M EAG+ IDF+V
Sbjct: 64 SVGLSIASGHNGGRHVCIVPDELSRSEYAKNMLEAGMSPEVLVGEPEEVMDGLIGIDFLV 123
Query: 115 VDCKGREFSKILRVAKLSSMGAVLVCKNAN--SRNVTGFKWXXXXXXXXXXXXLVHSVFL 172
VD + ++F+++LR+AKLS G+VL+CKNAN S+ +G+ W LV SVFL
Sbjct: 124 VDSRRKDFTRVLRLAKLSGKGSVLICKNANFISKMDSGYMW--RSVVARGSRRLVKSVFL 181
Query: 173 PVGKGLDIAHXXXX-------XXXXXXXXIKGKRWIKHVDRQSGDVHVIRR 216
PVGKG+ +AH + RWIKHVD++SGDVH IR+
Sbjct: 182 PVGKGIHMAHLSAAGGGEYSVAAVKHKARVIHNRWIKHVDQRSGDVHFIRK 232
>Medtr4g024370.1 | DUF1442 family protein | HC |
chr4:8245188-8243460 | 20130731
Length = 232
Score = 186 bits (473), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/234 (46%), Positives = 139/234 (59%), Gaps = 22/234 (9%)
Query: 1 MKLVWSPEAASKAYIDTVQSCP------------ESGVAELVSAMAAGWKAQLIVETWSE 48
MKLVWSPE A KAYIDTV+S ESGVAEL+S+MAAGW A+ IVE +S
Sbjct: 1 MKLVWSPETALKAYIDTVKSVSTVQPQQQCEKFKESGVAELLSSMAAGWNAKFIVECYSH 60
Query: 49 GGPIATSRGLGIARGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXXX--XXXXXXXX 106
GGPIA S GL +A +T RHVC+VPDE S+++Y + E GV
Sbjct: 61 GGPIAASVGLAVAARNTGARHVCIVPDEGSRLQYTKALAEMGVTPPPEIVHGEAQTVIKS 120
Query: 107 XXXIDFMVVDCKGREFSKILRVAKLSSMGAVLVCKNANSR-NVTGFKWXXXXXXXXXXXX 165
+DF+VVDC+ R+F+++L+VAK+S+ GAVL CKNA R NV+ FKW
Sbjct: 121 LDGLDFLVVDCRLRDFARVLKVAKVSTRGAVLACKNAWQRSNVSWFKW---NMVLERGTR 177
Query: 166 LVHSVFLPVGKGLDIAHXXXXXXXXXXXXIKGK----RWIKHVDRQSGDVHVIR 215
+V SVFLPVGKGLDIA+ K RWIK +D++SG+ H+ R
Sbjct: 178 VVRSVFLPVGKGLDIAYIGSRIGGGAASSSASKSTPSRWIKLIDQKSGEEHLYR 231
>Medtr4g024370.2 | DUF1442 family protein | HC |
chr4:8249190-8243459 | 20130731
Length = 232
Score = 186 bits (473), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/234 (46%), Positives = 139/234 (59%), Gaps = 22/234 (9%)
Query: 1 MKLVWSPEAASKAYIDTVQSCP------------ESGVAELVSAMAAGWKAQLIVETWSE 48
MKLVWSPE A KAYIDTV+S ESGVAEL+S+MAAGW A+ IVE +S
Sbjct: 1 MKLVWSPETALKAYIDTVKSVSTVQPQQQCEKFKESGVAELLSSMAAGWNAKFIVECYSH 60
Query: 49 GGPIATSRGLGIARGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXXX--XXXXXXXX 106
GGPIA S GL +A +T RHVC+VPDE S+++Y + E GV
Sbjct: 61 GGPIAASVGLAVAARNTGARHVCIVPDEGSRLQYTKALAEMGVTPPPEIVHGEAQTVIKS 120
Query: 107 XXXIDFMVVDCKGREFSKILRVAKLSSMGAVLVCKNANSR-NVTGFKWXXXXXXXXXXXX 165
+DF+VVDC+ R+F+++L+VAK+S+ GAVL CKNA R NV+ FKW
Sbjct: 121 LDGLDFLVVDCRLRDFARVLKVAKVSTRGAVLACKNAWQRSNVSWFKW---NMVLERGTR 177
Query: 166 LVHSVFLPVGKGLDIAHXXXXXXXXXXXXIKGK----RWIKHVDRQSGDVHVIR 215
+V SVFLPVGKGLDIA+ K RWIK +D++SG+ H+ R
Sbjct: 178 VVRSVFLPVGKGLDIAYIGSRIGGGAASSSASKSTPSRWIKLIDQKSGEEHLYR 231
>Medtr7g117380.1 | DUF1442 family protein | HC |
chr7:48497377-48495659 | 20130731
Length = 228
Score = 68.9 bits (167), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 99/231 (42%), Gaps = 32/231 (13%)
Query: 5 WSPEAASKAYIDTVQSC--------PESGVAELVSAMAAGWKAQLIVETWSEGGPIATSR 56
WS A++AY+DT++ C E G E VSA+AAG KA+LIVE S P ++S
Sbjct: 3 WSATCAARAYLDTLRLCNNVGSLRVQELGSNEFVSALAAGMKAKLIVEVTS--SPASSST 60
Query: 57 -GLGIARGHTSGRHVCVVP----DERSKVEYCDRMGEAGVCTXXXXXXXXXXXXX--XXX 109
L A T G+ VC++P DE K + +G+
Sbjct: 61 IALAAAARQTGGKVVCILPEPVLDESKKA-----INNSGLNDQVEFKTEDPSKLLPRYKN 115
Query: 110 IDFMVVDCKGREFSKILRVAKLSSMGAVLVCKNANSRNVTGFKWXXXXXXXXXXXXLVHS 169
IDF +VDCK ++ +L + ++ + +V+V + N+ G + V S
Sbjct: 116 IDFSLVDCKDESYAMLLNLIDVNPVRSVVV-----ANNLVGDRKGLEGIRRKDEKLAVRS 170
Query: 170 VFLPVGKGLDIAHXXXXXXXXXXXXIKGK-----RWIKHVDRQSGDVHVIR 215
V +GKG+++ + +W+ D +SG+ H+ R
Sbjct: 171 VKHSIGKGMEVTMISKNDETDHKRHHHARKRNKSKWVAKFDEKSGEEHIFR 221
>Medtr6g060440.1 | DUF1442 family protein | HC |
chr6:20762736-20761390 | 20130731
Length = 221
Score = 65.9 bits (159), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 92/222 (41%), Gaps = 26/222 (11%)
Query: 5 WSPEAASKAYIDTV---QSCPESGVAELVSAMAAGWKAQLIVETWSEGGPIATSRGLGIA 61
WS E A+KAY+ T+ Q E VAE +SA+AAG AQL+V T + G T+ L A
Sbjct: 4 WSAENATKAYLSTMKMGQKAKEPAVAEFISAIAAGNNAQLMVVTCA-GAADTTTLALVSA 62
Query: 62 RGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXXXXXXXXXXXXXXIDFMVVDCKGRE 121
T+G+ +C+VP + +G A DF+++DC
Sbjct: 63 ANQTNGKVICIVPTNEDLITSKKILGAAS--NQVQFMIGKEALLVLNKADFVLIDCNHMN 120
Query: 122 FSKILRVAKL---SSMGAVLVCKNANSRNVTGFKWXXXXXXXXXXXXLVHSVFLPVGKGL 178
+I++ ++ G V+V NA S + W + LP+G+GL
Sbjct: 121 HEEIVKCVQIGCCKQKGTVVVGYNAFSSKGS---WRSCGS---------KTQLLPIGQGL 168
Query: 179 DIA-----HXXXXXXXXXXXXIKGKRWIKHVDRQSGDVHVIR 215
+ + RWI VD+ +G+ HV R
Sbjct: 169 LVTRFGENNAISPKFESGMSNSPRSRWIVKVDKCTGEEHVYR 210
>Medtr6g060390.1 | DUF1442 family protein | HC |
chr6:20750761-20749382 | 20130731
Length = 223
Score = 63.9 bits (154), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 90/222 (40%), Gaps = 24/222 (10%)
Query: 5 WSPEAASKAYIDTV---QSCPESGVAELVSAMAAGWKAQLIVETWSEGGPIATSRGLGIA 61
WS E A+KAY+ T+ Q E VAE +SA+AAG AQL+V + G T+ L A
Sbjct: 4 WSAENATKAYLSTMKMGQKAKEPAVAEFISAIAAGNNAQLMVVACA-GAADPTTLALVAA 62
Query: 62 RGHTSGRHVCVVPDERSKVEYCDRMGEAGVCTXXXXXXXXXXXXXXXXIDFMVVDC---K 118
T+G+ +C+VP + +G A DF+++DC
Sbjct: 63 ANQTNGKVICIVPTIEDLITSKKILGAASNQVQFMIGKGAQELLVLNKADFVLIDCNLIN 122
Query: 119 GREFSKILRVAKLSSMGAVLVCKNANSRNVTGFKWXXXXXXXXXXXXLVHSVFLPVGKGL 178
E K +++ G V+V NA S + W + LP+G+GL
Sbjct: 123 HEEIVKCVQIGCCKQKGTVVVGYNAFSSKGS---WRSCGS---------KTQLLPIGQGL 170
Query: 179 DIA-----HXXXXXXXXXXXXIKGKRWIKHVDRQSGDVHVIR 215
+ + RWI VD+ +G+ HV R
Sbjct: 171 LVTRFGENNAISPKFESGISNSPRSRWIVKVDKCTGEEHVYR 212
>Medtr1g076800.1 | DUF1442 family protein | HC |
chr1:34282105-34283311 | 20130731
Length = 222
Score = 62.8 bits (151), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 90/222 (40%), Gaps = 25/222 (11%)
Query: 5 WSPEAASKAYIDTV---QSCPESGVAELVSAMAAGWKAQLIVETWSEGGPIATSRGLGIA 61
WS E A+KAY+ T+ Q E VAE +SA+AAG AQ+++ + +T+ L A
Sbjct: 4 WSAENATKAYLSTLKMGQKAKEPNVAEFISALAAGNNAQMMIVACANVAD-STTLALIAA 62
Query: 62 RGHTSGRHVCVVPDERSKVEYCDRMGEAG--VCTXXXXXXXXXXXXXXXXIDFMVVDCKG 119
T G+ +C+VP+ + + +G A V DF+++DC
Sbjct: 63 ANQTGGQVICIVPNHKDLIASKHVLGIASHQVQFMVGKAQEVLMLDQYEAADFLLIDCNI 122
Query: 120 REFSKILRVAKLSSMGAVLVCKNANSRNVTGFKWXXXXXXXXXXXXLVHSVFLPVGKGLD 179
+ +IL+ + +N N GF L LP+G+GL
Sbjct: 123 KNHEEILKTIQEG--------RNVNVVGYNGFSCKGSWLSCGSKTQL-----LPIGEGLL 169
Query: 180 IAHXXXXXXXXXXXXIK---GK---RWIKHVDRQSGDVHVIR 215
+ GK RW+ VD+ +G+ HV R
Sbjct: 170 VTRFGISENNSPRYGTSRSMGKIKSRWVVKVDKCTGEEHVFR 211
>Medtr4g023950.1 | DUF1442 family protein | HC |
chr4:8171319-8172599 | 20130731
Length = 219
Score = 53.5 bits (127), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 85/219 (38%), Gaps = 23/219 (10%)
Query: 5 WSPEAASKAYIDTVQSCP---ESGVAELVSAMAAGWKAQLIVETWSEGGPIATSRGLGIA 61
WSPE A KAY+ ++ E VAE +SA+AAG AQL+V S +T+ L A
Sbjct: 4 WSPENAKKAYLQALKMAKRDKEPDVAEFISAIAAGKNAQLMV-VASANVASSTTLALAAA 62
Query: 62 RGHTSGRHVCVVPDERSKVEYCDRMG-EAGVCTXXXXXXXXXXXXXXXXIDFMVVDCKGR 120
T GR + + + + +G DF++VDC
Sbjct: 63 SQQTHGRVIYISSGQNELQASKEALGVHKDSVEFVVGDAKTLLLNDYKGADFVLVDCDMN 122
Query: 121 EFSKILRVA--KLSSMGAVLVCKNANSRNVTGFKWXXXXXXXXXXXXLVHSVFLPVGKGL 178
++ A + GA++V N R +W + +LP+G+GL
Sbjct: 123 NAREVFLAAFKGANKDGALVVGYNVRHR---ALRWRQ-----------FRATYLPIGEGL 168
Query: 179 DIAHXXXXXXXXXXXXIKGKR--WIKHVDRQSGDVHVIR 215
+ ++ + WI VD +G+ H+ R
Sbjct: 169 LVTKIDLNVKKDYDMVVQRNKSNWIVQVDNCTGEEHIFR 207