Miyakogusa Predicted Gene
- Lj4g3v1108100.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1108100.1 Non Characterized Hit- tr|I3S202|I3S202_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.05,0,no
description,DNA glycosylase; seg,NULL; DNA-glycosylase,DNA
glycosylase;
Adenine_glyco,Methyladeni,NODE_93385_length_1592_cov_14.918342.path2.1
(308 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr8g066040.1 | DNA-3-methyladenine glycosylase I | HC | chr8:... 458 e-129
Medtr0009s0230.1 | DNA-3-methyladenine glycosylase I | HC | scaf... 399 e-111
Medtr0009s0230.2 | DNA-3-methyladenine glycosylase I | HC | scaf... 392 e-109
Medtr4g130880.1 | DNA-3-methyladenine glycosylase I | HC | chr4:... 233 2e-61
Medtr4g007070.1 | DNA-3-methyladenine glycosylase I | HC | chr4:... 231 8e-61
Medtr2g063510.1 | DNA-3-methyladenine glycosylase I | HC | chr2:... 229 3e-60
Medtr8g447000.1 | DNA-3-methyladenine glycosylase I | HC | chr8:... 222 4e-58
Medtr3g111690.1 | DNA-3-methyladenine glycosylase I | HC | chr3:... 204 7e-53
Medtr4g130880.2 | DNA-3-methyladenine glycosylase I | HC | chr4:... 109 3e-24
Medtr2g063510.2 | DNA-3-methyladenine glycosylase I | HC | chr2:... 106 3e-23
Medtr8g447000.2 | DNA-3-methyladenine glycosylase I | HC | chr8:... 102 3e-22
>Medtr8g066040.1 | DNA-3-methyladenine glycosylase I | HC |
chr8:27430410-27434638 | 20130731
Length = 329
Score = 458 bits (1179), Expect = e-129, Method: Compositional matrix adjust.
Identities = 227/330 (68%), Positives = 249/330 (75%), Gaps = 23/330 (6%)
Query: 1 MSKSNVRRHALEKSMTLKDTQKILNQSFF-PKSLKKVYPVGLQKXXXXXXXXXXXXXXXX 59
MSK+NVR+ ALE+S + KDTQKILNQ+FF K KKVYP+GLQK
Sbjct: 1 MSKTNVRKQALERSTSFKDTQKILNQNFFHNKIFKKVYPIGLQKSTSSLSLSSVSLSLSQ 60
Query: 60 XXXXXXXXXXXXXPLDENISLALRLISVSPRQRREPTAAKTAQQ----LNTEPGELKRCN 115
PLDE IS ALRLIS S +RRE AKT Q + TEPGEL+RCN
Sbjct: 61 NSNDSSQADSLT-PLDERISSALRLISASSHERRETAVAKTIHQQSPLVTTEPGELRRCN 119
Query: 116 WVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREVFAEF 175
W+TKNSDK Y+EFHDECWGVPAYDDNKLFE+LA+SGLLMDYNWTEI++R+E LREVFA F
Sbjct: 120 WITKNSDKLYVEFHDECWGVPAYDDNKLFEMLAMSGLLMDYNWTEIIKRREPLREVFAGF 179
Query: 176 DPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAK-----------------CIMKII 218
DPYTVAKMEE+EI+EI SNKALSLA+SRVMCI DN ++
Sbjct: 180 DPYTVAKMEEQEIIEITSNKALSLADSRVMCIVDNVSFGATLRLRSYGYGAGFLVNTPVV 239
Query: 219 RECGSFSSYIWGFVNHKPIINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFM 278
RECGSFSSYIW FVNHKPIIN+YKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFM
Sbjct: 240 RECGSFSSYIWSFVNHKPIINKYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFM 299
Query: 279 QAAGLTIDHLVDCYRHDECVSLAERPWRHI 308
QAAGLTIDHLVDCYRH ECVSLAERPWRHI
Sbjct: 300 QAAGLTIDHLVDCYRHSECVSLAERPWRHI 329
>Medtr0009s0230.1 | DNA-3-methyladenine glycosylase I | HC |
scaffold0009:104042-101934 | 20130731
Length = 308
Score = 399 bits (1026), Expect = e-111, Method: Compositional matrix adjust.
Identities = 199/312 (63%), Positives = 236/312 (75%), Gaps = 8/312 (2%)
Query: 1 MSKSNVRRHALEKSMTLKDTQKILNQSFFPKSLKKVYPVGLQKXXXXXXXXXXXXXXXXX 60
MSK N +RHA+EK + ++++K LNQ F K LK+VYP+GLQK
Sbjct: 1 MSKVNAKRHAMEKKSSDQESKK-LNQIIFHKHLKRVYPIGLQKSSSSSSISSFSSSLSQN 59
Query: 61 XXXXXXXXXXXXPLDENISLALRLISVSPRQRREPTAAKTAQQLN----TEPGELKRCNW 116
DE +SLAL S+SPRQRRE T +QQ E GELKRC+W
Sbjct: 60 SNDPCFTDSLTIA-DEEVSLALH--SISPRQRREHTLINISQQQQNQHAAELGELKRCSW 116
Query: 117 VTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREVFAEFD 176
+TKN DKAYIEFHDECWGVPAYDD KLFELLALSGLL+DYNWTEIL+RKE LR+VFA FD
Sbjct: 117 ITKNCDKAYIEFHDECWGVPAYDDKKLFELLALSGLLIDYNWTEILKRKEVLRQVFAGFD 176
Query: 177 PYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIWGFVNHKP 236
PYTV+KMEEKE+++IAS L LAE RV CI DNAKC+MKI RE GSFSSYIW +VNHKP
Sbjct: 177 PYTVSKMEEKEVIDIASATELVLAECRVKCIVDNAKCMMKIRREFGSFSSYIWSYVNHKP 236
Query: 237 IINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTIDHLVDCYRHDE 296
+IN+Y+Y R+VPLR+PKA+A+SKD++KRGFR++GPVIV+SFMQ AGLTIDHLV CYRH E
Sbjct: 237 VINKYRYSRDVPLRTPKADAISKDLLKRGFRYLGPVIVYSFMQVAGLTIDHLVGCYRHKE 296
Query: 297 CVSLAERPWRHI 308
CV+LAERPW+HI
Sbjct: 297 CVNLAERPWKHI 308
>Medtr0009s0230.2 | DNA-3-methyladenine glycosylase I | HC |
scaffold0009:104053-101904 | 20130731
Length = 307
Score = 392 bits (1008), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/312 (63%), Positives = 235/312 (75%), Gaps = 9/312 (2%)
Query: 1 MSKSNVRRHALEKSMTLKDTQKILNQSFFPKSLKKVYPVGLQKXXXXXXXXXXXXXXXXX 60
MSK N +RHA+EK + ++++K LNQ F K LK+VYP+GLQK
Sbjct: 1 MSKVNAKRHAMEKKSSDQESKK-LNQIIFHKHLKRVYPIGLQKSSSSSSISSFSSSLSQN 59
Query: 61 XXXXXXXXXXXXPLDENISLALRLISVSPRQRREPTAAKTAQQLN----TEPGELKRCNW 116
DE +SLAL S+SPRQRRE T +QQ E GELKRC+W
Sbjct: 60 SNDPCFTDSLTIA-DEEVSLALH--SISPRQRREHTLINISQQQQNQHAAELGELKRCSW 116
Query: 117 VTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREVFAEFD 176
+TKN KAYIEFHDECWGVPAYDD KLFELLALSGLL+DYNWTEIL+RKE LR+VFA FD
Sbjct: 117 ITKNY-KAYIEFHDECWGVPAYDDKKLFELLALSGLLIDYNWTEILKRKEVLRQVFAGFD 175
Query: 177 PYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIWGFVNHKP 236
PYTV+KMEEKE+++IAS L LAE RV CI DNAKC+MKI RE GSFSSYIW +VNHKP
Sbjct: 176 PYTVSKMEEKEVIDIASATELVLAECRVKCIVDNAKCMMKIRREFGSFSSYIWSYVNHKP 235
Query: 237 IINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTIDHLVDCYRHDE 296
+IN+Y+Y R+VPLR+PKA+A+SKD++KRGFR++GPVIV+SFMQ AGLTIDHLV CYRH E
Sbjct: 236 VINKYRYSRDVPLRTPKADAISKDLLKRGFRYLGPVIVYSFMQVAGLTIDHLVGCYRHKE 295
Query: 297 CVSLAERPWRHI 308
CV+LAERPW+HI
Sbjct: 296 CVNLAERPWKHI 307
>Medtr4g130880.1 | DNA-3-methyladenine glycosylase I | HC |
chr4:54548746-54544404 | 20130731
Length = 375
Score = 233 bits (594), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 98/191 (51%), Positives = 139/191 (72%)
Query: 112 KRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREV 171
KRC W+T N++ Y FHDE WGVP +DD KLFE+L LS L + W IL ++ REV
Sbjct: 148 KRCAWITPNTEPYYATFHDEEWGVPVHDDKKLFEVLVLSSALSELTWPAILSKRHIFREV 207
Query: 172 FAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIWGF 231
FA+FDP V+K+ EK+++ + + L++ ++ I +NA+ I K+I E GSF +YIW F
Sbjct: 208 FADFDPVAVSKLNEKKVITPGTTASSLLSDQKLRGIIENARQISKVIVEFGSFDNYIWSF 267
Query: 232 VNHKPIINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 291
VNHKPI+++++YPR VP+++PKAE +SKD+V+RGFR VGP +++SFMQ GLT DHL+ C
Sbjct: 268 VNHKPILSKFRYPRQVPVKTPKAEVISKDLVRRGFRGVGPTVIYSFMQVVGLTNDHLISC 327
Query: 292 YRHDECVSLAE 302
+R ECV+ AE
Sbjct: 328 FRFQECVAAAE 338
>Medtr4g007070.1 | DNA-3-methyladenine glycosylase I | HC |
chr4:935811-930134 | 20130731
Length = 365
Score = 231 bits (588), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 101/187 (54%), Positives = 139/187 (74%)
Query: 112 KRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREV 171
KRC WVT N++ YI FHDE WGVP +DD KLFELL+ SG L + +W IL +++ R+V
Sbjct: 137 KRCAWVTPNTEPCYIAFHDEEWGVPIHDDKKLFELLSFSGALAELSWPTILGKRQLFRKV 196
Query: 172 FAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIWGF 231
F +FDP V++M EK+I+ S + L+E R+ I +NA+ + K+I E GSF SYIW F
Sbjct: 197 FLDFDPCAVSRMNEKKIVAPGSPASSLLSELRLRSIIENARQMCKVIEEFGSFDSYIWNF 256
Query: 232 VNHKPIINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 291
VN+KPI+++++YPR VP +SPKAE +SKD+VKRGFR VGP ++++FMQ AGLT DHL+ C
Sbjct: 257 VNNKPIVSQFRYPRQVPAKSPKAEFISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGC 316
Query: 292 YRHDECV 298
+R EC+
Sbjct: 317 FRFKECI 323
>Medtr2g063510.1 | DNA-3-methyladenine glycosylase I | HC |
chr2:26805019-26800818 | 20130731
Length = 390
Score = 229 bits (583), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 102/208 (49%), Positives = 146/208 (70%), Gaps = 5/208 (2%)
Query: 94 EPTAAKTAQQLNTEPGELKRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLL 153
+P++A ++ N E KRC+++T NSD YI +HDE WGVP +DD LFELL LSG
Sbjct: 186 DPSSALDSKTTNQEE---KRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQ 242
Query: 154 MDYNWTEILRRKETLREVFAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKC 213
+ +WT L+++ R F+EFD VA + +K++M I+S + + S+V + DNA
Sbjct: 243 VGSDWTSTLKKRLDFRAAFSEFDAEIVANLTDKQMMSISSEYGIDI--SKVRGVVDNANQ 300
Query: 214 IMKIIRECGSFSSYIWGFVNHKPIINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVI 273
I+++ + GSF YIWGFVNHKPI N+YK+ +P+++ K+E++SKDM+KRGFR+VGP +
Sbjct: 301 ILQVRKGFGSFDKYIWGFVNHKPISNQYKFGHKIPVKTSKSESISKDMIKRGFRYVGPTV 360
Query: 274 VHSFMQAAGLTIDHLVDCYRHDECVSLA 301
VHSFMQAAGLT DHL+ C+RH +C LA
Sbjct: 361 VHSFMQAAGLTNDHLITCHRHLQCTLLA 388
>Medtr8g447000.1 | DNA-3-methyladenine glycosylase I | HC |
chr8:18418523-18415122 | 20130731
Length = 383
Score = 222 bits (565), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 139/196 (70%), Gaps = 2/196 (1%)
Query: 106 TEPGELKRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRK 165
T E KRC+++T NSD YI +HDE WGVP +DD LFELL LSG + +WT IL+++
Sbjct: 189 TSEEEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKR 248
Query: 166 ETLREVFAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFS 225
+ R F+EF T+A + +K+++ I+ + + SRV + DNA I+++ ++ GSF
Sbjct: 249 QDFRTAFSEFHAATLANLTDKQMLSISLEYGIDI--SRVRGVVDNANRILEVNKDFGSFD 306
Query: 226 SYIWGFVNHKPIINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTI 285
YIWGFVNHKPI +YK+ +P+++ K+E++SKDM++RGFRFVGP +VHSFMQAAGLT
Sbjct: 307 KYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMIRRGFRFVGPTVVHSFMQAAGLTN 366
Query: 286 DHLVDCYRHDECVSLA 301
DHL+ C+ H +C L+
Sbjct: 367 DHLITCHSHLKCTLLS 382
>Medtr3g111690.1 | DNA-3-methyladenine glycosylase I | HC |
chr3:52235165-52232483 | 20130731
Length = 331
Score = 204 bits (520), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 91/187 (48%), Positives = 129/187 (68%), Gaps = 1/187 (0%)
Query: 112 KRCNWVTKNSDKAYIEFHDECWGVPAYDDN-KLFELLALSGLLMDYNWTEILRRKETLRE 170
KRC+W+T N+D Y FHD+ WGVP DD+ KLFELL S L ++ W IL ++ R+
Sbjct: 128 KRCDWITPNADPLYTAFHDDEWGVPVLDDDRKLFELLVFSQALAEHTWPTILNHRDIFRK 187
Query: 171 VFAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIWG 230
+F FDP ++A+ EK ++ N L+E ++ + +NAK +KI E GSFS+Y W
Sbjct: 188 LFENFDPSSIAQFTEKNLVTPKLNGNPLLSEQKLRAVVENAKQFLKIQLEFGSFSNYCWK 247
Query: 231 FVNHKPIINRYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTIDHLVD 290
FVN+KPI N ++Y R VP+++PKAE +SKDM++RGF+ VGP +V+SFMQ AGL DHL+
Sbjct: 248 FVNNKPIKNEFRYGRQVPVKNPKAELISKDMMRRGFQCVGPKVVYSFMQVAGLVNDHLIT 307
Query: 291 CYRHDEC 297
C+R+ EC
Sbjct: 308 CFRYQEC 314
>Medtr4g130880.2 | DNA-3-methyladenine glycosylase I | HC |
chr4:54548746-54544404 | 20130731
Length = 265
Score = 109 bits (273), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 74/118 (62%), Gaps = 2/118 (1%)
Query: 112 KRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREV 171
KRC W+T N++ Y FHDE WGVP +DD KLFE+L LS L + W IL ++ REV
Sbjct: 148 KRCAWITPNTEPYYATFHDEEWGVPVHDDKKLFEVLVLSSALSELTWPAILSKRHIFREV 207
Query: 172 FAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIW 229
FA+FDP V+K+ EK+++ + + L++ ++ I +NA+ I K++ G S +W
Sbjct: 208 FADFDPVAVSKLNEKKVITPGTTASSLLSDQKLRGIIENARQISKVLLMPG--HSRVW 263
>Medtr2g063510.2 | DNA-3-methyladenine glycosylase I | HC |
chr2:26804984-26802744 | 20130731
Length = 316
Score = 106 bits (265), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 83/134 (61%), Gaps = 6/134 (4%)
Query: 94 EPTAAKTAQQLNTEPGELKRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLL 153
+P++A ++ N E KRC+++T NSD YI +HDE WGVP +DD LFELL LSG
Sbjct: 186 DPSSALDSKTTNQEE---KRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQ 242
Query: 154 MDYNWTEILRRKETLREVFAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKC 213
+ +WT L+++ R F+EFD VA + +K++M I+S + + S+V + DNA
Sbjct: 243 VGSDWTSTLKKRLDFRAAFSEFDAEIVANLTDKQMMSISSEYGIDI--SKVRGVVDNANQ 300
Query: 214 IMKIIRECGSFSSY 227
I+++ + FSS+
Sbjct: 301 ILQVNKNV-PFSSF 313
>Medtr8g447000.2 | DNA-3-methyladenine glycosylase I | HC |
chr8:18418617-18415158 | 20130731
Length = 323
Score = 102 bits (255), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 75/118 (63%), Gaps = 4/118 (3%)
Query: 106 TEPGELKRCNWVTKNSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRK 165
T E KRC+++T NSD YI +HDE WGVP +DD LFELL LSG + +WT IL+++
Sbjct: 189 TSEEEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKR 248
Query: 166 ETLREVFAEFDPYTVAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKI--IREC 221
+ R F+EF T+A + +K+++ I+ + + SRV + DNA I+++ ++ C
Sbjct: 249 QDFRTAFSEFHAATLANLTDKQMLSISLEYGIDI--SRVRGVVDNANRILEVKNVKTC 304