Miyakogusa Predicted Gene
- Lj3g3v0392790.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0392790.1 Non Chatacterized Hit- tr|I1MIA5|I1MIA5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.35449
PE,82.89,0,Adenine_glyco,Methyladenine glycosylase; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; no descrip,CUFF.40608.1
(370 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G12710.1 | Symbols: | DNA glycosylase superfamily protein | ... 355 3e-98
AT5G44680.1 | Symbols: | DNA glycosylase superfamily protein | ... 326 1e-89
AT5G57970.2 | Symbols: | DNA glycosylase superfamily protein | ... 219 2e-57
AT5G57970.1 | Symbols: | DNA glycosylase superfamily protein | ... 219 2e-57
AT1G80850.1 | Symbols: | DNA glycosylase superfamily protein | ... 219 3e-57
AT1G75090.1 | Symbols: | DNA glycosylase superfamily protein | ... 218 4e-57
AT1G13635.2 | Symbols: | DNA glycosylase superfamily protein | ... 214 1e-55
AT1G13635.1 | Symbols: | DNA glycosylase superfamily protein | ... 214 1e-55
AT1G15970.1 | Symbols: | DNA glycosylase superfamily protein | ... 211 1e-54
>AT3G12710.1 | Symbols: | DNA glycosylase superfamily protein |
chr3:4040572-4041828 REVERSE LENGTH=312
Length = 312
Score = 355 bits (911), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 177/264 (67%), Positives = 204/264 (77%), Gaps = 17/264 (6%)
Query: 110 TLERKKSKSFKDTXXXXXXXXXXXXXXNLITDAPGSIAAVRREHMALQHAQRKMKISHYG 169
+LERKKSKSFK+ LIT+APGSIAAVRRE +A Q A RK+KI+HYG
Sbjct: 46 SLERKKSKSFKEGDSYSSW---------LITEAPGSIAAVRREQVAAQQALRKLKIAHYG 96
Query: 170 RSKSA-NF--ERVVPLDSSINLTSKTIEEEKRCSFITANSDPIYVAYHDEQWGVPVHDDK 226
RSKS NF +VVPL L +RCSF+T SDPIYVAYHDE+WGVPVHDDK
Sbjct: 97 RSKSTINFTSSKVVPL-----LNPNPNPHPQRCSFLTPTSDPIYVAYHDEEWGVPVHDDK 151
Query: 227 MLFELLVLSGAQVGSDWTSILKKRQDFRTAFSEFDAATVANFTDKQMVSISLEYGIDISR 286
LFELL LSGAQVGSDWTS L+KR D+R AF EF+A VA T+K+M +IS+EY I++S+
Sbjct: 152 TLFELLTLSGAQVGSDWTSTLRKRHDYRKAFMEFEAEVVAKLTEKEMNAISIEYKIEMSK 211
Query: 287 VRGVVDNANQILEIIKNFGSFDKYIWGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIR 346
VRGVV+NA +I+EI K F S +KY+WGFVNHKPIST YK HKIPVKTSKSESISKDM+R
Sbjct: 212 VRGVVENAKKIVEIKKAFVSLEKYLWGFVNHKPISTNYKLGHKIPVKTSKSESISKDMVR 271
Query: 347 RGFRFVGPTVLHSFMQAAGLTNDH 370
RGFRFVGPTV+HSFMQAAGLTNDH
Sbjct: 272 RGFRFVGPTVVHSFMQAAGLTNDH 295
>AT5G44680.1 | Symbols: | DNA glycosylase superfamily protein |
chr5:18024461-18025893 REVERSE LENGTH=353
Length = 353
Score = 326 bits (836), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 178/373 (47%), Positives = 231/373 (61%), Gaps = 39/373 (10%)
Query: 1 MCSTKAKVTVGQEATTTIPWARINGRPVLQPTCNRVPNLVEGRNXXXXXXXXXXXXXXXX 60
MCS+K K + QE + +INGRPVLQP N+VP L + RN
Sbjct: 1 MCSSKLK-NLTQENIS-----QINGRPVLQPKSNQVPTL-DRRNSLKKSPPKPLNPIASK 53
Query: 61 XXXXXXXXXXXXXXXXXXXXXAVKRGNESNVLNSSSEK---IATPKNSTRTPTLERKKSK 117
G+ +L SSS K + +P+NS
Sbjct: 54 IPSPRPISLISPPLSPNTKSLRKPAGSCKELLRSSSTKSKPVISPENS----------DG 103
Query: 118 SFKDTXXXXXXXXXXXXXXNLITDAPGSIAAVRREHMALQHAQRKMKISHYGRSKSANFE 177
+K+ ++ PGSIAA RRE +A++ +RK KISHYGR KS
Sbjct: 104 GYKEVMPMV-----------IVQKQPGSIAAARREEVAMKQEERKKKISHYGRIKSVKSN 152
Query: 178 RVVPLDSSINLTSKTIEEEKRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGA 237
+ ++N+ E++KRCSFIT +SDPIYVAYHD++WGVPVHDD +LFELLVL+GA
Sbjct: 153 -----EKNLNVEH---EKKKRCSFITTSSDPIYVAYHDKEWGVPVHDDNLLFELLVLTGA 204
Query: 238 QVGSDWTSILKKRQDFRTAFSEFDAATVANFTDKQMVSISLEYGIDISRVRGVVDNANQI 297
QVGSDWTS+LK+R FR AFS F+A VA+F +K++ SI +YGI++S+V VVDNA QI
Sbjct: 205 QVGSDWTSVLKRRNTFREAFSGFEAELVADFNEKKIQSIVNDYGINLSQVLAVVDNAKQI 264
Query: 298 LEIIKNFGSFDKYIWGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVL 357
L++ ++ GSF+KYIWGF+ HKP++T+Y KIPVKTSKSE+ISKDM+RRGFRFVGPTV+
Sbjct: 265 LKVKRDLGSFNKYIWGFMKHKPVTTKYTSCQKIPVKTSKSETISKDMVRRGFRFVGPTVI 324
Query: 358 HSFMQAAGLTNDH 370
HS MQAAGLTNDH
Sbjct: 325 HSLMQAAGLTNDH 337
>AT5G57970.2 | Symbols: | DNA glycosylase superfamily protein |
chr5:23467316-23468910 FORWARD LENGTH=347
Length = 347
Score = 219 bits (559), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 95/179 (53%), Positives = 133/179 (74%), Gaps = 2/179 (1%)
Query: 194 EEEKRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 253
E +KRC+++T NSDP Y+ +HDE+WGVPVHDDK LFELLVLSGA W +IL KRQ F
Sbjct: 151 ETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAF 210
Query: 254 RTAFSEFDAATVANFTDKQMVSISLEYGIDIS--RVRGVVDNANQILEIIKNFGSFDKYI 311
R F++FD + +K+++ +S ++R V++NA QIL++I+ +GSFDKYI
Sbjct: 211 REVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKYI 270
Query: 312 WGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
W FV +K I +++++ ++P KT K+E ISKD++RRGFR VGPTV++SFMQAAG+TNDH
Sbjct: 271 WSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDH 329
>AT5G57970.1 | Symbols: | DNA glycosylase superfamily protein |
chr5:23467316-23468910 FORWARD LENGTH=347
Length = 347
Score = 219 bits (559), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 95/179 (53%), Positives = 133/179 (74%), Gaps = 2/179 (1%)
Query: 194 EEEKRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 253
E +KRC+++T NSDP Y+ +HDE+WGVPVHDDK LFELLVLSGA W +IL KRQ F
Sbjct: 151 ETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAF 210
Query: 254 RTAFSEFDAATVANFTDKQMVSISLEYGIDIS--RVRGVVDNANQILEIIKNFGSFDKYI 311
R F++FD + +K+++ +S ++R V++NA QIL++I+ +GSFDKYI
Sbjct: 211 REVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKYI 270
Query: 312 WGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
W FV +K I +++++ ++P KT K+E ISKD++RRGFR VGPTV++SFMQAAG+TNDH
Sbjct: 271 WSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDH 329
>AT1G80850.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:30385607-30387272 REVERSE LENGTH=327
Length = 327
Score = 219 bits (557), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 98/176 (55%), Positives = 129/176 (73%), Gaps = 2/176 (1%)
Query: 197 KRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRTA 256
KRC++IT SD Y+A+HDE+WGVPVHDDK LFELL LSGA W IL KRQ FR
Sbjct: 134 KRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLFREV 193
Query: 257 FSEFDAATVANFTDKQMVSISLEYGIDIS--RVRGVVDNANQILEIIKNFGSFDKYIWGF 314
F +FD ++ T+K++ S + +S ++R +++NANQ+ +II FGSFDKYIW F
Sbjct: 194 FMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKIIGAFGSFDKYIWNF 253
Query: 315 VNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
VN KP +Q+++ ++PVKTSK+E ISKD++RRGFR V PTV++SFMQ AGLTNDH
Sbjct: 254 VNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDH 309
>AT1G75090.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:28187647-28189612 REVERSE LENGTH=329
Length = 329
Score = 218 bits (556), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 94/176 (53%), Positives = 131/176 (74%), Gaps = 2/176 (1%)
Query: 197 KRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRTA 256
KRC +IT NSDPIYV +HDE+WGVPV DDK LFELLV S A W SIL++R DFR
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178
Query: 257 FSEFDAATVANFTDKQMVSISLEYGIDIS--RVRGVVDNANQILEIIKNFGSFDKYIWGF 314
F EFD + +A FT+K+++S+ + + +S ++R +V+NA +L++ + FGSF Y W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238
Query: 315 VNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
VNHKP+ Y++ ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDH
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDH 294
>AT1G13635.2 | Symbols: | DNA glycosylase superfamily protein |
chr1:4674248-4675784 FORWARD LENGTH=311
Length = 311
Score = 214 bits (544), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 93/179 (51%), Positives = 132/179 (73%), Gaps = 2/179 (1%)
Query: 194 EEEKRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 253
+E KRC++IT SD +YV +HD+QWGVPV+DD +LFE L +SG + +WT ILK+++ F
Sbjct: 112 DEPKRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHF 171
Query: 254 RTAFSEFDAATVANFTDKQMVSISLEYGIDI--SRVRGVVDNANQILEIIKNFGSFDKYI 311
R AF EFD VA +K++ I+ I + SRVR +VDNA I +++ FGSF ++
Sbjct: 172 REAFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFV 231
Query: 312 WGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
WGF+++KPI ++K+S +P+++ K+E ISKDMI+RGFRFVGP ++HSFMQAAGLT DH
Sbjct: 232 WGFMDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDH 290
>AT1G13635.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:4674248-4675784 FORWARD LENGTH=311
Length = 311
Score = 214 bits (544), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 93/179 (51%), Positives = 132/179 (73%), Gaps = 2/179 (1%)
Query: 194 EEEKRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 253
+E KRC++IT SD +YV +HD+QWGVPV+DD +LFE L +SG + +WT ILK+++ F
Sbjct: 112 DEPKRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHF 171
Query: 254 RTAFSEFDAATVANFTDKQMVSISLEYGIDI--SRVRGVVDNANQILEIIKNFGSFDKYI 311
R AF EFD VA +K++ I+ I + SRVR +VDNA I +++ FGSF ++
Sbjct: 172 REAFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFV 231
Query: 312 WGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
WGF+++KPI ++K+S +P+++ K+E ISKDMI+RGFRFVGP ++HSFMQAAGLT DH
Sbjct: 232 WGFMDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDH 290
>AT1G15970.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:5486544-5488494 REVERSE LENGTH=352
Length = 352
Score = 211 bits (536), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 95/179 (53%), Positives = 126/179 (70%), Gaps = 8/179 (4%)
Query: 197 KRCSFITANSDPIYVAYHDEQWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRTA 256
KRC++IT +DP YVA+HDE+WGVPVHDDK LFELL LSGA WT IL +R R
Sbjct: 145 KRCAWITPKADPCYVAFHDEEWGVPVHDDKKLFELLCLSGALAELSWTDILSRRHILREV 204
Query: 257 FSEFDAATVANFTDKQM-----VSISLEYGIDISRVRGVVDNANQILEIIKNFGSFDKYI 311
F +FD VA DK++ +ISL + ++R ++DN+ + +II GS KY+
Sbjct: 205 FMDFDPVAVAELNDKKLTAPGTAAISLLSEV---KIRSILDNSRHVRKIIAECGSLKKYM 261
Query: 312 WGFVNHKPISTQYKFSHKIPVKTSKSESISKDMIRRGFRFVGPTVLHSFMQAAGLTNDH 370
W FVN+KP +Q+++ ++PVKTSK+E ISKD++RRGFR V PTV++SFMQAAGLTNDH
Sbjct: 262 WNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDH 320