Miyakogusa Predicted Gene
- Lj3g3v3302770.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v3302770.1 Non Chatacterized Hit- tr|I1LJQ9|I1LJQ9_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,93.09,0,DNA-glycosylase,DNA glycosylase; no description,DNA
glycosylase; no description,Helix-turn-helix, ba,CUFF.45545.1
(236 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G31450.2 | Symbols: ATNTH1 | DNA glycosylase superfamily prot... 355 1e-98
AT2G31450.1 | Symbols: ATNTH1 | DNA glycosylase superfamily prot... 355 1e-98
AT1G05900.2 | Symbols: ATNTH2, NTH2 | endonuclease III 2 | chr1:... 342 2e-94
AT1G05900.1 | Symbols: ATNTH2, NTH2 | endonuclease III 2 | chr1:... 248 2e-66
AT3G47830.1 | Symbols: | DNA glycosylase superfamily protein | ... 49 4e-06
>AT2G31450.2 | Symbols: ATNTH1 | DNA glycosylase superfamily protein
| chr2:13401318-13404134 REVERSE LENGTH=377
Length = 377
Score = 355 bits (911), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 162/217 (74%), Positives = 190/217 (87%)
Query: 1 MRCSAEAPVDTMGCEKAGNTLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLTA 60
MR S +APVD+MGC+KAG+ LPP ERRFAVL+ +LLSSQTKD VN+ AI RL QN LLT
Sbjct: 145 MRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTP 204
Query: 61 DAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYDGDIPNSLEQLLLLPGIGPKM 120
+A++ ADE TIK+LIYPVGFYTRKAT +KKIA ICL+KYDGDIP+SL+ LL LPGIGPKM
Sbjct: 205 EAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKM 264
Query: 121 AHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTPEQTREALQRWLPREEWD 180
AHL++++ WN+VQGICVDTHVHRICNRLGWVSR GTKQKT +PE+TR ALQ+WLP+EEW
Sbjct: 265 AHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWV 324
Query: 181 PINPLLVGFGQTICTPLRPRCGECGISELCPSAFKET 217
INPLLVGFGQ ICTP+RPRC C +S+LCP+AFKET
Sbjct: 325 AINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKET 361
>AT2G31450.1 | Symbols: ATNTH1 | DNA glycosylase superfamily protein
| chr2:13401318-13404134 REVERSE LENGTH=379
Length = 379
Score = 355 bits (911), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 162/217 (74%), Positives = 190/217 (87%)
Query: 1 MRCSAEAPVDTMGCEKAGNTLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLTA 60
MR S +APVD+MGC+KAG+ LPP ERRFAVL+ +LLSSQTKD VN+ AI RL QN LLT
Sbjct: 147 MRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTP 206
Query: 61 DAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYDGDIPNSLEQLLLLPGIGPKM 120
+A++ ADE TIK+LIYPVGFYTRKAT +KKIA ICL+KYDGDIP+SL+ LL LPGIGPKM
Sbjct: 207 EAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKM 266
Query: 121 AHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTPEQTREALQRWLPREEWD 180
AHL++++ WN+VQGICVDTHVHRICNRLGWVSR GTKQKT +PE+TR ALQ+WLP+EEW
Sbjct: 267 AHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWV 326
Query: 181 PINPLLVGFGQTICTPLRPRCGECGISELCPSAFKET 217
INPLLVGFGQ ICTP+RPRC C +S+LCP+AFKET
Sbjct: 327 AINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKET 363
>AT1G05900.2 | Symbols: ATNTH2, NTH2 | endonuclease III 2 |
chr1:1786896-1789502 FORWARD LENGTH=386
Length = 386
Score = 342 bits (876), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 155/217 (71%), Positives = 187/217 (86%)
Query: 1 MRCSAEAPVDTMGCEKAGNTLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLTA 60
M+ S EAPV+ + C++ G+ LPPKERRF VL+ +LLSSQTK+H+ A++RL QN LLT
Sbjct: 154 MKPSEEAPVNAVECDRTGSFLPPKERRFYVLIGTLLSSQTKEHITGAAVERLHQNGLLTP 213
Query: 61 DAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYDGDIPNSLEQLLLLPGIGPKM 120
+AI+ ADE TIK+LIYPVGFYTRKATN+KK+A ICLM+YDGDIP +LE+LL LPG+GPK+
Sbjct: 214 EAIDKADESTIKELIYPVGFYTRKATNVKKVAKICLMEYDGDIPRTLEELLSLPGVGPKI 273
Query: 121 AHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTPEQTREALQRWLPREEWD 180
AHLV++V WN+VQGICVDTHVHRICNRLGWVS+ GTKQKT +PE+TR ALQ+WLP+ EW
Sbjct: 274 AHLVLHVAWNDVQGICVDTHVHRICNRLGWVSKPGTKQKTSSPEETRVALQQWLPKGEWV 333
Query: 181 PINPLLVGFGQTICTPLRPRCGECGISELCPSAFKET 217
IN LLVGFGQTICTPLRP CG C I+E+CPSAFKET
Sbjct: 334 AINFLLVGFGQTICTPLRPHCGTCSITEICPSAFKET 370
>AT1G05900.1 | Symbols: ATNTH2, NTH2 | endonuclease III 2 |
chr1:1786896-1789037 FORWARD LENGTH=314
Length = 314
Score = 248 bits (633), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 111/158 (70%), Positives = 137/158 (86%)
Query: 1 MRCSAEAPVDTMGCEKAGNTLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLTA 60
M+ S EAPV+ + C++ G+ LPPKERRF VL+ +LLSSQTK+H+ A++RL QN LLT
Sbjct: 154 MKPSEEAPVNAVECDRTGSFLPPKERRFYVLIGTLLSSQTKEHITGAAVERLHQNGLLTP 213
Query: 61 DAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYDGDIPNSLEQLLLLPGIGPKM 120
+AI+ ADE TIK+LIYPVGFYTRKATN+KK+A ICLM+YDGDIP +LE+LL LPG+GPK+
Sbjct: 214 EAIDKADESTIKELIYPVGFYTRKATNVKKVAKICLMEYDGDIPRTLEELLSLPGVGPKI 273
Query: 121 AHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQ 158
AHLV++V WN+VQGICVDTHVHRICNRLGWVS+ GTKQ
Sbjct: 274 AHLVLHVAWNDVQGICVDTHVHRICNRLGWVSKPGTKQ 311
>AT3G47830.1 | Symbols: | DNA glycosylase superfamily protein |
chr3:17647069-17648346 FORWARD LENGTH=293
Length = 293
Score = 48.5 bits (114), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 75/176 (42%), Gaps = 22/176 (12%)
Query: 31 LVSSLLSSQTKDHVNHGAIQRLLQNDLLTADAINNADEETIKKLIYPVGFYTRKATNLKK 90
LV LLS T + + A L + D + NA+ ++I+ I G +KA +K
Sbjct: 100 LVKILLSQNTTESNSQRAFASL-KATFPKWDDVLNAESKSIENAIRCGGLAPKKAVCIKN 158
Query: 91 IAN--------ICLMKYDG-DIPNSLEQLLLLPGIGPKMAH--LVMNVGWNNVQGICVDT 139
I N +CL G + +L G+GPK L+ N+ N+ VDT
Sbjct: 159 ILNRLQNERGRLCLEYLRGLSVEEVKTELSHFKGVGPKTVSCVLMFNLQHNDFP---VDT 215
Query: 140 HVHRICNRLGWVSRLGTKQKTLTPEQTREALQRWLPREEWDPINPLLVGFGQTICT 195
HV I LGWV + + KT L R +P E +N LL G+ IC+
Sbjct: 216 HVFEIAKALGWVPKTADRNKTYV------HLNRKIPDELKFDLNCLLYTHGK-ICS 264