Miyakogusa Predicted Gene
- Lj0g3v0274889.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0274889.1 Non Chatacterized Hit- tr|I1KDD7|I1KDD7_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,86.05,0,SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; DNA-glycosylase,DNA glycosylase; no
description,DNA,NODE_17350_length_2207_cov_53.992298.path2.1
(386 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G57970.2 | Symbols: | DNA glycosylase superfamily protein | ... 461 e-130
AT5G57970.1 | Symbols: | DNA glycosylase superfamily protein | ... 461 e-130
AT1G15970.1 | Symbols: | DNA glycosylase superfamily protein | ... 310 1e-84
AT1G80850.1 | Symbols: | DNA glycosylase superfamily protein | ... 293 2e-79
AT1G75090.1 | Symbols: | DNA glycosylase superfamily protein | ... 263 1e-70
AT1G13635.2 | Symbols: | DNA glycosylase superfamily protein | ... 231 6e-61
AT1G13635.1 | Symbols: | DNA glycosylase superfamily protein | ... 231 6e-61
AT3G12710.1 | Symbols: | DNA glycosylase superfamily protein | ... 215 4e-56
AT5G44680.1 | Symbols: | DNA glycosylase superfamily protein | ... 215 5e-56
>AT5G57970.2 | Symbols: | DNA glycosylase superfamily protein |
chr5:23467316-23468910 FORWARD LENGTH=347
Length = 347
Score = 461 bits (1185), Expect = e-130, Method: Compositional matrix adjust.
Identities = 224/346 (64%), Positives = 266/346 (76%), Gaps = 2/346 (0%)
Query: 1 MSGAPRLRSMNVADSEARPVLGPAGNKTGSLSSRKSASKPLRKVDKLLDEAASAVKEKKP 60
MSGAPR++SMNVA++E R LG K + K+ SK LRK+++ + EK
Sbjct: 1 MSGAPRVQSMNVAEAETRSTLGSTAKKASPFITHKAVSKSLRKLERS-SSGRTGSDEKTS 59
Query: 61 HQVLXXXXXXXPKSYPAARVSSLLSRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 120
+ + + +S+L RHEQ L+SNLSLNAS SSDAS DSFHSRASTGRL
Sbjct: 60 YATPTETVSSSSQKH-TLNAASILRRHEQNLNSNLSLNASFSSDASMDSFHSRASTGRLI 118
Query: 121 RSYSLGTRRKPYVSKPRSVASDGVLESPPDASQSKKRCAWVTPNTEPCYATFHDEEWGVP 180
RSYS+G+R K Y SKPRSV S+G L+SPP+ S++KKRC WVTPN++PCY FHDEEWGVP
Sbjct: 119 RSYSVGSRSKSYPSKPRSVVSEGALDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVP 178
Query: 181 VHDDKKLFELLVLSIALAELSWPVILSKRHSFREAFADFDPVAVSKLNEKKMMAPGTVAS 240
VHDDK+LFELLVLS ALAE +WP ILSKR +FRE FADFDP A+ K+NEKK++ PG+ AS
Sbjct: 179 VHDDKRLFELLVLSGALAEHTWPTILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPAS 238
Query: 241 SLLSELKLRAVIENARQISKVIDEFGSFDKYIWSFVNHKPIVNRFRYPRQVPVKTPKADV 300
+LLS+LKLRAVIENARQI KVI+E+GSFDKYIWSFV +K IV++FRY RQVP KTPKA+V
Sbjct: 239 TLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEV 298
Query: 301 ISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMSCFRFPECIAAAE 346
ISKDLVRRGFR VGPTV+YSFMQ AG TNDHL SCFRF CI E
Sbjct: 299 ISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSCFRFHHCIFEHE 344
>AT5G57970.1 | Symbols: | DNA glycosylase superfamily protein |
chr5:23467316-23468910 FORWARD LENGTH=347
Length = 347
Score = 461 bits (1185), Expect = e-130, Method: Compositional matrix adjust.
Identities = 224/346 (64%), Positives = 266/346 (76%), Gaps = 2/346 (0%)
Query: 1 MSGAPRLRSMNVADSEARPVLGPAGNKTGSLSSRKSASKPLRKVDKLLDEAASAVKEKKP 60
MSGAPR++SMNVA++E R LG K + K+ SK LRK+++ + EK
Sbjct: 1 MSGAPRVQSMNVAEAETRSTLGSTAKKASPFITHKAVSKSLRKLERS-SSGRTGSDEKTS 59
Query: 61 HQVLXXXXXXXPKSYPAARVSSLLSRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 120
+ + + +S+L RHEQ L+SNLSLNAS SSDAS DSFHSRASTGRL
Sbjct: 60 YATPTETVSSSSQKH-TLNAASILRRHEQNLNSNLSLNASFSSDASMDSFHSRASTGRLI 118
Query: 121 RSYSLGTRRKPYVSKPRSVASDGVLESPPDASQSKKRCAWVTPNTEPCYATFHDEEWGVP 180
RSYS+G+R K Y SKPRSV S+G L+SPP+ S++KKRC WVTPN++PCY FHDEEWGVP
Sbjct: 119 RSYSVGSRSKSYPSKPRSVVSEGALDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVP 178
Query: 181 VHDDKKLFELLVLSIALAELSWPVILSKRHSFREAFADFDPVAVSKLNEKKMMAPGTVAS 240
VHDDK+LFELLVLS ALAE +WP ILSKR +FRE FADFDP A+ K+NEKK++ PG+ AS
Sbjct: 179 VHDDKRLFELLVLSGALAEHTWPTILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPAS 238
Query: 241 SLLSELKLRAVIENARQISKVIDEFGSFDKYIWSFVNHKPIVNRFRYPRQVPVKTPKADV 300
+LLS+LKLRAVIENARQI KVI+E+GSFDKYIWSFV +K IV++FRY RQVP KTPKA+V
Sbjct: 239 TLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEV 298
Query: 301 ISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMSCFRFPECIAAAE 346
ISKDLVRRGFR VGPTV+YSFMQ AG TNDHL SCFRF CI E
Sbjct: 299 ISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSCFRFHHCIFEHE 344
>AT1G15970.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:5486544-5488494 REVERSE LENGTH=352
Length = 352
Score = 310 bits (793), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 172/350 (49%), Positives = 222/350 (63%), Gaps = 19/350 (5%)
Query: 1 MSGAPRLRSMNVADSEARPVLGPAGNKTGSLSSRKSASKPLRK---VDKLLDEAASAVKE 57
MS PR RS+N + E R VLGP GNK KP+ + +D ++A
Sbjct: 1 MSVPPRFRSVNSDEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAKKPTTP 60
Query: 58 KKPHQVLXXXXXXXPKSYPAARVSSLLSRHEQLLHSNLSLNASCSSDASTDSFHSRASTG 117
P L ++ SS+L ++ + ++ S +AS S ++S S S +S
Sbjct: 61 ASPRTTLKQC---------SSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCK 111
Query: 118 RLTR-SYSLGTRRKPYVSKPRSVASDGVLESPPDASQSKKRCAWVTPNTEPCYATFHDEE 176
++ R S S+ + RK V K S + +KRCAW+TP +PCY FHDEE
Sbjct: 112 KVVRRSGSVSSTRKLSVGKEEEKVSGDCF------ADGRKRCAWITPKADPCYVAFHDEE 165
Query: 177 WGVPVHDDKKLFELLVLSIALAELSWPVILSKRHSFREAFADFDPVAVSKLNEKKMMAPG 236
WGVPVHDDKKLFELL LS ALAELSW ILS+RH RE F DFDPVAV++LN+KK+ APG
Sbjct: 166 WGVPVHDDKKLFELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELNDKKLTAPG 225
Query: 237 TVASSLLSELKLRAVIENARQISKVIDEFGSFDKYIWSFVNHKPIVNRFRYPRQVPVKTP 296
T A SLLSE+K+R++++N+R + K+I E GS KY+W+FVN+KP ++FRY RQVPVKT
Sbjct: 226 TAAISLLSEVKIRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQRQVPVKTS 285
Query: 297 KADVISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMSCFRFPECIAAAE 346
KA+ ISKDLVRRGFR V PTVIYSFMQ AG TNDHL+ CFR+ +C AE
Sbjct: 286 KAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCVDAE 335
>AT1G80850.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:30385607-30387272 REVERSE LENGTH=327
Length = 327
Score = 293 bits (749), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 130/192 (67%), Positives = 154/192 (80%)
Query: 155 KKRCAWVTPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSIALAELSWPVILSKRHSFRE 214
+KRCAW+TP ++ CY FHDEEWGVPVHDDK+LFELL LS ALAELSW ILSKR FRE
Sbjct: 133 RKRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLFRE 192
Query: 215 AFADFDPVAVSKLNEKKMMAPGTVASSLLSELKLRAVIENARQISKVIDEFGSFDKYIWS 274
F DFDP+A+S+L KK+ +P A++LLSE KLR+++ENA Q+ K+I FGSFDKYIW+
Sbjct: 193 VFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKIIGAFGSFDKYIWN 252
Query: 275 FVNHKPIVNRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMS 334
FVN KP ++FRYPRQVPVKT KA++ISKDLVRRGFR V PTVIYSFMQ AG TNDHL
Sbjct: 253 FVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHLTC 312
Query: 335 CFRFPECIAAAE 346
CFR +C+ E
Sbjct: 313 CFRHHDCMTKDE 324
>AT1G75090.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:28187647-28189612 REVERSE LENGTH=329
Length = 329
Score = 263 bits (673), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 115/191 (60%), Positives = 144/191 (75%)
Query: 156 KRCAWVTPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSIALAELSWPVILSKRHSFREA 215
KRC W+TPN++P Y FHDEEWGVPV DDKKLFELLV S ALAE SWP IL +R FR+
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178
Query: 216 FADFDPVAVSKLNEKKMMAPGTVASSLLSELKLRAVIENARQISKVIDEFGSFDKYIWSF 275
F +FDP A+++ EK++M+ +LSE KLRA++ENA+ + KV EFGSF Y W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238
Query: 276 VNHKPIVNRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMSC 335
VNHKP+ N +RY RQVPVK+PKA+ ISKD+++RGFR VGPTV+YSF+Q +G NDHL +C
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTAC 298
Query: 336 FRFPECIAAAE 346
FR+ EC E
Sbjct: 299 FRYQECNVETE 309
>AT1G13635.2 | Symbols: | DNA glycosylase superfamily protein |
chr1:4674248-4675784 FORWARD LENGTH=311
Length = 311
Score = 231 bits (589), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 107/252 (42%), Positives = 166/252 (65%), Gaps = 17/252 (6%)
Query: 102 SSDASTDSFHSRASTGRLTRSYSLGT-----RRKPYVSK--PRSVASDGVLESPPDASQS 154
+ STDS ST S +LG RR+ +V K P+ + D ++S
Sbjct: 64 TDSVSTDS----NSTLEQKISLALGLISSPHRREIFVPKSIPQQLCQDF------NSSDE 113
Query: 155 KKRCAWVTPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSIALAELSWPVILSKRHSFRE 214
KRC W+T ++ Y FHD++WGVPV+DD LFE L +S L + +W IL ++ FRE
Sbjct: 114 PKRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFRE 173
Query: 215 AFADFDPVAVSKLNEKKMMAPGTVASSLLSELKLRAVIENARQISKVIDEFGSFDKYIWS 274
AF +FDP V+K+ EK++ + + +L E ++R +++NA+ I+KV++EFGSF ++W
Sbjct: 174 AFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWG 233
Query: 275 FVNHKPIVNRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMS 334
F+++KPI+N+F+Y R VP+++PKA++ISKD+++RGFR VGP +++SFMQ AG T DHL+
Sbjct: 234 FMDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVD 293
Query: 335 CFRFPECIAAAE 346
CFR +C++ AE
Sbjct: 294 CFRHGDCVSLAE 305
>AT1G13635.1 | Symbols: | DNA glycosylase superfamily protein |
chr1:4674248-4675784 FORWARD LENGTH=311
Length = 311
Score = 231 bits (589), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 107/252 (42%), Positives = 166/252 (65%), Gaps = 17/252 (6%)
Query: 102 SSDASTDSFHSRASTGRLTRSYSLGT-----RRKPYVSK--PRSVASDGVLESPPDASQS 154
+ STDS ST S +LG RR+ +V K P+ + D ++S
Sbjct: 64 TDSVSTDS----NSTLEQKISLALGLISSPHRREIFVPKSIPQQLCQDF------NSSDE 113
Query: 155 KKRCAWVTPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSIALAELSWPVILSKRHSFRE 214
KRC W+T ++ Y FHD++WGVPV+DD LFE L +S L + +W IL ++ FRE
Sbjct: 114 PKRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFRE 173
Query: 215 AFADFDPVAVSKLNEKKMMAPGTVASSLLSELKLRAVIENARQISKVIDEFGSFDKYIWS 274
AF +FDP V+K+ EK++ + + +L E ++R +++NA+ I+KV++EFGSF ++W
Sbjct: 174 AFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWG 233
Query: 275 FVNHKPIVNRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMS 334
F+++KPI+N+F+Y R VP+++PKA++ISKD+++RGFR VGP +++SFMQ AG T DHL+
Sbjct: 234 FMDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVD 293
Query: 335 CFRFPECIAAAE 346
CFR +C++ AE
Sbjct: 294 CFRHGDCVSLAE 305
>AT3G12710.1 | Symbols: | DNA glycosylase superfamily protein |
chr3:4040572-4041828 REVERSE LENGTH=312
Length = 312
Score = 215 bits (548), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 99/197 (50%), Positives = 134/197 (68%), Gaps = 2/197 (1%)
Query: 149 PDASQSKKRCAWVTPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSIALAELSWPVILSK 208
P+ + +RC+++TP ++P Y +HDEEWGVPVHDDK LFELL LS A W L K
Sbjct: 115 PNPNPHPQRCSFLTPTSDPIYVAYHDEEWGVPVHDDKTLFELLTLSGAQVGSDWTSTLRK 174
Query: 209 RHSFREAFADFDPVAVSKLNEKKMMAPGTVASSLLSELKLRAVIENARQISKVIDEFGSF 268
RH +R+AF +F+ V+KL EK+M A +S K+R V+ENA++I ++ F S
Sbjct: 175 RHDYRKAFMEFEAEVVAKLTEKEMNAISIEYKIEMS--KVRGVVENAKKIVEIKKAFVSL 232
Query: 269 DKYIWSFVNHKPIVNRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGFT 328
+KY+W FVNHKPI ++ ++PVKT K++ ISKD+VRRGFR VGPTV++SFMQ AG T
Sbjct: 233 EKYLWGFVNHKPISTNYKLGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQAAGLT 292
Query: 329 NDHLMSCFRFPECIAAA 345
NDHL++C R C A
Sbjct: 293 NDHLITCCRHAPCTLLA 309
>AT5G44680.1 | Symbols: | DNA glycosylase superfamily protein |
chr5:18024461-18025893 REVERSE LENGTH=353
Length = 353
Score = 215 bits (547), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 128/355 (36%), Positives = 198/355 (55%), Gaps = 28/355 (7%)
Query: 7 LRSMNVADSEARPVLGPAGNKTGSLSSRKSASKPLRKVDKLLDEAASAVKEKKPHQVLXX 66
L N++ RPVL P N+ +L R S K K L+ AS + +P ++
Sbjct: 9 LTQENISQINGRPVLQPKSNQVPTLDRRNSLKK---SPPKPLNPIASKIPSPRPISLISP 65
Query: 67 XXXXXPKSY--PAARVSSLLSRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLTRSY- 123
KS PA ++LL S+ + + S ++D + + +
Sbjct: 66 PLSPNTKSLRKPAGSC-------KELLRSSSTKSKPVISPENSDGGYKEVMPMVIVQKQP 118
Query: 124 -SLGTRRKPYVS-----KPRSVASDGVLESPPDASQS-------KKRCAWVTPNTEPCYA 170
S+ R+ V+ + + ++ G ++S ++ KKRC+++T +++P Y
Sbjct: 119 GSIAAARREEVAMKQEERKKKISHYGRIKSVKSNEKNLNVEHEKKKRCSFITTSSDPIYV 178
Query: 171 TFHDEEWGVPVHDDKKLFELLVLSIALAELSWPVILSKRHSFREAFADFDPVAVSKLNEK 230
+HD+EWGVPVHDD LFELLVL+ A W +L +R++FREAF+ F+ V+ NEK
Sbjct: 179 AYHDKEWGVPVHDDNLLFELLVLTGAQVGSDWTSVLKRRNTFREAFSGFEAELVADFNEK 238
Query: 231 KMMAPGTVASSLLSELKLRAVIENARQISKVIDEFGSFDKYIWSFVNHKPIVNRFRYPRQ 290
K+ + V ++ ++ AV++NA+QI KV + GSF+KYIW F+ HKP+ ++ ++
Sbjct: 239 KIQS--IVNDYGINLSQVLAVVDNAKQILKVKRDLGSFNKYIWGFMKHKPVTTKYTSCQK 296
Query: 291 VPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGFTNDHLMSCFRFPECIAAA 345
+PVKT K++ ISKD+VRRGFR VGPTVI+S MQ AG TNDHL++C R EC A A
Sbjct: 297 IPVKTSKSETISKDMVRRGFRFVGPTVIHSLMQAAGLTNDHLITCPRHLECTAMA 351