Miyakogusa Predicted Gene

Lj3g3v2098140.1
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2098140.1 Non Chatacterized Hit- tr|I1KGT2|I1KGT2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.15645
PE,79.53,0,Adenine_glyco,Methyladenine glycosylase; seg,NULL; no
description,DNA glycosylase; SUBFAMILY NOT NAM,CUFF.43580.1
         (381 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G57970.2 | Symbols:  | DNA glycosylase superfamily protein | ...   344   5e-95
AT5G57970.1 | Symbols:  | DNA glycosylase superfamily protein | ...   344   5e-95
AT1G15970.1 | Symbols:  | DNA glycosylase superfamily protein | ...   307   7e-84
AT1G80850.1 | Symbols:  | DNA glycosylase superfamily protein | ...   306   1e-83
AT1G75090.1 | Symbols:  | DNA glycosylase superfamily protein | ...   261   5e-70
AT1G13635.2 | Symbols:  | DNA glycosylase superfamily protein | ...   237   1e-62
AT1G13635.1 | Symbols:  | DNA glycosylase superfamily protein | ...   237   1e-62
AT5G44680.1 | Symbols:  | DNA glycosylase superfamily protein | ...   215   5e-56
AT3G12710.1 | Symbols:  | DNA glycosylase superfamily protein | ...   209   4e-54

>AT5G57970.2 | Symbols:  | DNA glycosylase superfamily protein |
           chr5:23467316-23468910 FORWARD LENGTH=347
          Length = 347

 Score =  344 bits (883), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 180/344 (52%), Positives = 212/344 (61%), Gaps = 11/344 (3%)

Query: 1   MSGPPRVRSMNVAVGDHEARPVLVPACNKARPAAVDGRKPVKKSVLEREREKSRGAPPTP 60
           MSG PRV+SMNVA  + E R  L     KA P      K V KS+ + ER  S G   + 
Sbjct: 1   MSGAPRVQSMNVA--EAETRSTLGSTAKKASPFIT--HKAVSKSLRKLERSSS-GRTGSD 55

Query: 61  PQRVLVSPV--VSRRQDHHHLAVLKNL----XXXXXXXXXXXXXXXXXXXXXXXXXXKVA 114
            +    +P   VS     H L     L                                 
Sbjct: 56  EKTSYATPTETVSSSSQKHTLNAASILRRHEQNLNSNLSLNASFSSDASMDSFHSRASTG 115

Query: 115 RRVRKKQAGARTEKXXXXXXXXXXXXXXXXXXXGLESKKRCAWVTANTEPCYIAFHDEEW 174
           R +R    G+R++                    G E+KKRC WVT N++PCYI FHDEEW
Sbjct: 116 RLIRSYSVGSRSKSYPSKPRSVVSEGALDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEW 175

Query: 175 GVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREVFLDFDPNVVSKMNEKKIVAPGN 234
           GVPVHDDK+LFELL  SGALAE TWPTILSKRQ FREVF DFDPN + K+NEKKI+ PG+
Sbjct: 176 GVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAFREVFADFDPNAIVKINEKKIIGPGS 235

Query: 235 SACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNFVNNKPIVSQFRYARQVPVKSPK 294
            A TLLS+L+LR++IENARQ+ KVIEE+GSFD +IW+FV NK IVS+FRY RQVP K+PK
Sbjct: 236 PASTLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPK 295

Query: 295 AEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFKEC 338
           AE ISKDLVRRGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  C
Sbjct: 296 AEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSCFRFHHC 339


>AT5G57970.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr5:23467316-23468910 FORWARD LENGTH=347
          Length = 347

 Score =  344 bits (883), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 180/344 (52%), Positives = 212/344 (61%), Gaps = 11/344 (3%)

Query: 1   MSGPPRVRSMNVAVGDHEARPVLVPACNKARPAAVDGRKPVKKSVLEREREKSRGAPPTP 60
           MSG PRV+SMNVA  + E R  L     KA P      K V KS+ + ER  S G   + 
Sbjct: 1   MSGAPRVQSMNVA--EAETRSTLGSTAKKASPFIT--HKAVSKSLRKLERSSS-GRTGSD 55

Query: 61  PQRVLVSPV--VSRRQDHHHLAVLKNL----XXXXXXXXXXXXXXXXXXXXXXXXXXKVA 114
            +    +P   VS     H L     L                                 
Sbjct: 56  EKTSYATPTETVSSSSQKHTLNAASILRRHEQNLNSNLSLNASFSSDASMDSFHSRASTG 115

Query: 115 RRVRKKQAGARTEKXXXXXXXXXXXXXXXXXXXGLESKKRCAWVTANTEPCYIAFHDEEW 174
           R +R    G+R++                    G E+KKRC WVT N++PCYI FHDEEW
Sbjct: 116 RLIRSYSVGSRSKSYPSKPRSVVSEGALDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEW 175

Query: 175 GVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREVFLDFDPNVVSKMNEKKIVAPGN 234
           GVPVHDDK+LFELL  SGALAE TWPTILSKRQ FREVF DFDPN + K+NEKKI+ PG+
Sbjct: 176 GVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAFREVFADFDPNAIVKINEKKIIGPGS 235

Query: 235 SACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNFVNNKPIVSQFRYARQVPVKSPK 294
            A TLLS+L+LR++IENARQ+ KVIEE+GSFD +IW+FV NK IVS+FRY RQVP K+PK
Sbjct: 236 PASTLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPK 295

Query: 295 AEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFKEC 338
           AE ISKDLVRRGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  C
Sbjct: 296 AEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSCFRFHHC 339


>AT1G15970.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr1:5486544-5488494 REVERSE LENGTH=352
          Length = 352

 Score =  307 bits (787), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 164/348 (47%), Positives = 217/348 (62%), Gaps = 18/348 (5%)

Query: 1   MSGPPRVRSMNVAVGDHEARPVLVPACNKA--RPAAVDGRKPV-KKSVLEREREKSRGAP 57
           MS PPR RS+N    + E R VL P  NK   +P  +   KP+ +K++++ + EK++  P
Sbjct: 1   MSVPPRFRSVNS--DEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAK-KP 57

Query: 58  PTPPQRVLVSPVVSRRQDHHHLAVL--KNLXXXXXXXXXXXXXXXXXXXXXXXXXXKVAR 115
            TP      SP  + +Q     + +  KN                              +
Sbjct: 58  TTP-----ASPRTTLKQCSSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKK 112

Query: 116 RVRKKQAGARTEKXXXXXXXXXXXXXXXXXXXGLESKKRCAWVTANTEPCYIAFHDEEWG 175
            VR+  + + T K                     + +KRCAW+T   +PCY+AFHDEEWG
Sbjct: 113 VVRRSGSVSSTRKLSVGKEEEKVSGDCFA-----DGRKRCAWITPKADPCYVAFHDEEWG 167

Query: 176 VPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREVFLDFDPNVVSKMNEKKIVAPGNS 235
           VPVHDDKKLFELL  SGALAEL+W  ILS+R + REVF+DFDP  V+++N+KK+ APG +
Sbjct: 168 VPVHDDKKLFELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELNDKKLTAPGTA 227

Query: 236 ACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNFVNNKPIVSQFRYARQVPVKSPKA 295
           A +LLSE+++RSI++N+R + K+I E GS   ++WNFVNNKP  SQFRY RQVPVK+ KA
Sbjct: 228 AISLLSEVKIRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQRQVPVKTSKA 287

Query: 296 EFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFKECTSNAE 343
           EFISKDLVRRGFRSV PTVIY+FMQ AGLTNDHLI CFR+++C  +AE
Sbjct: 288 EFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCVDAE 335


>AT1G80850.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr1:30385607-30387272 REVERSE LENGTH=327
          Length = 327

 Score =  306 bits (785), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 139/197 (70%), Positives = 161/197 (81%)

Query: 150 ESKKRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLF 209
           + +KRCAW+T  ++ CYIAFHDEEWGVPVHDDK+LFELLS SGALAEL+W  ILSKRQLF
Sbjct: 131 DGRKRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLF 190

Query: 210 REVFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFI 269
           REVF+DFDP  +S++  KKI +P  +A TLLSE +LRSI+ENA Q+CK+I  FGSFD +I
Sbjct: 191 REVFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKIIGAFGSFDKYI 250

Query: 270 WNFVNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHL 329
           WNFVN KP  SQFRY RQVPVK+ KAE ISKDLVRRGFRSV PTVIY+FMQ AGLTNDHL
Sbjct: 251 WNFVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHL 310

Query: 330 ISCFRFKECTSNAETVN 346
             CFR  +C +  ET N
Sbjct: 311 TCCFRHHDCMTKDETGN 327


>AT1G75090.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr1:28187647-28189612 REVERSE LENGTH=329
          Length = 329

 Score =  261 bits (667), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 111/203 (54%), Positives = 156/203 (76%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           KRC W+T N++P Y+ FHDEEWGVPV DDKKLFELL FS ALAE +WP+IL +R  FR++
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178

Query: 213 FLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNF 272
           F +FDP+ +++  EK++++   + C +LSE +LR+I+ENA+ + KV +EFGSF  + W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238

Query: 273 VNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISC 332
           VN+KP+ + +RY RQVPVKSPKAE+ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +C
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTAC 298

Query: 333 FRFKECTSNAETVNKESSLNSKV 355
           FR++EC    E   K     +K+
Sbjct: 299 FRYQECNVETERETKSHETETKL 321


>AT1G13635.2 | Symbols:  | DNA glycosylase superfamily protein |
           chr1:4674248-4675784 FORWARD LENGTH=311
          Length = 311

 Score =  237 bits (604), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 98/191 (51%), Positives = 142/191 (74%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           KRC W+T  ++  Y+ FHD++WGVPV+DD  LFE L+ SG L +  W  IL +++ FRE 
Sbjct: 115 KRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREA 174

Query: 213 FLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNF 272
           F +FDPN V+KM EK+I    ++   +L E R+R I++NA+ + KV+ EFGSF +F+W F
Sbjct: 175 FCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGF 234

Query: 273 VNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISC 332
           ++ KPI+++F+Y+R VP++SPKAE ISKD+++RGFR VGP ++++FMQ AGLT DHL+ C
Sbjct: 235 MDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 294

Query: 333 FRFKECTSNAE 343
           FR  +C S AE
Sbjct: 295 FRHGDCVSLAE 305


>AT1G13635.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr1:4674248-4675784 FORWARD LENGTH=311
          Length = 311

 Score =  237 bits (604), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 98/191 (51%), Positives = 142/191 (74%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           KRC W+T  ++  Y+ FHD++WGVPV+DD  LFE L+ SG L +  W  IL +++ FRE 
Sbjct: 115 KRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREA 174

Query: 213 FLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNF 272
           F +FDPN V+KM EK+I    ++   +L E R+R I++NA+ + KV+ EFGSF +F+W F
Sbjct: 175 FCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGF 234

Query: 273 VNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISC 332
           ++ KPI+++F+Y+R VP++SPKAE ISKD+++RGFR VGP ++++FMQ AGLT DHL+ C
Sbjct: 235 MDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 294

Query: 333 FRFKECTSNAE 343
           FR  +C S AE
Sbjct: 295 FRHGDCVSLAE 305


>AT5G44680.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr5:18024461-18025893 REVERSE LENGTH=353
          Length = 353

 Score =  215 bits (547), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 95/193 (49%), Positives = 139/193 (72%), Gaps = 2/193 (1%)

Query: 150 ESKKRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLF 209
           E KKRC+++T +++P Y+A+HD+EWGVPVHDD  LFELL  +GA     W ++L +R  F
Sbjct: 161 EKKKRCSFITTSSDPIYVAYHDKEWGVPVHDDNLLFELLVLTGAQVGSDWTSVLKRRNTF 220

Query: 210 REVFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFI 269
           RE F  F+  +V+  NEKKI +  N     LS++   ++++NA+Q+ KV  + GSF+ +I
Sbjct: 221 REAFSGFEAELVADFNEKKIQSIVNDYGINLSQVL--AVVDNAKQILKVKRDLGSFNKYI 278

Query: 270 WNFVNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHL 329
           W F+ +KP+ +++   +++PVK+ K+E ISKD+VRRGFR VGPTVI++ MQ AGLTNDHL
Sbjct: 279 WGFMKHKPVTTKYTSCQKIPVKTSKSETISKDMVRRGFRFVGPTVIHSLMQAAGLTNDHL 338

Query: 330 ISCFRFKECTSNA 342
           I+C R  ECT+ A
Sbjct: 339 ITCPRHLECTAMA 351


>AT3G12710.1 | Symbols:  | DNA glycosylase superfamily protein |
           chr3:4040572-4041828 REVERSE LENGTH=312
          Length = 312

 Score =  209 bits (531), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 90/187 (48%), Positives = 136/187 (72%), Gaps = 2/187 (1%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           +RC+++T  ++P Y+A+HDEEWGVPVHDDK LFELL+ SGA     W + L KR  +R+ 
Sbjct: 122 QRCSFLTPTSDPIYVAYHDEEWGVPVHDDKTLFELLTLSGAQVGSDWTSTLRKRHDYRKA 181

Query: 213 FLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNF 272
           F++F+  VV+K+ EK++ A        +S  ++R ++ENA+++ ++ + F S + ++W F
Sbjct: 182 FMEFEAEVVAKLTEKEMNAISIEYKIEMS--KVRGVVENAKKIVEIKKAFVSLEKYLWGF 239

Query: 273 VNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISC 332
           VN+KPI + ++   ++PVK+ K+E ISKD+VRRGFR VGPTV+++FMQ AGLTNDHLI+C
Sbjct: 240 VNHKPISTNYKLGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQAAGLTNDHLITC 299

Query: 333 FRFKECT 339
            R   CT
Sbjct: 300 CRHAPCT 306