Miyakogusa Predicted Gene

Lj3g3v2098140.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2098140.1 Non Characterized Hit- tr|I1KGT2|I1KGT2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.15645
PE,79.53,0,Adenine_glyco,Methyladenine glycosylase; seg,NULL; no
description,DNA glycosylase; SUBFAMILY NOT NAM,CUFF.43580.1
         (381 letters)

Database: Medicago_aa4.0v1 
           62,319 sequences; 21,947,249 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Medtr4g007070.1 | DNA-3-methyladenine glycosylase I | HC | chr4:...   470   e-133
Medtr4g130880.1 | DNA-3-methyladenine glycosylase I | HC | chr4:...   347   9e-96
Medtr3g111690.1 | DNA-3-methyladenine glycosylase I | HC | chr3:...   264   1e-70
Medtr0009s0230.1 | DNA-3-methyladenine glycosylase I | HC | scaf...   243   2e-64
Medtr0009s0230.2 | DNA-3-methyladenine glycosylase I | HC | scaf...   239   4e-63
Medtr2g063510.1 | DNA-3-methyladenine glycosylase I | HC | chr2:...   224   1e-58
Medtr8g447000.1 | DNA-3-methyladenine glycosylase I | HC | chr8:...   222   5e-58
Medtr8g066040.1 | DNA-3-methyladenine glycosylase I | HC | chr8:...   209   3e-54
Medtr4g130880.2 | DNA-3-methyladenine glycosylase I | HC | chr4:...   193   2e-49
Medtr2g063510.2 | DNA-3-methyladenine glycosylase I | HC | chr2:...   109   5e-24
Medtr8g447000.2 | DNA-3-methyladenine glycosylase I | HC | chr8:...   106   4e-23

>Medtr4g007070.1 | DNA-3-methyladenine glycosylase I | HC |
           chr4:935811-930134 | 20130731
          Length = 365

 Score =  470 bits (1210), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 252/383 (65%), Positives = 279/383 (72%), Gaps = 21/383 (5%)

Query: 1   MSGPPRVRSMNVAVG-DHEARPVLVPACNKARPAAVDGRKPVKKSVLEREREKSRGAPPT 59
           MSGPPRVRSMNV VG D +++         ARP      K VKK V   E EK +    T
Sbjct: 1   MSGPPRVRSMNVTVGADSDSK--------AARPV-----KNVKKPVPAPETEKKK----T 43

Query: 60  PPQRVLVSPVVSRRQDHHHLAVLKNLXXXXXXXXXXXXXXXXXXXXXXXXXXKVARRVRK 119
            PQ V+V+P V +R+DH  +  +KN+                          KVARRV K
Sbjct: 44  SPQCVVVTPAVLKRRDHCGVVGMKNMSMNASCSSDASSTDSSACSSGASSSGKVARRVGK 103

Query: 120 KQAGARTEKXXXXXXXXXXXXXXXXXXXGLESKKRCAWVTANTEPCYIAFHDEEWGVPVH 179
           KQ GA+ EK                   GLE KKRCAWVT NTEPCYIAFHDEEWGVP+H
Sbjct: 104 KQVGAKVEKVSIDAVVAVPAPVEVESIDGLEGKKRCAWVTPNTEPCYIAFHDEEWGVPIH 163

Query: 180 DDKKLFELLSFSGALAELTWPTILSKRQLFREVFLDFDPNVVSKMNEKKIVAPGNSACTL 239
           DDKKLFELLSFSGALAEL+WPTIL KRQLFR+VFLDFDP  VS+MNEKKIVAPG+ A +L
Sbjct: 164 DDKKLFELLSFSGALAELSWPTILGKRQLFRKVFLDFDPCAVSRMNEKKIVAPGSPASSL 223

Query: 240 LSELRLRSIIENARQMCKVIEEFGSFDTFIWNFVNNKPIVSQFRYARQVPVKSPKAEFIS 299
           LSELRLRSIIENARQMCKVIEEFGSFD++IWNFVNNKPIVSQFRY RQVP KSPKAEFIS
Sbjct: 224 LSELRLRSIIENARQMCKVIEEFGSFDSYIWNFVNNKPIVSQFRYPRQVPAKSPKAEFIS 283

Query: 300 KDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFKECT-SNAETVNKE-SSLNSKVKE 357
           KDLV+RGFRSVGPTVIYTFMQVAGLTNDHLI CFRFKEC  SNAE   KE SSLNSKVKE
Sbjct: 284 KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFKECIFSNAEAEGKESSSLNSKVKE 343

Query: 358 KANEEEPNEMGLLLAVNKLNFTS 380
           K+N E+P  +GLLL+VNKL+F+S
Sbjct: 344 KSN-EDPTNVGLLLSVNKLSFSS 365


>Medtr4g130880.1 | DNA-3-methyladenine glycosylase I | HC |
           chr4:54548746-54544404 | 20130731
          Length = 375

 Score =  347 bits (891), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 191/388 (49%), Positives = 235/388 (60%), Gaps = 24/388 (6%)

Query: 2   SGPPRVRSMNVAVGDHEARPVLVPACNKARPAAV--DGRKPVKKSV-----LEREREKSR 54
           SG PR+RSMNVA  D EARPV  PA NK    +   D  KP++K+      ++  +EK  
Sbjct: 4   SGGPRLRSMNVA--DSEARPVFGPAGNKTGSYSSRKDASKPLRKAEKLGKEVDLAKEKKE 61

Query: 55  GAPPTPPQRVLVSPVVSRRQDHHHLAVLKNLXXXXXXXXXXXXXXXXXXXXXXXXXXKVA 114
            +P +      VS V+ R +   H  +  N                              
Sbjct: 62  ASPQS--HSASVSSVLRRHEQLLHSNLSMN-----------ASCSSDASTDSFHSRASTG 108

Query: 115 RRVRKKQAG-ARTEKXXXXXXXXXXXXXXXXXXXGLESKKRCAWVTANTEPCYIAFHDEE 173
           R  R    G  R                      G + KKRCAW+T NTEP Y  FHDEE
Sbjct: 109 RLTRSNSYGLTRKRSVSKPRSVVSDGVLESPPPDGAQPKKRCAWITPNTEPYYATFHDEE 168

Query: 174 WGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREVFLDFDPNVVSKMNEKKIVAPG 233
           WGVPVHDDKKLFE+L  S AL+ELTWP ILSKR +FREVF DFDP  VSK+NEKK++ PG
Sbjct: 169 WGVPVHDDKKLFEVLVLSSALSELTWPAILSKRHIFREVFADFDPVAVSKLNEKKVITPG 228

Query: 234 NSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNFVNNKPIVSQFRYARQVPVKSP 293
            +A +LLS+ +LR IIENARQ+ KVI EFGSFD +IW+FVN+KPI+S+FRY RQVPVK+P
Sbjct: 229 TTASSLLSDQKLRGIIENARQISKVIVEFGSFDNYIWSFVNHKPILSKFRYPRQVPVKTP 288

Query: 294 KAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFKECTSNAETVNKESSLNS 353
           KAE ISKDLVRRGFR VGPTVIY+FMQV GLTNDHLISCFRF+EC + AE   + S  N 
Sbjct: 289 KAEVISKDLVRRGFRGVGPTVIYSFMQVVGLTNDHLISCFRFQECVAAAEGKEENSIKNE 348

Query: 354 KVKEKANEEEPNEMGLLLAVNKLNFTSK 381
             +  A +    E  L +A++ L+ +S+
Sbjct: 349 DAQPNACDSV-MESDLSIAIDNLSLSSE 375


>Medtr3g111690.1 | DNA-3-methyladenine glycosylase I | HC |
           chr3:52235165-52232483 | 20130731
          Length = 331

 Score =  264 bits (674), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 120/206 (58%), Positives = 158/206 (76%), Gaps = 6/206 (2%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPV-HDDKKLFELLSFSGALAELTWPTILSKRQLFRE 211
           KRC W+T N +P Y AFHD+EWGVPV  DD+KLFELL FS ALAE TWPTIL+ R +FR+
Sbjct: 128 KRCDWITPNADPLYTAFHDDEWGVPVLDDDRKLFELLVFSQALAEHTWPTILNHRDIFRK 187

Query: 212 VFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWN 271
           +F +FDP+ +++  EK +V P  +   LLSE +LR+++ENA+Q  K+  EFGSF  + W 
Sbjct: 188 LFENFDPSSIAQFTEKNLVTPKLNGNPLLSEQKLRAVVENAKQFLKIQLEFGSFSNYCWK 247

Query: 272 FVNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLIS 331
           FVNNKPI ++FRY RQVPVK+PKAE ISKD++RRGF+ VGP V+Y+FMQVAGL NDHLI+
Sbjct: 248 FVNNKPIKNEFRYGRQVPVKNPKAELISKDMMRRGFQCVGPKVVYSFMQVAGLVNDHLIT 307

Query: 332 CFRFKECTSNAETVNKESSLNSKVKE 357
           CFR++EC      V  ++ + ++VKE
Sbjct: 308 CFRYQEC-----NVAIKTEIKTEVKE 328


>Medtr0009s0230.1 | DNA-3-methyladenine glycosylase I | HC |
           scaffold0009:104042-101934 | 20130731
          Length = 308

 Score =  243 bits (621), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 100/191 (52%), Positives = 148/191 (77%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           KRC+W+T N +  YI FHDE WGVP +DDKKLFELL+ SG L +  W  IL ++++ R+V
Sbjct: 112 KRCSWITKNCDKAYIEFHDECWGVPAYDDKKLFELLALSGLLIDYNWTEILKRKEVLRQV 171

Query: 213 FLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNF 272
           F  FDP  VSKM EK+++   ++   +L+E R++ I++NA+ M K+  EFGSF ++IW++
Sbjct: 172 FAGFDPYTVSKMEEKEVIDIASATELVLAECRVKCIVDNAKCMMKIRREFGSFSSYIWSY 231

Query: 273 VNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISC 332
           VN+KP+++++RY+R VP+++PKA+ ISKDL++RGFR +GP ++Y+FMQVAGLT DHL+ C
Sbjct: 232 VNHKPVINKYRYSRDVPLRTPKADAISKDLLKRGFRYLGPVIVYSFMQVAGLTIDHLVGC 291

Query: 333 FRFKECTSNAE 343
           +R KEC + AE
Sbjct: 292 YRHKECVNLAE 302


>Medtr0009s0230.2 | DNA-3-methyladenine glycosylase I | HC |
           scaffold0009:104053-101904 | 20130731
          Length = 307

 Score =  239 bits (609), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 100/191 (52%), Positives = 148/191 (77%), Gaps = 1/191 (0%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           KRC+W+T N +  YI FHDE WGVP +DDKKLFELL+ SG L +  W  IL ++++ R+V
Sbjct: 112 KRCSWITKNYKA-YIEFHDECWGVPAYDDKKLFELLALSGLLIDYNWTEILKRKEVLRQV 170

Query: 213 FLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFIWNF 272
           F  FDP  VSKM EK+++   ++   +L+E R++ I++NA+ M K+  EFGSF ++IW++
Sbjct: 171 FAGFDPYTVSKMEEKEVIDIASATELVLAECRVKCIVDNAKCMMKIRREFGSFSSYIWSY 230

Query: 273 VNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHLISC 332
           VN+KP+++++RY+R VP+++PKA+ ISKDL++RGFR +GP ++Y+FMQVAGLT DHL+ C
Sbjct: 231 VNHKPVINKYRYSRDVPLRTPKADAISKDLLKRGFRYLGPVIVYSFMQVAGLTIDHLVGC 290

Query: 333 FRFKECTSNAE 343
           +R KEC + AE
Sbjct: 291 YRHKECVNLAE 301


>Medtr2g063510.1 | DNA-3-methyladenine glycosylase I | HC |
           chr2:26805019-26800818 | 20130731
          Length = 390

 Score =  224 bits (571), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 95/190 (50%), Positives = 141/190 (74%), Gaps = 2/190 (1%)

Query: 150 ESKKRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLF 209
           + +KRC+++T N++P YIA+HDEEWGVPVHDDK LFELL  SGA     W + L KR  F
Sbjct: 198 QEEKRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDF 257

Query: 210 REVFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFI 269
           R  F +FD  +V+ + +K++++  +     +S  ++R +++NA Q+ +V + FGSFD +I
Sbjct: 258 RAAFSEFDAEIVANLTDKQMMSISSEYGIDIS--KVRGVVDNANQILQVRKGFGSFDKYI 315

Query: 270 WNFVNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHL 329
           W FVN+KPI +Q+++  ++PVK+ K+E ISKD+++RGFR VGPTV+++FMQ AGLTNDHL
Sbjct: 316 WGFVNHKPISNQYKFGHKIPVKTSKSESISKDMIKRGFRYVGPTVVHSFMQAAGLTNDHL 375

Query: 330 ISCFRFKECT 339
           I+C R  +CT
Sbjct: 376 ITCHRHLQCT 385


>Medtr8g447000.1 | DNA-3-methyladenine glycosylase I | HC |
           chr8:18418523-18415122 | 20130731
          Length = 383

 Score =  222 bits (565), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 96/190 (50%), Positives = 140/190 (73%), Gaps = 2/190 (1%)

Query: 150 ESKKRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLF 209
           E +KRC+++T N++P YIA+HDEEWGVPVHDDK LFELL  SGA     W +IL KRQ F
Sbjct: 192 EEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 251

Query: 210 REVFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEEFGSFDTFI 269
           R  F +F    ++ + +K++++        +S  R+R +++NA ++ +V ++FGSFD +I
Sbjct: 252 RTAFSEFHAATLANLTDKQMLSISLEYGIDIS--RVRGVVDNANRILEVNKDFGSFDKYI 309

Query: 270 WNFVNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVIYTFMQVAGLTNDHL 329
           W FVN+KPI +Q+++  ++PVK+ K+E ISKD++RRGFR VGPTV+++FMQ AGLTNDHL
Sbjct: 310 WGFVNHKPISTQYKFGHKIPVKTSKSESISKDMIRRGFRFVGPTVVHSFMQAAGLTNDHL 369

Query: 330 ISCFRFKECT 339
           I+C    +CT
Sbjct: 370 ITCHSHLKCT 379


>Medtr8g066040.1 | DNA-3-methyladenine glycosylase I | HC |
           chr8:27430410-27434638 | 20130731
          Length = 329

 Score =  209 bits (533), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 138/208 (66%), Gaps = 17/208 (8%)

Query: 153 KRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREV 212
           +RC W+T N++  Y+ FHDE WGVP +DD KLFE+L+ SG L +  W  I+ +R+  REV
Sbjct: 116 RRCNWITKNSDKLYVEFHDECWGVPAYDDNKLFEMLAMSGLLMDYNWTEIIKRREPLREV 175

Query: 213 FLDFDPNVVSKMNEKKIV-APGNSACTLL--------------SELRLRSIIENARQMCK 257
           F  FDP  V+KM E++I+    N A +L               + LRLRS    A  +  
Sbjct: 176 FAGFDPYTVAKMEEQEIIEITSNKALSLADSRVMCIVDNVSFGATLRLRSYGYGAGFLVN 235

Query: 258 --VIEEFGSFDTFIWNFVNNKPIVSQFRYARQVPVKSPKAEFISKDLVRRGFRSVGPTVI 315
             V+ E GSF ++IW+FVN+KPI+++++Y R VP++SPKAE +SKD+V+RGFR VGP ++
Sbjct: 236 TPVVRECGSFSSYIWSFVNHKPIINKYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIV 295

Query: 316 YTFMQVAGLTNDHLISCFRFKECTSNAE 343
           ++FMQ AGLT DHL+ C+R  EC S AE
Sbjct: 296 HSFMQAAGLTIDHLVDCYRHSECVSLAE 323


>Medtr4g130880.2 | DNA-3-methyladenine glycosylase I | HC |
           chr4:54548746-54544404 | 20130731
          Length = 265

 Score =  193 bits (491), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 115/266 (43%), Positives = 142/266 (53%), Gaps = 23/266 (8%)

Query: 2   SGPPRVRSMNVAVGDHEARPVLVPACNKARPAAV--DGRKPVKKSV-----LEREREKSR 54
           SG PR+RSMNVA  D EARPV  PA NK    +   D  KP++K+      ++  +EK  
Sbjct: 4   SGGPRLRSMNVA--DSEARPVFGPAGNKTGSYSSRKDASKPLRKAEKLGKEVDLAKEKKE 61

Query: 55  GAPPTPPQRVLVSPVVSRRQDHHHLAVLKNLXXXXXXXXXXXXXXXXXXXXXXXXXXKVA 114
            +P +      VS V+ R +   H  +  N                              
Sbjct: 62  ASPQS--HSASVSSVLRRHEQLLHSNLSMN-----------ASCSSDASTDSFHSRASTG 108

Query: 115 RRVRKKQAG-ARTEKXXXXXXXXXXXXXXXXXXXGLESKKRCAWVTANTEPCYIAFHDEE 173
           R  R    G  R                      G + KKRCAW+T NTEP Y  FHDEE
Sbjct: 109 RLTRSNSYGLTRKRSVSKPRSVVSDGVLESPPPDGAQPKKRCAWITPNTEPYYATFHDEE 168

Query: 174 WGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLFREVFLDFDPNVVSKMNEKKIVAPG 233
           WGVPVHDDKKLFE+L  S AL+ELTWP ILSKR +FREVF DFDP  VSK+NEKK++ PG
Sbjct: 169 WGVPVHDDKKLFEVLVLSSALSELTWPAILSKRHIFREVFADFDPVAVSKLNEKKVITPG 228

Query: 234 NSACTLLSELRLRSIIENARQMCKVI 259
            +A +LLS+ +LR IIENARQ+ KV+
Sbjct: 229 TTASSLLSDQKLRGIIENARQISKVL 254


>Medtr2g063510.2 | DNA-3-methyladenine glycosylase I | HC |
           chr2:26804984-26802744 | 20130731
          Length = 316

 Score =  109 bits (272), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 50/118 (42%), Positives = 78/118 (66%), Gaps = 4/118 (3%)

Query: 150 ESKKRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLF 209
           + +KRC+++T N++P YIA+HDEEWGVPVHDDK LFELL  SGA     W + L KR  F
Sbjct: 198 QEEKRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDF 257

Query: 210 REVFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKVIEE--FGSF 265
           R  F +FD  +V+ + +K++++  +     +S  ++R +++NA Q+ +V +   F SF
Sbjct: 258 RAAFSEFDAEIVANLTDKQMMSISSEYGIDIS--KVRGVVDNANQILQVNKNVPFSSF 313


>Medtr8g447000.2 | DNA-3-methyladenine glycosylase I | HC |
           chr8:18418617-18415158 | 20130731
          Length = 323

 Score =  106 bits (264), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 48/109 (44%), Positives = 73/109 (66%), Gaps = 2/109 (1%)

Query: 150 ESKKRCAWVTANTEPCYIAFHDEEWGVPVHDDKKLFELLSFSGALAELTWPTILSKRQLF 209
           E +KRC+++T N++P YIA+HDEEWGVPVHDDK LFELL  SGA     W +IL KRQ F
Sbjct: 192 EEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 251

Query: 210 REVFLDFDPNVVSKMNEKKIVAPGNSACTLLSELRLRSIIENARQMCKV 258
           R  F +F    ++ + +K++++        +S  R+R +++NA ++ +V
Sbjct: 252 RTAFSEFHAATLANLTDKQMLSISLEYGIDIS--RVRGVVDNANRILEV 298