Miyakogusa Predicted Gene
- Lj1g3v2155540.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2155540.1 Non Chatacterized Hit- tr|I1MNV7|I1MNV7_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,80.13,0,glycine rich
nucleic binding domain,G-patch domain; coiled-coil,NULL; GCFC,GC-rich
sequence DNA-bind,CUFF.28650.1
(872 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G17070.1 | Symbols: | GC-rich sequence DNA-binding factor-li... 775 0.0
AT2G42330.2 | Symbols: | GC-rich sequence DNA-binding factor-li... 689 0.0
AT2G42330.1 | Symbols: | GC-rich sequence DNA-binding factor-li... 689 0.0
AT5G26610.3 | Symbols: | D111/G-patch domain-containing protein... 51 4e-06
AT5G26610.2 | Symbols: | D111/G-patch domain-containing protein... 51 4e-06
AT5G26610.1 | Symbols: | D111/G-patch domain-containing protein... 51 4e-06
>AT1G17070.1 | Symbols: | GC-rich sequence DNA-binding factor-like
protein with Tuftelin interacting domain |
chr1:5837653-5840202 FORWARD LENGTH=849
Length = 849
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/673 (58%), Positives = 471/673 (69%), Gaps = 19/673 (2%)
Query: 196 VSGDVGKFEKHTKGIGMKLLERMGYKGGGLGKNEQGILKPIEARLRAKNSGLGFNNETPA 255
+ D+G+FEK TKGIGMKLLE+MGYKGGGLGKN+QGI+ PIEA+LR KN G+G+N+
Sbjct: 189 LGSDIGQFEKSTKGIGMKLLEKMGYKGGGLGKNQQGIVAPIEAQLRPKNMGMGYND-FKE 247
Query: 256 APLPALQ-VESQSVSEAAQPTVGRTXXXXXXXXXXXXXXXXXXXYVTAEQLLASKQEEDS 314
A LP L+ VE + + + ++ YVTAE+LL KQE
Sbjct: 248 AKLPDLKKVEEKKIIGVSVSENEQSHGDRGGKNLWKKKKVRKAVYVTAEELLEKKQEAGF 307
Query: 315 EVVHRILDMRGPQVRVYTNLSDLNAEEKAKERDVPMPELQHNVGLIVRLAEAEIQEIDRD 374
I+DMRGPQVRV TNL +L+AEEKAKE DVPMPELQHN+ LIV L E EIQ+IDRD
Sbjct: 308 GGGQTIIDMRGPQVRVVTNLENLDAEEKAKEADVPMPELQHNLRLIVDLVEHEIQKIDRD 367
Query: 375 LRRERETXXXXXXXXXXXXXXXXFQKKQLDGFEKIMDVLDQIGEENTAGTLTLDSLAQCF 434
LR ERE+ QK+ L+ E I D + +I ENT+G LTLDSLA F
Sbjct: 368 LRNERESALSLQQEKEMLINEEEKQKRHLENMEYIADEISRIELENTSGNLTLDSLAIRF 427
Query: 435 RELHQKYADNYKLCNLSCIACSYALPLFIRVFQGWDPLRNPSHGLELVSQWKTLLQGEDC 494
+L Y D+YKLC+LS IACS ALPLFIR+FQGWDPL + HGL+ +S W+ LL+ E+
Sbjct: 428 EDLQTSYPDDYKLCSLSTIACSLALPLFIRMFQGWDPLSDAVHGLKAISSWRKLLEVEED 487
Query: 495 LDIWDDSSPYAQLVSEVVLPAVRISGINTWQARDPEPMLRFLESWEKLLPSSVLATILDN 554
+IW S+PY+QLVSEVVLPAVRI+GINTW+ RDPEPMLRFLE+WE LLPSSVL TILD
Sbjct: 488 HNIWVVSTPYSQLVSEVVLPAVRIAGINTWEPRDPEPMLRFLETWETLLPSSVLQTILDT 547
Query: 555 IVMPKLSSAVDTWEPHRETIPIHTWVHPWLPLLGHKLEGIYQVIRFKLSTVLGAWHPSDG 614
+V+PKLS+AV+ W+P RE + IH WVHPWLP+LG KLE +YQ+I+ KLS VL AWHPSD
Sbjct: 548 VVLPKLSTAVEYWDPRRELVAIHVWVHPWLPILGQKLEFLYQIIQMKLSNVLDAWHPSDS 607
Query: 615 SAYAILSPWKTVFDSVSWEQLMLRFIVPKLQLVLQEFQVNPANQNLDHFYWVMNWASAIP 674
SAY ILSPWKTVFD+ SWEQLM R+IVPKLQL LQEFQVNPANQNL+ F WVM WASA+P
Sbjct: 608 SAYTILSPWKTVFDTTSWEQLMRRYIVPKLQLALQEFQVNPANQNLERFDWVMKWASAVP 667
Query: 675 IHLMADMMEKFFFSKWLQVLYHWLCSNPNFEEVTKWYLGWKELIPKELLANESIRFQLNR 734
IHLMADMME+FFF KWL VLYHWL + P FEE+ WY GWKEL P+EL ANE IR QL R
Sbjct: 668 IHLMADMMERFFFPKWLDVLYHWLRAKPRFEEIQGWYYGWKELFPQELTANERIRIQLKR 727
Query: 735 GLGMMNQAVEGMEVVQPGLKENISYLRVLEQRQFEXXXXXXXXXXXXXXSLGGAVNADGA 794
GL M+ +AVEG+EV QP K N E+ Q D
Sbjct: 728 GLDMLMEAVEGVEVSQPRAKAN-------ERTQ----------SVPAQAQAQAKAQMDST 770
Query: 795 HELSLKEVIEAHAQQHGLLFKLKPGRMHNGHQIYGFGNVSIIIDSLNQKVYAQNEETWSL 854
LSLKEV+E AQ+ LLFK KP RMHNG QIYGFGNVS+IIDS+NQK+ AQ + W L
Sbjct: 771 EVLSLKEVLEVFAQEQELLFKPKPNRMHNGLQIYGFGNVSVIIDSVNQKLLAQKDGGWFL 830
Query: 855 ESLQGLVALHHKS 867
+ L+ +H+ +
Sbjct: 831 VTPDDLLRMHNNT 843
Score = 102 bits (253), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 61/101 (60%), Gaps = 7/101 (6%)
Query: 1 MDEDQEMDRFGMDNDFEGGEWIAGEYYYRKRKEKHAQTKDDVLYGVFAXXXXXXXXXXXX 60
MDE Q+M+RF MDND+EGG W E+ Y+KRKEK QTK+D YG+FA
Sbjct: 1 MDEYQDMERFSMDNDYEGGRWEGDEFVYQKRKEKRKQTKNDATYGIFAESDSDSDDSGGG 60
Query: 61 XXXXXX-------XXQDLTKPVNFVSTGTFMPNEEIDKNSR 94
DLTKPVNFVSTGT MPN+EIDK+SR
Sbjct: 61 GSRRKRRKDRDSGRKADLTKPVNFVSTGTVMPNQEIDKDSR 101
>AT2G42330.2 | Symbols: | GC-rich sequence DNA-binding factor-like
protein with Tuftelin interacting domain |
chr2:17631831-17634089 REVERSE LENGTH=752
Length = 752
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/669 (52%), Positives = 433/669 (64%), Gaps = 22/669 (3%)
Query: 203 FEKHTKGIGMKLLERMGYKGGGLGKNEQGILKPIEARLRAKNSGLGFNN--ETPAAPLPA 260
FEK + GIGMKLLE+MGYKG GLGKN+QGI+ PIE +LR KN G+G+N+ E A P
Sbjct: 99 FEKFSGGIGMKLLEKMGYKGRGLGKNQQGIVAPIEVQLRPKNMGMGYNDFKEKNAPLFPC 158
Query: 261 L-QVESQSVSEAAQPTVGRTXXXXXXXXXXXXXXXXXXXYVTAEQLLASKQEEDSEVVHR 319
L +VE + S TV Y+TAE+ L KQEE
Sbjct: 159 LNKVEEKKKSVVV--TVSENHGDGRRDLWKKKNVRKEV-YITAEEFLGKKQEEGFGCDQL 215
Query: 320 ILDMRGPQVRVYTNLSDLNAEEKAKERDVPMPELQHNVGLIVRLAEAEIQEIDRDLRRER 379
I+D RGPQ RV +L +L AEEKA + +V PELQHN+ IV+ E I + D+DLR E+
Sbjct: 216 IIDKRGPQDRVVNSLRNLYAEEKATDANVQQPELQHNLRFIVKSLEHGILKTDKDLRNEK 275
Query: 380 ETXXXXXXXXXXXXXXXXFQKKQLDGFEKIMDVLDQIGEENTAGTLTLDSLAQCFRELHQ 439
QK D + + +D+I E +G LTLDSLA F++L
Sbjct: 276 GLALSLQQEKEKFKMGVKKQKTLFDNLGYVAEEIDRIEVEIASGNLTLDSLANRFKDLRS 335
Query: 440 KYADNYKLCNLSCIACSYALPLFIRVFQGWDPLRNPSHGLELVSQWKTLLQGEDCLDIWD 499
Y D+YK CNLSCIA S ALPLFIR+FQGWDPL + HG+E +S WK LL+ ED I
Sbjct: 336 SYPDDYKCCNLSCIASSLALPLFIRMFQGWDPLSDAEHGIEAISSWKMLLEVEDNQSI-- 393
Query: 500 DSSPYAQLVSEVVLPAVRISGINTWQARDPEPMLRFLESWEKLLPSSVLATILDNIVMPK 559
S+PY+QLVSEV+LPAVR+SGINTW+ RDPEPMLR LE+WEK+LPS + TIL +V+PK
Sbjct: 394 -STPYSQLVSEVILPAVRVSGINTWEPRDPEPMLRLLETWEKMLPSLIFETILTTVVLPK 452
Query: 560 LSSAVDTWEPHRETIPIHTWVHPWLPLLGHKLEGIYQVIRFKLSTVLGAWHPSDGSAYAI 619
LS A+++WEP ET+PIH WVHPWLP+LG KLE YQ+IR K +L AWHPSD S + I
Sbjct: 453 LSIAIESWEPRLETVPIHFWVHPWLPVLGQKLESAYQIIRMKFGNLLDAWHPSDVSVHTI 512
Query: 620 LSPWKTVFDSVSWEQLMLRFIVPKLQLVLQEFQVNPANQNLDHFYWVMNWASAIPIHLMA 679
LSPWKTVFD+ SWEQLM R+IVPKLQ+ LQEFQ+NPA+QNLD F VM W S++PIHLM
Sbjct: 513 LSPWKTVFDAASWEQLMRRYIVPKLQVALQEFQINPADQNLDEFNLVMGWVSSVPIHLMT 572
Query: 680 DMMEKFFFSKWLQVLYHWLCSNPNFEEVTKWYLGWKELIPKELLANESIRFQLNRGLGMM 739
D+ME+FFF KWL VLYHWLCS P F+E+ KW+LGWK P+EL AN I Q RGL M
Sbjct: 573 DLMERFFFPKWLDVLYHWLCSEPKFDEIMKWFLGWKGTFPQELSANRRIEIQFKRGLDMA 632
Query: 740 NQAVEGMEVVQPGLKENISYLRVLEQRQFEXXXXXXXXXXXXXXSLGGAVNADGAHELSL 799
+AVE ME+ QPG +ENISY + EQRQ E D ELS
Sbjct: 633 REAVERMEMSQPGARENISYHKAQEQRQSEGRAKVQ-------------AQVDDPEELSF 679
Query: 800 KEVIEAHAQQHGLLFKLKPGRMHNGHQIYGFGNVSIIIDSLNQKVYAQNEETWSLESLQG 859
KE +E AQ+ LL K KP RMHNG QIY FGNVS+++DS N K+ AQ E W L
Sbjct: 680 KEAVELFAQEKELLLKPKPHRMHNGLQIYRFGNVSVLLDSANSKLLAQEEGRWFPVDLDS 739
Query: 860 LVALHHKSL 868
L+ +H+ ++
Sbjct: 740 LLKMHYSAV 748
>AT2G42330.1 | Symbols: | GC-rich sequence DNA-binding factor-like
protein with Tuftelin interacting domain |
chr2:17631831-17634089 REVERSE LENGTH=752
Length = 752
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/669 (52%), Positives = 433/669 (64%), Gaps = 22/669 (3%)
Query: 203 FEKHTKGIGMKLLERMGYKGGGLGKNEQGILKPIEARLRAKNSGLGFNN--ETPAAPLPA 260
FEK + GIGMKLLE+MGYKG GLGKN+QGI+ PIE +LR KN G+G+N+ E A P
Sbjct: 99 FEKFSGGIGMKLLEKMGYKGRGLGKNQQGIVAPIEVQLRPKNMGMGYNDFKEKNAPLFPC 158
Query: 261 L-QVESQSVSEAAQPTVGRTXXXXXXXXXXXXXXXXXXXYVTAEQLLASKQEEDSEVVHR 319
L +VE + S TV Y+TAE+ L KQEE
Sbjct: 159 LNKVEEKKKSVVV--TVSENHGDGRRDLWKKKNVRKEV-YITAEEFLGKKQEEGFGCDQL 215
Query: 320 ILDMRGPQVRVYTNLSDLNAEEKAKERDVPMPELQHNVGLIVRLAEAEIQEIDRDLRRER 379
I+D RGPQ RV +L +L AEEKA + +V PELQHN+ IV+ E I + D+DLR E+
Sbjct: 216 IIDKRGPQDRVVNSLRNLYAEEKATDANVQQPELQHNLRFIVKSLEHGILKTDKDLRNEK 275
Query: 380 ETXXXXXXXXXXXXXXXXFQKKQLDGFEKIMDVLDQIGEENTAGTLTLDSLAQCFRELHQ 439
QK D + + +D+I E +G LTLDSLA F++L
Sbjct: 276 GLALSLQQEKEKFKMGVKKQKTLFDNLGYVAEEIDRIEVEIASGNLTLDSLANRFKDLRS 335
Query: 440 KYADNYKLCNLSCIACSYALPLFIRVFQGWDPLRNPSHGLELVSQWKTLLQGEDCLDIWD 499
Y D+YK CNLSCIA S ALPLFIR+FQGWDPL + HG+E +S WK LL+ ED I
Sbjct: 336 SYPDDYKCCNLSCIASSLALPLFIRMFQGWDPLSDAEHGIEAISSWKMLLEVEDNQSI-- 393
Query: 500 DSSPYAQLVSEVVLPAVRISGINTWQARDPEPMLRFLESWEKLLPSSVLATILDNIVMPK 559
S+PY+QLVSEV+LPAVR+SGINTW+ RDPEPMLR LE+WEK+LPS + TIL +V+PK
Sbjct: 394 -STPYSQLVSEVILPAVRVSGINTWEPRDPEPMLRLLETWEKMLPSLIFETILTTVVLPK 452
Query: 560 LSSAVDTWEPHRETIPIHTWVHPWLPLLGHKLEGIYQVIRFKLSTVLGAWHPSDGSAYAI 619
LS A+++WEP ET+PIH WVHPWLP+LG KLE YQ+IR K +L AWHPSD S + I
Sbjct: 453 LSIAIESWEPRLETVPIHFWVHPWLPVLGQKLESAYQIIRMKFGNLLDAWHPSDVSVHTI 512
Query: 620 LSPWKTVFDSVSWEQLMLRFIVPKLQLVLQEFQVNPANQNLDHFYWVMNWASAIPIHLMA 679
LSPWKTVFD+ SWEQLM R+IVPKLQ+ LQEFQ+NPA+QNLD F VM W S++PIHLM
Sbjct: 513 LSPWKTVFDAASWEQLMRRYIVPKLQVALQEFQINPADQNLDEFNLVMGWVSSVPIHLMT 572
Query: 680 DMMEKFFFSKWLQVLYHWLCSNPNFEEVTKWYLGWKELIPKELLANESIRFQLNRGLGMM 739
D+ME+FFF KWL VLYHWLCS P F+E+ KW+LGWK P+EL AN I Q RGL M
Sbjct: 573 DLMERFFFPKWLDVLYHWLCSEPKFDEIMKWFLGWKGTFPQELSANRRIEIQFKRGLDMA 632
Query: 740 NQAVEGMEVVQPGLKENISYLRVLEQRQFEXXXXXXXXXXXXXXSLGGAVNADGAHELSL 799
+AVE ME+ QPG +ENISY + EQRQ E D ELS
Sbjct: 633 REAVERMEMSQPGARENISYHKAQEQRQSEGRAKVQ-------------AQVDDPEELSF 679
Query: 800 KEVIEAHAQQHGLLFKLKPGRMHNGHQIYGFGNVSIIIDSLNQKVYAQNEETWSLESLQG 859
KE +E AQ+ LL K KP RMHNG QIY FGNVS+++DS N K+ AQ E W L
Sbjct: 680 KEAVELFAQEKELLLKPKPHRMHNGLQIYRFGNVSVLLDSANSKLLAQEEGRWFPVDLDS 739
Query: 860 LVALHHKSL 868
L+ +H+ ++
Sbjct: 740 LLKMHYSAV 748
>AT5G26610.3 | Symbols: | D111/G-patch domain-containing protein |
chr5:9375456-9376991 FORWARD LENGTH=301
Length = 301
Score = 50.8 bits (120), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 33/46 (71%)
Query: 207 TKGIGMKLLERMGYKGGGLGKNEQGILKPIEARLRAKNSGLGFNNE 252
+ +G +LL++MG+KG GLGK EQGI +PI++ +R + GLG E
Sbjct: 66 SSNVGFRLLQKMGWKGKGLGKQEQGITEPIKSGIRDRRLGLGKQEE 111
>AT5G26610.2 | Symbols: | D111/G-patch domain-containing protein |
chr5:9375456-9376991 FORWARD LENGTH=301
Length = 301
Score = 50.8 bits (120), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 33/46 (71%)
Query: 207 TKGIGMKLLERMGYKGGGLGKNEQGILKPIEARLRAKNSGLGFNNE 252
+ +G +LL++MG+KG GLGK EQGI +PI++ +R + GLG E
Sbjct: 66 SSNVGFRLLQKMGWKGKGLGKQEQGITEPIKSGIRDRRLGLGKQEE 111
>AT5G26610.1 | Symbols: | D111/G-patch domain-containing protein |
chr5:9375456-9376991 FORWARD LENGTH=301
Length = 301
Score = 50.8 bits (120), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 33/46 (71%)
Query: 207 TKGIGMKLLERMGYKGGGLGKNEQGILKPIEARLRAKNSGLGFNNE 252
+ +G +LL++MG+KG GLGK EQGI +PI++ +R + GLG E
Sbjct: 66 SSNVGFRLLQKMGWKGKGLGKQEQGITEPIKSGIRDRRLGLGKQEE 111