Miyakogusa Predicted Gene
- Lj0g3v0104059.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0104059.1 Non Chatacterized Hit- tr|E1ZBP1|E1ZBP1_CHLVA
Putative uncharacterized protein OS=Chlorella variabil,66.67,0.094,no
description,NULL; seg,NULL; DNA_MISMATCH_REPAIR_2,DNA mismatch repair
protein MutS, C-terminal; A,CUFF.5903.1
(272 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G20475.1 | Symbols: ATMSH5, MSH5 | MUTS-homologue 5 | chr3:71... 355 1e-98
AT4G25540.1 | Symbols: MSH3, ATMSH3 | homolog of DNA mismatch re... 117 6e-27
AT3G18524.1 | Symbols: MSH2, ATMSH2 | MUTS homolog 2 | chr3:6368... 113 2e-25
AT4G02070.2 | Symbols: MSH6 | MUTS homolog 6 | chr4:906079-91293... 112 3e-25
AT4G02070.1 | Symbols: MSH6, MSH6-1, ATMSH6 | MUTS homolog 6 | c... 112 3e-25
AT3G24495.1 | Symbols: MSH7, MSH6-2, ATMSH7 | MUTS homolog 7 | c... 104 5e-23
AT4G17380.1 | Symbols: MSH4, ATMSH4 | MUTS-like protein 4 | chr4... 95 4e-20
AT5G54090.1 | Symbols: | DNA mismatch repair protein MutS, type... 70 1e-12
AT3G24320.1 | Symbols: MSH1, CHM1, ATMSH1, CHM | MUTL protein ho... 59 5e-09
>AT3G20475.1 | Symbols: ATMSH5, MSH5 | MUTS-homologue 5 |
chr3:7143546-7150811 FORWARD LENGTH=807
Length = 807
Score = 355 bits (912), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 179/265 (67%), Positives = 197/265 (74%), Gaps = 30/265 (11%)
Query: 1 MERAITRDLFSRVLLFSTHLIKVATFAAELDW---------------------------- 32
MERAI RDL S LLFS HL+K F AELD
Sbjct: 479 MERAIIRDLLSHTLLFSAHLLKAVNFVAELDCILSLACVAHQNNYVRPVLTVESLLDIRN 538
Query: 33 --HVLQEMTVDTFIPNDTKMLHDGRINIITGPNFSGKSIYIKQVALIVFLSHIGCFVPAD 90
HVLQEM VDTFIPNDT++ +GRI+IITGPN+SGKSIY+KQVALIVFLSHIG FVPAD
Sbjct: 539 GRHVLQEMAVDTFIPNDTEINDNGRIHIITGPNYSGKSIYVKQVALIVFLSHIGSFVPAD 598
Query: 91 AATVGLTDRIFCATGSRLMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDG 150
AATVGLTDRIFCA GS+ MTAEQSTFMIDLHQVGMMLR ATSRSLCL+DEFGKGTLTEDG
Sbjct: 599 AATVGLTDRIFCAMGSKFMTAEQSTFMIDLHQVGMMLRQATSRSLCLLDEFGKGTLTEDG 658
Query: 151 TGLLAGTINHFVTYDEPPKVFVCTHLMDLLHGHSLTKSEQIMFYTMSILRPDESSTQIED 210
GLL GTI+HF T EPP+V VCTHL +LL+ L SE+I FYTMS+LRPD S +E+
Sbjct: 659 IGLLGGTISHFATCAEPPRVVVCTHLTELLNESCLPVSEKIKFYTMSVLRPDTESANMEE 718
Query: 211 IVFLYRLVPGHAHHSYGLHCALLAG 235
IVFLYRL+PG SYGLHCALLAG
Sbjct: 719 IVFLYRLIPGQTLLSYGLHCALLAG 743
>AT4G25540.1 | Symbols: MSH3, ATMSH3 | homolog of DNA mismatch
repair protein MSH3 | chr4:13042700-13048115 REVERSE
LENGTH=1081
Length = 1081
Score = 117 bits (294), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 111/204 (54%), Gaps = 4/204 (1%)
Query: 34 VLQEMTVDTFIPNDTKMLHDGRI-NIITGPNFSGKSIYIKQVALIVFLSHIGCFVPADAA 92
VL+ + D F+PNDT + +G IITGPN GKS YI+QVALI ++ +G FVPA A
Sbjct: 789 VLETILQDNFVPNDTILHAEGEYCQIITGPNMGGKSCYIRQVALISIMAQVGSFVPASFA 848
Query: 93 TVGLTDRIFCATG-SRLMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGT 151
+ + D +F G S + +STF+ +L + ++R +SRSL ++DE G+GT T DG
Sbjct: 849 KLHVLDGVFTRMGASDSIQHGRSTFLEELSEASHIIRTCSSRSLVILDELGRGTSTHDGV 908
Query: 152 GLLAGTINHFVTYDEPPKVFVCTHLMDLLH-GHSLTKSEQIMFYTMSILRPDESSTQIED 210
+ T+ H + +FV TH ++ + S + L+ D+ S +D
Sbjct: 909 AIAYATLQHLLAEKRCLVLFV-THYPEIAEISNGFPGSVGTYHVSYLTLQKDKGSYDHDD 967
Query: 211 IVFLYRLVPGHAHHSYGLHCALLA 234
+ +LY+LV G S+G A LA
Sbjct: 968 VTYLYKLVRGLCSRSFGFKVAQLA 991
>AT3G18524.1 | Symbols: MSH2, ATMSH2 | MUTS homolog 2 |
chr3:6368151-6372409 REVERSE LENGTH=937
Length = 937
Score = 113 bits (282), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 103/203 (50%), Gaps = 16/203 (7%)
Query: 42 TFIPNDTKMLH-DGRINIITGPNFSGKSIYIKQVALIVFLSHIGCFVPADAATVGLTDRI 100
FIPND +++ I+TGPN GKS +I+QV +IV ++ +G FVP D A++ + D I
Sbjct: 646 NFIPNDCRLMRGKSWFQIVTGPNMGGKSTFIRQVGVIVLMAQVGSFVPCDKASISIRDCI 705
Query: 101 FCATGS-RLMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGTGLLAGTIN 159
F G+ STFM ++ + +L+ A+ +SL ++DE G+GT T DG GL
Sbjct: 706 FARVGAGDCQLRGVSTFMQEMLETASILKGASDKSLIIIDELGRGTSTYDGFGLAWAICE 765
Query: 160 HFVTYDEPPKVFVCTHLMDLLHGHSLT----KSEQIMFYTMSILRPDESS---TQIEDIV 212
H V P +F H H LT + ++ T+ + S+ T+ +
Sbjct: 766 HLVQVKRAPTLFAT-------HFHELTALAQANSEVSGNTVGVANFHVSAHIDTESRKLT 818
Query: 213 FLYRLVPGHAHHSYGLHCALLAG 235
LY++ PG S+G+H A A
Sbjct: 819 MLYKVEPGACDQSFGIHVAEFAN 841
>AT4G02070.2 | Symbols: MSH6 | MUTS homolog 6 | chr4:906079-912930
FORWARD LENGTH=1321
Length = 1321
Score = 112 bits (279), Expect = 3e-25, Method: Composition-based stats.
Identities = 68/197 (34%), Positives = 101/197 (51%), Gaps = 7/197 (3%)
Query: 42 TFIPNDTKM--LHDGRINIITGPNFSGKSIYIKQVALIVFLSHIGCFVPADAATVGLTDR 99
+F+PN+ K+ ++TGPN GKS ++QV L V L+ IG VPA+ V D+
Sbjct: 1059 SFVPNNVKIGGAEKASFILLTGPNMGGKSTLLRQVCLAVILAQIGADVPAETFEVSPVDK 1118
Query: 100 IFCATGSR-LMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGTGLLAGTI 158
I G++ + A QSTF+ +L + +ML AT SL ++DE G+GT T DG + +
Sbjct: 1119 ICVRMGAKDHIMAGQSTFLTELSETAVMLTSATRNSLVVLDELGRGTATSDGQAIAESVL 1178
Query: 159 NHFVTYDEPPKVFVCTHLMDLLHGHSLTKSEQIMFYTMSILRPDESSTQIEDIVFLYRLV 218
HF+ + + F TH L + + I E +E++ FLYRL
Sbjct: 1179 EHFIEKVQ-CRGFFSTHYHRLSVDYQTNPKVSLCHMACQI---GEGIGGVEEVTFLYRLT 1234
Query: 219 PGHAHHSYGLHCALLAG 235
PG SYG++ A LAG
Sbjct: 1235 PGACPKSYGVNVARLAG 1251
>AT4G02070.1 | Symbols: MSH6, MSH6-1, ATMSH6 | MUTS homolog 6 |
chr4:906079-912930 FORWARD LENGTH=1324
Length = 1324
Score = 112 bits (279), Expect = 3e-25, Method: Composition-based stats.
Identities = 68/197 (34%), Positives = 101/197 (51%), Gaps = 7/197 (3%)
Query: 42 TFIPNDTKM--LHDGRINIITGPNFSGKSIYIKQVALIVFLSHIGCFVPADAATVGLTDR 99
+F+PN+ K+ ++TGPN GKS ++QV L V L+ IG VPA+ V D+
Sbjct: 1062 SFVPNNVKIGGAEKASFILLTGPNMGGKSTLLRQVCLAVILAQIGADVPAETFEVSPVDK 1121
Query: 100 IFCATGSR-LMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGTGLLAGTI 158
I G++ + A QSTF+ +L + +ML AT SL ++DE G+GT T DG + +
Sbjct: 1122 ICVRMGAKDHIMAGQSTFLTELSETAVMLTSATRNSLVVLDELGRGTATSDGQAIAESVL 1181
Query: 159 NHFVTYDEPPKVFVCTHLMDLLHGHSLTKSEQIMFYTMSILRPDESSTQIEDIVFLYRLV 218
HF+ + + F TH L + + I E +E++ FLYRL
Sbjct: 1182 EHFIEKVQ-CRGFFSTHYHRLSVDYQTNPKVSLCHMACQI---GEGIGGVEEVTFLYRLT 1237
Query: 219 PGHAHHSYGLHCALLAG 235
PG SYG++ A LAG
Sbjct: 1238 PGACPKSYGVNVARLAG 1254
>AT3G24495.1 | Symbols: MSH7, MSH6-2, ATMSH7 | MUTS homolog 7 |
chr3:8912418-8918086 REVERSE LENGTH=1109
Length = 1109
Score = 104 bits (260), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 105/208 (50%), Gaps = 22/208 (10%)
Query: 44 IPNDTKMLHDGRIN---------IITGPNFSGKSIYIKQVALIVFLSHIGCFVPADAATV 94
+PND +L + R + ++TGPN GKS ++ L V + +GC+VP ++ +
Sbjct: 828 VPNDI-LLGEARRSSGSIHPRSLLLTGPNMGGKSTLLRATCLAVIFAQLGCYVPCESCEI 886
Query: 95 GLTDRIFCATGS--RLMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGTG 152
L D IF G+ R+MT E STF+++ + +L++AT SL ++DE G+GT T DG
Sbjct: 887 SLVDTIFTRLGASDRIMTGE-STFLVECTETASVLQNATQDSLVILDELGRGTSTFDGYA 945
Query: 153 LLAGTINHFVTYDEPPKVFVCTHLMDLL-----HGHSLTKSEQIMFYTMSILRPDESSTQ 207
+ H V + +F TH L H +K F + S +P
Sbjct: 946 IAYSVFRHLVEKVQCRMLFA-THYHPLTKEFASHPRVTSKHMACAFKSRSDYQPRGCD-- 1002
Query: 208 IEDIVFLYRLVPGHAHHSYGLHCALLAG 235
+D+VFLYRL G SYGL AL+AG
Sbjct: 1003 -QDLVFLYRLTEGACPESYGLQVALMAG 1029
>AT4G17380.1 | Symbols: MSH4, ATMSH4 | MUTS-like protein 4 |
chr4:9708742-9714068 FORWARD LENGTH=792
Length = 792
Score = 95.1 bits (235), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 81/147 (55%), Gaps = 6/147 (4%)
Query: 8 DLFSRVLLFSTHLIKVATFAAELDWHVLQEMTVDTFIPNDTKMLHDGRINIITGPNFSGK 67
D +SR L A + H + E + F+ N M + ++ GPN SGK
Sbjct: 505 DRYSR-----PELTDSGPLAIDAGRHPILESIHNDFVSNSIFMSEATNMLVVMGPNMSGK 559
Query: 68 SIYIKQVALIVFLSHIGCFVPADAATVGLTDRIFCATGS-RLMTAEQSTFMIDLHQVGMM 126
S Y++QV L+V L+ IGC+VPA AT+ + DRIF G+ + + STFM ++ + +
Sbjct: 560 STYLQQVCLVVILAQIGCYVPARFATIRVVDRIFTRMGTMDNLESNSSTFMTEMRETAFI 619
Query: 127 LRHATSRSLCLVDEFGKGTLTEDGTGL 153
+++ T+RSL ++DE G+ T + DG +
Sbjct: 620 MQNVTNRSLIVMDELGRATSSSDGLAM 646
>AT5G54090.1 | Symbols: | DNA mismatch repair protein MutS, type 2
| chr5:21948283-21952550 REVERSE LENGTH=796
Length = 796
Score = 70.1 bits (170), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 64/121 (52%), Gaps = 2/121 (1%)
Query: 44 IPNDTKMLHDGRINIITGPNFSGKSIYIKQVALIVFLSHIGCFVPA-DAATVGLTDRIFC 102
IP D ++ R+ +ITGPN GK+I +K V L ++ G +V A ++A + D I+
Sbjct: 424 IPADFQISKGTRVLVITGPNTGGKTICLKSVGLAAMMAKSGLYVLATESARIPWFDNIYA 483
Query: 103 ATG-SRLMTAEQSTFMIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGTGLLAGTINHF 161
G + + STF L Q+ +L H+TSRSL L+DE G GT +G L + F
Sbjct: 484 DIGDEQSLLQSLSTFSGHLKQISEILSHSTSRSLVLLDEVGAGTNPLEGAALGMAILESF 543
Query: 162 V 162
Sbjct: 544 A 544
>AT3G24320.1 | Symbols: MSH1, CHM1, ATMSH1, CHM | MUTL protein
homolog 1 | chr3:8823229-8829571 REVERSE LENGTH=1118
Length = 1118
Score = 58.5 bits (140), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 57/102 (55%), Gaps = 2/102 (1%)
Query: 58 IITGPNFSGKSIYIKQVALIVFLSHIGCFVPADAATVGLTDRIFCATGSRLMTAE-QSTF 116
++TGPN GKS ++ + L G VPA++A + D I S + +S+F
Sbjct: 765 LLTGPNGGGKSSLLRSICAAALLGISGLMVPAESACIPHFDSIMLHMKSYDSPVDGKSSF 824
Query: 117 MIDLHQVGMMLRHATSRSLCLVDEFGKGTLTEDGTGLLAGTI 158
+++ ++ ++ ATSRSL L+DE +GT T GT +AG++
Sbjct: 825 QVEMSEIRSIVSQATSRSLVLIDEICRGTETAKGT-CIAGSV 865