Miyakogusa Predicted Gene
- Lj2g3v0286920.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v0286920.1 tr|G7KLZ3|G7KLZ3_MEDTR Flap endonuclease GEN-like
protein OS=Medicago truncatula GN=MTR_6g055360
PE=,85.22,0,XPG_I,XPG/RAD2 endonuclease; XPG_N,XPG N-terminal;
XPGRADSUPER,DNA repair protein (XPGC)/yeast Rad; ,CUFF.34495.1
(407 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G48900.2 | Symbols: | single-stranded DNA endonuclease famil... 571 e-163
AT3G48900.1 | Symbols: | single-stranded DNA endonuclease famil... 426 e-119
AT1G01880.1 | Symbols: | 5'-3' exonuclease family protein | chr... 128 9e-30
AT1G01880.2 | Symbols: | 5'-3' exonuclease family protein | chr... 127 2e-29
AT3G28030.1 | Symbols: UVH3, UVR1 | 5'-3' exonuclease family pro... 108 6e-24
AT5G26680.1 | Symbols: | 5'-3' exonuclease family protein | chr... 64 3e-10
AT5G26680.2 | Symbols: | 5'-3' exonuclease family protein | chr... 64 3e-10
AT1G29630.2 | Symbols: | 5'-3' exonuclease family protein | chr... 53 5e-07
>AT3G48900.2 | Symbols: | single-stranded DNA endonuclease family
protein | chr3:18131854-18136239 FORWARD LENGTH=600
Length = 600
Score = 571 bits (1472), Expect = e-163, Method: Compositional matrix adjust.
Identities = 273/429 (63%), Positives = 334/429 (77%), Gaps = 25/429 (5%)
Query: 1 MGVKNLWDVLESCKKTVPLHLLQNKRVCVDLSCWMVQLHNVSKSHACVKEKVHLRGLFHR 60
MGVK LWDVLE CKKT PL LQNKRVCVDLSCWMV+LH V+KS+ KEKV+LRG FHR
Sbjct: 1 MGVKYLWDVLEPCKKTFPLDHLQNKRVCVDLSCWMVELHKVNKSYCATKEKVYLRGFFHR 60
Query: 61 LRALIALNCSVVLVADGSIPAIKLSTYRRRLNVGKEVMQDETNLPKVTSLRRNMGSEFSC 120
LRALIALNCS++LV+DG+IP IK+ TY+RRL E+ D K TSL+RNMGSEFSC
Sbjct: 61 LRALIALNCSIILVSDGAIPGIKVPTYKRRLKARFEIADDGVEPSKETSLKRNMGSEFSC 120
Query: 121 MIKEAKALGMALGISCLNGIEEAEAQCALLNLELLCDGCFSLDSDIFLFGARTVYRDICL 180
+IKEAK + LGI CL+GIEEAEAQCALLN E LCD CFS DSDIFLFGA+TVYR+ICL
Sbjct: 121 IIKEAKVIASTLGILCLDGIEEAEAQCALLNSESLCDACFSFDSDIFLFGAKTVYREICL 180
Query: 181 GDGGYAVCYEMADIERKLGFGRDSLIALSLLLGSDYYQGVHGLGPESACQIVKSIGDEFI 240
G+GGY VCYEM DI++KLG GR+SLIAL+LLLGSDY QGV GL E AC++V+SIGD I
Sbjct: 181 GEGGYVVCYEMDDIKKKLGLGRNSLIALALLLGSDYSQGVRGLRQEKACELVRSIGDNVI 240
Query: 241 LKKIASEGLGWVKKRR-----------------------GGGNNLHRDEKVLEVINAYMK 277
L+K+ASEGL + +K R G + R E++ +VI+A+M
Sbjct: 241 LEKVASEGLSFAEKPRKSKKQVRPSVCSKKGTLPLVVINGNNRDPERLEEIKQVIDAFMN 300
Query: 278 PKCHSADSDVVHRALANYPFQRIQLQQICAEFFEWPSDRTDGYILPSIAERDLRRFANLR 337
PKCH ADS+ V RALA + FQR +LQ+IC +FFEWP ++TD YILP +AER+LRRFANL+
Sbjct: 301 PKCHQADSNTVSRALAEFSFQRTKLQEICHQFFEWPPEKTDEYILPKVAERNLRRFANLQ 360
Query: 338 LTSSDLGLNLPLH--EIPVKCPVSEIVKSRKVQGKECYEVTWKDMDGLETSIVPADLIES 395
S+++ +NLPLH ++P KCPVSEI+K+RKVQG+EC+EV+W D++GLE+SIVPADL+E
Sbjct: 361 SRSTEVEVNLPLHKPQMPEKCPVSEIIKTRKVQGRECFEVSWNDLEGLESSIVPADLVER 420
Query: 396 ACPEKILEF 404
ACPEKI+EF
Sbjct: 421 ACPEKIIEF 429
>AT3G48900.1 | Symbols: | single-stranded DNA endonuclease family
protein | chr3:18132449-18136239 FORWARD LENGTH=536
Length = 536
Score = 426 bits (1095), Expect = e-119, Method: Compositional matrix adjust.
Identities = 203/334 (60%), Positives = 255/334 (76%), Gaps = 25/334 (7%)
Query: 96 EVMQDETNLPKVTSLRRNMGSEFSCMIKEAKALGMALGISCLNGIEEAEAQCALLNLELL 155
++ D K TSL+RNMGSEFSC+IKEAK + LGI CL+GIEEAEAQCALLN E L
Sbjct: 32 QIADDGVEPSKETSLKRNMGSEFSCIIKEAKVIASTLGILCLDGIEEAEAQCALLNSESL 91
Query: 156 CDGCFSLDSDIFLFGARTVYRDICLGDGGYAVCYEMADIERKLGFGRDSLIALSLLLGSD 215
CD CFS DSDIFLFGA+TVYR+ICLG+GGY VCYEM DI++KLG GR+SLIAL+LLLGSD
Sbjct: 92 CDACFSFDSDIFLFGAKTVYREICLGEGGYVVCYEMDDIKKKLGLGRNSLIALALLLGSD 151
Query: 216 YYQGVHGLGPESACQIVKSIGDEFILKKIASEGLGWVKKRR------------------- 256
Y QGV GL E AC++V+SIGD IL+K+ASEGL + +K R
Sbjct: 152 YSQGVRGLRQEKACELVRSIGDNVILEKVASEGLSFAEKPRKSKKQVRPSVCSKKGTLPL 211
Query: 257 ----GGGNNLHRDEKVLEVINAYMKPKCHSADSDVVHRALANYPFQRIQLQQICAEFFEW 312
G + R E++ +VI+A+M PKCH ADS+ V RALA + FQR +LQ+IC +FFEW
Sbjct: 212 VVINGNNRDPERLEEIKQVIDAFMNPKCHQADSNTVSRALAEFSFQRTKLQEICHQFFEW 271
Query: 313 PSDRTDGYILPSIAERDLRRFANLRLTSSDLGLNLPLH--EIPVKCPVSEIVKSRKVQGK 370
P ++TD YILP +AER+LRRFANL+ S+++ +NLPLH ++P KCPVSEI+K+RKVQG+
Sbjct: 272 PPEKTDEYILPKVAERNLRRFANLQSRSTEVEVNLPLHKPQMPEKCPVSEIIKTRKVQGR 331
Query: 371 ECYEVTWKDMDGLETSIVPADLIESACPEKILEF 404
EC+EV+W D++GLE+SIVPADL+E ACPEKI+EF
Sbjct: 332 ECFEVSWNDLEGLESSIVPADLVERACPEKIIEF 365
>AT1G01880.1 | Symbols: | 5'-3' exonuclease family protein |
chr1:306558-308991 REVERSE LENGTH=599
Length = 599
Score = 128 bits (321), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 127/264 (48%), Gaps = 13/264 (4%)
Query: 1 MGVK-NLWDVLESCKKTVPLHLLQNKRVCVDLSCWMVQLHNVSKSHACVKEKVHLRGLFH 59
MGV N WD+L + L+NKRV VDLS W+VQ K K HLR F
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVL---KPHLRLTFF 57
Query: 60 RLRALIA-LNCSVVLVADGSIPAIKLSTYRRRLNVGKEVMQDETNLPKV---TSLRRNMG 115
R L + V V DG+ +K R + D NLP + S+ RN
Sbjct: 58 RTINLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGI--DTCNLPVIKDGVSVERN-- 113
Query: 116 SEFSCMIKEAKALGMALGISCLNGIEEAEAQCALLNLELLCDGCFSLDSDIFLFGARTVY 175
FS ++E L LGI L EAEA CA LN + D C + DSD FLFGA V
Sbjct: 114 KLFSEWVRECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVI 173
Query: 176 RDICLGDGGYAVCYEMADIERKLGFGRDSLIALSLLLGSDYYQ-GVHGLGPESACQIVKS 234
+DI CY M+ IE LG R LIA+SLL+G+DY GV G+G + A +IV+
Sbjct: 174 KDIKPNSREPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVRE 233
Query: 235 IGDEFILKKIASEGLGWVKKRRGG 258
++ +L+++ G G GG
Sbjct: 234 FSEDQVLERLQDIGNGLQPAVPGG 257
>AT1G01880.2 | Symbols: | 5'-3' exonuclease family protein |
chr1:306558-308991 REVERSE LENGTH=598
Length = 598
Score = 127 bits (318), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 129/264 (48%), Gaps = 14/264 (5%)
Query: 1 MGVK-NLWDVLESCKKTVPLHLLQNKRVCVDLSCWMVQLHNVSKSHACVKEKVHLRGLFH 59
MGV N WD+L + L+NKRV VDLS W+VQ K K HLR F
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVL---KPHLRLTFF 57
Query: 60 RLRALIA-LNCSVVLVADGSIPAIKLSTYRRRLNVGKEVMQDETNLPKV---TSLRRNMG 115
R L + V V DG+ +K R + D NLP + S+ RN
Sbjct: 58 RTINLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGI--DTCNLPVIKDGVSVERN-- 113
Query: 116 SEFSCMIKEAKALGMALGISCLNGIEEAEAQCALLNLELLCDGCFSLDSDIFLFGARTVY 175
FS ++E + L + LGI L EAEA CA LN + D C + DSD FLFGA V
Sbjct: 114 KLFSEWVRECELLEL-LGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVI 172
Query: 176 RDICLGDGGYAVCYEMADIERKLGFGRDSLIALSLLLGSDYYQ-GVHGLGPESACQIVKS 234
+DI CY M+ IE LG R LIA+SLL+G+DY GV G+G + A +IV+
Sbjct: 173 KDIKPNSREPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVRE 232
Query: 235 IGDEFILKKIASEGLGWVKKRRGG 258
++ +L+++ G G GG
Sbjct: 233 FSEDQVLERLQDIGNGLQPAVPGG 256
>AT3G28030.1 | Symbols: UVH3, UVR1 | 5'-3' exonuclease family protein
| chr3:10424321-10431178 FORWARD LENGTH=1479
Length = 1479
Score = 108 bits (270), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 58/135 (42%), Positives = 81/135 (60%), Gaps = 1/135 (0%)
Query: 110 LRRNMGSEFSCMIKEAKALGMALGISCLNGIEEAEAQCALLNLELLCDGCFSLDSDIFLF 169
L RN S S M E + L GI + EAEAQCA + L DG + DSD+FLF
Sbjct: 914 LERNAESVSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFMEQSNLVDGIVTDDSDVFLF 973
Query: 170 GARTVYRDICLGDGGYAVCYEMADIERKLGFGRDSLIALSLLLGSDYYQGVHGLGPESAC 229
GAR+VY++I D Y Y M DIE++LG RD +I +++LLGSDY +G+ G+G +A
Sbjct: 974 GARSVYKNI-FDDRKYVETYFMKDIEKELGLSRDKIIRMAMLLGSDYTEGISGIGIVNAI 1032
Query: 230 QIVKSIGDEFILKKI 244
++V + +E L+K
Sbjct: 1033 EVVTAFPEEDGLQKF 1047
Score = 63.9 bits (154), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 50/93 (53%), Gaps = 3/93 (3%)
Query: 1 MGVKNLWDVLESCKKTVPLHLLQNKRVCVDLSCWMVQ-LHNVSKSHACVKEKVHLRGLFH 59
MGV+ LW++L + V + L NKR+ +D S WMVQ + + + + HL G F
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAIDASIWMVQFIKAMRDEKGDMVQNAHLIGFFR 60
Query: 60 RLRALIALNCSVVLVADGSIPAIKLSTY--RRR 90
R+ L+ L + V DG+ PA+K T RRR
Sbjct: 61 RICKLLFLRTKPIFVFDGATPALKRRTVIARRR 93
>AT5G26680.1 | Symbols: | 5'-3' exonuclease family protein |
chr5:9311882-9315458 REVERSE LENGTH=453
Length = 453
Score = 63.5 bits (153), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 106/267 (39%), Gaps = 20/267 (7%)
Query: 1 MGVKNLWDVL----ESCKKTVPLHLLQNKRVCVDLSCWMVQLHNV---SKSHACVKE--- 50
MG+K L +L SC K +++ VD S + Q V + + E
Sbjct: 1 MGIKGLTKLLADNAPSCMKEQKFESYFGRKIAVDASMSIYQFLIVVGRTGTEMLTNEAGE 60
Query: 51 -KVHLRGLFHRLRALIALNCSVVLVADGSIPAIKLSTYRRRLNVGKEVMQDET------N 103
HL+G+F+R L+ V V DG P +K +R + + D T N
Sbjct: 61 VTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPELKRQELAKRYSKRADATADLTGAIEAGN 120
Query: 104 LPKVTSLRRNMGSEFSCMIKEAKALGMALGISCLNGIEEAEAQCALLNLELLCDGCFSLD 163
+ + + K L +G+ + EAEAQCA L G S D
Sbjct: 121 KEDIEKYSKRTVKVTKQHNDDCKRLLRLMGVPVVEATSEAEAQCAALCKSGKVYGVASED 180
Query: 164 SDIFLFGARTVYRDICLGDGGY--AVCYEMADIERKLGFGRDSLIALSLLLGSDYYQGVH 221
D FGA R + + +E+A I +L D I L +L G DY +
Sbjct: 181 MDSLTFGAPKFLRHLMDPSSRKIPVMEFEVAKILEELQLTMDQFIDLCILSGCDYCDSIR 240
Query: 222 GLGPESACQIVKSIGD-EFILKKIASE 247
G+G ++A ++++ G E IL+ + E
Sbjct: 241 GIGGQTALKLIRQHGSIETILENLNKE 267
>AT5G26680.2 | Symbols: | 5'-3' exonuclease family protein |
chr5:9311882-9315458 REVERSE LENGTH=383
Length = 383
Score = 63.5 bits (153), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 106/267 (39%), Gaps = 20/267 (7%)
Query: 1 MGVKNLWDVL----ESCKKTVPLHLLQNKRVCVDLSCWMVQLHNV---SKSHACVKE--- 50
MG+K L +L SC K +++ VD S + Q V + + E
Sbjct: 1 MGIKGLTKLLADNAPSCMKEQKFESYFGRKIAVDASMSIYQFLIVVGRTGTEMLTNEAGE 60
Query: 51 -KVHLRGLFHRLRALIALNCSVVLVADGSIPAIKLSTYRRRLNVGKEVMQDET------N 103
HL+G+F+R L+ V V DG P +K +R + + D T N
Sbjct: 61 VTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPELKRQELAKRYSKRADATADLTGAIEAGN 120
Query: 104 LPKVTSLRRNMGSEFSCMIKEAKALGMALGISCLNGIEEAEAQCALLNLELLCDGCFSLD 163
+ + + K L +G+ + EAEAQCA L G S D
Sbjct: 121 KEDIEKYSKRTVKVTKQHNDDCKRLLRLMGVPVVEATSEAEAQCAALCKSGKVYGVASED 180
Query: 164 SDIFLFGARTVYRDICLGDGGY--AVCYEMADIERKLGFGRDSLIALSLLLGSDYYQGVH 221
D FGA R + + +E+A I +L D I L +L G DY +
Sbjct: 181 MDSLTFGAPKFLRHLMDPSSRKIPVMEFEVAKILEELQLTMDQFIDLCILSGCDYCDSIR 240
Query: 222 GLGPESACQIVKSIGD-EFILKKIASE 247
G+G ++A ++++ G E IL+ + E
Sbjct: 241 GIGGQTALKLIRQHGSIETILENLNKE 267
>AT1G29630.2 | Symbols: | 5'-3' exonuclease family protein |
chr1:10349587-10353538 FORWARD LENGTH=735
Length = 735
Score = 52.8 bits (125), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 108/256 (42%), Gaps = 53/256 (20%)
Query: 1 MGVKNLWDVLESCKKTVPLHL--LQNKRVCVDLSCWMVQLHNVSKSHACVKE-------K 51
MG++ L +L+S VP+H+ L+ V VD W LH + S C +E K
Sbjct: 1 MGIQGLLPLLKSI--MVPIHIKELEGCIVAVDTYSW---LHKGALS--CSRELCKGLPTK 53
Query: 52 VHLRGLFHRLRALIALNCSVVLVADGSIPAIKLSTYRRRLNVGKE----VMQDETNLPKV 107
H++ HR+ L ++V DG +KL +R KE ++ E N
Sbjct: 54 RHIQYCMHRVNLLRHHGVKPIMVFDGGPLPMKLEQENKRARSRKENLARALEHEAN---- 109
Query: 108 TSLRRNMGSEFSCMIKEAKALGMALGIS-------------CLNGIEEAEAQCALLNLEL 154
N + + C +KA+ ++ I+ + EA+AQ A L +
Sbjct: 110 ----GNSSAAYECY---SKAVDISPSIAHELIQVLRQENVDYVVAPYEADAQMAFLAITK 162
Query: 155 LCDGCFSLDSDIFLFGA-RTVYRDICLGDGGYAVCYEMADIERK-----LGFGRDSLIAL 208
D + DSD+ FG R +++ + G+ V ++ + + + GF L+ +
Sbjct: 163 QVDAIITEDSDLIPFGCLRIIFK---MDKFGHGVEFQASKLPKNKDLSLSGFSSQMLLEM 219
Query: 209 SLLLGSDYYQGVHGLG 224
+L G DY Q + G+G
Sbjct: 220 CILSGCDYLQSLPGMG 235