Miyakogusa Predicted Gene
- Lj5g3v0962480.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0962480.1 tr|G7IEG4|G7IEG4_MEDTR Hepatoma-derived growth
factor-related protein OS=Medicago truncatula
GN=MTR_,67.7,0,PWWP,PWWP; CID,RNA polymerase II, large subunit, CTD;
domain with conserved PWWP motif,PWWP; no desc,CUFF.54394.1
(1372 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G63070.1 | Symbols: | Tudor/PWWP/MBT domain-containing prote... 322 2e-87
AT2G48160.1 | Symbols: | Tudor/PWWP/MBT domain-containing prote... 318 1e-86
AT5G23150.1 | Symbols: HUA2 | Tudor/PWWP/MBT domain-containing p... 244 3e-64
AT5G08230.1 | Symbols: | Tudor/PWWP/MBT domain-containing prote... 219 1e-56
>AT3G63070.1 | Symbols: | Tudor/PWWP/MBT domain-containing protein |
chr3:23302667-23309575 FORWARD LENGTH=1347
Length = 1347
Score = 322 bits (824), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 186/349 (53%), Positives = 238/349 (68%), Gaps = 25/349 (7%)
Query: 730 NSVSTSDDCLGEKDISGMPSSPSLTDGGDCIPQGSPPNTSVCNVSTSDSSNILQNGSC-S 788
NS+S S++ EK + SSP+ +G+P SVC +ST++S N +QN S S
Sbjct: 763 NSISVSENFSREK----LNSSPA---------RGTPNCNSVCRISTAESENAMQNNSYYS 809
Query: 789 PDVHLHPMQVQAVSGPLDGSKNGYTATQQSRSTGKSTEAARAALKYFQATLATLTRTKES 848
+V + V + SK TQ + + ++ F+ L +L RTKE+
Sbjct: 810 TNVQYGENKSLNVDTVKEESKVETGTTQVKKVVSSDVQCT---VESFETALDSLVRTKET 866
Query: 849 IGRATRIAIDCAKFGLAAKVMESLAHNLE-ESSLQRRVDLFFLVDSIAQ-SRGLKGDVCG 906
IGRATR+A+D AKFG++AK ME LAH LE ES+LQRRVDLFFLVDSIAQ S+GL GD G
Sbjct: 867 IGRATRLAMDLAKFGVSAKAMEILAHTLESESNLQRRVDLFFLVDSIAQCSKGLNGDAGG 926
Query: 907 VYPSAIQAVLPRLLSAAAPPGNTAQENRRQCRKVLKVWLERKILPESLIRHHIRELDLRS 966
VY S+IQA+LPRLL+AA P G T QENR+QC KVL++WLER+ILPES++RHHIRELD
Sbjct: 927 VYLSSIQAMLPRLLTAAVPAGATTQENRKQCLKVLRLWLERRILPESIVRHHIRELD-SL 985
Query: 967 SSAFAGPFSRRTSRTERALDDPIRDMEGMLVDEYGSNSSFQLPGFCMPRMLKXXXXXXXX 1026
S+ A +SRR++RTERALDDP+RDMEG+LVDEYGSNS+ QL GFC+P +L+
Sbjct: 986 SNVPACLYSRRSARTERALDDPVRDMEGILVDEYGSNSTLQLHGFCIPPILR--DEDEGS 1043
Query: 1027 XXXXXNFEAVTPEHDSE-VQEMV--STIEKHRHILEDVDGELEMEDVAP 1072
+FE+VTPEH+S ++E V S E+H ILEDVDGELEMEDVAP
Sbjct: 1044 DSDGGDFESVTPEHESRSLEEHVTPSITERHTRILEDVDGELEMEDVAP 1092
Score = 265 bits (676), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 185/464 (39%), Positives = 254/464 (54%), Gaps = 53/464 (11%)
Query: 22 WKVGDLVLAKVKGFPAWPATVSEPEKWGYSTDWKKVLVYFFGTQQIAFCNPADVEAFTEE 81
WKVGDLVLAKVKGFPAWPA V EPEKWG+S D KKV V+FFGTQQIAFCN DVE+FTEE
Sbjct: 22 WKVGDLVLAKVKGFPAWPAVVDEPEKWGHSADSKKVTVHFFGTQQIAFCNHGDVESFTEE 81
Query: 82 KKQSLLLKRQGKGADFVRAVQEIVDSYEKLKKESQ------LDEASLGG----------- 124
KKQSLL +R KG+DFVRAV+EI +SYEKLK++ Q +E + G
Sbjct: 82 KKQSLLTRRHAKGSDFVRAVKEITESYEKLKQQDQASGPKYAEETTAGSSGNTSQLPQAC 141
Query: 125 -NIVDANVSNPVSTSARDLTDAPKLTHELVCVAEDDSTAVLKEESHDKEALLEEPTDNVS 183
N++ + + + +S+ D EL ++ED S A D+ +
Sbjct: 142 ENLIGSRLDTQIESSSSHGRD------ELTLLSEDASAAEQMLALRHNTLAHNGACDSAA 195
Query: 184 AVQSPKPVTYSSRKRS----AGDLCPRGCVTHRHMPLRRSRISS-----RAQNFVLPCSD 234
A + TYSSR+R+ A P+ + +P+ S+ISS R Q +L CSD
Sbjct: 196 AKDLCEIATYSSRRRNERVRALKYAPQSII----LPVEHSKISSRLELDRVQRSMLQCSD 251
Query: 235 CGKSTGNPSTNEGLSSSVKRNTSVRKSPDSGCNDFDSSAFVSNGSMEDKGSGILTIDSDA 294
G PS N +++R +R S S +D SS +GS ED S I T++S+
Sbjct: 252 -----GGPSVNSINGKAIRRRKRIRTSGQSESDDVVSSDLNLHGSDEDNASEIATVESNN 306
Query: 295 FSLNEGSTMDSNFKLEHSDTI--ECLEKVELNKALDLDINTVIHXXXXXXXXXXVTNDAS 352
S NEG+ +DS K+E+SD + C ELNK LD I+T++ T+D
Sbjct: 307 NSRNEGNGVDSGSKVEYSDAVGEGCDGGHELNKGLDFHISTMVKRKKRKPTRKRETSDII 366
Query: 353 KPTGRLE------EACVQNTSQSSQNICGNSEQRGFEQDGDEHLPLVKRARVRMGKSSSM 406
P ++E AC ++ Q SQN +R E++GDEHLPLVKRARVRM ++
Sbjct: 367 DPPAKVEAEGLGPNAC--DSCQRSQNSHERLNERPCEENGDEHLPLVKRARVRMSRAFYA 424
Query: 407 EAELNSVLQAQEKSCKEDINSPHLMIT-SSNCENGSSADGDSSV 449
+ ++N+ Q +E+S K+ + S L + S N ENG + D+S
Sbjct: 425 DEKVNASSQVEERSSKDTLLSAALQTSPSVNHENGIGSGHDTSA 468
Score = 72.0 bits (175), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 63/130 (48%), Gaps = 13/130 (10%)
Query: 1248 ERHMNHRREVXXXXXXXSYPNRQHFVENMERENFYNNHERIKPPPFDHRDRWN-GPGPYP 1306
+ H+ RRE SYP+R H+ + N+ +++ER++P P ++RD W P
Sbjct: 1226 DHHIKSRRE------GLSYPHRSHYTLEFDERNYQDSYERMRPEPCENRDNWRYHPPSSH 1279
Query: 1307 GPRYQDNGAPP----PYGCHPYESPRMPGHGWRFPPRSGNHNHRSSAPFRPPFEDAIPVA 1362
GPRY D P Y H +S R+ + W PR+ +N+R S ++ E +PV
Sbjct: 1280 GPRYHDRHKGPHQSSSYSGHHRDSGRLQNNRWSDSPRA--YNNRHSYHYKQHSEGPVPVG 1337
Query: 1363 NRGPNFWRPR 1372
R P W R
Sbjct: 1338 MRDPGTWHQR 1347
>AT2G48160.1 | Symbols: | Tudor/PWWP/MBT domain-containing protein |
chr2:19689784-19696584 REVERSE LENGTH=1366
Length = 1366
Score = 318 bits (816), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 210/453 (46%), Positives = 277/453 (61%), Gaps = 48/453 (10%)
Query: 675 KSDIILPQNSIDVPQNEVAA--CEDMKCLKPAVADVKITNDMSEVREIKCKGPEKDMNSV 732
+SD+I+ +N VP NE+ CED AV D + + E + K + + NSV
Sbjct: 700 ESDVIVGEN---VPLNEIGCTKCED------AVEDSRQLKMIGETNDQKQQ--VQTNNSV 748
Query: 733 STSDDCLGEKDISGMPSSPSLTDGGDCIPQGSPPNTSVC-NVSTSDSSNILQNGSC-SPD 790
S++ EK M SP++T D +G+P ++SV ++STS+S+N +QN S SP+
Sbjct: 749 LVSENLSREK----MSFSPAIT--ADTPARGTPHSSSVYYHISTSESANDMQNNSSGSPN 802
Query: 791 VHLHPMQ--VQAVSGPLDGSKNGYTATQQSRSTGKSTEAARAALKYFQATLATLTRTKES 848
+ + A+ + + G Q+ S ++ + ++ L +L RTKES
Sbjct: 803 IPTGEKKNDCDAIVKEEEKIETGVCQGQKVVSCD-----VQSTRESYEDALCSLVRTKES 857
Query: 849 IGRATRIAIDCAKFGLAAKVMESLAHNLE-ESSLQRRVDLFFLVDSIAQ-SRGLKGDVCG 906
IGRAT +A+D KFG++AK ME LAH LE ES+L+RRVDLFFLVDSIAQ S+GLKGD
Sbjct: 858 IGRATCLAMDLMKFGVSAKAMEILAHTLESESNLKRRVDLFFLVDSIAQCSKGLKGDTGC 917
Query: 907 VYPSAIQAVLPRLLSAAAPPGNTAQENRRQCRKVLKVWLERKILPESLIRHHIRELDLRS 966
VY SAIQ +LPRLL+AA P G T QENR+QC KVLK+WLER+ILPES++RHHIRELD
Sbjct: 918 VYLSAIQVILPRLLAAAVPAGATTQENRKQCLKVLKLWLERRILPESIVRHHIRELD-SH 976
Query: 967 SSAFAGPFSRRTSRTERALDDPIRDMEGMLVDEYGSNSSFQLPGFCMPRMLKXXX----- 1021
S A +SRR++RTER+LDDP+RDME MLVDEYGSNS+ QLPGFCMP +LK
Sbjct: 977 SIVPACLYSRRSARTERSLDDPVRDMEDMLVDEYGSNSTLQLPGFCMPALLKDEEGGSDS 1036
Query: 1022 -----XXXXXXXXXXNFEAVTPEHDSEVQE---MVSTIEKHRHILEDVDGELEMEDVAPS 1073
+FE+VTPEH+S + E ST E+H ILEDVDGELEMEDVAP
Sbjct: 1037 EGGCDSEGGSDSDGGDFESVTPEHESRILEENVSSSTAERHTLILEDVDGELEMEDVAPP 1096
Query: 1074 CDVE----MNSFCNGDRGNATQFEKNLPVFSTT 1102
E + N N +++ PVF T+
Sbjct: 1097 WGTENCTHTDQADNTKVSNCQLGQQHRPVFGTS 1129
Score = 262 bits (670), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 175/424 (41%), Positives = 239/424 (56%), Gaps = 20/424 (4%)
Query: 23 KVGDLVLAKVKGFPAWPATVSEPEKWGYSTDWKKVLVYFFGTQQIAFCNPADVEAFTEEK 82
KVGDLVLAKVKGFPAWPA VSEPEKW S D KKV V+FFGTQQIAFCNP DVEAFTEE+
Sbjct: 23 KVGDLVLAKVKGFPAWPAVVSEPEKWDASPDSKKVFVHFFGTQQIAFCNPGDVEAFTEER 82
Query: 83 KQSLLLKRQGKGADFVRAVQEIVDSYEKLKKESQ------LDEASLGG----NIVDANVS 132
KQSLL +R KG+DFVRAV+EI++SYEKLK++ + +E +LG ++ +
Sbjct: 83 KQSLLTRRHAKGSDFVRAVKEIIESYEKLKQQERASDPKSAEEGTLGSAENTTLMPQVIE 142
Query: 133 NPVSTSARDLTDAPKLTH-ELVCVAEDDSTAVLKEESHDKEALLEEPTDNVSAVQSPKPV 191
P +TS + P E + ED S A D + D+ + K
Sbjct: 143 IPTATSLTQMNSDPSHGRDESTLLNEDASAAEQMLALRDNSGPRNKACDSAVVKEPRKIA 202
Query: 192 TYSSRKRSAGDLCPRGCVTHRHMPLRRSRISSRAQNFVLPCSDCGKSTGNPSTNEGLSSS 251
TYSSRKR+ G + P++RS+ SR Q L S S G + ++ +
Sbjct: 203 TYSSRKRNGGVRSQNCAPQNETCPVQRSKSPSRLQTEKLQSSMLQNSDGGQTIDDVEDGA 262
Query: 252 VKRNTSVRKSPD-SGCNDFDSSAFVSNGSMEDKGSGILTIDSDAFSLNEGSTMDSNFKLE 310
++R +R+S S +D +S+ S+GS E+ S I T++SD + NEG+ +DS K+E
Sbjct: 263 LRREKRIRRSSGHSESDDVATSSLNSHGSDEENASEIATVESDN-NRNEGNGVDSGSKVE 321
Query: 311 HSDT-IECLE-KVELNKALDLDINTVIHXXXXXXXXXXVTNDASKPTGRLE-----EACV 363
D + LE +LNK L+ IN ++ T+D P ++E EA
Sbjct: 322 QIDIGGKFLEGDYDLNKGLNFQINIMVKRKKRKPTRKRGTSDVVDPQAKVEGEAVPEAGA 381
Query: 364 QNTSQSSQNICGNSEQRGFEQDGDEHLPLVKRARVRMGKSSSMEAELNSVLQAQEKSCKE 423
+N Q+SQN +R E++GDEHLPLVKRARVRM ++ E NS LQA+E+S K+
Sbjct: 382 RNNVQTSQNSHEKFTERPCEENGDEHLPLVKRARVRMSRAFYGNHEANSSLQAEERSPKD 441
Query: 424 DINS 427
+ S
Sbjct: 442 TVVS 445
Score = 73.9 bits (180), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 61/133 (45%), Gaps = 15/133 (11%)
Query: 1241 QFSFVHEERHMNHRREVXXXXXXXSYPNRQHFVENMERENFYNNHERIKPPPFDHRDRWN 1300
QFSF + R+ SY +R H+V N + NF++NHER++ PF++RD W
Sbjct: 1246 QFSFREPGHVLKSHRDAP------SYSHRSHYVPNCDERNFHDNHERMRHAPFENRDNWR 1299
Query: 1301 G-PGPYPGPRYQDNGAPPPYGCHPYESPRMPGHGWRFPPRSGNHNHRSSAPFRPPFEDAI 1359
P G RYQD PY S G W PPR +N+R S +P E
Sbjct: 1300 YPPSSSYGSRYQDEHKA------PYPSSSYNGVRWDNPPR--QYNNRPSFHPKPHSEGPA 1351
Query: 1360 PVANRGPNFWRPR 1372
PV R P W R
Sbjct: 1352 PVGMRDPGMWHQR 1364
>AT5G23150.1 | Symbols: HUA2 | Tudor/PWWP/MBT domain-containing
protein | chr5:7786173-7792080 FORWARD LENGTH=1392
Length = 1392
Score = 244 bits (623), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 139/267 (52%), Positives = 171/267 (64%), Gaps = 11/267 (4%)
Query: 824 STEAARAALKYFQATLATLTRTKESIGRATRIAIDCAKFGLAAKVMESLAHNLE-ESSLQ 882
STEAA + F+ L TL+RT+ESIGRATR+AIDCAK+GLA++V+E L LE ES
Sbjct: 769 STEAA-ISRDAFEGMLETLSRTRESIGRATRLAIDCAKYGLASEVVELLIRKLESESHFH 827
Query: 883 RRVDLFFLVDSIAQ-SRGLKGDVCGVYPSAIQAVLPRLLSAAAPPGNTAQENRRQCRKVL 941
R+VDLFFLVDSI Q S KG Y +QA LPRLL AAAPPG A +NRR+C KVL
Sbjct: 828 RKVDLFFLVDSITQHSHSQKGIAGASYVPTVQAALPRLLGAAAPPGTGASDNRRKCLKVL 887
Query: 942 KVWLERKILPESLIRHHIRELDLRSSSAFAGPFSRRTSRTERALDDPIRDMEGMLVDEYG 1001
K+WLERK+ PESL+R +I ++ A G RR SR+ERA+DDPIR+MEGMLVDEYG
Sbjct: 888 KLWLERKVFPESLLRRYIDDIRASGDDATGGFSLRRPSRSERAVDDPIREMEGMLVDEYG 947
Query: 1002 SNSSFQLPGFCMPRMLKXXXX-----XXXXXXXXXNFEAVTPEHDSEVQEMVSTIEKHRH 1056
SN++FQLPGF + E V+ D E+ + S +K
Sbjct: 948 SNATFQLPGFFSSHNFEDDEEDDDLPTSQKEKSTSAGERVSALDDLEIHDTSS--DKCHR 1005
Query: 1057 ILEDVDGELEMEDVAPS-CDVEMNSFC 1082
+LEDVD ELEMEDV+ DV +SFC
Sbjct: 1006 VLEDVDHELEMEDVSGQRKDVAPSSFC 1032
Score = 106 bits (264), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/90 (54%), Positives = 63/90 (70%), Gaps = 1/90 (1%)
Query: 24 VGDLVLAKVKGFPAWPATVSEPEKWGYSTDWKKVLVYFFGTQQIAFCNPADVEAFTEEKK 83
+GDLVLAKVKGFPAWPA +S PE W + D KK V FFGT++IAF P D++AFT E K
Sbjct: 20 LGDLVLAKVKGFPAWPAKISRPEDWDRAPDPKKYFVQFFGTEEIAFVAPPDIQAFTSEAK 79
Query: 84 QSLLLKRQGKGAD-FVRAVQEIVDSYEKLK 112
LL + QGK F +AV++I ++E L+
Sbjct: 80 SKLLARCQGKTVKYFAQAVEQICTAFEGLQ 109
>AT5G08230.1 | Symbols: | Tudor/PWWP/MBT domain-containing protein |
chr5:2643846-2649788 REVERSE LENGTH=1445
Length = 1445
Score = 219 bits (558), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/273 (46%), Positives = 160/273 (58%), Gaps = 16/273 (5%)
Query: 819 RSTGKSTEAARAAL---KYFQATLATLTRTKESIGRATRIAIDCAKFGLAAKVMESLAHN 875
RS G S A A F+ + TL+RTKESI RATR+AIDCAK+G+A +V+E L
Sbjct: 827 RSVGGSLSGATEAAISRDTFEGMIETLSRTKESIRRATRVAIDCAKYGIANEVVELLIRK 886
Query: 876 LE-ESSLQRRVDLFFLVDSIAQS-RGLKGDVCGVYPSAIQAVLPRLLSAAAPPGNTAQEN 933
LE E R+VDLFFL+DSI QS KG +Y +QA LPRLL AAAPPG A+EN
Sbjct: 887 LEIEPHFPRKVDLFFLLDSIIQSSHSQKGRARSLYIPTVQAALPRLLGAAAPPGTGAREN 946
Query: 934 RRQCRKVLKVWLERKILPESLIRHHIRELDLRSSSAFAGPFSRRTSRTERALDDPIRDME 993
R QCRKVL++WL+RKI P+ L+R +I +L G RR SR+ERA+DDP+RDME
Sbjct: 947 RHQCRKVLRLWLKRKIFPDFLLRRYIGDLGASGDDKTVGFSLRRPSRSERAVDDPLRDME 1006
Query: 994 GMLVDEYGSNSSFQLPGFCMPRML---KXXXXXXXXXXXXXNFEAVTPEHDSEVQEMVST 1050
GMLVDEYGSN++FQLPG+ + V H E +
Sbjct: 1007 GMLVDEYGSNANFQLPGYLASLTFGDDEEEDLPSTSQEVKNTHMEVKITHMEEPVLALGK 1066
Query: 1051 IEKHR------HILEDVDGELEMEDVAPSCDVE 1077
+E H H + DV+G LEMED SC ++
Sbjct: 1067 LEAHDSSSDKPHCVVDVNGGLEMEDA--SCQLK 1097
Score = 102 bits (253), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 22 WKVGDLVLAKVKGFPAWPATVSEPEKWGYSTDWKKVLVYFFGTQQIAFCNPADVEAFTEE 81
++GDLVLAKVKGFPAWPA + +PE W + D KK V F+GT +I F P D++ FT E
Sbjct: 18 MRLGDLVLAKVKGFPAWPAKIGQPEDWNQAPDPKKHFVQFYGTGEIGFVTPPDIQPFTSE 77
Query: 82 KKQSLLLKRQGKGAD-FVRAVQEIVDSYEKLKKESQLDEASLGGNIVDANVSNPVSTSAR 140
K+ L + QGK F +AV+EI ++E ESQ ++ + GN N P T +
Sbjct: 78 TKKKLSARCQGKTVKYFSQAVEEISAAFE----ESQKQKSDIVGNEALLNAVEPSVTKPK 133
Query: 141 DLTDA 145
L A
Sbjct: 134 YLNQA 138