Miyakogusa Predicted Gene
- Lj1g3v3891890.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3891890.1 tr|G7L125|G7L125_MEDTR Regulation of nuclear
pre-mRNA domain-containing protein 1B OS=Medicago
trunc,76.12,0,ENTH/VHS domain,ENTH/VHS; CID,RNA polymerase II, large
subunit, CTD; no description,RNA polymerase
I,NODE_57586_length_2212_cov_73.393761.path2.1
(513 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr7g026740.1 | regulation of nuclear pre-mRNA domain protein ... 691 0.0
Medtr3g087840.1 | ENTH/VHS family protein | HC | chr3:39807660-3... 233 4e-61
Medtr6g027180.1 | RNA polymerase II-binding domain protein | HC ... 223 4e-58
Medtr6g027180.2 | RNA polymerase II-binding domain protein | HC ... 223 4e-58
Medtr7g087450.1 | RNA polymerase II-binding domain protein | HC ... 206 4e-53
Medtr7g087450.2 | RNA polymerase II-binding domain protein | HC ... 74 5e-13
>Medtr7g026740.1 | regulation of nuclear pre-mRNA domain protein |
HC | chr7:8884185-8894171 | 20130731
Length = 527
Score = 691 bits (1782), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/536 (67%), Positives = 398/536 (74%), Gaps = 32/536 (5%)
Query: 1 MGSTFNAQILVDKLTKLNGSQASIETLSHWCIFHMNKAKQVVETWARQFHSSPREKRLAF 60
MGSTFN QILV+KL KLN SQ SIETLSHWCIFHMNKAKQVVETWA+QFHSSPREK+LAF
Sbjct: 1 MGSTFNPQILVEKLAKLNSSQTSIETLSHWCIFHMNKAKQVVETWAKQFHSSPREKKLAF 60
Query: 61 LYLANDILQNSRRKGSEFVGEFWKVLPGALRDVIENGDEFAKNAALRLIGIWEERKVFGS 120
L+LANDILQNSRRKGSEFVGEFWKVLP +LRDVI+NGD+ A+N A RLIGIW+ERKVFGS
Sbjct: 61 LFLANDILQNSRRKGSEFVGEFWKVLPDSLRDVIQNGDDNARNQARRLIGIWDERKVFGS 120
Query: 121 RGQILKEGIVGKNVENNSRD----------AKPMNMKLRPSAGDALEKIVSGYRVIYGGQ 170
RGQILKE VG++ ENN+RD KPMN+KLRPSAG+AL++IVSGY+ IYGGQ
Sbjct: 121 RGQILKEEFVGRHAENNNRDVKPMNAKPTNVKPMNVKLRPSAGNALDRIVSGYQYIYGGQ 180
Query: 171 TDEDAVLSKCRNAISFLEKADKEIGHDSDSGK---------LQGHNATLKDCIERLTSIE 221
TDEDAVLSKC+NAIS LEK DKEI HDS+SGK LQGHN LKDCI++LT+IE
Sbjct: 181 TDEDAVLSKCQNAISSLEKVDKEIDHDSNSGKFHGPAVVNELQGHNGILKDCIDQLTAIE 240
Query: 222 SSRASLVSHLREALQDQEFKLGQIRCQIQAARVQSEQAGG---QLLNGNNVQPITEQSSK 278
SSRASLVSHLREALQDQEFKLGQ+R QIQAARVQ E + QLLNGNN+Q + EQSSK
Sbjct: 241 SSRASLVSHLREALQDQEFKLGQVRSQIQAARVQWEHSNNTCQQLLNGNNIQSLVEQSSK 300
Query: 279 EIQTSMAPASFVSAGDREQSAPLMYAPQVSFAQNSGPNEEDPRXXXXXXXXXXXXXXXXX 338
EIQTSM PASF+S G EQSAPLMY+PQV F+QNSG +EEDPR
Sbjct: 301 EIQTSMTPASFISGG--EQSAPLMYSPQVMFSQNSGHSEEDPRKSAAAAVAAKLTASTSS 358
Query: 339 XQMLSYVLSSLASEGVIGNPIRDSSADYQPEKRAKLENDXXXXXXXXXXXXXXXXXXXXX 398
QMLSYVLSSLASEGVIGN + SSADY EKR KLEND
Sbjct: 359 AQMLSYVLSSLASEGVIGNQMTGSSADYHAEKRTKLENDQSYVPSQNPQQPLPPFS---- 414
Query: 399 XXILHNAASTTNQQSTPNEXXXXXXXXXXXXXXXXXMA-QYPVPQYMQTSGSVNNMTYSY 457
L T+NQQSTPNE QYPVPQ+MQ GSVNNM YSY
Sbjct: 415 ---LSEPTQTSNQQSTPNEPPPPPSSSPPPLPPLPPTPQQYPVPQFMQNVGSVNNMAYSY 471
Query: 458 SVIQQPSMAAFPAVGPSLNNASSFAPPMNAYQGFQGPDGNYYNHPSSMPMTPISRQ 513
V+QQPSMA +PAVG S+NN S + P MNAYQGFQGPDGNYYN PSSMPM PISRQ
Sbjct: 472 GVMQQPSMATYPAVGVSMNNISPYTPQMNAYQGFQGPDGNYYNQPSSMPMVPISRQ 527
>Medtr3g087840.1 | ENTH/VHS family protein | HC |
chr3:39807660-39811898 | 20130731
Length = 533
Score = 233 bits (593), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 124/288 (43%), Positives = 168/288 (58%), Gaps = 34/288 (11%)
Query: 5 FNAQILVDKLTKLNGSQASIETLSHWCIFHMNKAKQVVETWARQFHSSPREKRLAFLYLA 64
F IL DKL KLN +Q IETLSHWCIFH + A+QVV TW QF S R+ L+LA
Sbjct: 7 FKEAILADKLAKLNSTQQCIETLSHWCIFHRSNAEQVVATWKNQFDKSDMIHRIPLLWLA 66
Query: 65 NDILQNSRRKGSEFVGEFWKVLPGALRDVIENGDEFAKNAALRLIGIWEERKVFGSRGQI 124
NDILQNS+R G EFV EFWKVLP AL+DVI D+ K A RL +WE+R VFGSR
Sbjct: 67 NDILQNSKRNGKEFVIEFWKVLPAALKDVIAKDDDRGKRAVSRLFEVWEQRNVFGSRVPN 126
Query: 125 LKEGIVGK----------------------NVENNSRDAKPMNMKLRPSAGDALEKIVSG 162
LK+ ++G+ +V+ RD++ + KL S G EKIVS
Sbjct: 127 LKDAMLGEGSPPPLEFGKKRPRSARIMKRDSVKILKRDSRSIKSKL--SIGGTTEKIVSA 184
Query: 163 YRVIYGGQTDEDAVLSKCRNAISFLEKADKEIGHDSDSGK----------LQGHNATLKD 212
+ ++ G Q +EDA +SKC++A+ + K +K++ + K L+ + LK
Sbjct: 185 FHLVLGEQANEDAEMSKCKSAVQRVRKIEKDVDIACATAKDPTRKTLRKELEEQQSLLKH 244
Query: 213 CIERLTSIESSRASLVSHLREALQDQEFKLGQIRCQIQAARVQSEQAG 260
CIE+L +E +R +LVS L+EAL +QE L +R Q+Q A+ Q E+A
Sbjct: 245 CIEKLKLVEENRVALVSQLKEALHEQESDLENVRTQMQVAQAQIEEAS 292
>Medtr6g027180.1 | RNA polymerase II-binding domain protein | HC |
chr6:9201234-9212785 | 20130731
Length = 542
Score = 223 bits (568), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 150/397 (37%), Positives = 207/397 (52%), Gaps = 42/397 (10%)
Query: 5 FNAQILVDKLTKLNGSQASIETLSHWCIFHMNKAKQVVETWARQFHSSPREKRLAFLYLA 64
++ Q+L +KL KLN SQ SIE++S C+ H +AK +VETW + F SS +E+R+ FL LA
Sbjct: 6 YDGQVLAEKLRKLNNSQQSIESVSRLCVSHRKRAKDIVETWNKSFGSSQKEQRVPFLNLA 65
Query: 65 NDILQNSRRKGSEFVGEFWKVLPGALRDVIENGDEFAKNAALRLIGIWEERKVFGSRGQI 124
NDILQNSRRKGSEFV EFWKVLP AL+ V + DE K + +RLI IWEERKVFGSR Q
Sbjct: 66 NDILQNSRRKGSEFVNEFWKVLPSALKRVYAS-DEPGKKSVIRLIDIWEERKVFGSRSQG 124
Query: 125 LKEGIVGKNVENNS-------RDAKPMNMKLRPSAGDALEKIVSGYRVIYGGQTDEDAVL 177
LKE I GK+ NS RDA + +KL + G EKI++ + +E+A L
Sbjct: 125 LKEEITGKSNGKNSNPIKIAKRDAHSLRLKL--AIGCLPEKIITALHFVNEEHPNEEASL 182
Query: 178 SKCRNAISFLEKADKEIGHDSDSGK---------LQGHNATLKDCIERLTSIESSRASLV 228
+KC A+S L K +++ + G LQ L I +L + E+ RA+L+
Sbjct: 183 NKCSAALSQLGKLVEDVENTLSQGNQPGSTLVNDLQQQEKELTQNIVQLENTEAIRATLL 242
Query: 229 SHLREALQDQEFKLGQIRCQIQAARVQSEQAGG---QLLNGNNVQPITEQSSKEIQTSMA 285
S L+EALQ+QE + + ++QAAR EQ + + EQ++ +Q +
Sbjct: 243 SQLKEALQEQESRQELVHSRLQAARDHIEQVASIRKRFSQAPETTRVPEQTTPSVQLNST 302
Query: 286 PASFVSAGDREQSAPLMYAPQVSFAQNSGPNEEDPRXXXXXXXXXXXXXXXXXXQMLSYV 345
P P P +SFA ED + ML+ +
Sbjct: 303 PP-----------PPSFTQPTMSFAPLQ--TTEDDKKAAAAAVAARLTASTSSALMLTSI 349
Query: 346 LSSLASEGVIGNPIRDSSAD-------YQPEKRAKLE 375
LSSL +E +SA+ + PEKR KLE
Sbjct: 350 LSSLVAEEAASKNGSLNSAEFNSAPPGFHPEKRPKLE 386
>Medtr6g027180.2 | RNA polymerase II-binding domain protein | HC |
chr6:9202019-9212786 | 20130731
Length = 542
Score = 223 bits (568), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 150/397 (37%), Positives = 207/397 (52%), Gaps = 42/397 (10%)
Query: 5 FNAQILVDKLTKLNGSQASIETLSHWCIFHMNKAKQVVETWARQFHSSPREKRLAFLYLA 64
++ Q+L +KL KLN SQ SIE++S C+ H +AK +VETW + F SS +E+R+ FL LA
Sbjct: 6 YDGQVLAEKLRKLNNSQQSIESVSRLCVSHRKRAKDIVETWNKSFGSSQKEQRVPFLNLA 65
Query: 65 NDILQNSRRKGSEFVGEFWKVLPGALRDVIENGDEFAKNAALRLIGIWEERKVFGSRGQI 124
NDILQNSRRKGSEFV EFWKVLP AL+ V + DE K + +RLI IWEERKVFGSR Q
Sbjct: 66 NDILQNSRRKGSEFVNEFWKVLPSALKRVYAS-DEPGKKSVIRLIDIWEERKVFGSRSQG 124
Query: 125 LKEGIVGKNVENNS-------RDAKPMNMKLRPSAGDALEKIVSGYRVIYGGQTDEDAVL 177
LKE I GK+ NS RDA + +KL + G EKI++ + +E+A L
Sbjct: 125 LKEEITGKSNGKNSNPIKIAKRDAHSLRLKL--AIGCLPEKIITALHFVNEEHPNEEASL 182
Query: 178 SKCRNAISFLEKADKEIGHDSDSGK---------LQGHNATLKDCIERLTSIESSRASLV 228
+KC A+S L K +++ + G LQ L I +L + E+ RA+L+
Sbjct: 183 NKCSAALSQLGKLVEDVENTLSQGNQPGSTLVNDLQQQEKELTQNIVQLENTEAIRATLL 242
Query: 229 SHLREALQDQEFKLGQIRCQIQAARVQSEQAGG---QLLNGNNVQPITEQSSKEIQTSMA 285
S L+EALQ+QE + + ++QAAR EQ + + EQ++ +Q +
Sbjct: 243 SQLKEALQEQESRQELVHSRLQAARDHIEQVASIRKRFSQAPETTRVPEQTTPSVQLNST 302
Query: 286 PASFVSAGDREQSAPLMYAPQVSFAQNSGPNEEDPRXXXXXXXXXXXXXXXXXXQMLSYV 345
P P P +SFA ED + ML+ +
Sbjct: 303 PP-----------PPSFTQPTMSFAPLQ--TTEDDKKAAAAAVAARLTASTSSALMLTSI 349
Query: 346 LSSLASEGVIGNPIRDSSAD-------YQPEKRAKLE 375
LSSL +E +SA+ + PEKR KLE
Sbjct: 350 LSSLVAEEAASKNGSLNSAEFNSAPPGFHPEKRPKLE 386
>Medtr7g087450.1 | RNA polymerase II-binding domain protein | HC |
chr7:34034205-34026117 | 20130731
Length = 489
Score = 206 bits (525), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 134/372 (36%), Positives = 190/372 (51%), Gaps = 41/372 (11%)
Query: 27 LSHWCIFHMNKAKQVVETWARQFHSSPREKRLAFLYLANDILQNSRRKGSEFVGEFWKVL 86
LS WCI H +AK +VE W + F++S +E+R++FL LANDILQNSRRKGSEFV EFWKVL
Sbjct: 10 LSRWCIPHQKRAKDIVEIWDKLFNASQKEQRVSFLNLANDILQNSRRKGSEFVNEFWKVL 69
Query: 87 PGALRDVIENGDEFAKNAALRLIGIWEERKVFGSRGQILKEGIVGKN----VENNSRDAK 142
P ALR V E+GD + A RLI IWEERKVFGSR Q LK+ ++ KN NN + +
Sbjct: 70 PAALRHVYESGDVQGRKAVNRLIDIWEERKVFGSRSQGLKDEVMSKNPLPFSANNGKGSD 129
Query: 143 PMNM--------KLRPSAGDALEKIVSGYRVIYGGQTDEDAVLSKC----RNAISFLEKA 190
P+ + +++ + G EKI++ + + +E+A L+KC + + LE
Sbjct: 130 PIKIVKRDAHSVRIKLAVGSLPEKILTAFHSVLDEHLNEEAALNKCNAGVHDVVKLLEDV 189
Query: 191 DKEIGHDSDSG-----KLQGHNATLKDCIERLTSIESSRASLVSHLREALQDQEFKLGQI 245
+ + G LQ LK +E+L E++RASL+S L++ALQ+ E K +
Sbjct: 190 ENTFAQGNQLGSTLVNNLQEREKELKHYMEQLEHAEAARASLLSQLKDALQEHESKQEHV 249
Query: 246 RCQIQAARVQSEQAGG--QLLNGNNVQPITEQSSKEIQTSMAPASFVSAGDREQSAPLMY 303
R Q+ R Q E+ G + LN TE + +Q + S P
Sbjct: 250 RAQLLIVRGQIEKTAGIRKWLNQT-----TEATHPSVQL-----------NGTTSQPTCA 293
Query: 304 APQVSFAQNSGPNEEDPRXXXXXXXXXXXXXXXXXXQMLSYVLSSLASEGVIGNPIRDSS 363
P +SF+ E++ QML+ VLSSLA+E
Sbjct: 294 QPSMSFSPFQTSEEDN--KKAAAAVAAKLAGSSSSAQMLASVLSSLAAEEAASKGFSSGL 351
Query: 364 ADYQPEKRAKLE 375
+ PEKR K+E
Sbjct: 352 PIFNPEKRQKIE 363
>Medtr7g087450.2 | RNA polymerase II-binding domain protein | HC |
chr7:34032142-34026117 | 20130731
Length = 377
Score = 73.6 bits (179), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 107/250 (42%), Gaps = 35/250 (14%)
Query: 139 RDAKPMNMKLRPSAGDALEKIVSGYRVIYGGQTDEDAVLSKC----RNAISFLEKADKEI 194
RDA + +KL + G EKI++ + + +E+A L+KC + + LE +
Sbjct: 24 RDAHSVRIKL--AVGSLPEKILTAFHSVLDEHLNEEAALNKCNAGVHDVVKLLEDVENTF 81
Query: 195 GHDSDSGK-----LQGHNATLKDCIERLTSIESSRASLVSHLREALQDQEFKLGQIRCQI 249
+ G LQ LK +E+L E++RASL+S L++ALQ+ E K +R Q+
Sbjct: 82 AQGNQLGSTLVNNLQEREKELKHYMEQLEHAEAARASLLSQLKDALQEHESKQEHVRAQL 141
Query: 250 QAARVQSEQAGG--QLLNGNNVQPITEQSSKEIQTSMAPASFVSAGDREQSAPLMYAPQV 307
R Q E+ G + LN TE + +Q + S P P +
Sbjct: 142 LIVRGQIEKTAGIRKWLNQT-----TEATHPSVQL-----------NGTTSQPTCAQPSM 185
Query: 308 SFA--QNSGPNEEDPRXXXXXXXXXXXXXXXXXXQMLSYVLSSLASEGVIGNPIRDSSAD 365
SF+ Q S EED QML+ VLSSLA+E
Sbjct: 186 SFSPFQTS---EED-NKKAAAAVAAKLAGSSSSAQMLASVLSSLAAEEAASKGFSSGLPI 241
Query: 366 YQPEKRAKLE 375
+ PEKR K+E
Sbjct: 242 FNPEKRQKIE 251