Miyakogusa Predicted Gene
- Lj0g3v0103829.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0103829.1 tr|D8RN79|D8RN79_SELML DNA-directed RNA
polymerase OS=Selaginella moellendorffii
GN=SELMODRAFT_41306,30.46,9e-19,RNA_pol_Rpb1_5,RNA polymerase Rpb1,
domain 5; DNA-DIRECTED RNA POLYMERASE,NULL; beta and beta-prime
,CUFF.5895.1
(339 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G40030.1 | Symbols: NRPD1B, DRD3, ATNRPD1B, DMS5, NRPE1 | nuc... 374 e-104
AT1G63020.2 | Symbols: NRPD1A | nuclear RNA polymerase D1A | chr... 84 2e-16
AT1G63020.1 | Symbols: NRPD1A, POL IVA, SDE4, NRPD1, SMD2 | nucl... 84 2e-16
AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LSRNA_POL_II_LS, ... 72 5e-13
AT5G60040.1 | Symbols: NRPC1 | nuclear RNA polymerase C1 | chr5:... 58 1e-08
AT5G60040.2 | Symbols: NRPC1 | nuclear RNA polymerase C1 | chr5:... 58 1e-08
>AT2G40030.1 | Symbols: NRPD1B, DRD3, ATNRPD1B, DMS5, NRPE1 |
nuclear RNA polymerase D1B | chr2:16715089-16723406
FORWARD LENGTH=1976
Length = 1976
Score = 374 bits (961), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/341 (55%), Positives = 255/341 (74%), Gaps = 7/341 (2%)
Query: 1 MESIFSNGFSVGLHDFSISGA-VKRVTDRNIGKVSPLLYQLRFIYNELVAQQLEKHMQDI 59
MES+F+ GFS+ L D S+S A + + + I ++SP++ +LR Y + + QLE + +
Sbjct: 609 MESLFAEGFSLSLEDLSMSRADMDVIHNLIIREISPMVSRLRLSYRDEL--QLENSIHKV 666
Query: 60 ELPAIKFVSKSSRLGDMIDSKSKSALDKVTQQIGFLGQQLFVRGRLYSKGLLEDVASHFK 119
+ A F+ KS + ++ID KS SA+ K+ QQ GFLG QL + + Y+K L+ED+A K
Sbjct: 667 KEVAANFMLKSYSIRNLIDIKSNSAITKLVQQTGFLGLQLSDKKKFYTKTLVEDMAIFCK 726
Query: 120 LKCDYDGDGYPSAEYGLLRGCFFHGLDPFEELVHSISTREIMVRSSRGLSEPGTLFKNLM 179
K G S ++G+++GCFFHGLDP+EE+ HSI+ RE++VRSSRGL+EPGTLFKNLM
Sbjct: 727 RKY---GRISSSGDFGIVKGCFFHGLDPYEEMAHSIAAREVIVRSSRGLAEPGTLFKNLM 783
Query: 180 AILRDVVVCYDGTVRNVCSNSIIQFEYGVKAGDKTRYLFPAGEPVGVLAATSMSNPAYKA 239
A+LRD+V+ DGTVRN CSNS+IQF+YGV + + LF AGEPVGVLAAT+MSNPAYKA
Sbjct: 784 AVLRDIVITNDGTVRNTCSNSVIQFKYGVDSERGHQGLFEAGEPVGVLAATAMSNPAYKA 843
Query: 240 VLDASPSSSCSWELMKEILLCKVNFRNEPIDRRVILYLNDCDCGRRYCRENAAYIVKNQL 299
VLD+SP+S+ SWELMKE+LLCKVNF+N DRRVILYLN+C CG+R+C+ENAA V+N+L
Sbjct: 844 VLDSSPNSNSSWELMKEVLLCKVNFQNTTNDRRVILYLNECHCGKRFCQENAACTVRNKL 903
Query: 300 RKVSLKDTAVDFIVEYQQQRK-REGSETDAGLVGHIHLDEV 339
KVSLKDTAV+F+VEY++Q E D+ L GHIHL++
Sbjct: 904 NKVSLKDTAVEFLVEYRKQPTISEIFGIDSCLHGHIHLNKT 944
>AT1G63020.2 | Symbols: NRPD1A | nuclear RNA polymerase D1A |
chr1:23355329-23361126 REVERSE LENGTH=1453
Length = 1453
Score = 83.6 bits (205), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/310 (25%), Positives = 135/310 (43%), Gaps = 34/310 (10%)
Query: 45 NELVAQQLEKHMQDIELPAIKFVSKSSRLGDMIDSKSKSALDKVTQQIGFLGQQ------ 98
+EL + +D++ A ++ +S+ M + SK + K+ Q +G Q
Sbjct: 697 SELAVSAFKDAYRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSL 756
Query: 99 LFVRGRLYSKGLLEDVASHFKLKCDYDGDGYPS-AEYGLLRGCFFHGLDPFEELVHSIST 157
F R + D S + D S YG++ F GL+P E VHS+++
Sbjct: 757 SFGFPRELTCAAWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTS 816
Query: 158 REIMVRSSRGLSEPGTLFKNLMAILRDVVVCYDGTVRNVCSNSIIQFEYGVKAGDKTRYL 217
R+ + L PGTL + LM +RD+ YDGTVRN N ++QF Y +
Sbjct: 817 RDSSFSGNADL--PGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPVEDI-- 872
Query: 218 FPAGEPVGVLAATSMSNPAYKA------VLDASPSSSCSWELMKEILLCKVNFRNEPIDR 271
GE +G L+A ++S AY A +L+ SP + +K +L C + ++
Sbjct: 873 --TGEALGSLSACALSEAAYSALDQPISLLETSPLLN-----LKNVLEC--GSKKGQREQ 923
Query: 272 RVILYLNDCDCGRRYCRENAAYIVKNQLRKVSLKDTAVDFIVEYQQQRKREGSETDAGL- 330
+ LYL++ +++ E + +KN L K+S + ++ + S T L
Sbjct: 924 TMSLYLSEYLSKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFS-----PSSNTKVPLS 978
Query: 331 --VGHIHLDE 338
V H H+ E
Sbjct: 979 PWVCHFHISE 988
>AT1G63020.1 | Symbols: NRPD1A, POL IVA, SDE4, NRPD1, SMD2 | nuclear
RNA polymerase D1A | chr1:23355329-23361126 REVERSE
LENGTH=1453
Length = 1453
Score = 83.6 bits (205), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/310 (25%), Positives = 135/310 (43%), Gaps = 34/310 (10%)
Query: 45 NELVAQQLEKHMQDIELPAIKFVSKSSRLGDMIDSKSKSALDKVTQQIGFLGQQ------ 98
+EL + +D++ A ++ +S+ M + SK + K+ Q +G Q
Sbjct: 697 SELAVSAFKDAYRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSL 756
Query: 99 LFVRGRLYSKGLLEDVASHFKLKCDYDGDGYPS-AEYGLLRGCFFHGLDPFEELVHSIST 157
F R + D S + D S YG++ F GL+P E VHS+++
Sbjct: 757 SFGFPRELTCAAWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTS 816
Query: 158 REIMVRSSRGLSEPGTLFKNLMAILRDVVVCYDGTVRNVCSNSIIQFEYGVKAGDKTRYL 217
R+ + L PGTL + LM +RD+ YDGTVRN N ++QF Y +
Sbjct: 817 RDSSFSGNADL--PGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPVEDI-- 872
Query: 218 FPAGEPVGVLAATSMSNPAYKA------VLDASPSSSCSWELMKEILLCKVNFRNEPIDR 271
GE +G L+A ++S AY A +L+ SP + +K +L C + ++
Sbjct: 873 --TGEALGSLSACALSEAAYSALDQPISLLETSPLLN-----LKNVLEC--GSKKGQREQ 923
Query: 272 RVILYLNDCDCGRRYCRENAAYIVKNQLRKVSLKDTAVDFIVEYQQQRKREGSETDAGL- 330
+ LYL++ +++ E + +KN L K+S + ++ + S T L
Sbjct: 924 TMSLYLSEYLSKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFS-----PSSNTKVPLS 978
Query: 331 --VGHIHLDE 338
V H H+ E
Sbjct: 979 PWVCHFHISE 988
>AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LSRNA_POL_II_LS,
RNA_POL_II_LS | RNA polymerase II large subunit |
chr4:16961115-16967892 REVERSE LENGTH=1839
Length = 1839
Score = 72.4 bits (176), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 95/221 (42%), Gaps = 25/221 (11%)
Query: 4 IFSNGFSVGLHDFSISGAVKRVTDRNIGK----VSPLLYQ-------------LRFIYNE 46
+ NGF++G+ D + + I V L+ Q +R +
Sbjct: 676 LLQNGFTIGIGDTIADSSTMEKINETISNAKTAVKDLIRQFQGKELDPEPGRTMRDTFEN 735
Query: 47 LVAQQLEKHMQDIELPAIKFVSKSSRLGDMIDSKSKSALDKVTQQIGFLGQQLFVRGRLY 106
V Q L K D A K +++++ L M+ + SK + ++Q +GQQ V G+
Sbjct: 736 RVNQVLNKARDDAGSSAQKSLAETNNLKAMVTAGSKGSFINISQMTACVGQQ-NVEGKRI 794
Query: 107 SKGLLEDVASHFKLKCDYDGDGYPSAEYGLLRGCFFHGLDPFEELVHSISTREIMVRSSR 166
G HF D Y G + + GL P E H++ RE ++ ++
Sbjct: 795 PFGFDGRTLPHFT------KDDYGPESRGFVENSYLRGLTPQEFFFHAMGGREGLIDTAV 848
Query: 167 GLSEPGTLFKNLMAILRDVVVCYDGTVRNVCSNSIIQFEYG 207
SE G + + L+ + D++V YDGTVRN + +IQF YG
Sbjct: 849 KTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGD-VIQFLYG 888
>AT5G60040.1 | Symbols: NRPC1 | nuclear RNA polymerase C1 |
chr5:24173590-24183269 FORWARD LENGTH=1376
Length = 1376
Score = 57.8 bits (138), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 62/127 (48%), Gaps = 8/127 (6%)
Query: 81 SKSALDKVTQQIGFLGQQLFVRGRLYSKGLLEDVASHFKLKCDYDGDGYPSAEYGLLRGC 140
SK + ++Q + +GQQ V G G ++ HF P+A+ G +
Sbjct: 786 SKGSPINISQMVACVGQQT-VNGHRAPDGFIDRSLPHFPRMSKS-----PAAK-GFVANS 838
Query: 141 FFHGLDPFEELVHSISTREIMVRSSRGLSEPGTLFKNLMAILRDVVVCYDGTVRNVCSNS 200
F+ GL E H++ RE +V ++ + G + + LM L D++V YD TVRN S
Sbjct: 839 FYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLLVHYDNTVRNA-SGC 897
Query: 201 IIQFEYG 207
I+QF YG
Sbjct: 898 ILQFTYG 904
>AT5G60040.2 | Symbols: NRPC1 | nuclear RNA polymerase C1 |
chr5:24173590-24183269 FORWARD LENGTH=1391
Length = 1391
Score = 57.8 bits (138), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 62/127 (48%), Gaps = 8/127 (6%)
Query: 81 SKSALDKVTQQIGFLGQQLFVRGRLYSKGLLEDVASHFKLKCDYDGDGYPSAEYGLLRGC 140
SK + ++Q + +GQQ V G G ++ HF P+A+ G +
Sbjct: 803 SKGSPINISQMVACVGQQT-VNGHRAPDGFIDRSLPHFPRMSKS-----PAAK-GFVANS 855
Query: 141 FFHGLDPFEELVHSISTREIMVRSSRGLSEPGTLFKNLMAILRDVVVCYDGTVRNVCSNS 200
F+ GL E H++ RE +V ++ + G + + LM L D++V YD TVRN S
Sbjct: 856 FYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLLVHYDNTVRNA-SGC 914
Query: 201 IIQFEYG 207
I+QF YG
Sbjct: 915 ILQFTYG 921