Miyakogusa Predicted Gene
- Lj0g3v0149989.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0149989.2 tr|G7KER5|G7KER5_MEDTR DNA-directed RNA
polymerase OS=Medicago truncatula GN=MTR_5g011000 PE=3
SV=1,72.8,0,seg,NULL; DNA-DIRECTED RNA POLYMERASE,NULL; beta and
beta-prime subunits of DNA dependent RNA-polyme,CUFF.9202.2
(1048 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G63020.2 | Symbols: NRPD1A | nuclear RNA polymerase D1A | chr... 998 0.0
AT1G63020.1 | Symbols: NRPD1A, POL IVA, SDE4, NRPD1, SMD2 | nucl... 998 0.0
AT2G40030.1 | Symbols: NRPD1B, DRD3, ATNRPD1B, DMS5, NRPE1 | nuc... 199 9e-51
AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LSRNA_POL_II_LS, ... 108 1e-23
AT5G60040.1 | Symbols: NRPC1 | nuclear RNA polymerase C1 | chr5:... 67 7e-11
AT5G60040.2 | Symbols: NRPC1 | nuclear RNA polymerase C1 | chr5:... 67 8e-11
AT1G45230.1 | Symbols: | Protein of unknown function (DUF3223) ... 61 5e-09
AT3G46630.1 | Symbols: | Protein of unknown function (DUF3223) ... 59 2e-08
AT1G45230.2 | Symbols: | Protein of unknown function (DUF3223) ... 57 5e-08
>AT1G63020.2 | Symbols: NRPD1A | nuclear RNA polymerase D1A |
chr1:23355329-23361126 REVERSE LENGTH=1453
Length = 1453
Score = 998 bits (2581), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/1049 (49%), Positives = 701/1049 (66%), Gaps = 46/1049 (4%)
Query: 1 MISLSVRVLPISSVVSINPLCCSPLRGDFDGDCLHGYIPQSVAARVELNELVALDRQLIN 60
+I+++VR+LP +SVVS+NP+CC P RGDFDGDCLHGY+PQS+ A+VEL+ELVALD+QLIN
Sbjct: 420 LIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVALDKQLIN 479
Query: 61 GQSGRNLLSLSQDSLTAAYML-MEDGVLLNLYEIQQLQMLCDKKLTPPPSIIKA-PSRNN 118
Q+GRNLLSL QDSLTAAY++ +E LN ++QQLQM C +L PPP+IIKA PS
Sbjct: 480 RQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQL-PPPAIIKASPSSTE 538
Query: 119 SLWSGKQLFSMLLPSNFDYSFPPNGVSVSDGELTSSFESSGWLRDSECNIFQRLVERFQD 178
W+G QLF ML P FDY++P N V VS+GEL S E S WLRD E N +RL++ +
Sbjct: 539 PQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGNFIERLLKHDKG 598
Query: 179 KTLNLLYDAQKVLCEWLSMTGFXXXXXXXXXXXXXCARENMMEEISYGLQEAEQACDFNQ 238
K L+++Y AQ++L +WL M G +R+N+ EEISYGL+EAEQ C+ Q
Sbjct: 599 KVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGLREAEQVCNKQQ 658
Query: 239 LLVDHYCDFLSGSLQDSDNVASIDMDSLNYEKHISAALSEVSVDAFRHMFRNIQSLVDKY 298
L+V+ + DFL+ + +D + + D+ YE+ SA LSE++V AF+ +R++Q+L +Y
Sbjct: 659 LMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDAYRDVQALAYRY 718
Query: 299 ASKGNAFLTMFKAGSKGNLLKLVQHSMCLGLQHSLVRLSYRMPRELSCAAWN-------S 351
+ N+FL M KAGSKGN+ KLVQHSMC+GLQ+S V LS+ PREL+CAAWN
Sbjct: 719 GDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCAAWNDPNSPLRG 778
Query: 352 EKGLNSMPMFSNTLKSIQCYIPHAVVESSFLTGLNPLECFAHSVANRDSSFSDNADLPGT 411
KG +S + + Y+P+ V+E+SFLTGLNPLE F HSV +RDSSFS NADLPGT
Sbjct: 779 AKGKDST--------TTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADLPGT 830
Query: 412 LTRRLMFFMRDLYQAYDGTVRNLYGNQLIQFSYDTDKDSSCDSGFQEGTVGGEPVGALSA 471
L+RRLMFFMRD+Y AYDGTVRN +GNQL+QF+Y+TD + GE +G+LSA
Sbjct: 831 LSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPVE--------DITGEALGSLSA 882
Query: 472 CAISEAAYSALGQPISLLETSPLLNLKNVLECGSRKKSGDQTVSLFLSDKLGKQRYGFEY 531
CA+SEAAYSAL QPISLLETSPLLNLKNVLECGS+K +QT+SL+LS+ L K+++GFEY
Sbjct: 883 CALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYLSKKKHGFEY 942
Query: 532 AALEVKNYLERVMFSDIVSTVMIMFTPQSSSLEIFNPWVCHFHLDKEIVARRKLTVHSVI 591
+LE+KN+LE++ FS+IVST MI+F+P S++ +PWVCHFH+ ++++ R++L+ SV+
Sbjct: 943 GSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKRKQLSAESVV 1002
Query: 592 ESLYRRYESLTKESKVTFPNLKISSNRKCSKEGGYASLNKEKEDVDCISVTIVESSRSSA 651
SL +Y+S +E K+ +L I + CS + K+D CI+VT+VE+S+ S
Sbjct: 1003 SSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDD-----QAMKDDNVCITVTVVEASKHSV 1057
Query: 652 -KLEAVRDLMIPFLLGTVIKGFLEIKKVDILWSNRSKVSNSYAGS-SGELYLRVTMSSDG 709
+L+A+R ++IPFLL + +KG IKKV+ILW++R K +GELYL+VTM D
Sbjct: 1058 LELDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVTMYGDR 1117
Query: 710 DSGRFWGVLINHCHRIMPMIDWTRSHPDNIHHFCSAYGIDAGRQYFLHSLASATTETGKS 769
W L+ C IM MIDW RSHPDNI CS YGIDAGR F+ +L SA ++TGK
Sbjct: 1118 GKRNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKE 1177
Query: 770 ILPKHLHLVANSLSASGEFVGLNAKGIGRQRKHASVSSPFVQACFSNPGTSFIKAAKSGV 829
IL +HL LVA+SLS +GEFV LNAKG +QR+ S +PF QACFS+P F+KAAK GV
Sbjct: 1178 ILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGV 1237
Query: 830 LDDLQGCLDALAWGKCMSMGTSGQFDIMHSEKVEEFPESADVYSLLIANFDQLNDKVDIP 889
DDLQG +DALAWGK GT QF+I+ S KV F DVY LL ++ + P
Sbjct: 1238 RDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLL-SSTKTMRRTNSAP 1296
Query: 890 HYHNRSSNKCDSEFSRKNGGYALKEYKQ------SKSFIRNFVTVNDIQKLAFESRSILS 943
+S F + + LK+ K S +R T +I+ L+ + IL
Sbjct: 1297 ----KSDKATVQPFGLLHSAF-LKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSLKRILH 1351
Query: 944 RYSIDQVISDHDKITMLRVLHFHPRKNEKLGCGPADIKVGWHPVHKDSRCFHIIRSDESV 1003
Y I++++++ D+ + VL HP EK+G G I+V H DS CF ++R D +
Sbjct: 1352 SYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRVA-KSKHGDSCCFEVVRIDGTF 1410
Query: 1004 EDFSYRKCILRALEIVDPGKFRIQKKKWL 1032
EDFSY KC+L A +I+ P K K K+L
Sbjct: 1411 EDFSYHKCVLGATKIIAPKKMNFYKSKYL 1439
>AT1G63020.1 | Symbols: NRPD1A, POL IVA, SDE4, NRPD1, SMD2 | nuclear
RNA polymerase D1A | chr1:23355329-23361126 REVERSE
LENGTH=1453
Length = 1453
Score = 998 bits (2581), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/1049 (49%), Positives = 701/1049 (66%), Gaps = 46/1049 (4%)
Query: 1 MISLSVRVLPISSVVSINPLCCSPLRGDFDGDCLHGYIPQSVAARVELNELVALDRQLIN 60
+I+++VR+LP +SVVS+NP+CC P RGDFDGDCLHGY+PQS+ A+VEL+ELVALD+QLIN
Sbjct: 420 LIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVALDKQLIN 479
Query: 61 GQSGRNLLSLSQDSLTAAYML-MEDGVLLNLYEIQQLQMLCDKKLTPPPSIIKA-PSRNN 118
Q+GRNLLSL QDSLTAAY++ +E LN ++QQLQM C +L PPP+IIKA PS
Sbjct: 480 RQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQL-PPPAIIKASPSSTE 538
Query: 119 SLWSGKQLFSMLLPSNFDYSFPPNGVSVSDGELTSSFESSGWLRDSECNIFQRLVERFQD 178
W+G QLF ML P FDY++P N V VS+GEL S E S WLRD E N +RL++ +
Sbjct: 539 PQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGNFIERLLKHDKG 598
Query: 179 KTLNLLYDAQKVLCEWLSMTGFXXXXXXXXXXXXXCARENMMEEISYGLQEAEQACDFNQ 238
K L+++Y AQ++L +WL M G +R+N+ EEISYGL+EAEQ C+ Q
Sbjct: 599 KVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGLREAEQVCNKQQ 658
Query: 239 LLVDHYCDFLSGSLQDSDNVASIDMDSLNYEKHISAALSEVSVDAFRHMFRNIQSLVDKY 298
L+V+ + DFL+ + +D + + D+ YE+ SA LSE++V AF+ +R++Q+L +Y
Sbjct: 659 LMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDAYRDVQALAYRY 718
Query: 299 ASKGNAFLTMFKAGSKGNLLKLVQHSMCLGLQHSLVRLSYRMPRELSCAAWN-------S 351
+ N+FL M KAGSKGN+ KLVQHSMC+GLQ+S V LS+ PREL+CAAWN
Sbjct: 719 GDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCAAWNDPNSPLRG 778
Query: 352 EKGLNSMPMFSNTLKSIQCYIPHAVVESSFLTGLNPLECFAHSVANRDSSFSDNADLPGT 411
KG +S + + Y+P+ V+E+SFLTGLNPLE F HSV +RDSSFS NADLPGT
Sbjct: 779 AKGKDST--------TTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADLPGT 830
Query: 412 LTRRLMFFMRDLYQAYDGTVRNLYGNQLIQFSYDTDKDSSCDSGFQEGTVGGEPVGALSA 471
L+RRLMFFMRD+Y AYDGTVRN +GNQL+QF+Y+TD + GE +G+LSA
Sbjct: 831 LSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPVE--------DITGEALGSLSA 882
Query: 472 CAISEAAYSALGQPISLLETSPLLNLKNVLECGSRKKSGDQTVSLFLSDKLGKQRYGFEY 531
CA+SEAAYSAL QPISLLETSPLLNLKNVLECGS+K +QT+SL+LS+ L K+++GFEY
Sbjct: 883 CALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYLSKKKHGFEY 942
Query: 532 AALEVKNYLERVMFSDIVSTVMIMFTPQSSSLEIFNPWVCHFHLDKEIVARRKLTVHSVI 591
+LE+KN+LE++ FS+IVST MI+F+P S++ +PWVCHFH+ ++++ R++L+ SV+
Sbjct: 943 GSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKRKQLSAESVV 1002
Query: 592 ESLYRRYESLTKESKVTFPNLKISSNRKCSKEGGYASLNKEKEDVDCISVTIVESSRSSA 651
SL +Y+S +E K+ +L I + CS + K+D CI+VT+VE+S+ S
Sbjct: 1003 SSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDD-----QAMKDDNVCITVTVVEASKHSV 1057
Query: 652 -KLEAVRDLMIPFLLGTVIKGFLEIKKVDILWSNRSKVSNSYAGS-SGELYLRVTMSSDG 709
+L+A+R ++IPFLL + +KG IKKV+ILW++R K +GELYL+VTM D
Sbjct: 1058 LELDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVTMYGDR 1117
Query: 710 DSGRFWGVLINHCHRIMPMIDWTRSHPDNIHHFCSAYGIDAGRQYFLHSLASATTETGKS 769
W L+ C IM MIDW RSHPDNI CS YGIDAGR F+ +L SA ++TGK
Sbjct: 1118 GKRNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKE 1177
Query: 770 ILPKHLHLVANSLSASGEFVGLNAKGIGRQRKHASVSSPFVQACFSNPGTSFIKAAKSGV 829
IL +HL LVA+SLS +GEFV LNAKG +QR+ S +PF QACFS+P F+KAAK GV
Sbjct: 1178 ILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGV 1237
Query: 830 LDDLQGCLDALAWGKCMSMGTSGQFDIMHSEKVEEFPESADVYSLLIANFDQLNDKVDIP 889
DDLQG +DALAWGK GT QF+I+ S KV F DVY LL ++ + P
Sbjct: 1238 RDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLL-SSTKTMRRTNSAP 1296
Query: 890 HYHNRSSNKCDSEFSRKNGGYALKEYKQ------SKSFIRNFVTVNDIQKLAFESRSILS 943
+S F + + LK+ K S +R T +I+ L+ + IL
Sbjct: 1297 ----KSDKATVQPFGLLHSAF-LKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSLKRILH 1351
Query: 944 RYSIDQVISDHDKITMLRVLHFHPRKNEKLGCGPADIKVGWHPVHKDSRCFHIIRSDESV 1003
Y I++++++ D+ + VL HP EK+G G I+V H DS CF ++R D +
Sbjct: 1352 SYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRVA-KSKHGDSCCFEVVRIDGTF 1410
Query: 1004 EDFSYRKCILRALEIVDPGKFRIQKKKWL 1032
EDFSY KC+L A +I+ P K K K+L
Sbjct: 1411 EDFSYHKCVLGATKIIAPKKMNFYKSKYL 1439
>AT2G40030.1 | Symbols: NRPD1B, DRD3, ATNRPD1B, DMS5, NRPE1 | nuclear
RNA polymerase D1B | chr2:16715089-16723406 FORWARD
LENGTH=1976
Length = 1976
Score = 199 bits (506), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 171/613 (27%), Positives = 283/613 (46%), Gaps = 56/613 (9%)
Query: 291 IQSLVDKYASKGNAFLTMFKAGSKGNLLKLVQHSMCLGLQHSLVRLSYRMPRELSCAAWN 350
++ + + K + + S + KLVQ + LGLQ S + Y
Sbjct: 666 VKEVAANFMLKSYSIRNLIDIKSNSAITKLVQQTGFLGLQLSDKKKFY------------ 713
Query: 351 SEKGLNSMPMF-SNTLKSIQCYIPHAVVESSFLTGLNPLECFAHSVANRDSSFSDNADL- 408
++ + M +F I +V+ F GL+P E AHS+A R+ + L
Sbjct: 714 TKTLVEDMAIFCKRKYGRISSSGDFGIVKGCFFHGLDPYEEMAHSIAAREVIVRSSRGLA 773
Query: 409 -PGTLTRRLMFFMRDLYQAYDGTVRNLYGNQLIQFSYDTDKDSSCDSGFQEGTVGGEPVG 467
PGTL + LM +RD+ DGTVRN N +IQF Y D + F+ GEPVG
Sbjct: 774 EPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGVDSERGHQGLFE----AGEPVG 829
Query: 468 ALSACAISEAAYSALGQPISLLETSPLLN-----LKNVLECGS--RKKSGDQTVSLFLSD 520
L+A A+S AY A+ L++SP N +K VL C + + D+ V L+L++
Sbjct: 830 VLAATAMSNPAYKAV------LDSSPNSNSSWELMKEVLLCKVNFQNTTNDRRVILYLNE 883
Query: 521 KLGKQRYGFEYAALEVKNYLERVMFSDIVSTVMIMFTPQSSSLEIFNPWVC---HFHLDK 577
+R+ E AA V+N L +V D ++ + Q + EIF C H HL+K
Sbjct: 884 CHCGKRFCQENAACTVRNKLNKVSLKDTAVEFLVEYRKQPTISEIFGIDSCLHGHIHLNK 943
Query: 578 EIVARRKLTVHSVIESLYRRYESL--TKESKVT--FPNLKISSNRKCSKEGGYASLNKEK 633
++ +++ + + SL K+ K T F +S + CS S +
Sbjct: 944 TLLQDWNISMQDIHQKCEDVINSLGQKKKKKATDDFKRTSLSVSECCSFRDPCGS---KG 1000
Query: 634 EDVDCISVTIVESSRSSAKLEAVRDLM----IPFLLGTVIKGFLEIKKVDILWSN---RS 686
D+ C++ + + + LE D++ P LL VIKG I +I+W++ +
Sbjct: 1001 SDMPCLTFSY---NATDPDLERTLDVLCNTVYPVLLEIVIKGDSRICSANIIWNSSDMTT 1057
Query: 687 KVSNSYAGSSGELYLRVTM--SSDGDSGRFWGVLINHCHRIMPMIDWTRSHPDNIHHFCS 744
+ N +A GE L VT+ S+ SG W V+I+ C ++ +ID RS P ++
Sbjct: 1058 WIRNRHASRRGEWVLDVTVEKSAVKQSGDAWRVVIDSCLSVLHLIDTKRSIPYSVKQVQE 1117
Query: 745 AYGIDAGRQYFLHSLASATTETGKSILPKHLHLVANSLSASGEFVGLNAKGIGRQRKHAS 804
G+ + + L+++ K +L +H+ L+AN+++ SG +G N+ G + +
Sbjct: 1118 LLGLSCAFEQAVQRLSASVRMVSKGVLKEHIILLANNMTCSGTMLGFNSGGYKALTRSLN 1177
Query: 805 VSSPFVQACFSNPGTSFIKAAKSGVLDDLQGCLDALAWGKCMSMGTSGQFDIMHSEKVEE 864
+ +PF +A P F KAA+ D L + + +WGK + +GT QF+++ ++K
Sbjct: 1178 IKAPFTEATLIAPRKCFEKAAEKCHTDSLSTVVGSCSWGKRVDVGTGSQFELLWNQKETG 1237
Query: 865 F--PESADVYSLL 875
E DVYS L
Sbjct: 1238 LDDKEETDVYSFL 1250
Score = 90.9 bits (224), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 76/120 (63%), Gaps = 3/120 (2%)
Query: 13 SVVSINPLCCSPLRGDFDGDCLHGYIPQSVAARVELNELVALDRQLINGQSGRNLLSLSQ 72
+ V INPL CSPL DFDGDC+H + PQS++A+ E+ EL ++++QL++ +G+ +L +
Sbjct: 434 NTVKINPLMCSPLSADFDGDCVHLFYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGS 493
Query: 73 DSLTAAYMLMEDGVLLNLYEIQQLQMLCDKKLTPPPSIIKAPSRNNSLWSGKQLFSMLLP 132
DSL + +++E V L+ QQL M L PPP++ K+ S++ W+ Q+ + P
Sbjct: 494 DSLLSLRVMLER-VFLDKATAQQLAMYGSLSL-PPPALRKS-SKSGPAWTVFQILQLAFP 550
Score = 64.3 bits (155), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 53/88 (60%), Gaps = 3/88 (3%)
Query: 928 VNDIQKLAFESRSIL--SRYSIDQVISDHDKITML-RVLHFHPRKNEKLGCGPADIKVGW 984
++D++ + R I+ S Y ISD DK +L ++L+FHP+K KLG G I V
Sbjct: 1740 LSDVEPVMRTLRKIMHPSAYPDGDPISDDDKTFVLEKILNFHPQKETKLGSGVDFITVDK 1799
Query: 985 HPVHKDSRCFHIIRSDESVEDFSYRKCI 1012
H + DSRCF ++ +D + +DFSYRK +
Sbjct: 1800 HTIFSDSRCFFVVSTDGAKQDFSYRKSL 1827
>AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LSRNA_POL_II_LS,
RNA_POL_II_LS | RNA polymerase II large subunit |
chr4:16961115-16967892 REVERSE LENGTH=1839
Length = 1839
Score = 108 bits (271), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 121/468 (25%), Positives = 196/468 (41%), Gaps = 77/468 (16%)
Query: 6 VRVLPISSVVSINPLCCSPLRGDFDGDCLHGYIPQSVAARVELNELVALDRQLINGQSGR 65
+R++P S+ +N SP DFDGD ++ ++PQS R E+ EL+ + + +++ Q+ R
Sbjct: 474 IRIMPYSTF-RLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMMVPKCIVSPQANR 532
Query: 66 NLLSLSQDSL------TAAYMLMEDGVLLNLYEIQQLQMLCDKKLTPPPSIIKAPSRNNS 119
++ + QD+L T +E V +N + D K+ P P+I+K
Sbjct: 533 PVMGIVQDTLLGCRKITKRDTFIEKDVFMNTLMWWE---DFDGKV-PAPAILKP----RP 584
Query: 120 LWSGKQLFSMLLPSNFD---YS---------FPPNG---VSVSDGELTSSFESSGWLRDS 164
LW+GKQ+F++++P + YS F G V + GEL + L S
Sbjct: 585 LWTGKQVFNLIIPKQINLLRYSAWHADTETGFITPGDTQVRIERGELLAGTLCKKTLGTS 644
Query: 165 ECNIFQRLVERF-QDKTLNLLYDAQKVLCEWLSMTGFXXXXXXXXXXXXXCARENMMEEI 223
++ + E D L Q ++ WL GF A + ME+I
Sbjct: 645 NGSLVHVIWEEVGPDAARKFLGHTQWLVNYWLLQNGF------TIGIGDTIADSSTMEKI 698
Query: 224 SYGLQEAEQACDFNQLLVDHYCDFLSGSLQDSDNVASIDMDSLNYEKHISAALSEVSVDA 283
+ + A+ A D F L D +E ++ L++ DA
Sbjct: 699 NETISNAKTAVK------DLIRQFQGKELDPEPGRTMRD----TFENRVNQVLNKARDDA 748
Query: 284 FRHMFRNIQSLVDKYASKGNAFLTMFKAGSKGNLLKLVQHSMCLGLQHSLVRLSYRMPRE 343
S K ++ N M AGSKG+ + + Q + C+G Q+ + R+P
Sbjct: 749 --------GSSAQKSLAETNNLKAMVTAGSKGSFINISQMTACVGQQNVEGK---RIPFG 797
Query: 344 LSCAAWNSEKGLNSMPMFSNTLKSIQCYIPHA--VVESSFLTGLNPLECFAHSVANRDSS 401
++P F+ Y P + VE+S+L GL P E F H++ R+
Sbjct: 798 FDG---------RTLPHFTK-----DDYGPESRGFVENSYLRGLTPQEFFFHAMGGREGL 843
Query: 402 FSDNADL--PGTLTRRLMFFMRDLYQAYDGTVRNLYGNQLIQFSYDTD 447
G + RRL+ M D+ YDGTVRN G+ +IQF Y D
Sbjct: 844 IDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGD-VIQFLYGED 890
>AT5G60040.1 | Symbols: NRPC1 | nuclear RNA polymerase C1 |
chr5:24173590-24183269 FORWARD LENGTH=1376
Length = 1376
Score = 67.0 bits (162), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 70/135 (51%), Gaps = 15/135 (11%)
Query: 7 RVLPISSVVSINPLCCSPLRGDFDGDCLHGYIPQSVAARVELNELVALDRQLINGQSGRN 66
R++P ++ N C+P DFDGD ++ ++PQ+ AR E L+ + L ++G
Sbjct: 479 RIMPWRTL-RFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGEI 537
Query: 67 LLSLSQDSLTAAYMLMEDGVLLNLYEIQQLQMLC-------DKKLTPPPSIIKAPSRNNS 119
L++ +QD LT+++++ Y+ ++C D P P+I+K
Sbjct: 538 LVASTQDFLTSSFLITRKDT---FYDRAAFSLICSYMGDGMDSIDLPTPTILKPI----E 590
Query: 120 LWSGKQLFSMLLPSN 134
LW+GKQ+FS+LL N
Sbjct: 591 LWTGKQIFSVLLRPN 605
Score = 60.1 bits (144), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 5/124 (4%)
Query: 727 PMIDWTRSHPDNIHHFCSAYGIDAGRQYFLHSLASATTETGKSILPKHLHLVANSLSASG 786
P I+ + +N+ GI+A R + + + G SI +H+ L+A+ ++ G
Sbjct: 1234 PGINGRTTTSNNVVEVSKTLGIEAARTTIIDEIGTVMGNHGMSIDIRHMMLLADVMTYRG 1293
Query: 787 EFVGLNAKGIGRQRKHASVSSPFVQACFSNPGTSFIKAAKSGVLDDLQGCLDALAWGKCM 846
E +G+ GI + K S +QA F G AA SG +D+++G + + G M
Sbjct: 1294 EVLGIQRTGIQKMDK-----SVLMQASFERTGDHLFSAAASGKVDNIEGVTECVIMGIPM 1348
Query: 847 SMGT 850
+GT
Sbjct: 1349 KLGT 1352
Score = 55.1 bits (131), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 68/165 (41%), Gaps = 25/165 (15%)
Query: 303 NAFLTMFKAGSKGNLLKLVQHSMCLGLQHSLVRLSYRMPRELSCAAWNSEKGL--NSMPM 360
N+ L M + GSKG+ + + Q C+G Q + + G S+P
Sbjct: 776 NSPLIMSQCGSKGSPINISQMVACVGQQ--------------TVNGHRAPDGFIDRSLPH 821
Query: 361 FSNTLKSIQCYIPHAVVESSFLTGLNPLECFAHSVANRDSSFSDNADLP--GTLTRRLMF 418
F KS V +SF +GL E F H++ R+ G ++RRLM
Sbjct: 822 FPRMSKSPAA---KGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMK 878
Query: 419 FMRDLYQAYDGTVRNLYGNQLIQFSYDTDKDSSCDSGFQEGTVGG 463
+ DL YD TVRN G ++QF+Y D D EG G
Sbjct: 879 ALEDLLVHYDNTVRNASGC-ILQFTYG---DDGMDPALMEGKDGA 919
>AT5G60040.2 | Symbols: NRPC1 | nuclear RNA polymerase C1 |
chr5:24173590-24183269 FORWARD LENGTH=1391
Length = 1391
Score = 66.6 bits (161), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 70/135 (51%), Gaps = 15/135 (11%)
Query: 7 RVLPISSVVSINPLCCSPLRGDFDGDCLHGYIPQSVAARVELNELVALDRQLINGQSGRN 66
R++P ++ N C+P DFDGD ++ ++PQ+ AR E L+ + L ++G
Sbjct: 489 RIMPWRTL-RFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGEI 547
Query: 67 LLSLSQDSLTAAYMLMEDGVLLNLYEIQQLQMLC-------DKKLTPPPSIIKAPSRNNS 119
L++ +QD LT+++++ Y+ ++C D P P+I+K
Sbjct: 548 LVASTQDFLTSSFLITRKDT---FYDRAAFSLICSYMGDGMDSIDLPTPTILKPI----E 600
Query: 120 LWSGKQLFSMLLPSN 134
LW+GKQ+FS+LL N
Sbjct: 601 LWTGKQIFSVLLRPN 615
Score = 60.5 bits (145), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 81/176 (46%), Gaps = 20/176 (11%)
Query: 702 RVTMSSDGDSGRFWGVLINHCHR--------IM--PMIDWTRSHPDNIHHFCSAYGIDAG 751
RV ++ D D + LI C R +M P I+ + +N+ GI+A
Sbjct: 1216 RVVVAEDMD--KMLAKLIIPCPRWACTNLLAVMGTPGINGRTTTSNNVVEVSKTLGIEAA 1273
Query: 752 RQYFLHSLASATTETGKSILPKHLHLVANSLSASGEFVGLNAKGIGRQRKHASVSSPFVQ 811
R + + + G SI +H+ L+A+ ++ GE +G+ GI + K S +Q
Sbjct: 1274 RTTIIDEIGTVMGNHGMSIDIRHMMLLADVMTYRGEVLGIQRTGIQKMDK-----SVLMQ 1328
Query: 812 ACFSNPGTSFIKAAKSGVLDDLQGCLDALAWGKCMSMGTSGQFDIMHSEKVEEFPE 867
A F G AA SG +D+++G + + G M +GT G ++ ++ ++ P+
Sbjct: 1329 ASFERTGDHLFSAAASGKVDNIEGVTECVIMGIPMKLGT-GILKVL--QRTDDLPK 1381
Score = 55.1 bits (131), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 68/165 (41%), Gaps = 25/165 (15%)
Query: 303 NAFLTMFKAGSKGNLLKLVQHSMCLGLQHSLVRLSYRMPRELSCAAWNSEKGL--NSMPM 360
N+ L M + GSKG+ + + Q C+G Q + + G S+P
Sbjct: 793 NSPLIMSQCGSKGSPINISQMVACVGQQ--------------TVNGHRAPDGFIDRSLPH 838
Query: 361 FSNTLKSIQCYIPHAVVESSFLTGLNPLECFAHSVANRDSSFSDNADLP--GTLTRRLMF 418
F KS V +SF +GL E F H++ R+ G ++RRLM
Sbjct: 839 FPRMSKSPAA---KGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMK 895
Query: 419 FMRDLYQAYDGTVRNLYGNQLIQFSYDTDKDSSCDSGFQEGTVGG 463
+ DL YD TVRN G ++QF+Y D D EG G
Sbjct: 896 ALEDLLVHYDNTVRNASGC-ILQFTYG---DDGMDPALMEGKDGA 936
>AT1G45230.1 | Symbols: | Protein of unknown function (DUF3223) |
chr1:17169874-17171381 REVERSE LENGTH=219
Length = 219
Score = 60.8 bits (146), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%)
Query: 948 DQVISDHDKITMLRVLHFHPRKNEKLGCGPADIKVGWHPVHKDSRCFHIIRSDESVEDFS 1007
D++ +H++ + +L +HP +K+GCG I VG HP + SRC I+R D V DFS
Sbjct: 128 DRLSPEHERTIIEMLLPYHPECEKKIGCGIDYIMVGHHPDFESSRCMFIVRKDGEVVDFS 187
Query: 1008 YRKCI 1012
Y KCI
Sbjct: 188 YWKCI 192
>AT3G46630.1 | Symbols: | Protein of unknown function (DUF3223) |
chr3:17181138-17182346 REVERSE LENGTH=207
Length = 207
Score = 58.5 bits (140), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 928 VNDIQKLAFESRSIL--SRYSIDQVISDHD-KITMLRVLHFHPRKNEKLGCGPADIKVGW 984
+ DI+ ++ ++ IL RY + + D KI M ++L +HP +K+GCG I V
Sbjct: 95 LRDIEPISLLAKEILHSDRYLDGERLDFEDEKIVMEKLLPYHPYSKDKIGCGLDFIMVDR 154
Query: 985 HPVHKDSRCFHIIRSDESVEDFSYRKCILRA 1015
HP + SRC ++R+D DFSY+KC LRA
Sbjct: 155 HPQFRHSRCLFVVRTDGGWIDFSYQKC-LRA 184
>AT1G45230.2 | Symbols: | Protein of unknown function (DUF3223) |
chr1:17169874-17171381 REVERSE LENGTH=219
Length = 219
Score = 57.4 bits (137), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%)
Query: 948 DQVISDHDKITMLRVLHFHPRKNEKLGCGPADIKVGWHPVHKDSRCFHIIRSDESVEDFS 1007
D++ +H++ + +L +HP +K+GCG I V HP + SRC I+R D V DFS
Sbjct: 128 DRLSPEHERTIIEMLLPYHPECEKKIGCGIDYIMVWHHPDFESSRCMFIVRKDGEVVDFS 187
Query: 1008 YRKCI 1012
Y KCI
Sbjct: 188 YWKCI 192