Miyakogusa Predicted Gene
- Lj0g3v0353889.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0353889.2 tr|D7KU36|D7KU36_ARALL DNA-directed RNA
polymerase OS=Arabidopsis lyrata subsp. lyrata
GN=ARALYDRAFT,27.49,4e-18,seg,NULL; beta and beta-prime subunits of
DNA dependent RNA-polymerase,NULL,CUFF.24477.2
(788 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G40030.1 | Symbols: NRPD1B, DRD3, ATNRPD1B, DMS5, NRPE1 | nuc... 391 e-108
AT1G63020.2 | Symbols: NRPD1A | nuclear RNA polymerase D1A | chr... 106 6e-23
AT1G63020.1 | Symbols: NRPD1A, POL IVA, SDE4, NRPD1, SMD2 | nucl... 106 6e-23
>AT2G40030.1 | Symbols: NRPD1B, DRD3, ATNRPD1B, DMS5, NRPE1 | nuclear
RNA polymerase D1B | chr2:16715089-16723406 FORWARD
LENGTH=1976
Length = 1976
Score = 391 bits (1005), Expect = e-108, Method: Compositional matrix adjust.
Identities = 298/856 (34%), Positives = 438/856 (51%), Gaps = 113/856 (13%)
Query: 1 MLEELKINMAEVFQRCQEKLRSLNRKRKYYQT--LKSTELFFSESCA------SPNFSSP 52
+L++ I+M ++ Q+C++ + SL +K+K T K T L SE C+ S P
Sbjct: 945 LLQDWNISMQDIHQKCEDVINSLGQKKKKKATDDFKRTSLSVSECCSFRDPCGSKGSDMP 1004
Query: 53 CVTFI--SLDGDGLDKTTEILADVICPVLLGTIIQGDPRISSANITWINPDSNAWVRNPS 110
C+TF + D D L++T ++L + + PVLL +I+GD RI SANI W + D W+RN
Sbjct: 1005 CLTFSYNATDPD-LERTLDVLCNTVYPVLLEIVIKGDSRICSANIIWNSSDMTTWIRNRH 1063
Query: 111 KSSNGELALDIILEEAAVKQSGDAWRIVLDSCLPVLNLIDTRRSIPYAIKQVQELLGISC 170
S GE LD+ +E++AVKQSGDAWR+V+DSCL VL+LIDT+RSIPY++KQVQELLG+SC
Sbjct: 1064 ASRRGEWVLDVTVEKSAVKQSGDAWRVVIDSCLSVLHLIDTKRSIPYSVKQVQELLGLSC 1123
Query: 171 TFDQAIQRLAASVKMVAKGVLREHLILLASSMTCGGNLVGFNTGGYKALARQLNIQVPFT 230
F+QA+QRL+ASV+MV+KGVL+EH+ILLA++MTC G ++GFN+GGYKAL R LNI+ PFT
Sbjct: 1124 AFEQAVQRLSASVRMVSKGVLKEHIILLANNMTCSGTMLGFNSGGYKALTRSLNIKAPFT 1183
Query: 231 DATLFTPKKCFERAAEKCHTDSLSSIVASCSWGKPVAVGTGSRFDVLWDAKERKSNELEG 290
+ATL P+KCFE+AAEKCHTDSLS++V SCSWGK V VGTGS+F++LW+ KE ++ E
Sbjct: 1184 EATLIAPRKCFEKAAEKCHTDSLSTVVGSCSWGKRVDVGTGSQFELLWNQKETGLDDKEE 1243
Query: 291 MDVYNFLHMVKSLINGEEENDACXXXXXXXXXXXXNADYSLSPQHTSGV-DAVFEETFEA 349
DVY+FL MV S N DA A+++ SP+ S + + FE++ +
Sbjct: 1244 TDVYSFLQMVISTTNA----DAFVSSPGFDVTEEEMAEWAESPERDSALGEPKFEDSADF 1299
Query: 350 MN-----GPESNGWGATT--DRTETKSTQWSAWESNKAETK-----DGGSQRVHEDSWTS 397
N P W ++ D + ++W +S E + + ED+W+S
Sbjct: 1300 QNLHDEGKPSGANWEKSSSWDNGCSGGSEWGVSKSTGGEANPESNWEKTTNVEKEDAWSS 1359
Query: 398 RDVMKDDSQMTNAWDGNVEQTKTISNDWTAWGKNKSEIQDNVAEKAEGECGSSEMWKTAV 457
+ KD Q + S+ AWG + + E +S K ++
Sbjct: 1360 WNTRKD------------AQESSKSDSGGAWGIKTKDADADTTPNWE----TSPAPKDSI 1403
Query: 458 IQEGSSK-SNAW--KSNIEQRSDEDSWTSQKLKADVIQDSSKPSSWGAKPKSN-----DS 509
+ E + S+ W KS ++ D+ +W ++ A S+ + WG+ K N D+
Sbjct: 1404 VPENNEPTSDVWGHKSVSDKSWDKKNWGTE--SAPAAWGSTDAAVWGSSDKKNSETESDA 1461
Query: 510 SNWGRNKDEIQDVVSRRAEDDSWSSKKQLSSDATQEDSSKFSFWGGNKDTTKPKSNDWSS 569
+ WG DV S W+ K + E S + WG + D TK + W+S
Sbjct: 1462 AAWGSRDKNNSDVGSGAGVLGPWNKK-------SSETESNGATWGSS-DKTKSGAAAWNS 1513
Query: 570 WGGNRDGIQDGGSKRAQDDSWSSQKDVTRESSSKVDAWGA---NNEASNPKSNDWSA--- 623
W ++ I+ A W SQ E+ S AWGA + P W
Sbjct: 1514 W--DKKNIETDSEPAA----WGSQGKKNSETESGPAAWGAWDKKKSETEPGPAGWGMGDK 1567
Query: 624 --------------WGKNKDGTQDGGSERAQDDSSSLGKW-KAESQVEADTVQEDSSKSK 668
W K K T+ G + D+++ G K S+ E+D S K
Sbjct: 1568 KNSETELGPAAMGNWDKKKSDTKSGPAAWGSTDAAAWGSSDKNNSETESDAAAWGSRNKK 1627
Query: 669 AWERNSDNVKVGNSSWGKPKFPETQAWDPQKETNQGAGSRGWDS--QIASANSDSDRNFQ 726
E ++ G +WG P A D K+TN+ W S + S D Q
Sbjct: 1628 TSE-----IESGAGAWGSWGQPSPTAED--KDTNED-DRNPWVSLKETKSREKDDKERSQ 1679
Query: 727 WGKQGRESFKKSQGWGSNAGDMK-NKNRPGRAP------GP-------RLDMYSSEEQDI 772
WG ++ D K N+N R P P RLD ++SEEQ++
Sbjct: 1680 WGNPAKKFPSSGGWSNGGGADWKGNRNHTPRPPRSEDNLAPMFTATRQRLDSFTSEEQEL 1739
Query: 773 LKDIEPIVQSIRRIMQ 788
L D+EP+++++R+IM
Sbjct: 1740 LSDVEPVMRTLRKIMH 1755
>AT1G63020.2 | Symbols: NRPD1A | nuclear RNA polymerase D1A |
chr1:23355329-23361126 REVERSE LENGTH=1453
Length = 1453
Score = 106 bits (265), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 118/233 (50%), Gaps = 12/233 (5%)
Query: 74 VICPVLLGTIIQGDPRISSANITWINPDSNAWVRNPSKSSN---GELALDIILEEAAVKQ 130
V+ P LL + ++GD I NI W + + P ++ N GEL L + + K+
Sbjct: 1066 VLIPFLLDSPVKGDQGIKKVNILWTDRP-----KAPKRNGNHLAGELYLKVTMYGDRGKR 1120
Query: 131 SGDAWRIVLDSCLPVLNLIDTRRSIPYAIKQVQELLGISCTFDQAIQRLAASVKMVAKGV 190
+ W +L++CLP++++ID RS P I+Q + GI + L ++V K +
Sbjct: 1121 --NCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKEI 1178
Query: 191 LREHLILLASSMTCGGNLVGFNTGGYKALARQLNIQVPFTDATLFTPKKCFERAAEKCHT 250
LREHL+L+A S++ G V N G+ + + PFT A +P +CF +AA++
Sbjct: 1179 LREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGVR 1238
Query: 251 DSLSSIVASCSWGKPVAVGTGSRFDVLWDAKERKSNELEGMDVYNFLHMVKSL 303
D L + + +WGK GTG +F+++ K +DVY+ L K++
Sbjct: 1239 DDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFT--TPVDVYDLLSSTKTM 1289
>AT1G63020.1 | Symbols: NRPD1A, POL IVA, SDE4, NRPD1, SMD2 | nuclear
RNA polymerase D1A | chr1:23355329-23361126 REVERSE
LENGTH=1453
Length = 1453
Score = 106 bits (265), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 118/233 (50%), Gaps = 12/233 (5%)
Query: 74 VICPVLLGTIIQGDPRISSANITWINPDSNAWVRNPSKSSN---GELALDIILEEAAVKQ 130
V+ P LL + ++GD I NI W + + P ++ N GEL L + + K+
Sbjct: 1066 VLIPFLLDSPVKGDQGIKKVNILWTDRP-----KAPKRNGNHLAGELYLKVTMYGDRGKR 1120
Query: 131 SGDAWRIVLDSCLPVLNLIDTRRSIPYAIKQVQELLGISCTFDQAIQRLAASVKMVAKGV 190
+ W +L++CLP++++ID RS P I+Q + GI + L ++V K +
Sbjct: 1121 --NCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKEI 1178
Query: 191 LREHLILLASSMTCGGNLVGFNTGGYKALARQLNIQVPFTDATLFTPKKCFERAAEKCHT 250
LREHL+L+A S++ G V N G+ + + PFT A +P +CF +AA++
Sbjct: 1179 LREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGVR 1238
Query: 251 DSLSSIVASCSWGKPVAVGTGSRFDVLWDAKERKSNELEGMDVYNFLHMVKSL 303
D L + + +WGK GTG +F+++ K +DVY+ L K++
Sbjct: 1239 DDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFT--TPVDVYDLLSSTKTM 1289