Miyakogusa Predicted Gene
- Lj0g3v0206989.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0206989.2 tr|B9IDF9|B9IDF9_POPTR DNA-directed RNA
polymerase (Fragment) OS=Populus trichocarpa
GN=POPTRDRAFT_9,75.51,0,beta and beta-prime subunits of DNA dependent
RNA-polymerase,NULL; seg,NULL; RNA_pol_Rpb2_1,RNA poly,CUFF.13386.2
(486 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G45140.1 | Symbols: NRPC2 | nuclear RNA polymerase C2 | chr5:... 685 0.0
AT4G21710.1 | Symbols: NRPB2, EMB1989, RPB2 | DNA-directed RNA p... 238 9e-63
AT3G18090.1 | Symbols: NRPD2B | nuclear RNA polymerase D2B | chr... 145 7e-35
AT3G23780.2 | Symbols: NRPD2A | nuclear RNA polymerase D2A | chr... 141 1e-33
AT3G23780.1 | Symbols: NRPD2A, DRD2, NRPD2, DMS2, NRPE2 | nuclea... 141 1e-33
AT1G29940.1 | Symbols: NRPA2 | nuclear RNA polymerase A2 | chr1:... 73 5e-13
>AT5G45140.1 | Symbols: NRPC2 | nuclear RNA polymerase C2 |
chr5:18247416-18257713 REVERSE LENGTH=1161
Length = 1161
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/487 (68%), Positives = 394/487 (80%), Gaps = 5/487 (1%)
Query: 1 MPVMLRSCRCVLYEKDEAELAKLGECPLDPGGYFVIKGNEKVILIQEQLSKNRIIIVMDK 60
MP+MLRSCRCVL+ KDE ELA+LGECPLDPGGYF+IKG EKV+LIQEQLSKNRIII DK
Sbjct: 145 MPIMLRSCRCVLHGKDEEELARLGECPLDPGGYFIIKGTEKVLLIQEQLSKNRIIIDSDK 204
Query: 61 NRNIXXXXXXXXXXXXXXXDIVMEKEKMWLKLNRFAKQVPLMVVMKAMGMESDQEVVQMV 120
NI I MEKEK++L L+RF K++P+++V+KAMGMESDQE+VQMV
Sbjct: 205 KGNINASVTSSTEMTKSKTVIQMEKEKIYLFLHRFVKKIPIIIVLKAMGMESDQEIVQMV 264
Query: 121 GRDPRYSLLLLPSFEECAQNGVYTQEQALAHLDSKVKRSMFSNISSEKEGRGSPALKVLE 180
GRDPR+S LLPS EEC GV TQ+QAL +L++KVK+ + EK+GR AL +L
Sbjct: 265 GRDPRFSASLLPSIEECVSEGVNTQKQALDYLEAKVKKISYGT-PPEKDGR---ALSILR 320
Query: 181 EEFLSNIPVHQGNFRLKCIYAAVMMRRIMDAILNKDAMDDKDYVGNKRLELSGQLISLLF 240
+ FL+++PV NFR KC Y VM+RR+++A+LNKDAMDDKDYVGNKRLELSGQLISLLF
Sbjct: 321 DLFLAHVPVPDNNFRQKCFYVGVMLRRMIEAMLNKDAMDDKDYVGNKRLELSGQLISLLF 380
Query: 241 EDRFKTMGEQVRNSSDKLLDKPDKANRFDISSVLARHQDL-ITHGLESTLSTGNFEIRRF 299
ED FKTM + + D +L+KP +A+RFD S L + I+ GLE TLSTGNF+I+RF
Sbjct: 381 EDLFKTMLSEAIKNVDHILNKPIRASRFDFSQCLNKDSRYSISLGLERTLSTGNFDIKRF 440
Query: 300 RMERKGMTQVLQRLSFIGFLGQMTRVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEA 359
RM RKGMTQVL RLSFIG +G +T++SPQFEKSRKVSGPR+LQPSQWGMLCPCDTPEGE+
Sbjct: 441 RMHRKGMTQVLTRLSFIGSMGFITKISPQFEKSRKVSGPRSLQPSQWGMLCPCDTPEGES 500
Query: 360 CGLVKNLALMTHVTTDEEEAPLISLCYYLGVEDMEYLSGEELHTPDSFLVIFNGLILGKH 419
CGLVKNLALMTHVTTDEEE PL+++CY LGV D+E LS EELHTPDSFLVI NGLILGKH
Sbjct: 501 CGLVKNLALMTHVTTDEEEGPLVAMCYKLGVTDLEVLSAEELHTPDSFLVILNGLILGKH 560
Query: 420 RRPRGFVTAMRKLRRAGKIGEFVSVYVNEKQGCVYLASDGGRVCRPLVIADNGISRIKEY 479
RP+ F ++R+LRRAGKIGEFVSV+ NEKQ CVY+ASD GRVCRPLVIAD GISR+K++
Sbjct: 561 SRPQYFANSLRRLRRAGKIGEFVSVFTNEKQHCVYVASDVGRVCRPLVIADKGISRVKQH 620
Query: 480 HMKELMD 486
HMKEL D
Sbjct: 621 HMKELQD 627
>AT4G21710.1 | Symbols: NRPB2, EMB1989, RPB2 | DNA-directed RNA
polymerase family protein | chr4:11535684-11542200
REVERSE LENGTH=1188
Length = 1188
Score = 238 bits (606), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 153/491 (31%), Positives = 250/491 (50%), Gaps = 35/491 (7%)
Query: 1 MPVMLRSCRCVLYEKDEAELAKLGECPLDPGGYFVIKGNEKVILIQEQLSKNRIIIV--- 57
+P+MLRS C L++ E +L +LGECP D GGYF+I G+EKV++ QE++S N + +
Sbjct: 158 VPIMLRSSYCTLFQNSEKDLTELGECPYDQGGYFIINGSEKVLIAQEKMSTNHVYVFKKR 217
Query: 58 -------------MDKNRNIXXXXXXXXXXXXXXXDIVMEKEKMWLKLNRFAKQVPLMVV 104
M +N+N + + L ++P+++V
Sbjct: 218 QPNKYAYVGEVRSMAENQNRPPSTMFVRMLARASAKGGSSGQYIRCTLPYIRTEIPIIIV 277
Query: 105 MKAMGMESDQEVVQMVG---RDPRYSLLLLPSFEECAQNGVYTQEQALAHLDSKVKRSMF 161
+A+G +D+++++ + D + LL PS EE + + L LD KR
Sbjct: 278 FRALGFVADKDILEHICYDFADTQMMELLRPSLEEA-----FVIQNQLVALDYIGKRG-- 330
Query: 162 SNISSEKEGRGSPALKVLEEEFLSNIPVHQGNFRLKCIYAAVMMRRIMDAILNKDAMDDK 221
+ + KE R A +L++E L ++ + + K Y ++ R++ L + DD+
Sbjct: 331 ATVGVTKEKRIKYARDILQKEMLPHVGIGEHCETKKAYYFGYIIHRLLLCALGRRPEDDR 390
Query: 222 DYVGNKRLELSGQLISLLFEDRFKTMGEQVRNSSDKLLDKPDKAN-RFDISSVLARHQDL 280
D+ GNKRL+L+G L+ LF F+ + VR+ K +D + N +F I +
Sbjct: 391 DHYGNKRLDLAGPLLGGLFRMLFRKLTRDVRSYVQKCVDNGKEVNLQFAIKA------KT 444
Query: 281 ITHGLESTLSTGNFEIRRFRMERKGMTQVLQRLSFIGFLGQMTRVSPQFEKSRKVSGPRA 340
IT GL+ +L+TGN+ R G++QVL RL++ L + R++ + K++ PR
Sbjct: 445 ITSGLKYSLATGNWGQANAAGTRAGVSQVLNRLTYASTLSHLRRLNSPIGREGKLAKPRQ 504
Query: 341 LQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEEEAPLISLCYYLGVEDMEYLSGEE 400
L SQWGM+CP +TPEG+ACGLVKNLALM ++T P++ G E+ E +S
Sbjct: 505 LHNSQWGMMCPAETPEGQACGLVKNLALMVYITVGSAAYPILEFLEEWGTENFEEISPSV 564
Query: 401 LHTPDSFLVIFNGLILGKHRRPRGFVTAMRKLRRAGKIGEFVSVYVNEKQGCVYLASDGG 460
+ P + + NG+ +G HR P V +R+LRR + V V + + + + +D G
Sbjct: 565 I--PQATKIFVNGMWVGVHRDPDMLVKTLRRLRRRVDVNTEVGVVRDIRLKELRIYTDYG 622
Query: 461 RVCRPLVIADN 471
R RPL I DN
Sbjct: 623 RCSRPLFIVDN 633
>AT3G18090.1 | Symbols: NRPD2B | nuclear RNA polymerase D2B |
chr3:6195323-6200204 FORWARD LENGTH=1055
Length = 1055
Score = 145 bits (366), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 143/516 (27%), Positives = 235/516 (45%), Gaps = 66/516 (12%)
Query: 1 MPVMLRSCRCVLYEKDEAELAKLGECPLDPGGYFVIKGNEKVILIQEQLSKNRIIIV--- 57
+PVM++S C EK + E K G C D GGYFVIKG EKV + QEQ+ R+ I
Sbjct: 58 IPVMVKSVLCKTSEKGK-ENCKKGNCAFDQGGYFVIKGAEKVFIAQEQMCTKRLWISNSP 116
Query: 58 --------MDKNRNIXXXXXXXXXXXXXXXDIVMEKEKMWLKLNRFAKQVPLMVVMKAMG 109
+NR I +MEK L + + ++P+ ++ A+G
Sbjct: 117 WTVSFRSETKRNRFIVRLSENEKAEDYK----IMEK---VLTVYFLSTEIPVWLLFFALG 169
Query: 110 MESDQEVVQMV---GRDPRYSLLLLPSFEECAQ--NGVYTQEQALAHLDSKVKRSMFSNI 164
+ SD+E + ++ G D + L+ S E AL +++ ++K + F
Sbjct: 170 VSSDKEAMDLIAFDGDDASITNSLIASIHEADAVCEAFRCGNNALTYVEHQIKSTKFP-- 227
Query: 165 SSEKEGRGSPALKVLEEEFLSNIPVHQGNFRLKCIYAAVMMRRIMDAILNKDAMDDKDYV 224
PA V + L P QG + K + M++ ++ A K +++D
Sbjct: 228 ---------PAESVDDCLRLYLFPCLQG-LKKKARFLGYMVKCLLSAYAGKRKCENRDSF 277
Query: 225 GNKRLELSGQL------ISLLFEDRFKTMGEQVRNSSDKLLDKPDKANRFDISSVLARHQ 278
NKR+EL+G+L + L R T Q + S D L KP + + D S
Sbjct: 278 RNKRIELAGELLEREIRVHLAHARRKMTRAMQKQLSGDGDL-KPIE-HYLDAS------- 328
Query: 279 DLITHGLESTLSTGNFEIRRFRMER-KGMTQVLQRLSFIGFLGQMTRVSPQFEKSRKVSG 337
+IT+GL STG + +MER G+ L R + + L + R Q + KV
Sbjct: 329 -VITNGLNRAFSTGAWSHPFRKMERVSGVVANLGRANPLQTLIDLRRTRQQVLYTGKVGD 387
Query: 338 PRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEEEAPLISLCYYLGVEDMEYLS 397
R PS WG +C TP+GE CGLVKN++L+ V+T E+ ++ + + G+E++
Sbjct: 388 ARHPHPSHWGRVCFLSTPDGENCGLVKNMSLLGLVSTQGLES-VVEMLFTCGMEELM--- 443
Query: 398 GEELHTP--DSFLVIFNGLILGKHRRPRGFVTAMRKLRRAGKIGEFVSVYVNEKQGCVYL 455
+ TP V+ NG +G FV ++ RR ++ + + ++ V +
Sbjct: 444 -NDTSTPLCGKHKVLLNGDWVGLCADSESFVGELKSRRRQSELPLEMEIKRDKDDNEVRI 502
Query: 456 ASDGGRVCRPLVIADNGISRIK-----EYHMKELMD 486
+D GR+ RPL++ +N + ++K +Y K L+D
Sbjct: 503 FTDAGRLLRPLLVVEN-LHKLKQDKPTQYPFKHLLD 537
>AT3G23780.2 | Symbols: NRPD2A | nuclear RNA polymerase D2A |
chr3:8567971-8573819 REVERSE LENGTH=1172
Length = 1172
Score = 141 bits (355), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 135/491 (27%), Positives = 227/491 (46%), Gaps = 50/491 (10%)
Query: 1 MPVMLRSCRCVLYEKDEAELAKLGECPLDPGGYFVIKGNEKVILIQEQLSKNRIII---- 56
+PVM++S C EK + E K G+C D GGYFVIKG EKV + QEQ+ R+ I
Sbjct: 176 IPVMVKSILCKTSEKGK-ENCKKGDCAFDQGGYFVIKGAEKVFIAQEQMCTKRLWISNSP 234
Query: 57 --VMDKNRNIXXXXXXXXXXXXXXXDIVMEKEKMWLKLNRFAKQVPLMVVMKAMGMESDQ 114
V ++ N D +EK+ L + + ++P+ ++ A+G+ SD+
Sbjct: 235 WTVSFRSENKRNRFIVRLSENEKAEDY-KRREKV-LTVYFLSTEIPVWLLFFALGVSSDK 292
Query: 115 EVVQMV---GRDPRYSLLLLPSFE--ECAQNGVYTQEQALAHLDSKVKRSMFSNISSEKE 169
E + ++ G D + L+ S + AL +++ ++K + F
Sbjct: 293 EAMDLIAFDGDDASITNSLIASIHVADAVCEAFRCGNNALTYVEQQIKSTKFP------- 345
Query: 170 GRGSPALKVLEEEFLSNIPVHQGNFRLKCIYAAVMMRRIMDAILNKDAMDDKDYVGNKRL 229
PA V E L P Q + + K + M++ ++++ K +++D NKR+
Sbjct: 346 ----PAESVDECLHLYLFPGLQ-SLKKKARFLGYMVKCLLNSYAGKRKCENRDSFRNKRI 400
Query: 230 ELSGQL------ISLLFEDRFKTMGEQVRNSSDKLLDKPDKANRFDISSVLARHQDLITH 283
EL+G+L + L R T Q S D L KP + + D S +IT+
Sbjct: 401 ELAGELLEREIRVHLAHARRKMTRAMQKHLSGDGDL-KPIE-HYLDAS--------VITN 450
Query: 284 GLESTLSTGNFEIRRFRMER-KGMTQVLQRLSFIGFLGQMTRVSPQFEKSRKVSGPRALQ 342
GL STG + +MER G+ L R + + L + R Q + KV R
Sbjct: 451 GLSRAFSTGAWSHPFRKMERVSGVVANLGRANPLQTLIDLRRTRQQVLYTGKVGDARYPH 510
Query: 343 PSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEEEAPLISLCYYLGVEDMEYLSGEELH 402
PS WG +C TP+GE CGLVKN++L+ V+T E+ ++ + G+E++ ++
Sbjct: 511 PSHWGRVCFLSTPDGENCGLVKNMSLLGLVSTQSLES-VVEKLFACGMEELM----DDTC 565
Query: 403 TP--DSFLVIFNGLILGKHRRPRGFVTAMRKLRRAGKIGEFVSVYVNEKQGCVYLASDGG 460
TP V+ NG +G FV ++ RR ++ + + ++ V + +D G
Sbjct: 566 TPLFGKHKVLLNGDWVGLCADSESFVAELKSRRRQSELPREMEIKRDKDDNEVRIFTDAG 625
Query: 461 RVCRPLVIADN 471
R+ RPL++ +N
Sbjct: 626 RLLRPLLVVEN 636
>AT3G23780.1 | Symbols: NRPD2A, DRD2, NRPD2, DMS2, NRPE2 | nuclear
RNA polymerase D2A | chr3:8567971-8573819 REVERSE
LENGTH=1172
Length = 1172
Score = 141 bits (355), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 135/491 (27%), Positives = 227/491 (46%), Gaps = 50/491 (10%)
Query: 1 MPVMLRSCRCVLYEKDEAELAKLGECPLDPGGYFVIKGNEKVILIQEQLSKNRIII---- 56
+PVM++S C EK + E K G+C D GGYFVIKG EKV + QEQ+ R+ I
Sbjct: 176 IPVMVKSILCKTSEKGK-ENCKKGDCAFDQGGYFVIKGAEKVFIAQEQMCTKRLWISNSP 234
Query: 57 --VMDKNRNIXXXXXXXXXXXXXXXDIVMEKEKMWLKLNRFAKQVPLMVVMKAMGMESDQ 114
V ++ N D +EK+ L + + ++P+ ++ A+G+ SD+
Sbjct: 235 WTVSFRSENKRNRFIVRLSENEKAEDY-KRREKV-LTVYFLSTEIPVWLLFFALGVSSDK 292
Query: 115 EVVQMV---GRDPRYSLLLLPSFE--ECAQNGVYTQEQALAHLDSKVKRSMFSNISSEKE 169
E + ++ G D + L+ S + AL +++ ++K + F
Sbjct: 293 EAMDLIAFDGDDASITNSLIASIHVADAVCEAFRCGNNALTYVEQQIKSTKFP------- 345
Query: 170 GRGSPALKVLEEEFLSNIPVHQGNFRLKCIYAAVMMRRIMDAILNKDAMDDKDYVGNKRL 229
PA V E L P Q + + K + M++ ++++ K +++D NKR+
Sbjct: 346 ----PAESVDECLHLYLFPGLQ-SLKKKARFLGYMVKCLLNSYAGKRKCENRDSFRNKRI 400
Query: 230 ELSGQL------ISLLFEDRFKTMGEQVRNSSDKLLDKPDKANRFDISSVLARHQDLITH 283
EL+G+L + L R T Q S D L KP + + D S +IT+
Sbjct: 401 ELAGELLEREIRVHLAHARRKMTRAMQKHLSGDGDL-KPIE-HYLDAS--------VITN 450
Query: 284 GLESTLSTGNFEIRRFRMER-KGMTQVLQRLSFIGFLGQMTRVSPQFEKSRKVSGPRALQ 342
GL STG + +MER G+ L R + + L + R Q + KV R
Sbjct: 451 GLSRAFSTGAWSHPFRKMERVSGVVANLGRANPLQTLIDLRRTRQQVLYTGKVGDARYPH 510
Query: 343 PSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEEEAPLISLCYYLGVEDMEYLSGEELH 402
PS WG +C TP+GE CGLVKN++L+ V+T E+ ++ + G+E++ ++
Sbjct: 511 PSHWGRVCFLSTPDGENCGLVKNMSLLGLVSTQSLES-VVEKLFACGMEELM----DDTC 565
Query: 403 TP--DSFLVIFNGLILGKHRRPRGFVTAMRKLRRAGKIGEFVSVYVNEKQGCVYLASDGG 460
TP V+ NG +G FV ++ RR ++ + + ++ V + +D G
Sbjct: 566 TPLFGKHKVLLNGDWVGLCADSESFVAELKSRRRQSELPREMEIKRDKDDNEVRIFTDAG 625
Query: 461 RVCRPLVIADN 471
R+ RPL++ +N
Sbjct: 626 RLLRPLLVVEN 636
>AT1G29940.1 | Symbols: NRPA2 | nuclear RNA polymerase A2 |
chr1:10479322-10486670 REVERSE LENGTH=1178
Length = 1178
Score = 72.8 bits (177), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 98/427 (22%), Positives = 172/427 (40%), Gaps = 79/427 (18%)
Query: 1 MPVMLRSCRCVLYEKDEAELAKLGECPLDPGGYFVIKGNEKVILIQEQLSKNRIIIVMDK 60
P+ML S C L D +L K E + GGYF++ G E+V R +I +
Sbjct: 124 FPIMLMSKLCSLKGADCRKLLKCKESTSEMGGYFILNGIERVF---------RCVIAPKR 174
Query: 61 N------RNIXXXXXXXXXXXXXXXDIVMEKE-----KMWLKLNRFAKQ----------V 99
N RN V + + K++ N A+ +
Sbjct: 175 NHPTSMIRNSFRDRKEGYSSKAVVTRCVRDDQSSVTVKLYYLRNGSARVGFWIVGREYLL 234
Query: 100 PLMVVMKAMGMESDQEVVQMV--------GRDP----------RYSLLLLPSFEECAQNG 141
P+ +V+KA+ D+E+ + + GR R ++L +E G
Sbjct: 235 PVGLVLKALTNSCDEEIYESLNCCYSEHYGRGDGAIGTQLVRERAKIIL----DEVRDLG 290
Query: 142 VYTQEQALAHLDSKVKRSMFSNISSEKEGRGSPALKVLEEEFLSN-IPVHQGNFRLKCIY 200
++T+EQ HL + + +G +L ++ E L + + VH N K
Sbjct: 291 LFTREQCRKHLGQHFQPVL--------DGVKKESLSIVAEAVLRDYLFVHLDNDHDKFNL 342
Query: 201 AAVMMRRIMDAILNKDAMDDKDYVGNKRLELSGQLISLLFEDRFKTMGEQVRNSSDKLLD 260
+++++ + D+ D + N+ + + G +I++ +++ + E +R L D
Sbjct: 343 LIFIIQKLYSLVDQTSLPDNPDSLQNQEILVPGHVITIYLKEKLE---EWLRKCKSLLKD 399
Query: 261 KPDKAN-RFDISSVLARHQDLITH--------GLESTLSTGNFEIRRFR--MERKGMTQV 309
+ D N +F S LA + LI +E+ L TG + + +R G T
Sbjct: 400 ELDNTNSKFSFES-LADVKKLINKNPPRSIGTSIETLLKTGALKTQSGLDLQQRAGYTVQ 458
Query: 310 LQRLSFIGFLGQMTRV--SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLA 367
+RL+F+ FL V F R + R L P WG LCP TP+G CGL+ ++
Sbjct: 459 AERLNFLRFLSFFRAVHRGASFAGLRTTT-VRKLLPESWGFLCPVHTPDGTPCGLLNHMT 517
Query: 368 LMTHVTT 374
+ +T+
Sbjct: 518 RTSRITS 524