Miyakogusa Predicted Gene
- Lj6g3v0727740.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0727740.2 Non Chatacterized Hit- tr|I1N089|I1N089_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.3490 PE=,84.28,0,no
description,NULL; no description,DNA-directed RNA polymerase, insert
domain; RNA_POL_D_30KD,DNA-d,CUFF.58190.2
(387 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G60620.1 | Symbols: ATRPAC43, RPAC43 | RNA polymerase I subun... 518 e-147
AT1G60850.1 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA polyme... 461 e-130
AT1G60850.2 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA polyme... 461 e-130
AT1G60850.3 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA polyme... 338 4e-93
AT2G15430.1 | Symbols: RBP36A, RPB35.5A, NRPB3, NRPD3, NRPE3A | ... 139 3e-33
AT2G15400.1 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymer... 126 3e-29
AT2G15400.2 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymer... 106 3e-23
>AT1G60620.1 | Symbols: ATRPAC43, RPAC43 | RNA polymerase I subunit
43 | chr1:22331225-22333370 FORWARD LENGTH=385
Length = 385
Score = 518 bits (1335), Expect = e-147, Method: Compositional matrix adjust.
Identities = 248/365 (67%), Positives = 297/365 (81%), Gaps = 2/365 (0%)
Query: 23 IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAYAALGVDNSLRFDDFCKNFK 82
I +L VP LPPHLEL RTRV+C D+ H I +SGAY+++GVDNS+R ++F ++FK
Sbjct: 23 IFDLPDVPTGLPPHLELQRTRVVCKKDSNIHPTAITFSGAYSSMGVDNSVRLENFSEDFK 82
Query: 83 VEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHRL 142
V+V LT+ +M FDMIG+ IANAFRRIL++E+P+MA+E+VY+ANNTS++QDEVLAHRL
Sbjct: 83 VDVISLTETDMVFDMIGVHAGIANAFRRILLAELPSMAIEKVYVANNTSVIQDEVLAHRL 142
Query: 143 GLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELIA 202
GL+PI ADPRLFEY + NEKN+IVFKLHV C KG PR VL+ +LKWLPNGSELI
Sbjct: 143 GLIPIAADPRLFEYLSENDQPNEKNTIVFKLHVKCLKGDPRRKVLTSELKWLPNGSELIK 202
Query: 203 EDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKGI 262
E T PKT+TSF SQDS PEF+ NP P DI+I KLGPGQEIELEAHAVKGI
Sbjct: 203 ESGGSTT--TPKTYTSFNHSQDSFPEFAENPIRPTLKDILIAKLGPGQEIELEAHAVKGI 260
Query: 263 GKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEELKNKCPLKVFDIEDIGKGKRRAKVAR 322
GKTHAKWSPV+TAWYRMLPEVVLL++ + + AEEL CP KVFDIED+G+G++RA VAR
Sbjct: 261 GKTHAKWSPVATAWYRMLPEVVLLKEFEGKHAEELVKVCPKKVFDIEDMGQGRKRATVAR 320
Query: 323 PRDCTLCRECIRGGKEWEDRVSLRRVKDHFIFTVESTGALPPELLFTEAVKILEDKCERV 382
PRDC+LCRECIR G EWED+V LRRVK+HFIFT+ESTG+ PPE+LF EAVKILEDKCERV
Sbjct: 321 PRDCSLCRECIRDGVEWEDQVDLRRVKNHFIFTIESTGSQPPEVLFNEAVKILEDKCERV 380
Query: 383 ITELS 387
I+ELS
Sbjct: 381 ISELS 385
>AT1G60850.1 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA
polymerase family protein | chr1:22398078-22400155
REVERSE LENGTH=375
Length = 375
Score = 461 bits (1186), Expect = e-130, Method: Compositional matrix adjust.
Identities = 233/361 (64%), Positives = 279/361 (77%), Gaps = 4/361 (1%)
Query: 23 IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAY-AALGVDNSLRFDDFCKNF 81
I +L VP LPPHL+ +TRV+ +AP HT + YSG Y ++ D++++ +F NF
Sbjct: 15 IDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTEEDDNVKLGNFYDNF 74
Query: 82 KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
KV+V LT +MEFDMIGID A ANAFRRILI+EVP+MA+E+V IA NTS++ DEVLAHR
Sbjct: 75 KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVIIDEVLAHR 134
Query: 142 LGLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELI 201
+GL+PI ADPRLFEY + NEKN+IVFKLHV C K +PR+ VL+ LKWLPNGSEL+
Sbjct: 135 MGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVLTSDLKWLPNGSELL 194
Query: 202 AEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKG 261
E N SKPKT+TSF+CSQDSLPEF++NP P LDI+I KL PGQEIELEAHAVKG
Sbjct: 195 RESE--NKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPGQEIELEAHAVKG 252
Query: 262 IGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEELKNKCPLKVFDIEDIGKGKRRAKVA 321
IGKTHAKWSPV TAWYRM PEVVL +V+DELAE L N CP VFDIED+GKGK+RA VA
Sbjct: 253 IGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIEDMGKGKKRATVA 312
Query: 322 RPRDCTLCRECIRGGKEWEDRVSLRRVKDHFIFTVESTGALPPELLFTEAVKILEDKCER 381
+PR CTLC+EC+R D V L VK+HFIF +ESTG+LPPE+LFTEAVKILE KCE
Sbjct: 313 QPRKCTLCKECVRDDDL-VDHVDLGSVKNHFIFNIESTGSLPPEVLFTEAVKILEAKCEA 371
Query: 382 V 382
+
Sbjct: 372 I 372
>AT1G60850.2 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA
polymerase family protein | chr1:22398078-22400155
REVERSE LENGTH=375
Length = 375
Score = 461 bits (1186), Expect = e-130, Method: Compositional matrix adjust.
Identities = 233/361 (64%), Positives = 279/361 (77%), Gaps = 4/361 (1%)
Query: 23 IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAY-AALGVDNSLRFDDFCKNF 81
I +L VP LPPHL+ +TRV+ +AP HT + YSG Y ++ D++++ +F NF
Sbjct: 15 IDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTEEDDNVKLGNFYDNF 74
Query: 82 KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
KV+V LT +MEFDMIGID A ANAFRRILI+EVP+MA+E+V IA NTS++ DEVLAHR
Sbjct: 75 KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVIIDEVLAHR 134
Query: 142 LGLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELI 201
+GL+PI ADPRLFEY + NEKN+IVFKLHV C K +PR+ VL+ LKWLPNGSEL+
Sbjct: 135 MGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVLTSDLKWLPNGSELL 194
Query: 202 AEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKG 261
E N SKPKT+TSF+CSQDSLPEF++NP P LDI+I KL PGQEIELEAHAVKG
Sbjct: 195 RESE--NKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPGQEIELEAHAVKG 252
Query: 262 IGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEELKNKCPLKVFDIEDIGKGKRRAKVA 321
IGKTHAKWSPV TAWYRM PEVVL +V+DELAE L N CP VFDIED+GKGK+RA VA
Sbjct: 253 IGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIEDMGKGKKRATVA 312
Query: 322 RPRDCTLCRECIRGGKEWEDRVSLRRVKDHFIFTVESTGALPPELLFTEAVKILEDKCER 381
+PR CTLC+EC+R D V L VK+HFIF +ESTG+LPPE+LFTEAVKILE KCE
Sbjct: 313 QPRKCTLCKECVRDDDL-VDHVDLGSVKNHFIFNIESTGSLPPEVLFTEAVKILEAKCEA 371
Query: 382 V 382
+
Sbjct: 372 I 372
>AT1G60850.3 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA
polymerase family protein | chr1:22398588-22400155
REVERSE LENGTH=302
Length = 302
Score = 338 bits (867), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 168/262 (64%), Positives = 202/262 (77%), Gaps = 3/262 (1%)
Query: 23 IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAY-AALGVDNSLRFDDFCKNF 81
I +L VP LPPHL+ +TRV+ +AP HT + YSG Y ++ D++++ +F NF
Sbjct: 15 IDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTEEDDNVKLGNFYDNF 74
Query: 82 KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
KV+V LT +MEFDMIGID A ANAFRRILI+EVP+MA+E+V IA NTS++ DEVLAHR
Sbjct: 75 KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVIIDEVLAHR 134
Query: 142 LGLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELI 201
+GL+PI ADPRLFEY + NEKN+IVFKLHV C K +PR+ VL+ LKWLPNGSEL+
Sbjct: 135 MGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVLTSDLKWLPNGSELL 194
Query: 202 AEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKG 261
E N SKPKT+TSF+CSQDSLPEF++NP P LDI+I KL PGQEIELEAHAVKG
Sbjct: 195 RESE--NKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPGQEIELEAHAVKG 252
Query: 262 IGKTHAKWSPVSTAWYRMLPEV 283
IGKTHAKWSPV TAWYRM PEV
Sbjct: 253 IGKTHAKWSPVGTAWYRMHPEV 274
>AT2G15430.1 | Symbols: RBP36A, RPB35.5A, NRPB3, NRPD3, NRPE3A |
DNA-directed RNA polymerase family protein |
chr2:6733661-6735482 FORWARD LENGTH=319
Length = 319
Score = 139 bits (351), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 160/313 (51%), Gaps = 39/313 (12%)
Query: 82 KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
K++++ L DD +F++ D ++ANA RR++ISEVPT+A++ V I N+S++ DE +AHR
Sbjct: 11 KIKIRELKDDYAKFELRETDVSMANALRRVMISEVPTVAIDLVEIEVNSSVLNDEFIAHR 70
Query: 142 LGLVPINADPRL---FEYPDNAGENN---EKNSIVFKLHVHCKKGQPRITVLSDQLKWLP 195
LGL+P+ ++ + F +A + + E S+ F+L C Q + V S L
Sbjct: 71 LGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFRLSSKCVTDQT-LDVTSRDLY--- 126
Query: 196 NGSELIAEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELE 255
S T T + DS SS G III KL GQE++L
Sbjct: 127 ---------------SADPTVTPVDFTIDSSVSDSSEHKG-----IIIVKLRRGQELKLR 166
Query: 256 AHAVKGIGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEE----LKNKCPLKVFDIEDI 311
A A KGIGK HAKWSP +T + P++++ ED+ D L++E L P KVF ++ +
Sbjct: 167 AIARKGIGKDHAKWSPAATVTFMYEPDIIINEDMMDTLSDEEKIDLIESSPTKVFGMDPV 226
Query: 312 GKGKRRAKVARPRDCTLCRECIRGGKEWE--DRVSLRRVKDHFIFTVESTGALPPELLFT 369
R+ V P T E I+ + + + D FIFTVESTGA+ L
Sbjct: 227 ---TRQVVVVDPEAYTYDEEVIKKAEAMGKPGLIEISPKDDSFIFTVESTGAVKASQLVL 283
Query: 370 EAVKILEDKCERV 382
A+ +L+ K + V
Sbjct: 284 NAIDLLKQKLDAV 296
>AT2G15400.1 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymerase
family protein | chr2:6713022-6714386 FORWARD LENGTH=319
Length = 319
Score = 126 bits (316), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 156/312 (50%), Gaps = 39/312 (12%)
Query: 83 VEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHRL 142
V+++ L DD +F++ D ++ANA RR++ISEVPTMA+ V I N+S++ DE +A RL
Sbjct: 12 VKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSSVLNDEFIAQRL 71
Query: 143 GLVPINADPRLF-----EYPD-NAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPN 196
L+P+ ++ + + D N E+ E S+ F L C ++DQ
Sbjct: 72 SLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKC---------VTDQ------ 116
Query: 197 GSELIAEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEA 256
+ T + S T T + +S SS G III KL GQE++L+A
Sbjct: 117 ----TLDVTSRDLYSADPTVTPVDFTSNSSTSDSSEHKG-----IIIAKLRRGQELKLKA 167
Query: 257 HAVKGIGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEE----LKNKCPLKVFDIEDIG 312
A KGIGK HAKWSP +T Y P++++ E++ + L +E L P KVF I+ +
Sbjct: 168 LARKGIGKDHAKWSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVT 227
Query: 313 KGKRRAKVARPRDCTLCRECIRGGKEWE--DRVSLRRVKDHFIFTVESTGALPPELLFTE 370
+ V P T E I+ + + + D F+FTVESTGAL L
Sbjct: 228 G---QVVVVDPEAYTYDEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLN 284
Query: 371 AVKILEDKCERV 382
A+ IL+ K + +
Sbjct: 285 AIDILKQKLDAI 296
>AT2G15400.2 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymerase
family protein | chr2:6713022-6713918 FORWARD LENGTH=235
Length = 235
Score = 106 bits (264), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 122/237 (51%), Gaps = 34/237 (14%)
Query: 83 VEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHRL 142
V+++ L DD +F++ D ++ANA RR++ISEVPTMA+ V I N+S++ DE +A RL
Sbjct: 12 VKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSSVLNDEFIAQRL 71
Query: 143 GLVPINADPRLF-----EYPD-NAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPN 196
L+P+ ++ + + D N E+ E S+ F L C Q + V S L
Sbjct: 72 SLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQT-LDVTSRDLY---- 126
Query: 197 GSELIAEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEA 256
S T T + +S SS G III KL GQE++L+A
Sbjct: 127 --------------SADPTVTPVDFTSNSSTSDSSEHKG-----IIIAKLRRGQELKLKA 167
Query: 257 HAVKGIGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEE----LKNKCPLKVFDIE 309
A KGIGK HAKWSP +T Y P++++ E++ + L +E L P KVF I+
Sbjct: 168 LARKGIGKDHAKWSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGID 224