Miyakogusa Predicted Gene

Lj6g3v0727740.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v0727740.2 Non Chatacterized Hit- tr|I1N089|I1N089_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.3490 PE=,84.28,0,no
description,NULL; no description,DNA-directed RNA polymerase, insert
domain; RNA_POL_D_30KD,DNA-d,CUFF.58190.2
         (387 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G60620.1 | Symbols: ATRPAC43, RPAC43 | RNA polymerase I subun...   518   e-147
AT1G60850.1 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA polyme...   461   e-130
AT1G60850.2 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA polyme...   461   e-130
AT1G60850.3 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA polyme...   338   4e-93
AT2G15430.1 | Symbols: RBP36A, RPB35.5A, NRPB3, NRPD3, NRPE3A | ...   139   3e-33
AT2G15400.1 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymer...   126   3e-29
AT2G15400.2 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymer...   106   3e-23

>AT1G60620.1 | Symbols: ATRPAC43, RPAC43 | RNA polymerase I subunit
           43 | chr1:22331225-22333370 FORWARD LENGTH=385
          Length = 385

 Score =  518 bits (1335), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 248/365 (67%), Positives = 297/365 (81%), Gaps = 2/365 (0%)

Query: 23  IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAYAALGVDNSLRFDDFCKNFK 82
           I +L  VP  LPPHLEL RTRV+C  D+  H   I +SGAY+++GVDNS+R ++F ++FK
Sbjct: 23  IFDLPDVPTGLPPHLELQRTRVVCKKDSNIHPTAITFSGAYSSMGVDNSVRLENFSEDFK 82

Query: 83  VEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHRL 142
           V+V  LT+ +M FDMIG+   IANAFRRIL++E+P+MA+E+VY+ANNTS++QDEVLAHRL
Sbjct: 83  VDVISLTETDMVFDMIGVHAGIANAFRRILLAELPSMAIEKVYVANNTSVIQDEVLAHRL 142

Query: 143 GLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELIA 202
           GL+PI ADPRLFEY     + NEKN+IVFKLHV C KG PR  VL+ +LKWLPNGSELI 
Sbjct: 143 GLIPIAADPRLFEYLSENDQPNEKNTIVFKLHVKCLKGDPRRKVLTSELKWLPNGSELIK 202

Query: 203 EDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKGI 262
           E     T   PKT+TSF  SQDS PEF+ NP  P   DI+I KLGPGQEIELEAHAVKGI
Sbjct: 203 ESGGSTT--TPKTYTSFNHSQDSFPEFAENPIRPTLKDILIAKLGPGQEIELEAHAVKGI 260

Query: 263 GKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEELKNKCPLKVFDIEDIGKGKRRAKVAR 322
           GKTHAKWSPV+TAWYRMLPEVVLL++ + + AEEL   CP KVFDIED+G+G++RA VAR
Sbjct: 261 GKTHAKWSPVATAWYRMLPEVVLLKEFEGKHAEELVKVCPKKVFDIEDMGQGRKRATVAR 320

Query: 323 PRDCTLCRECIRGGKEWEDRVSLRRVKDHFIFTVESTGALPPELLFTEAVKILEDKCERV 382
           PRDC+LCRECIR G EWED+V LRRVK+HFIFT+ESTG+ PPE+LF EAVKILEDKCERV
Sbjct: 321 PRDCSLCRECIRDGVEWEDQVDLRRVKNHFIFTIESTGSQPPEVLFNEAVKILEDKCERV 380

Query: 383 ITELS 387
           I+ELS
Sbjct: 381 ISELS 385


>AT1G60850.1 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA
           polymerase family protein | chr1:22398078-22400155
           REVERSE LENGTH=375
          Length = 375

 Score =  461 bits (1186), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 233/361 (64%), Positives = 279/361 (77%), Gaps = 4/361 (1%)

Query: 23  IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAY-AALGVDNSLRFDDFCKNF 81
           I +L  VP  LPPHL+  +TRV+   +AP HT +  YSG Y ++   D++++  +F  NF
Sbjct: 15  IDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTEEDDNVKLGNFYDNF 74

Query: 82  KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
           KV+V  LT  +MEFDMIGID A ANAFRRILI+EVP+MA+E+V IA NTS++ DEVLAHR
Sbjct: 75  KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVIIDEVLAHR 134

Query: 142 LGLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELI 201
           +GL+PI ADPRLFEY     + NEKN+IVFKLHV C K +PR+ VL+  LKWLPNGSEL+
Sbjct: 135 MGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVLTSDLKWLPNGSELL 194

Query: 202 AEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKG 261
            E    N  SKPKT+TSF+CSQDSLPEF++NP  P  LDI+I KL PGQEIELEAHAVKG
Sbjct: 195 RESE--NKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPGQEIELEAHAVKG 252

Query: 262 IGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEELKNKCPLKVFDIEDIGKGKRRAKVA 321
           IGKTHAKWSPV TAWYRM PEVVL  +V+DELAE L N CP  VFDIED+GKGK+RA VA
Sbjct: 253 IGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIEDMGKGKKRATVA 312

Query: 322 RPRDCTLCRECIRGGKEWEDRVSLRRVKDHFIFTVESTGALPPELLFTEAVKILEDKCER 381
           +PR CTLC+EC+R      D V L  VK+HFIF +ESTG+LPPE+LFTEAVKILE KCE 
Sbjct: 313 QPRKCTLCKECVRDDDL-VDHVDLGSVKNHFIFNIESTGSLPPEVLFTEAVKILEAKCEA 371

Query: 382 V 382
           +
Sbjct: 372 I 372


>AT1G60850.2 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA
           polymerase family protein | chr1:22398078-22400155
           REVERSE LENGTH=375
          Length = 375

 Score =  461 bits (1186), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 233/361 (64%), Positives = 279/361 (77%), Gaps = 4/361 (1%)

Query: 23  IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAY-AALGVDNSLRFDDFCKNF 81
           I +L  VP  LPPHL+  +TRV+   +AP HT +  YSG Y ++   D++++  +F  NF
Sbjct: 15  IDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTEEDDNVKLGNFYDNF 74

Query: 82  KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
           KV+V  LT  +MEFDMIGID A ANAFRRILI+EVP+MA+E+V IA NTS++ DEVLAHR
Sbjct: 75  KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVIIDEVLAHR 134

Query: 142 LGLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELI 201
           +GL+PI ADPRLFEY     + NEKN+IVFKLHV C K +PR+ VL+  LKWLPNGSEL+
Sbjct: 135 MGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVLTSDLKWLPNGSELL 194

Query: 202 AEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKG 261
            E    N  SKPKT+TSF+CSQDSLPEF++NP  P  LDI+I KL PGQEIELEAHAVKG
Sbjct: 195 RESE--NKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPGQEIELEAHAVKG 252

Query: 262 IGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEELKNKCPLKVFDIEDIGKGKRRAKVA 321
           IGKTHAKWSPV TAWYRM PEVVL  +V+DELAE L N CP  VFDIED+GKGK+RA VA
Sbjct: 253 IGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIEDMGKGKKRATVA 312

Query: 322 RPRDCTLCRECIRGGKEWEDRVSLRRVKDHFIFTVESTGALPPELLFTEAVKILEDKCER 381
           +PR CTLC+EC+R      D V L  VK+HFIF +ESTG+LPPE+LFTEAVKILE KCE 
Sbjct: 313 QPRKCTLCKECVRDDDL-VDHVDLGSVKNHFIFNIESTGSLPPEVLFTEAVKILEAKCEA 371

Query: 382 V 382
           +
Sbjct: 372 I 372


>AT1G60850.3 | Symbols: ATRPAC42, AAC42 | DNA-directed RNA
           polymerase family protein | chr1:22398588-22400155
           REVERSE LENGTH=302
          Length = 302

 Score =  338 bits (867), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 168/262 (64%), Positives = 202/262 (77%), Gaps = 3/262 (1%)

Query: 23  IMNLESVPQKLPPHLELLRTRVLCNVDAPQHTDTIQYSGAY-AALGVDNSLRFDDFCKNF 81
           I +L  VP  LPPHL+  +TRV+   +AP HT +  YSG Y ++   D++++  +F  NF
Sbjct: 15  IDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTEEDDNVKLGNFYDNF 74

Query: 82  KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
           KV+V  LT  +MEFDMIGID A ANAFRRILI+EVP+MA+E+V IA NTS++ DEVLAHR
Sbjct: 75  KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVIIDEVLAHR 134

Query: 142 LGLVPINADPRLFEYPDNAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPNGSELI 201
           +GL+PI ADPRLFEY     + NEKN+IVFKLHV C K +PR+ VL+  LKWLPNGSEL+
Sbjct: 135 MGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVLTSDLKWLPNGSELL 194

Query: 202 AEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEAHAVKG 261
            E    N  SKPKT+TSF+CSQDSLPEF++NP  P  LDI+I KL PGQEIELEAHAVKG
Sbjct: 195 RESE--NKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPGQEIELEAHAVKG 252

Query: 262 IGKTHAKWSPVSTAWYRMLPEV 283
           IGKTHAKWSPV TAWYRM PEV
Sbjct: 253 IGKTHAKWSPVGTAWYRMHPEV 274


>AT2G15430.1 | Symbols: RBP36A, RPB35.5A, NRPB3, NRPD3, NRPE3A |
           DNA-directed RNA polymerase family protein |
           chr2:6733661-6735482 FORWARD LENGTH=319
          Length = 319

 Score =  139 bits (351), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 160/313 (51%), Gaps = 39/313 (12%)

Query: 82  KVEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHR 141
           K++++ L DD  +F++   D ++ANA RR++ISEVPT+A++ V I  N+S++ DE +AHR
Sbjct: 11  KIKIRELKDDYAKFELRETDVSMANALRRVMISEVPTVAIDLVEIEVNSSVLNDEFIAHR 70

Query: 142 LGLVPINADPRL---FEYPDNAGENN---EKNSIVFKLHVHCKKGQPRITVLSDQLKWLP 195
           LGL+P+ ++  +   F    +A + +   E  S+ F+L   C   Q  + V S  L    
Sbjct: 71  LGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFRLSSKCVTDQT-LDVTSRDLY--- 126

Query: 196 NGSELIAEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELE 255
                          S   T T    + DS    SS   G     III KL  GQE++L 
Sbjct: 127 ---------------SADPTVTPVDFTIDSSVSDSSEHKG-----IIIVKLRRGQELKLR 166

Query: 256 AHAVKGIGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEE----LKNKCPLKVFDIEDI 311
           A A KGIGK HAKWSP +T  +   P++++ ED+ D L++E    L    P KVF ++ +
Sbjct: 167 AIARKGIGKDHAKWSPAATVTFMYEPDIIINEDMMDTLSDEEKIDLIESSPTKVFGMDPV 226

Query: 312 GKGKRRAKVARPRDCTLCRECIRGGKEWE--DRVSLRRVKDHFIFTVESTGALPPELLFT 369
               R+  V  P   T   E I+  +       + +    D FIFTVESTGA+    L  
Sbjct: 227 ---TRQVVVVDPEAYTYDEEVIKKAEAMGKPGLIEISPKDDSFIFTVESTGAVKASQLVL 283

Query: 370 EAVKILEDKCERV 382
            A+ +L+ K + V
Sbjct: 284 NAIDLLKQKLDAV 296


>AT2G15400.1 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymerase
           family protein | chr2:6713022-6714386 FORWARD LENGTH=319
          Length = 319

 Score =  126 bits (316), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 156/312 (50%), Gaps = 39/312 (12%)

Query: 83  VEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHRL 142
           V+++ L DD  +F++   D ++ANA RR++ISEVPTMA+  V I  N+S++ DE +A RL
Sbjct: 12  VKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSSVLNDEFIAQRL 71

Query: 143 GLVPINADPRLF-----EYPD-NAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPN 196
            L+P+ ++  +      +  D N  E+ E  S+ F L   C         ++DQ      
Sbjct: 72  SLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKC---------VTDQ------ 116

Query: 197 GSELIAEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEA 256
                 + T  +  S   T T    + +S    SS   G     III KL  GQE++L+A
Sbjct: 117 ----TLDVTSRDLYSADPTVTPVDFTSNSSTSDSSEHKG-----IIIAKLRRGQELKLKA 167

Query: 257 HAVKGIGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEE----LKNKCPLKVFDIEDIG 312
            A KGIGK HAKWSP +T  Y   P++++ E++ + L +E    L    P KVF I+ + 
Sbjct: 168 LARKGIGKDHAKWSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVT 227

Query: 313 KGKRRAKVARPRDCTLCRECIRGGKEWE--DRVSLRRVKDHFIFTVESTGALPPELLFTE 370
               +  V  P   T   E I+  +       + +    D F+FTVESTGAL    L   
Sbjct: 228 G---QVVVVDPEAYTYDEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLN 284

Query: 371 AVKILEDKCERV 382
           A+ IL+ K + +
Sbjct: 285 AIDILKQKLDAI 296


>AT2G15400.2 | Symbols: RBP36B, NRPE3B | DNA-directed RNA polymerase
           family protein | chr2:6713022-6713918 FORWARD LENGTH=235
          Length = 235

 Score =  106 bits (264), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 122/237 (51%), Gaps = 34/237 (14%)

Query: 83  VEVKRLTDDEMEFDMIGIDPAIANAFRRILISEVPTMALERVYIANNTSLVQDEVLAHRL 142
           V+++ L DD  +F++   D ++ANA RR++ISEVPTMA+  V I  N+S++ DE +A RL
Sbjct: 12  VKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSSVLNDEFIAQRL 71

Query: 143 GLVPINADPRLF-----EYPD-NAGENNEKNSIVFKLHVHCKKGQPRITVLSDQLKWLPN 196
            L+P+ ++  +      +  D N  E+ E  S+ F L   C   Q  + V S  L     
Sbjct: 72  SLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQT-LDVTSRDLY---- 126

Query: 197 GSELIAEDTKPNTDSKPKTFTSFTCSQDSLPEFSSNPPGPRYLDIIIDKLGPGQEIELEA 256
                         S   T T    + +S    SS   G     III KL  GQE++L+A
Sbjct: 127 --------------SADPTVTPVDFTSNSSTSDSSEHKG-----IIIAKLRRGQELKLKA 167

Query: 257 HAVKGIGKTHAKWSPVSTAWYRMLPEVVLLEDVQDELAEE----LKNKCPLKVFDIE 309
            A KGIGK HAKWSP +T  Y   P++++ E++ + L +E    L    P KVF I+
Sbjct: 168 LARKGIGKDHAKWSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGID 224