Miyakogusa Predicted Gene

Lj4g3v0768700.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v0768700.1 tr|Q8S3Q2|Q8S3Q2_ORYSJ OSJNBa0011F23.5 protein
OS=Oryza sativa subsp. japonica GN=24K23.4 PE=4
SV=1,28.04,7e-19,SAGA-Tad1,Transcriptional coactivator SAGA-type
complex, Ada1/Tada1; SUBFAMILY NOT NAMED,NULL; FAMIL,CUFF.48018.1
         (307 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   195   3e-50
AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   177   8e-45
AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   177   8e-45
AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   136   2e-32
AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   129   2e-30
AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   129   2e-30

>AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
           LENGTH=291
          Length = 291

 Score =  195 bits (495), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/318 (39%), Positives = 179/318 (56%), Gaps = 59/318 (18%)

Query: 1   MPAARYFSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGREN 60
           M + + FS +++LE K  I +K+G  +A  YF+ L +FL+ +ISK EFD+ C  T+GREN
Sbjct: 1   MGSDQCFSRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGREN 60

Query: 61  IHLHNHFIRSILKKASLSK-------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQSPRKV 113
           I LHN  +RSILK AS++K       + ++ G  +                 F  SPRK 
Sbjct: 61  ISLHNRLVRSILKNASVAKSPPPRYPKKSLYGDPV-----------------FPPSPRKC 103

Query: 114 RTPSLRDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPL---CVED 170
           R+     R+F+DRPSPLGP GK  ++             +N E  S A R+P+    VED
Sbjct: 104 RS-----RKFRDRPSPLGPLGKPQSL-----------TTTNDESMSKAQRLPMEVVSVED 147

Query: 171 GEEVDQDSEKVNIYMRSPIQPPLAIPTYNKG-TRTLLHNGLPSGTDTCQSIGELPDTPSL 229
           GEEV+Q +   ++  RSP+  PL +  + K   R   +NG+    +TCQS GELPD  +L
Sbjct: 148 GEEVEQMTGSPSVQSRSPLTAPLGVSFHLKSKARFSTYNGI--NRETCQSSGELPDMITL 205

Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
             RLE+KLEMEG K+S D+A L+N+ L+ Y++RLI+PCL LA+ +    SN         
Sbjct: 206 RARLEKKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLASQQKRAVSN--------- 256

Query: 290 QIGSVSVSDFRTATELNP 307
               VS+ DF  A E+NP
Sbjct: 257 ----VSMLDFHAAMEVNP 270


>AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:16250057-16251085 FORWARD LENGTH=342
          Length = 342

 Score =  177 bits (449), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 178/320 (55%), Gaps = 28/320 (8%)

Query: 8   SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
           S +DTLE K  I R++G  +A  YFN L RF ++KI+K EFD+ C  TIGR+NIHLHN  
Sbjct: 8   SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67

Query: 68  IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
           IRSI+K A ++K    I  G S  V+  NG     + +Q L  D   SP    T   R R
Sbjct: 68  IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123

Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
           + +DRPSPLGP GK  ++    E+S   + + QS  EL S  SR P+ V   EE ++  +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180

Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
                 ++  R P+  PL +     N  TR  + N          +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240

Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
             RLE++LEMEG KI+ D+ +L+N  LD +++RLI+PCL LA ++        +     +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300

Query: 290 Q---IGSVSVSDFRTATELN 306
           Q   +  VS+SDFR   ELN
Sbjct: 301 QSRRLSYVSMSDFRAGMELN 320


>AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
           LENGTH=342
          Length = 342

 Score =  177 bits (449), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 178/320 (55%), Gaps = 28/320 (8%)

Query: 8   SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
           S +DTLE K  I R++G  +A  YFN L RF ++KI+K EFD+ C  TIGR+NIHLHN  
Sbjct: 8   SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67

Query: 68  IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
           IRSI+K A ++K    I  G S  V+  NG     + +Q L  D   SP    T   R R
Sbjct: 68  IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123

Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
           + +DRPSPLGP GK  ++    E+S   + + QS  EL S  SR P+ V   EE ++  +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180

Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
                 ++  R P+  PL +     N  TR  + N          +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240

Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
             RLE++LEMEG KI+ D+ +L+N  LD +++RLI+PCL LA ++        +     +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300

Query: 290 Q---IGSVSVSDFRTATELN 306
           Q   +  VS+SDFR   ELN
Sbjct: 301 QSRRLSYVSMSDFRAGMELN 320


>AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
           in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
           Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
           LENGTH=379
          Length = 379

 Score =  136 bits (343), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 108/346 (31%), Positives = 165/346 (47%), Gaps = 48/346 (13%)

Query: 10  VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
           +D  E K+ I +K+G  ++ +YF  L RFLS K++K EFD+ C   +GREN+ LHN  IR
Sbjct: 9   IDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSLHNKLIR 68

Query: 70  SILKKASLSKRGNII------GSSLNVKIPNGCNDLQFLCKDFLQSP--------RKVRT 115
           SIL+ ASL+K    +      G SL +   +G  + + L  D +++          KVR 
Sbjct: 69  SILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSNGVLAKVRP 128

Query: 116 PSLRDRRFKDRPSPLGPNGKNVN-IGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEV 174
            +  DR  +D+P PLG NGK +    +    R   E+ S     +    +    +    +
Sbjct: 129 GTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSGKDQVAAPI 188

Query: 175 DQDSE-KVNIYMRSPIQPPLAIPTYNK---GTRTLLHNGLPSGTDTCQSIGELPDTPSLT 230
            +D E +V I    P+  PL IP  +    G R  +     +   +C   G L DT  L 
Sbjct: 189 SRDDEAQVRILSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLR 248

Query: 231 KRLEQKLEMEGF-KISADAAALMNKALDTYLKRLIKPCLDLAASKAVN------------ 277
           KR+E     +G   +SA+ + ++N  LD YLK+L+K C+DLA ++++N            
Sbjct: 249 KRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQ 308

Query: 278 ---------RSNGPI------QPG-LNEQIGSVSVSDFRTATELNP 307
                    R+N         QP  +  +  SVS+ DFR A ELNP
Sbjct: 309 SRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNP 354


>AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:26896600-26897463
           REVERSE LENGTH=287
          Length = 287

 Score =  129 bits (325), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 114/314 (36%), Positives = 158/314 (50%), Gaps = 53/314 (16%)

Query: 1   MPAARY-FSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRE 59
           MP +++     D  E K QIE+++G  K   Y NLL++FLS+KISK +FD+    T+ RE
Sbjct: 1   MPTSQHHVVRTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRE 60

Query: 60  NIHLHNHFIRSILKKASLSK------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQSPRKV 113
           NI LHN  +R ILK   LSK      +  +   +   K  NG    Q LCK+  +SPRK 
Sbjct: 61  NISLHNALLRGILKNICLSKTLPPFVKNGVESDNKKKKQLNGA--FQSLCKELPRSPRKG 118

Query: 114 RTPSLRDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEE 173
           RT     RR            K+ NI    S+          E+ S++ R    +E+ EE
Sbjct: 119 RT----QRRL----------NKDGNISKGKSLVT--------EVVSSSGRQQWSMENVEE 156

Query: 174 VDQDSEKVNIYMRSPIQPPLAIPTYNKGTRTLLHNGLPSGTDTCQSIGELPDTPSLTKRL 233
           VDQ    +  +   PI+ P  +       R ++       T  C S GELPD+ SL K+L
Sbjct: 157 VDQ---LIPCWRSQPIEAPFGV-----NLRDVIKKQHRIDT-CCYSSGELPDSVSLKKKL 207

Query: 234 EQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNEQIGS 293
           E  LE EG ++S   A  +N  LD +LKRLIKPCL+LAAS++ N S+            +
Sbjct: 208 EDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASS------------A 254

Query: 294 VSVSDFRTATELNP 307
            S+ DF+ A  LNP
Sbjct: 255 SSLVDFQVAMALNP 268


>AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10422597-10423820 FORWARD LENGTH=407
          Length = 407

 Score =  129 bits (324), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/382 (30%), Positives = 174/382 (45%), Gaps = 93/382 (24%)

Query: 10  VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
           +   E K  I +K G  ++ +YF  L RFLS K++K EFD+ C   +GREN+ LHN  IR
Sbjct: 9   ISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSLHNQLIR 68

Query: 70  SILKKASLSK-------------------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQ-S 109
           SIL+ A+++K                   RG+ +  S  + IPN            L  S
Sbjct: 69  SILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTL-IPNHSQHEPVWSNGVLPIS 127

Query: 110 PRKVRTPSLRDRRFKDRPSPLGPNGK--------------NVNIGFED-----SVREIHE 150
           PRKVR+  +++R+ +DRPSPLG NGK                ++G E+     S R + +
Sbjct: 128 PRKVRS-GMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRSGRYVAD 186

Query: 151 QQSNKELDSAAS-RIP-------LCVEDGEEVDQDSEKVNIYMRSPIQPPLAIP----TY 198
           ++  + L      RIP       + + D ++  ++  +VN+ M SP+  PL IP    + 
Sbjct: 187 EKDGEFLRPVEKPRIPNKEKIAAVSMRD-DQNQEEQARVNLSM-SPLIAPLGIPFCSASV 244

Query: 199 NKGTRTLLHNGLPSGTD----TCQSIGELPDTPSLTKRLEQKLEMEGFK-ISADAAALMN 253
               RT     +P  T+    +C   G LPD   L KR+E     +G + +S + A  +N
Sbjct: 245 GGSPRT-----IPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTLN 299

Query: 254 KALDTYLKRLIKPCLDLAASKAVNRSNGPIQPG--------------------------- 286
             LD YLK+LI  C DL  +++ N   G  + G                           
Sbjct: 300 NMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNGSS 359

Query: 287 -LNEQIGSVSVSDFRTATELNP 307
            + +   SVS+ DFRTA ELNP
Sbjct: 360 DIRQDHHSVSMLDFRTAMELNP 381