Miyakogusa Predicted Gene

Lj1g3v2938400.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2938400.1 Non Chatacterized Hit- tr|B8ARW8|B8ARW8_ORYSI
Putative uncharacterized protein OS=Oryza sativa
subsp,30.72,0.000000000005,SAGA-Tad1,Transcriptional coactivator
SAGA-type complex, Ada1/Tada1; SUBFAMILY NOT NAMED,NULL;
FAMIL,NODE_70194_length_1171_cov_35.951324.path1.1
         (327 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   213   1e-55
AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   200   1e-51
AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   200   1e-51
AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   152   3e-37
AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   147   7e-36
AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   140   1e-33

>AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
           LENGTH=291
          Length = 291

 Score =  213 bits (543), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 135/327 (41%), Positives = 191/327 (58%), Gaps = 49/327 (14%)

Query: 1   MPAARYFSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGREN 60
           M + + FS +++LE K  I +K+G  +A  YF+ L +FL+ +ISK EFD+ C  T+GREN
Sbjct: 1   MGSDQCFSRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGREN 60

Query: 61  IHLHNHFIRSILKKASLSKRGNIIGSSLNVKIPNGCNDLQFLCKD--FLQSPRKVRTPSL 118
           I LHN  +RSILK AS++K       S   + P      + L  D  F  SPRK R+   
Sbjct: 61  ISLHNRLVRSILKNASVAK-------SPPPRYPK-----KSLYGDPVFPPSPRKCRS--- 105

Query: 119 RDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPL---CVEDGEEVD 175
             R+F+DRPSPLGP GK  ++             +N E  S A R+P+    VEDGEEV+
Sbjct: 106 --RKFRDRPSPLGPLGKPQSL-----------TTTNDESMSKAQRLPMEVVSVEDGEEVE 152

Query: 176 QDSEKVNIYMRSPIQPPLAIPTYNKG-TRTLLHNGLPSGTDTCQSIGELPDTPSLTKRLE 234
           Q +   ++  RSP+  PL +  + K   R   +NG+    +TCQS GELPD  +L  RLE
Sbjct: 153 QMTGSPSVQSRSPLTAPLGVSFHLKSKARFSTYNGI--NRETCQSSGELPDMITLRARLE 210

Query: 235 QKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNEQIGSV 294
           +KLEMEG K+S D+A L+N+ L+ Y++RLI+PCL LA+ +    SN             V
Sbjct: 211 KKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLASQQKRAVSN-------------V 257

Query: 295 SVSDFRTATELNPNILGKDWSLHLEKV 321
           S+ DF  A E+NP +LG++W + LEK+
Sbjct: 258 SMLDFHAAMEVNPRVLGEEWPIQLEKI 284


>AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:16250057-16251085 FORWARD LENGTH=342
          Length = 342

 Score =  200 bits (508), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/335 (40%), Positives = 190/335 (56%), Gaps = 28/335 (8%)

Query: 8   SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
           S +DTLE K  I R++G  +A  YFN L RF ++KI+K EFD+ C  TIGR+NIHLHN  
Sbjct: 8   SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67

Query: 68  IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
           IRSI+K A ++K    I  G S  V+  NG     + +Q L  D   SP    T   R R
Sbjct: 68  IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123

Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
           + +DRPSPLGP GK  ++    E+S   + + QS  EL S  SR P+ V   EE ++  +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180

Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
                 ++  R P+  PL +     N  TR  + N          +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240

Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
             RLE++LEMEG KI+ D+ +L+N  LD +++RLI+PCL LA ++        +     +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300

Query: 290 Q---IGSVSVSDFRTATELNPNILGKDWSLHLEKV 321
           Q   +  VS+SDFR   ELN  ILG+DW +H+EK+
Sbjct: 301 QSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKI 335


>AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
           LENGTH=342
          Length = 342

 Score =  200 bits (508), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/335 (40%), Positives = 190/335 (56%), Gaps = 28/335 (8%)

Query: 8   SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
           S +DTLE K  I R++G  +A  YFN L RF ++KI+K EFD+ C  TIGR+NIHLHN  
Sbjct: 8   SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67

Query: 68  IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
           IRSI+K A ++K    I  G S  V+  NG     + +Q L  D   SP    T   R R
Sbjct: 68  IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123

Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
           + +DRPSPLGP GK  ++    E+S   + + QS  EL S  SR P+ V   EE ++  +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180

Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
                 ++  R P+  PL +     N  TR  + N          +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240

Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
             RLE++LEMEG KI+ D+ +L+N  LD +++RLI+PCL LA ++        +     +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300

Query: 290 Q---IGSVSVSDFRTATELNPNILGKDWSLHLEKV 321
           Q   +  VS+SDFR   ELN  ILG+DW +H+EK+
Sbjct: 301 QSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKI 335


>AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
           in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
           Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
           LENGTH=379
          Length = 379

 Score =  152 bits (384), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 115/365 (31%), Positives = 178/365 (48%), Gaps = 48/365 (13%)

Query: 10  VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
           +D  E K+ I +K+G  ++ +YF  L RFLS K++K EFD+ C   +GREN+ LHN  IR
Sbjct: 9   IDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSLHNKLIR 68

Query: 70  SILKKASLSKRGNII------GSSLNVKIPNGCNDLQFLCKDFLQSP--------RKVRT 115
           SIL+ ASL+K    +      G SL +   +G  + + L  D +++          KVR 
Sbjct: 69  SILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSNGVLAKVRP 128

Query: 116 PSLRDRRFKDRPSPLGPNGKNVN-IGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEV 174
            +  DR  +D+P PLG NGK +    +    R   E+ S     +    +    +    +
Sbjct: 129 GTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSGKDQVAAPI 188

Query: 175 DQDSE-KVNIYMRSPIQPPLAIPTYNK---GTRTLLHNGLPSGTDTCQSIGELPDTPSLT 230
            +D E +V I    P+  PL IP  +    G R  +     +   +C   G L DT  L 
Sbjct: 189 SRDDEAQVRILSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLR 248

Query: 231 KRLEQKLEMEGF-KISADAAALMNKALDTYLKRLIKPCLDLAASKAVN------------ 277
           KR+E     +G   +SA+ + ++N  LD YLK+L+K C+DLA ++++N            
Sbjct: 249 KRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQ 308

Query: 278 ---------RSNGPI------QPG-LNEQIGSVSVSDFRTATELNPNILGKDWSLHLEKV 321
                    R+N         QP  +  +  SVS+ DFR A ELNP+ LG+DW L  E++
Sbjct: 309 SRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNPHQLGEDWPLLRERI 368

Query: 322 TGSIL 326
           + S+ 
Sbjct: 369 SISLF 373


>AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:26896600-26897463
           REVERSE LENGTH=287
          Length = 287

 Score =  147 bits (372), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 122/329 (37%), Positives = 169/329 (51%), Gaps = 53/329 (16%)

Query: 1   MPAARY-FSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRE 59
           MP +++     D  E K QIE+++G  K   Y NLL++FLS+KISK +FD+    T+ RE
Sbjct: 1   MPTSQHHVVRTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRE 60

Query: 60  NIHLHNHFIRSILKKASLSK------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQSPRKV 113
           NI LHN  +R ILK   LSK      +  +   +   K  NG    Q LCK+  +SPRK 
Sbjct: 61  NISLHNALLRGILKNICLSKTLPPFVKNGVESDNKKKKQLNGA--FQSLCKELPRSPRKG 118

Query: 114 RTPSLRDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEE 173
           RT     RR            K+ NI    S+          E+ S++ R    +E+ EE
Sbjct: 119 RT----QRRL----------NKDGNISKGKSLVT--------EVVSSSGRQQWSMENVEE 156

Query: 174 VDQDSEKVNIYMRSPIQPPLAIPTYNKGTRTLLHNGLPSGTDTCQSIGELPDTPSLTKRL 233
           VDQ    +  +   PI+ P  +       R ++       T  C S GELPD+ SL K+L
Sbjct: 157 VDQ---LIPCWRSQPIEAPFGV-----NLRDVIKKQHRIDT-CCYSSGELPDSVSLKKKL 207

Query: 234 EQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNEQIGS 293
           E  LE EG ++S   A  +N  LD +LKRLIKPCL+LAAS++ N S+            +
Sbjct: 208 EDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASS------------A 254

Query: 294 VSVSDFRTATELNPNILGKDWSLHLEKVT 322
            S+ DF+ A  LNP+ILG+DW   LEK+ 
Sbjct: 255 SSLVDFQVAMALNPSILGEDWPTKLEKIA 283


>AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10422597-10423820 FORWARD LENGTH=407
          Length = 407

 Score =  140 bits (353), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 122/396 (30%), Positives = 183/396 (46%), Gaps = 91/396 (22%)

Query: 10  VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
           +   E K  I +K G  ++ +YF  L RFLS K++K EFD+ C   +GREN+ LHN  IR
Sbjct: 9   ISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSLHNQLIR 68

Query: 70  SILKKASLSK-------------------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQ-S 109
           SIL+ A+++K                   RG+ +  S  + IPN            L  S
Sbjct: 69  SILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTL-IPNHSQHEPVWSNGVLPIS 127

Query: 110 PRKVRTPSLRDRRFKDRPSPLGPNGK--------------NVNIGFED-----SVREIHE 150
           PRKVR+  +++R+ +DRPSPLG NGK                ++G E+     S R + +
Sbjct: 128 PRKVRS-GMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRSGRYVAD 186

Query: 151 QQSNKELDSAAS-RIP-----LCVEDGEEVDQDSE-KVNIYMRSPIQPPLAIP----TYN 199
           ++  + L      RIP       V   ++ +Q+ + +VN+ M SP+  PL IP    +  
Sbjct: 187 EKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVNLSM-SPLIAPLGIPFCSASVG 245

Query: 200 KGTRTLLHNGLPSGTD----TCQSIGELPDTPSLTKRLEQKLEMEGFK-ISADAAALMNK 254
              RT     +P  T+    +C   G LPD   L KR+E     +G + +S + A  +N 
Sbjct: 246 GSPRT-----IPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTLNN 300

Query: 255 ALDTYLKRLIKPCLDLAASKAVNRSNGPIQPG---------------------------- 286
            LD YLK+LI  C DL  +++ N   G  + G                            
Sbjct: 301 MLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNGSSD 360

Query: 287 LNEQIGSVSVSDFRTATELNPNILGKDWSLHLEKVT 322
           + +   SVS+ DFRTA ELNP  LG+DW    E+++
Sbjct: 361 IRQDHHSVSMLDFRTAMELNPRQLGEDWPTLRERIS 396