Miyakogusa Predicted Gene

Lj0g3v0317559.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0317559.2 Non Chatacterized Hit- tr|I1PJV6|I1PJV6_ORYGL
Uncharacterized protein OS=Oryza glaberrima PE=4
SV=1,40.1,4e-19,SAGA-Tad1,Transcriptional coactivator SAGA-type
complex, Ada1/Tada1; SUBFAMILY NOT NAMED,NULL; FAMIL,CUFF.21554.2
         (202 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   204   3e-53
AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   186   1e-47
AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    87   8e-18
AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    84   7e-17
AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    84   7e-17
AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    55   3e-08

>AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10422597-10423820 FORWARD LENGTH=407
          Length = 407

 Score =  204 bits (519), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 102/196 (52%), Positives = 139/196 (70%), Gaps = 5/196 (2%)

Query: 7   VEDGEEVEQLNRLSFARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSFCDSGSLPDT 66
           + D +  E+  R++ + SPLIAPLGIP CSASVGG+ +++PV++ ++ +S  DSG LPD 
Sbjct: 212 MRDDQNQEEQARVNLSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDI 271

Query: 67  DTLRRRMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLP 126
           + LR+RME IA  QGL GVSMECA  LNN+LD YLK+LI SC DLV ARS N  +P K  
Sbjct: 272 EMLRKRMENIAVAQGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTN-GDPGKQR 330

Query: 127 VSKPQIQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGE 186
           + K Q Q KI+NG+WP N   + Q+P G ++   +H    S+S+ DF+ AMELNP+QLGE
Sbjct: 331 IGKQQSQNKIVNGVWPTNSLKI-QTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGE 386

Query: 187 DWPLQLEKLSMQSLEK 202
           DWP   E++S++S E+
Sbjct: 387 DWPTLRERISLRSFEE 402


>AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
           in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
           Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
           LENGTH=379
          Length = 379

 Score =  186 bits (471), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 95/178 (53%), Positives = 127/178 (71%), Gaps = 5/178 (2%)

Query: 25  PLIAPLGIPHCSASVGGARKSLPVNSASDFVSFCDSGSLPDTDTLRRRMEQIATVQGLGG 84
           P++APLGIP CSASVGG R+++PV++++  +S  DSG L DT+ LR+RME IA  QGLGG
Sbjct: 203 PVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVTQGLGG 262

Query: 85  VSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLPVSKPQIQGKIMNGMWPNN 144
           VS EC+ +LNN+LD YLK+L++SCVDL  ARS N   P K  + K Q + +++NG+  NN
Sbjct: 263 VSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMN-GTPGKHSLEKQQSRDELVNGVRTNN 321

Query: 145 QSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGEDWPLQLEKLSMQSLEK 202
             H+Q S     +P    R Q S+SL DF+VAMELNP QLGEDWPL  E++S+   E+
Sbjct: 322 SFHIQTS----NQPSDITREQHSVSLLDFRVAMELNPHQLGEDWPLLRERISISLFEE 375


>AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
           LENGTH=291
          Length = 291

 Score = 87.0 bits (214), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 97/206 (47%), Gaps = 69/206 (33%)

Query: 7   VEDGEEVEQLNRLSF--ARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSF------- 57
           VEDGEEVEQ+       +RSPL APLG+            S  + S + F ++       
Sbjct: 145 VEDGEEVEQMTGSPSVQSRSPLTAPLGV------------SFHLKSKARFSTYNGINRET 192

Query: 58  C-DSGSLPDTDTLRRRMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARS 116
           C  SG LPD  TLR R+E+   ++G+  +SM+ AN+LN  L++Y++RLI  C+ L S   
Sbjct: 193 CQSSGELPDMITLRARLEKKLEMEGIK-LSMDSANLLNRGLNAYMRRLIEPCLSLASQ-- 249

Query: 117 ANENEPTKLPVSKPQIQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVA 176
                                                       + R   ++S+ DF  A
Sbjct: 250 --------------------------------------------QKRAVSNVSMLDFHAA 265

Query: 177 MELNPQQLGEDWPLQLEKLSMQSLEK 202
           ME+NP+ LGE+WP+QLEK+  ++ E+
Sbjct: 266 MEVNPRVLGEEWPIQLEKICCRASEE 291


>AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:16250057-16251085 FORWARD LENGTH=342
          Length = 342

 Score = 84.0 bits (206), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 48/191 (25%)

Query: 22  ARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSFC----------DSGSLPDTDTLRR 71
           +R PL APLG+   S   G  RKS+        VS C          ++G LPDT TLR 
Sbjct: 190 SRCPLTAPLGVSM-SLRNGATRKSV------SNVSMCSRSFNRETCQNNGELPDTRTLRS 242

Query: 72  RMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLPVSKPQ 131
           R+E+   ++GL  ++M+  ++LN+ LD +++RLI  C+ L + R   +            
Sbjct: 243 RLERRLEMEGLK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDR----------- 290

Query: 132 IQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGEDWPLQ 191
                   +   N  + QQS           R    +S+ DF+  MELN + LGEDWP+ 
Sbjct: 291 --------VREMNYQYTQQS-----------RRLSYVSMSDFRAGMELNTEILGEDWPMH 331

Query: 192 LEKLSMQSLEK 202
           +EK+  ++ +K
Sbjct: 332 MEKICSRASDK 342


>AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
           LENGTH=342
          Length = 342

 Score = 84.0 bits (206), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 48/191 (25%)

Query: 22  ARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSFC----------DSGSLPDTDTLRR 71
           +R PL APLG+   S   G  RKS+        VS C          ++G LPDT TLR 
Sbjct: 190 SRCPLTAPLGVSM-SLRNGATRKSV------SNVSMCSRSFNRETCQNNGELPDTRTLRS 242

Query: 72  RMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLPVSKPQ 131
           R+E+   ++GL  ++M+  ++LN+ LD +++RLI  C+ L + R   +            
Sbjct: 243 RLERRLEMEGLK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDR----------- 290

Query: 132 IQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGEDWPLQ 191
                   +   N  + QQS           R    +S+ DF+  MELN + LGEDWP+ 
Sbjct: 291 --------VREMNYQYTQQS-----------RRLSYVSMSDFRAGMELNTEILGEDWPMH 331

Query: 192 LEKLSMQSLEK 202
           +EK+  ++ +K
Sbjct: 332 MEKICSRASDK 342


>AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:26896600-26897463
           REVERSE LENGTH=287
          Length = 287

 Score = 55.5 bits (132), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 88/191 (46%), Gaps = 59/191 (30%)

Query: 7   VEDGEEVEQLNRLSFARS-PLIAPLGIPHCSASVGGARKSLPVNSASDFVSFCDSGSLPD 65
           +E+ EEV+QL  +   RS P+ AP G+          R  +      D   +  SG LPD
Sbjct: 151 MENVEEVDQL--IPCWRSQPIEAPFGV--------NLRDVIKKQHRIDTCCY-SSGELPD 199

Query: 66  TDTLRRRMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKL 125
           + +L++++E     +GL  VS+  AN LN  LD +LKRLI+ C++L ++RS+N +  + L
Sbjct: 200 SVSLKKKLED-DLEEGLE-VSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASSASSL 257

Query: 126 PVSKPQIQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLG 185
                                                         DF+VAM LNP  LG
Sbjct: 258 V---------------------------------------------DFQVAMALNPSILG 272

Query: 186 EDWPLQLEKLS 196
           EDWP +LEK++
Sbjct: 273 EDWPTKLEKIA 283