Miyakogusa Predicted Gene
- Lj0g3v0317559.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0317559.2 Non Chatacterized Hit- tr|I1PJV6|I1PJV6_ORYGL
Uncharacterized protein OS=Oryza glaberrima PE=4
SV=1,40.1,4e-19,SAGA-Tad1,Transcriptional coactivator SAGA-type
complex, Ada1/Tada1; SUBFAMILY NOT NAMED,NULL; FAMIL,CUFF.21554.2
(202 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G24530.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 204 3e-53
AT4G31440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 186 1e-47
AT2G14850.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 87 8e-18
AT4G33890.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 84 7e-17
AT4G33890.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 84 7e-17
AT5G67410.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 55 3e-08
>AT2G24530.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10422597-10423820 FORWARD LENGTH=407
Length = 407
Score = 204 bits (519), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 102/196 (52%), Positives = 139/196 (70%), Gaps = 5/196 (2%)
Query: 7 VEDGEEVEQLNRLSFARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSFCDSGSLPDT 66
+ D + E+ R++ + SPLIAPLGIP CSASVGG+ +++PV++ ++ +S DSG LPD
Sbjct: 212 MRDDQNQEEQARVNLSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDI 271
Query: 67 DTLRRRMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLP 126
+ LR+RME IA QGL GVSMECA LNN+LD YLK+LI SC DLV ARS N +P K
Sbjct: 272 EMLRKRMENIAVAQGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTN-GDPGKQR 330
Query: 127 VSKPQIQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGE 186
+ K Q Q KI+NG+WP N + Q+P G ++ +H S+S+ DF+ AMELNP+QLGE
Sbjct: 331 IGKQQSQNKIVNGVWPTNSLKI-QTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGE 386
Query: 187 DWPLQLEKLSMQSLEK 202
DWP E++S++S E+
Sbjct: 387 DWPTLRERISLRSFEE 402
>AT4G31440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
LENGTH=379
Length = 379
Score = 186 bits (471), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 95/178 (53%), Positives = 127/178 (71%), Gaps = 5/178 (2%)
Query: 25 PLIAPLGIPHCSASVGGARKSLPVNSASDFVSFCDSGSLPDTDTLRRRMEQIATVQGLGG 84
P++APLGIP CSASVGG R+++PV++++ +S DSG L DT+ LR+RME IA QGLGG
Sbjct: 203 PVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVTQGLGG 262
Query: 85 VSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLPVSKPQIQGKIMNGMWPNN 144
VS EC+ +LNN+LD YLK+L++SCVDL ARS N P K + K Q + +++NG+ NN
Sbjct: 263 VSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMN-GTPGKHSLEKQQSRDELVNGVRTNN 321
Query: 145 QSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGEDWPLQLEKLSMQSLEK 202
H+Q S +P R Q S+SL DF+VAMELNP QLGEDWPL E++S+ E+
Sbjct: 322 SFHIQTS----NQPSDITREQHSVSLLDFRVAMELNPHQLGEDWPLLRERISISLFEE 375
>AT2G14850.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
LENGTH=291
Length = 291
Score = 87.0 bits (214), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 97/206 (47%), Gaps = 69/206 (33%)
Query: 7 VEDGEEVEQLNRLSF--ARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSF------- 57
VEDGEEVEQ+ +RSPL APLG+ S + S + F ++
Sbjct: 145 VEDGEEVEQMTGSPSVQSRSPLTAPLGV------------SFHLKSKARFSTYNGINRET 192
Query: 58 C-DSGSLPDTDTLRRRMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARS 116
C SG LPD TLR R+E+ ++G+ +SM+ AN+LN L++Y++RLI C+ L S
Sbjct: 193 CQSSGELPDMITLRARLEKKLEMEGIK-LSMDSANLLNRGLNAYMRRLIEPCLSLASQ-- 249
Query: 117 ANENEPTKLPVSKPQIQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVA 176
+ R ++S+ DF A
Sbjct: 250 --------------------------------------------QKRAVSNVSMLDFHAA 265
Query: 177 MELNPQQLGEDWPLQLEKLSMQSLEK 202
ME+NP+ LGE+WP+QLEK+ ++ E+
Sbjct: 266 MEVNPRVLGEEWPIQLEKICCRASEE 291
>AT4G33890.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:16250057-16251085 FORWARD LENGTH=342
Length = 342
Score = 84.0 bits (206), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 48/191 (25%)
Query: 22 ARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSFC----------DSGSLPDTDTLRR 71
+R PL APLG+ S G RKS+ VS C ++G LPDT TLR
Sbjct: 190 SRCPLTAPLGVSM-SLRNGATRKSV------SNVSMCSRSFNRETCQNNGELPDTRTLRS 242
Query: 72 RMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLPVSKPQ 131
R+E+ ++GL ++M+ ++LN+ LD +++RLI C+ L + R +
Sbjct: 243 RLERRLEMEGLK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDR----------- 290
Query: 132 IQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGEDWPLQ 191
+ N + QQS R +S+ DF+ MELN + LGEDWP+
Sbjct: 291 --------VREMNYQYTQQS-----------RRLSYVSMSDFRAGMELNTEILGEDWPMH 331
Query: 192 LEKLSMQSLEK 202
+EK+ ++ +K
Sbjct: 332 MEKICSRASDK 342
>AT4G33890.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
LENGTH=342
Length = 342
Score = 84.0 bits (206), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 48/191 (25%)
Query: 22 ARSPLIAPLGIPHCSASVGGARKSLPVNSASDFVSFC----------DSGSLPDTDTLRR 71
+R PL APLG+ S G RKS+ VS C ++G LPDT TLR
Sbjct: 190 SRCPLTAPLGVSM-SLRNGATRKSV------SNVSMCSRSFNRETCQNNGELPDTRTLRS 242
Query: 72 RMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKLPVSKPQ 131
R+E+ ++GL ++M+ ++LN+ LD +++RLI C+ L + R +
Sbjct: 243 RLERRLEMEGLK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDR----------- 290
Query: 132 IQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLGEDWPLQ 191
+ N + QQS R +S+ DF+ MELN + LGEDWP+
Sbjct: 291 --------VREMNYQYTQQS-----------RRLSYVSMSDFRAGMELNTEILGEDWPMH 331
Query: 192 LEKLSMQSLEK 202
+EK+ ++ +K
Sbjct: 332 MEKICSRASDK 342
>AT5G67410.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:26896600-26897463
REVERSE LENGTH=287
Length = 287
Score = 55.5 bits (132), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 88/191 (46%), Gaps = 59/191 (30%)
Query: 7 VEDGEEVEQLNRLSFARS-PLIAPLGIPHCSASVGGARKSLPVNSASDFVSFCDSGSLPD 65
+E+ EEV+QL + RS P+ AP G+ R + D + SG LPD
Sbjct: 151 MENVEEVDQL--IPCWRSQPIEAPFGV--------NLRDVIKKQHRIDTCCY-SSGELPD 199
Query: 66 TDTLRRRMEQIATVQGLGGVSMECANMLNNVLDSYLKRLIRSCVDLVSARSANENEPTKL 125
+ +L++++E +GL VS+ AN LN LD +LKRLI+ C++L ++RS+N + + L
Sbjct: 200 SVSLKKKLED-DLEEGLE-VSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASSASSL 257
Query: 126 PVSKPQIQGKIMNGMWPNNQSHVQQSPGGPAEPELEHRPQISISLHDFKVAMELNPQQLG 185
DF+VAM LNP LG
Sbjct: 258 V---------------------------------------------DFQVAMALNPSILG 272
Query: 186 EDWPLQLEKLS 196
EDWP +LEK++
Sbjct: 273 EDWPTKLEKIA 283