Miyakogusa Predicted Gene
- Lj4g3v0768700.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0768700.1 tr|Q8S3Q2|Q8S3Q2_ORYSJ OSJNBa0011F23.5 protein
OS=Oryza sativa subsp. japonica GN=24K23.4 PE=4
SV=1,28.04,7e-19,SAGA-Tad1,Transcriptional coactivator SAGA-type
complex, Ada1/Tada1; SUBFAMILY NOT NAMED,NULL; FAMIL,CUFF.48018.1
(307 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G14850.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 195 3e-50
AT4G33890.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 177 8e-45
AT4G33890.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 177 8e-45
AT4G31440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 136 2e-32
AT5G67410.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 129 2e-30
AT2G24530.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 129 2e-30
>AT2G14850.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
LENGTH=291
Length = 291
Score = 195 bits (495), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/318 (39%), Positives = 179/318 (56%), Gaps = 59/318 (18%)
Query: 1 MPAARYFSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGREN 60
M + + FS +++LE K I +K+G +A YF+ L +FL+ +ISK EFD+ C T+GREN
Sbjct: 1 MGSDQCFSRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGREN 60
Query: 61 IHLHNHFIRSILKKASLSK-------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQSPRKV 113
I LHN +RSILK AS++K + ++ G + F SPRK
Sbjct: 61 ISLHNRLVRSILKNASVAKSPPPRYPKKSLYGDPV-----------------FPPSPRKC 103
Query: 114 RTPSLRDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPL---CVED 170
R+ R+F+DRPSPLGP GK ++ +N E S A R+P+ VED
Sbjct: 104 RS-----RKFRDRPSPLGPLGKPQSL-----------TTTNDESMSKAQRLPMEVVSVED 147
Query: 171 GEEVDQDSEKVNIYMRSPIQPPLAIPTYNKG-TRTLLHNGLPSGTDTCQSIGELPDTPSL 229
GEEV+Q + ++ RSP+ PL + + K R +NG+ +TCQS GELPD +L
Sbjct: 148 GEEVEQMTGSPSVQSRSPLTAPLGVSFHLKSKARFSTYNGI--NRETCQSSGELPDMITL 205
Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
RLE+KLEMEG K+S D+A L+N+ L+ Y++RLI+PCL LA+ + SN
Sbjct: 206 RARLEKKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLASQQKRAVSN--------- 256
Query: 290 QIGSVSVSDFRTATELNP 307
VS+ DF A E+NP
Sbjct: 257 ----VSMLDFHAAMEVNP 270
>AT4G33890.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:16250057-16251085 FORWARD LENGTH=342
Length = 342
Score = 177 bits (449), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 178/320 (55%), Gaps = 28/320 (8%)
Query: 8 SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
S +DTLE K I R++G +A YFN L RF ++KI+K EFD+ C TIGR+NIHLHN
Sbjct: 8 SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67
Query: 68 IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
IRSI+K A ++K I G S V+ NG + +Q L D SP T R R
Sbjct: 68 IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123
Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
+ +DRPSPLGP GK ++ E+S + + QS EL S SR P+ V EE ++ +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180
Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
++ R P+ PL + N TR + N +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240
Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
RLE++LEMEG KI+ D+ +L+N LD +++RLI+PCL LA ++ + +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300
Query: 290 Q---IGSVSVSDFRTATELN 306
Q + VS+SDFR ELN
Sbjct: 301 QSRRLSYVSMSDFRAGMELN 320
>AT4G33890.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
LENGTH=342
Length = 342
Score = 177 bits (449), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 178/320 (55%), Gaps = 28/320 (8%)
Query: 8 SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
S +DTLE K I R++G +A YFN L RF ++KI+K EFD+ C TIGR+NIHLHN
Sbjct: 8 SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67
Query: 68 IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
IRSI+K A ++K I G S V+ NG + +Q L D SP T R R
Sbjct: 68 IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123
Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
+ +DRPSPLGP GK ++ E+S + + QS EL S SR P+ V EE ++ +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180
Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
++ R P+ PL + N TR + N +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240
Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
RLE++LEMEG KI+ D+ +L+N LD +++RLI+PCL LA ++ + +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300
Query: 290 Q---IGSVSVSDFRTATELN 306
Q + VS+SDFR ELN
Sbjct: 301 QSRRLSYVSMSDFRAGMELN 320
>AT4G31440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
LENGTH=379
Length = 379
Score = 136 bits (343), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/346 (31%), Positives = 165/346 (47%), Gaps = 48/346 (13%)
Query: 10 VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
+D E K+ I +K+G ++ +YF L RFLS K++K EFD+ C +GREN+ LHN IR
Sbjct: 9 IDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSLHNKLIR 68
Query: 70 SILKKASLSKRGNII------GSSLNVKIPNGCNDLQFLCKDFLQSP--------RKVRT 115
SIL+ ASL+K + G SL + +G + + L D +++ KVR
Sbjct: 69 SILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSNGVLAKVRP 128
Query: 116 PSLRDRRFKDRPSPLGPNGKNVN-IGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEV 174
+ DR +D+P PLG NGK + + R E+ S + + + +
Sbjct: 129 GTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSGKDQVAAPI 188
Query: 175 DQDSE-KVNIYMRSPIQPPLAIPTYNK---GTRTLLHNGLPSGTDTCQSIGELPDTPSLT 230
+D E +V I P+ PL IP + G R + + +C G L DT L
Sbjct: 189 SRDDEAQVRILSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLR 248
Query: 231 KRLEQKLEMEGF-KISADAAALMNKALDTYLKRLIKPCLDLAASKAVN------------ 277
KR+E +G +SA+ + ++N LD YLK+L+K C+DLA ++++N
Sbjct: 249 KRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQ 308
Query: 278 ---------RSNGPI------QPG-LNEQIGSVSVSDFRTATELNP 307
R+N QP + + SVS+ DFR A ELNP
Sbjct: 309 SRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNP 354
>AT5G67410.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:26896600-26897463
REVERSE LENGTH=287
Length = 287
Score = 129 bits (325), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/314 (36%), Positives = 158/314 (50%), Gaps = 53/314 (16%)
Query: 1 MPAARY-FSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRE 59
MP +++ D E K QIE+++G K Y NLL++FLS+KISK +FD+ T+ RE
Sbjct: 1 MPTSQHHVVRTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRE 60
Query: 60 NIHLHNHFIRSILKKASLSK------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQSPRKV 113
NI LHN +R ILK LSK + + + K NG Q LCK+ +SPRK
Sbjct: 61 NISLHNALLRGILKNICLSKTLPPFVKNGVESDNKKKKQLNGA--FQSLCKELPRSPRKG 118
Query: 114 RTPSLRDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEE 173
RT RR K+ NI S+ E+ S++ R +E+ EE
Sbjct: 119 RT----QRRL----------NKDGNISKGKSLVT--------EVVSSSGRQQWSMENVEE 156
Query: 174 VDQDSEKVNIYMRSPIQPPLAIPTYNKGTRTLLHNGLPSGTDTCQSIGELPDTPSLTKRL 233
VDQ + + PI+ P + R ++ T C S GELPD+ SL K+L
Sbjct: 157 VDQ---LIPCWRSQPIEAPFGV-----NLRDVIKKQHRIDT-CCYSSGELPDSVSLKKKL 207
Query: 234 EQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNEQIGS 293
E LE EG ++S A +N LD +LKRLIKPCL+LAAS++ N S+ +
Sbjct: 208 EDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASS------------A 254
Query: 294 VSVSDFRTATELNP 307
S+ DF+ A LNP
Sbjct: 255 SSLVDFQVAMALNP 268
>AT2G24530.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10422597-10423820 FORWARD LENGTH=407
Length = 407
Score = 129 bits (324), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/382 (30%), Positives = 174/382 (45%), Gaps = 93/382 (24%)
Query: 10 VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
+ E K I +K G ++ +YF L RFLS K++K EFD+ C +GREN+ LHN IR
Sbjct: 9 ISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSLHNQLIR 68
Query: 70 SILKKASLSK-------------------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQ-S 109
SIL+ A+++K RG+ + S + IPN L S
Sbjct: 69 SILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTL-IPNHSQHEPVWSNGVLPIS 127
Query: 110 PRKVRTPSLRDRRFKDRPSPLGPNGK--------------NVNIGFED-----SVREIHE 150
PRKVR+ +++R+ +DRPSPLG NGK ++G E+ S R + +
Sbjct: 128 PRKVRS-GMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRSGRYVAD 186
Query: 151 QQSNKELDSAAS-RIP-------LCVEDGEEVDQDSEKVNIYMRSPIQPPLAIP----TY 198
++ + L RIP + + D ++ ++ +VN+ M SP+ PL IP +
Sbjct: 187 EKDGEFLRPVEKPRIPNKEKIAAVSMRD-DQNQEEQARVNLSM-SPLIAPLGIPFCSASV 244
Query: 199 NKGTRTLLHNGLPSGTD----TCQSIGELPDTPSLTKRLEQKLEMEGFK-ISADAAALMN 253
RT +P T+ +C G LPD L KR+E +G + +S + A +N
Sbjct: 245 GGSPRT-----IPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTLN 299
Query: 254 KALDTYLKRLIKPCLDLAASKAVNRSNGPIQPG--------------------------- 286
LD YLK+LI C DL +++ N G + G
Sbjct: 300 NMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNGSS 359
Query: 287 -LNEQIGSVSVSDFRTATELNP 307
+ + SVS+ DFRTA ELNP
Sbjct: 360 DIRQDHHSVSMLDFRTAMELNP 381