Miyakogusa Predicted Gene
- Lj1g3v2938400.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2938400.1 Non Chatacterized Hit- tr|B8ARW8|B8ARW8_ORYSI
Putative uncharacterized protein OS=Oryza sativa
subsp,30.72,0.000000000005,SAGA-Tad1,Transcriptional coactivator
SAGA-type complex, Ada1/Tada1; SUBFAMILY NOT NAMED,NULL;
FAMIL,NODE_70194_length_1171_cov_35.951324.path1.1
(327 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G14850.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 213 1e-55
AT4G33890.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 200 1e-51
AT4G33890.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 200 1e-51
AT4G31440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 152 3e-37
AT5G67410.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 147 7e-36
AT2G24530.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 140 1e-33
>AT2G14850.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
LENGTH=291
Length = 291
Score = 213 bits (543), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/327 (41%), Positives = 191/327 (58%), Gaps = 49/327 (14%)
Query: 1 MPAARYFSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGREN 60
M + + FS +++LE K I +K+G +A YF+ L +FL+ +ISK EFD+ C T+GREN
Sbjct: 1 MGSDQCFSRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGREN 60
Query: 61 IHLHNHFIRSILKKASLSKRGNIIGSSLNVKIPNGCNDLQFLCKD--FLQSPRKVRTPSL 118
I LHN +RSILK AS++K S + P + L D F SPRK R+
Sbjct: 61 ISLHNRLVRSILKNASVAK-------SPPPRYPK-----KSLYGDPVFPPSPRKCRS--- 105
Query: 119 RDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPL---CVEDGEEVD 175
R+F+DRPSPLGP GK ++ +N E S A R+P+ VEDGEEV+
Sbjct: 106 --RKFRDRPSPLGPLGKPQSL-----------TTTNDESMSKAQRLPMEVVSVEDGEEVE 152
Query: 176 QDSEKVNIYMRSPIQPPLAIPTYNKG-TRTLLHNGLPSGTDTCQSIGELPDTPSLTKRLE 234
Q + ++ RSP+ PL + + K R +NG+ +TCQS GELPD +L RLE
Sbjct: 153 QMTGSPSVQSRSPLTAPLGVSFHLKSKARFSTYNGI--NRETCQSSGELPDMITLRARLE 210
Query: 235 QKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNEQIGSV 294
+KLEMEG K+S D+A L+N+ L+ Y++RLI+PCL LA+ + SN V
Sbjct: 211 KKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLASQQKRAVSN-------------V 257
Query: 295 SVSDFRTATELNPNILGKDWSLHLEKV 321
S+ DF A E+NP +LG++W + LEK+
Sbjct: 258 SMLDFHAAMEVNPRVLGEEWPIQLEKI 284
>AT4G33890.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:16250057-16251085 FORWARD LENGTH=342
Length = 342
Score = 200 bits (508), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 190/335 (56%), Gaps = 28/335 (8%)
Query: 8 SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
S +DTLE K I R++G +A YFN L RF ++KI+K EFD+ C TIGR+NIHLHN
Sbjct: 8 SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67
Query: 68 IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
IRSI+K A ++K I G S V+ NG + +Q L D SP T R R
Sbjct: 68 IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123
Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
+ +DRPSPLGP GK ++ E+S + + QS EL S SR P+ V EE ++ +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180
Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
++ R P+ PL + N TR + N +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240
Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
RLE++LEMEG KI+ D+ +L+N LD +++RLI+PCL LA ++ + +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300
Query: 290 Q---IGSVSVSDFRTATELNPNILGKDWSLHLEKV 321
Q + VS+SDFR ELN ILG+DW +H+EK+
Sbjct: 301 QSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKI 335
>AT4G33890.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
LENGTH=342
Length = 342
Score = 200 bits (508), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 190/335 (56%), Gaps = 28/335 (8%)
Query: 8 SPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHF 67
S +DTLE K I R++G +A YFN L RF ++KI+K EFD+ C TIGR+NIHLHN
Sbjct: 8 SRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRL 67
Query: 68 IRSILKKASLSKRGNII--GSSLNVKIPNG----CNDLQFLCKDFLQSPRKVRTPSLRDR 121
IRSI+K A ++K I G S V+ NG + +Q L D SP T R R
Sbjct: 68 IRSIIKNACIAKSPPFIKKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSP---STRKCRSR 123
Query: 122 RFKDRPSPLGPNGK--NVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEVDQDSE 179
+ +DRPSPLGP GK ++ E+S + + QS EL S SR P+ V EE ++ +
Sbjct: 124 KLRDRPSPLGPLGKPHSLTTTNEES---MSKAQSATELLSLGSRPPVEVVSVEEGEEVEQ 180
Query: 180 KV----NIYMRSPIQPPLAIPTY--NKGTRTLLHN----GLPSGTDTCQSIGELPDTPSL 229
++ R P+ PL + N TR + N +TCQ+ GELPDT +L
Sbjct: 181 IAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTL 240
Query: 230 TKRLEQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNE 289
RLE++LEMEG KI+ D+ +L+N LD +++RLI+PCL LA ++ + +
Sbjct: 241 RSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQYTQ 300
Query: 290 Q---IGSVSVSDFRTATELNPNILGKDWSLHLEKV 321
Q + VS+SDFR ELN ILG+DW +H+EK+
Sbjct: 301 QSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKI 335
>AT4G31440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
LENGTH=379
Length = 379
Score = 152 bits (384), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/365 (31%), Positives = 178/365 (48%), Gaps = 48/365 (13%)
Query: 10 VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
+D E K+ I +K+G ++ +YF L RFLS K++K EFD+ C +GREN+ LHN IR
Sbjct: 9 IDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSLHNKLIR 68
Query: 70 SILKKASLSKRGNII------GSSLNVKIPNGCNDLQFLCKDFLQSP--------RKVRT 115
SIL+ ASL+K + G SL + +G + + L D +++ KVR
Sbjct: 69 SILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSNGVLAKVRP 128
Query: 116 PSLRDRRFKDRPSPLGPNGKNVN-IGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEEV 174
+ DR +D+P PLG NGK + + R E+ S + + + +
Sbjct: 129 GTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSGKDQVAAPI 188
Query: 175 DQDSE-KVNIYMRSPIQPPLAIPTYNK---GTRTLLHNGLPSGTDTCQSIGELPDTPSLT 230
+D E +V I P+ PL IP + G R + + +C G L DT L
Sbjct: 189 SRDDEAQVRILSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLR 248
Query: 231 KRLEQKLEMEGF-KISADAAALMNKALDTYLKRLIKPCLDLAASKAVN------------ 277
KR+E +G +SA+ + ++N LD YLK+L+K C+DLA ++++N
Sbjct: 249 KRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQ 308
Query: 278 ---------RSNGPI------QPG-LNEQIGSVSVSDFRTATELNPNILGKDWSLHLEKV 321
R+N QP + + SVS+ DFR A ELNP+ LG+DW L E++
Sbjct: 309 SRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNPHQLGEDWPLLRERI 368
Query: 322 TGSIL 326
+ S+
Sbjct: 369 SISLF 373
>AT5G67410.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:26896600-26897463
REVERSE LENGTH=287
Length = 287
Score = 147 bits (372), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 169/329 (51%), Gaps = 53/329 (16%)
Query: 1 MPAARY-FSPVDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRE 59
MP +++ D E K QIE+++G K Y NLL++FLS+KISK +FD+ T+ RE
Sbjct: 1 MPTSQHHVVRTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRE 60
Query: 60 NIHLHNHFIRSILKKASLSK------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQSPRKV 113
NI LHN +R ILK LSK + + + K NG Q LCK+ +SPRK
Sbjct: 61 NISLHNALLRGILKNICLSKTLPPFVKNGVESDNKKKKQLNGA--FQSLCKELPRSPRKG 118
Query: 114 RTPSLRDRRFKDRPSPLGPNGKNVNIGFEDSVREIHEQQSNKELDSAASRIPLCVEDGEE 173
RT RR K+ NI S+ E+ S++ R +E+ EE
Sbjct: 119 RT----QRRL----------NKDGNISKGKSLVT--------EVVSSSGRQQWSMENVEE 156
Query: 174 VDQDSEKVNIYMRSPIQPPLAIPTYNKGTRTLLHNGLPSGTDTCQSIGELPDTPSLTKRL 233
VDQ + + PI+ P + R ++ T C S GELPD+ SL K+L
Sbjct: 157 VDQ---LIPCWRSQPIEAPFGV-----NLRDVIKKQHRIDT-CCYSSGELPDSVSLKKKL 207
Query: 234 EQKLEMEGFKISADAAALMNKALDTYLKRLIKPCLDLAASKAVNRSNGPIQPGLNEQIGS 293
E LE EG ++S A +N LD +LKRLIKPCL+LAAS++ N S+ +
Sbjct: 208 EDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNASS------------A 254
Query: 294 VSVSDFRTATELNPNILGKDWSLHLEKVT 322
S+ DF+ A LNP+ILG+DW LEK+
Sbjct: 255 SSLVDFQVAMALNPSILGEDWPTKLEKIA 283
>AT2G24530.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10422597-10423820 FORWARD LENGTH=407
Length = 407
Score = 140 bits (353), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 122/396 (30%), Positives = 183/396 (46%), Gaps = 91/396 (22%)
Query: 10 VDTLEQKLQIERKLGTVKACKYFNLLTRFLSVKISKHEFDRQCRATIGRENIHLHNHFIR 69
+ E K I +K G ++ +YF L RFLS K++K EFD+ C +GREN+ LHN IR
Sbjct: 9 ISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSLHNQLIR 68
Query: 70 SILKKASLSK-------------------RGNIIGSSLNVKIPNGCNDLQFLCKDFLQ-S 109
SIL+ A+++K RG+ + S + IPN L S
Sbjct: 69 SILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTL-IPNHSQHEPVWSNGVLPIS 127
Query: 110 PRKVRTPSLRDRRFKDRPSPLGPNGK--------------NVNIGFED-----SVREIHE 150
PRKVR+ +++R+ +DRPSPLG NGK ++G E+ S R + +
Sbjct: 128 PRKVRS-GMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRSGRYVAD 186
Query: 151 QQSNKELDSAAS-RIP-----LCVEDGEEVDQDSE-KVNIYMRSPIQPPLAIP----TYN 199
++ + L RIP V ++ +Q+ + +VN+ M SP+ PL IP +
Sbjct: 187 EKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVNLSM-SPLIAPLGIPFCSASVG 245
Query: 200 KGTRTLLHNGLPSGTD----TCQSIGELPDTPSLTKRLEQKLEMEGFK-ISADAAALMNK 254
RT +P T+ +C G LPD L KR+E +G + +S + A +N
Sbjct: 246 GSPRT-----IPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTLNN 300
Query: 255 ALDTYLKRLIKPCLDLAASKAVNRSNGPIQPG---------------------------- 286
LD YLK+LI C DL +++ N G + G
Sbjct: 301 MLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNGSSD 360
Query: 287 LNEQIGSVSVSDFRTATELNPNILGKDWSLHLEKVT 322
+ + SVS+ DFRTA ELNP LG+DW E+++
Sbjct: 361 IRQDHHSVSMLDFRTAMELNPRQLGEDWPTLRERIS 396