Miyakogusa Predicted Gene
- Lj0g3v0327029.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0327029.1 Non Chatacterized Hit- tr|A3BER1|A3BER1_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,43.75,0.0000000001,ADP-ribosylation,NULL; seg,NULL; SUBFAMILY
NOT NAMED,NULL; FAMILY NOT NAMED,NULL; ZINC_FINGER_C2H2_1,CUFF.22245.1
(426 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G54630.1 | Symbols: | zinc finger protein-related | chr5:221... 508 e-144
AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein... 465 e-131
AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein... 199 2e-51
AT1G75710.1 | Symbols: | C2H2-like zinc finger protein | chr1:2... 170 2e-42
AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein... 145 4e-35
AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 107 2e-23
AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 106 4e-23
AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 95 1e-19
>AT5G54630.1 | Symbols: | zinc finger protein-related |
chr5:22192607-22194260 REVERSE LENGTH=472
Length = 472
Score = 508 bits (1309), Expect = e-144, Method: Compositional matrix adjust.
Identities = 280/464 (60%), Positives = 323/464 (69%), Gaps = 49/464 (10%)
Query: 1 MPTVWFNLKRSLHCKSEPSEVHDP----KSRKQLSTILTKK-------------PGRSGC 43
+PTVWF+LK+SLHCKSEPS+VHDP K ++ LSTI TKK G SGC
Sbjct: 20 IPTVWFSLKKSLHCKSEPSDVHDPISTTKQQQHLSTISTKKISGISSGGAAVCGGGLSGC 79
Query: 44 SRSIANLKDVIHGSKRHLEDKPPTCSPRSIGSSEFLNPITHEVILSNSRCELKITGYGGF 103
SRSIANLKDVIHGSKRH E KPP SPRSIGS+EFLNPITHEVILSNS CELKITG G
Sbjct: 80 SRSIANLKDVIHGSKRHFE-KPPISSPRSIGSNEFLNPITHEVILSNSTCELKITGVGDM 138
Query: 104 QEXXXXXXXXXXXXXXXXXST-FVGTLRXXXXXXXXXXXMHYFN--PSFRTSSTPPRKSP 160
ST +VG LR MHY N S+R+ + RK
Sbjct: 139 ASPVGAADSGGGGGGGNGRSTTYVGMLRPGTP-------MHYLNHSASYRSQT---RKGS 188
Query: 161 FSSSDKE--------GSGLHSSNR----FHPETTTDSNGSSSVTCHKCGEQFNKWEAAEA 208
F+ S+++ G G H++ R + E+T + +SSV+CHKCGEQFNK EAAEA
Sbjct: 189 FALSERDRGGGGGGEGLGFHTNRRVSLEMNRESTINGGNNSSVSCHKCGEQFNKLEAAEA 248
Query: 209 HHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYRE 268
HHLSKHAVTELVEGDSSRKIVEIICRTSWLKSEN CGRI+RVLKVHNMQ+TLARFEEYRE
Sbjct: 249 HHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRIDRVLKVHNMQKTLARFEEYRE 308
Query: 269 MVKTKASKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFS 328
VK +ASKLQKKHPRCLADGNELLRF+GTT+A +KCCVCRIIRNGFS
Sbjct: 309 TVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGINGSTSVCTAEKCCVCRIIRNGFS 368
Query: 329 AKKELKXXXXXXXXXXXXRAFETI-----ESFGNEPPSLRKALIVCRVIAGRVHRPLENI 383
+K+E RAFE+I + G+ ++RK LIVCRVIAGRVHRP+EN+
Sbjct: 369 SKREKNNGVGVFTASTSGRAFESILVNGGDESGDVDRTVRKVLIVCRVIAGRVHRPVENV 428
Query: 384 QEIAS-QTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVICKP 426
+E+ +GFDSLAGKVGLY+N+EELYLLNP+ALLPCFVVICKP
Sbjct: 429 EEMNGLMSGFDSLAGKVGLYTNVEELYLLNPKALLPCFVVICKP 472
>AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr4:13640160-13641640 FORWARD LENGTH=431
Length = 431
Score = 465 bits (1196), Expect = e-131, Method: Compositional matrix adjust.
Identities = 251/436 (57%), Positives = 301/436 (69%), Gaps = 44/436 (10%)
Query: 1 MPTVWFNLKRSLHCKSEPSEVHDPKSRKQLSTILTKKPGRSG---------CSRSIANLK 51
+P+VWF+LK+SL CKS+ S+VH P+S+K+L+ I TK+ S CSRSIANLK
Sbjct: 30 LPSVWFSLKKSLPCKSDVSDVHIPRSKKELAPISTKRTTTSSGGGVGGRSGCSRSIANLK 89
Query: 52 DVIHGSKRHLEDKPPTCSPRSIGSSEFLNPITHEVILSNSRCELKITGYGGFQEXXXXXX 111
DVIHG++RHLE KP SPRSIGSSEFLNPITH+VI SNS CELKIT G +
Sbjct: 90 DVIHGNQRHLE-KPLCSSPRSIGSSEFLNPITHDVIFSNSTCELKITAAGATE------- 141
Query: 112 XXXXXXXXXXXSTFVGTLRXXXXXXXXXXXMHYFNPSFRTSSTPPRKSPFSSSDKEGSGL 171
FVG LR ++ +S SS D+EG G
Sbjct: 142 -------------FVGNLRPGTPV------------NYSSSRRSQTSRKASSLDREGLGF 176
Query: 172 HSSNRFHPETTTDSNGSSSVTCHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEI 231
H S R + + +SSV+CHKCGE+F+K EAAEAHHL+KHAVTEL+EGDSSR+IVEI
Sbjct: 177 HQSRRENDREAAINGDNSSVSCHKCGEKFSKLEAAEAHHLTKHAVTELMEGDSSRRIVEI 236
Query: 232 ICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKASKLQKKHPRCLADGNEL 291
ICRTSWLK+EN GRI+R+LKVHNMQ+TLARFEEYR+ VK +ASKLQKKHPRC+ADGNEL
Sbjct: 237 ICRTSWLKTENQGGRIDRILKVHNMQKTLARFEEYRDTVKIRASKLQKKHPRCIADGNEL 296
Query: 292 LRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFSAKKELKXXXXXXXXXXXXRAFET 351
LRF+GTT+A +KCCVCRIIRNGFSAK+E+ RAFE+
Sbjct: 297 LRFHGTTVACALGINGSTSLCSSEKCCVCRIIRNGFSAKREMNNGIGVFTASTSERAFES 356
Query: 352 IESFGNEPPSLRKALIVCRVIAGRVHRPLENIQEIAS-QTGFDSLAGKVGLYSNIEELYL 410
I G+ RKALIVCRVIAGRVHRP+EN++E+ +GFDSLAGKVGLY+N+EELYL
Sbjct: 357 I-VIGDGGGGDRKALIVCRVIAGRVHRPVENVEEMGGLLSGFDSLAGKVGLYTNVEELYL 415
Query: 411 LNPRALLPCFVVICKP 426
LN RALLPCFV+ICKP
Sbjct: 416 LNSRALLPCFVLICKP 431
>AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr1:3868884-3870065 REVERSE LENGTH=365
Length = 365
Score = 199 bits (507), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/251 (43%), Positives = 145/251 (57%), Gaps = 11/251 (4%)
Query: 183 TDSNGSSSVTCHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSW----- 237
+D G + C KC E+ +A EAH+LS H+V L+ GD SR VE+IC T +
Sbjct: 119 SDICGFGVLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLG 178
Query: 238 -LKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKASKLQKKHPRCLADGNELLRFYG 296
+K N I + K+ N+QR +A FE+YRE+VK +A+KL KKH RC+ADGNE L F+G
Sbjct: 179 KMKGNN----ISAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHG 234
Query: 297 TTLAXXXXXXXXXXXX-XXDKCCVCRIIRNGFSAKKELKXXXXXXXXXXXXRAFETIESF 355
TTL+ D C VC I+R+GFS K A E+IE+
Sbjct: 235 TTLSCTLGFSNSSSNLCFSDHCEVCHILRHGFSPKTRPDGIKGVLTASTSSTALESIETD 294
Query: 356 GNEPPSLRKALIVCRVIAGRVHRPLENIQEIASQTGFDSLAGKVGLYSNIEELYLLNPRA 415
A+++CRVIAGRVH+P++ + + FDSLA KVG S IEELYLL+ +A
Sbjct: 295 QGRNRGSLIAVVLCRVIAGRVHKPMQTFENSLGFSEFDSLALKVGQNSRIEELYLLSTKA 354
Query: 416 LLPCFVVICKP 426
LLPCFV+I KP
Sbjct: 355 LLPCFVIIFKP 365
>AT1G75710.1 | Symbols: | C2H2-like zinc finger protein |
chr1:28428806-28431128 FORWARD LENGTH=462
Length = 462
Score = 170 bits (430), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/253 (40%), Positives = 134/253 (52%), Gaps = 20/253 (7%)
Query: 193 CHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLK 252
C +CGE F K E+ E H +HAV+EL DS R IVEII ++SWLK ++ +IER+LK
Sbjct: 206 CSQCGEVFPKLESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILK 265
Query: 253 VHNMQRTLARFEEYREMVKTKASKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXX 312
VHN QRT+ RFE+ R+ VK +A + +K RC ADGNELLRF+ TTL
Sbjct: 266 VHNTQRTIQRFEDCRDAVKARALQATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLC 325
Query: 313 XXDKCC-VCRIIRNGFSAKKELKXXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRV 371
C VC +IR+GF K + + R+ ++VCRV
Sbjct: 326 SNLPVCGVCTVIRHGFQGKSGGGGANVANAGVRTTASSGRADDLLRCSDDARRVMLVCRV 385
Query: 372 IAGRVHR---PLENIQEIASQTG----------------FDSLAGKVGLYSNIEELYLLN 412
IAGRV R P + A + FDS+A G+YSN+EEL + N
Sbjct: 386 IAGRVKRVDLPAADASATAEKKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELVVYN 445
Query: 413 PRALLPCFVVICK 425
PRA+LPCFVVI K
Sbjct: 446 PRAILPCFVVIYK 458
>AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr2:12679346-12680467 FORWARD LENGTH=373
Length = 373
Score = 145 bits (367), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 138/263 (52%), Gaps = 17/263 (6%)
Query: 176 RFHPETTTDSNGSSSVT-CHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICR 234
R H +T + + S + C+ CGE F K E H KHAV+EL+ G+SS IV+II +
Sbjct: 110 RIHQQTEFEISSSDEIFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFK 169
Query: 235 TSWLKSEN-NCGRIERVLKVHNMQRTLARFEEYREMVKTKASKLQK-----KHPRCLADG 288
+ W + N I R+LK+HN + L RFEEYRE VK KA++ RC+ADG
Sbjct: 170 SGWPEQGNYKSPVINRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADG 229
Query: 289 NELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFSAKKELKXXXXXXXXXXXXRA 348
NELLRFY +T C +C II +GFS K +
Sbjct: 230 NELLRFYCSTFMCDLGQNGKSNLCGHQYCSICGIIGSGFSPKLDGIATLATGWRGHVAVP 289
Query: 349 FETIESFGNEPPSLRKALIVCRVIAGRV--HRPLENIQEIASQTGFDSLAGKVG------ 400
E E FG ++++A++VCRV+AGRV ++ + + G+DSL G+ G
Sbjct: 290 EEVEEEFGFM--NVKRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGAL 347
Query: 401 LYSNIEELYLLNPRALLPCFVVI 423
L + +EL + NPRA+LPCFV++
Sbjct: 348 LRIDDDELLVFNPRAVLPCFVIV 370
>AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
LENGTH=264
Length = 264
Score = 107 bits (267), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 108/216 (50%), Gaps = 47/216 (21%)
Query: 212 SKHAVTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVK 271
+ A+TEL +G SR +VEII +SW S+ GRIE + KV + RT+ RFEEYRE+VK
Sbjct: 89 TSDALTELPDGHPSRNVVEIIFHSSW-SSDEFPGRIEMIFKVEHGSRTVTRFEEYREVVK 147
Query: 272 TKA----SKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGF 327
++A +++ RCLADGNE++RFY + +GF
Sbjct: 148 SRAGFNGGTCEEEDARCLADGNEMMRFY--------------------------PVLDGF 181
Query: 328 SAKKELKXXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRVIAGRVHRPLENIQEIA 387
+ + + E S G RKA+++CRVIAGRV +
Sbjct: 182 NGGACVFAGGKGQAVCTFSGSGEAYVSSGG--GGGRKAMMICRVIAGRVDDVI------- 232
Query: 388 SQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVI 423
G DS+AG+ G EL++ + RA+LPCF++I
Sbjct: 233 -GFGSDSVAGRDG------ELFVFDTRAVLPCFLII 261
>AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
LENGTH=280
Length = 280
Score = 106 bits (264), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 106/211 (50%), Gaps = 32/211 (15%)
Query: 216 VTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKA- 274
+TEL EG SR +VEII +TSW + GR+E + KV N +TL RFEEYRE VK ++
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSW-GPKPFSGRVEMIFKVQNGSKTLTRFEEYREAVKARSV 158
Query: 275 SKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFSAKKELK 334
K ++++ R +ADGNE +RFY C+ G SA L
Sbjct: 159 GKAREENARSVADGNETMRFY----------------------CLGPSYGGGGSAWGILG 196
Query: 335 XXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRVIAGRVHRPLENIQEIASQTGFDS 394
+ E G RKA++VCRVIAGRV + E + ++ FDS
Sbjct: 197 GKGGGASIYTFAGSSTANEKAGGGKG--RKAMLVCRVIAGRVTKQNELKYDSDLRSRFDS 254
Query: 395 LAGKVGLYSNIEELYLLNPRALLPCFVVICK 425
++G G EL + + RA+LPCF++I +
Sbjct: 255 VSGDDG------ELLVFDTRAVLPCFLIIYR 279
>AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
LENGTH=277
Length = 277
Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 103/218 (47%), Gaps = 50/218 (22%)
Query: 216 VTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKA- 274
+T+L +G SR +VEII ++SW S+ GR+E + KV N + + RFEEYRE VK+++
Sbjct: 97 LTDLPDGHPSRNVVEIIFQSSW-SSDEFPGRVEMIFKVENGSKAVTRFEEYREAVKSRSC 155
Query: 275 SKLQ---------KKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRN 325
SK+ ++ RC ADGNE++RF+ VC
Sbjct: 156 SKVDSDRVDGSACDENARCSADGNEMMRFFPLGPIPGGINGGAWGFPGGKGAAVCT---- 211
Query: 326 GFSAKKELKXXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRVIAGRVHRPLENIQE 385
FS E A + G R+A+++CRVIAGRV +
Sbjct: 212 -FSGSGE---------------AHASTGGGGG-----RRAMLICRVIAGRVAK------- 243
Query: 386 IASQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVI 423
+ G DS+AG+ G EL + + RA+LPCF++
Sbjct: 244 -KGEFGSDSVAGRAG------ELIVFDARAVLPCFLIF 274