Miyakogusa Predicted Gene
- Lj0g3v0075609.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0075609.1 Non Chatacterized Hit- tr|A3BER1|A3BER1_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,42.61,0.0000000001,no description,NULL; seg,NULL; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL;
ZINC_FINGER_C2H2_1,Z,NODE_72166_length_2066_cov_36.444336.path1.1
(430 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G54630.1 | Symbols: | zinc finger protein-related | chr5:221... 528 e-150
AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein... 481 e-136
AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein... 206 2e-53
AT1G75710.1 | Symbols: | C2H2-like zinc finger protein | chr1:2... 180 2e-45
AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein... 145 4e-35
AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 114 2e-25
AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 107 2e-23
AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 102 4e-22
>AT5G54630.1 | Symbols: | zinc finger protein-related |
chr5:22192607-22194260 REVERSE LENGTH=472
Length = 472
Score = 528 bits (1360), Expect = e-150, Method: Compositional matrix adjust.
Identities = 295/464 (63%), Positives = 328/464 (70%), Gaps = 45/464 (9%)
Query: 1 MPTVWFSLKRSLHCKSEPTDVHVP----KSRKHLATILTKR-----------AGTGRSGC 45
+PTVWFSLK+SLHCKSEP+DVH P K ++HL+TI TK+ G G SGC
Sbjct: 20 IPTVWFSLKKSLHCKSEPSDVHDPISTTKQQQHLSTISTKKISGISSGGAAVCGGGLSGC 79
Query: 46 SRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKITGYGGFQ 105
SRSIANLKDVIHGSKRH EKPP SPRSIGS+EFLNPITHEVILSNS CELKITG G
Sbjct: 80 SRSIANLKDVIHGSKRHFEKPPISSPRSIGSNEFLNPITHEVILSNSTCELKITGVGDMA 139
Query: 106 E--GGVASDGNNNGGETGDSTFVGTLRXXXXXXXXXXXMHYFN--PSYKTPATPPRKLSP 161
G S G GG +T+VG LR MHY N SY++ RK S
Sbjct: 140 SPVGAADSGGGGGGGNGRSTTYVGMLRPGTP-------MHYLNHSASYRSQT---RKGSF 189
Query: 162 FLSSDKEXXXXXXXXXX--XXXRLSLETDSNGPCN------VTCHKCGEQFSKWEAAEAH 213
LS R+SLE + N V+CHKCGEQF+K EAAEAH
Sbjct: 190 ALSERDRGGGGGGEGLGFHTNRRVSLEMNRESTINGGNNSSVSCHKCGEQFNKLEAAEAH 249
Query: 214 HLSKHAVTELVEGDSSRKIVEIICRTSWLKSENHCGRIERVLKVHNMQKTLARFEEYREM 273
HLSKHAVTELVEGDSSRKIVEIICRTSWLKSEN CGRI+RVLKVHNMQKTLARFEEYRE
Sbjct: 250 HLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRIDRVLKVHNMQKTLARFEEYRET 309
Query: 274 VKIKASKLQKKHPRCLADGNELLRFYGTTXXXXXXXXXXXXXXQYERCCVCRIIRNGFSS 333
VKI+ASKLQKKHPRCLADGNELLRF+GTT E+CCVCRIIRNGFSS
Sbjct: 310 VKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGINGSTSVCTAEKCCVCRIIRNGFSS 369
Query: 334 NKEELKGGIGVFTTSTSGRAFESIEILGHDPS------LRKALIVCRVIAGRVHRPLENI 387
+E+ G+GVFT STSGRAFESI + G D S +RK LIVCRVIAGRVHRP+EN+
Sbjct: 370 KREK-NNGVGVFTASTSGRAFESILVNGGDESGDVDRTVRKVLIVCRVIAGRVHRPVENV 428
Query: 388 QEMAG-QTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVVCKP 430
+EM G +GFDSLAGKVGLY+N+EELYLLNP+ALLPCFVV+CKP
Sbjct: 429 EEMNGLMSGFDSLAGKVGLYTNVEELYLLNPKALLPCFVVICKP 472
>AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr4:13640160-13641640 FORWARD LENGTH=431
Length = 431
Score = 481 bits (1237), Expect = e-136, Method: Compositional matrix adjust.
Identities = 263/443 (59%), Positives = 299/443 (67%), Gaps = 54/443 (12%)
Query: 1 MPTVWFSLKRSLHCKSEPTDVHVPKSRKHLATILTKRAGTGRSG-------CSRSIANLK 53
+P+VWFSLK+SL CKS+ +DVH+P+S+K LA I TKR T G CSRSIANLK
Sbjct: 30 LPSVWFSLKKSLPCKSDVSDVHIPRSKKELAPISTKRTTTSSGGGVGGRSGCSRSIANLK 89
Query: 54 DVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKITGYGGFQEGGVASDG 113
DVIHG++RHLEKP SPRSIGSSEFLNPITH+VI SNS CELKIT G +
Sbjct: 90 DVIHGNQRHLEKPLCSSPRSIGSSEFLNPITHDVIFSNSTCELKITAAGATE-------- 141
Query: 114 NNNGGETGDSTFVGTLRXXXXXXXXXXXMHYFNPSYKTPATPPRKLSPFLSSDKEXXXXX 173
FVG LR P TP S S
Sbjct: 142 -----------FVGNLR---------------------PGTPVNYSSSRRSQTSRKASSL 169
Query: 174 XXXXXXXXRLSLETDSNGPCN-----VTCHKCGEQFSKWEAAEAHHLSKHAVTELVEGDS 228
+ E D N V+CHKCGE+FSK EAAEAHHL+KHAVTEL+EGDS
Sbjct: 170 DREGLGFHQSRRENDREAAINGDNSSVSCHKCGEKFSKLEAAEAHHLTKHAVTELMEGDS 229
Query: 229 SRKIVEIICRTSWLKSENHCGRIERVLKVHNMQKTLARFEEYREMVKIKASKLQKKHPRC 288
SR+IVEIICRTSWLK+EN GRI+R+LKVHNMQKTLARFEEYR+ VKI+ASKLQKKHPRC
Sbjct: 230 SRRIVEIICRTSWLKTENQGGRIDRILKVHNMQKTLARFEEYRDTVKIRASKLQKKHPRC 289
Query: 289 LADGNELLRFYGTTXXXXXXXXXXXXXXQYERCCVCRIIRNGFSSNKEELKGGIGVFTTS 348
+ADGNELLRF+GTT E+CCVCRIIRNGFS+ K E+ GIGVFT S
Sbjct: 290 IADGNELLRFHGTTVACALGINGSTSLCSSEKCCVCRIIRNGFSA-KREMNNGIGVFTAS 348
Query: 349 TSGRAFESIEILGHDPSLRKALIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLYS 407
TS RAFESI I RKALIVCRVIAGRVHRP+EN++EM G +GFDSLAGKVGLY+
Sbjct: 349 TSERAFESIVIGDGGGGDRKALIVCRVIAGRVHRPVENVEEMGGLLSGFDSLAGKVGLYT 408
Query: 408 NIEELYLLNPRALLPCFVVVCKP 430
N+EELYLLN RALLPCFV++CKP
Sbjct: 409 NVEELYLLNSRALLPCFVLICKP 431
>AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr1:3868884-3870065 REVERSE LENGTH=365
Length = 365
Score = 206 bits (524), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 151/244 (61%), Gaps = 13/244 (5%)
Query: 195 VTCHKCGEQFSKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSW------LKSENHC 248
+ C KC E+ +A EAH+LS H+V L+ GD SR VE+IC T + +K N
Sbjct: 127 LACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLGKMKGNN-- 184
Query: 249 GRIERVLKVHNMQKTLARFEEYREMVKIKASKLQKKHPRCLADGNELLRFYGTTXXXXXX 308
I + K+ N+Q+ +A FE+YRE+VKI+A+KL KKH RC+ADGNE L F+GTT
Sbjct: 185 --ISAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHGTTLSCTLG 242
Query: 309 XXXXXXXXQY-ERCCVCRIIRNGFSSNKEELKGGIGVFTTSTSGRAFESIEI-LGHDPSL 366
+ + C VC I+R+GFS K G GV T STS A ESIE G +
Sbjct: 243 FSNSSSNLCFSDHCEVCHILRHGFSP-KTRPDGIKGVLTASTSSTALESIETDQGRNRGS 301
Query: 367 RKALIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVV 426
A+++CRVIAGRVH+P++ + G + FDSLA KVG S IEELYLL+ +ALLPCFV+
Sbjct: 302 LIAVVLCRVIAGRVHKPMQTFENSLGFSEFDSLALKVGQNSRIEELYLLSTKALLPCFVI 361
Query: 427 VCKP 430
+ KP
Sbjct: 362 IFKP 365
>AT1G75710.1 | Symbols: | C2H2-like zinc finger protein |
chr1:28428806-28431128 FORWARD LENGTH=462
Length = 462
Score = 180 bits (456), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 108/256 (42%), Positives = 144/256 (56%), Gaps = 26/256 (10%)
Query: 197 CHKCGEQFSKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENHCGRIERVLK 256
C +CGE F K E+ E H +HAV+EL DS R IVEII ++SWLK ++ +IER+LK
Sbjct: 206 CSQCGEVFPKLESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILK 265
Query: 257 VHNMQKTLARFEEYREMVKIKASKLQKKHPRCLADGNELLRFYGTTXX-XXXXXXXXXXX 315
VHN Q+T+ RFE+ R+ VK +A + +K RC ADGNELLRF+ TT
Sbjct: 266 VHNTQRTIQRFEDCRDAVKARALQATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLC 325
Query: 316 XQYERCCVCRIIRNGFSSNKEELKGGI---GVFTTSTSGRAFESIEILGHDPSLRKALIV 372
C VC +IR+GF + GV TT++SGRA ++L R+ ++V
Sbjct: 326 SNLPVCGVCTVIRHGFQGKSGGGGANVANAGVRTTASSGRAD---DLLRCSDDARRVMLV 382
Query: 373 CRVIAGRVHR---PLENIQEMAGQTG----------------FDSLAGKVGLYSNIEELY 413
CRVIAGRV R P + A + FDS+A G+YSN+EEL
Sbjct: 383 CRVIAGRVKRVDLPAADASATAEKKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELV 442
Query: 414 LLNPRALLPCFVVVCK 429
+ NPRA+LPCFVV+ K
Sbjct: 443 VYNPRAILPCFVVIYK 458
>AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr2:12679346-12680467 FORWARD LENGTH=373
Length = 373
Score = 145 bits (367), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 94/249 (37%), Positives = 138/249 (55%), Gaps = 24/249 (9%)
Query: 197 CHKCGEQFSKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENH-CGRIERVL 255
C+ CGE F K E H KHAV+EL+ G+SS IV+II ++ W + N+ I R+L
Sbjct: 128 CNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFKSGWPEQGNYKSPVINRIL 187
Query: 256 KVHNMQKTLARFEEYREMVKIKASK-----LQKKHPRCLADGNELLRFYGTTXXXXXXXX 310
K+HN K L RFEEYRE VK KA++ + RC+ADGNELLRFY +T
Sbjct: 188 KIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADGNELLRFYCSTFMCDLGQN 247
Query: 311 XXXXXXQYERCCVCRIIRNGFSSNKEELKGGIGVFTTSTSGRAFESIEILGHDP----SL 366
++ C +C II +GFS + G+ T +T R ++ + ++
Sbjct: 248 GKSNLCGHQYCSICGIIGSGFSPKLD------GIATLATGWRGHVAVPEEVEEEFGFMNV 301
Query: 367 RKALIVCRVIAGRV--HRPLENIQEMAGQTGFDSLAGKVG------LYSNIEELYLLNPR 418
++A++VCRV+AGRV ++ + + G+DSL G+ G L + +EL + NPR
Sbjct: 302 KRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGALLRIDDDELLVFNPR 361
Query: 419 ALLPCFVVV 427
A+LPCFV+V
Sbjct: 362 AVLPCFVIV 370
>AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
LENGTH=264
Length = 264
Score = 114 bits (284), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 114/218 (52%), Gaps = 51/218 (23%)
Query: 216 SKHAVTELVEGDSSRKIVEIICRTSWLKSENHCGRIERVLKVHNMQKTLARFEEYREMVK 275
+ A+TEL +G SR +VEII +SW S+ GRIE + KV + +T+ RFEEYRE+VK
Sbjct: 89 TSDALTELPDGHPSRNVVEIIFHSSW-SSDEFPGRIEMIFKVEHGSRTVTRFEEYREVVK 147
Query: 276 IKA----SKLQKKHPRCLADGNELLRFYGTTXXXXXXXXXXXXXXQYERCCVCRIIRNGF 331
+A +++ RCLADGNE++RFY + +GF
Sbjct: 148 SRAGFNGGTCEEEDARCLADGNEMMRFY--------------------------PVLDGF 181
Query: 332 SSNKEELKGGIG--VFTTSTSGRAFESIEILGHDPSLRKALIVCRVIAGRVHRPLENIQE 389
+ GG G V T S SG A+ S G RKA+++CRVIAGRV +
Sbjct: 182 NGGACVFAGGKGQAVCTFSGSGEAYVSSGGGGG----RKAMMICRVIAGRVDDVI----- 232
Query: 390 MAGQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVV 427
G DS+AG+ G EL++ + RA+LPCF+++
Sbjct: 233 ---GFGSDSVAGRDG------ELFVFDTRAVLPCFLII 261
>AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
LENGTH=280
Length = 280
Score = 107 bits (266), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 105/211 (49%), Gaps = 32/211 (15%)
Query: 220 VTELVEGDSSRKIVEIICRTSWLKSENHCGRIERVLKVHNMQKTLARFEEYREMVKIKA- 278
+TEL EG SR +VEII +TSW + GR+E + KV N KTL RFEEYRE VK ++
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSW-GPKPFSGRVEMIFKVQNGSKTLTRFEEYREAVKARSV 158
Query: 279 SKLQKKHPRCLADGNELLRFYGTTXXXXXXXXXXXXXXQYERCCVCRIIRNGFSSNKEEL 338
K ++++ R +ADGNE +RFY G S+ E+
Sbjct: 159 GKAREENARSVADGNETMRFYCLGPSYGGGGSAWGILGGKGGGASIYTF-AGSSTANEKA 217
Query: 339 KGGIGVFTTSTSGRAFESIEILGHDPSLRKALIVCRVIAGRVHRPLENIQEMAGQTGFDS 398
GG G RKA++VCRVIAGRV + E + ++ FDS
Sbjct: 218 GGGKG-----------------------RKAMLVCRVIAGRVTKQNELKYDSDLRSRFDS 254
Query: 399 LAGKVGLYSNIEELYLLNPRALLPCFVVVCK 429
++G G EL + + RA+LPCF+++ +
Sbjct: 255 VSGDDG------ELLVFDTRAVLPCFLIIYR 279
>AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
LENGTH=277
Length = 277
Score = 102 bits (255), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 76/217 (35%), Positives = 107/217 (49%), Gaps = 50/217 (23%)
Query: 220 VTELVEGDSSRKIVEIICRTSWLKSENHCGRIERVLKVHNMQKTLARFEEYREMVKIKA- 278
+T+L +G SR +VEII ++SW S+ GR+E + KV N K + RFEEYRE VK ++
Sbjct: 97 LTDLPDGHPSRNVVEIIFQSSW-SSDEFPGRVEMIFKVENGSKAVTRFEEYREAVKSRSC 155
Query: 279 SKLQ---------KKHPRCLADGNELLRFYGTTXXXXXXXXXXXXXXQYERCCVCRIIRN 329
SK+ ++ RC ADGNE++RF+
Sbjct: 156 SKVDSDRVDGSACDENARCSADGNEMMRFFPLGPIPGGINGGAW---------------- 199
Query: 330 GFSSNKEELKGGIGVFTTSTSGRAFESIEILGHDPSLRKALIVCRVIAGRVHRPLENIQE 389
GF K G V T S SG A S G R+A+++CRVIAGRV +
Sbjct: 200 GFPGGK-----GAAVCTFSGSGEAHASTGGGGG----RRAMLICRVIAGRVAK------- 243
Query: 390 MAGQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVV 426
G+ G DS+AG+ G EL + + RA+LPCF++
Sbjct: 244 -KGEFGSDSVAGRAG------ELIVFDARAVLPCFLI 273