Miyakogusa Predicted Gene

Lj0g3v0327029.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0327029.1 Non Chatacterized Hit- tr|A3BER1|A3BER1_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,43.75,0.0000000001,ADP-ribosylation,NULL; seg,NULL; SUBFAMILY
NOT NAMED,NULL; FAMILY NOT NAMED,NULL; ZINC_FINGER_C2H2_1,CUFF.22245.1
         (426 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G54630.1 | Symbols:  | zinc finger protein-related | chr5:221...   508   e-144
AT4G27240.1 | Symbols:  | zinc finger (C2H2 type) family protein...   465   e-131
AT1G11490.1 | Symbols:  | zinc finger (C2H2 type) family protein...   199   2e-51
AT1G75710.1 | Symbols:  | C2H2-like zinc finger protein | chr1:2...   170   2e-42
AT2G29660.1 | Symbols:  | zinc finger (C2H2 type) family protein...   145   4e-35
AT4G22560.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   107   2e-23
AT1G62520.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   106   4e-23
AT4G12450.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    95   1e-19

>AT5G54630.1 | Symbols:  | zinc finger protein-related |
           chr5:22192607-22194260 REVERSE LENGTH=472
          Length = 472

 Score =  508 bits (1309), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 280/464 (60%), Positives = 323/464 (69%), Gaps = 49/464 (10%)

Query: 1   MPTVWFNLKRSLHCKSEPSEVHDP----KSRKQLSTILTKK-------------PGRSGC 43
           +PTVWF+LK+SLHCKSEPS+VHDP    K ++ LSTI TKK              G SGC
Sbjct: 20  IPTVWFSLKKSLHCKSEPSDVHDPISTTKQQQHLSTISTKKISGISSGGAAVCGGGLSGC 79

Query: 44  SRSIANLKDVIHGSKRHLEDKPPTCSPRSIGSSEFLNPITHEVILSNSRCELKITGYGGF 103
           SRSIANLKDVIHGSKRH E KPP  SPRSIGS+EFLNPITHEVILSNS CELKITG G  
Sbjct: 80  SRSIANLKDVIHGSKRHFE-KPPISSPRSIGSNEFLNPITHEVILSNSTCELKITGVGDM 138

Query: 104 QEXXXXXXXXXXXXXXXXXST-FVGTLRXXXXXXXXXXXMHYFN--PSFRTSSTPPRKSP 160
                              ST +VG LR           MHY N   S+R+ +   RK  
Sbjct: 139 ASPVGAADSGGGGGGGNGRSTTYVGMLRPGTP-------MHYLNHSASYRSQT---RKGS 188

Query: 161 FSSSDKE--------GSGLHSSNR----FHPETTTDSNGSSSVTCHKCGEQFNKWEAAEA 208
           F+ S+++        G G H++ R     + E+T +   +SSV+CHKCGEQFNK EAAEA
Sbjct: 189 FALSERDRGGGGGGEGLGFHTNRRVSLEMNRESTINGGNNSSVSCHKCGEQFNKLEAAEA 248

Query: 209 HHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYRE 268
           HHLSKHAVTELVEGDSSRKIVEIICRTSWLKSEN CGRI+RVLKVHNMQ+TLARFEEYRE
Sbjct: 249 HHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRIDRVLKVHNMQKTLARFEEYRE 308

Query: 269 MVKTKASKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFS 328
            VK +ASKLQKKHPRCLADGNELLRF+GTT+A              +KCCVCRIIRNGFS
Sbjct: 309 TVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGINGSTSVCTAEKCCVCRIIRNGFS 368

Query: 329 AKKELKXXXXXXXXXXXXRAFETI-----ESFGNEPPSLRKALIVCRVIAGRVHRPLENI 383
           +K+E              RAFE+I     +  G+   ++RK LIVCRVIAGRVHRP+EN+
Sbjct: 369 SKREKNNGVGVFTASTSGRAFESILVNGGDESGDVDRTVRKVLIVCRVIAGRVHRPVENV 428

Query: 384 QEIAS-QTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVICKP 426
           +E+    +GFDSLAGKVGLY+N+EELYLLNP+ALLPCFVVICKP
Sbjct: 429 EEMNGLMSGFDSLAGKVGLYTNVEELYLLNPKALLPCFVVICKP 472


>AT4G27240.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr4:13640160-13641640 FORWARD LENGTH=431
          Length = 431

 Score =  465 bits (1196), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 251/436 (57%), Positives = 301/436 (69%), Gaps = 44/436 (10%)

Query: 1   MPTVWFNLKRSLHCKSEPSEVHDPKSRKQLSTILTKKPGRSG---------CSRSIANLK 51
           +P+VWF+LK+SL CKS+ S+VH P+S+K+L+ I TK+   S          CSRSIANLK
Sbjct: 30  LPSVWFSLKKSLPCKSDVSDVHIPRSKKELAPISTKRTTTSSGGGVGGRSGCSRSIANLK 89

Query: 52  DVIHGSKRHLEDKPPTCSPRSIGSSEFLNPITHEVILSNSRCELKITGYGGFQEXXXXXX 111
           DVIHG++RHLE KP   SPRSIGSSEFLNPITH+VI SNS CELKIT  G  +       
Sbjct: 90  DVIHGNQRHLE-KPLCSSPRSIGSSEFLNPITHDVIFSNSTCELKITAAGATE------- 141

Query: 112 XXXXXXXXXXXSTFVGTLRXXXXXXXXXXXMHYFNPSFRTSSTPPRKSPFSSSDKEGSGL 171
                        FVG LR                 ++ +S         SS D+EG G 
Sbjct: 142 -------------FVGNLRPGTPV------------NYSSSRRSQTSRKASSLDREGLGF 176

Query: 172 HSSNRFHPETTTDSNGSSSVTCHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEI 231
           H S R +      +  +SSV+CHKCGE+F+K EAAEAHHL+KHAVTEL+EGDSSR+IVEI
Sbjct: 177 HQSRRENDREAAINGDNSSVSCHKCGEKFSKLEAAEAHHLTKHAVTELMEGDSSRRIVEI 236

Query: 232 ICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKASKLQKKHPRCLADGNEL 291
           ICRTSWLK+EN  GRI+R+LKVHNMQ+TLARFEEYR+ VK +ASKLQKKHPRC+ADGNEL
Sbjct: 237 ICRTSWLKTENQGGRIDRILKVHNMQKTLARFEEYRDTVKIRASKLQKKHPRCIADGNEL 296

Query: 292 LRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFSAKKELKXXXXXXXXXXXXRAFET 351
           LRF+GTT+A              +KCCVCRIIRNGFSAK+E+             RAFE+
Sbjct: 297 LRFHGTTVACALGINGSTSLCSSEKCCVCRIIRNGFSAKREMNNGIGVFTASTSERAFES 356

Query: 352 IESFGNEPPSLRKALIVCRVIAGRVHRPLENIQEIAS-QTGFDSLAGKVGLYSNIEELYL 410
           I   G+     RKALIVCRVIAGRVHRP+EN++E+    +GFDSLAGKVGLY+N+EELYL
Sbjct: 357 I-VIGDGGGGDRKALIVCRVIAGRVHRPVENVEEMGGLLSGFDSLAGKVGLYTNVEELYL 415

Query: 411 LNPRALLPCFVVICKP 426
           LN RALLPCFV+ICKP
Sbjct: 416 LNSRALLPCFVLICKP 431


>AT1G11490.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr1:3868884-3870065 REVERSE LENGTH=365
          Length = 365

 Score =  199 bits (507), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 109/251 (43%), Positives = 145/251 (57%), Gaps = 11/251 (4%)

Query: 183 TDSNGSSSVTCHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSW----- 237
           +D  G   + C KC E+    +A EAH+LS H+V  L+ GD SR  VE+IC T +     
Sbjct: 119 SDICGFGVLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLG 178

Query: 238 -LKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKASKLQKKHPRCLADGNELLRFYG 296
            +K  N    I  + K+ N+QR +A FE+YRE+VK +A+KL KKH RC+ADGNE L F+G
Sbjct: 179 KMKGNN----ISAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHG 234

Query: 297 TTLAXXXXXXXXXXXX-XXDKCCVCRIIRNGFSAKKELKXXXXXXXXXXXXRAFETIESF 355
           TTL+               D C VC I+R+GFS K                 A E+IE+ 
Sbjct: 235 TTLSCTLGFSNSSSNLCFSDHCEVCHILRHGFSPKTRPDGIKGVLTASTSSTALESIETD 294

Query: 356 GNEPPSLRKALIVCRVIAGRVHRPLENIQEIASQTGFDSLAGKVGLYSNIEELYLLNPRA 415
                    A+++CRVIAGRVH+P++  +     + FDSLA KVG  S IEELYLL+ +A
Sbjct: 295 QGRNRGSLIAVVLCRVIAGRVHKPMQTFENSLGFSEFDSLALKVGQNSRIEELYLLSTKA 354

Query: 416 LLPCFVVICKP 426
           LLPCFV+I KP
Sbjct: 355 LLPCFVIIFKP 365


>AT1G75710.1 | Symbols:  | C2H2-like zinc finger protein |
           chr1:28428806-28431128 FORWARD LENGTH=462
          Length = 462

 Score =  170 bits (430), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 103/253 (40%), Positives = 134/253 (52%), Gaps = 20/253 (7%)

Query: 193 CHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLK 252
           C +CGE F K E+ E H   +HAV+EL   DS R IVEII ++SWLK ++   +IER+LK
Sbjct: 206 CSQCGEVFPKLESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILK 265

Query: 253 VHNMQRTLARFEEYREMVKTKASKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXX 312
           VHN QRT+ RFE+ R+ VK +A +  +K  RC ADGNELLRF+ TTL             
Sbjct: 266 VHNTQRTIQRFEDCRDAVKARALQATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLC 325

Query: 313 XXDKCC-VCRIIRNGFSAKKELKXXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRV 371
                C VC +IR+GF  K                 +    +         R+ ++VCRV
Sbjct: 326 SNLPVCGVCTVIRHGFQGKSGGGGANVANAGVRTTASSGRADDLLRCSDDARRVMLVCRV 385

Query: 372 IAGRVHR---PLENIQEIASQTG----------------FDSLAGKVGLYSNIEELYLLN 412
           IAGRV R   P  +    A +                  FDS+A   G+YSN+EEL + N
Sbjct: 386 IAGRVKRVDLPAADASATAEKKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELVVYN 445

Query: 413 PRALLPCFVVICK 425
           PRA+LPCFVVI K
Sbjct: 446 PRAILPCFVVIYK 458


>AT2G29660.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr2:12679346-12680467 FORWARD LENGTH=373
          Length = 373

 Score =  145 bits (367), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 138/263 (52%), Gaps = 17/263 (6%)

Query: 176 RFHPETTTDSNGSSSVT-CHKCGEQFNKWEAAEAHHLSKHAVTELVEGDSSRKIVEIICR 234
           R H +T  + + S  +  C+ CGE F K    E H   KHAV+EL+ G+SS  IV+II +
Sbjct: 110 RIHQQTEFEISSSDEIFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFK 169

Query: 235 TSWLKSEN-NCGRIERVLKVHNMQRTLARFEEYREMVKTKASKLQK-----KHPRCLADG 288
           + W +  N     I R+LK+HN  + L RFEEYRE VK KA++           RC+ADG
Sbjct: 170 SGWPEQGNYKSPVINRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADG 229

Query: 289 NELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFSAKKELKXXXXXXXXXXXXRA 348
           NELLRFY +T                  C +C II +GFS K +                
Sbjct: 230 NELLRFYCSTFMCDLGQNGKSNLCGHQYCSICGIIGSGFSPKLDGIATLATGWRGHVAVP 289

Query: 349 FETIESFGNEPPSLRKALIVCRVIAGRV--HRPLENIQEIASQTGFDSLAGKVG------ 400
            E  E FG    ++++A++VCRV+AGRV      ++  + +   G+DSL G+ G      
Sbjct: 290 EEVEEEFGFM--NVKRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGAL 347

Query: 401 LYSNIEELYLLNPRALLPCFVVI 423
           L  + +EL + NPRA+LPCFV++
Sbjct: 348 LRIDDDELLVFNPRAVLPCFVIV 370


>AT4G22560.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
           LENGTH=264
          Length = 264

 Score =  107 bits (267), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/216 (34%), Positives = 108/216 (50%), Gaps = 47/216 (21%)

Query: 212 SKHAVTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVK 271
           +  A+TEL +G  SR +VEII  +SW  S+   GRIE + KV +  RT+ RFEEYRE+VK
Sbjct: 89  TSDALTELPDGHPSRNVVEIIFHSSW-SSDEFPGRIEMIFKVEHGSRTVTRFEEYREVVK 147

Query: 272 TKA----SKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGF 327
           ++A       +++  RCLADGNE++RFY                           + +GF
Sbjct: 148 SRAGFNGGTCEEEDARCLADGNEMMRFY--------------------------PVLDGF 181

Query: 328 SAKKELKXXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRVIAGRVHRPLENIQEIA 387
           +    +              + E   S G      RKA+++CRVIAGRV   +       
Sbjct: 182 NGGACVFAGGKGQAVCTFSGSGEAYVSSGG--GGGRKAMMICRVIAGRVDDVI------- 232

Query: 388 SQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVI 423
              G DS+AG+ G      EL++ + RA+LPCF++I
Sbjct: 233 -GFGSDSVAGRDG------ELFVFDTRAVLPCFLII 261


>AT1G62520.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
           in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
           Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
           LENGTH=280
          Length = 280

 Score =  106 bits (264), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 75/211 (35%), Positives = 106/211 (50%), Gaps = 32/211 (15%)

Query: 216 VTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKA- 274
           +TEL EG  SR +VEII +TSW   +   GR+E + KV N  +TL RFEEYRE VK ++ 
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSW-GPKPFSGRVEMIFKVQNGSKTLTRFEEYREAVKARSV 158

Query: 275 SKLQKKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRNGFSAKKELK 334
            K ++++ R +ADGNE +RFY                      C+      G SA   L 
Sbjct: 159 GKAREENARSVADGNETMRFY----------------------CLGPSYGGGGSAWGILG 196

Query: 335 XXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRVIAGRVHRPLENIQEIASQTGFDS 394
                        +    E  G      RKA++VCRVIAGRV +  E   +   ++ FDS
Sbjct: 197 GKGGGASIYTFAGSSTANEKAGGGKG--RKAMLVCRVIAGRVTKQNELKYDSDLRSRFDS 254

Query: 395 LAGKVGLYSNIEELYLLNPRALLPCFVVICK 425
           ++G  G      EL + + RA+LPCF++I +
Sbjct: 255 VSGDDG------ELLVFDTRAVLPCFLIIYR 279


>AT4G12450.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
           in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
           LENGTH=277
          Length = 277

 Score = 94.7 bits (234), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 103/218 (47%), Gaps = 50/218 (22%)

Query: 216 VTELVEGDSSRKIVEIICRTSWLKSENNCGRIERVLKVHNMQRTLARFEEYREMVKTKA- 274
           +T+L +G  SR +VEII ++SW  S+   GR+E + KV N  + + RFEEYRE VK+++ 
Sbjct: 97  LTDLPDGHPSRNVVEIIFQSSW-SSDEFPGRVEMIFKVENGSKAVTRFEEYREAVKSRSC 155

Query: 275 SKLQ---------KKHPRCLADGNELLRFYGTTLAXXXXXXXXXXXXXXDKCCVCRIIRN 325
           SK+           ++ RC ADGNE++RF+                       VC     
Sbjct: 156 SKVDSDRVDGSACDENARCSADGNEMMRFFPLGPIPGGINGGAWGFPGGKGAAVCT---- 211

Query: 326 GFSAKKELKXXXXXXXXXXXXRAFETIESFGNEPPSLRKALIVCRVIAGRVHRPLENIQE 385
            FS   E               A  +    G      R+A+++CRVIAGRV +       
Sbjct: 212 -FSGSGE---------------AHASTGGGGG-----RRAMLICRVIAGRVAK------- 243

Query: 386 IASQTGFDSLAGKVGLYSNIEELYLLNPRALLPCFVVI 423
              + G DS+AG+ G      EL + + RA+LPCF++ 
Sbjct: 244 -KGEFGSDSVAGRAG------ELIVFDARAVLPCFLIF 274