Miyakogusa Predicted Gene

Lj6g3v2145450.1
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v2145450.1 Non Chatacterized Hit- tr|I1ME64|I1ME64_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,62.81,0,ADP-ribosylation,NULL; ZINC_FINGER_C2H2_1,Zinc finger,
C2H2; no description,NULL; SUBFAMILY NOT NAME,CUFF.60702.1
         (375 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G27240.1 | Symbols:  | zinc finger (C2H2 type) family protein...   268   4e-72
AT5G54630.1 | Symbols:  | zinc finger protein-related | chr5:221...   265   3e-71
AT1G11490.1 | Symbols:  | zinc finger (C2H2 type) family protein...   215   5e-56
AT1G75710.1 | Symbols:  | C2H2-like zinc finger protein | chr1:2...   162   3e-40
AT2G29660.1 | Symbols:  | zinc finger (C2H2 type) family protein...   120   1e-27
AT1G62520.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    86   5e-17
AT4G22560.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    82   9e-16
AT4G12450.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    69   5e-12

>AT4G27240.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr4:13640160-13641640 FORWARD LENGTH=431
          Length = 431

 Score =  268 bits (686), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 143/270 (52%), Positives = 185/270 (68%), Gaps = 15/270 (5%)

Query: 97  DSEG-SSHRTRRTAPTTTSMLDQTDSSDVVSMLICQKCGEKLKNLDAVEPHHISNHSVTE 155
           D EG   H++RR      ++    D+S V     C KCGEK   L+A E HH++ H+VTE
Sbjct: 170 DREGLGFHQSRRENDREAAI--NGDNSSVS----CHKCGEKFSKLEAAEAHHLTKHAVTE 223

Query: 156 LQE-DSSRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVKIKAEKLQ 214
           L E DSSR+I+E IC TS + +EN  G+ID ILKV NM KTLA FEEYR+ VKI+A KLQ
Sbjct: 224 LMEGDSSRRIVEIICRTSWLKTENQGGRIDRILKVHNMQKTLARFEEYRDTVKIRASKLQ 283

Query: 215 KNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEFQCALG 274
           K HPRC+ DGNELLR HGT +AC+LG N S SLC+ + C  C+I+R+GFS  +E    +G
Sbjct: 284 KKHPRCIADGNELLRFHGTTVACALGINGSTSLCSSEKCCVCRIIRNGFSAKREMNNGIG 343

Query: 275 VYTTSTSAKAFDSIVLSNERQFERKTVIVCRVIAGIVYSPHTQEKVDH-----SEFDSLA 329
           V+T STS +AF+SIV+ +    +RK +IVCRVIAG V+ P   E V+      S FDSLA
Sbjct: 344 VFTASTSERAFESIVIGDGGGGDRKALIVCRVIAGRVHRP--VENVEEMGGLLSGFDSLA 401

Query: 330 EKISNHSDFEELYVLSPRALLPCFVVIYKP 359
            K+  +++ EELY+L+ RALLPCFV+I KP
Sbjct: 402 GKVGLYTNVEELYLLNSRALLPCFVLICKP 431


>AT5G54630.1 | Symbols:  | zinc finger protein-related |
           chr5:22192607-22194260 REVERSE LENGTH=472
          Length = 472

 Score =  265 bits (678), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 136/242 (56%), Positives = 172/242 (71%), Gaps = 14/242 (5%)

Query: 130 CQKCGEKLKNLDAVEPHHISNHSVTELQE-DSSRQIIETICGTSSVNSENILGQIDSILK 188
           C KCGE+   L+A E HH+S H+VTEL E DSSR+I+E IC TS + SEN  G+ID +LK
Sbjct: 233 CHKCGEQFNKLEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRIDRVLK 292

Query: 189 VQNMPKTLACFEEYREKVKIKAEKLQKNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLC 248
           V NM KTLA FEEYRE VKI+A KLQK HPRCL DGNELLR HGT +AC LG N S S+C
Sbjct: 293 VHNMQKTLARFEEYRETVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGINGSTSVC 352

Query: 249 TLDHCGACQILRHGFSTNKEFQCALGVYTTSTSAKAFDSIVLSNERQFE------RKTVI 302
           T + C  C+I+R+GFS+ +E    +GV+T STS +AF+SI+++   +        RK +I
Sbjct: 353 TAEKCCVCRIIRNGFSSKREKNNGVGVFTASTSGRAFESILVNGGDESGDVDRTVRKVLI 412

Query: 303 VCRVIAGIVYSPHTQEKVDH-----SEFDSLAEKISNHSDFEELYVLSPRALLPCFVVIY 357
           VCRVIAG V+ P   E V+      S FDSLA K+  +++ EELY+L+P+ALLPCFVVI 
Sbjct: 413 VCRVIAGRVHRP--VENVEEMNGLMSGFDSLAGKVGLYTNVEELYLLNPKALLPCFVVIC 470

Query: 358 KP 359
           KP
Sbjct: 471 KP 472


>AT1G11490.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr1:3868884-3870065 REVERSE LENGTH=365
          Length = 365

 Score =  215 bits (547), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 116/242 (47%), Positives = 153/242 (63%), Gaps = 7/242 (2%)

Query: 125 VSMLICQKCGEKLKNLDAVEPHHISNHSVTEL-QEDSSRQIIETICGTSSVNSENIL--G 181
             +L CQKC E++++LDA E H++SNHSV  L   D SR  +E IC T   +    +   
Sbjct: 124 FGVLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLGKMKGN 183

Query: 182 QIDSILKVQNMPKTLACFEEYREKVKIKAEKLQKNHPRCLVDGNELLRIHGTNIACSLG- 240
            I +I K+QN+ + +A FE+YRE VKI+A KL K H RC+ DGNE L  HGT ++C+LG 
Sbjct: 184 NISAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHGTTLSCTLGF 243

Query: 241 TNSSYSLCTLDHCGACQILRHGFSTNKEFQCALGVYTTSTSAKAFDSIVLSNER-QFERK 299
           +NSS +LC  DHC  C ILRHGFS         GV T STS+ A +SI     R +    
Sbjct: 244 SNSSSNLCFSDHCEVCHILRHGFSPKTRPDGIKGVLTASTSSTALESIETDQGRNRGSLI 303

Query: 300 TVIVCRVIAGIVYSPHT--QEKVDHSEFDSLAEKISNHSDFEELYVLSPRALLPCFVVIY 357
            V++CRVIAG V+ P    +  +  SEFDSLA K+  +S  EELY+LS +ALLPCFV+I+
Sbjct: 304 AVVLCRVIAGRVHKPMQTFENSLGFSEFDSLALKVGQNSRIEELYLLSTKALLPCFVIIF 363

Query: 358 KP 359
           KP
Sbjct: 364 KP 365


>AT1G75710.1 | Symbols:  | C2H2-like zinc finger protein |
           chr1:28428806-28431128 FORWARD LENGTH=462
          Length = 462

 Score =  162 bits (410), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 103/256 (40%), Positives = 142/256 (55%), Gaps = 30/256 (11%)

Query: 130 CQKCGEKLKNLDAVEPHHISNHSVTEL-QEDSSRQIIETICGTSSVNSENILGQIDSILK 188
           C +CGE    L+++E H    H+V+EL  EDS R I+E I  +S +  ++ + QI+ ILK
Sbjct: 206 CSQCGEVFPKLESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILK 265

Query: 189 VQNMPKTLACFEEYREKVKIKAEKLQKNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLC 248
           V N  +T+  FE+ R+ VK +A +  +   RC  DGNELLR H T + CSLG   S SLC
Sbjct: 266 VHNTQRTIQRFEDCRDAVKARALQATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLC 325

Query: 249 T-LDHCGACQILRHGFSTNKEFQCA----LGVYTTSTSAKAFDSIVLSNERQFERKTVIV 303
           + L  CG C ++RHGF        A     GV TT++S +A D +  S++    R+ ++V
Sbjct: 326 SNLPVCGVCTVIRHGFQGKSGGGGANVANAGVRTTASSGRADDLLRCSDD---ARRVMLV 382

Query: 304 CRVIAGIVY--------SPHTQEKVDHSE-------------FDSLAEKISNHSDFEELY 342
           CRVIAG V         +  T EK    E             FDS+A     +S+ EEL 
Sbjct: 383 CRVIAGRVKRVDLPAADASATAEKKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELV 442

Query: 343 VLSPRALLPCFVVIYK 358
           V +PRA+LPCFVVIYK
Sbjct: 443 VYNPRAILPCFVVIYK 458


>AT2G29660.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr2:12679346-12680467 FORWARD LENGTH=373
          Length = 373

 Score =  120 bits (302), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/261 (33%), Positives = 130/261 (49%), Gaps = 28/261 (10%)

Query: 118 QTDSSDVVSMLICQKCGEKLKNLDAVEPHHISNHSVTEL-QEDSSRQIIETICGTSSVNS 176
           +  SSD   +  C  CGE    ++ +E H    H+V+EL   +SS  I++ I  +     
Sbjct: 118 EISSSD--EIFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFKSGWPEQ 175

Query: 177 ENILGQ-IDSILKVQNMPKTLACFEEYREKVKIKAEK-----LQKNHPRCLVDGNELLRI 230
            N     I+ ILK+ N  K L  FEEYRE VK KA +      + +  RC+ DGNELLR 
Sbjct: 176 GNYKSPVINRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADGNELLRF 235

Query: 231 HGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEFQCALGVYTTSTSAKAFDSIVL 290
           + +   C LG N   +LC   +C  C I+  GFS   +     G+ T +T  +   ++  
Sbjct: 236 YCSTFMCDLGQNGKSNLCGHQYCSICGIIGSGFSPKLD-----GIATLATGWRGHVAVPE 290

Query: 291 SNERQFE----RKTVIVCRVIAGIV----YSPHTQEKVDHSEFDSLAEKISNHS------ 336
             E +F     ++ ++VCRV+AG V          +K D   +DSL  +  N S      
Sbjct: 291 EVEEEFGFMNVKRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGALLRI 350

Query: 337 DFEELYVLSPRALLPCFVVIY 357
           D +EL V +PRA+LPCFV++Y
Sbjct: 351 DDDELLVFNPRAVLPCFVIVY 371


>AT1G62520.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
           in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
           Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
           LENGTH=280
          Length = 280

 Score = 85.9 bits (211), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 71/215 (33%), Positives = 103/215 (47%), Gaps = 42/215 (19%)

Query: 152 SVTELQED-SSRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVKIKA 210
           ++TEL E   SR ++E I  TS    +   G+++ I KVQN  KTL  FEEYRE VK ++
Sbjct: 99  TLTELSEGHQSRNVVEIIFQTS-WGPKPFSGRVEMIFKVQNGSKTLTRFEEYREAVKARS 157

Query: 211 -EKLQKNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEF 269
             K ++ + R + DGNE +R +   +  S G   S                  +      
Sbjct: 158 VGKAREENARSVADGNETMRFY--CLGPSYGGGGS-----------------AWGILGGK 198

Query: 270 QCALGVYTTSTSAKAFDSIVLSNERQF---ERKTVIVCRVIAGIVYSPHTQEKVD---HS 323
                +YT + S+ A       NE+      RK ++VCRVIAG V +   + K D    S
Sbjct: 199 GGGASIYTFAGSSTA-------NEKAGGGKGRKAMLVCRVIAGRV-TKQNELKYDSDLRS 250

Query: 324 EFDSLAEKISNHSDFEELYVLSPRALLPCFVVIYK 358
            FDS++       D  EL V   RA+LPCF++IY+
Sbjct: 251 RFDSVS------GDDGELLVFDTRAVLPCFLIIYR 279


>AT4G22560.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
           LENGTH=264
          Length = 264

 Score = 81.6 bits (200), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 65/220 (29%), Positives = 101/220 (45%), Gaps = 55/220 (25%)

Query: 149 SNHSVTELQED-SSRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVK 207
           ++ ++TEL +   SR ++E I   SS +S+   G+I+ I KV++  +T+  FEEYRE VK
Sbjct: 89  TSDALTELPDGHPSRNVVEIIF-HSSWSSDEFPGRIEMIFKVEHGSRTVTRFEEYREVVK 147

Query: 208 IKAE----KLQKNHPRCLVDGNELLRIH----GTNI-ACSLGTNSSYSLCTLDHCGACQI 258
            +A       ++   RCL DGNE++R +    G N  AC        ++CT    G    
Sbjct: 148 SRAGFNGGTCEEEDARCLADGNEMMRFYPVLDGFNGGACVFAGGKGQAVCTFSGSGE--- 204

Query: 259 LRHGFSTNKEFQCALGVYTTSTSAKAFDSIVLSNERQFERKTVIVCRVIAGIVYSPHTQE 318
                            Y +S                  RK +++CRVIAG V      +
Sbjct: 205 ----------------AYVSSGGGGG-------------RKAMMICRVIAGRV------D 229

Query: 319 KVDHSEFDSLAEKISNHSDFEELYVLSPRALLPCFVVIYK 358
            V     DS+A +        EL+V   RA+LPCF++I++
Sbjct: 230 DVIGFGSDSVAGRDG------ELFVFDTRAVLPCFLIIFR 263


>AT4G12450.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
           in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
           LENGTH=277
          Length = 277

 Score = 69.3 bits (168), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 91/210 (43%), Gaps = 51/210 (24%)

Query: 161 SRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVK------IKAEKLQ 214
           SR ++E I   SS +S+   G+++ I KV+N  K +  FEEYRE VK      + ++++ 
Sbjct: 106 SRNVVEIIF-QSSWSSDEFPGRVEMIFKVENGSKAVTRFEEYREAVKSRSCSKVDSDRVD 164

Query: 215 KN----HPRCLVDGNELLRIHGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEFQ 270
            +    + RC  DGNE++R              ++                GF   K   
Sbjct: 165 GSACDENARCSADGNEMMRFFPLGPIPGGINGGAW----------------GFPGGK--- 205

Query: 271 CALGVYTTSTSAKAFDSIVLSNERQFERKTVIVCRVIAGIVYSPHTQEKVDHSEF--DSL 328
               V T S S +A  S      R    + +++CRVIAG V            EF  DS+
Sbjct: 206 -GAAVCTFSGSGEAHASTGGGGGR----RAMLICRVIAGRV--------AKKGEFGSDSV 252

Query: 329 AEKISNHSDFEELYVLSPRALLPCFVVIYK 358
           A +        EL V   RA+LPCF++ ++
Sbjct: 253 AGRAG------ELIVFDARAVLPCFLIFFR 276