Miyakogusa Predicted Gene
- Lj6g3v2145450.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v2145450.1 Non Chatacterized Hit- tr|I1ME64|I1ME64_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,62.81,0,ADP-ribosylation,NULL; ZINC_FINGER_C2H2_1,Zinc finger,
C2H2; no description,NULL; SUBFAMILY NOT NAME,CUFF.60702.1
(375 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein... 268 4e-72
AT5G54630.1 | Symbols: | zinc finger protein-related | chr5:221... 265 3e-71
AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein... 215 5e-56
AT1G75710.1 | Symbols: | C2H2-like zinc finger protein | chr1:2... 162 3e-40
AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein... 120 1e-27
AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 86 5e-17
AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 82 9e-16
AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 69 5e-12
>AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr4:13640160-13641640 FORWARD LENGTH=431
Length = 431
Score = 268 bits (686), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 143/270 (52%), Positives = 185/270 (68%), Gaps = 15/270 (5%)
Query: 97 DSEG-SSHRTRRTAPTTTSMLDQTDSSDVVSMLICQKCGEKLKNLDAVEPHHISNHSVTE 155
D EG H++RR ++ D+S V C KCGEK L+A E HH++ H+VTE
Sbjct: 170 DREGLGFHQSRRENDREAAI--NGDNSSVS----CHKCGEKFSKLEAAEAHHLTKHAVTE 223
Query: 156 LQE-DSSRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVKIKAEKLQ 214
L E DSSR+I+E IC TS + +EN G+ID ILKV NM KTLA FEEYR+ VKI+A KLQ
Sbjct: 224 LMEGDSSRRIVEIICRTSWLKTENQGGRIDRILKVHNMQKTLARFEEYRDTVKIRASKLQ 283
Query: 215 KNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEFQCALG 274
K HPRC+ DGNELLR HGT +AC+LG N S SLC+ + C C+I+R+GFS +E +G
Sbjct: 284 KKHPRCIADGNELLRFHGTTVACALGINGSTSLCSSEKCCVCRIIRNGFSAKREMNNGIG 343
Query: 275 VYTTSTSAKAFDSIVLSNERQFERKTVIVCRVIAGIVYSPHTQEKVDH-----SEFDSLA 329
V+T STS +AF+SIV+ + +RK +IVCRVIAG V+ P E V+ S FDSLA
Sbjct: 344 VFTASTSERAFESIVIGDGGGGDRKALIVCRVIAGRVHRP--VENVEEMGGLLSGFDSLA 401
Query: 330 EKISNHSDFEELYVLSPRALLPCFVVIYKP 359
K+ +++ EELY+L+ RALLPCFV+I KP
Sbjct: 402 GKVGLYTNVEELYLLNSRALLPCFVLICKP 431
>AT5G54630.1 | Symbols: | zinc finger protein-related |
chr5:22192607-22194260 REVERSE LENGTH=472
Length = 472
Score = 265 bits (678), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 136/242 (56%), Positives = 172/242 (71%), Gaps = 14/242 (5%)
Query: 130 CQKCGEKLKNLDAVEPHHISNHSVTELQE-DSSRQIIETICGTSSVNSENILGQIDSILK 188
C KCGE+ L+A E HH+S H+VTEL E DSSR+I+E IC TS + SEN G+ID +LK
Sbjct: 233 CHKCGEQFNKLEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRIDRVLK 292
Query: 189 VQNMPKTLACFEEYREKVKIKAEKLQKNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLC 248
V NM KTLA FEEYRE VKI+A KLQK HPRCL DGNELLR HGT +AC LG N S S+C
Sbjct: 293 VHNMQKTLARFEEYRETVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGINGSTSVC 352
Query: 249 TLDHCGACQILRHGFSTNKEFQCALGVYTTSTSAKAFDSIVLSNERQFE------RKTVI 302
T + C C+I+R+GFS+ +E +GV+T STS +AF+SI+++ + RK +I
Sbjct: 353 TAEKCCVCRIIRNGFSSKREKNNGVGVFTASTSGRAFESILVNGGDESGDVDRTVRKVLI 412
Query: 303 VCRVIAGIVYSPHTQEKVDH-----SEFDSLAEKISNHSDFEELYVLSPRALLPCFVVIY 357
VCRVIAG V+ P E V+ S FDSLA K+ +++ EELY+L+P+ALLPCFVVI
Sbjct: 413 VCRVIAGRVHRP--VENVEEMNGLMSGFDSLAGKVGLYTNVEELYLLNPKALLPCFVVIC 470
Query: 358 KP 359
KP
Sbjct: 471 KP 472
>AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr1:3868884-3870065 REVERSE LENGTH=365
Length = 365
Score = 215 bits (547), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 116/242 (47%), Positives = 153/242 (63%), Gaps = 7/242 (2%)
Query: 125 VSMLICQKCGEKLKNLDAVEPHHISNHSVTEL-QEDSSRQIIETICGTSSVNSENIL--G 181
+L CQKC E++++LDA E H++SNHSV L D SR +E IC T + +
Sbjct: 124 FGVLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLGKMKGN 183
Query: 182 QIDSILKVQNMPKTLACFEEYREKVKIKAEKLQKNHPRCLVDGNELLRIHGTNIACSLG- 240
I +I K+QN+ + +A FE+YRE VKI+A KL K H RC+ DGNE L HGT ++C+LG
Sbjct: 184 NISAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHGTTLSCTLGF 243
Query: 241 TNSSYSLCTLDHCGACQILRHGFSTNKEFQCALGVYTTSTSAKAFDSIVLSNER-QFERK 299
+NSS +LC DHC C ILRHGFS GV T STS+ A +SI R +
Sbjct: 244 SNSSSNLCFSDHCEVCHILRHGFSPKTRPDGIKGVLTASTSSTALESIETDQGRNRGSLI 303
Query: 300 TVIVCRVIAGIVYSPHT--QEKVDHSEFDSLAEKISNHSDFEELYVLSPRALLPCFVVIY 357
V++CRVIAG V+ P + + SEFDSLA K+ +S EELY+LS +ALLPCFV+I+
Sbjct: 304 AVVLCRVIAGRVHKPMQTFENSLGFSEFDSLALKVGQNSRIEELYLLSTKALLPCFVIIF 363
Query: 358 KP 359
KP
Sbjct: 364 KP 365
>AT1G75710.1 | Symbols: | C2H2-like zinc finger protein |
chr1:28428806-28431128 FORWARD LENGTH=462
Length = 462
Score = 162 bits (410), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 103/256 (40%), Positives = 142/256 (55%), Gaps = 30/256 (11%)
Query: 130 CQKCGEKLKNLDAVEPHHISNHSVTEL-QEDSSRQIIETICGTSSVNSENILGQIDSILK 188
C +CGE L+++E H H+V+EL EDS R I+E I +S + ++ + QI+ ILK
Sbjct: 206 CSQCGEVFPKLESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILK 265
Query: 189 VQNMPKTLACFEEYREKVKIKAEKLQKNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLC 248
V N +T+ FE+ R+ VK +A + + RC DGNELLR H T + CSLG S SLC
Sbjct: 266 VHNTQRTIQRFEDCRDAVKARALQATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLC 325
Query: 249 T-LDHCGACQILRHGFSTNKEFQCA----LGVYTTSTSAKAFDSIVLSNERQFERKTVIV 303
+ L CG C ++RHGF A GV TT++S +A D + S++ R+ ++V
Sbjct: 326 SNLPVCGVCTVIRHGFQGKSGGGGANVANAGVRTTASSGRADDLLRCSDD---ARRVMLV 382
Query: 304 CRVIAGIVY--------SPHTQEKVDHSE-------------FDSLAEKISNHSDFEELY 342
CRVIAG V + T EK E FDS+A +S+ EEL
Sbjct: 383 CRVIAGRVKRVDLPAADASATAEKKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELV 442
Query: 343 VLSPRALLPCFVVIYK 358
V +PRA+LPCFVVIYK
Sbjct: 443 VYNPRAILPCFVVIYK 458
>AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr2:12679346-12680467 FORWARD LENGTH=373
Length = 373
Score = 120 bits (302), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 130/261 (49%), Gaps = 28/261 (10%)
Query: 118 QTDSSDVVSMLICQKCGEKLKNLDAVEPHHISNHSVTEL-QEDSSRQIIETICGTSSVNS 176
+ SSD + C CGE ++ +E H H+V+EL +SS I++ I +
Sbjct: 118 EISSSD--EIFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFKSGWPEQ 175
Query: 177 ENILGQ-IDSILKVQNMPKTLACFEEYREKVKIKAEK-----LQKNHPRCLVDGNELLRI 230
N I+ ILK+ N K L FEEYRE VK KA + + + RC+ DGNELLR
Sbjct: 176 GNYKSPVINRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADGNELLRF 235
Query: 231 HGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEFQCALGVYTTSTSAKAFDSIVL 290
+ + C LG N +LC +C C I+ GFS + G+ T +T + ++
Sbjct: 236 YCSTFMCDLGQNGKSNLCGHQYCSICGIIGSGFSPKLD-----GIATLATGWRGHVAVPE 290
Query: 291 SNERQFE----RKTVIVCRVIAGIV----YSPHTQEKVDHSEFDSLAEKISNHS------ 336
E +F ++ ++VCRV+AG V +K D +DSL + N S
Sbjct: 291 EVEEEFGFMNVKRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGALLRI 350
Query: 337 DFEELYVLSPRALLPCFVVIY 357
D +EL V +PRA+LPCFV++Y
Sbjct: 351 DDDELLVFNPRAVLPCFVIVY 371
>AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
LENGTH=280
Length = 280
Score = 85.9 bits (211), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 71/215 (33%), Positives = 103/215 (47%), Gaps = 42/215 (19%)
Query: 152 SVTELQED-SSRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVKIKA 210
++TEL E SR ++E I TS + G+++ I KVQN KTL FEEYRE VK ++
Sbjct: 99 TLTELSEGHQSRNVVEIIFQTS-WGPKPFSGRVEMIFKVQNGSKTLTRFEEYREAVKARS 157
Query: 211 -EKLQKNHPRCLVDGNELLRIHGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEF 269
K ++ + R + DGNE +R + + S G S +
Sbjct: 158 VGKAREENARSVADGNETMRFY--CLGPSYGGGGS-----------------AWGILGGK 198
Query: 270 QCALGVYTTSTSAKAFDSIVLSNERQF---ERKTVIVCRVIAGIVYSPHTQEKVD---HS 323
+YT + S+ A NE+ RK ++VCRVIAG V + + K D S
Sbjct: 199 GGGASIYTFAGSSTA-------NEKAGGGKGRKAMLVCRVIAGRV-TKQNELKYDSDLRS 250
Query: 324 EFDSLAEKISNHSDFEELYVLSPRALLPCFVVIYK 358
FDS++ D EL V RA+LPCF++IY+
Sbjct: 251 RFDSVS------GDDGELLVFDTRAVLPCFLIIYR 279
>AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
LENGTH=264
Length = 264
Score = 81.6 bits (200), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 101/220 (45%), Gaps = 55/220 (25%)
Query: 149 SNHSVTELQED-SSRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVK 207
++ ++TEL + SR ++E I SS +S+ G+I+ I KV++ +T+ FEEYRE VK
Sbjct: 89 TSDALTELPDGHPSRNVVEIIF-HSSWSSDEFPGRIEMIFKVEHGSRTVTRFEEYREVVK 147
Query: 208 IKAE----KLQKNHPRCLVDGNELLRIH----GTNI-ACSLGTNSSYSLCTLDHCGACQI 258
+A ++ RCL DGNE++R + G N AC ++CT G
Sbjct: 148 SRAGFNGGTCEEEDARCLADGNEMMRFYPVLDGFNGGACVFAGGKGQAVCTFSGSGE--- 204
Query: 259 LRHGFSTNKEFQCALGVYTTSTSAKAFDSIVLSNERQFERKTVIVCRVIAGIVYSPHTQE 318
Y +S RK +++CRVIAG V +
Sbjct: 205 ----------------AYVSSGGGGG-------------RKAMMICRVIAGRV------D 229
Query: 319 KVDHSEFDSLAEKISNHSDFEELYVLSPRALLPCFVVIYK 358
V DS+A + EL+V RA+LPCF++I++
Sbjct: 230 DVIGFGSDSVAGRDG------ELFVFDTRAVLPCFLIIFR 263
>AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
LENGTH=277
Length = 277
Score = 69.3 bits (168), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 91/210 (43%), Gaps = 51/210 (24%)
Query: 161 SRQIIETICGTSSVNSENILGQIDSILKVQNMPKTLACFEEYREKVK------IKAEKLQ 214
SR ++E I SS +S+ G+++ I KV+N K + FEEYRE VK + ++++
Sbjct: 106 SRNVVEIIF-QSSWSSDEFPGRVEMIFKVENGSKAVTRFEEYREAVKSRSCSKVDSDRVD 164
Query: 215 KN----HPRCLVDGNELLRIHGTNIACSLGTNSSYSLCTLDHCGACQILRHGFSTNKEFQ 270
+ + RC DGNE++R ++ GF K
Sbjct: 165 GSACDENARCSADGNEMMRFFPLGPIPGGINGGAW----------------GFPGGK--- 205
Query: 271 CALGVYTTSTSAKAFDSIVLSNERQFERKTVIVCRVIAGIVYSPHTQEKVDHSEF--DSL 328
V T S S +A S R + +++CRVIAG V EF DS+
Sbjct: 206 -GAAVCTFSGSGEAHASTGGGGGR----RAMLICRVIAGRV--------AKKGEFGSDSV 252
Query: 329 AEKISNHSDFEELYVLSPRALLPCFVVIYK 358
A + EL V RA+LPCF++ ++
Sbjct: 253 AGRAG------ELIVFDARAVLPCFLIFFR 276