Miyakogusa Predicted Gene
- Lj5g3v0176630.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0176630.1 Non Chatacterized Hit- tr|G7I760|G7I760_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,79.06,0,seg,NULL; ZINC_FINGER_C2H2_1,Zinc finger, C2H2;
ADP-ribosylation,NULL; SUBFAMILY NOT NAMED,NULL; FAM,CUFF.52623.1
(297 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G75710.1 | Symbols: | C2H2-like zinc finger protein | chr1:2... 349 1e-96
AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein... 156 1e-38
AT5G54630.1 | Symbols: | zinc finger protein-related | chr5:221... 151 6e-37
AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein... 131 6e-31
AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein... 97 2e-20
AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 76 3e-14
AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 76 3e-14
AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 73 3e-13
>AT1G75710.1 | Symbols: | C2H2-like zinc finger protein |
chr1:28428806-28431128 FORWARD LENGTH=462
Length = 462
Score = 349 bits (895), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 183/285 (64%), Positives = 210/285 (73%), Gaps = 21/285 (7%)
Query: 26 DQIKNLISCKQIEGSRVIDPSKGHXXXXXXXXXXXXX----------XFKDVVHGNTRVV 75
DQIKNL++CKQIEGSRV DPSK F+DV HGNTRVV
Sbjct: 57 DQIKNLLTCKQIEGSRVHDPSKNSQSGPSMTTNLSPSKLGSSCSSICSFRDVAHGNTRVV 116
Query: 76 HRSDNSSP--ESSTLGQETGLLSRKPAPHHSTSGSAG-----KSNCXXXXXXXXXXX-XX 127
HR+D+S S+T ET LL+RKP H S+S + +SN
Sbjct: 117 HRADHSPDVGNSATPNSETRLLTRKPGQHGSSSSRSLTSGSTRSNASGSYTSSSTTSFRA 176
Query: 128 MQFRKLSGCYECHMIIDPSRQPM-PRSTICVCSQCGEVFPKMESLELHQAVRHAVSELGP 186
MQFRKLSGCYECHMI+DPSR P+ PR +C CSQCGEVFPK+ESLELHQAVRHAVSELGP
Sbjct: 177 MQFRKLSGCYECHMIVDPSRYPISPR--VCACSQCGEVFPKLESLELHQAVRHAVSELGP 234
Query: 187 EDSGRNIVEIIFKSSWLKRDTPMCKIERILKVHNTQRTIQRFEECRDAVKSRAVSSTKKN 246
EDSGRNIVEIIFKSSWLK+D+P+C+IERILKVHNTQRTIQRFE+CRDAVK+RA+ +T+K+
Sbjct: 235 EDSGRNIVEIIFKSSWLKKDSPICQIERILKVHNTQRTIQRFEDCRDAVKARALQATRKD 294
Query: 247 PRCAADGNELLRFHCTALTCALGERGSSALCSAGPSCGVCTIIRH 291
RCAADGNELLRFHCT LTC+LG RGSS+LCS P CGVCT+IRH
Sbjct: 295 ARCAADGNELLRFHCTTLTCSLGARGSSSLCSNLPVCGVCTVIRH 339
>AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr4:13640160-13641640 FORWARD LENGTH=431
Length = 431
Score = 156 bits (395), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 79/140 (56%), Positives = 98/140 (70%), Gaps = 1/140 (0%)
Query: 158 CSQCGEVFPKMESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKRDTPMCKIERILK 217
C +CGE F K+E+ E H +HAV+EL DS R IVEII ++SWLK + +I+RILK
Sbjct: 198 CHKCGEKFSKLEAAEAHHLTKHAVTELMEGDSSRRIVEIICRTSWLKTENQGGRIDRILK 257
Query: 218 VHNTQRTIQRFEECRDAVKSRAVSSTKKNPRCAADGNELLRFHCTALTCALGERGSSALC 277
VHN Q+T+ RFEE RD VK RA KK+PRC ADGNELLRFH T + CALG GS++LC
Sbjct: 258 VHNMQKTLARFEEYRDTVKIRASKLQKKHPRCIADGNELLRFHGTTVACALGINGSTSLC 317
Query: 278 SAGPSCGVCTIIRHGFQGGK 297
S+ C VC IIR+GF +
Sbjct: 318 SS-EKCCVCRIIRNGFSAKR 336
>AT5G54630.1 | Symbols: | zinc finger protein-related |
chr5:22192607-22194260 REVERSE LENGTH=472
Length = 472
Score = 151 bits (381), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 75/136 (55%), Positives = 96/136 (70%), Gaps = 1/136 (0%)
Query: 158 CSQCGEVFPKMESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKRDTPMCKIERILK 217
C +CGE F K+E+ E H +HAV+EL DS R IVEII ++SWLK + +I+R+LK
Sbjct: 233 CHKCGEQFNKLEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRIDRVLK 292
Query: 218 VHNTQRTIQRFEECRDAVKSRAVSSTKKNPRCAADGNELLRFHCTALTCALGERGSSALC 277
VHN Q+T+ RFEE R+ VK RA KK+PRC ADGNELLRFH T + C LG GS+++C
Sbjct: 293 VHNMQKTLARFEEYRETVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGINGSTSVC 352
Query: 278 SAGPSCGVCTIIRHGF 293
+A C VC IIR+GF
Sbjct: 353 TA-EKCCVCRIIRNGF 367
>AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr2:12679346-12680467 FORWARD LENGTH=373
Length = 373
Score = 131 bits (329), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 93/147 (63%), Gaps = 11/147 (7%)
Query: 155 ICVCSQCGEVFPKMESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKR---DTPMCK 211
I C+ CGE+FPK+ LE H A++HAVSEL +S NIV+IIFKS W ++ +P+
Sbjct: 125 IFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFKSGWPEQGNYKSPV-- 182
Query: 212 IERILKVHNTQRTIQRFEECRDAVKSRAVSST-----KKNPRCAADGNELLRFHCTALTC 266
I RILK+HN+ + + RFEE R+ VK++A S + RC ADGNELLRF+C+ C
Sbjct: 183 INRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADGNELLRFYCSTFMC 242
Query: 267 ALGERGSSALCSAGPSCGVCTIIRHGF 293
LG+ G S LC C +C II GF
Sbjct: 243 DLGQNGKSNLC-GHQYCSICGIIGSGF 268
>AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr1:3868884-3870065 REVERSE LENGTH=365
Length = 365
Score = 96.7 bits (239), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 73/141 (51%), Gaps = 2/141 (1%)
Query: 155 ICVCSQCGEVFPKMESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKRDTPM--CKI 212
+ C +C E +++ E H H+V L D R VE+I + + + M I
Sbjct: 126 VLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLGKMKGNNI 185
Query: 213 ERILKVHNTQRTIQRFEECRDAVKSRAVSSTKKNPRCAADGNELLRFHCTALTCALGERG 272
I K+ N QR + FE+ R+ VK RA +KK+ RC ADGNE L FH T L+C LG
Sbjct: 186 SAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHGTTLSCTLGFSN 245
Query: 273 SSALCSAGPSCGVCTIIRHGF 293
SS+ C VC I+RHGF
Sbjct: 246 SSSNLCFSDHCEVCHILRHGF 266
>AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
LENGTH=280
Length = 280
Score = 76.3 bits (186), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 55/83 (66%), Gaps = 4/83 (4%)
Query: 181 VSELGPEDSGRNIVEIIFKSSWLKRDTPMC-KIERILKVHNTQRTIQRFEECRDAVKSRA 239
++EL RN+VEIIF++SW + P ++E I KV N +T+ RFEE R+AVK+R+
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSWGPK--PFSGRVEMIFKVQNGSKTLTRFEEYREAVKARS 157
Query: 240 VSSTK-KNPRCAADGNELLRFHC 261
V + +N R ADGNE +RF+C
Sbjct: 158 VGKAREENARSVADGNETMRFYC 180
>AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
LENGTH=264
Length = 264
Score = 75.9 bits (185), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 64/113 (56%), Gaps = 9/113 (7%)
Query: 180 AVSELGPEDSGRNIVEIIFKSSWLKRDTPMCKIERILKVHNTQRTIQRFEECRDAVKSRA 239
A++EL RN+VEIIF SSW + P +IE I KV + RT+ RFEE R+ VKSRA
Sbjct: 92 ALTELPDGHPSRNVVEIIFHSSWSSDEFP-GRIEMIFKVEHGSRTVTRFEEYREVVKSRA 150
Query: 240 VSS----TKKNPRCAADGNELLRFHCTALTCALGERGSSALCSAGPSCGVCTI 288
+ +++ RC ADGNE++RF+ G G + + + G VCT
Sbjct: 151 GFNGGTCEEEDARCLADGNEMMRFY----PVLDGFNGGACVFAGGKGQAVCTF 199
>AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
LENGTH=277
Length = 277
Score = 72.8 bits (177), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 57/95 (60%), Gaps = 11/95 (11%)
Query: 175 QAVRHAVSELGPEDSGRNIVEIIFKSSWLKRDTPMCKIERILKVHNTQRTIQRFEECRDA 234
++V +++L RN+VEIIF+SSW + P ++E I KV N + + RFEE R+A
Sbjct: 91 ESVLPVLTDLPDGHPSRNVVEIIFQSSWSSDEFP-GRVEMIFKVENGSKAVTRFEEYREA 149
Query: 235 VKSRAVSST----------KKNPRCAADGNELLRF 259
VKSR+ S +N RC+ADGNE++RF
Sbjct: 150 VKSRSCSKVDSDRVDGSACDENARCSADGNEMMRF 184