Miyakogusa Predicted Gene
- Lj1g3v0726890.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0726890.1 Non Chatacterized Hit- tr|G7J3K8|G7J3K8_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,73.33,0,seg,NULL; ADP-ribosylation,NULL; no
description,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NUL,CUFF.26222.1
(440 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G75710.1 | Symbols: | C2H2-like zinc finger protein | chr1:2... 299 2e-81
AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein... 236 2e-62
AT5G54630.1 | Symbols: | zinc finger protein-related | chr5:221... 232 4e-61
AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein... 189 2e-48
AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein... 157 1e-38
AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 122 4e-28
AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 119 4e-27
AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 107 2e-23
>AT1G75710.1 | Symbols: | C2H2-like zinc finger protein |
chr1:28428806-28431128 FORWARD LENGTH=462
Length = 462
Score = 299 bits (766), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 158/293 (53%), Positives = 199/293 (67%), Gaps = 34/293 (11%)
Query: 172 FRAIPFRRLSGCYECRMVVDPVLGFTRDPSLRSSICSCPDCGEIM-KAESLEHHQAVKHA 230
FRA+ FR+LSGCYEC M+VDP +R P + +C+C CGE+ K ESLE HQAV+HA
Sbjct: 174 FRAMQFRKLSGCYECHMIVDP----SRYP-ISPRVCACSQCGEVFPKLESLELHQAVRHA 228
Query: 231 VSELGPEDTSKNIVEIIFHSSWLKKQSPVCKIDRILKVHNTQRTITKFEEYRDSIKAKAT 290
VSELGPED+ +NIVEIIF SSWLKK SP+C+I+RILKVHNTQRTI +FE+ RD++KA+A
Sbjct: 229 VSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILKVHNTQRTIQRFEDCRDAVKARAL 288
Query: 291 KLPKKHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCNVCSVIKHGFKFNRXXX 350
+ +K RC ADGNELLRFHCTT CSLG GSS++C++ C VC+VI+HGF+
Sbjct: 289 QATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLCSNLPVCGVCTVIRHGFQGKSGGG 348
Query: 351 XXXIL-----TTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKK------------- 392
+ TTA+SG+A D + D+ +R MLVCRVIAGRVK+
Sbjct: 349 GANVANAGVRTTASSGRADDLLRCS---DDARRVMLVCRVIAGRVKRVDLPAADASATAE 405
Query: 393 -------NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
N+ G +DSVA + G YSNL+EL V+NPRAILPCFVVIY+
Sbjct: 406 KKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELVVYNPRAILPCFVVIYK 458
>AT4G27240.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr4:13640160-13641640 FORWARD LENGTH=431
Length = 431
Score = 236 bits (602), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 121/238 (50%), Positives = 166/238 (69%), Gaps = 7/238 (2%)
Query: 204 SSICSCPDCGE-IMKAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKI 262
+S SC CGE K E+ E H KHAV+EL D+S+ IVEII +SWLK ++ +I
Sbjct: 193 NSSVSCHKCGEKFSKLEAAEAHHLTKHAVTELMEGDSSRRIVEIICRTSWLKTENQGGRI 252
Query: 263 DRILKVHNTQRTITKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTFVCSLGLNG 322
DRILKVHN Q+T+ +FEEYRD++K +A+KL KKHPRCIADGNELLRFH TT C+LG+NG
Sbjct: 253 DRILKVHNMQKTLARFEEYRDTVKIRASKLQKKHPRCIADGNELLRFHGTTVACALGING 312
Query: 323 SSNICNSTSQCNVCSVIKHGFKFNRXXXX-XXILTTATSGKAHDKASIAPEDDNDKRAML 381
S+++C S+ +C VC +I++GF R + T +TS +A + I D++A++
Sbjct: 313 STSLC-SSEKCCVCRIIRNGFSAKREMNNGIGVFTASTSERAFESIVIGDGGGGDRKALI 371
Query: 382 VCRVIAGRVKK---NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFVVI 436
VCRVIAGRV + N E G++ +DS+AG VG Y+N++ELY+ N RA+LPCFV+I
Sbjct: 372 VCRVIAGRVHRPVENVEEMGGLL-SGFDSLAGKVGLYTNVEELYLLNSRALLPCFVLI 428
>AT5G54630.1 | Symbols: | zinc finger protein-related |
chr5:22192607-22194260 REVERSE LENGTH=472
Length = 472
Score = 232 bits (591), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 168/244 (68%), Gaps = 13/244 (5%)
Query: 204 SSICSCPDCGE-IMKAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKI 262
+S SC CGE K E+ E H KHAV+EL D+S+ IVEII +SWLK ++ +I
Sbjct: 228 NSSVSCHKCGEQFNKLEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRI 287
Query: 263 DRILKVHNTQRTITKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTFVCSLGLNG 322
DR+LKVHN Q+T+ +FEEYR+++K +A+KL KKHPRC+ADGNELLRFH TT C LG+NG
Sbjct: 288 DRVLKVHNMQKTLARFEEYRETVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGING 347
Query: 323 SSNICNSTSQCNVCSVIKHGFKFNRXXXX-XXILTTATSGKAHDKASIAPEDDND----- 376
S+++C + +C VC +I++GF R + T +TSG+A + + D++
Sbjct: 348 STSVC-TAEKCCVCRIIRNGFSSKREKNNGVGVFTASTSGRAFESILVNGGDESGDVDRT 406
Query: 377 -KRAMLVCRVIAGRVKK---NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPC 432
++ ++VCRVIAGRV + N E +G+M +DS+AG VG Y+N++ELY+ NP+A+LPC
Sbjct: 407 VRKVLIVCRVIAGRVHRPVENVEEMNGLM-SGFDSLAGKVGLYTNVEELYLLNPKALLPC 465
Query: 433 FVVI 436
FVVI
Sbjct: 466 FVVI 469
>AT2G29660.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr2:12679346-12680467 FORWARD LENGTH=373
Length = 373
Score = 189 bits (481), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/254 (45%), Positives = 150/254 (59%), Gaps = 29/254 (11%)
Query: 206 ICSCPDCGEIM-KAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKK---QSPVCK 261
I C CGEI K LE+H A+KHAVSEL ++S NIV+IIF S W ++ +SPV
Sbjct: 125 IFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFKSGWPEQGNYKSPV-- 182
Query: 262 IDRILKVHNTQRTITKFEEYRDSIKAKATKLPK-----KHPRCIADGNELLRFHCTTFVC 316
I+RILK+HN+ + +T+FEEYR+ +KAKA + RC+ADGNELLRF+C+TF+C
Sbjct: 183 INRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADGNELLRFYCSTFMC 242
Query: 317 SLGLNGSSNICNSTSQCNVCSVIKHGFKFNRXXXXXXILTTATSGKAHDKASIAPEDD-- 374
LG NG SN+C C++C +I GF I T AT + H E++
Sbjct: 243 DLGQNGKSNLCGH-QYCSICGIIGSGFS----PKLDGIATLATGWRGHVAVPEEVEEEFG 297
Query: 375 --NDKRAMLVCRVIAGRV---KKNTEGGSGMMEEEYDSVAGDVGAYSNL------DELYV 423
N KRAMLVCRV+AGRV + + YDS+ G G S DEL V
Sbjct: 298 FMNVKRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGALLRIDDDELLV 357
Query: 424 FNPRAILPCFVVIY 437
FNPRA+LPCFV++Y
Sbjct: 358 FNPRAVLPCFVIVY 371
>AT1G11490.1 | Symbols: | zinc finger (C2H2 type) family protein |
chr1:3868884-3870065 REVERSE LENGTH=365
Length = 365
Score = 157 bits (398), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 139/244 (56%), Gaps = 16/244 (6%)
Query: 206 ICSCPDCGE-IMKAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPV--CKI 262
+ +C C E + ++ E H H+V L D S+ VE+I ++ + K + I
Sbjct: 126 VLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLGKMKGNNI 185
Query: 263 DRILKVHNTQRTITKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTFVCSLGL-N 321
I K+ N QR + FE+YR+ +K +A KL KKH RC+ADGNE L FH TT C+LG N
Sbjct: 186 SAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHGTTLSCTLGFSN 245
Query: 322 GSSNICNSTSQCNVCSVIKHGFK-FNRXXXXXXILTTATSGKAHDKASIAPEDDNDKR-- 378
SSN+C S C VC +++HGF R +LT +TS A + SI + ++
Sbjct: 246 SSSNLCFS-DHCEVCHILRHGFSPKTRPDGIKGVLTASTSSTALE--SIETDQGRNRGSL 302
Query: 379 -AMLVCRVIAGRVKK---NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFV 434
A+++CRVIAGRV K E G E+DS+A VG S ++ELY+ + +A+LPCFV
Sbjct: 303 IAVVLCRVIAGRVHKPMQTFENSLGF--SEFDSLALKVGQNSRIEELYLLSTKALLPCFV 360
Query: 435 VIYR 438
+I++
Sbjct: 361 IIFK 364
>AT1G62520.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
LENGTH=280
Length = 280
Score = 122 bits (307), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/211 (37%), Positives = 114/211 (54%), Gaps = 34/211 (16%)
Query: 231 VSELGPEDTSKNIVEIIFHSSWLKKQSPVC-KIDRILKVHNTQRTITKFEEYRDSIKAKA 289
++EL S+N+VEIIF +SW K P +++ I KV N +T+T+FEEYR+++KA++
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSWGPK--PFSGRVEMIFKVQNGSKTLTRFEEYREAVKARS 157
Query: 290 T-KLPKKHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCNVCSVIKHGFKFNRX 348
K +++ R +ADGNE +RF+C S G GS+
Sbjct: 158 VGKAREENARSVADGNETMRFYC--LGPSYGGGGSAWGILGGKGGGAS------------ 203
Query: 349 XXXXXILTTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKKNTE-GGSGMMEEEYDS 407
I T A S A++KA ++AMLVCRVIAGRV K E + +DS
Sbjct: 204 -----IYTFAGSSTANEKAG----GGKGRKAMLVCRVIAGRVTKQNELKYDSDLRSRFDS 254
Query: 408 VAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
V+GD G EL VF+ RA+LPCF++IYR
Sbjct: 255 VSGDDG------ELLVFDTRAVLPCFLIIYR 279
>AT4G22560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
LENGTH=264
Length = 264
Score = 119 bits (299), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 114/213 (53%), Gaps = 45/213 (21%)
Query: 230 AVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKIDRILKVHNTQRTITKFEEYRDSIKAKA 289
A++EL S+N+VEIIFHSSW + P +I+ I KV + RT+T+FEEYR+ +K++A
Sbjct: 92 ALTELPDGHPSRNVVEIIFHSSWSSDEFP-GRIEMIFKVEHGSRTVTRFEEYREVVKSRA 150
Query: 290 ----TKLPKKHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCNVCSVIKHGFKF 345
++ RC+ADGNE++RF+ G NG + + VC
Sbjct: 151 GFNGGTCEEEDARCLADGNEMMRFYPVL----DGFNGGACVFAGGKGQAVC--------- 197
Query: 346 NRXXXXXXILTTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKKNTEGGSGMMEEEY 405
T + SG+A+ ++ ++AM++CRVIAGRV GS
Sbjct: 198 ----------TFSGSGEAY----VSSGGGGGRKAMMICRVIAGRVDDVIGFGS------- 236
Query: 406 DSVAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
DSVAG G EL+VF+ RA+LPCF++I+R
Sbjct: 237 DSVAGRDG------ELFVFDTRAVLPCFLIIFR 263
>AT4G12450.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
LENGTH=277
Length = 277
Score = 107 bits (266), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 112/224 (50%), Gaps = 48/224 (21%)
Query: 225 QAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKIDRILKVHNTQRTITKFEEYRDS 284
++V +++L S+N+VEIIF SSW + P +++ I KV N + +T+FEEYR++
Sbjct: 91 ESVLPVLTDLPDGHPSRNVVEIIFQSSWSSDEFP-GRVEMIFKVENGSKAVTRFEEYREA 149
Query: 285 IKAKA-TKLPK---------KHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCN 334
+K+++ +K+ ++ RC ADGNE++RF
Sbjct: 150 VKSRSCSKVDSDRVDGSACDENARCSADGNEMMRFFPL-----------------GPIPG 192
Query: 335 VCSVIKHGFKFNRXXXXXXILTTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKKNT 394
+ GF + + T + SG+AH +RAML+CRVIAGRV K
Sbjct: 193 GINGGAWGFPGGK---GAAVCTFSGSGEAHASTG----GGGGRRAMLICRVIAGRVAKKG 245
Query: 395 EGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
E GS DSVAG G EL VF+ RA+LPCF++ +R
Sbjct: 246 EFGS-------DSVAGRAG------ELIVFDARAVLPCFLIFFR 276