Miyakogusa Predicted Gene

Lj1g3v0726890.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v0726890.1 Non Chatacterized Hit- tr|G7J3K8|G7J3K8_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,73.33,0,seg,NULL; ADP-ribosylation,NULL; no
description,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NUL,CUFF.26222.1
         (440 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G75710.1 | Symbols:  | C2H2-like zinc finger protein | chr1:2...   299   2e-81
AT4G27240.1 | Symbols:  | zinc finger (C2H2 type) family protein...   236   2e-62
AT5G54630.1 | Symbols:  | zinc finger protein-related | chr5:221...   232   4e-61
AT2G29660.1 | Symbols:  | zinc finger (C2H2 type) family protein...   189   2e-48
AT1G11490.1 | Symbols:  | zinc finger (C2H2 type) family protein...   157   1e-38
AT1G62520.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   122   4e-28
AT4G22560.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   119   4e-27
AT4G12450.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   107   2e-23

>AT1G75710.1 | Symbols:  | C2H2-like zinc finger protein |
           chr1:28428806-28431128 FORWARD LENGTH=462
          Length = 462

 Score =  299 bits (766), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 158/293 (53%), Positives = 199/293 (67%), Gaps = 34/293 (11%)

Query: 172 FRAIPFRRLSGCYECRMVVDPVLGFTRDPSLRSSICSCPDCGEIM-KAESLEHHQAVKHA 230
           FRA+ FR+LSGCYEC M+VDP    +R P +   +C+C  CGE+  K ESLE HQAV+HA
Sbjct: 174 FRAMQFRKLSGCYECHMIVDP----SRYP-ISPRVCACSQCGEVFPKLESLELHQAVRHA 228

Query: 231 VSELGPEDTSKNIVEIIFHSSWLKKQSPVCKIDRILKVHNTQRTITKFEEYRDSIKAKAT 290
           VSELGPED+ +NIVEIIF SSWLKK SP+C+I+RILKVHNTQRTI +FE+ RD++KA+A 
Sbjct: 229 VSELGPEDSGRNIVEIIFKSSWLKKDSPICQIERILKVHNTQRTIQRFEDCRDAVKARAL 288

Query: 291 KLPKKHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCNVCSVIKHGFKFNRXXX 350
           +  +K  RC ADGNELLRFHCTT  CSLG  GSS++C++   C VC+VI+HGF+      
Sbjct: 289 QATRKDARCAADGNELLRFHCTTLTCSLGARGSSSLCSNLPVCGVCTVIRHGFQGKSGGG 348

Query: 351 XXXIL-----TTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKK------------- 392
              +      TTA+SG+A D    +   D+ +R MLVCRVIAGRVK+             
Sbjct: 349 GANVANAGVRTTASSGRADDLLRCS---DDARRVMLVCRVIAGRVKRVDLPAADASATAE 405

Query: 393 -------NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
                  N+  G       +DSVA + G YSNL+EL V+NPRAILPCFVVIY+
Sbjct: 406 KKSTVEDNSVVGVSSSGGTFDSVAVNAGVYSNLEELVVYNPRAILPCFVVIYK 458


>AT4G27240.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr4:13640160-13641640 FORWARD LENGTH=431
          Length = 431

 Score =  236 bits (602), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 121/238 (50%), Positives = 166/238 (69%), Gaps = 7/238 (2%)

Query: 204 SSICSCPDCGE-IMKAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKI 262
           +S  SC  CGE   K E+ E H   KHAV+EL   D+S+ IVEII  +SWLK ++   +I
Sbjct: 193 NSSVSCHKCGEKFSKLEAAEAHHLTKHAVTELMEGDSSRRIVEIICRTSWLKTENQGGRI 252

Query: 263 DRILKVHNTQRTITKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTFVCSLGLNG 322
           DRILKVHN Q+T+ +FEEYRD++K +A+KL KKHPRCIADGNELLRFH TT  C+LG+NG
Sbjct: 253 DRILKVHNMQKTLARFEEYRDTVKIRASKLQKKHPRCIADGNELLRFHGTTVACALGING 312

Query: 323 SSNICNSTSQCNVCSVIKHGFKFNRXXXX-XXILTTATSGKAHDKASIAPEDDNDKRAML 381
           S+++C S+ +C VC +I++GF   R       + T +TS +A +   I      D++A++
Sbjct: 313 STSLC-SSEKCCVCRIIRNGFSAKREMNNGIGVFTASTSERAFESIVIGDGGGGDRKALI 371

Query: 382 VCRVIAGRVKK---NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFVVI 436
           VCRVIAGRV +   N E   G++   +DS+AG VG Y+N++ELY+ N RA+LPCFV+I
Sbjct: 372 VCRVIAGRVHRPVENVEEMGGLL-SGFDSLAGKVGLYTNVEELYLLNSRALLPCFVLI 428


>AT5G54630.1 | Symbols:  | zinc finger protein-related |
           chr5:22192607-22194260 REVERSE LENGTH=472
          Length = 472

 Score =  232 bits (591), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 118/244 (48%), Positives = 168/244 (68%), Gaps = 13/244 (5%)

Query: 204 SSICSCPDCGE-IMKAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKI 262
           +S  SC  CGE   K E+ E H   KHAV+EL   D+S+ IVEII  +SWLK ++   +I
Sbjct: 228 NSSVSCHKCGEQFNKLEAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQCGRI 287

Query: 263 DRILKVHNTQRTITKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTFVCSLGLNG 322
           DR+LKVHN Q+T+ +FEEYR+++K +A+KL KKHPRC+ADGNELLRFH TT  C LG+NG
Sbjct: 288 DRVLKVHNMQKTLARFEEYRETVKIRASKLQKKHPRCLADGNELLRFHGTTVACGLGING 347

Query: 323 SSNICNSTSQCNVCSVIKHGFKFNRXXXX-XXILTTATSGKAHDKASIAPEDDND----- 376
           S+++C +  +C VC +I++GF   R       + T +TSG+A +   +   D++      
Sbjct: 348 STSVC-TAEKCCVCRIIRNGFSSKREKNNGVGVFTASTSGRAFESILVNGGDESGDVDRT 406

Query: 377 -KRAMLVCRVIAGRVKK---NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPC 432
            ++ ++VCRVIAGRV +   N E  +G+M   +DS+AG VG Y+N++ELY+ NP+A+LPC
Sbjct: 407 VRKVLIVCRVIAGRVHRPVENVEEMNGLM-SGFDSLAGKVGLYTNVEELYLLNPKALLPC 465

Query: 433 FVVI 436
           FVVI
Sbjct: 466 FVVI 469


>AT2G29660.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr2:12679346-12680467 FORWARD LENGTH=373
          Length = 373

 Score =  189 bits (481), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 115/254 (45%), Positives = 150/254 (59%), Gaps = 29/254 (11%)

Query: 206 ICSCPDCGEIM-KAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKK---QSPVCK 261
           I  C  CGEI  K   LE+H A+KHAVSEL   ++S NIV+IIF S W ++   +SPV  
Sbjct: 125 IFPCNSCGEIFPKINLLENHIAIKHAVSELIAGESSTNIVKIIFKSGWPEQGNYKSPV-- 182

Query: 262 IDRILKVHNTQRTITKFEEYRDSIKAKATKLPK-----KHPRCIADGNELLRFHCTTFVC 316
           I+RILK+HN+ + +T+FEEYR+ +KAKA +           RC+ADGNELLRF+C+TF+C
Sbjct: 183 INRILKIHNSSKILTRFEEYREFVKAKAARSNGGGRRWDDERCVADGNELLRFYCSTFMC 242

Query: 317 SLGLNGSSNICNSTSQCNVCSVIKHGFKFNRXXXXXXILTTATSGKAHDKASIAPEDD-- 374
            LG NG SN+C     C++C +I  GF          I T AT  + H       E++  
Sbjct: 243 DLGQNGKSNLCGH-QYCSICGIIGSGFS----PKLDGIATLATGWRGHVAVPEEVEEEFG 297

Query: 375 --NDKRAMLVCRVIAGRV---KKNTEGGSGMMEEEYDSVAGDVGAYSNL------DELYV 423
             N KRAMLVCRV+AGRV     + +         YDS+ G  G  S        DEL V
Sbjct: 298 FMNVKRAMLVCRVVAGRVGCDLIDDDDVDKSDGGGYDSLVGQSGNKSGALLRIDDDELLV 357

Query: 424 FNPRAILPCFVVIY 437
           FNPRA+LPCFV++Y
Sbjct: 358 FNPRAVLPCFVIVY 371


>AT1G11490.1 | Symbols:  | zinc finger (C2H2 type) family protein |
           chr1:3868884-3870065 REVERSE LENGTH=365
          Length = 365

 Score =  157 bits (398), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 95/244 (38%), Positives = 139/244 (56%), Gaps = 16/244 (6%)

Query: 206 ICSCPDCGE-IMKAESLEHHQAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPV--CKI 262
           + +C  C E +   ++ E H    H+V  L   D S+  VE+I ++ +  K   +    I
Sbjct: 126 VLACQKCHERVRDLDAFEAHYLSNHSVVRLLAGDFSRTTVELICNTGYSHKLGKMKGNNI 185

Query: 263 DRILKVHNTQRTITKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTFVCSLGL-N 321
             I K+ N QR +  FE+YR+ +K +A KL KKH RC+ADGNE L FH TT  C+LG  N
Sbjct: 186 SAIFKIQNLQRVVADFEDYRELVKIRANKLSKKHSRCMADGNEFLGFHGTTLSCTLGFSN 245

Query: 322 GSSNICNSTSQCNVCSVIKHGFK-FNRXXXXXXILTTATSGKAHDKASIAPEDDNDKR-- 378
            SSN+C S   C VC +++HGF    R      +LT +TS  A +  SI  +   ++   
Sbjct: 246 SSSNLCFS-DHCEVCHILRHGFSPKTRPDGIKGVLTASTSSTALE--SIETDQGRNRGSL 302

Query: 379 -AMLVCRVIAGRVKK---NTEGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFV 434
            A+++CRVIAGRV K     E   G    E+DS+A  VG  S ++ELY+ + +A+LPCFV
Sbjct: 303 IAVVLCRVIAGRVHKPMQTFENSLGF--SEFDSLALKVGQNSRIEELYLLSTKALLPCFV 360

Query: 435 VIYR 438
           +I++
Sbjct: 361 IIFK 364


>AT1G62520.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G12450.1); Has 388 Blast hits to 388 proteins
           in 26 species: Archae - 0; Bacteria - 1; Metazoa - 0;
           Fungi - 8; Plants - 376; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr1:23144506-23145348 FORWARD
           LENGTH=280
          Length = 280

 Score =  122 bits (307), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 80/211 (37%), Positives = 114/211 (54%), Gaps = 34/211 (16%)

Query: 231 VSELGPEDTSKNIVEIIFHSSWLKKQSPVC-KIDRILKVHNTQRTITKFEEYRDSIKAKA 289
           ++EL     S+N+VEIIF +SW  K  P   +++ I KV N  +T+T+FEEYR+++KA++
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSWGPK--PFSGRVEMIFKVQNGSKTLTRFEEYREAVKARS 157

Query: 290 T-KLPKKHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCNVCSVIKHGFKFNRX 348
             K  +++ R +ADGNE +RF+C     S G  GS+                        
Sbjct: 158 VGKAREENARSVADGNETMRFYC--LGPSYGGGGSAWGILGGKGGGAS------------ 203

Query: 349 XXXXXILTTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKKNTE-GGSGMMEEEYDS 407
                I T A S  A++KA         ++AMLVCRVIAGRV K  E      +   +DS
Sbjct: 204 -----IYTFAGSSTANEKAG----GGKGRKAMLVCRVIAGRVTKQNELKYDSDLRSRFDS 254

Query: 408 VAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
           V+GD G      EL VF+ RA+LPCF++IYR
Sbjct: 255 VSGDDG------ELLVFDTRAVLPCFLIIYR 279


>AT4G22560.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G12450.1); Has 380 Blast hits to 380 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 6; Plants - 374; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr4:11880178-11880972 FORWARD
           LENGTH=264
          Length = 264

 Score =  119 bits (299), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 114/213 (53%), Gaps = 45/213 (21%)

Query: 230 AVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKIDRILKVHNTQRTITKFEEYRDSIKAKA 289
           A++EL     S+N+VEIIFHSSW   + P  +I+ I KV +  RT+T+FEEYR+ +K++A
Sbjct: 92  ALTELPDGHPSRNVVEIIFHSSWSSDEFP-GRIEMIFKVEHGSRTVTRFEEYREVVKSRA 150

Query: 290 ----TKLPKKHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCNVCSVIKHGFKF 345
                   ++  RC+ADGNE++RF+        G NG + +        VC         
Sbjct: 151 GFNGGTCEEEDARCLADGNEMMRFYPVL----DGFNGGACVFAGGKGQAVC--------- 197

Query: 346 NRXXXXXXILTTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKKNTEGGSGMMEEEY 405
                     T + SG+A+    ++      ++AM++CRVIAGRV      GS       
Sbjct: 198 ----------TFSGSGEAY----VSSGGGGGRKAMMICRVIAGRVDDVIGFGS------- 236

Query: 406 DSVAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
           DSVAG  G      EL+VF+ RA+LPCF++I+R
Sbjct: 237 DSVAGRDG------ELFVFDTRAVLPCFLIIFR 263


>AT4G12450.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G22560.1); Has 380 Blast hits to 380 proteins
           in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 4; Plants - 374; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr4:7385841-7386674 REVERSE
           LENGTH=277
          Length = 277

 Score =  107 bits (266), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/224 (33%), Positives = 112/224 (50%), Gaps = 48/224 (21%)

Query: 225 QAVKHAVSELGPEDTSKNIVEIIFHSSWLKKQSPVCKIDRILKVHNTQRTITKFEEYRDS 284
           ++V   +++L     S+N+VEIIF SSW   + P  +++ I KV N  + +T+FEEYR++
Sbjct: 91  ESVLPVLTDLPDGHPSRNVVEIIFQSSWSSDEFP-GRVEMIFKVENGSKAVTRFEEYREA 149

Query: 285 IKAKA-TKLPK---------KHPRCIADGNELLRFHCTTFVCSLGLNGSSNICNSTSQCN 334
           +K+++ +K+           ++ RC ADGNE++RF                         
Sbjct: 150 VKSRSCSKVDSDRVDGSACDENARCSADGNEMMRFFPL-----------------GPIPG 192

Query: 335 VCSVIKHGFKFNRXXXXXXILTTATSGKAHDKASIAPEDDNDKRAMLVCRVIAGRVKKNT 394
             +    GF   +      + T + SG+AH            +RAML+CRVIAGRV K  
Sbjct: 193 GINGGAWGFPGGK---GAAVCTFSGSGEAHASTG----GGGGRRAMLICRVIAGRVAKKG 245

Query: 395 EGGSGMMEEEYDSVAGDVGAYSNLDELYVFNPRAILPCFVVIYR 438
           E GS       DSVAG  G      EL VF+ RA+LPCF++ +R
Sbjct: 246 EFGS-------DSVAGRAG------ELIVFDARAVLPCFLIFFR 276