Miyakogusa Predicted Gene

Lj1g3v4692840.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4692840.1 tr|G7IKD9|G7IKD9_MEDTR PIF-like protein
OS=Medicago truncatula GN=MTR_2g021280 PE=4
SV=1,34.18,8e-19,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,gene.Ljchr1_pseudomol_20120830.path1.gene10411.1
         (260 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    63   2e-10
AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    63   2e-10
AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    54   7e-08
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    53   2e-07
AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    53   2e-07
AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    50   1e-06

>AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
           in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
           - 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
           LENGTH=449
          Length = 449

 Score = 62.8 bits (151), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 60/143 (41%), Gaps = 1/143 (0%)

Query: 61  STSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXX 120
           S++   K  W       F+   + +   G    T+F K+GW  I    NE +G       
Sbjct: 161 SSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQ 220

Query: 121 XXXXXXXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLP 180
                   R+ W+ W ++ G  + + WD    +F A +E W   I ENP   + R K +P
Sbjct: 221 LKNHWDCTRKAWKIWCQLVGA-SSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVP 279

Query: 181 FAHDLTILYKDVVATGEHAWAPS 203
            A  L I++  V+  GE    PS
Sbjct: 280 HADQLAIIFNGVIEPGETYTPPS 302



 Score = 58.9 bits (141), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 4/123 (3%)

Query: 66  PKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXXXXXXX 125
           PKA W+ +    FV  C+ Q   G   GT+F+K+GW  I   F E +G            
Sbjct: 3   PKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHW 62

Query: 126 XXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLPFAHDL 185
               R+W+ W ++  + + + W+   + F A D+ W   + ENP+  + R   L   HDL
Sbjct: 63  DTMSRQWKIWRRLV-ETSFMNWNPESNRFRATDDDWANYLQENPDAGQYR---LSVPHDL 118

Query: 186 TIL 188
             L
Sbjct: 119 KKL 121


>AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1743234-1744751
           REVERSE LENGTH=449
          Length = 449

 Score = 62.8 bits (151), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 60/143 (41%), Gaps = 1/143 (0%)

Query: 61  STSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXX 120
           S++   K  W       F+   + +   G    T+F K+GW  I    NE +G       
Sbjct: 161 SSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQ 220

Query: 121 XXXXXXXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLP 180
                   R+ W+ W ++ G  + + WD    +F A +E W   I ENP   + R K +P
Sbjct: 221 LKNHWDCTRKAWKIWCQLVGA-SSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVP 279

Query: 181 FAHDLTILYKDVVATGEHAWAPS 203
            A  L I++  V+  GE    PS
Sbjct: 280 HADQLAIIFNGVIEPGETYTPPS 302



 Score = 58.9 bits (141), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 4/123 (3%)

Query: 66  PKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXXXXXXX 125
           PKA W+ +    FV  C+ Q   G   GT+F+K+GW  I   F E +G            
Sbjct: 3   PKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHW 62

Query: 126 XXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLPFAHDL 185
               R+W+ W ++  + + + W+   + F A D+ W   + ENP+  + R   L   HDL
Sbjct: 63  DTMSRQWKIWRRLV-ETSFMNWNPESNRFRATDDDWANYLQENPDAGQYR---LSVPHDL 118

Query: 186 TIL 188
             L
Sbjct: 119 KKL 121


>AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
           in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
           LENGTH=460
          Length = 460

 Score = 54.3 bits (129), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 57/140 (40%), Gaps = 2/140 (1%)

Query: 58  SNMSTSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXX 117
           SN S+ +  K  W       FV     +   G    +++ K+ W  I +  N+ +G+   
Sbjct: 157 SNFSSPQ-SKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFT 215

Query: 118 XXXXXXXXXXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREK 177
                      R+ W+ W +V G    + WD     F A DE W+  + EN      R K
Sbjct: 216 RPQLKNHWDCTRKSWKIWCQVIGAPV-MKWDATSRTFGATDEDWKNYLKENHRAAPFRRK 274

Query: 178 GLPFAHDLTILYKDVVATGE 197
            LP A  L  ++K ++  G+
Sbjct: 275 QLPHADKLATIFKGLIEPGK 294


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score = 53.1 bits (126), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 56/134 (41%), Gaps = 9/134 (6%)

Query: 60  MSTSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFN-ELSGRXXXX 118
           +S  +  K  W +++  YFV+  + Q+  G   G  F+K+ WI +   FN   SG+    
Sbjct: 162 LSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKR 221

Query: 119 XXXXXXXXXXRRKWRTWYK---VCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLR 175
                       K   +YK      KE G  WD  +    A D  W+  I ++P     R
Sbjct: 222 VLRHRY-----NKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYR 276

Query: 176 EKGLPFAHDLTILY 189
            K LP  +DL  ++
Sbjct: 277 MKSLPSYNDLDTIF 290


>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 52.8 bits (125), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 56/134 (41%), Gaps = 9/134 (6%)

Query: 60  MSTSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFN-ELSGRXXXX 118
           +S  +  K  W +++  YFV+  + Q+  G   G  F+K+ WI +   FN   SG+    
Sbjct: 162 LSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKR 221

Query: 119 XXXXXXXXXXRRKWRTWYK---VCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLR 175
                       K   +YK      KE G  WD  +    A D  W+  I ++P     R
Sbjct: 222 VLRHRY-----NKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYR 276

Query: 176 EKGLPFAHDLTILY 189
            K LP  +DL  ++
Sbjct: 277 MKSLPSYNDLDTIF 290


>AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
           10 (source: NCBI BLink). | chr1:10598764-10599527
           FORWARD LENGTH=222
          Length = 222

 Score = 50.4 bits (119), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 35/66 (53%)

Query: 143 TGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLPFAHDLTILYKDVVATGEHAWAP 202
           +G GWD     F APDE W   +  +PN++ ++ + +    DL I++ DVVATG  A   
Sbjct: 88  SGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQIIFGDVVATGSFAVGM 147

Query: 203 SSGVMP 208
           S    P
Sbjct: 148 SDSTCP 153