Miyakogusa Predicted Gene

Lj1g3v2927350.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2927350.1 Non Chatacterized Hit- tr|G7JNM1|G7JNM1_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,34.05,4e-18,seg,NULL; coiled-coil,NULL,CUFF.29702.1
         (476 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G37440.2 | Symbols:  | unknown protein; LOCATED IN: cellular_...   153   3e-37
AT4G37440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   152   3e-37
AT3G59670.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   147   1e-35
AT3G50040.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   108   9e-24

>AT4G37440.2 | Symbols:  | unknown protein; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G50040.1);
           Has 121 Blast hits to 117 proteins in 32 species: Archae
           - 0; Bacteria - 6; Metazoa - 13; Fungi - 5; Plants - 66;
           Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink).
           | chr4:17601647-17603766 FORWARD LENGTH=444
          Length = 444

 Score =  153 bits (386), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 98/269 (36%), Positives = 142/269 (52%), Gaps = 13/269 (4%)

Query: 78  IDNTECXXXXXXXDTGSGVETDS-GSAXGLTDSEGE------SSVSDDWSEPLLFRKKKL 130
           +D  EC        +G    TD   S+ G TDSE E      S + ++ S PL  RK+KL
Sbjct: 73  VDILECNDNIEIQVSGCDDGTDGYSSSFGGTDSEHENDQEVDSMICNETSLPLWVRKRKL 132

Query: 131 TDHWRKFIRP-LMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQLEFLKFSIDDFGVKS 189
           TDHWR+F++P LMWRCKWIEL  K+L +QA KY+KE+ EY   K+LE      ++ GVK+
Sbjct: 133 TDHWRRFVQPTLMWRCKWIELKYKELQNQAQKYDKEVEEYYQAKKLELENVKSEELGVKA 192

Query: 190 V-PVSDSIYRNRVMXXXXXXXAEE-CDLSSYVSNHNIFSYYENKNCRHDAGLEDFHSDAV 247
           + P+     + R+M        EE  D++SY SNHN+FSYY+ +    D  L D   ++ 
Sbjct: 193 LPPLPCYTQKTRLMKRKTRKRVEETADVTSYASNHNLFSYYDCRKSLADIALND---NSR 249

Query: 248 AIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIEKLQSQVGNLKTRIDNVISENP 307
            +   N    +E   ++    L   + D     IL KIE  +S+  NLK R+D V+SENP
Sbjct: 250 NLDKKNKSAKDETAFSEETPPLEFREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENP 309

Query: 308 GNFCYVSQLSMIEPSDGFNHSGHGSASLA 336
             F   + ++ +  +D +  S      LA
Sbjct: 310 SIFPLANTVNPLGAADVYTSSEQQKPLLA 338


>AT4G37440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G50040.1); Has 220 Blast hits to 205 proteins
           in 55 species: Archae - 0; Bacteria - 15; Metazoa - 50;
           Fungi - 11; Plants - 76; Viruses - 3; Other Eukaryotes -
           65 (source: NCBI BLink). | chr4:17601647-17603846
           FORWARD LENGTH=471
          Length = 471

 Score =  152 bits (385), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 98/269 (36%), Positives = 142/269 (52%), Gaps = 13/269 (4%)

Query: 78  IDNTECXXXXXXXDTGSGVETDS-GSAXGLTDSEGE------SSVSDDWSEPLLFRKKKL 130
           +D  EC        +G    TD   S+ G TDSE E      S + ++ S PL  RK+KL
Sbjct: 73  VDILECNDNIEIQVSGCDDGTDGYSSSFGGTDSEHENDQEVDSMICNETSLPLWVRKRKL 132

Query: 131 TDHWRKFIRP-LMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQLEFLKFSIDDFGVKS 189
           TDHWR+F++P LMWRCKWIEL  K+L +QA KY+KE+ EY   K+LE      ++ GVK+
Sbjct: 133 TDHWRRFVQPTLMWRCKWIELKYKELQNQAQKYDKEVEEYYQAKKLELENVKSEELGVKA 192

Query: 190 V-PVSDSIYRNRVMXXXXXXXAEE-CDLSSYVSNHNIFSYYENKNCRHDAGLEDFHSDAV 247
           + P+     + R+M        EE  D++SY SNHN+FSYY+ +    D  L D   ++ 
Sbjct: 193 LPPLPCYTQKTRLMKRKTRKRVEETADVTSYASNHNLFSYYDCRKSLADIALND---NSR 249

Query: 248 AIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIEKLQSQVGNLKTRIDNVISENP 307
            +   N    +E   ++    L   + D     IL KIE  +S+  NLK R+D V+SENP
Sbjct: 250 NLDKKNKSAKDETAFSEETPPLEFREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENP 309

Query: 308 GNFCYVSQLSMIEPSDGFNHSGHGSASLA 336
             F   + ++ +  +D +  S      LA
Sbjct: 310 SIFPLANTVNPLGAADVYTSSEQQKPLLA 338


>AT3G59670.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G37440.2); Has 77 Blast hits to 77 proteins in
           14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 73; Viruses - 0; Other Eukaryotes - 4
           (source: NCBI BLink). | chr3:22040485-22042380 FORWARD
           LENGTH=517
          Length = 517

 Score =  147 bits (372), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 97/259 (37%), Positives = 141/259 (54%), Gaps = 21/259 (8%)

Query: 117 DDWSEPLLFRKKKLTDHWRKFIRPLMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQLE 176
           D +S    FRKK+LT+HWR+FIRPLMWR KW+EL +++L S+AL+Y KEL  YD  K   
Sbjct: 122 DSFSSIFHFRKKRLTNHWRRFIRPLMWRSKWVELRIRELESRALEYPKELELYDQEK--- 178

Query: 177 FLKFSIDDF-------GVKSVPVSDSIYRNRVMXXXXXXXAEEC--DLSSYVSNHNIFSY 227
            L+ +ID         G+KS+P S+  Y+ R           E   D++SY++ HN+FSY
Sbjct: 179 -LEANIDPSVLESCGEGIKSLPFSNPCYKKRAAKKRRKRKKVESTDDIASYMACHNLFSY 237

Query: 228 YENKNCRHDA-GLEDFHSDAVAIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIE 286
            E K    D  GL D   DA   P S  D+ E + L+D  S  H  D D    ++L KIE
Sbjct: 238 IETKRLSSDGMGLADDFGDA-KDPRS--DSNEPVDLDDADSLFHHRDGDSVLEEVLWKIE 294

Query: 287 KLQSQVGNLKTRIDNVISENPGNFCYVSQLSMIEPSDGFNHSGHGSASLAGNENQLPVSF 346
            + SQV  LKT++D V+S+N   F     LS++  S   +     + S  GN + +    
Sbjct: 295 LVHSQVHRLKTQVDVVLSKNTARFSSSENLSLLAASSAPS----PTVSAGGNGDVISFGA 350

Query: 347 IHASSQHKSELYVEDQLLA 365
           I+ +SQH ++  + D + +
Sbjct: 351 IYNASQHMADYGLGDIVFS 369


>AT3G50040.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G37440.2); Has 70 Blast hits to 70 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:18549489-18551019 REVERSE
           LENGTH=421
          Length = 421

 Score =  108 bits (270), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 77/270 (28%), Positives = 128/270 (47%), Gaps = 17/270 (6%)

Query: 115 VSDDWSEPLLFRKKKLTDHWRKFIRPLMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQ 174
             DD +E L   KKK  D WR+  +P+MWRCKWIEL VK++ SQA  YEKE+ +Y   KQ
Sbjct: 96  TCDDGTEFLGLPKKKTNDRWRRLTKPIMWRCKWIELKVKEIQSQARGYEKEVKDYYLTKQ 155

Query: 175 LEFLKFSIDDFGVKSVPVSDSIYRNRVMXXXXXXXAEE-CDLSSYVSNHNIFSYYENK-- 231
            +  K  ++ F  KS+P  ++  R  V         EE  D+++Y+SNHN+FSY + +  
Sbjct: 156 FDLEKSKLEGFDGKSIPFRENNQRRNVFKRGRRKRVEETTDVAAYMSNHNLFSYADKRVP 215

Query: 232 -NCRHDAGLEDFHSDAVAIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIEKLQS 290
            N +      DF +   A      D IE+   + ++S L  + +D      L KI++ Q 
Sbjct: 216 VNVKGQYLDSDFGTGRKAT--GKQDAIED---DSLISEL--DCSDDVLAKFLCKIDEAQG 268

Query: 291 QVGNLKTRIDNVI-SENPGNFCYVSQLSMIEPSDGFNHSGHGSASLAGNENQLPVSFIHA 349
           +   L+ R+D ++    P +   + Q+      D    +G   A +     + P++ +  
Sbjct: 269 KARRLRKRVDQLMWDSQPAHTSSMPQMVAPCHRDSMIQTGKKCALV-----EAPLTHVQN 323

Query: 350 SSQHKSELYVEDQLLAKNTLSTLEANTNRP 379
             Q     ++E  ++ +  +       N P
Sbjct: 324 GQQCIPADHIEHLMVPQTHIGGQCLTNNSP 353