Miyakogusa Predicted Gene

Lj3g3v2062690.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2062690.1 Non Chatacterized Hit- tr|D7M093|D7M093_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,42.75,4e-18,seg,NULL,CUFF.43552.1
         (255 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G24100.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   267   6e-72
AT4G30780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   261   3e-70
AT1G54300.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   120   8e-28
AT3G05770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   117   9e-27

>AT2G24100.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G30780.1); Has 101 Blast hits to 101 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 95; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr2:10244921-10246715 FORWARD
           LENGTH=466
          Length = 466

 Score =  267 bits (682), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 132/222 (59%), Positives = 158/222 (71%), Gaps = 24/222 (10%)

Query: 31  NFPGTVLKIGTWEYKSRYEGDLVAKCYFAKHKLVWEVLDGCLKNKIEIPWSDIMALKANY 90
           NFP T+L+IG WEYKSRYEGDLVAKCYFAKHKLVWEVL+  LK+KIEI WSDIMALKAN 
Sbjct: 113 NFPATILRIGQWEYKSRYEGDLVAKCYFAKHKLVWEVLEQGLKSKIEIQWSDIMALKANL 172

Query: 91  PEDAPGTLEVVLARRPLFFREINPQPRKHTLWQATSDFTGGQASIQRRHFMQCPQGLLGK 150
           PED PGTL +VLARRPLFFRE NPQPRKHTLWQATSDFT GQAS+ R+HF+QCP G++ K
Sbjct: 173 PEDEPGTLTIVLARRPLFFRETNPQPRKHTLWQATSDFTDGQASMNRQHFLQCPPGIMNK 232

Query: 151 HFEKLIQCDPRLNFLSQQPELVLESPYFESGTAIHDHIESSDGFDSKSEEQPSLFGLHEV 210
           HFEKL+QCD RL  LS+QPE+ L +P+F+S  +I               E PS+ G H +
Sbjct: 233 HFEKLVQCDHRLFCLSRQPEINLAAPFFDSRLSIF--------------EDPSVSGSHNI 278

Query: 211 EXXXXXXXXXXXXEHNLMGKAVENVSQEITSPSTVMNSHAIK 252
                        EH        ++S +  SPS+VM++ AI+
Sbjct: 279 ---ASPVGAQSSSEH-------VSLSHDALSPSSVMDARAIE 310


>AT4G30780.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24100.1); Has 109 Blast hits to 109 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 0; Plants - 95; Viruses - 0; Other Eukaryotes -
           13 (source: NCBI BLink). | chr4:14990523-14992855
           FORWARD LENGTH=589
          Length = 589

 Score =  261 bits (668), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 118/163 (72%), Positives = 138/163 (84%)

Query: 31  NFPGTVLKIGTWEYKSRYEGDLVAKCYFAKHKLVWEVLDGCLKNKIEIPWSDIMALKANY 90
           NFP ++LKIG WEYKSRYEGDLVAKCYFAKHKLVWEVL+  LK+KIEI WSDIMALKAN 
Sbjct: 140 NFPASLLKIGQWEYKSRYEGDLVAKCYFAKHKLVWEVLEQGLKSKIEIQWSDIMALKANC 199

Query: 91  PEDAPGTLEVVLARRPLFFREINPQPRKHTLWQATSDFTGGQASIQRRHFMQCPQGLLGK 150
           PED PGTL +VLAR+PLFFRE NPQPRKHTLWQATSDFT GQAS+ R+HF+QC QG++ K
Sbjct: 200 PEDGPGTLTLVLARQPLFFRETNPQPRKHTLWQATSDFTDGQASMNRQHFLQCAQGIMNK 259

Query: 151 HFEKLIQCDPRLNFLSQQPELVLESPYFESGTAIHDHIESSDG 193
           HFEKL+QCD RL  LS+QPE+ ++SPYF++  +I +    S G
Sbjct: 260 HFEKLVQCDHRLFHLSRQPEIAIDSPYFDARQSIFEDPSESKG 302


>AT1G54300.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G05770.1); Has 107 Blast hits to 107 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes -
           13 (source: NCBI BLink). | chr1:20270810-20272009
           FORWARD LENGTH=314
          Length = 314

 Score =  120 bits (301), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 93/160 (58%), Gaps = 10/160 (6%)

Query: 31  NFPGTVLKIGTWEYKSRYEGDLVAKCYFAKHKLVWEVLDG-------CLKNKIEIPWSDI 83
           NFP + ++IG W   ++   D+VAK YFAK KL+WE L G        LK KIEI W+D+
Sbjct: 2   NFPISTIRIGGWVVVAKNPDDIVAKFYFAKKKLIWEFLFGEPETNTLRLKRKIEIQWNDV 61

Query: 84  MALKANY-PEDAPGTLEVVLARRPLFFREINPQPRKHTLW-QATSDFTGGQASIQRRHFM 141
            + + +    D  G L++ L +RP FF E NPQ  KHT W Q   DFTG  AS  RRH +
Sbjct: 62  SSFEESISSRDETGILKIELKKRPTFFIETNPQAGKHTQWKQLDHDFTGDHASNYRRHTL 121

Query: 142 QCPQGLLGKHFEKLIQCDPRLNFLSQQPELVLESPYFESG 181
             P G+L K+ EKL+  D   + L + P  V ES YF+SG
Sbjct: 122 HFPPGVLQKNLEKLV-TDSFWSKLYEVPFPVHESRYFDSG 160


>AT3G05770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G54300.1); Has 105 Blast hits to 105 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 99; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr3:1710328-1712165 REVERSE
           LENGTH=410
          Length = 410

 Score =  117 bits (292), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 94/160 (58%), Gaps = 10/160 (6%)

Query: 31  NFPGTVLKIGTWEYKSRYEGDLVAKCYFAKHKLVWEVLDG-------CLKNKIEIPWSDI 83
           NFP + +KIG   + ++   D+VAK YFAK KL+WE L G        LK+KIEI W+D+
Sbjct: 75  NFPISTIKIGDCVFVAKNPDDIVAKFYFAKKKLLWEFLFGEPVANMPRLKSKIEIQWNDV 134

Query: 84  MALKANY-PEDAPGTLEVVLARRPLFFREINPQPRKHTLW-QATSDFTGGQASIQRRHFM 141
            + + +    D  G L++ L +RP FF E NPQ  KHT W Q   DFTG QAS  RRH +
Sbjct: 135 SSFEESINSRDETGILKIELKKRPTFFTETNPQAGKHTQWKQLDYDFTGDQASYYRRHTL 194

Query: 142 QCPQGLLGKHFEKLIQCDPRLNFLSQQPELVLESPYFESG 181
             P G+L K+ EKL+  D   + L + P  V ES YF+ G
Sbjct: 195 HFPPGVLQKNLEKLL-TDSFWSKLYKVPFPVHESLYFDIG 233