Miyakogusa Predicted Gene

Lj1g3v1465200.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v1465200.1 Non Chatacterized Hit- tr|I1N7W1|I1N7W1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.27482
PE,29.51,0.000000000005,OS09G0509400 PROTEIN,NULL; FAMILY NOT
NAMED,NULL; seg,NULL; DUF4210,Domain of unknown function
DUF42,CUFF.27351.1
         (499 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G41110.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   124   2e-28
AT3G26890.5 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   117   2e-26
AT3G26890.4 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   117   2e-26
AT3G26890.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...   117   2e-26
AT3G26890.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   117   2e-26
AT3G26890.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   117   2e-26
AT3G26890.6 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   110   2e-24

>AT5G41110.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT3G26890.5); Has 1807 Blast hits to 1807
           proteins in 277 species: Archae - 0; Bacteria - 0;
           Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
           Other Eukaryotes - 339 (source: NCBI BLink). |
           chr5:16452598-16454989 FORWARD LENGTH=621
          Length = 621

 Score =  124 bits (311), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 131/411 (31%), Positives = 185/411 (45%), Gaps = 58/411 (14%)

Query: 94  VNTEQTPTSRTVGFRIRASTPCVNGFGGNGYSSSVFNATNDATEA---SKSQVRKRLLSP 150
           VN  +T   R VGF     +        N Y+S   N T  + E      S VRKR+ SP
Sbjct: 93  VNLSRTEVRRIVGFESNEVSSV-----SNEYTSITVNRTGGSNEVDIIGGSLVRKRVSSP 147

Query: 151 LNGMMLADHFRGDPLDIGEGNHRTCPKVGDDCHNAIHEYKKVHIGDD---INIQPTIWSS 207
           LN  +  + FRGD LDI   N    P          H    + +G D    NI      S
Sbjct: 148 LN-KLFPEKFRGDSLDISHSNQ---PMTSAGLPKGFH----IPVGQDHKKANI------S 193

Query: 208 AYFQEFVNSSCNDSGMNKNKIVXXXXXXXXXXXXXMCYKHLKSSSELNDSKERTKLKSLP 267
              + F  S+C++ G + +                     L  S++L  S++   L S P
Sbjct: 194 GRLRLFTTSNCSEWGNDTSHT-------GKLSSTVFTDGPLLDSNDLQPSQDVHCLYS-P 245

Query: 268 V--ALSIPQKKVS-------SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLSFKDV 317
           V     +P K +        SP   LSPLG + S   K L G    ++  DD  L     
Sbjct: 246 VHETFQVPNKPLPCHRNISVSPPLSLSPLGPRFSERMKALQGGLNGNIFEDDVCLKNTGE 305

Query: 318 EESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTMSRLP 377
           E  L    + F      N +Q+    F+ D+ I+     + P S P K   F  ++S  P
Sbjct: 306 EAELRTGHRSFDDT---NGIQR---AFSMDRAIE-----SVPTS-PCKR--FSRSLSGRP 351

Query: 378 SRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQGDKY 437
            +RSL+               + +QKI+GFLA+L++ GGN SP+SQK+PF V+SV  D +
Sbjct: 352 IQRSLVGSFEESLFSGRLSYGQANQKIDGFLAILSIAGGNISPKSQKLPFSVTSVGDDCF 411

Query: 438 LLYYSSINLS-GKLSSKSRVSKFQRTLSMDESRSEKSRIRIPMKGRIQLVL 487
           LLYY+SI+LS G L SK    K +   +  ++++   R+RIPMKGRIQLVL
Sbjct: 412 LLYYASIDLSGGSLPSKLWGQKLKTNQNKSDAQTINKRLRIPMKGRIQLVL 462


>AT3G26890.5 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G41110.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr3:9907456-9910463 REVERSE LENGTH=649
          Length = 649

 Score =  117 bits (292), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)

Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
           VRKR+LSPLN  +    FRGD  DI  GNH+     G     C++   ++ K +    ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202

Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
           +  T   +S +  + V++S   S M      ++             +C Y  L  +S   
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260

Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
                T  K LP       K++S SP   LSPLG K S   K +  C+T  ++ D  N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310

Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
            ++ E  +D+       +L+        D +   +   M        + P K   F  ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353

Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
           S  P +RSL+                 +QKI+GFLAVL++ GGN SP+SQK+PF V+S  
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413

Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
            D  LLYY+SI+L+G     S+++KF     +T  M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468


>AT3G26890.4 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G41110.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr3:9907456-9910463 REVERSE LENGTH=649
          Length = 649

 Score =  117 bits (292), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)

Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
           VRKR+LSPLN  +    FRGD  DI  GNH+     G     C++   ++ K +    ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202

Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
           +  T   +S +  + V++S   S M      ++             +C Y  L  +S   
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260

Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
                T  K LP       K++S SP   LSPLG K S   K +  C+T  ++ D  N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310

Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
            ++ E  +D+       +L+        D +   +   M        + P K   F  ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353

Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
           S  P +RSL+                 +QKI+GFLAVL++ GGN SP+SQK+PF V+S  
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413

Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
            D  LLYY+SI+L+G     S+++KF     +T  M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468


>AT3G26890.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G41110.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:9907456-9910463 REVERSE LENGTH=649
          Length = 649

 Score =  117 bits (292), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)

Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
           VRKR+LSPLN  +    FRGD  DI  GNH+     G     C++   ++ K +    ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202

Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
           +  T   +S +  + V++S   S M      ++             +C Y  L  +S   
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260

Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
                T  K LP       K++S SP   LSPLG K S   K +  C+T  ++ D  N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310

Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
            ++ E  +D+       +L+        D +   +   M        + P K   F  ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353

Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
           S  P +RSL+                 +QKI+GFLAVL++ GGN SP+SQK+PF V+S  
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413

Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
            D  LLYY+SI+L+G     S+++KF     +T  M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468


>AT3G26890.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G41110.1); Has 393 Blast hits to 383 proteins
           in 134 species: Archae - 0; Bacteria - 61; Metazoa -
           171; Fungi - 74; Plants - 80; Viruses - 0; Other
           Eukaryotes - 7 (source: NCBI BLink). |
           chr3:9907456-9910463 REVERSE LENGTH=649
          Length = 649

 Score =  117 bits (292), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)

Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
           VRKR+LSPLN  +    FRGD  DI  GNH+     G     C++   ++ K +    ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202

Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
           +  T   +S +  + V++S   S M      ++             +C Y  L  +S   
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260

Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
                T  K LP       K++S SP   LSPLG K S   K +  C+T  ++ D  N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310

Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
            ++ E  +D+       +L+        D +   +   M        + P K   F  ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353

Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
           S  P +RSL+                 +QKI+GFLAVL++ GGN SP+SQK+PF V+S  
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413

Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
            D  LLYY+SI+L+G     S+++KF     +T  M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468


>AT3G26890.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G41110.1); Has 393 Blast hits to 383 proteins
           in 134 species: Archae - 0; Bacteria - 61; Metazoa -
           171; Fungi - 74; Plants - 80; Viruses - 0; Other
           Eukaryotes - 7 (source: NCBI BLink). |
           chr3:9907456-9910463 REVERSE LENGTH=649
          Length = 649

 Score =  117 bits (292), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)

Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
           VRKR+LSPLN  +    FRGD  DI  GNH+     G     C++   ++ K +    ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202

Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
           +  T   +S +  + V++S   S M      ++             +C Y  L  +S   
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260

Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
                T  K LP       K++S SP   LSPLG K S   K +  C+T  ++ D  N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310

Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
            ++ E  +D+       +L+        D +   +   M        + P K   F  ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353

Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
           S  P +RSL+                 +QKI+GFLAVL++ GGN SP+SQK+PF V+S  
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413

Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
            D  LLYY+SI+L+G     S+++KF     +T  M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468


>AT3G26890.6 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G41110.1). | chr3:9907456-9910463 REVERSE
           LENGTH=648
          Length = 648

 Score =  110 bits (276), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 49/359 (13%)

Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
           VRKR+LSPLN  +    FRGD  DI  GNH+     G     C++   ++ K +    ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202

Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
           +  T   +S +  + V++S   S M      ++             +C Y  L  +S   
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260

Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
                T  K LP       K++S SP   LSPLG K S   K +  C+T  ++ D  N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310

Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
            ++ E  +D+       +L+        D +   +   M        + P K   F  ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353

Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
           S  P +RSL+                 +Q I+GFLAVL++ GGN SP+SQK+PF V+S  
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQ-IDGFLAVLSIAGGNISPKSQKLPFSVTSAG 412

Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
            D  LLYY+SI+L+G     S+++KF     +T  M+ +++S KS++RIPMKGRIQLVL
Sbjct: 413 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 467