Miyakogusa Predicted Gene

Lj4g3v2351040.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v2351040.1 Non Chatacterized Hit- tr|I1KN07|I1KN07_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48866
PE,68.03,0,SANT,SANT domain; seg,NULL;
Homeodomain-like,Homeodomain-like; FAMILY NOT NAMED,NULL,CUFF.50796.1
         (924 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G09040.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   328   1e-89
AT1G09050.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   315   1e-85
AT2G47820.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   296   6e-80
AT2G47820.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   296   6e-80
AT1G55050.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   271   1e-72
AT1G55050.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   271   1e-72

>AT1G09040.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: membrane;
           EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein
           match is: unknown protein (TAIR:AT1G09050.1); Has 614
           Blast hits to 567 proteins in 104 species: Archae - 2;
           Bacteria - 12; Metazoa - 344; Fungi - 31; Plants - 81;
           Viruses - 0; Other Eukaryotes - 144 (source: NCBI
           BLink). | chr1:2912362-2915174 FORWARD LENGTH=911
          Length = 911

 Score =  328 bits (840), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 203/487 (41%), Positives = 281/487 (57%), Gaps = 50/487 (10%)

Query: 44  GDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDEVE 103
           GDP++ PRVG+++QV+IPP+ S +  + F    P     S   +L+GLP+ ++WI     
Sbjct: 31  GDPQVEPRVGDEFQVDIPPMMSATKRAVFL-STPVALDDSSYSFLIGLPVQVMWIDKH-- 87

Query: 104 SNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLENS 163
                             + + +G + +   Q +K L    ++  S  + G        S
Sbjct: 88  -----------------RRGQGNGDDNVDMNQSLKSLRAK-KSRCSAKIRG-------KS 122

Query: 164 NAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGNKK 223
           +   ETK     K R   +  VP   S +W ++E ASF LGLY FGKN  QVK F+ NK 
Sbjct: 123 DKNSETK-----KQRSNLEA-VPVIPSSSWEDLEVASFVLGLYTFGKNFTQVKNFMENKG 176

Query: 224 MGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEEGR 283
           +G+++LFYYGKFY S KY  WS  RK R+RKC+FG+ +++G RQQ+LL+RL+P++ +E +
Sbjct: 177 IGEIMLFYYGKFYNSAKYHSWSESRKKRNRKCVFGRTLYSGWRQQQLLTRLMPSIPDEPQ 236

Query: 284 SRLL-EVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPTQT 342
            ++L +VSK+F EG I LE YVS +K                   DLT  T+  +K    
Sbjct: 237 KQILVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTSTPMKTKPW 296

Query: 343 LPVHPE---IPAGKACSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
             V  +   +P     + LT + II+ LTG  RLSKAR +D+FW AVWPRLLARGWHS+Q
Sbjct: 297 FTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLLARGWHSQQ 356

Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSRK-LVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
           P    Y + SK+ +VF+VPGVKKFSR+ LVKG+HY+DSVSD+L KV S+PEL+E E    
Sbjct: 357 PEDRGY-FKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELLENE---- 411

Query: 459 NDCTGKEGNGCTKDTKPDHENSP-DRPRHCYLKVKTPNRIADGMKFTVVDTSLAS-EKMT 516
              TG      + D K D E+ P D  RH YL+    NR   GMKFTVVDTSLA+  K+ 
Sbjct: 412 ---TGGVAAELSSD-KSDEESVPSDSLRHRYLRSPCSNRGTLGMKFTVVDTSLATGGKLC 467

Query: 517 KVRELRS 523
            +R L +
Sbjct: 468 DLRNLNA 474


>AT1G09050.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G09040.1); Has 552 Blast hits to 499 proteins
           in 115 species: Archae - 0; Bacteria - 86; Metazoa -
           259; Fungi - 14; Plants - 77; Viruses - 0; Other
           Eukaryotes - 116 (source: NCBI BLink). |
           chr1:2918031-2920858 FORWARD LENGTH=916
          Length = 916

 Score =  315 bits (806), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 202/486 (41%), Positives = 277/486 (56%), Gaps = 48/486 (9%)

Query: 44  GDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDEVE 103
           GDP++ PRVG+++QV+IP + S S  + F    P     S   +LVGLP+ ++WI D+V 
Sbjct: 31  GDPQVEPRVGDEFQVDIPLMMSASKRAVFLSN-PVALDDSTCSFLVGLPVQVMWI-DKVG 88

Query: 104 SNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLENS 163
                        IG  N     G   +   Q +K L    +   S  + G        S
Sbjct: 89  -------------IGQGN-----GDGNVDMNQSLKSLRAK-KGRCSAKIRG-------KS 122

Query: 164 NAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGNKK 223
           +   ETK     K R   +  VP   S +W+++E ASF LGLY FGKN  Q+  F+ NK 
Sbjct: 123 DKNSETK-----KQRLNLEA-VPAIPSSSWDDLEVASFVLGLYTFGKNFTQMNNFMENKG 176

Query: 224 MGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEEGR 283
           +G+++LFYYGKFY S KY  WS  RK R+RKC++G+K+++G RQQ+LL+RL+P++ +E +
Sbjct: 177 IGEIMLFYYGKFYNSAKYHTWSESRKKRNRKCVYGRKLYSGWRQQQLLTRLMPSIPDEPQ 236

Query: 284 SRLL-EVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPTQT 342
            ++L +VSK+F EG I LE YVS +K                   DLT  T+  +K    
Sbjct: 237 KQMLVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTSTPMKTKPW 296

Query: 343 LPVHPE---IPAGKACSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
             V  +   +P     + LT + II+ LTG  RLSKAR +D+FW AVWPRLLARGW S+Q
Sbjct: 297 FTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLLARGWRSQQ 356

Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSRK-LVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
           P    Y + SK+ +VF+VPGVKKFSR+ LVKG+HY+DSVSD+L KV S+PEL+E E    
Sbjct: 357 PEDRGY-FKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELLENE---- 411

Query: 459 NDCTGKEGNGCTKDTKPDHENSPDRPRHCYLKVKTPNRIADGMKFTVVDTSLAS-EKMTK 517
              TG        D   +  +  D  RH YL+    NR   GMKFTVVDTSLA+  K+  
Sbjct: 412 ---TGGVAAENPSDQSDEESSPSDSLRHRYLRSPCSNRGTLGMKFTVVDTSLATGGKLCD 468

Query: 518 VRELRS 523
           +R L +
Sbjct: 469 LRNLNA 474


>AT2G47820.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 17 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G09040.1). | chr2:19588122-19590629 FORWARD
           LENGTH=805
          Length = 805

 Score =  296 bits (757), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 204/523 (39%), Positives = 282/523 (53%), Gaps = 72/523 (13%)

Query: 27  ADEQSL---SPELSDVYDVFGDPEIFPRVGEKYQVEIPPLTSKSDH----SWFQRKIPKK 79
            DE S+   SP L+ ++   GDP++ PRVG++YQ ++P L ++SD     + F  + P  
Sbjct: 14  VDESSMLLNSPYLNGIH---GDPDVLPRVGDQYQADLPVLLTESDRLKLITCFHSEPP-- 68

Query: 80  EGGSLNKYLV-GLPIPIIWIKDEVESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVK 138
               L K L  GLPIP++W + E             KF G             +E  I K
Sbjct: 69  ----LQKLLTFGLPIPLMWTRSE-------------KFRG------------FREADIDK 99

Query: 139 KLNPNLEAIDSTLVNG-------VHLGGLENSNAQQETKIGMHDKLRGGGDCLVPGSASD 191
              P     D +L N        + L      NA+   K    DK         PG+   
Sbjct: 100 ASPP---VDDQSLQNAACMKPRSIVLALPCQKNAK--FKFDWLDKTLYP----FPGTLGQ 150

Query: 192 AWNEIEEASFTLGLYIFGKNLDQVKRFIGNKKMGDVLLFYYGKFYKSEKYQRWSRCRKMR 251
            W + E+  F LGLY  GKNL  V+RF+G+K MGD+L +YYG FY+S +Y+RW   RK R
Sbjct: 151 PWEDAEQERFLLGLYCLGKNLVLVQRFVGSKHMGDMLSYYYGSFYRSTEYRRWVDGRKSR 210

Query: 252 SRKCIFGQKIFTGPRQQELLSRLLPNVSEEGRSRLLEVSKTFVEGKILLEDYVSILKASX 311
           SR+ + GQK+ +G RQQELLSR+  +VSEE +  LL+VSK F E KI LEDYV  LK + 
Sbjct: 211 SRRSVQGQKLLSGWRQQELLSRISSHVSEECKITLLKVSKAFREDKIALEDYVFTLKNTV 270

Query: 312 XXXXXXXXXXXXXXXXDLTGLTTDSVKPTQTLPVHPEIPAGKACSMLTHSEIISFLTGNF 371
                           DLT    +  K       + ++   +  + L  ++I+ FLTG +
Sbjct: 271 GIDMLTQVIGIGKGKRDLTNCALEPTKLNHGASGNSQV---RIRNDLPIADIVKFLTGEY 327

Query: 372 RLSKARTSDLFWEAVWPRLLARGWHSEQPGGSNYAYASKNPLVFLVPGVKKFS-RKLVKG 430
           R+SK R+SDLFWEAVWPRLLARGWHSEQP         KN LVFLVP   KFS RK+ KG
Sbjct: 328 RMSKTRSSDLFWEAVWPRLLARGWHSEQPKD-----GPKNSLVFLVPEANKFSRRKMSKG 382

Query: 431 NHYYDSVSDVLGKVASDPELIELETIADNDCTGKE--GNGCTKDTKPDHENSPD-RPRHC 487
           NHY+DS++DVL KVA DP L+EL+   +   + +E   N    + +   ++SP+ + +  
Sbjct: 383 NHYFDSLTDVLNKVALDPTLLELDEDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKK 442

Query: 488 YLKVKTPNR-IADGMKFTVVDTS-LASEKMTKVRELRSLPFGV 528
           YL+ ++  R I + M FT++DTS   S +   ++ELRSLP G 
Sbjct: 443 YLQPRSKTRKIQEVMLFTIIDTSETNSIEGCTLKELRSLPVGT 485


>AT2G47820.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G09040.1); Has 628 Blast hits to 543 proteins
           in 149 species: Archae - 0; Bacteria - 106; Metazoa -
           145; Fungi - 69; Plants - 97; Viruses - 10; Other
           Eukaryotes - 201 (source: NCBI BLink). |
           chr2:19588122-19590629 FORWARD LENGTH=805
          Length = 805

 Score =  296 bits (757), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 204/523 (39%), Positives = 282/523 (53%), Gaps = 72/523 (13%)

Query: 27  ADEQSL---SPELSDVYDVFGDPEIFPRVGEKYQVEIPPLTSKSDH----SWFQRKIPKK 79
            DE S+   SP L+ ++   GDP++ PRVG++YQ ++P L ++SD     + F  + P  
Sbjct: 14  VDESSMLLNSPYLNGIH---GDPDVLPRVGDQYQADLPVLLTESDRLKLITCFHSEPP-- 68

Query: 80  EGGSLNKYLV-GLPIPIIWIKDEVESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVK 138
               L K L  GLPIP++W + E             KF G             +E  I K
Sbjct: 69  ----LQKLLTFGLPIPLMWTRSE-------------KFRG------------FREADIDK 99

Query: 139 KLNPNLEAIDSTLVNG-------VHLGGLENSNAQQETKIGMHDKLRGGGDCLVPGSASD 191
              P     D +L N        + L      NA+   K    DK         PG+   
Sbjct: 100 ASPP---VDDQSLQNAACMKPRSIVLALPCQKNAK--FKFDWLDKTLYP----FPGTLGQ 150

Query: 192 AWNEIEEASFTLGLYIFGKNLDQVKRFIGNKKMGDVLLFYYGKFYKSEKYQRWSRCRKMR 251
            W + E+  F LGLY  GKNL  V+RF+G+K MGD+L +YYG FY+S +Y+RW   RK R
Sbjct: 151 PWEDAEQERFLLGLYCLGKNLVLVQRFVGSKHMGDMLSYYYGSFYRSTEYRRWVDGRKSR 210

Query: 252 SRKCIFGQKIFTGPRQQELLSRLLPNVSEEGRSRLLEVSKTFVEGKILLEDYVSILKASX 311
           SR+ + GQK+ +G RQQELLSR+  +VSEE +  LL+VSK F E KI LEDYV  LK + 
Sbjct: 211 SRRSVQGQKLLSGWRQQELLSRISSHVSEECKITLLKVSKAFREDKIALEDYVFTLKNTV 270

Query: 312 XXXXXXXXXXXXXXXXDLTGLTTDSVKPTQTLPVHPEIPAGKACSMLTHSEIISFLTGNF 371
                           DLT    +  K       + ++   +  + L  ++I+ FLTG +
Sbjct: 271 GIDMLTQVIGIGKGKRDLTNCALEPTKLNHGASGNSQV---RIRNDLPIADIVKFLTGEY 327

Query: 372 RLSKARTSDLFWEAVWPRLLARGWHSEQPGGSNYAYASKNPLVFLVPGVKKFS-RKLVKG 430
           R+SK R+SDLFWEAVWPRLLARGWHSEQP         KN LVFLVP   KFS RK+ KG
Sbjct: 328 RMSKTRSSDLFWEAVWPRLLARGWHSEQPKD-----GPKNSLVFLVPEANKFSRRKMSKG 382

Query: 431 NHYYDSVSDVLGKVASDPELIELETIADNDCTGKE--GNGCTKDTKPDHENSPD-RPRHC 487
           NHY+DS++DVL KVA DP L+EL+   +   + +E   N    + +   ++SP+ + +  
Sbjct: 383 NHYFDSLTDVLNKVALDPTLLELDEDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKK 442

Query: 488 YLKVKTPNR-IADGMKFTVVDTS-LASEKMTKVRELRSLPFGV 528
           YL+ ++  R I + M FT++DTS   S +   ++ELRSLP G 
Sbjct: 443 YLQPRSKTRKIQEVMLFTIIDTSETNSIEGCTLKELRSLPVGT 485


>AT1G55050.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: cultured cell;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT1G09040.1); Has 30201 Blast hits to
           17322 proteins in 780 species: Archae - 12; Bacteria -
           1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
           Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr1:20542779-20545612 FORWARD LENGTH=915
          Length = 915

 Score =  271 bits (694), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 185/486 (38%), Positives = 268/486 (55%), Gaps = 62/486 (12%)

Query: 42  VFGDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDE 101
           V GDP++  RVG++YQVEIPP+ S+S  +     +   E  S   + VGLP+ ++WI+  
Sbjct: 18  VCGDPKVDIRVGDEYQVEIPPMMSESQRAELL--LNPLEFDSSCSFAVGLPVEVMWIE-- 73

Query: 102 VESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLE 161
                      +C+            G+ +    I   +N +L++    L       G  
Sbjct: 74  ----------TKCR-----------DGDGLGSDNI--DMNESLKS----LKRKRSRRGGS 106

Query: 162 NSNAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGN 221
           + N+  + ++ +           VP  +S +W ++E   F LGLY FGKN  QV++ + +
Sbjct: 107 DGNSGSKRRMNLE---------AVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157

Query: 222 KKMGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEE 281
           K  G++LLFYYGKFY S KY+ WS   K RS +CI G+K+++  R Q LLSRL+ ++++E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217

Query: 282 GR-SRLLEVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPT 340
            +  +L++VSK+F EGK  LE+Y++ +K                   DLT LTT  V   
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277

Query: 341 QTLPVHPEIPAGKA-CSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
           Q   V   +PAG    + LT   II  L+G  R+SKAR +D+FW+AVWPRLL RGW SE 
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337

Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSR-KLVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
           P    Y   SK  +VFLVPGVKKFSR KLVK +HY+DS+SD+L KV S+PEL+E      
Sbjct: 338 PKDQGY-IKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVSEPELLEETA--- 393

Query: 459 NDCTGKEGNGCTKDTKPDHENSPDRPRHCYLKVKTPNRIADGMKFTVVDTS-LASE-KMT 516
                       ++ + +  N   + +HCYL  ++P+  +  MKFTVVDTS  AS  K+ 
Sbjct: 394 -----------EEEREENTYNQSKQEKHCYL--RSPSSSSTHMKFTVVDTSRFASRGKLY 440

Query: 517 KVRELR 522
           + RELR
Sbjct: 441 EFRELR 446


>AT1G55050.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: cultured cell;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT1G09040.1); Has 2440 Blast hits to 1999
           proteins in 271 species: Archae - 0; Bacteria - 138;
           Metazoa - 960; Fungi - 166; Plants - 162; Viruses - 14;
           Other Eukaryotes - 1000 (source: NCBI BLink). |
           chr1:20542779-20545612 FORWARD LENGTH=915
          Length = 915

 Score =  271 bits (694), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 185/486 (38%), Positives = 268/486 (55%), Gaps = 62/486 (12%)

Query: 42  VFGDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDE 101
           V GDP++  RVG++YQVEIPP+ S+S  +     +   E  S   + VGLP+ ++WI+  
Sbjct: 18  VCGDPKVDIRVGDEYQVEIPPMMSESQRAELL--LNPLEFDSSCSFAVGLPVEVMWIE-- 73

Query: 102 VESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLE 161
                      +C+            G+ +    I   +N +L++    L       G  
Sbjct: 74  ----------TKCR-----------DGDGLGSDNI--DMNESLKS----LKRKRSRRGGS 106

Query: 162 NSNAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGN 221
           + N+  + ++ +           VP  +S +W ++E   F LGLY FGKN  QV++ + +
Sbjct: 107 DGNSGSKRRMNLE---------AVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157

Query: 222 KKMGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEE 281
           K  G++LLFYYGKFY S KY+ WS   K RS +CI G+K+++  R Q LLSRL+ ++++E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217

Query: 282 GR-SRLLEVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPT 340
            +  +L++VSK+F EGK  LE+Y++ +K                   DLT LTT  V   
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277

Query: 341 QTLPVHPEIPAGKA-CSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
           Q   V   +PAG    + LT   II  L+G  R+SKAR +D+FW+AVWPRLL RGW SE 
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337

Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSR-KLVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
           P    Y   SK  +VFLVPGVKKFSR KLVK +HY+DS+SD+L KV S+PEL+E      
Sbjct: 338 PKDQGY-IKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVSEPELLEETA--- 393

Query: 459 NDCTGKEGNGCTKDTKPDHENSPDRPRHCYLKVKTPNRIADGMKFTVVDTS-LASE-KMT 516
                       ++ + +  N   + +HCYL  ++P+  +  MKFTVVDTS  AS  K+ 
Sbjct: 394 -----------EEEREENTYNQSKQEKHCYL--RSPSSSSTHMKFTVVDTSRFASRGKLY 440

Query: 517 KVRELR 522
           + RELR
Sbjct: 441 EFRELR 446