Miyakogusa Predicted Gene
- Lj4g3v2351040.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2351040.1 Non Chatacterized Hit- tr|I1KN07|I1KN07_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48866
PE,68.03,0,SANT,SANT domain; seg,NULL;
Homeodomain-like,Homeodomain-like; FAMILY NOT NAMED,NULL,CUFF.50796.1
(924 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G09040.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 328 1e-89
AT1G09050.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 315 1e-85
AT2G47820.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 296 6e-80
AT2G47820.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 296 6e-80
AT1G55050.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 271 1e-72
AT1G55050.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 271 1e-72
>AT1G09040.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: membrane;
EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT1G09050.1); Has 614
Blast hits to 567 proteins in 104 species: Archae - 2;
Bacteria - 12; Metazoa - 344; Fungi - 31; Plants - 81;
Viruses - 0; Other Eukaryotes - 144 (source: NCBI
BLink). | chr1:2912362-2915174 FORWARD LENGTH=911
Length = 911
Score = 328 bits (840), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 203/487 (41%), Positives = 281/487 (57%), Gaps = 50/487 (10%)
Query: 44 GDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDEVE 103
GDP++ PRVG+++QV+IPP+ S + + F P S +L+GLP+ ++WI
Sbjct: 31 GDPQVEPRVGDEFQVDIPPMMSATKRAVFL-STPVALDDSSYSFLIGLPVQVMWIDKH-- 87
Query: 104 SNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLENS 163
+ + +G + + Q +K L ++ S + G S
Sbjct: 88 -----------------RRGQGNGDDNVDMNQSLKSLRAK-KSRCSAKIRG-------KS 122
Query: 164 NAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGNKK 223
+ ETK K R + VP S +W ++E ASF LGLY FGKN QVK F+ NK
Sbjct: 123 DKNSETK-----KQRSNLEA-VPVIPSSSWEDLEVASFVLGLYTFGKNFTQVKNFMENKG 176
Query: 224 MGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEEGR 283
+G+++LFYYGKFY S KY WS RK R+RKC+FG+ +++G RQQ+LL+RL+P++ +E +
Sbjct: 177 IGEIMLFYYGKFYNSAKYHSWSESRKKRNRKCVFGRTLYSGWRQQQLLTRLMPSIPDEPQ 236
Query: 284 SRLL-EVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPTQT 342
++L +VSK+F EG I LE YVS +K DLT T+ +K
Sbjct: 237 KQILVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTSTPMKTKPW 296
Query: 343 LPVHPE---IPAGKACSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
V + +P + LT + II+ LTG RLSKAR +D+FW AVWPRLLARGWHS+Q
Sbjct: 297 FTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLLARGWHSQQ 356
Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSRK-LVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
P Y + SK+ +VF+VPGVKKFSR+ LVKG+HY+DSVSD+L KV S+PEL+E E
Sbjct: 357 PEDRGY-FKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELLENE---- 411
Query: 459 NDCTGKEGNGCTKDTKPDHENSP-DRPRHCYLKVKTPNRIADGMKFTVVDTSLAS-EKMT 516
TG + D K D E+ P D RH YL+ NR GMKFTVVDTSLA+ K+
Sbjct: 412 ---TGGVAAELSSD-KSDEESVPSDSLRHRYLRSPCSNRGTLGMKFTVVDTSLATGGKLC 467
Query: 517 KVRELRS 523
+R L +
Sbjct: 468 DLRNLNA 474
>AT1G09050.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G09040.1); Has 552 Blast hits to 499 proteins
in 115 species: Archae - 0; Bacteria - 86; Metazoa -
259; Fungi - 14; Plants - 77; Viruses - 0; Other
Eukaryotes - 116 (source: NCBI BLink). |
chr1:2918031-2920858 FORWARD LENGTH=916
Length = 916
Score = 315 bits (806), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 202/486 (41%), Positives = 277/486 (56%), Gaps = 48/486 (9%)
Query: 44 GDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDEVE 103
GDP++ PRVG+++QV+IP + S S + F P S +LVGLP+ ++WI D+V
Sbjct: 31 GDPQVEPRVGDEFQVDIPLMMSASKRAVFLSN-PVALDDSTCSFLVGLPVQVMWI-DKVG 88
Query: 104 SNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLENS 163
IG N G + Q +K L + S + G S
Sbjct: 89 -------------IGQGN-----GDGNVDMNQSLKSLRAK-KGRCSAKIRG-------KS 122
Query: 164 NAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGNKK 223
+ ETK K R + VP S +W+++E ASF LGLY FGKN Q+ F+ NK
Sbjct: 123 DKNSETK-----KQRLNLEA-VPAIPSSSWDDLEVASFVLGLYTFGKNFTQMNNFMENKG 176
Query: 224 MGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEEGR 283
+G+++LFYYGKFY S KY WS RK R+RKC++G+K+++G RQQ+LL+RL+P++ +E +
Sbjct: 177 IGEIMLFYYGKFYNSAKYHTWSESRKKRNRKCVYGRKLYSGWRQQQLLTRLMPSIPDEPQ 236
Query: 284 SRLL-EVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPTQT 342
++L +VSK+F EG I LE YVS +K DLT T+ +K
Sbjct: 237 KQMLVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTSTPMKTKPW 296
Query: 343 LPVHPE---IPAGKACSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
V + +P + LT + II+ LTG RLSKAR +D+FW AVWPRLLARGW S+Q
Sbjct: 297 FTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLLARGWRSQQ 356
Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSRK-LVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
P Y + SK+ +VF+VPGVKKFSR+ LVKG+HY+DSVSD+L KV S+PEL+E E
Sbjct: 357 PEDRGY-FKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELLENE---- 411
Query: 459 NDCTGKEGNGCTKDTKPDHENSPDRPRHCYLKVKTPNRIADGMKFTVVDTSLAS-EKMTK 517
TG D + + D RH YL+ NR GMKFTVVDTSLA+ K+
Sbjct: 412 ---TGGVAAENPSDQSDEESSPSDSLRHRYLRSPCSNRGTLGMKFTVVDTSLATGGKLCD 468
Query: 518 VRELRS 523
+R L +
Sbjct: 469 LRNLNA 474
>AT2G47820.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 17 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G09040.1). | chr2:19588122-19590629 FORWARD
LENGTH=805
Length = 805
Score = 296 bits (757), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 204/523 (39%), Positives = 282/523 (53%), Gaps = 72/523 (13%)
Query: 27 ADEQSL---SPELSDVYDVFGDPEIFPRVGEKYQVEIPPLTSKSDH----SWFQRKIPKK 79
DE S+ SP L+ ++ GDP++ PRVG++YQ ++P L ++SD + F + P
Sbjct: 14 VDESSMLLNSPYLNGIH---GDPDVLPRVGDQYQADLPVLLTESDRLKLITCFHSEPP-- 68
Query: 80 EGGSLNKYLV-GLPIPIIWIKDEVESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVK 138
L K L GLPIP++W + E KF G +E I K
Sbjct: 69 ----LQKLLTFGLPIPLMWTRSE-------------KFRG------------FREADIDK 99
Query: 139 KLNPNLEAIDSTLVNG-------VHLGGLENSNAQQETKIGMHDKLRGGGDCLVPGSASD 191
P D +L N + L NA+ K DK PG+
Sbjct: 100 ASPP---VDDQSLQNAACMKPRSIVLALPCQKNAK--FKFDWLDKTLYP----FPGTLGQ 150
Query: 192 AWNEIEEASFTLGLYIFGKNLDQVKRFIGNKKMGDVLLFYYGKFYKSEKYQRWSRCRKMR 251
W + E+ F LGLY GKNL V+RF+G+K MGD+L +YYG FY+S +Y+RW RK R
Sbjct: 151 PWEDAEQERFLLGLYCLGKNLVLVQRFVGSKHMGDMLSYYYGSFYRSTEYRRWVDGRKSR 210
Query: 252 SRKCIFGQKIFTGPRQQELLSRLLPNVSEEGRSRLLEVSKTFVEGKILLEDYVSILKASX 311
SR+ + GQK+ +G RQQELLSR+ +VSEE + LL+VSK F E KI LEDYV LK +
Sbjct: 211 SRRSVQGQKLLSGWRQQELLSRISSHVSEECKITLLKVSKAFREDKIALEDYVFTLKNTV 270
Query: 312 XXXXXXXXXXXXXXXXDLTGLTTDSVKPTQTLPVHPEIPAGKACSMLTHSEIISFLTGNF 371
DLT + K + ++ + + L ++I+ FLTG +
Sbjct: 271 GIDMLTQVIGIGKGKRDLTNCALEPTKLNHGASGNSQV---RIRNDLPIADIVKFLTGEY 327
Query: 372 RLSKARTSDLFWEAVWPRLLARGWHSEQPGGSNYAYASKNPLVFLVPGVKKFS-RKLVKG 430
R+SK R+SDLFWEAVWPRLLARGWHSEQP KN LVFLVP KFS RK+ KG
Sbjct: 328 RMSKTRSSDLFWEAVWPRLLARGWHSEQPKD-----GPKNSLVFLVPEANKFSRRKMSKG 382
Query: 431 NHYYDSVSDVLGKVASDPELIELETIADNDCTGKE--GNGCTKDTKPDHENSPD-RPRHC 487
NHY+DS++DVL KVA DP L+EL+ + + +E N + + ++SP+ + +
Sbjct: 383 NHYFDSLTDVLNKVALDPTLLELDEDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKK 442
Query: 488 YLKVKTPNR-IADGMKFTVVDTS-LASEKMTKVRELRSLPFGV 528
YL+ ++ R I + M FT++DTS S + ++ELRSLP G
Sbjct: 443 YLQPRSKTRKIQEVMLFTIIDTSETNSIEGCTLKELRSLPVGT 485
>AT2G47820.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G09040.1); Has 628 Blast hits to 543 proteins
in 149 species: Archae - 0; Bacteria - 106; Metazoa -
145; Fungi - 69; Plants - 97; Viruses - 10; Other
Eukaryotes - 201 (source: NCBI BLink). |
chr2:19588122-19590629 FORWARD LENGTH=805
Length = 805
Score = 296 bits (757), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 204/523 (39%), Positives = 282/523 (53%), Gaps = 72/523 (13%)
Query: 27 ADEQSL---SPELSDVYDVFGDPEIFPRVGEKYQVEIPPLTSKSDH----SWFQRKIPKK 79
DE S+ SP L+ ++ GDP++ PRVG++YQ ++P L ++SD + F + P
Sbjct: 14 VDESSMLLNSPYLNGIH---GDPDVLPRVGDQYQADLPVLLTESDRLKLITCFHSEPP-- 68
Query: 80 EGGSLNKYLV-GLPIPIIWIKDEVESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVK 138
L K L GLPIP++W + E KF G +E I K
Sbjct: 69 ----LQKLLTFGLPIPLMWTRSE-------------KFRG------------FREADIDK 99
Query: 139 KLNPNLEAIDSTLVNG-------VHLGGLENSNAQQETKIGMHDKLRGGGDCLVPGSASD 191
P D +L N + L NA+ K DK PG+
Sbjct: 100 ASPP---VDDQSLQNAACMKPRSIVLALPCQKNAK--FKFDWLDKTLYP----FPGTLGQ 150
Query: 192 AWNEIEEASFTLGLYIFGKNLDQVKRFIGNKKMGDVLLFYYGKFYKSEKYQRWSRCRKMR 251
W + E+ F LGLY GKNL V+RF+G+K MGD+L +YYG FY+S +Y+RW RK R
Sbjct: 151 PWEDAEQERFLLGLYCLGKNLVLVQRFVGSKHMGDMLSYYYGSFYRSTEYRRWVDGRKSR 210
Query: 252 SRKCIFGQKIFTGPRQQELLSRLLPNVSEEGRSRLLEVSKTFVEGKILLEDYVSILKASX 311
SR+ + GQK+ +G RQQELLSR+ +VSEE + LL+VSK F E KI LEDYV LK +
Sbjct: 211 SRRSVQGQKLLSGWRQQELLSRISSHVSEECKITLLKVSKAFREDKIALEDYVFTLKNTV 270
Query: 312 XXXXXXXXXXXXXXXXDLTGLTTDSVKPTQTLPVHPEIPAGKACSMLTHSEIISFLTGNF 371
DLT + K + ++ + + L ++I+ FLTG +
Sbjct: 271 GIDMLTQVIGIGKGKRDLTNCALEPTKLNHGASGNSQV---RIRNDLPIADIVKFLTGEY 327
Query: 372 RLSKARTSDLFWEAVWPRLLARGWHSEQPGGSNYAYASKNPLVFLVPGVKKFS-RKLVKG 430
R+SK R+SDLFWEAVWPRLLARGWHSEQP KN LVFLVP KFS RK+ KG
Sbjct: 328 RMSKTRSSDLFWEAVWPRLLARGWHSEQPKD-----GPKNSLVFLVPEANKFSRRKMSKG 382
Query: 431 NHYYDSVSDVLGKVASDPELIELETIADNDCTGKE--GNGCTKDTKPDHENSPD-RPRHC 487
NHY+DS++DVL KVA DP L+EL+ + + +E N + + ++SP+ + +
Sbjct: 383 NHYFDSLTDVLNKVALDPTLLELDEDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKK 442
Query: 488 YLKVKTPNR-IADGMKFTVVDTS-LASEKMTKVRELRSLPFGV 528
YL+ ++ R I + M FT++DTS S + ++ELRSLP G
Sbjct: 443 YLQPRSKTRKIQEVMLFTIIDTSETNSIEGCTLKELRSLPVGT 485
>AT1G55050.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: cultured cell;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT1G09040.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr1:20542779-20545612 FORWARD LENGTH=915
Length = 915
Score = 271 bits (694), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 185/486 (38%), Positives = 268/486 (55%), Gaps = 62/486 (12%)
Query: 42 VFGDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDE 101
V GDP++ RVG++YQVEIPP+ S+S + + E S + VGLP+ ++WI+
Sbjct: 18 VCGDPKVDIRVGDEYQVEIPPMMSESQRAELL--LNPLEFDSSCSFAVGLPVEVMWIE-- 73
Query: 102 VESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLE 161
+C+ G+ + I +N +L++ L G
Sbjct: 74 ----------TKCR-----------DGDGLGSDNI--DMNESLKS----LKRKRSRRGGS 106
Query: 162 NSNAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGN 221
+ N+ + ++ + VP +S +W ++E F LGLY FGKN QV++ + +
Sbjct: 107 DGNSGSKRRMNLE---------AVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157
Query: 222 KKMGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEE 281
K G++LLFYYGKFY S KY+ WS K RS +CI G+K+++ R Q LLSRL+ ++++E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217
Query: 282 GR-SRLLEVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPT 340
+ +L++VSK+F EGK LE+Y++ +K DLT LTT V
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277
Query: 341 QTLPVHPEIPAGKA-CSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
Q V +PAG + LT II L+G R+SKAR +D+FW+AVWPRLL RGW SE
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337
Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSR-KLVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
P Y SK +VFLVPGVKKFSR KLVK +HY+DS+SD+L KV S+PEL+E
Sbjct: 338 PKDQGY-IKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVSEPELLEETA--- 393
Query: 459 NDCTGKEGNGCTKDTKPDHENSPDRPRHCYLKVKTPNRIADGMKFTVVDTS-LASE-KMT 516
++ + + N + +HCYL ++P+ + MKFTVVDTS AS K+
Sbjct: 394 -----------EEEREENTYNQSKQEKHCYL--RSPSSSSTHMKFTVVDTSRFASRGKLY 440
Query: 517 KVRELR 522
+ RELR
Sbjct: 441 EFRELR 446
>AT1G55050.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: cultured cell;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT1G09040.1); Has 2440 Blast hits to 1999
proteins in 271 species: Archae - 0; Bacteria - 138;
Metazoa - 960; Fungi - 166; Plants - 162; Viruses - 14;
Other Eukaryotes - 1000 (source: NCBI BLink). |
chr1:20542779-20545612 FORWARD LENGTH=915
Length = 915
Score = 271 bits (694), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 185/486 (38%), Positives = 268/486 (55%), Gaps = 62/486 (12%)
Query: 42 VFGDPEIFPRVGEKYQVEIPPLTSKSDHSWFQRKIPKKEGGSLNKYLVGLPIPIIWIKDE 101
V GDP++ RVG++YQVEIPP+ S+S + + E S + VGLP+ ++WI+
Sbjct: 18 VCGDPKVDIRVGDEYQVEIPPMMSESQRAELL--LNPLEFDSSCSFAVGLPVEVMWIE-- 73
Query: 102 VESNKPDLLKNECKFIGVANKIESSGGECIKEIQIVKKLNPNLEAIDSTLVNGVHLGGLE 161
+C+ G+ + I +N +L++ L G
Sbjct: 74 ----------TKCR-----------DGDGLGSDNI--DMNESLKS----LKRKRSRRGGS 106
Query: 162 NSNAQQETKIGMHDKLRGGGDCLVPGSASDAWNEIEEASFTLGLYIFGKNLDQVKRFIGN 221
+ N+ + ++ + VP +S +W ++E F LGLY FGKN QV++ + +
Sbjct: 107 DGNSGSKRRMNLE---------AVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157
Query: 222 KKMGDVLLFYYGKFYKSEKYQRWSRCRKMRSRKCIFGQKIFTGPRQQELLSRLLPNVSEE 281
K G++LLFYYGKFY S KY+ WS K RS +CI G+K+++ R Q LLSRL+ ++++E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217
Query: 282 GR-SRLLEVSKTFVEGKILLEDYVSILKASXXXXXXXXXXXXXXXXXDLTGLTTDSVKPT 340
+ +L++VSK+F EGK LE+Y++ +K DLT LTT V
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277
Query: 341 QTLPVHPEIPAGKA-CSMLTHSEIISFLTGNFRLSKARTSDLFWEAVWPRLLARGWHSEQ 399
Q V +PAG + LT II L+G R+SKAR +D+FW+AVWPRLL RGW SE
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337
Query: 400 PGGSNYAYASKNPLVFLVPGVKKFSR-KLVKGNHYYDSVSDVLGKVASDPELIELETIAD 458
P Y SK +VFLVPGVKKFSR KLVK +HY+DS+SD+L KV S+PEL+E
Sbjct: 338 PKDQGY-IKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVSEPELLEETA--- 393
Query: 459 NDCTGKEGNGCTKDTKPDHENSPDRPRHCYLKVKTPNRIADGMKFTVVDTS-LASE-KMT 516
++ + + N + +HCYL ++P+ + MKFTVVDTS AS K+
Sbjct: 394 -----------EEEREENTYNQSKQEKHCYL--RSPSSSSTHMKFTVVDTSRFASRGKLY 440
Query: 517 KVRELR 522
+ RELR
Sbjct: 441 EFRELR 446