Miyakogusa Predicted Gene
- Lj1g3v4913230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4913230.1 Non Chatacterized Hit- tr|I1JQJ5|I1JQJ5_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,68.05,0,SANT,SANT
domain; seg,NULL; FAMILY NOT NAMED,NULL;
Homeodomain-like,Homeodomain-like,CUFF.33565.1
(842 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G47820.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 377 e-104
AT2G47820.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 377 e-104
AT1G09040.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 322 9e-88
AT1G09050.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 315 7e-86
AT1G55050.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 273 5e-73
AT1G55050.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 273 5e-73
>AT2G47820.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 17 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G09040.1). | chr2:19588122-19590629 FORWARD
LENGTH=805
Length = 805
Score = 377 bits (967), Expect = e-104, Method: Compositional matrix adjust.
Identities = 221/499 (44%), Positives = 287/499 (57%), Gaps = 65/499 (13%)
Query: 22 LHGVDDKFGDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPEP-----FS 76
L+G+ GDP+VLPRVGD+YQ ++P L L + + I EP +
Sbjct: 26 LNGI---HGDPDVLPRVGDQYQADLPVL--------LTESDRLKLITCFHSEPPLQKLLT 74
Query: 77 LGLPIPIMWAHC-SFGCENSESVTSGEGKVSSEH----ECTKVKGGNLGGFSNSQNSSKS 131
GLPIP+MW F + V + C K + L K
Sbjct: 75 FGLPIPLMWTRSEKFRGFREADIDKASPPVDDQSLQNAACMKPRSIVLALPCQKNAKFKF 134
Query: 132 DETDIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLLKRFVG 191
D D + Y PG L QPW D E + FLLGLY GKNL L++RFVG
Sbjct: 135 DWLD--------------KTLYPFPGTLG-QPWEDAEQERFLLGLYCLGKNLVLVQRFVG 179
Query: 192 SKKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPG 251
SK MGD+LS+YYG F+RS + RW + RKSR+RR GQK+ GWRQQE LSR+ SHV
Sbjct: 180 SKHMGDMLSYYYGSFYRSTEYRRWVDGRKSRSRRSVQGQKLLSGWRQQELLSRISSHVSE 239
Query: 252 ECQTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
EC+ L+++S+ F E KI E+Y+F LK+ VGI++L +GIGKGK DLT A+EP+K
Sbjct: 240 ECKITLLKVSKAFREDKIALEDYVFTLKNTVGIDMLTQVIGIGKGKRDLTNCALEPTKLN 299
Query: 312 H-------VSVRPELPIGKACSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANG 364
H V +R +LPI ADI+KFLTG +R+SK RSSDLFWEAVWPRLLA G
Sbjct: 300 HGASGNSQVRIRNDLPI---------ADIVKFLTGEYRMSKTRSSDLFWEAVWPRLLARG 350
Query: 365 WHSEQPMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLLET- 423
WHSEQP D G K SLVFL+P KFS+RK+ KGNHYFDS++D+LNKVA DP LLE
Sbjct: 351 WHSEQPKD----GPKNSLVFLVPEANKFSRRKMSKGNHYFDSLTDVLNKVALDPTLLELD 406
Query: 424 EIQATEGSSDGGKRQD-----KRDVDGLPNG-QQCHYLQSRSKCN--EDLAKLTIIDTSM 475
E +GS + + D + D PN ++ YLQ RSK +++ TIIDTS
Sbjct: 407 EDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKKYLQPRSKTRKIQEVMLFTIIDTSE 466
Query: 476 VHNMDQRKVRQMRSLSFQT 494
++++ ++++RSL T
Sbjct: 467 TNSIEGCTLKELRSLPVGT 485
Score = 54.7 bits (130), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 64/122 (52%), Gaps = 9/122 (7%)
Query: 676 TSSQAKDSVTENHMVGEVSAENSETRMLIDLNFPQVSPEF---GIDLEIPS--SQNDNQC 730
TSS A+DS ++ E+S E SE+R DLN Q+S E G D + +++ C
Sbjct: 635 TSSFARDSSCRRNIDREISPERSESREDFDLNVSQISLEREADGTDTVMADVVQNSESSC 694
Query: 731 ANTLSSQSEITQLNAMHDFPDDNKEQQSTIVNLRQSTRNRPLTTKALEALQYRFLNSTKR 790
A S Q ++ + P + + + RQSTR RPLTTKALEA + +L ++ +
Sbjct: 695 AEQSSVQVDVEKQCK----PQELQVTADLLPERRQSTRTRPLTTKALEAFAFGYLGNSNK 750
Query: 791 KR 792
+R
Sbjct: 751 ER 752
>AT2G47820.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G09040.1); Has 628 Blast hits to 543 proteins
in 149 species: Archae - 0; Bacteria - 106; Metazoa -
145; Fungi - 69; Plants - 97; Viruses - 10; Other
Eukaryotes - 201 (source: NCBI BLink). |
chr2:19588122-19590629 FORWARD LENGTH=805
Length = 805
Score = 377 bits (967), Expect = e-104, Method: Compositional matrix adjust.
Identities = 221/499 (44%), Positives = 287/499 (57%), Gaps = 65/499 (13%)
Query: 22 LHGVDDKFGDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPEP-----FS 76
L+G+ GDP+VLPRVGD+YQ ++P L L + + I EP +
Sbjct: 26 LNGI---HGDPDVLPRVGDQYQADLPVL--------LTESDRLKLITCFHSEPPLQKLLT 74
Query: 77 LGLPIPIMWAHC-SFGCENSESVTSGEGKVSSEH----ECTKVKGGNLGGFSNSQNSSKS 131
GLPIP+MW F + V + C K + L K
Sbjct: 75 FGLPIPLMWTRSEKFRGFREADIDKASPPVDDQSLQNAACMKPRSIVLALPCQKNAKFKF 134
Query: 132 DETDIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLLKRFVG 191
D D + Y PG L QPW D E + FLLGLY GKNL L++RFVG
Sbjct: 135 DWLD--------------KTLYPFPGTLG-QPWEDAEQERFLLGLYCLGKNLVLVQRFVG 179
Query: 192 SKKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPG 251
SK MGD+LS+YYG F+RS + RW + RKSR+RR GQK+ GWRQQE LSR+ SHV
Sbjct: 180 SKHMGDMLSYYYGSFYRSTEYRRWVDGRKSRSRRSVQGQKLLSGWRQQELLSRISSHVSE 239
Query: 252 ECQTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
EC+ L+++S+ F E KI E+Y+F LK+ VGI++L +GIGKGK DLT A+EP+K
Sbjct: 240 ECKITLLKVSKAFREDKIALEDYVFTLKNTVGIDMLTQVIGIGKGKRDLTNCALEPTKLN 299
Query: 312 H-------VSVRPELPIGKACSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANG 364
H V +R +LPI ADI+KFLTG +R+SK RSSDLFWEAVWPRLLA G
Sbjct: 300 HGASGNSQVRIRNDLPI---------ADIVKFLTGEYRMSKTRSSDLFWEAVWPRLLARG 350
Query: 365 WHSEQPMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLLET- 423
WHSEQP D G K SLVFL+P KFS+RK+ KGNHYFDS++D+LNKVA DP LLE
Sbjct: 351 WHSEQPKD----GPKNSLVFLVPEANKFSRRKMSKGNHYFDSLTDVLNKVALDPTLLELD 406
Query: 424 EIQATEGSSDGGKRQD-----KRDVDGLPNG-QQCHYLQSRSKCN--EDLAKLTIIDTSM 475
E +GS + + D + D PN ++ YLQ RSK +++ TIIDTS
Sbjct: 407 EDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKKYLQPRSKTRKIQEVMLFTIIDTSE 466
Query: 476 VHNMDQRKVRQMRSLSFQT 494
++++ ++++RSL T
Sbjct: 467 TNSIEGCTLKELRSLPVGT 485
Score = 54.7 bits (130), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 64/122 (52%), Gaps = 9/122 (7%)
Query: 676 TSSQAKDSVTENHMVGEVSAENSETRMLIDLNFPQVSPEF---GIDLEIPS--SQNDNQC 730
TSS A+DS ++ E+S E SE+R DLN Q+S E G D + +++ C
Sbjct: 635 TSSFARDSSCRRNIDREISPERSESREDFDLNVSQISLEREADGTDTVMADVVQNSESSC 694
Query: 731 ANTLSSQSEITQLNAMHDFPDDNKEQQSTIVNLRQSTRNRPLTTKALEALQYRFLNSTKR 790
A S Q ++ + P + + + RQSTR RPLTTKALEA + +L ++ +
Sbjct: 695 AEQSSVQVDVEKQCK----PQELQVTADLLPERRQSTRTRPLTTKALEAFAFGYLGNSNK 750
Query: 791 KR 792
+R
Sbjct: 751 ER 752
>AT1G09040.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: membrane;
EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT1G09050.1); Has 614
Blast hits to 567 proteins in 104 species: Archae - 2;
Bacteria - 12; Metazoa - 344; Fungi - 31; Plants - 81;
Viruses - 0; Other Eukaryotes - 144 (source: NCBI
BLink). | chr1:2912362-2915174 FORWARD LENGTH=911
Length = 911
Score = 322 bits (824), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 195/479 (40%), Positives = 276/479 (57%), Gaps = 56/479 (11%)
Query: 18 VEEDLHGVDDKF--GDPEVLPRVGDEYQVEIPSLTAAP----YLSQLAKKTKDSEIRVNV 71
EED + DD+F GDP+V PRVGDE+QV+IP + +A +LS S
Sbjct: 19 TEEDSY--DDEFPCGDPQVEPRVGDEFQVDIPPMMSATKRAVFLSTPVALDDSSY----- 71
Query: 72 PEPFSLGLPIPIMWAHC-----SFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQ 126
F +GLP+ +MW G +N + S + + + C+ + G S+
Sbjct: 72 --SFLIGLPVQVMWIDKHRRGQGNGDDNVDMNQSLKSLRAKKSRCS----AKIRGKSDKN 125
Query: 127 NSSKSDETDIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLL 186
+ +K +++++ ++ W D+E SF+LGLY FGKN +
Sbjct: 126 SETKKQRSNLEAVP-----------------VIPSSSWEDLEVASFVLGLYTFGKNFTQV 168
Query: 187 KRFVGSKKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLF 246
K F+ +K +G+I+ FYYGKF+ S + WSE RK R R+C G+ ++ GWRQQ+ L+RL
Sbjct: 169 KNFMENKGIGEIMLFYYGKFYNSAKYHSWSESRKKRNRKCVFGRTLYSGWRQQQLLTRLM 228
Query: 247 SHVPGECQT-LLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAV 305
+P E Q +LV++S++F E I E+Y+ A+K+ VG+ LL+ AV IGK K DLT
Sbjct: 229 PSIPDEPQKQILVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTS 288
Query: 306 EPSKT---YHVSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLL 361
P KT + VS + L G+ +SLTSA II LTG RLSKAR +D+FW AVWPRLL
Sbjct: 289 TPMKTKPWFTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLL 348
Query: 362 ANGWHSEQPMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLL 421
A GWHS+QP D+ SK +VF++PGVKKFS+++LVKG+HYFDS+SD+L KV S+P LL
Sbjct: 349 ARGWHSQQPEDRGYFKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELL 408
Query: 422 ETEIQ--ATEGSSDGGKRQDKRDVDGLPNGQQCH-YLQSRSKCNEDLA-KLTIIDTSMV 476
E E A E SS DK D + +P+ H YL+S L K T++DTS+
Sbjct: 409 ENETGGVAAELSS------DKSDEESVPSDSLRHRYLRSPCSNRGTLGMKFTVVDTSLA 461
>AT1G09050.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G09040.1); Has 552 Blast hits to 499 proteins
in 115 species: Archae - 0; Bacteria - 86; Metazoa -
259; Fungi - 14; Plants - 77; Viruses - 0; Other
Eukaryotes - 116 (source: NCBI BLink). |
chr1:2918031-2920858 FORWARD LENGTH=916
Length = 916
Score = 315 bits (808), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 184/426 (43%), Positives = 255/426 (59%), Gaps = 32/426 (7%)
Query: 18 VEEDLHGVDDKF--GDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPEPF 75
+EED + DD+F GDP+V PRVGDE+QV+IP + +A S+ A + + F
Sbjct: 19 IEEDSY--DDEFPCGDPQVEPRVGDEFQVDIPLMMSA---SKRAVFLSNPVALDDSTCSF 73
Query: 76 SLGLPIPIMWA-HCSFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQNSSKSDET 134
+GLP+ +MW G N G+G V ++ G +++ KSD+
Sbjct: 74 LVGLPVQVMWIDKVGIGQGN------GDGNVDMNQSLKSLRAKK--GRCSAKIRGKSDKN 125
Query: 135 DIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLLKRFVGSKK 194
+ L E +P + W D+E SF+LGLY FGKN + F+ +K
Sbjct: 126 SETKKQRLNLEA--------VPAIPSSS-WDDLEVASFVLGLYTFGKNFTQMNNFMENKG 176
Query: 195 MGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPGECQ 254
+G+I+ FYYGKF+ S + WSE RK R R+C +G+K++ GWRQQ+ L+RL +P E Q
Sbjct: 177 IGEIMLFYYGKFYNSAKYHTWSESRKKRNRKCVYGRKLYSGWRQQQLLTRLMPSIPDEPQ 236
Query: 255 T-LLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKT--- 310
+LV++S++F E I E+Y+ A+K+ VG+ LL+ AV IGK K DLT P KT
Sbjct: 237 KQMLVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTSTPMKTKPW 296
Query: 311 YHVSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANGWHSEQ 369
+ VS + L G+ +SLTSA II LTG RLSKAR +D+FW AVWPRLLA GW S+Q
Sbjct: 297 FTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLLARGWRSQQ 356
Query: 370 PMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLLETEIQ--A 427
P D+ SK +VF++PGVKKFS+++LVKG+HYFDS+SD+L KV S+P LLE E A
Sbjct: 357 PEDRGYFKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELLENETGGVA 416
Query: 428 TEGSSD 433
E SD
Sbjct: 417 AENPSD 422
>AT1G55050.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: cultured cell;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT1G09040.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr1:20542779-20545612 FORWARD LENGTH=915
Length = 915
Score = 273 bits (697), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 166/407 (40%), Positives = 226/407 (55%), Gaps = 52/407 (12%)
Query: 26 DDKF--GDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPE-----PFSLG 78
D++F GDP+V RVGDEYQVEIP P +S+ ++ +E+ +N E F++G
Sbjct: 14 DEEFVCGDPKVDIRVGDEYQVEIP-----PMMSE----SQRAELLLNPLEFDSSCSFAVG 64
Query: 79 LPIPIMWAHCSFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQNSSKSDETDIDS 138
LP+ +MW C + + + S ++ + K K GG D
Sbjct: 65 LPVEVMWIETK--CRDGDGLGSDNIDMNESLKSLKRKRSRRGG--------------SDG 108
Query: 139 CKGLKTELNQPRGKYLLPGLLDDQP------WTDIEYDSFLLGLYVFGKNLNLLKRFVGS 192
G K +N L+ P W D+E D F+LGLY FGKN +++ + S
Sbjct: 109 NSGSKRRMN-----------LEAVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157
Query: 193 KKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPGE 252
K G+IL FYYGKF+ S + WS K R+ RC G+K++ WR Q LSRL + E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217
Query: 253 C-QTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
+ LV++S++F E K EEYI A+K VG+ L+ AV IGK K DLT +P
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277
Query: 312 H-VSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANGWHSEQ 369
V +P G +SLT II+ L+G R+SKAR +D+FW+AVWPRLL GW SE
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337
Query: 370 PMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVAS 416
P DQ SK+ +VFL+PGVKKFS++KLVK +HYFDSISD+L KV S
Sbjct: 338 PKDQGYIKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVS 384
>AT1G55050.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: cultured cell;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT1G09040.1); Has 2440 Blast hits to 1999
proteins in 271 species: Archae - 0; Bacteria - 138;
Metazoa - 960; Fungi - 166; Plants - 162; Viruses - 14;
Other Eukaryotes - 1000 (source: NCBI BLink). |
chr1:20542779-20545612 FORWARD LENGTH=915
Length = 915
Score = 273 bits (697), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 166/407 (40%), Positives = 226/407 (55%), Gaps = 52/407 (12%)
Query: 26 DDKF--GDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPE-----PFSLG 78
D++F GDP+V RVGDEYQVEIP P +S+ ++ +E+ +N E F++G
Sbjct: 14 DEEFVCGDPKVDIRVGDEYQVEIP-----PMMSE----SQRAELLLNPLEFDSSCSFAVG 64
Query: 79 LPIPIMWAHCSFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQNSSKSDETDIDS 138
LP+ +MW C + + + S ++ + K K GG D
Sbjct: 65 LPVEVMWIETK--CRDGDGLGSDNIDMNESLKSLKRKRSRRGG--------------SDG 108
Query: 139 CKGLKTELNQPRGKYLLPGLLDDQP------WTDIEYDSFLLGLYVFGKNLNLLKRFVGS 192
G K +N L+ P W D+E D F+LGLY FGKN +++ + S
Sbjct: 109 NSGSKRRMN-----------LEAVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157
Query: 193 KKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPGE 252
K G+IL FYYGKF+ S + WS K R+ RC G+K++ WR Q LSRL + E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217
Query: 253 C-QTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
+ LV++S++F E K EEYI A+K VG+ L+ AV IGK K DLT +P
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277
Query: 312 H-VSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANGWHSEQ 369
V +P G +SLT II+ L+G R+SKAR +D+FW+AVWPRLL GW SE
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337
Query: 370 PMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVAS 416
P DQ SK+ +VFL+PGVKKFS++KLVK +HYFDSISD+L KV S
Sbjct: 338 PKDQGYIKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVS 384