Miyakogusa Predicted Gene
- Lj3g3v3557050.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v3557050.1 Non Chatacterized Hit- tr|I1LPE6|I1LPE6_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.57851
PE,54.3,0,Prefoldin,Prefoldin; seg,NULL; coiled-coil,NULL,CUFF.46006.1
(1267 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G32240.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 436 e-122
AT1G05320.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 201 4e-51
AT1G05320.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 201 4e-51
AT1G05320.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 201 4e-51
>AT2G32240.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: response to cadmium ion; LOCATED IN: plasma
membrane; EXPRESSED IN: 25 plant structures; EXPRESSED
DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s:
Prefoldin (InterPro:IPR009053); BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G05320.3); Has 470429 Blast hits to 168274
proteins in 4282 species: Archae - 6896; Bacteria -
131956; Metazoa - 175525; Fungi - 33166; Plants - 25441;
Viruses - 2243; Other Eukaryotes - 95202 (source: NCBI
BLink). | chr2:13684614-13689330 REVERSE LENGTH=1333
Length = 1333
Score = 436 bits (1121), Expect = e-122, Method: Compositional matrix adjust.
Identities = 371/911 (40%), Positives = 521/911 (57%), Gaps = 74/911 (8%)
Query: 1 MEEETKVIS-EVSVTKVAEDEAAHNDDSIKEMNGDSPKVITEGKKEDEENAFEGEFIKVE 59
MEE T+V S EV V K D+ D S+K +NG+ PK E + + FIKVE
Sbjct: 1 MEEATQVTSSEVPVVKGDVDDLKTADISVKAVNGEVPKEEKEEEDGE--------FIKVE 52
Query: 60 KE------------ENPLDDKSHKTERSSDPPGREFLEAQEKTRXXXXXXXXXXXXXKSS 107
KE P++++ ERSS RE E+QEK + K
Sbjct: 53 KEAFDAKDDAEKADHVPVEEQKEVIERSSSGSQRELHESQEKAKELELELERVAGELKRY 112
Query: 108 EHENAQLKGEISVTKDKLEESGKKFEELELSHKKLQEQIIEAESKYNLQLSTLEEALQSQ 167
E EN LK E+ K+KLEE+ KK +LE+ KK QE+I+E E +++ QL +LE+ALQS
Sbjct: 113 ESENTHLKDELLSAKEKLEETEKKHGDLEVVQKKQQEKIVEGEERHSSQLKSLEDALQSH 172
Query: 168 EVKQKELLHVKXXXXXXXXXXXXTRKRIEELQHELQLSADEAQKFEELHKQSGSHXXXXX 227
+ K KEL VK +RK++ EL+ L+ SA+EAQKFEELHKQS SH
Sbjct: 173 DAKDKELTEVKEAFDALGIELESSRKKLIELEEGLKRSAEEAQKFEELHKQSASHADSES 232
Query: 228 XXXXXXXXXXXXXXXXXXGVEEEMASLKEELKGVYDKIAENQKVEEALKTTEAELSTIXX 287
+EE+MASL++E+K + +K++EN+KVE ALK++ EL+ +
Sbjct: 233 QKALEFSELLKSTKESAKEMEEKMASLQQEIKELNEKMSENEKVEAALKSSAGELAAVQE 292
Query: 288 XXXXXXXXXXXVETRLSSRDSLVDELTQELNLIKTSETQLKEDISALQNLFASTKEELQE 347
E ++SS ++L+DELTQEL K SE++ KE++S LQ+L A TK LQ
Sbjct: 293 ELALSKSRLLETEQKVSSTEALIDELTQELEQKKASESRFKEELSVLQDLDAQTKG-LQA 351
Query: 348 KDSDLETSRLKLQEEEKSRESIEVAFKSQEAQFLIVQEELTKIKVEKESLEATVEDLNSN 407
K S+ E KL EE K +E +E K QE + E+L ++ EKE+LEA V ++ SN
Sbjct: 352 KLSEQEGINSKLAEELKEKELLESLSKDQEEKLRTANEKLAEVLKEKEALEANVAEVTSN 411
Query: 408 VKQFKELCTDLEEKLKLSDESFHKTDSLLSEAISNNAELEQKIKSLEDLHNESGTAAATA 467
V E+C +LEEKLK SDE+F KTD+LLS+A+SNN+ELEQK+KSLE+LH+E+G+AAA A
Sbjct: 412 VATVTEVCNELEEKLKTSDENFSKTDALLSQALSNNSELEQKLKSLEELHSEAGSAAAAA 471
Query: 468 NQRSLELEEHVQASNAVAEEVKSQMRELETRSIAAXXXXXXXXXXXXXXXXXINDAEREV 527
Q++LELE+ V++S+ AEE KSQ++ELET+ AA +DAERE+
Sbjct: 472 TQKNLELEDVVRSSSQAAEEAKSQIKELETKFTAAEQKNAELEQQLNLLQLKSSDAEREL 531
Query: 528 TEFSDKISHXXXXXXXXXXXXXXXXXXXXXHMEKVIQLESDVTQSSTRSSQLEQELKIAN 587
E S+K S + +K +LE +TQSS R+S+LE++L+IA
Sbjct: 532 KELSEKSSELQTAIEVAEEEKKQATTQMQEYKQKASELELSLTQSSARNSELEEDLRIAL 591
Query: 588 EKCAEHEDRSTMNHQRSLELEDLVQSSHSKWEDAGKKVSELELLLKTEQYRIQELEQQIS 647
+K AEHEDR+ HQRS+ELE L QSS SK EDA ++ +LELLL+TE+YRIQELE+Q+S
Sbjct: 592 QKGAEHEDRANTTHQRSIELEGLCQSSQSKHEDAEGRLKDLELLLQTEKYRIQELEEQVS 651
Query: 648 TLEKRCSDSEADANKSLDNVSYLTSELEASQARVSSLEITLQEANERERKLEDSFTAVTD 707
+LEK+ ++EAD+ L V+ L S LEA Q + SSLE L A E E++L ++ AVT
Sbjct: 652 SLEKKHGETEADSKGYLGQVAELQSTLEAFQVKSSSLEAALNIATENEKELTENLNAVTS 711
Query: 708 EKKMFEDASNSLSEKFAEAENLLAIVRDDLNLTQVKSTSLLNELQESLSKVASLESENED 767
EKK E + S K +E+ENLL +R++LN+TQ K S+ N+L K A L+ E+E
Sbjct: 712 EKKKLEATVDEYSVKISESENLLESIRNELNVTQGKLESIENDL-----KAAGLQ-ESEV 765
Query: 768 FKRQIREAEEKCSQSLAENELLVETNNQLKTKIDELQEALNLVVSEKEATAQELVSHKNT 827
++ ++ AEE Q E ID EAT + +
Sbjct: 766 MEK-LKSAEESLEQKGRE--------------ID-------------EATTKRM------ 791
Query: 828 MAELNDLQSRSSETLSANEARILEVESQLQEALQRHAEKEAEAKDLNEKLNALEGQIKLH 887
EL L ++LS ++ E +LQ+A++ +++EA L EKL LEG+IK +
Sbjct: 792 --ELEALH----QSLS------IDSEHRLQKAMEEFTSRDSEASSLTEKLRDLEGKIKSY 839
Query: 888 EEQAREAVAVS 898
EEQ EA S
Sbjct: 840 EEQLAEASGKS 850
>AT1G05320.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: fruit, egg
cell; CONTAINS InterPro DOMAIN/s: Prefoldin
(InterPro:IPR009053); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT2G32240.1); Has 267444
Blast hits to 119656 proteins in 3886 species: Archae -
3699; Bacteria - 62432; Metazoa - 106542; Fungi - 19875;
Plants - 13918; Viruses - 1288; Other Eukaryotes - 59690
(source: NCBI BLink). | chr1:1554856-1557658 FORWARD
LENGTH=790
Length = 790
Score = 201 bits (510), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 228/729 (31%), Positives = 336/729 (46%), Gaps = 150/729 (20%)
Query: 558 HMEKVIQLESDVTQSSTRSSQLEQELKIANEKCAEHEDRSTMNHQRSLELEDLVQSSHSK 617
+ EKV +LES + QSS R+S+LE++L+IA +K AEHED ++ +RS+EL+ L Q+S K
Sbjct: 193 YQEKVSKLESSLNQSSARNSELEEDLRIALQKGAEHEDIGNVSTKRSVELQGLFQTSQLK 252
Query: 618 WEDAGKKVSELELLLKTEQYRIQELEQQISTLEKRCSDSEADANKSLDNVSYLTSELEAS 677
E A +K+ +L EA
Sbjct: 253 LEKAEEKLKDL----------------------------------------------EAI 266
Query: 678 QARVSSLEITLQEANERERKLEDSFTAVTDEKK----MFEDASNSLSEKFAEAENLLAIV 733
Q + SSLE TL A E+ER L ++ AV ++ K E + + E + L A+
Sbjct: 267 QVKNSSLEATLSVAMEKERDLSENLNAVMEKLKSSEERLEKQAREIDEATTRSIELEALH 326
Query: 734 RDDLNLTQVKSTSLLNELQESLSKVASLESENEDFKRQIREAE----EKCSQSL------ 783
+ +++K + + ++ SL +++D + +IR E E C QSL
Sbjct: 327 KH----SELKVQKTMEDFSSRDTEAKSLTEKSKDLEEKIRVYEGKLAEACGQSLSLQEEL 382
Query: 784 ----AENELLVETNNQLKTKIDELQEALNLVVSEKEATAQELVSHKNTMAELNDLQSRSS 839
AENELL +TNNQLK KI EL+ L+ SEKE
Sbjct: 383 DQSSAENELLADTNNQLKIKIQELEGYLD---SEKET----------------------- 416
Query: 840 ETLSANEARILEVESQLQEALQRHAEKEAEAKDLNEKLNALEGQIKLHEEQAREAVAVSE 899
A+++ +K+ EAKDL KL + E I+ H+ Q EA V++
Sbjct: 417 -------------------AIEKLNQKDTEAKDLITKLKSHENVIEEHKRQVLEASGVAD 457
Query: 900 TXXXXXXXXXXXXXXXXTVVDELQNKLLHHEKETAGLNEENTKLNQEIAIYESKLSDLQS 959
T + ++EL EKE L E N KLNQ++A S+ D Q+
Sbjct: 458 TRKVEVEEALLKLNTLESTIEEL-------EKENGDLAEVNIKLNQKLANQGSETDDFQA 510
Query: 960 KLSAALVEKDETVKEILTSKDAIEELVAKHSAEAETLKSQISSATEEKNLLNETNQNLKK 1019
KLS EK + KE+ + IE+L + ++E E L+SQISS EEKN +NE Q+ K
Sbjct: 511 KLSVLEAEKYQQAKELQIT---IEDLTKQLTSERERLRSQISSLEEEKNQVNEIYQSTKN 567
Query: 1020 ELQSLIFDXXXXXXXXXXXXXSLRSEVETLKIEISEKSVLQSRLKEIEEQLIKSESRLNE 1079
EL L + S++E L ++EKSVL+S+ +++E L E
Sbjct: 568 ELVKL----QAQLQVDKSKSDDMVSQIEKLSALVAEKSVLESKFEQVE-------IHLKE 616
Query: 1080 EVGNVKVAASQREIELSSKLEDYAQKINDRDVLNGKVAELEKELQLSQATIANQKVAESQ 1139
EV V EL+SKL+++ K +DRDVL K +L KELQ S I+ QK A S
Sbjct: 617 EVEKV--------AELTSKLQEHKHKASDRDVLEEKAIQLHKELQASHTAISEQKEALSH 668
Query: 1140 KL-ELEASLKNSLEELEIKKNEITLLQKQVTDLEQKLQLTSDKLSVKGEVGVDQKDGLEV 1198
K ELEA+LK S EEL+ KK+ I L+ ++ +LEQK++L K G +++ +EV
Sbjct: 669 KHSELEATLKKSQEELDAKKSVIVHLESKLNELEQKVKLADAKSKETESTG--KEEEVEV 726
Query: 1199 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHVQTSQASPVINFKFILGVALVSII 1258
+Q ++ ++ K LGVALVS+I
Sbjct: 727 KSRDSDLSFSNPKQTKIKKNLDAASSSGHVM-----IQKAETWHLMTLKIALGVALVSVI 781
Query: 1259 FGIILGKRY 1267
GII+GK Y
Sbjct: 782 LGIIVGKNY 790
>AT1G05320.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: fruit, egg
cell; CONTAINS InterPro DOMAIN/s: Prefoldin
(InterPro:IPR009053); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT2G32240.1); Has 267650
Blast hits to 119772 proteins in 3899 species: Archae -
3706; Bacteria - 62589; Metazoa - 106546; Fungi - 19914;
Plants - 13918; Viruses - 1287; Other Eukaryotes - 59690
(source: NCBI BLink). | chr1:1554856-1557658 FORWARD
LENGTH=790
Length = 790
Score = 201 bits (510), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 228/729 (31%), Positives = 336/729 (46%), Gaps = 150/729 (20%)
Query: 558 HMEKVIQLESDVTQSSTRSSQLEQELKIANEKCAEHEDRSTMNHQRSLELEDLVQSSHSK 617
+ EKV +LES + QSS R+S+LE++L+IA +K AEHED ++ +RS+EL+ L Q+S K
Sbjct: 193 YQEKVSKLESSLNQSSARNSELEEDLRIALQKGAEHEDIGNVSTKRSVELQGLFQTSQLK 252
Query: 618 WEDAGKKVSELELLLKTEQYRIQELEQQISTLEKRCSDSEADANKSLDNVSYLTSELEAS 677
E A +K+ +L EA
Sbjct: 253 LEKAEEKLKDL----------------------------------------------EAI 266
Query: 678 QARVSSLEITLQEANERERKLEDSFTAVTDEKK----MFEDASNSLSEKFAEAENLLAIV 733
Q + SSLE TL A E+ER L ++ AV ++ K E + + E + L A+
Sbjct: 267 QVKNSSLEATLSVAMEKERDLSENLNAVMEKLKSSEERLEKQAREIDEATTRSIELEALH 326
Query: 734 RDDLNLTQVKSTSLLNELQESLSKVASLESENEDFKRQIREAE----EKCSQSL------ 783
+ +++K + + ++ SL +++D + +IR E E C QSL
Sbjct: 327 KH----SELKVQKTMEDFSSRDTEAKSLTEKSKDLEEKIRVYEGKLAEACGQSLSLQEEL 382
Query: 784 ----AENELLVETNNQLKTKIDELQEALNLVVSEKEATAQELVSHKNTMAELNDLQSRSS 839
AENELL +TNNQLK KI EL+ L+ SEKE
Sbjct: 383 DQSSAENELLADTNNQLKIKIQELEGYLD---SEKET----------------------- 416
Query: 840 ETLSANEARILEVESQLQEALQRHAEKEAEAKDLNEKLNALEGQIKLHEEQAREAVAVSE 899
A+++ +K+ EAKDL KL + E I+ H+ Q EA V++
Sbjct: 417 -------------------AIEKLNQKDTEAKDLITKLKSHENVIEEHKRQVLEASGVAD 457
Query: 900 TXXXXXXXXXXXXXXXXTVVDELQNKLLHHEKETAGLNEENTKLNQEIAIYESKLSDLQS 959
T + ++EL EKE L E N KLNQ++A S+ D Q+
Sbjct: 458 TRKVEVEEALLKLNTLESTIEEL-------EKENGDLAEVNIKLNQKLANQGSETDDFQA 510
Query: 960 KLSAALVEKDETVKEILTSKDAIEELVAKHSAEAETLKSQISSATEEKNLLNETNQNLKK 1019
KLS EK + KE+ + IE+L + ++E E L+SQISS EEKN +NE Q+ K
Sbjct: 511 KLSVLEAEKYQQAKELQIT---IEDLTKQLTSERERLRSQISSLEEEKNQVNEIYQSTKN 567
Query: 1020 ELQSLIFDXXXXXXXXXXXXXSLRSEVETLKIEISEKSVLQSRLKEIEEQLIKSESRLNE 1079
EL L + S++E L ++EKSVL+S+ +++E L E
Sbjct: 568 ELVKL----QAQLQVDKSKSDDMVSQIEKLSALVAEKSVLESKFEQVE-------IHLKE 616
Query: 1080 EVGNVKVAASQREIELSSKLEDYAQKINDRDVLNGKVAELEKELQLSQATIANQKVAESQ 1139
EV V EL+SKL+++ K +DRDVL K +L KELQ S I+ QK A S
Sbjct: 617 EVEKV--------AELTSKLQEHKHKASDRDVLEEKAIQLHKELQASHTAISEQKEALSH 668
Query: 1140 KL-ELEASLKNSLEELEIKKNEITLLQKQVTDLEQKLQLTSDKLSVKGEVGVDQKDGLEV 1198
K ELEA+LK S EEL+ KK+ I L+ ++ +LEQK++L K G +++ +EV
Sbjct: 669 KHSELEATLKKSQEELDAKKSVIVHLESKLNELEQKVKLADAKSKETESTG--KEEEVEV 726
Query: 1199 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHVQTSQASPVINFKFILGVALVSII 1258
+Q ++ ++ K LGVALVS+I
Sbjct: 727 KSRDSDLSFSNPKQTKIKKNLDAASSSGHVM-----IQKAETWHLMTLKIALGVALVSVI 781
Query: 1259 FGIILGKRY 1267
GII+GK Y
Sbjct: 782 LGIIVGKNY 790
>AT1G05320.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: fruit, egg
cell; CONTAINS InterPro DOMAIN/s: Prefoldin
(InterPro:IPR009053); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT2G32240.1); Has 267650
Blast hits to 119772 proteins in 3899 species: Archae -
3706; Bacteria - 62589; Metazoa - 106546; Fungi - 19914;
Plants - 13918; Viruses - 1287; Other Eukaryotes - 59690
(source: NCBI BLink). | chr1:1554856-1557658 FORWARD
LENGTH=790
Length = 790
Score = 201 bits (510), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 228/729 (31%), Positives = 336/729 (46%), Gaps = 150/729 (20%)
Query: 558 HMEKVIQLESDVTQSSTRSSQLEQELKIANEKCAEHEDRSTMNHQRSLELEDLVQSSHSK 617
+ EKV +LES + QSS R+S+LE++L+IA +K AEHED ++ +RS+EL+ L Q+S K
Sbjct: 193 YQEKVSKLESSLNQSSARNSELEEDLRIALQKGAEHEDIGNVSTKRSVELQGLFQTSQLK 252
Query: 618 WEDAGKKVSELELLLKTEQYRIQELEQQISTLEKRCSDSEADANKSLDNVSYLTSELEAS 677
E A +K+ +L EA
Sbjct: 253 LEKAEEKLKDL----------------------------------------------EAI 266
Query: 678 QARVSSLEITLQEANERERKLEDSFTAVTDEKK----MFEDASNSLSEKFAEAENLLAIV 733
Q + SSLE TL A E+ER L ++ AV ++ K E + + E + L A+
Sbjct: 267 QVKNSSLEATLSVAMEKERDLSENLNAVMEKLKSSEERLEKQAREIDEATTRSIELEALH 326
Query: 734 RDDLNLTQVKSTSLLNELQESLSKVASLESENEDFKRQIREAE----EKCSQSL------ 783
+ +++K + + ++ SL +++D + +IR E E C QSL
Sbjct: 327 KH----SELKVQKTMEDFSSRDTEAKSLTEKSKDLEEKIRVYEGKLAEACGQSLSLQEEL 382
Query: 784 ----AENELLVETNNQLKTKIDELQEALNLVVSEKEATAQELVSHKNTMAELNDLQSRSS 839
AENELL +TNNQLK KI EL+ L+ SEKE
Sbjct: 383 DQSSAENELLADTNNQLKIKIQELEGYLD---SEKET----------------------- 416
Query: 840 ETLSANEARILEVESQLQEALQRHAEKEAEAKDLNEKLNALEGQIKLHEEQAREAVAVSE 899
A+++ +K+ EAKDL KL + E I+ H+ Q EA V++
Sbjct: 417 -------------------AIEKLNQKDTEAKDLITKLKSHENVIEEHKRQVLEASGVAD 457
Query: 900 TXXXXXXXXXXXXXXXXTVVDELQNKLLHHEKETAGLNEENTKLNQEIAIYESKLSDLQS 959
T + ++EL EKE L E N KLNQ++A S+ D Q+
Sbjct: 458 TRKVEVEEALLKLNTLESTIEEL-------EKENGDLAEVNIKLNQKLANQGSETDDFQA 510
Query: 960 KLSAALVEKDETVKEILTSKDAIEELVAKHSAEAETLKSQISSATEEKNLLNETNQNLKK 1019
KLS EK + KE+ + IE+L + ++E E L+SQISS EEKN +NE Q+ K
Sbjct: 511 KLSVLEAEKYQQAKELQIT---IEDLTKQLTSERERLRSQISSLEEEKNQVNEIYQSTKN 567
Query: 1020 ELQSLIFDXXXXXXXXXXXXXSLRSEVETLKIEISEKSVLQSRLKEIEEQLIKSESRLNE 1079
EL L + S++E L ++EKSVL+S+ +++E L E
Sbjct: 568 ELVKL----QAQLQVDKSKSDDMVSQIEKLSALVAEKSVLESKFEQVE-------IHLKE 616
Query: 1080 EVGNVKVAASQREIELSSKLEDYAQKINDRDVLNGKVAELEKELQLSQATIANQKVAESQ 1139
EV V EL+SKL+++ K +DRDVL K +L KELQ S I+ QK A S
Sbjct: 617 EVEKV--------AELTSKLQEHKHKASDRDVLEEKAIQLHKELQASHTAISEQKEALSH 668
Query: 1140 KL-ELEASLKNSLEELEIKKNEITLLQKQVTDLEQKLQLTSDKLSVKGEVGVDQKDGLEV 1198
K ELEA+LK S EEL+ KK+ I L+ ++ +LEQK++L K G +++ +EV
Sbjct: 669 KHSELEATLKKSQEELDAKKSVIVHLESKLNELEQKVKLADAKSKETESTG--KEEEVEV 726
Query: 1199 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHVQTSQASPVINFKFILGVALVSII 1258
+Q ++ ++ K LGVALVS+I
Sbjct: 727 KSRDSDLSFSNPKQTKIKKNLDAASSSGHVM-----IQKAETWHLMTLKIALGVALVSVI 781
Query: 1259 FGIILGKRY 1267
GII+GK Y
Sbjct: 782 LGIIVGKNY 790