Miyakogusa Predicted Gene
- Lj5g3v1872900.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1872900.1 Non Chatacterized Hit- tr|I1NH92|I1NH92_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.39746
PE,84.45,0,DUF668,Protein of unknown function DUF668; DUF3475,Protein
of unknown function DUF3475; seg,NULL; co,CUFF.56247.1
(654 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |... 912 0.0
AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |... 627 e-180
AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |... 499 e-141
AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |... 123 4e-28
AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |... 95 2e-19
AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |... 76 7e-14
>AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:12520713-12524046 FORWARD LENGTH=657
Length = 657
Score = 912 bits (2357), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/668 (70%), Positives = 535/668 (80%), Gaps = 27/668 (4%)
Query: 1 MGGICSRSWKATVDGVAVDNALDGSSRNANGHA--NNEPGTTYQTGHINSNSNLRPIDDE 58
MGG+CSRS +V+NA G+ + NGH NN +G + P+ +
Sbjct: 1 MGGLCSRS-------SSVNNAPGGTFAHVNGHHLNNNASDLNSHSGESGLKDDPSPVTEN 53
Query: 59 SD--KHQRESFSFSALDKLSYGTSGDDINDGIPSLSRALSHKSRSTRSRQAAV-KVSEVS 115
D KH ESFSF + S G+ +I DGIP LSR LS KSRST+SRQAAV KVSEVS
Sbjct: 54 VDDNKHTSESFSFPIV---SSGSHPQNIEDGIPRLSRVLSQKSRSTKSRQAAVAKVSEVS 110
Query: 116 SLLGRAGTAGLGKAVEVLDTLGSSMTNLNLSSGFTSGVATKGNKISILAFEVANTIVKGA 175
SLLGRAGT GLGKAV+VLDTLGSSMTNLNLS GF+S KGNKISIL+FEVANTIVKGA
Sbjct: 111 SLLGRAGTMGLGKAVDVLDTLGSSMTNLNLSGGFSSATTVKGNKISILSFEVANTIVKGA 170
Query: 176 NLMQSLSQENIKHLKEVVLPSEGVQNLISRDMNELLRIAAADKREELKIFSGEVVRFGNR 235
NLM SLS+++I HLKEVVLPSEGVQNLIS+DM+ELLRIAAADKREEL+IFSGEVVRFGNR
Sbjct: 171 NLMHSLSKDSITHLKEVVLPSEGVQNLISKDMDELLRIAAADKREELRIFSGEVVRFGNR 230
Query: 236 CKDPQWHNLDRYFEKLGSELTPQKQLKEEAEVVMQQLMTYVQYTAELYHELHALDRFDQD 295
CKDPQ+HNLDR+F++LGSE TPQK LK+EAE +M Q+M++V +TA+LYHELHALDRF+QD
Sbjct: 231 CKDPQYHNLDRFFDRLGSEFTPQKHLKQEAETIMHQMMSFVHFTADLYHELHALDRFEQD 290
Query: 296 YRRKLQEEDNSNATQR--GDSLAILRSELKSQKKHVRNLKKKSLWSKILEEVMEKFVDVV 353
Y+RK+QEE+N + QR GD+LAILR+ELKSQKKHVRNLKKKSLWS+ILEEVMEK VDVV
Sbjct: 291 YQRKIQEEENPSTAQRGVGDTLAILRTELKSQKKHVRNLKKKSLWSRILEEVMEKLVDVV 350
Query: 354 NFLYLEIHEAFGSADPDKQVKDSQGNHKKLGSAGLALHYANIITQIDTLVSRSSSVPPNT 413
+FL+LEIHEAFG ADPDK D NHKKLGSAGLALHYANIITQIDTLVSRSS++P +T
Sbjct: 351 HFLHLEIHEAFGGADPDKPANDPPINHKKLGSAGLALHYANIITQIDTLVSRSSTMPAST 410
Query: 414 RDTLYQGLPPNVKSALRTRLQSFQVKEELTVQQIKAEMEKTLQWLVPIATNTTKAHHGFG 473
RD LYQGLPP++KSALR+R+QSFQVKEELTV QIKAEMEKTLQWLVP+ATNTTKAHHGFG
Sbjct: 411 RDALYQGLPPSIKSALRSRIQSFQVKEELTVPQIKAEMEKTLQWLVPVATNTTKAHHGFG 470
Query: 474 WVGEWANTGSDVNRKPAGQTDLLRIETLHHADKEKTEAYILELVVWLHHLVSQVRVGNG- 532
WVGEWA++GS+ N++PAGQT +LRI+TLHHADKEKTEAYIL+LVVWLHHLV+QVR G
Sbjct: 471 WVGEWASSGSEANQRPAGQT-ILRIDTLHHADKEKTEAYILDLVVWLHHLVTQVRATTGY 529
Query: 533 GIRSPAKSPIRSPTQKTGQLF--THKGCSTSPMLTFEDQQMLRDVSKRKLTPGISKSQEF 590
G+RSP KSPIRSP QKT QL +H P+LT EDQ+MLRDVSKR+ TPGISKSQEF
Sbjct: 530 GLRSPVKSPIRSPNQKTIQLSSGSHNPSMGLPLLTTEDQEMLRDVSKRRKTPGISKSQEF 589
Query: 591 ST-AKTRLXX----XXXXXXXXXXXXXXXXXNDMFSTRRVLPSVPVIDFDIDRMKALDVI 645
T AK RL D FSTRR SVP+IDFDIDRMKALDVI
Sbjct: 590 ETVAKARLCKHHRLSKSSSHSPMMGEMMKNKKDTFSTRRP-SSVPIIDFDIDRMKALDVI 648
Query: 646 DRVDTIGS 653
DRVDTI S
Sbjct: 649 DRVDTIRS 656
>AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:2814526-2817845 FORWARD LENGTH=649
Length = 649
Score = 627 bits (1617), Expect = e-180, Method: Compositional matrix adjust.
Identities = 347/656 (52%), Positives = 450/656 (68%), Gaps = 28/656 (4%)
Query: 1 MGGICSRSWKATVDGVAVDNALDGSSRNANGH----ANNEPGTTYQTGHINSNSNLRPID 56
MG CS+S G+ + GSS +G +++P QT I +
Sbjct: 1 MGSFCSKSL-----GINFGSEYSGSSVADDGREPDFGHSQPNG--QTSLIVPGMRQLMVK 53
Query: 57 DESDKHQ-RESFSFSALDKLSYGTSGDDINDGIPSLSRALSHKSRSTRSRQAAV-KVSEV 114
D +++Q ++ FSF + + D+ DGIP+ + A S K RS +S Q AV KV+E
Sbjct: 54 DVKEQNQLKDVFSFRERE------AEDNFYDGIPTYTMAPSQKIRSAKSTQTAVSKVTEA 107
Query: 115 SSLLGRAGTAGLGKAVEVLDTLGSSMTNLNLSSGFTSGVATKGNKISILAFEVANTIVKG 174
S LLG+AG LG+A +VLDTLGSSMT+L+ S GFTSGVATKGN++ ILAFEVANTIVK
Sbjct: 108 SKLLGKAG---LGRAKDVLDTLGSSMTDLS-SGGFTSGVATKGNELGILAFEVANTIVKS 163
Query: 175 ANLMQSLSQENIKHLKEVVLPSEGVQNLISRDMNELLRIAAADKREELKIFSGEVVRFGN 234
+NL++SLS+ NI+HLK +L SEGVQNL+S D +ELLR+ AADKR+EL++FSGEVVRFGN
Sbjct: 164 SNLIESLSKRNIEHLKGTILYSEGVQNLVSNDFDELLRLVAADKRQELQVFSGEVVRFGN 223
Query: 235 RCKDPQWHNLDRYFEKLGSELTPQKQLKEEAEVVMQQLMTYVQYTAELYHELHALDRFDQ 294
R KD QWHNL RYF+++ ELTPQ+QLKE+A +V+ QLM VQYTAELY EL L R ++
Sbjct: 224 RSKDFQWHNLQRYFDRISKELTPQRQLKEDAVLVVDQLMVLVQYTAELYQELQVLYRLEK 283
Query: 295 DYRRKLQEEDNSNATQRGDSLAILRSELKSQKKHVRNLKKKSLWSKILEEVMEKFVDVVN 354
DY +K +EE+NS + +GD LAIL++ELK+Q+K V++LKKKSLWS+ EEVMEK VD+V+
Sbjct: 284 DYEQKRREEENSANSSKGDGLAILKTELKAQRKVVKSLKKKSLWSRGFEEVMEKLVDIVH 343
Query: 355 FLYLEIHEAFGSADPDKQVKDSQGNHKKLGSAGLALHYANIITQIDTLVSRSSSVPPNTR 414
FL LEIH FG AD K + K+LG AGLALHYANII QIDTLV+R+SS+ N R
Sbjct: 344 FLLLEIHNIFGGADDQPSKKGAAEYDKRLGPAGLALHYANIIVQIDTLVARASSITSNAR 403
Query: 415 DTLYQGLPPNVKSALRTRLQSFQVKEELTVQQIKAEMEKTLQWLVPIATNTTKAHHGFGW 474
D+LYQ LPP +K ALR++++SF V +EL+V QIK EME+TL WLVP+A NTTKAHHGFGW
Sbjct: 404 DSLYQSLPPGIKLALRSKIKSFNVDKELSVTQIKDEMERTLHWLVPVAGNTTKAHHGFGW 463
Query: 475 VGEWANTGSDVNRKPAGQTDLLRIETLHHADKEKTEAYILELVVWLHHLVSQVRVGNGGI 534
VGEWANTG+D KP+G D+LRIETL+HA KEKTE YIL ++WL HLV++ + G
Sbjct: 464 VGEWANTGTDFTSKPSG-GDILRIETLYHASKEKTEIYILGQIIWLQHLVTKAKSDARG- 521
Query: 535 RSPAKSPIRSPTQKTGQLFTHKGCSTSPMLTFEDQQMLRDVSKRKLTPGISKSQEFSTAK 594
P S I+SP T Q + S P++T E+Q+ML++ SKRK TP +SKSQ+F +
Sbjct: 522 -GPRLSSIKSPLDTTNQQLISEPLSV-PIVTDEEQKMLQEASKRKRTPCVSKSQDFDSEY 579
Query: 595 TRLXXXXXXXXXXXXXXXXXXXNDMFSTRRVLPSVPVIDFDIDRMKALDVIDRVDT 650
+R + +R P++DF ID+ K LDVIDRVD
Sbjct: 580 SRARKCDPLSKSSEYFRGVRRSKSA-AVKRYSSGFPLLDFAIDKEKVLDVIDRVDV 634
>AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:10905991-10908773 REVERSE LENGTH=615
Length = 615
Score = 499 bits (1285), Expect = e-141, Method: Compositional matrix adjust.
Identities = 260/498 (52%), Positives = 349/498 (70%), Gaps = 19/498 (3%)
Query: 105 RQAAVKVSEVSSLLGRAGTAGLGKAVEVLDTLGSSMTNLNLSSGFTSGV-ATKGNKISIL 163
R + K + +S +GRAG GL KAVEVLDTLGSSMT +N S+ + SGV +++G K++IL
Sbjct: 78 RNDSTKFMQRNSFMGRAGVMGLEKAVEVLDTLGSSMTRMNPSNAYLSGVTSSRGGKVTIL 137
Query: 164 AFEVANTIVKGANLMQSLSQENIKHLKEVVLPSEGVQNLISRDMNELLRIAAADKREELK 223
AFEVANTI KGA L+QSLS+EN+K +K+ +L SE V+ L+S D EL +AA+DKREEL
Sbjct: 138 AFEVANTIAKGAALLQSLSEENLKFMKKDMLHSEEVKKLVSTDTTELQILAASDKREELD 197
Query: 224 IFSGEVVRFGNRCKDPQWHNLDRYFEKLGSELTPQKQLKEEAEVVMQQLMTYVQYTAELY 283
+FSGEV+RFGN CKD QWHNLDRYF KL +E + K LK++AE MQ+L+T + T+ELY
Sbjct: 198 LFSGEVIRFGNMCKDLQWHNLDRYFMKLDTENSQHKLLKDDAEARMQELVTLARITSELY 257
Query: 284 HELHALDRFDQDYRRKLQEEDNSNATQRGDSLAILRSELKSQKKHVRNLKKKSLWSKILE 343
HEL ALDRF+QDYRRKL E ++ N +RG+ + IL++ELK QKK V++L+KKSLWS+ L
Sbjct: 258 HELQALDRFEQDYRRKLAEVESLNLPRRGEGIVILQNELKQQKKLVKSLQKKSLWSQNLA 317
Query: 344 EVMEKFVDVVNFLYLEIHEAFGSADPDKQVKDSQGNH--KKLGSAGLALHYANIITQIDT 401
E++EK VDVV+++ I E FG + ++D++G ++LG AGL+LHYAN+I QID
Sbjct: 318 EIIEKLVDVVSYIRQTIVEVFG----NNGLRDNEGEQGRERLGEAGLSLHYANLIQQIDN 373
Query: 402 LVSRSSSVPPNTRDTLYQGLPPNVKSALRTRLQSFQVKEELTVQQIKAEMEKTLQWLVPI 461
+ SR SS+P N RDTLY LP VK+ALR RLQ+ +EEL+V +IKAEMEK+LQWLVP
Sbjct: 374 IASRPSSLPSNVRDTLYNALPATVKTALRPRLQTLDQEEELSVPEIKAEMEKSLQWLVPF 433
Query: 462 ATNTTKAHHGFGWVGEWANTGSDVNR---KPAGQTDLLRIETLHHADKEKTEAYILELVV 518
A NTTKAH GFGWVGEWAN+ + + K + R++TLHHADK ++Y+LELVV
Sbjct: 434 AENTTKAHQGFGWVGEWANSRIEFGKGKGKGENNGNPTRLQTLHHADKPIVDSYVLELVV 493
Query: 519 WLHHLVSQVRVGNGGIRSPAKSPIRSPTQKTGQLFTHKGCSTSPMLTFEDQQMLRD---- 574
WLH L+ + G++ + + P +T ++ S SP T+++Q L D
Sbjct: 494 WLHRLMKSSKKRAHGVKLQETNHVSPPNNRT---ISNTQLSLSPDFTYKNQLSLEDRLLL 550
Query: 575 --VSKRKLTPGISKSQEF 590
V + P +SKSQE
Sbjct: 551 DRVQSIRFGPNLSKSQEL 568
>AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:20993560-20995096 FORWARD LENGTH=474
Length = 474
Score = 123 bits (309), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 105/428 (24%), Positives = 191/428 (44%), Gaps = 63/428 (14%)
Query: 153 VATKGNKISILAFEVANTIVKGANLMQSLSQENIKHLKEVVLPSEGVQNLISRDMNELLR 212
++T + + +L+FEVA + K +L SL+ N+ ++ L EG+ +++ D L
Sbjct: 30 ISTTTSSVGVLSFEVARVMTKLLHLTHSLTDSNLLTPRDHSLSLEGLTKIVNGDETFHLS 89
Query: 213 IAAADKREELKIFSGEVVRFGNRCKDPQ---WHNLDRYFEKLGSELTPQKQLKEEAEVVM 269
+ A+ + L + V R NRC +H L F +G + ++ E
Sbjct: 90 LVCAELADSLAHAANSVSRLSNRCTTASLRSFHRLFHEFADMGRDPHGWVMNCKDTEAKN 149
Query: 270 QQLMTYVQYTAELYHELHALDRFDQDYRRK-----LQEEDNSNATQRGDSLAI--LRSEL 322
+++ YV T LY E+ + + R++ ++ E+ + + D + + L++++
Sbjct: 150 KKIERYVSVTTALYREMEEMAILENSLRKQSLQIGIEFEEEEDYENKKDVMKVIDLQNKI 209
Query: 323 KSQKKHVRNLKKKSLWSKILEEVMEKFVDVVNFLYLEIHEAFGSA--------------- 367
+ QK+HV+ LK +SLW+K + V+ V + F SA
Sbjct: 210 ERQKQHVKYLKDRSLWNKSFDTVVLILARSVFTALARLKSVFSSAAATGYMVPTVVSSLP 269
Query: 368 ---------------DPDKQVKDSQGN---------------HKKLGSAGLALHYANIIT 397
P+ + +D LG AG+ALHYAN+I
Sbjct: 270 RSLSSSSSSMNLVHPSPNDEERDKTTTSSAFLEESSRLLKPPETTLGGAGVALHYANLIV 329
Query: 398 QIDTLVSRSSSVPPNTRDTLYQGLPPNVKSALRTRLQ--SFQVKEELTVQQIKAEMEKTL 455
++ ++ + V + RD LY LP +V+S+LR+RL+ F + + KA + + L
Sbjct: 330 VMEKMIKQPQLVGLDARDDLYSMLPASVRSSLRSRLKGVGFTATDGGLATEWKAALGRIL 389
Query: 456 QWLVPIATNTTKAHHGFGWVGEWANTGSDVNRKPAGQTDLLRIETLHHADKEKTEAYILE 515
+WL+P+A N + W E + + Q ++ ++TL ADK KTEA I E
Sbjct: 390 RWLLPLAQNMIR------WQSERSFEQQHMATATNSQNRVMLVQTLVFADKVKTEAAITE 443
Query: 516 LVVWLHHL 523
L+V L+++
Sbjct: 444 LLVGLNYI 451
>AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |
chr3:8260059-8261654 REVERSE LENGTH=531
Length = 531
Score = 94.7 bits (234), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 110/209 (52%), Gaps = 9/209 (4%)
Query: 160 ISILAFEVANTIVKGANLMQSLSQENIKHLKEVVLPSEGVQNLISRDMNELLRIAAADKR 219
I IL+FEVAN + K +L +SLS I LK V SEGV+ L+S D N LL ++ ++K
Sbjct: 43 IGILSFEVANVMSKTIHLHRSLSDTEISKLKAEVFHSEGVRKLVSSDENHLLDLSVSEKL 102
Query: 220 EELKIFSGEVVRFGNRCKDPQWHNLDRYFEKLGSELTPQKQLK---EEAEVVMQQLMTYV 276
++L + V R G +C +P + +E + + ++L ++ E +++++ +V
Sbjct: 103 DDLSRVASVVSRLGKKCNEPALQGFEHVYEDIVNGAIDFRKLGFLVKDMESMVKKMERFV 162
Query: 277 QYTAELYHELHALDRFDQDYRRKLQEEDNSNATQRGDSLAILRSELKSQKKHVRNLKKKS 336
T LY E+ ++ +Q KLQ + Q +S+ +L Q++ V++L+ S
Sbjct: 163 NATCSLYCEMEVMNELEQAI-VKLQR-----SQQHQESVKAFEQKLMWQRQDVKSLRDGS 216
Query: 337 LWSKILEEVMEKFVDVVNFLYLEIHEAFG 365
LW++ ++V+E V +Y I FG
Sbjct: 217 LWNQTYDKVVEMLARTVCTIYGRIETVFG 245
Score = 75.5 bits (184), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 78/146 (53%), Gaps = 13/146 (8%)
Query: 383 LGSAGLALHYANIITQIDTLVSRSSSVPPNTRDTLYQGLPPNVKSALRTRLQSF----QV 438
+G + L+LHYAN++ ++ L+ + RD LYQ LP ++K+ L+ L+S+ +
Sbjct: 361 IGGSALSLHYANVVIVVEKLLKYPHLIGEEARDDLYQMLPTSLKTTLKASLRSYLKNISI 420
Query: 439 KEELTVQQIKAEMEKTLQWLVPIATNTTKAHHGFGWVGEWANTGSDVNRKPAGQTDLLRI 498
+ K ++ L WL P+A N + W E + + +T++L +
Sbjct: 421 YDAPLAHDWKETIDGILSWLAPLAHNMIR------WQSE---RNFEQQNQIVKRTNVLLL 471
Query: 499 ETLHHADKEKTEAYILELVVWLHHLV 524
+TL+ AD+EKTEA I +L+V L+++
Sbjct: 472 QTLYFADREKTEAAICKLLVGLNYIC 497
>AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:1303757-1305556 REVERSE LENGTH=599
Length = 599
Score = 76.3 bits (186), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 79/148 (53%), Gaps = 17/148 (11%)
Query: 383 LGSAGLALHYANIITQIDTLVSRSSSVPPNTRDTLYQGLPPNVKSALRTRLQSFQ----- 437
LG+A LALHYAN+I I+ V+ + + RD LY LP +V+++LR RL+ +
Sbjct: 433 LGTACLALHYANVIIVIERFVASPHLIGDDARDDLYNMLPASVRTSLRERLKPYSKNLSS 492
Query: 438 --VKEELTVQQIKAEMEKTLQWLVPIATNTTKAHHGFGWVGEWANTGSDVNRKPAGQTDL 495
V + ++ M L+WL P+A N K W + S ++ +T +
Sbjct: 493 STVYDPGLAREWTDAMAGILEWLGPLAHNMIK----------WQSERSYEHQSLVSRTHI 542
Query: 496 LRIETLHHADKEKTEAYILELVVWLHHL 523
+ +TL A+++KTEA I EL+V L+++
Sbjct: 543 VLAQTLFFANQQKTEAIITELLVGLNYV 570
Score = 70.9 bits (172), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 104/209 (49%), Gaps = 9/209 (4%)
Query: 160 ISILAFEVANTIVKGANLMQSLSQENIKHLKEVVLPSEGVQNLISRDMNELLRIAAADKR 219
+ +LAFEVA+ + K +L QSLS +N+ L++ + S G++ L+S D + ++R+ +
Sbjct: 26 LGVLAFEVASLLSKLVHLWQSLSDKNVARLRDEITHSTGIKKLVSEDDDFIVRLIRDEMM 85
Query: 220 EELKIFSGEVVRFGNRCKDPQWHNLDRYFE---KLGSELTPQKQLKEEAEVVMQQLMTYV 276
E ++ + V R +C DP+ + F K G++ + ++ + +++ ++
Sbjct: 86 ENVENVAKAVARLARKCNDPKLKCFENCFSDMMKTGADPYGWQFGWKKMDKKAKKMERFI 145
Query: 277 QYTAELYHELHALDRFDQDYRRKLQEEDNSNATQRGDSLAILRSELKSQKKHVRNLKKKS 336
A LY E L +Q ++R + N +AT D+L + ++ ++ V+NL+ S
Sbjct: 146 SSNASLYQETEILADLEQTFKRM---KSNESAT---DNLLEYQKKVTWKRHEVKNLRDVS 199
Query: 337 LWSKILEEVMEKFVDVVNFLYLEIHEAFG 365
LW++ + + V V + FG
Sbjct: 200 LWNRTYDYTVILLVRSVFTILSRTKHVFG 228