Miyakogusa Predicted Gene
- Lj1g3v4277590.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4277590.1 tr|A4RVW9|A4RVW9_OSTLU Predicted protein
OS=Ostreococcus lucimarinus (strain CCE9901)
GN=OSTLU_31127,27.1,3e-17,ARM repeat,Armadillo-type fold; seg,NULL; no
description,Armadillo-like helical,CUFF.32286.1
(432 letters)
Database: trembl
41,451,118 sequences; 13,208,986,710 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
I1N927_SOYBN (tr|I1N927) Uncharacterized protein OS=Glycine max ... 670 0.0
D7THZ6_VITVI (tr|D7THZ6) Putative uncharacterized protein OS=Vit... 502 e-139
M5VJB6_PRUPE (tr|M5VJB6) Uncharacterized protein OS=Prunus persi... 494 e-137
G7KTK5_MEDTR (tr|G7KTK5) Putative uncharacterized protein OS=Med... 475 e-131
B9SL51_RICCO (tr|B9SL51) Putative uncharacterized protein OS=Ric... 473 e-131
D7LVU5_ARALL (tr|D7LVU5) Binding protein OS=Arabidopsis lyrata s... 424 e-116
Q9SCL6_ARATH (tr|Q9SCL6) Putative uncharacterized protein T8H10.... 416 e-114
F4J3F3_ARATH (tr|F4J3F3) Armadillo/beta-catenin-like repeat-cont... 416 e-113
F4J3F2_ARATH (tr|F4J3F2) Armadillo/beta-catenin-like repeat-cont... 416 e-113
A2Q3M3_MEDTR (tr|A2Q3M3) Zinc finger, C2H2-type OS=Medicago trun... 410 e-112
M4CSZ8_BRARP (tr|M4CSZ8) Uncharacterized protein OS=Brassica rap... 407 e-111
K4BHB1_SOLLC (tr|K4BHB1) Uncharacterized protein OS=Solanum lyco... 303 8e-80
Q9SVX2_ARATH (tr|Q9SVX2) Putative uncharacterized protein F15B8.... 287 6e-75
M1AVC6_SOLTU (tr|M1AVC6) Uncharacterized protein OS=Solanum tube... 279 2e-72
R0FLZ2_9BRAS (tr|R0FLZ2) Uncharacterized protein OS=Capsella rub... 236 1e-59
A9SRE8_PHYPA (tr|A9SRE8) Predicted protein OS=Physcomitrella pat... 232 2e-58
A5BCL5_VITVI (tr|A5BCL5) Putative uncharacterized protein OS=Vit... 193 1e-46
D8RWI0_SELML (tr|D8RWI0) Putative uncharacterized protein OS=Sel... 192 2e-46
D8QZC6_SELML (tr|D8QZC6) Putative uncharacterized protein (Fragm... 169 2e-39
A2Q3M2_MEDTR (tr|A2Q3M2) Putative uncharacterized protein OS=Med... 150 8e-34
A5BCL6_VITVI (tr|A5BCL6) Putative uncharacterized protein OS=Vit... 139 1e-30
B9HA58_POPTR (tr|B9HA58) Predicted protein OS=Populus trichocarp... 129 2e-27
A4RVW9_OSTLU (tr|A4RVW9) Predicted protein OS=Ostreococcus lucim... 98 6e-18
L8GVB8_ACACA (tr|L8GVB8) Uncharacterized protein OS=Acanthamoeba... 94 1e-16
M1AVC4_SOLTU (tr|M1AVC4) Uncharacterized protein OS=Solanum tube... 69 4e-09
C1E222_MICSR (tr|C1E222) Predicted protein OS=Micromonas sp. (st... 68 7e-09
Q01B42_OSTTA (tr|Q01B42) WGS project CAID00000000 data, contig c... 66 3e-08
D2VH40_NAEGR (tr|D2VH40) Predicted protein OS=Naegleria gruberi ... 60 2e-06
>I1N927_SOYBN (tr|I1N927) Uncharacterized protein OS=Glycine max PE=4 SV=2
Length = 1101
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/437 (78%), Positives = 377/437 (86%), Gaps = 5/437 (1%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF DPSNAT+VKFLSYISE+LANVADLVLHHV+LHV+EQKKI ESFLSRWE RTYT DEF
Sbjct: 665 MFGDPSNATIVKFLSYISENLANVADLVLHHVLLHVKEQKKIDESFLSRWEQRTYTCDEF 724
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYKCIANL 120
EEMQ+SLFE LCPLLIIK+LP+KTFNDLNSSIMYG L QN+I D+ S DT+ Y CIA
Sbjct: 725 EEMQQSLFEHLCPLLIIKILPLKTFNDLNSSIMYGHLSQNIIQDAGSRDTDIDYDCIAAF 784
Query: 121 LLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICTS 180
LLNRAFCEFEFE+VRKLSAELCGRIHPQVL+PFVCS LE A+DSK+++KIKACLFSICTS
Sbjct: 785 LLNRAFCEFEFEEVRKLSAELCGRIHPQVLLPFVCSLLERAVDSKNVLKIKACLFSICTS 844
Query: 181 LVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKESI 240
L+VRGWESLSHPSM +IR+MIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKESI
Sbjct: 845 LMVRGWESLSHPSMYSIRKMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKESI 904
Query: 241 KDSTPDRITVAEKKGNSVVTYVIHQFVNDK-EQASIPELGDG----VAAVPLSFRLCMGN 295
+S PD + KKGNSVVTYVI+QF N+K EQ S PE GD VAAV LSF LCMGN
Sbjct: 905 NNSIPDTVRALGKKGNSVVTYVINQFFNNKNEQTSTPEFGDENSEFVAAVSLSFCLCMGN 964
Query: 296 VLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGYAVLPYA 355
VLIS QKIS+SCKK FAAQV+PFLLHSLEFE KSEIRAAC QVLFSAVYHL AVLPYA
Sbjct: 965 VLISTCQKISESCKKPFAAQVIPFLLHSLEFETKSEIRAACTQVLFSAVYHLRSAVLPYA 1024
Query: 356 SELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXXXXXXXX 415
S+LL+++LKALRKES+KERMAGAKLIASLMASED+ILE IS GLL+AR
Sbjct: 1025 SDLLRMALKALRKESDKERMAGAKLIASLMASEDMILENISVGLLQARSVLSTISSSDPS 1084
Query: 416 XELRQLCSKLLACISSP 432
EL+QLC KLLACISSP
Sbjct: 1085 PELQQLCCKLLACISSP 1101
>D7THZ6_VITVI (tr|D7THZ6) Putative uncharacterized protein OS=Vitis vinifera
GN=VIT_08s0007g08180 PE=2 SV=1
Length = 1112
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 264/444 (59%), Positives = 334/444 (75%), Gaps = 15/444 (3%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNAT+V+FLSYISE LA AD+V H ++LH++ QK++ ESF ++WES+TY +D+
Sbjct: 668 MFAEPSNATLVRFLSYISEHLAEAADIVFHRILLHMKGQKELDESFFTKWESKTYAADDS 727
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKL-GQNVIHDSDSGDTETSYKCIAN 119
++Q SLF+RLCPLL+I++LPM+ FNDLNSS++YG+L Q V+H S D ++C+A
Sbjct: 728 MKLQHSLFDRLCPLLVIRLLPMRVFNDLNSSVIYGQLPDQVVVHGYGSIDI-NDHECVAM 786
Query: 120 LLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICT 179
LLLNRA +FEFEDVRKL+AELCGRIHPQVL+P + S LE+A DS+DIVKIKACLFS+CT
Sbjct: 787 LLLNRALGKFEFEDVRKLAAELCGRIHPQVLLPILSSHLELAADSQDIVKIKACLFSVCT 846
Query: 180 SLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKES 239
SLV RG +SLS P+ML I++ I+T+LLWP L+ D VSKAQHGCIDCLALMIC ELQA +S
Sbjct: 847 SLVARGRDSLSQPAMLKIQKTIKTILLWPSLDGDEVSKAQHGCIDCLALMICTELQAPKS 906
Query: 240 IKDSTPDRITVAEKK--------GNSVVTYVIHQFVNDKEQASIPEL--GDGVA---AVP 286
S D+I++ K G+SVVTYVIHQ D +A+ + D A +VP
Sbjct: 907 FIGSVSDKISIIGKNFHPGDSALGDSVVTYVIHQLSLDAVEAASTSMLCSDNCASEPSVP 966
Query: 287 LSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYH 346
LSFRLCM NVLISA QKIS S KK FA ++LP+L+H ++ SEIR AC+QVLFSAVYH
Sbjct: 967 LSFRLCMANVLISACQKISDSGKKAFARRILPYLIHFVQVIKDSEIRVACVQVLFSAVYH 1026
Query: 347 LGYAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXX 406
L +LPY+SELLK+SLK+L SEKERMAG KL+ASLMASED I+E IS GLLEAR
Sbjct: 1027 LKSMILPYSSELLKLSLKSLEGNSEKERMAGVKLMASLMASEDAIVENISEGLLEARLVL 1086
Query: 407 XXXXXXXXXXELRQLCSKLLACIS 430
E++Q+C KLLAC++
Sbjct: 1087 LSMYMADPSLEVQQMCQKLLACLT 1110
>M5VJB6_PRUPE (tr|M5VJB6) Uncharacterized protein OS=Prunus persica
GN=PRUPE_ppa000620mg PE=4 SV=1
Length = 1068
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 262/443 (59%), Positives = 319/443 (72%), Gaps = 22/443 (4%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNAT+VKFLSYISE LA AD VL V+LH + +++I E+ S E +TY SD+
Sbjct: 636 MFAEPSNATIVKFLSYISEHLAEAADAVLSCVLLHAKRREEIDENSFSGRECQTYRSDDS 695
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKL-GQNVIHDSDSGDTET-SYKCIA 118
E+MQ++LFE LCPLLII+MLP++ FNDLNSSI+YG+L Q + HD GD S C+
Sbjct: 696 EKMQQTLFEHLCPLLIIRMLPLRVFNDLNSSIVYGQLFNQGIFHDC--GDINAISEDCVT 753
Query: 119 NLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSIC 178
LLL R FCEFEF DVRKL+AELCGR+HP+VLIP V S+LE+A S+DI+KIKA LFS+C
Sbjct: 754 ILLLKRTFCEFEFNDVRKLAAELCGRLHPKVLIPVVSSQLEIATGSRDILKIKASLFSVC 813
Query: 179 TSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKE 238
TSLVVRG ESLSHP ML IR+ +ET+LLWP ++ D VSKAQHGCID LALMICAELQ
Sbjct: 814 TSLVVRGRESLSHPLMLKIRKTLETMLLWPSVDGDEVSKAQHGCIDSLALMICAELQ--- 870
Query: 239 SIKDSTPDRITVAEKK-----GNSVVTYVIHQFVNDKEQASIPELGDGV-----AAVPLS 288
P+ ++ KK GNSV+T VI++ + D Q + D V VPLS
Sbjct: 871 -----DPESFSIVGKKGDASSGNSVLTCVINKLIQDNHQPVLLSNLDDVKCSSEVPVPLS 925
Query: 289 FRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLG 348
F +CM NVLISA QKI S KK F + LP L+HS++ SEIRAACIQVLFS+VYHL
Sbjct: 926 FYMCMANVLISACQKILDSGKKPFVRKTLPCLIHSVKVMTNSEIRAACIQVLFSSVYHLK 985
Query: 349 YAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXX 408
VLPY+++LL++SLKALRK SEKERMAGAKL+ SLMAS+D ILE ISGGL+EAR
Sbjct: 986 STVLPYSADLLEVSLKALRKGSEKERMAGAKLLGSLMASDDAILETISGGLVEARSILSS 1045
Query: 409 XXXXXXXXELRQLCSKLLACISS 431
ELRQ+C KLLAC+ S
Sbjct: 1046 ISSTDPSVELRQVCGKLLACLIS 1068
>G7KTK5_MEDTR (tr|G7KTK5) Putative uncharacterized protein OS=Medicago truncatula
GN=MTR_7g092150 PE=4 SV=1
Length = 524
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 251/356 (70%), Positives = 277/356 (77%), Gaps = 28/356 (7%)
Query: 105 SDSGDTETSYKCIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDS 164
S S E Y+CIA LLNRA CEFEFEDVRKLSAELCGRIHPQVL P +CSKL+ A+D
Sbjct: 169 SISRSAELDYECIAAFLLNRALCEFEFEDVRKLSAELCGRIHPQVLFPVICSKLDRAVDL 228
Query: 165 KDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCID 224
K++ +IKACLFSICTSLVVRGWESLSHP + +I+RMIETVLLWPCLNADSVSK QHGCID
Sbjct: 229 KNVPEIKACLFSICTSLVVRGWESLSHPLVHSIKRMIETVLLWPCLNADSVSKVQHGCID 288
Query: 225 CLALMICAELQAKESIKDSTPDRITVAEKK--GNSVVTYVIHQFVNDKEQ-ASIPELG-- 279
CLALMI ELQA+ESI D PDR+ V KK GNS++TYV++QF NDKE+ +S PELG
Sbjct: 289 CLALMISVELQAEESITDYMPDRVLVIGKKAAGNSIITYVMNQFFNDKEELSSTPELGED 348
Query: 280 --DGVAAVPLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSL------------- 324
+ VAAVPL FRLCMGNVLIS QKIS+SCKK FAAQVLPFLLHSL
Sbjct: 349 KCESVAAVPLYFRLCMGNVLISTCQKISESCKKLFAAQVLPFLLHSLKLIVLETLPSLLL 408
Query: 325 --------EFEMKSEIRAACIQVLFSAVYHLGYAVLPYASELLKISLKALRKESEKERMA 376
+FE +SEIRAACIQVLFSAVYHL AVLPYAS+LLKISLK+LRK+SEKERMA
Sbjct: 409 VMKLILRQQFEKRSEIRAACIQVLFSAVYHLRSAVLPYASDLLKISLKSLRKKSEKERMA 468
Query: 377 GAKLIASLMASEDVILEKISGGLLEARXXXXXXXXXXXXXELRQLCSKLLACISSP 432
GAKLIASLMASEDVILE IS GLLEAR EL+QLC KLLACISSP
Sbjct: 469 GAKLIASLMASEDVILENISVGLLEARSVLSTVSSSDPSHELQQLCRKLLACISSP 524
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/69 (71%), Positives = 58/69 (84%)
Query: 42 IHESFLSRWESRTYTSDEFEEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNV 101
I ESFL+RWE R+Y+SDE+EEMQR+LFE LCPLLIIKMLPMKTF++LNSS+MYG L QN
Sbjct: 17 IDESFLARWECRSYSSDEYEEMQRTLFEHLCPLLIIKMLPMKTFDNLNSSVMYGHLSQNK 76
Query: 102 IHDSDSGDT 110
HD+ T
Sbjct: 77 THDTHGRQT 85
>B9SL51_RICCO (tr|B9SL51) Putative uncharacterized protein OS=Ricinus communis
GN=RCOM_0089360 PE=4 SV=1
Length = 1054
Score = 473 bits (1216), Expect = e-131, Method: Compositional matrix adjust.
Identities = 257/443 (58%), Positives = 321/443 (72%), Gaps = 14/443 (3%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +P+NA +VKFLSYISE LA AD+VL++V+ ++ QK I+E LS W+SR+ +++
Sbjct: 613 MFAEPANAIIVKFLSYISERLAEAADVVLYYVLSQMKPQKGINEGLLSTWKSRSCNNEDL 672
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYKCIANL 120
+MQ++LFERLCPLLII++LP++ FNDL SS MYG+L VI + GD + CIA
Sbjct: 673 MKMQQTLFERLCPLLIIRLLPLRVFNDLESSTMYGQLPSQVI-TQECGDVNIADDCIAAF 731
Query: 121 LLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICTS 180
LL RAF ++EFEDVRKL+AELCGR+HPQVL P V + LE A + DI+KIKACLF+ICTS
Sbjct: 732 LLQRAFNKYEFEDVRKLAAELCGRLHPQVLFPVVLTILENAANFHDILKIKACLFAICTS 791
Query: 181 LVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKESI 240
LVV+G +S+ HP + IR+ IE VLLWP L+ D VSKAQHGCIDCLALMICAELQA ES+
Sbjct: 792 LVVKGKDSVYHPVIFQIRKTIEAVLLWPSLDGDEVSKAQHGCIDCLALMICAELQATESL 851
Query: 241 KDSTPDRITVAEK--------KGNSVVTYVIHQFVNDKEQASIPELG----DGVAAVPLS 288
KDS+ ++ +A K GNS + YVIHQ NDK + S+ L + A +P S
Sbjct: 852 KDSS-NKFRIAGKIIDSGKSTAGNSALAYVIHQLANDKNEVSVSSLNIENCEFEATIPCS 910
Query: 289 FRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLG 348
RLCM N LISA QKIS S KK FA + LP L+HS+E EIRAACIQV+FSAVYHL
Sbjct: 911 LRLCMANALISACQKISDSGKKSFARRSLPNLIHSVEMISHPEIRAACIQVMFSAVYHLK 970
Query: 349 YAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXX 408
AV+PY+++LLK+SLK LRK S+KERMAGAKL+ASLMASED ILE IS GLLEAR
Sbjct: 971 SAVVPYSADLLKLSLKFLRKGSDKERMAGAKLMASLMASEDDILESISEGLLEARIVLSA 1030
Query: 409 XXXXXXXXELRQLCSKLLACISS 431
+L+ +C LLACI+S
Sbjct: 1031 ISSSDPSPDLQVVCKNLLACITS 1053
>D7LVU5_ARALL (tr|D7LVU5) Binding protein OS=Arabidopsis lyrata subsp. lyrata
GN=ARALYDRAFT_486206 PE=4 SV=1
Length = 1082
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 238/441 (53%), Positives = 313/441 (70%), Gaps = 22/441 (4%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNA +V+FLS ISE LA+ +DLVL HV+ H+++ K+ ESF+SR S T +S +
Sbjct: 651 MFLEPSNAIIVRFLSCISEYLADTSDLVLPHVLSHMKKLNKVDESFISR--SDTKSSVDK 708
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYK----- 115
+ ++SLF+ LCPLLI+++LP + F+D++SS +YGK SGD+ Y+
Sbjct: 709 AKSEKSLFDHLCPLLILRLLPQRVFDDIDSSTIYGKFL--------SGDSVNDYQDIKFE 760
Query: 116 ---CIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKA 172
CIA +L RAF +FEFE+VRKLSAELCGR+HPQVL P V +LE A + +D +KIKA
Sbjct: 761 DCQCIAAFILERAFSKFEFEEVRKLSAELCGRLHPQVLFPTVLLQLEKATELQDSLKIKA 820
Query: 173 CLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICA 232
CLFSICTSLVVRGWES SH IR+++E +LLWP + D +SK QHGCIDCLALMICA
Sbjct: 821 CLFSICTSLVVRGWESFSHSVTPKIRKVLENILLWPSVE-DEISKVQHGCIDCLALMICA 879
Query: 233 ELQAKESIKDSTPD--RITVAEKKGNSVVTYVIHQFVNDKEQ-ASIPELGDGVAAVPLSF 289
ELQ +S+K S + R T + GNSV+ Y IH V D+ +SIP+L G +P+ F
Sbjct: 880 ELQDLKSLKTSGGEQMRTTEEDASGNSVLDYTIHCLVEDRSNCSSIPKLSTGENPLPIPF 939
Query: 290 RLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGY 349
RLCM NV+ISA QKI +S KK FA + LP L+HSL+ E+RAACIQVLFSA+Y+L
Sbjct: 940 RLCMANVIISACQKIPESTKKTFARKALPPLVHSLKVISVPEVRAACIQVLFSAMYYLKS 999
Query: 350 AVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXX 409
+LP +S+LLK+SL+ L + SEKE++AGAKL+ASLMASED+ILE IS GLLEAR
Sbjct: 1000 TLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMASEDMILENISEGLLEARSVLSKA 1059
Query: 410 XXXXXXXELRQLCSKLLACIS 430
++R++C+KLLACI+
Sbjct: 1060 SLSDPSQDVREVCAKLLACIT 1080
>Q9SCL6_ARATH (tr|Q9SCL6) Putative uncharacterized protein T8H10.170 OS=Arabidopsis
thaliana GN=T8H10.170 PE=4 SV=1
Length = 1057
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 236/445 (53%), Positives = 311/445 (69%), Gaps = 26/445 (5%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNA +V+FLS ISE LA+ +DLVL HV+ H+++Q K+ SF+SR S T +S +
Sbjct: 622 MFLEPSNAIMVRFLSCISESLADTSDLVLPHVLSHMKKQNKVDASFISR--SDTKSSVDK 679
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYK----- 115
+ ++SLF+ LCPLLI+++LP + F+D++SS +YGK SGD+ Y+
Sbjct: 680 TKSEKSLFDHLCPLLILRLLPQRVFDDIDSSTIYGKFL--------SGDSVNDYQDIKFE 731
Query: 116 ---CIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKA 172
CIA +L RAF +FEFE+VRKLSAELCGR+HPQVL P V +LE A + +D +KIKA
Sbjct: 732 DCQCIATFILERAFSKFEFEEVRKLSAELCGRLHPQVLFPTVLLQLEKATEIQDSLKIKA 791
Query: 173 CLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICA 232
CLFSICTSL+VRGWESLSH IR+++E +LLWP + D +SK QHGCIDCLALMICA
Sbjct: 792 CLFSICTSLMVRGWESLSHRVTPKIRKVLENILLWPSVE-DEISKVQHGCIDCLALMICA 850
Query: 233 ELQAKESIKDSTPDRI--TVAEKKGNSVVTYVIHQFVNDKEQ-ASIPELGDGVAA----V 285
ELQ +S K S ++I T + G SV+ Y IH + D+ +SIP+L + +
Sbjct: 851 ELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSIPKLSTDILTCENPL 910
Query: 286 PLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVY 345
P+ FRLCM NV+ISA QK +S KK FA + LP L+HSL+ E+RAACIQVLFSA Y
Sbjct: 911 PIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKVISVPEVRAACIQVLFSATY 970
Query: 346 HLGYAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXX 405
HL +LP +S+LLK+SL+ L + SEKE++AGAKL+ASLMASEDVILE IS GLLEAR
Sbjct: 971 HLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMASEDVILENISEGLLEARSV 1030
Query: 406 XXXXXXXXXXXELRQLCSKLLACIS 430
++R++C+KLLACI+
Sbjct: 1031 LSKASLSDPSRDVREVCAKLLACIT 1055
>F4J3F3_ARATH (tr|F4J3F3) Armadillo/beta-catenin-like repeat-containing protein
OS=Arabidopsis thaliana GN=AT3G57570 PE=2 SV=1
Length = 1092
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 236/445 (53%), Positives = 311/445 (69%), Gaps = 26/445 (5%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNA +V+FLS ISE LA+ +DLVL HV+ H+++Q K+ SF+SR S T +S +
Sbjct: 657 MFLEPSNAIMVRFLSCISESLADTSDLVLPHVLSHMKKQNKVDASFISR--SDTKSSVDK 714
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYK----- 115
+ ++SLF+ LCPLLI+++LP + F+D++SS +YGK SGD+ Y+
Sbjct: 715 TKSEKSLFDHLCPLLILRLLPQRVFDDIDSSTIYGKFL--------SGDSVNDYQDIKFE 766
Query: 116 ---CIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKA 172
CIA +L RAF +FEFE+VRKLSAELCGR+HPQVL P V +LE A + +D +KIKA
Sbjct: 767 DCQCIATFILERAFSKFEFEEVRKLSAELCGRLHPQVLFPTVLLQLEKATEIQDSLKIKA 826
Query: 173 CLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICA 232
CLFSICTSL+VRGWESLSH IR+++E +LLWP + D +SK QHGCIDCLALMICA
Sbjct: 827 CLFSICTSLMVRGWESLSHRVTPKIRKVLENILLWPSVE-DEISKVQHGCIDCLALMICA 885
Query: 233 ELQAKESIKDSTPDRI--TVAEKKGNSVVTYVIHQFVNDKEQ-ASIPELGDGV----AAV 285
ELQ +S K S ++I T + G SV+ Y IH + D+ +SIP+L + +
Sbjct: 886 ELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSIPKLSTDILTCENPL 945
Query: 286 PLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVY 345
P+ FRLCM NV+ISA QK +S KK FA + LP L+HSL+ E+RAACIQVLFSA Y
Sbjct: 946 PIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKVISVPEVRAACIQVLFSATY 1005
Query: 346 HLGYAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXX 405
HL +LP +S+LLK+SL+ L + SEKE++AGAKL+ASLMASEDVILE IS GLLEAR
Sbjct: 1006 HLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMASEDVILENISEGLLEARSV 1065
Query: 406 XXXXXXXXXXXELRQLCSKLLACIS 430
++R++C+KLLACI+
Sbjct: 1066 LSKASLSDPSRDVREVCAKLLACIT 1090
>F4J3F2_ARATH (tr|F4J3F2) Armadillo/beta-catenin-like repeat-containing protein
OS=Arabidopsis thaliana GN=AT3G57570 PE=2 SV=1
Length = 1096
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 236/445 (53%), Positives = 311/445 (69%), Gaps = 26/445 (5%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNA +V+FLS ISE LA+ +DLVL HV+ H+++Q K+ SF+SR S T +S +
Sbjct: 661 MFLEPSNAIMVRFLSCISESLADTSDLVLPHVLSHMKKQNKVDASFISR--SDTKSSVDK 718
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYK----- 115
+ ++SLF+ LCPLLI+++LP + F+D++SS +YGK SGD+ Y+
Sbjct: 719 TKSEKSLFDHLCPLLILRLLPQRVFDDIDSSTIYGKFL--------SGDSVNDYQDIKFE 770
Query: 116 ---CIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKA 172
CIA +L RAF +FEFE+VRKLSAELCGR+HPQVL P V +LE A + +D +KIKA
Sbjct: 771 DCQCIATFILERAFSKFEFEEVRKLSAELCGRLHPQVLFPTVLLQLEKATEIQDSLKIKA 830
Query: 173 CLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICA 232
CLFSICTSL+VRGWESLSH IR+++E +LLWP + D +SK QHGCIDCLALMICA
Sbjct: 831 CLFSICTSLMVRGWESLSHRVTPKIRKVLENILLWPSVE-DEISKVQHGCIDCLALMICA 889
Query: 233 ELQAKESIKDSTPDRI--TVAEKKGNSVVTYVIHQFVNDKEQ-ASIPELGDGV----AAV 285
ELQ +S K S ++I T + G SV+ Y IH + D+ +SIP+L + +
Sbjct: 890 ELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDRSNCSSIPKLSTDILTCENPL 949
Query: 286 PLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVY 345
P+ FRLCM NV+ISA QK +S KK FA + LP L+HSL+ E+RAACIQVLFSA Y
Sbjct: 950 PIPFRLCMANVIISACQKNPESSKKTFARKALPPLIHSLKVISVPEVRAACIQVLFSATY 1009
Query: 346 HLGYAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXX 405
HL +LP +S+LLK+SL+ L + SEKE++AGAKL+ASLMASEDVILE IS GLLEAR
Sbjct: 1010 HLKSTLLPVSSDLLKLSLRFLEQGSEKEKLAGAKLMASLMASEDVILENISEGLLEARSV 1069
Query: 406 XXXXXXXXXXXELRQLCSKLLACIS 430
++R++C+KLLACI+
Sbjct: 1070 LSKASLSDPSRDVREVCAKLLACIT 1094
>A2Q3M3_MEDTR (tr|A2Q3M3) Zinc finger, C2H2-type OS=Medicago truncatula
GN=MtrDRAFT_AC155885g20v2 PE=4 SV=1
Length = 340
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/275 (75%), Positives = 233/275 (84%), Gaps = 7/275 (2%)
Query: 105 SDSGDTETSYKCIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDS 164
S S E Y+CIA LLNRA CEFEFEDVRKLSAELCGRIHPQVL P +CSKL+ A+D
Sbjct: 35 SISRSAELDYECIAAFLLNRALCEFEFEDVRKLSAELCGRIHPQVLFPVICSKLDRAVDL 94
Query: 165 KDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCID 224
K++ +IKACLFSICTSLVVRGWESLSHP + +I+RMIETVLLWPCLNADSVSK QHGCID
Sbjct: 95 KNVPEIKACLFSICTSLVVRGWESLSHPLVHSIKRMIETVLLWPCLNADSVSKVQHGCID 154
Query: 225 CLALMICAELQAKESIKDSTPDRITVAEKK--GNSVVTYVIHQFVNDKEQ-ASIPELG-- 279
CLALMI ELQA+ESI D PDR+ V KK GNS++TYV++QF NDKE+ +S PELG
Sbjct: 155 CLALMISVELQAEESITDYMPDRVLVIGKKAAGNSIITYVMNQFFNDKEELSSTPELGED 214
Query: 280 --DGVAAVPLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACI 337
+ VAAVPL FRLCMGNVLIS QKIS+SCKK FAAQVLPFLLHSL+FE +SEIRAACI
Sbjct: 215 KCESVAAVPLYFRLCMGNVLISTCQKISESCKKLFAAQVLPFLLHSLKFEKRSEIRAACI 274
Query: 338 QVLFSAVYHLGYAVLPYASELLKISLKALRKESEK 372
QVLFSAVYHL AVLPYAS+LLKISLK+LRK+SEK
Sbjct: 275 QVLFSAVYHLRSAVLPYASDLLKISLKSLRKKSEK 309
>M4CSZ8_BRARP (tr|M4CSZ8) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra007340 PE=4 SV=1
Length = 1088
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 224/436 (51%), Positives = 311/436 (71%), Gaps = 15/436 (3%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNA +V+FLS ISE LA+ +D+VL HV+ H++EQ K+ E+F++ +S ++
Sbjct: 660 MFLEPSNAIMVRFLSCISEHLADASDMVLLHVLSHMKEQDKMDENFINMSKSSVDKTN-- 717
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGK-LGQNVIHDSDSGDTETSYKCIAN 119
+++SLF+ LCPLLI+++LP + F+D++SS +YG+ L ++ +++ E +CIA
Sbjct: 718 --VEKSLFDHLCPLLILRLLPQRVFDDIDSSTIYGRFLREDSVNEYRDIKFEDC-QCIAA 774
Query: 120 LLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICT 179
L RAF +FEFE+VRKL+AELCGRIHPQVL P V +LE A + +D +KI+ACLFSICT
Sbjct: 775 FLFERAFSKFEFEEVRKLAAELCGRIHPQVLFPTVLLQLEKATELQDSLKIRACLFSICT 834
Query: 180 SLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKES 239
SLVVRGWES SH +R+++E +LLWP + D +SK QHGCIDCLALMICAELQ ES
Sbjct: 835 SLVVRGWESFSHSVTPKVRKVLEKILLWPS-DEDEISKVQHGCIDCLALMICAELQHPES 893
Query: 240 IKDSTPDRITVAEKKGNSVVTYVIHQFVNDKEQ-ASIPELGDGVA----AVPLSFRLCMG 294
K +++ + +SV+ + IH + D+ +S+PE + P+ FRLCM
Sbjct: 894 SKTMKGEKLRAS---SSSVLDFTIHCLIEDRSDCSSMPEPNTEHSIREKPFPIPFRLCMA 950
Query: 295 NVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGYAVLPY 354
NVLISA QKI +S KK FA + LP L+HSL+F E+RAACIQVLFSA+YHL +LP+
Sbjct: 951 NVLISACQKIPQSAKKTFARKALPPLVHSLKFISAPEVRAACIQVLFSAMYHLKSTLLPF 1010
Query: 355 ASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXXXXXXX 414
AS+LLK++L+ L + SEKE++AGAKL+ASLMASEDV+LE+ISGGL+EAR
Sbjct: 1011 ASDLLKLALRFLEQGSEKEKLAGAKLMASLMASEDVVLERISGGLIEARSVLSKASLSDP 1070
Query: 415 XXELRQLCSKLLACIS 430
++R++C KLLACI+
Sbjct: 1071 SQDVREVCDKLLACIT 1086
>K4BHB1_SOLLC (tr|K4BHB1) Uncharacterized protein OS=Solanum lycopersicum
GN=Solyc03g063700.1 PE=4 SV=1
Length = 320
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 218/309 (70%), Gaps = 10/309 (3%)
Query: 103 HDSDSGDTETSYKCIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAM 162
+D+ G + S + + LL+ RA +FEFEDVR+L+AELCGRIHP+VLIP + +L+ A
Sbjct: 13 YDAPEGQIDLSNR-LCPLLVVRALSKFEFEDVRRLAAELCGRIHPKVLIPIMSYQLKNAT 71
Query: 163 DSKDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGC 222
KD++KIKACLFSIC SL+V G ++ +HP M I + IET+LLWP ++ D +SKAQHGC
Sbjct: 72 CVKDLLKIKACLFSICISLLVNGTDAYAHPDMFWIHQAIETILLWPSVDGDDISKAQHGC 131
Query: 223 IDCLALMICAELQAKESIKDSTPDRITVAEK--------KGNSVVTYVIHQFVNDKEQAS 274
IDCL L++C ELQA +++K+S + + +SV +YVIH V D E S
Sbjct: 132 IDCLPLILCTELQATKAVKNSISIEVCFEQSIVSSGDSLTKDSVCSYVIHHLVCD-EDIS 190
Query: 275 IPELGDGVAAVPLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRA 334
+ + V SFRL M NVLI+A QK+ + KK F +++LP +LH +E SE+R+
Sbjct: 191 VMLGRNEVVKAHQSFRLRMANVLITACQKVPSASKKPFVSKILPRVLHCVEEIANSEVRS 250
Query: 335 ACIQVLFSAVYHLGYAVLPYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEK 394
ACIQV FS +YHL VLPY+S+LLK+S+K+ R+ SEKER+AGAKL+ASLMASE+ +L+K
Sbjct: 251 ACIQVFFSMLYHLKSLVLPYSSDLLKVSIKSPREGSEKERIAGAKLLASLMASEEAVLQK 310
Query: 395 ISGGLLEAR 403
ISGGL+EAR
Sbjct: 311 ISGGLVEAR 319
>Q9SVX2_ARATH (tr|Q9SVX2) Putative uncharacterized protein F15B8.240
OS=Arabidopsis thaliana GN=F15B8.240 PE=4 SV=1
Length = 305
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 160/299 (53%), Positives = 204/299 (68%), Gaps = 18/299 (6%)
Query: 149 VLIPFVCSKLEVAMDSKDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWP 208
VL P V +LE A + +D +KIKACLFSICTSL+VRGWESLSH IR+++E +LLWP
Sbjct: 6 VLFPTVLLQLEKATEIQDSLKIKACLFSICTSLMVRGWESLSHRVTPKIRKVLENILLWP 65
Query: 209 CLNADSVSKAQHGCIDCLALMICAELQAKESIKDSTPDRI--TVAEKKGNSVVTYVIHQF 266
+ D +SK QHGCIDCLALMICAELQ +S K S ++I T + G SV+ Y IH
Sbjct: 66 SVE-DEISKVQHGCIDCLALMICAELQHLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCL 124
Query: 267 VNDKEQ-ASIPELGDGVAA----VPLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLL 321
+ D+ +SIP+L + +P+ FRLCM NV+ISA QK +S KK FA + LP L+
Sbjct: 125 IEDRSNCSSIPKLSTDILTCENPLPIPFRLCMANVIISACQKNPESSKKTFARKALPPLI 184
Query: 322 HSLEFEMK----------SEIRAACIQVLFSAVYHLGYAVLPYASELLKISLKALRKESE 371
HSL+F K E+RAACIQVLFSA YHL +LP +S+LLK+SL+ L + SE
Sbjct: 185 HSLKFSTKFLNFVQVISVPEVRAACIQVLFSATYHLKSTLLPVSSDLLKLSLRFLEQGSE 244
Query: 372 KERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXXXXXXXXXELRQLCSKLLACIS 430
KE++AGAKL+ASLMASEDVILE IS GLLEAR ++R++C+KLLACI+
Sbjct: 245 KEKLAGAKLMASLMASEDVILENISEGLLEARSVLSKASLSDPSRDVREVCAKLLACIT 303
>M1AVC6_SOLTU (tr|M1AVC6) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG402011958 PE=4 SV=1
Length = 286
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/284 (51%), Positives = 199/284 (70%), Gaps = 11/284 (3%)
Query: 157 KLEVAMDSKDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVS 216
+L+ A +KD++KIKACLFSICTSL+V G ++ +HP M IR+ IET+LLWP ++ D +S
Sbjct: 4 QLKNATSAKDLLKIKACLFSICTSLLVNGTDAYAHPDMFWIRKAIETILLWPSVDGDDIS 63
Query: 217 KAQHGCIDCLALMICAELQAKESIKDSTPDRITVAEK---------KGNSVVTYVIHQFV 267
KAQHGCIDCLALM+C ELQA +++K+S + + KG SV +YVIH V
Sbjct: 64 KAQHGCIDCLALMLCTELQATKAVKNSISIEVCFEQSIVSSGDSLTKG-SVCSYVIHHLV 122
Query: 268 NDKEQASIPELGDGVAAVPLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFE 327
E S+ + V SFRLCM NVLISA QK+ + KK F +++LP +LHS+E
Sbjct: 123 CG-EDISVMLGRNEVVKAHHSFRLCMANVLISACQKVPCASKKPFVSKILPRVLHSVEEI 181
Query: 328 MKSEIRAACIQVLFSAVYHLGYAVLPYASELLKISLKALRKESEKERMAGAKLIASLMAS 387
SE+R+ACIQV FS VYHL VLPY+S+LLK+S+K+LR+ SEKER+AGAKL+ASLMAS
Sbjct: 182 ANSEVRSACIQVFFSMVYHLKSLVLPYSSDLLKVSIKSLREGSEKERIAGAKLLASLMAS 241
Query: 388 EDVILEKISGGLLEARXXXXXXXXXXXXXELRQLCSKLLACISS 431
E+ +L+KISGGL+EAR ++R++C +LL C++S
Sbjct: 242 EEAVLQKISGGLVEARTLLQDICSSDLPLDVRKMCQRLLVCLTS 285
>R0FLZ2_9BRAS (tr|R0FLZ2) Uncharacterized protein OS=Capsella rubella
GN=CARUB_v10016664mg PE=4 SV=1
Length = 911
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/254 (54%), Positives = 185/254 (72%), Gaps = 5/254 (1%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MF +PSNA +V+FLS ISE LA+++++VL HV+ H++EQ K++E F+ R S T S +
Sbjct: 658 MFLEPSNAIMVRFLSCISEYLADMSNIVLLHVLSHIKEQNKVNEIFICR--SDTKISIDK 715
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNV-IHDSDSGDTETSYKCIAN 119
+ ++SLF+RLCPLLI+++LP + F+D++SS +YGK V ++D E +CIA
Sbjct: 716 TKSEKSLFDRLCPLLILRLLPQRVFDDIDSSTIYGKFLSGVSVNDYQDIKFEDC-QCIAA 774
Query: 120 LLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICT 179
+L RAF +FEFE+V+KLSAELCGRIHPQVL P V +LE A + +D +KIKACLFSICT
Sbjct: 775 FILERAFSKFEFEEVQKLSAELCGRIHPQVLFPTVLLQLEKAREFQDNLKIKACLFSICT 834
Query: 180 SLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKES 239
SLVVRGWES SH IR+++E +LLWP + D +SK QHGCIDCLALMICAELQ +S
Sbjct: 835 SLVVRGWESFSHSVTPKIRKVLEIILLWPSVE-DEISKVQHGCIDCLALMICAELQHLKS 893
Query: 240 IKDSTPDRITVAEK 253
K S ++I K
Sbjct: 894 SKTSGEEKIRTRGK 907
>A9SRE8_PHYPA (tr|A9SRE8) Predicted protein OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_166246 PE=4 SV=1
Length = 1252
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 155/468 (33%), Positives = 253/468 (54%), Gaps = 63/468 (13%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
M+ + SN + +FLS+IS LA++ LV + + +Q ++ E LS + + D
Sbjct: 807 MYKEASNPVIPRFLSHISGQLADLPHLVFPQIHQRMHDQPRMSEEVLS---NISKNEDAL 863
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYK--CIA 118
E+ LF+RL PLL++++LP+K FN +S +YG + + ++ SG+++ + I
Sbjct: 864 AEL---LFQRLSPLLVLRILPLKAFNHSSSKDLYGGM----LDEAISGESQDQGRISTIT 916
Query: 119 NLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSIC 178
LLL R +E +DVRK++AEL GR+ P ++ V + LE A + + V KACLFS+C
Sbjct: 917 GLLLERMCNVWELDDVRKIAAELTGRLLPNAMLSAVSAALEEAAHNVNTVTAKACLFSLC 976
Query: 179 TSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKE 238
TSL++RG E+L HP ++ V+ CL + S+ H CL MICAE+
Sbjct: 977 TSLMIRGKETLEHP-------LLAQVI---CLFNNGRSEWNH----CLTAMICAEV-GTL 1021
Query: 239 SIKDSTPDRITVAEKKGNSVVTYV----IHQFVNDKEQ----------------ASIP-- 276
+ P + +++K ++ + N K++ S P
Sbjct: 1022 GLVSQLPSKPVQSDQKMKKLIEEIPEGEWRSHANGKQESQKRWEILCSVVQCVLGSQPFP 1081
Query: 277 -----------ELGDG-VAAVPLSFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSL 324
E G G V P +R+CM NVLIS++QKIS S + +A V+P +++ +
Sbjct: 1082 PISATLGLSEVEKGRGDVQLAPEVYRVCMANVLISSAQKISASARPAYAQAVIPPIMNFI 1141
Query: 325 EFEMKSEIRAACIQVLFSAVYHL-GYAVLPYASELLKISLKALRKE-SEKERMAGAKLIA 382
+ S++R AC QVLF++VYHL G A++P+A++LL +S+ + S +ER AG +L+A
Sbjct: 1142 QTSSASKLRGACFQVLFTSVYHLKGLAIVPFAADLLNLSISTIGGRFSIEERTAGTRLLA 1201
Query: 383 SLMASEDVILEKISGGLLEARXXXXXXXXXXXXXELRQLCSKLLACIS 430
SL+ASED +LEKI+ + +A+ +LR LC +LL C++
Sbjct: 1202 SLLASEDSVLEKIAPYIEDAQRAVATVANMDSSPQLRALCEQLLGCMT 1249
>A5BCL5_VITVI (tr|A5BCL5) Putative uncharacterized protein OS=Vitis vinifera
GN=VITISV_013819 PE=4 SV=1
Length = 831
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/149 (59%), Positives = 120/149 (80%), Gaps = 2/149 (1%)
Query: 4 DPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEFEEM 63
+PSNAT+V+FLSYISE LA AD+V H ++LH++ QK++ ESF ++WES+TY +D+ ++
Sbjct: 654 EPSNATLVRFLSYISEHLAEAADIVFHRILLHMKGQKELDESFFTKWESKTYAADDSMKL 713
Query: 64 QRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKL-GQNVIHDSDSGDTETSYKCIANLLL 122
Q SLF+RLCPLL+I++LPM+ FNDLNSS++YG+L Q V+H S D ++C+A LLL
Sbjct: 714 QHSLFDRLCPLLVIRLLPMRVFNDLNSSVIYGQLPDQVVVHGYGSIDI-NDHECVAMLLL 772
Query: 123 NRAFCEFEFEDVRKLSAELCGRIHPQVLI 151
NRA +FEFEDVRKL+AELCGRIHPQ +
Sbjct: 773 NRALGKFEFEDVRKLAAELCGRIHPQCYV 801
>D8RWI0_SELML (tr|D8RWI0) Putative uncharacterized protein OS=Selaginella
moellendorffii GN=SELMODRAFT_442836 PE=4 SV=1
Length = 933
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 190/322 (59%), Gaps = 19/322 (5%)
Query: 125 AFCEFEFEDVRKLSAELCGRIHPQ--VLIPFVCSKLEVAMDSKDIVKIKACLFSICTSLV 182
C EF+DVRKLSAEL GR+ P+ VL+P V L +AM++ DIV +K+CL+SIC+SL+
Sbjct: 616 TMCTSEFDDVRKLSAELSGRLLPEASVLLPLVIDILRIAMNNMDIVAVKSCLYSICSSLM 675
Query: 183 VRGWESLSHPSMLAIRRMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQAKESIKD 242
RG ++LS+ M I+ ++ T+LLWP N++ V KAQHGCID A MI AE I+
Sbjct: 676 TRGCQNLSNDVMGQIKALLTTMLLWPT-NSEEVQKAQHGCIDSFAFMIRAEFNT-SMIQM 733
Query: 243 STPDRITVAEKKGNSVVT------YVIHQFVNDKEQ---ASIPELGDGVAAVPLSFRLCM 293
D E+ N V++ Y I +S+ A P+SFR+CM
Sbjct: 734 IDKDEGNKVERVINGVMSCLQNRPYAIATLCGTSPADFCSSVSIQDATCVAAPVSFRVCM 793
Query: 294 GNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGYAV-L 352
NVLIS+S KI+ S F + V+P + L+ E +S +RAAC+QVLFSAV +L A L
Sbjct: 794 ANVLISSSSKITASSHDAFLS-VIPVVSALLKRESESIVRAACVQVLFSAVCNLKEAASL 852
Query: 353 P-YASELLKISLKALR--KESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXX 409
P YA LL ISL +L K +++ER+AG KL+A+L+ E IL++ L+ A
Sbjct: 853 PAYAKTLLDISLDSLNNDKAADEERLAGVKLLAALLVHES-ILKEGGSSLVNALQVLSRV 911
Query: 410 XXXXXXXELRQLCSKLLACISS 431
E+R+LC +LL+C+SS
Sbjct: 912 SRADPSPEVRKLCEQLLSCVSS 933
>D8QZC6_SELML (tr|D8QZC6) Putative uncharacterized protein (Fragment)
OS=Selaginella moellendorffii GN=SELMODRAFT_82324 PE=4
SV=1
Length = 309
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 178/311 (57%), Gaps = 30/311 (9%)
Query: 149 VLIPFVCSKLEVAMDSKDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWP 208
VL+P V +L AM++ DI+ +K+CL+SIC+SL+ RG ++LS+ M I+ ++ T+LLWP
Sbjct: 1 VLLPLVIDQLRNAMNNMDILAVKSCLYSICSSLMTRGCQNLSNDVMGQIKALLTTMLLWP 60
Query: 209 CLNADSVSKAQHGCIDCLALMICAELQAK--ESIKDSTPDRITVAEK------KGNSV-- 258
N++ V KAQHGCID A MI AE + I T D + +GN V
Sbjct: 61 -TNSEEVQKAQHGCIDSFAFMIRAEFNTSMIQMIDKGTKDGRSSFHSNYRFADEGNKVER 119
Query: 259 VTYVIHQFVNDKEQASIPELGDGVA--------------AVPLSFRLCMGNVLISASQKI 304
V V+ + ++ A G A A P+SFR+CM NVLIS+S KI
Sbjct: 120 VINVVMSCLQNRPYAIATLCGTSPADFCGSVSIQDASCVAAPVSFRVCMANVLISSSSKI 179
Query: 305 SKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGYAV-LP-YASELLKIS 362
+ S F ++V+P + L+ E +S +RAAC+QVLFSAV +L A LP YA LL IS
Sbjct: 180 TASSHDAFLSEVIPVVSALLKSESESIVRAACVQVLFSAVCNLKEAASLPAYAKTLLHIS 239
Query: 363 LKALR--KESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXXXXXXXXXELRQ 420
L +L K +++ER+AG KL+A+L+ E IL+++ L+ A E+R+
Sbjct: 240 LDSLNNDKAADEERLAGVKLLAALLVHES-ILKEVGSSLVNALQVLSRVSRADPLPEVRK 298
Query: 421 LCSKLLACISS 431
LC +LL+C+SS
Sbjct: 299 LCEQLLSCVSS 309
>A2Q3M2_MEDTR (tr|A2Q3M2) Putative uncharacterized protein OS=Medicago truncatula
GN=MtrDRAFT_AC155885g22v2 PE=4 SV=1
Length = 172
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 80/103 (77%), Positives = 91/103 (88%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
MFTDPSN +VKF SYISEDL NV DLVLHHV+LHVREQK+I ESFL+RWE R+Y+SDE+
Sbjct: 40 MFTDPSNPVIVKFFSYISEDLTNVVDLVLHHVLLHVREQKEIDESFLARWECRSYSSDEY 99
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIH 103
EEMQR+LFE LCPLLIIKMLPMKTF++LNSS+MYG L QN H
Sbjct: 100 EEMQRTLFEHLCPLLIIKMLPMKTFDNLNSSVMYGHLSQNKTH 142
>A5BCL6_VITVI (tr|A5BCL6) Putative uncharacterized protein OS=Vitis vinifera
GN=VITISV_013820 PE=2 SV=1
Length = 140
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/138 (60%), Positives = 101/138 (73%)
Query: 293 MGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGYAVL 352
M NVLISA QKIS S KK FA ++LP+L+H ++ SE R AC+QVLFSAVYHL +L
Sbjct: 1 MANVLISACQKISDSGKKAFARRILPYLIHXVQVIKDSEXRVACVQVLFSAVYHLKSMIL 60
Query: 353 PYASELLKISLKALRKESEKERMAGAKLIASLMASEDVILEKISGGLLEARXXXXXXXXX 412
PY+SELLK+SLK+L SEKERMAG KL+ASLMASED I+E IS GLLEAR
Sbjct: 61 PYSSELLKLSLKSLEGNSEKERMAGVKLMASLMASEDAIVENISEGLLEARLVLLSMYMA 120
Query: 413 XXXXELRQLCSKLLACIS 430
E++Q+C KLLAC++
Sbjct: 121 DPSLEVQQMCQKLLACLT 138
>B9HA58_POPTR (tr|B9HA58) Predicted protein OS=Populus trichocarpa
GN=POPTRDRAFT_652755 PE=4 SV=1
Length = 182
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/157 (50%), Positives = 100/157 (63%), Gaps = 14/157 (8%)
Query: 229 MICAELQAKESIKDSTPDRITVAEK--------KGNSVVTYVIHQFVNDKE---QASIPE 277
MICA+LQ S K+S+ + + A K GN V+ YVI+ +ND+ AS+
Sbjct: 1 MICAKLQVPASFKESSKN-LGAARKTSYCGNAASGNCVLLYVINLLINDENALVSASMSG 59
Query: 278 LGDGVAAVPL--SFRLCMGNVLISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAA 335
+ P SFR+CM NVLISA QKIS S KKRFA + +P LL ++E M +IRAA
Sbjct: 60 SENSAFEAPTTHSFRVCMANVLISACQKISDSGKKRFAKKTVPHLLQAVEGIMHPDIRAA 119
Query: 336 CIQVLFSAVYHLGYAVLPYASELLKISLKALRKESEK 372
CIQVLFSAVYHL AVLPY+S+LL +SLK L + SEK
Sbjct: 120 CIQVLFSAVYHLKSAVLPYSSDLLNLSLKFLSRGSEK 156
>A4RVW9_OSTLU (tr|A4RVW9) Predicted protein OS=Ostreococcus lucimarinus (strain
CCE9901) GN=OSTLU_31127 PE=4 SV=1
Length = 930
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 171/379 (45%), Gaps = 55/379 (14%)
Query: 102 IHDSDSGDTETSYKCIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKL--- 158
+ D S + + KC+ +LN EFEDVR++ +E+ GR+ +L + L
Sbjct: 558 VWDESSENKDAIRKCMRRRMLN---IRGEFEDVRRVCSEMFGRLPQNILDEELFEDLRRL 614
Query: 159 -EVAMD-SKDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPCLNADSV- 215
E A S D+ ++++C+FS ++L +RG +LS S +R + +L W D V
Sbjct: 615 RETARSVSDDLARVRSCMFSCNSALALRGESALSERSRREVRSLSMDILTWVDNAQDDVE 674
Query: 216 -SKAQHGCIDCLALMICAELQAKE-------------SIKDSTP------------DRIT 249
SKA G ++ LA +I AE++A++ ++ D+ P R
Sbjct: 675 LSKAHMGAMETLASLIVAEIEARKESPHQSAKKKTSSALLDTKPPLLALNESKASSGRAL 734
Query: 250 VAEKKGNSVVTY------------VIHQFVNDKEQASIPELGDGVAAVPLSFRLCMGNVL 297
+ E G+ VT V+H + KE +P+ D LS RL M NV+
Sbjct: 735 IVELDGDEDVTRERVDCARETLRGVLHLACHTKE--GVPKWVDDALTKDLSLRLAMMNVI 792
Query: 298 ISASQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRAACIQVLFSAVYHLGYAVL--PYA 355
I+A+++ S K + P L+ +E + E+RAA +Q L ++H + +A
Sbjct: 793 IAAARRPSIE-KASLMDECFPPLIACVEHRGEPEVRAAALQALM-MIFHGAERDVDETHA 850
Query: 356 SELLKISLKALRKESEKE--RMAGAKLIASLMASEDVILEKISGGLLEARXXXXXXXXXX 413
L++ LR + + RM K+ +L+A+++ +L+ I L R
Sbjct: 851 VTLVRTVTNILRDHAAGDGPRMGATKVATALLAADEDVLKAIEPHLEALRQGLETAARIA 910
Query: 414 XXXELRQLCSKLLACISSP 432
E+ L +KL C+++P
Sbjct: 911 VDPEVAALATKLATCMTTP 929
>L8GVB8_ACACA (tr|L8GVB8) Uncharacterized protein OS=Acanthamoeba castellanii
str. Neff GN=ACA1_223230 PE=4 SV=1
Length = 338
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 140/283 (49%), Gaps = 26/283 (9%)
Query: 129 FEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICTSLVVRGWES 188
+EFEDVRKL+AE+ + P++ +P + L +D +V IK +++IC +L+V G
Sbjct: 5 YEFEDVRKLAAEVVAFLPPKLTLPAISQSLLRHIDEAALVHIKVDIYAICNALLVHG--K 62
Query: 189 LSHPSMLAIRRMIETVLLWPCLNADS----VSKAQHGCIDCLALM-ICAELQAKESIKDS 243
+HP ++ + I ++L P ++ + V K QH CID LAL+ + A A D
Sbjct: 63 AAHPFVIDLVPAILSLLKLPIPSSSTGSLEVQKLQHACIDFLALVTLTARQPALREGADL 122
Query: 244 TPDRI----TVAEKKGNSVVTYVIHQFVNDKEQASIPELGDGV-AAVPLSFRLCMGNVLI 298
P + T +G ++ N + + E G V + SFR+C NVLI
Sbjct: 123 LPLLLHFVCTGRASRGLEALSSSSSSRANARLEVG-EEQGAAVEEQLSPSFRICAINVLI 181
Query: 299 SASQKISKSCKKRFAAQVLPFLLHSL-----------EFEMKSEIRAACIQVLFSAVYHL 347
+ ++ I S + A ++P LL S EF + +RAA +Q+LF+ Y L
Sbjct: 182 TLAKGIGLSVLQALGAWLIPQLLVSYFAVSGTGSTSHEFGLHF-LRAAHLQLLFTIAYQL 240
Query: 348 GYAVLPYASELLKISLKALRKES-EKERMAGAKLIASLMASED 389
ELL ++ AL+ + E RMAG KL+ ++++S D
Sbjct: 241 KGEFRGEGKELLAAAVHALQDPAPEVVRMAGMKLLGAVLSSPD 283
>M1AVC4_SOLTU (tr|M1AVC4) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG401011958 PE=4 SV=1
Length = 208
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 63/105 (60%), Gaps = 8/105 (7%)
Query: 1 MFTDPSNATVVKFLSYISEDLANVADLVLHHVMLHVREQKKIHESFLSRWESRTYTSDEF 60
+F +PSNA +V+FLS ISE LA+ D V ++ + R QK E +++
Sbjct: 108 LFAEPSNAVIVRFLSSISEHLASATDFVFQRIISYSRRQKDPDEGVYPNYDA-------- 159
Query: 61 EEMQRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDS 105
E Q LF RLCPLL++++LP++ FNDLNSS +Y +L + H +
Sbjct: 160 PEGQIDLFNRLCPLLVVRLLPLQVFNDLNSSALYDELPTKLAHGT 204
>C1E222_MICSR (tr|C1E222) Predicted protein OS=Micromonas sp. (strain RCC299 /
NOUM17) GN=MICPUN_99558 PE=4 SV=1
Length = 1203
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 96/178 (53%), Gaps = 30/178 (16%)
Query: 67 LFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYKCIANLLLNRAF 126
+FERL PLL+++ LP++ ++D + ++ + +S S + ++LL+ A
Sbjct: 715 VFERLAPLLVLRTLPLEAWDDDH---LF------SVDESPS---------VPDVLLDIAL 756
Query: 127 C-EFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICTSLVVRG 185
E ++VR+++AEL GR HP +L A+ + + +AC+FS+C+SL RG
Sbjct: 757 AGAGEHDEVRRVAAELHGRAHPDATSESRGVRLAEAVVDGALGRARACMFSLCSSLAFRG 816
Query: 186 WES-LSHPSM---LAIR----RMIETVLLWPCLNADSVSKAQHGCIDCLALMICAELQ 235
++ SH S A R R++E++ + ++D +K + G + LA ++ AEL+
Sbjct: 817 IDACPSHSSTASPAATREVCARILESLGM---DDSDEATKTRMGATETLASLVRAELE 871
>Q01B42_OSTTA (tr|Q01B42) WGS project CAID00000000 data, contig chromosome 04
OS=Ostreococcus tauri GN=Ot04g02070 PE=4 SV=1
Length = 753
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 73/142 (51%), Gaps = 10/142 (7%)
Query: 102 IHDSDSGDTETSYKCIANLLLNRAFCEFEFEDVRKLSAELCGRIHPQVLIPFVCSKLEVA 161
I D +S + KC+ +LN + EFEDVR++ +E+ ++ Q L + S+LE A
Sbjct: 377 IWDRESRERTDIQKCLRYRMLN---VQGEFEDVRRVCSEMFAKVPQQTLDKELLSELERA 433
Query: 162 -----MDSKDIVKIKACLFSICTSLVVRGWESLSHPSMLAIRRMIETVLLWPC--LNADS 214
DS D+ +++ C+F L VRG +LS + +R + +L W + D
Sbjct: 434 YRSARTDSDDLARVRTCMFCCSAGLAVRGKSALSESFVNVLRSVCVNILAWSACDTDGDD 493
Query: 215 VSKAQHGCIDCLALMICAELQA 236
KAQ G ++ LA +I AE+ +
Sbjct: 494 FFKAQMGAMETLASIIVAEVDS 515
>D2VH40_NAEGR (tr|D2VH40) Predicted protein OS=Naegleria gruberi
GN=NAEGRDRAFT_68267 PE=4 SV=1
Length = 1020
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/313 (20%), Positives = 132/313 (42%), Gaps = 57/313 (18%)
Query: 64 QRSLFERLCPLLIIKMLPMKTFNDLNSSIMYGKLGQNVIHDSDSGDTETSYKCIANLLLN 123
+ ++F+ L PLL++K LP+ F ++ + + K+ V+ L++
Sbjct: 689 EETIFDMLAPLLVLKTLPVSIFYYIHPNAEHDKIFTEVL-----------------LIMI 731
Query: 124 RAFCEF-EFEDVRKLSAELCGRIHPQVLIPFVCSKLEVAMDSKDIVKIKACLFSICTSLV 182
R + + E+VR++SAE+ ++ P++ I ++ + + K K+ LF IC ++
Sbjct: 732 RNVIDLSKPEEVRRVSAEVASKLPPRITISSFVARFSNYLQKNSLTKAKSVLFCICQTIA 791
Query: 183 VRGWESLSHPSMLAIRRMIETVL-LWPCLNADS-VSKAQHGCIDCLALMICAELQAKESI 240
+ + + + +I+++ W + D +SK GC+DC+ ++I L S
Sbjct: 792 LYEEDPYLYSFLTGESLLIQSIKNAWNLQSNDQELSKIHQGCMDCMVMIIKYMLNYSHS- 850
Query: 241 KDSTPDRITVAEKKGNSVVTYVIHQFVNDKEQASIPELGDGVAAVPLSFRLCMGNVLISA 300
GN + ++ N+ E + R+ + N++
Sbjct: 851 -------------DGNFISDNILKPLSNNSESMMV--------------RVSISNIITKC 883
Query: 301 SQKISKSCKKRFAAQVLPFLLHSLEFEMKSEIRA-------ACIQVLFSAVYHLGYAVLP 353
+ ++ V+P ++S +K E A + +Q L + V+ + AV P
Sbjct: 884 TNVFPLKHLLSLSSMVIP--IYSKVLLLKREPNAETSLFYCSSLQALMNLVFKIKTAVHP 941
Query: 354 YASELLKISLKAL 366
Y+ LL+I K L
Sbjct: 942 YSKSLLEIISKCL 954