Miyakogusa Predicted Gene
- Lj5g3v0589900.3
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0589900.3 tr|G7I5R4|G7I5R4_MEDTR Myosin-like protein
OS=Medicago truncatula GN=MTR_1g009610 PE=4
SV=1,65.7,0,coiled-coil,NULL; seg,NULL; SUBFAMILY NOT NAMED,NULL;
FAMILY NOT NAMED,NULL; Spectrin repeat,NULL,CUFF.53325.3
(2747 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G31570.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Prefoldin ... 1304 0.0
AT1G24460.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 92 4e-18
AT1G24460.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 92 5e-18
>AT4G31570.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Prefoldin
(InterPro:IPR009053); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT1G24460.1); Has 194354
Blast hits to 66887 proteins in 3244 species: Archae -
3688; Bacteria - 38556; Metazoa - 84828; Fungi - 17265;
Plants - 10589; Viruses - 805; Other Eukaryotes - 38623
(source: NCBI BLink). | chr4:15297061-15306036 FORWARD
LENGTH=2730
Length = 2730
Score = 1304 bits (3375), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 915/2434 (37%), Positives = 1362/2434 (55%), Gaps = 181/2434 (7%)
Query: 403 SVNLLKLAEIIRGLNEEEYQFLLEARGAVSDADPLPSSSVLTVLEFAEAFQRLKEDFFLA 462
SV+ L+L +I++GL ++EYQ L AR A S +P SS +RL+E+ F++
Sbjct: 389 SVSFLQLMDIVKGLGQDEYQILCNAREAASSTEPGTSS-----------LERLREELFVS 437
Query: 463 NLMENIFNIQLVEQLELQVESDNQRFQLIGELSQLRASHNEVNEKNQQLTDELANCHFEL 522
+ ME+I ++QL EQ LQ+E D+Q Q + E+SQLRAS++ V E+N L +EL+ C +L
Sbjct: 438 STMEDILHVQLTEQSHLQIEFDHQHNQFVAEISQLRASYSAVTERNDSLAEELSECQSKL 497
Query: 523 RDISSKRVEVQNHFSAAMAEVDALSARLVELQVNFEMSQKDSSD---------------- 566
+S ++N A A+V+ +A++ ELQ++ E S D S+
Sbjct: 498 YAATSSNTNLENQLLATEAQVEDFTAKMNELQLSLEKSLLDLSETKEKFINLQVENDTLV 557
Query: 567 --------------------------LSTELMDCRGLISSLQAEKKGTNETLELVTAEKN 600
LS+EL +C+ L + L+AE + T+ +T EK
Sbjct: 558 AVISSMNDEKKELIEEKESKNYEIKHLSSELCNCKNLAAILKAEVEQFENTIGPLTDEKI 617
Query: 601 KLMEEKEFHMCESTKLAAEVADIKSSTEAAKVENSNLIDMISLLTEESHKTKAEVEHXXX 660
L+EEK + E+ KL E+A+ K+ +VENSN+ + +SLLT + +T E +
Sbjct: 618 HLVEEKYSLLGEAEKLQEELANCKTVVTLQEVENSNMKETLSLLTRQ--QTMFEENNIHL 675
Query: 661 XXXXXXXXXXXNKDLVAS--LQAEXXXXXXXXXXXXDKIKNLEDENHSVFIENQGLSSQI 718
+ L++ L +E +K+ + E + EN L+ ++
Sbjct: 676 REENEKAHLELSAHLISETYLLSEYSNLKEGYTLLNNKLLKFQGEKEHLVEENDKLTQEL 735
Query: 719 VVLQEQLSIEKGERVRFEGDLQEVTIHLVQLSNENVFLNSTLEEHRAKLAEIGKKHSQLL 778
+ LQE +S + ER E +L+E L +L+ EN L S++ +A++ + G
Sbjct: 736 LTLQEHMSTVEEERTHLEVELREAIARLDKLAEENTSLTSSIMVEKARMVDNGS------ 789
Query: 779 SQPADLGNQAQITRGHSEGLEIAVAEDSMHVDQEPDEGAPSELEVFNDSHGFVSLKTCLD 838
+ + L NQ +I+ EI V++ S + E+ + F +L L+
Sbjct: 790 ADVSGLINQ-EISEKLGRSSEIGVSKQSASFLENTQYTNLEEVREYTSE--FSALMKNLE 846
Query: 839 EGENLLVKLEKAINELHSRSVFASRSDGKVSSPAVSKLIQAFESKVLEDEHEAEERDXXX 898
+GE ++ LE+AI ++ + S + SD ++PAVSKLIQAFESK +E E+E
Sbjct: 847 KGEKMVQNLEEAIKQILTDSSVSKSSDKG-ATPAVSKLIQAFESKRKPEEPESENAQLTD 905
Query: 899 XXXXXXXLIMFTMEQIENLRKLLSKWELEVQSAAALFKGERDGRKIGDAKYSDLEDQFKG 958
+ + QI NLR LL + L + A F D R + + +L +F
Sbjct: 906 DLSEADQFVSVNV-QIRNLRGLLDQLLLNARKAGIQFNQLNDDRTSTNQRLEELNVEFAS 964
Query: 959 LTQHCSDLETSNIELSVQYEIVKQLLGEIQENKCQLEELYEALKQENTCRKGKDTELHQK 1018
H + LE IE V +E +K E+Q LE L ++LK N ++TEL++K
Sbjct: 965 HQDHINVLEADTIESKVSFEALKHYSYELQHKNHDLELLCDSLKLRNDNISVENTELNKK 1024
Query: 1019 LGYCLSKVSELHTEMYGVRQSSNEMAIIIGSQLENLQKEVAERGMQLEQGWNASIAEIVE 1078
L YC ++ EL ++ ++Q+ + QL LQ E +ER M +E + ++E E
Sbjct: 1025 LNYCSLRIDELEIQLENLQQNLTSFLSTMEEQLVALQDE-SERAMMVEHELTSLMSEFGE 1083
Query: 1079 LVGKLNESVGETLDSTISSDTHDSLDISHQLEVSVKAATELIFDLRKKLEATYSEHEMMC 1138
V +L++ L + +S H LD++ ++ SV A +I DL++KLEA Y +HE
Sbjct: 1084 AVVRLDD----CLLRSGTSGAHTGLDMTKRISGSVDVAVNVIEDLKEKLEAAYVKHESTS 1139
Query: 1139 TSYKEMNMKCDHLLGRNEMSIGVLHKMYSDTRKLVLKSGGSTDENVIDEQCEALPDLLNY 1198
Y+E+ + L +NE + + K+Y+D KL+ +S GS + ++ + A+ D
Sbjct: 1140 NKYEELKQSFNTLFEKNEFTASSMQKVYADLTKLITESCGSAEMTSLEVENVAVFDPFRD 1199
Query: 1199 DSYQPIMKHLVDLLNEKLKLESGTKEMKSELMHKETELEELKMKCRGLDSIGKLIEDVAR 1258
S++ +++ + +L+E+L+L+S +++S+L K ++EE+ + S+ +L+E V
Sbjct: 1200 GSFENLLEAVRKILSERLELQSVIDKLQSDLSSKSNDMEEMTQRSLDSTSLRELVEKVEG 1259
Query: 1259 VLNVENPNVEINKXXXXXXXXXXXXXVQKTKEAEIQYHTTK---EGYGSXXXXXXXXXXX 1315
+L +E+ I + VQK E E + + E G+
Sbjct: 1260 LLELESGV--IFESPSSQVEFLVSQLVQKFIEIEELANLLRKQLEAKGNELME------- 1310
Query: 1316 XYYLDTLRLENENEILVLRESLDQADEALTAARSELHEKANELEHSEQRLSSIREKLSIA 1375
++ L ++ +I LRESL QA+E+L A RSEL +K+NELE SEQRL S REKLSIA
Sbjct: 1311 ---IEESLLHHKTKIAGLRESLTQAEESLVAVRSELQDKSNELEQSEQRLLSTREKLSIA 1367
Query: 1376 VAKGKGLVVQRDGLKQSLSETSSELERCLQELNLKDTRLHEVETKLKTYAEAGERAEALE 1435
V KGKGL+VQRD +KQSL+E S++L++C +ELN KD RL EVE KLKTY EAGER EALE
Sbjct: 1368 VTKGKGLIVQRDNVKQSLAEASAKLQKCSEELNSKDARLVEVEKKLKTYIEAGERVEALE 1427
Query: 1436 SELSYIRNSANALRESFLLKDSMLQRIEEVLEDLDLPEQFHSSDIIEKIDWLVRSVAGNS 1495
SELSYIRNSA ALRESFLLKDS+L RIEE+LEDLDLPE FH+ DI+EK++WL RS GNS
Sbjct: 1428 SELSYIRNSATALRESFLLKDSLLHRIEEILEDLDLPEHFHARDILEKVEWLARSANGNS 1487
Query: 1496 LPMNDWERKDSVGGGSNSDAGYVVTDSWKDDIQLQPESQDDFRKNPEEMQSKYYELAEQN 1555
+ W++K S GG AG+V+++ W++D+Q S+DD R EE++ K+Y LAEQN
Sbjct: 1488 SRPSGWDQKSSDGG-----AGFVLSEPWREDVQTGTSSEDDLRIKFEELKGKFYGLAEQN 1542
Query: 1556 EMLEQSLMERNSLVQRWEELVDRIDMPSHLRSMEMEDRIEWVGRALAEANHHVDSLQLKI 1615
EMLEQSLMERN+LVQRWE+L++ ID+P L SME+E++IEW+ + EA H D+LQ KI
Sbjct: 1543 EMLEQSLMERNTLVQRWEKLLENIDIPPQLHSMEVENKIEWLASTITEATHDRDNLQQKI 1602
Query: 1616 EKYDSYCGLVNADLEESQRRVSALQADLRALTSEREFLSEKLEALTYEYEKISVQARGAE 1675
+ + YC V DLE SQ++V ++ +L++ SER LSE+LE+L ++E +S ARG
Sbjct: 1603 DNLEVYCQSVTTDLEVSQKQVGDVEGNLQSCVSERVNLSERLESLIGDHESLS--ARGIH 1660
Query: 1676 LEI--ERLNNEVTSLKDNLEQKAEIEEQNFTIDGRISKLRDLVVEALSKSETE--YLVAD 1731
LE+ E+L N+V L + L +K EE TI+G + LR ++ + + + + L ++
Sbjct: 1661 LEVENEKLQNQVKDLHEKLVEKLGNEEHFQTIEGDLLSLRYMIDDVIQEDGLQDLALASN 1720
Query: 1732 RENIDSLEELLQKLIESHASL-------------SSTKPT-----CGVVFDEH--NSQND 1771
EN+D +L+KLI+ + +L T+P+ G H S
Sbjct: 1721 SENLDG---VLRKLIDYYKNLVKSSLPGETDDNVCETRPSDADVRSGESLGAHGATSHGQ 1777
Query: 1772 HI-----NLHSERSIDMHDKEGADIDRYKADLEEALSELVHLKEERDRNLEKQMSLSGEV 1826
H N+ S D+ E D+ DL++AL +EERD + KQ SL E
Sbjct: 1778 HFELSDSNVVEATSRDIAVVETPDVASLTKDLDQALHVQKLTREERDLYMAKQQSLVAEN 1837
Query: 1827 ETLNKRTXXXXXXXXXXXXKSASVREKLNVAVRKGKSLVQQRDSLKQTIEGMSVEIERLK 1886
E L+K+ KSASVREKLNVAVRKGK+LVQQRDSLKQTIE ++ E+ RLK
Sbjct: 1838 EALDKKIIELQEFLKQEEQKSASVREKLNVAVRKGKALVQQRDSLKQTIEEVNAELGRLK 1897
Query: 1887 SEINNRESTLADHEQKLRQLSTYPDRVEALESESLLLKNXXXXXXXXXXXXXYSLKLILN 1946
SEI R+ L ++E+K R+L +Y RVE+LESE LLK +L + LN
Sbjct: 1898 SEIIKRDEKLLENEKKFRELESYSVRVESLESECQLLKIHSQETEYLLQERSGNLSMTLN 1957
Query: 1947 KLGEIEVGGEGHISDPVKKLEWLGKLCSDLHDAVTSLEQESRKSKRASELLLAELNEVQE 2006
L I++G EG I+DPV KL+ + +L + VTS EQESRKS+RA+ELLLAELNEVQE
Sbjct: 1958 ALNSIDIGDEGDINDPVMKLQRISQLFQTMSTTVTSAEQESRKSRRAAELLLAELNEVQE 2017
Query: 2007 RNDSFQDELAKVAAELVDLKKEKDSAEAAKLEALSHLEKLSTVHEEVKISQSSKIMELKS 2066
NDS Q++L+K E+ L +EKD+AEAAK+EA+S E LS V E K ++++ +
Sbjct: 2018 TNDSLQEDLSKFTYEIQQLSREKDAAEAAKVEAISRFENLSAVSNEEKNKLYAQLLSCGT 2077
Query: 2067 SMNQVCKSLGEVHNLLTNVLLMDLDSFRNLKAGLESCLK--GNKTANMVN-SSVNREQAG 2123
S+N + K L ++ L ++ +MD++ +LKA +E C K G + + S+ N
Sbjct: 2078 SVNSLRKILAGTNSCLADIFIMDMEFLHHLKANMELCAKKTGTDLSGLPQLSTENLVDKE 2137
Query: 2124 IVCRXXXXXXXXXXXXWPDFSSIDHHDYNNVVEIFHLFGNQLKEFLEVVGYLKERIDMHS 2183
I R W + + + N+ EI L +F+ V +L+E++ H
Sbjct: 2138 IFAR--------LSAAWSNINLHETSSGGNIAEICGSLSQNLDQFVVGVSHLEEKVSKH- 2188
Query: 2184 SSALEQDKSLFKLMADIQREITSQRESCETMKKEV-SERDRELVALRGNILYLYEACINS 2242
+A +I S +T K + + D E+ AL I L+ AC +
Sbjct: 2189 -------------LATWHDQINIVSNSIDTFFKSIGTGTDSEVAALGERIALLHGACSSV 2235
Query: 2243 VSVLENGKADLVGKMFDSSNLGINLKAPFSDEISEELIKTMADRLLLSAKGFASMKTEFL 2302
+ +E KA+LVG + + ++L D S E +++M +RL + K E L
Sbjct: 2236 LVEIERRKAELVG----NDDFNMSLHQVDEDFSSMESVRSMVNRLSSAVKELVVANAETL 2291
Query: 2303 DANQKEMKATITTLQRELQEKDVQRDSICSELVKQIKDAENAANSCSQDLQSFRTQESNL 2362
+ N+KEMK I LQREL EKD+Q + C+ELV Q+K+A+ A ++DLQS + ++
Sbjct: 2292 ERNEKEMKVIIANLQRELHEKDIQNNRTCNELVGQVKEAQAGAKIFAEDLQSASARMRDM 2351
Query: 2363 KKQVEVIESERKILEQRVTELQDRQGAAAELEQKVRSQTGMLAAKDQEIEALMHALDEEE 2422
+ Q+ ++ ER +++RV EL Q + +EL++KV S + +LAAKD EIEALM ALDEEE
Sbjct: 2352 QDQLGILVRERDSMKERVKELLAGQASHSELQEKVTSLSDLLAAKDLEIEALMQALDEEE 2411
Query: 2423 MQMXXXXXXXXXXXXXXXXXNLEIENLDSSRSKVMKKLSITVSKFDELHDLSASLLSEVE 2482
QM NL+++ ++SR K+ KKLSITV KFDELH LS +LL+E+E
Sbjct: 2412 SQMEDLKLRVTELEQEVQQKNLDLQKAEASRGKISKKLSITVDKFDELHHLSENLLAEIE 2471
Query: 2483 KLQSQLQEKDSEISFLRQEVTRCTNDVLIASQTSNQRSLDEIFAFLMWVDTIVSQDGMDE 2542
KLQ Q+Q++D+E+SFLRQEVTRCTN+ L ASQ +R +EI L W DTI S G+++
Sbjct: 2472 KLQQQVQDRDTEVSFLRQEVTRCTNEALAASQMGTKRDSEEIQTVLSWFDTIASLLGIED 2531
Query: 2543 IHPEVKSSSHLNEYKEVLHKKLMAILSELENLREVAESKDSMLQVEKIKVEELNRKTETL 2602
+ SH+N Y E K++ ++LSE++ LR V +SKD +L+ E+ +V EL +K TL
Sbjct: 2532 -SLSTDADSHINHYMETFEKRIASMLSEIDELRLVGQSKDVLLEGERSRVAELRQKEATL 2590
Query: 2603 EKSLHEKALQLNLLEGVEETGKRVGSSSEILEVDEPVVNEWTASGAFVTPQVRSLRKGNN 2662
EK L EK Q ++ S+SEI+EV EP++N+WT + + QVRSLRKGN
Sbjct: 2591 EKFLLEKESQQDI---------STSSTSEIVEV-EPLINKWTKTS--IPSQVRSLRKGNM 2638
Query: 2663 DHVAIAVDEDPGSTNRIEDEEDDKVHGFKS---LSSSKIVPRFTRPL----TDLVDG--L 2713
D VAI++D D + +E+DDK H + L S P T + T + G
Sbjct: 2639 DQVAISIDADQTDQSGSLEEDDDKDHSLRQESFLDSQD--PSLTWSMVYGQTLFIHGSRS 2696
Query: 2714 WVSCDRTLMRQPVLRLGIIMYWAIMHALLAFFVV 2747
VSCDRTLMRQP LRLGI++YWAI+HALLA FVV
Sbjct: 2697 VVSCDRTLMRQPALRLGIMLYWAILHALLAAFVV 2730
>AT1G24460.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 14 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G31570.1). | chr1:8666072-8672338 FORWARD
LENGTH=1732
Length = 1732
Score = 92.4 bits (228), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 149/299 (49%), Gaps = 24/299 (8%)
Query: 2446 IENLDSSRSKVMKKLSITVSKFDEL-HDLSASLLSEVEK-LQSQLQEKDSEISFLRQEVT 2503
+ LD +KKL V E+ H + +LS +K L S L EKD EI L++
Sbjct: 1435 VNGLDPQSPYDVKKLFAIVDSVTEMQHQID--ILSYGQKELNSTLAEKDLEIQGLKKATE 1492
Query: 2504 RCTNDVLIASQTSNQRS-----LDEIFAFLMWVDTIVSQDGMDEIHPEVKSSSHLNEYKE 2558
+ L + N+ S L+++ L + +V P S L +
Sbjct: 1493 AESTTELELVKAKNELSKLISGLEKLLGILASNNPVV--------DPNFSESWTL---VQ 1541
Query: 2559 VLHKKLMAILSELENLREVAESKDSMLQVEKIKVEELNRKTETLEKSLHEKALQLNLLE- 2617
L KK+ ++L E E+ + A+ L + V++L+ + + E+ L KA+Q ++++
Sbjct: 1542 ALEKKITSLLLESESSKSRAQELGLKLAGSEKLVDKLSLRVKEFEEKLQTKAIQPDIVQE 1601
Query: 2618 -GVEETGKRVGSSSEILEV-DEPVVNEWTASGAFVTPQVRSLRKGNNDHVAIAVDEDPGS 2675
+ ET R S+SEI E+ D+ + + S QVR++RKG+ DH++I +D +
Sbjct: 1602 RSIFET-PRAPSTSEISEIEDKGALGIKSISPVPTAAQVRTVRKGSTDHLSINIDSESEH 1660
Query: 2676 TNRIEDEEDDKVHGFKSLSSSKIVPRFTRPLTDLVDGLWVSCDRTLMRQPVLRLGIIMY 2734
+ ++DK H FKSL+ S ++P + + D VDG+WVS R LM +P RLG+++Y
Sbjct: 1661 LMNNNETDEDKGHVFKSLNMSGLIPTQGKIIADRVDGIWVSGGRVLMSRPQARLGVMVY 1719
Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 76/132 (57%)
Query: 1357 ELEHSEQRLSSIREKLSIAVAKGKGLVVQRDGLKQSLSETSSELERCLQELNLKDTRLHE 1416
+LE SE++ + +R+KLS+A+ KGKGLV R+ K L E SE+E+ + EL +
Sbjct: 711 DLERSEEKSALLRDKLSMAIKKGKGLVQDREKFKTQLDEKKSEIEKLMLELQQLGGTVDG 770
Query: 1417 VETKLKTYAEAGERAEALESELSYIRNSANALRESFLLKDSMLQRIEEVLEDLDLPEQFH 1476
+ ++ + ER + LE+EL + + L++S L D++LQ++ + +E + LP
Sbjct: 771 YKNQIDMLSRDLERTKELETELVATKEERDQLQQSLSLIDTLLQKVMKSVEIIALPVDLA 830
Query: 1477 SSDIIEKIDWLV 1488
S D EKID L
Sbjct: 831 SEDPSEKIDRLA 842
Score = 66.6 bits (161), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 103/225 (45%), Gaps = 62/225 (27%)
Query: 1324 LENENEILVLRESLDQADEALTAARSELHEKANELEHSEQRLSSIREKLSIAVAKGKGLV 1383
LE+EN V E +++ E + R+E + ELE + + ++ +EKLS+AV KGK LV
Sbjct: 279 LEDENRNFV--EQVNREKEMCESMRTEFEKLKAELELEKTKCTNTKEKLSMAVTKGKALV 336
Query: 1384 VQRDGLK-----------------------------------QSLSETSSELERCLQELN 1408
RD LK QSL+E + ELE+C ELN
Sbjct: 337 QNRDALKHQLSEKTTELANRLTELQEKEIALESSEVMKGQLEQSLTEKTDELEKCYAELN 396
Query: 1409 LKDTRLHEVE-TKLKTYAEAGERAEALESELSYIRNSANALRESFL-------------- 1453
+ L E TK + E+ + LE L+ ++ + AL +S L
Sbjct: 397 DRSVSLEAYELTKKELEQSLAEKTKELEECLTKLQEMSTALDQSELDKGELAKSDAMVAS 456
Query: 1454 ------LKDSMLQRIEEVLEDLDLPEQFHSSDIIEKIDWLVRSVA 1492
+++S+++ IE +L ++ PE+ HS DI+EK VRS+A
Sbjct: 457 YQEMLSVRNSIIENIETILSNIYTPEEGHSFDIVEK----VRSLA 497
>AT1G24460.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 14 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G31570.1); Has 181008 Blast hits to 85359
proteins in 3551 species: Archae - 3290; Bacteria -
48304; Metazoa - 70793; Fungi - 13943; Plants - 10118;
Viruses - 785; Other Eukaryotes - 33775 (source: NCBI
BLink). | chr1:8666072-8672338 FORWARD LENGTH=1807
Length = 1807
Score = 92.4 bits (228), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 104/180 (57%), Gaps = 4/180 (2%)
Query: 2558 EVLHKKLMAILSELENLREVAESKDSMLQVEKIKVEELNRKTETLEKSLHEKALQLNLLE 2617
+ L KK+ ++L E E+ + A+ L + V++L+ + + E+ L KA+Q ++++
Sbjct: 1616 QALEKKITSLLLESESSKSRAQELGLKLAGSEKLVDKLSLRVKEFEEKLQTKAIQPDIVQ 1675
Query: 2618 --GVEETGKRVGSSSEILEV-DEPVVNEWTASGAFVTPQVRSLRKGNNDHVAIAVDEDPG 2674
+ ET R S+SEI E+ D+ + + S QVR++RKG+ DH++I +D +
Sbjct: 1676 ERSIFET-PRAPSTSEISEIEDKGALGIKSISPVPTAAQVRTVRKGSTDHLSINIDSESE 1734
Query: 2675 STNRIEDEEDDKVHGFKSLSSSKIVPRFTRPLTDLVDGLWVSCDRTLMRQPVLRLGIIMY 2734
+ ++DK H FKSL+ S ++P + + D VDG+WVS R LM +P RLG+++Y
Sbjct: 1735 HLMNNNETDEDKGHVFKSLNMSGLIPTQGKIIADRVDGIWVSGGRVLMSRPQARLGVMVY 1794
Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 76/132 (57%)
Query: 1357 ELEHSEQRLSSIREKLSIAVAKGKGLVVQRDGLKQSLSETSSELERCLQELNLKDTRLHE 1416
+LE SE++ + +R+KLS+A+ KGKGLV R+ K L E SE+E+ + EL +
Sbjct: 711 DLERSEEKSALLRDKLSMAIKKGKGLVQDREKFKTQLDEKKSEIEKLMLELQQLGGTVDG 770
Query: 1417 VETKLKTYAEAGERAEALESELSYIRNSANALRESFLLKDSMLQRIEEVLEDLDLPEQFH 1476
+ ++ + ER + LE+EL + + L++S L D++LQ++ + +E + LP
Sbjct: 771 YKNQIDMLSRDLERTKELETELVATKEERDQLQQSLSLIDTLLQKVMKSVEIIALPVDLA 830
Query: 1477 SSDIIEKIDWLV 1488
S D EKID L
Sbjct: 831 SEDPSEKIDRLA 842
Score = 66.2 bits (160), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 103/225 (45%), Gaps = 62/225 (27%)
Query: 1324 LENENEILVLRESLDQADEALTAARSELHEKANELEHSEQRLSSIREKLSIAVAKGKGLV 1383
LE+EN V E +++ E + R+E + ELE + + ++ +EKLS+AV KGK LV
Sbjct: 279 LEDENRNFV--EQVNREKEMCESMRTEFEKLKAELELEKTKCTNTKEKLSMAVTKGKALV 336
Query: 1384 VQRDGLK-----------------------------------QSLSETSSELERCLQELN 1408
RD LK QSL+E + ELE+C ELN
Sbjct: 337 QNRDALKHQLSEKTTELANRLTELQEKEIALESSEVMKGQLEQSLTEKTDELEKCYAELN 396
Query: 1409 LKDTRLHEVE-TKLKTYAEAGERAEALESELSYIRNSANALRESFL-------------- 1453
+ L E TK + E+ + LE L+ ++ + AL +S L
Sbjct: 397 DRSVSLEAYELTKKELEQSLAEKTKELEECLTKLQEMSTALDQSELDKGELAKSDAMVAS 456
Query: 1454 ------LKDSMLQRIEEVLEDLDLPEQFHSSDIIEKIDWLVRSVA 1492
+++S+++ IE +L ++ PE+ HS DI+EK VRS+A
Sbjct: 457 YQEMLSVRNSIIENIETILSNIYTPEEGHSFDIVEK----VRSLA 497