Miyakogusa Predicted Gene
- Lj0g3v0285959.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0285959.1 tr|E2FKJ1|E2FKJ1_MEDTR Sieve element occlusion by
forisomes 2 OS=Medicago truncatula GN=SEO-F2 PE=2
,63.76,0,seg,NULL,CUFF.19131.1
(675 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator c... 202 8e-52
AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 137 2e-32
AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 58 2e-08
>AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator
complex subunit Med28 (InterPro:IPR021640); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:252033-255246 FORWARD
LENGTH=740
Length = 740
Score = 202 bits (513), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 175/647 (27%), Positives = 310/647 (47%), Gaps = 83/647 (12%)
Query: 90 ALKRISCQMITTRGTAQCAHQKTIWILQQLRSFSWDAKALIALAAFTLEYGEFWLLYRIP 149
A+ R++C++ T +H+ T+ + + L SF WD K ++ LAAF L YGEFWLL +
Sbjct: 109 AIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWDGKLVLTLAAFALNYGEFWLLVQFY 168
Query: 150 TSDPLGNSLKLL------NQVQIRKVPTDLTDLVSFLVQVFQEIKKWASWSAFGYDLEEV 203
+ + L SL +L N+V + V L DL+ + V + + + Y +V
Sbjct: 169 SKNQLAKSLAMLKLVPVQNRVTLESVSQGLNDLIREMKSVTACVVELSELPD-RYITPDV 227
Query: 204 HSLSDAIQEIPLVVYWTVASIVACSGNLVGVSKYNLSEFKTRL-----SIMVDKLK---- 254
LS + IP+ VYWT+ S++AC + ++ T++ S++ +KLK
Sbjct: 228 PQLSRILSTIPIAVYWTIRSVIACISQINMITAMGHEMMNTQMDLWETSMLANKLKNIHD 287
Query: 255 ---EHLQKCQVQIDRIDHYRSRMNASKNIKDV--VDFLKLLI-LNDDGSHIPQLYEDNII 308
E L+ C I++ S + ++ D +D +K+L L HI L +
Sbjct: 288 HLAETLRLCYRHIEKQRSSES-LKVLHSLFDTTHIDNMKILTALVHPKPHITPLQDGLTK 346
Query: 309 IKKGLEVFKQKYVLLFISSLDSIGDEIMLLNSVYNRLQENPKEAKKGFRKEDFKILWIPI 368
K L+V ++K VLL IS L+ + DE+ + +Y + N G ++++W+P+
Sbjct: 347 RKVHLDVLRRKTVLLLISDLNILQDELSIFEQIYTESRRNLV-GVDGKSHMPYEVVWVPV 405
Query: 369 VDIWDE-----VLKTQFKTLKESMKWHVLEY--FFELPGLRIIREKLNYFNGKPIVAVIN 421
VD ++ +L+ +F+ L++ M W+ ++ E + +R + ++ N KPI+ VI+
Sbjct: 406 VDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIERHVVEFMRGRWHFMN-KPILVVID 464
Query: 422 PQGVIMNDNALDIIFQWGFDAFPFRKSDGDDLIKKWSWFWNLMKKA-DLNIEDF-GSDSY 479
PQG + NAL +I+ WG +AFPF +S ++L ++ ++ NL+ D I ++ D+Y
Sbjct: 465 PQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETFSLNLIVDGIDSVIFNWIKPDNY 524
Query: 480 IFIYGGNDPKWIRDFTTXXXXXXXXXXXXNVDVTIEHYQLGKNN---------------- 523
IF+YGG+D WIR FT + +V +E +GK N
Sbjct: 525 IFLYGGDDLDWIRRFT-----MAAKATAKDSNVNLEMAYVGKRNHSHREQIRRISEVIRS 579
Query: 524 ---------PTKVPYFWMGVDGKKVSQ----KCQDPVDCEIQEAVKSLLCLKQDPTGWVL 570
P + +FW ++ S+ K D D + + +K +L + GW L
Sbjct: 580 ENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADDHDD--VMQGIKKILSYDK-LGGWAL 636
Query: 571 LSKGYHVMLLGHGEPVYQTVADFE-LWKHKVLEKEGFDVAFKEYYNGKVKELYSRNQCAV 629
LSKG ++++ HG + +T++ ++ WK V K G+ A ++++ +V + C
Sbjct: 637 LSKGPEIVMIAHG-AIERTMSVYDRTWKTHVPTK-GYTKAMSDHHHDEVLRETGK-PCGH 693
Query: 630 INVDNHAASNLL-ATITCPNPPCGRVMEVTSVNYRCCH----HDDPN 671
+ A S + + C C R ME +++ CCH H+D N
Sbjct: 694 FDFHITARSGRIPEKMNCFE--CQRPME-KYMSFSCCHDEKLHEDEN 737
>AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:247288-250261 FORWARD
LENGTH=822
Length = 822
Score = 137 bits (346), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 151/646 (23%), Positives = 265/646 (41%), Gaps = 121/646 (18%)
Query: 95 SCQMITTRGTAQCAHQKTIWILQQLRSFSWDAKALIALAAFTLEYGEFWLLYRIPTSDPL 154
S M+T+ + T +L + + WDAK ++ L+A ++YG F LL ++ L
Sbjct: 221 SHGMMTSGLHLDSRNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQL 280
Query: 155 GNSLKLLNQV---------------QIRKVPTDLTDLVSFLVQVFQEIKKWASWSAFGYD 199
SL L+ Q+ + R + D+ DL + ++ ++Q
Sbjct: 281 TKSLALIKQLPSIFSRQNALHQRLDKTRILMQDMVDLTTTIIDIYQ-------------- 326
Query: 200 LEEVHSLSDAIQEIPLVVYWTVASIVACSGNLVGVSKY------------NLSEFKTRLS 247
L H + IP VYW V ++ C ++ G S + + E RL
Sbjct: 327 LPPNHITAAFTDHIPTAVYWIVRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLR 386
Query: 248 IMVDKLKEHLQKCQVQIDRIDH---YRSRMNASKNI--KDVVDFLKLLILNDDGSHIPQL 302
+ L E +K ++ I+ Y+ + I DVV L L+ D L
Sbjct: 387 KINAYLLEQFKKSKMTIEEGIIEEEYQELIQTFTTIIHVDVVPPLLRLLRPIDF-----L 441
Query: 303 YEDNIIIKK--GLEVFKQKYVLLFISSLDSIGDEIMLLNSVYNRLQENPKEAKKGFRKED 360
Y + K+ G+ V QK+VLL IS L++I E+ +L S+Y + +
Sbjct: 442 YHGAGVSKRRVGINVLTQKHVLLLISDLENIEKELYILESLYTEAWQ-----------QS 490
Query: 361 FKILWIPIVDIWDEVLKTQFKTLKESMKWHVLEYFFEL--PGLRIIREKLNYFNGKPIVA 418
F+ILW+P+ D W E +F+ L +M+W+VL +L +R +RE F +PI+
Sbjct: 491 FEILWVPVQDFWTEADDAKFEALHMNMRWYVLGEPRKLRRAAIRFVREWWG-FKNRPILV 549
Query: 419 VINPQGVIMNDNALDIIFQWGFDAFPFRKSDGDDLIKKWSWFWNLMKKAD--LNIEDFGS 476
++P+G +M+ NA +++ W A PF + DL + W + ++
Sbjct: 550 ALDPKGQVMSTNAFPMVWIWQPFAHPFTTARERDLWSEQEWNLEFLIDGTDPHSLNQLVD 609
Query: 477 DSYIFIYGGNDPKWIRDFTTXXXXXXXXXXXXNVDVTIEHYQLGKNNPT----------- 525
YI +YGG D +WI++FT+ ++ +E +GK NP
Sbjct: 610 GKYICLYGGEDMQWIKNFTSLWRNVAKA-----ANIQLEMVYVGKRNPKNGIQPIINTIR 664
Query: 526 ------------KVPYFWMGVDGKKVSQKC--------------QDPVDCEIQEAVKSLL 559
++ +FW V+ S++ ++ D +QE V ++L
Sbjct: 665 EENLSHTLPDLFQIWFFWTRVESMWESKQRMLKAHGIKGREGFKEEEKDLVLQEVV-AML 723
Query: 560 CLKQDPTGWVLLSKGYHVMLLGHGEPVYQTVADFELWKHKVLEKEGFDVAFKEYYNGKVK 619
+ GW L+SK +M+ G + +A+F W+ + K GF A ++ ++
Sbjct: 724 GYGGEGDGWGLVSKASDMMVRAKGNLFSRGLAEFNEWEVNIPTK-GFLTALNDHLLMRLP 782
Query: 620 ELYSRNQCAVINVDNHAASNLLATITCPNPPCGRVMEVTSVNYRCC 665
+ C + A + + C C R ME + Y+CC
Sbjct: 783 P----HHCTRFMLPE-TAGIIPNEVECTE--CRRTMEKYYL-YQCC 820
>AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:25417542-25420099 REVERSE
LENGTH=576
Length = 576
Score = 57.8 bits (138), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 7/143 (4%)
Query: 358 KEDFKILWIPI--VDIWDEVLKTQFKTLKESMKWHVLE--YFFELPGLRIIREKLNYFNG 413
+++++I+W+PI W + K F S+ W + + L +++ +Y +
Sbjct: 269 EQNYEIIWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMSSTILNFFKQEWHYKDN 328
Query: 414 KPIVAVINPQGVIMNDNALDIIFQWGFDAFPFRKSDGDDLIKKWSWFWNLMKKADLNIED 473
+ ++ VI+ G +N NA+D++ WG A+PF S D+L K+ W NL+ I
Sbjct: 329 EAMLVVIDSNGRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEHGWSINLLLDG---IHP 385
Query: 474 FGSDSYIFIYGGNDPKWIRDFTT 496
I I+G + WI +F +
Sbjct: 386 TFEGREICIFGSENLDWIDEFVS 408