Miyakogusa Predicted Gene
- Lj0g3v0303369.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0303369.1 tr|E2FKI7|E2FKI7_SOYBN Sieve element occlusion p
OS=Glycine max GN=SEOp PE=2 SV=1,79.88,0,coiled-coil,NULL,CUFF.20396.1
(673 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator c... 177 3e-44
AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 139 5e-33
AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 67 5e-11
>AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator
complex subunit Med28 (InterPro:IPR021640); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:252033-255246 FORWARD
LENGTH=740
Length = 740
Score = 177 bits (448), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 153/627 (24%), Positives = 284/627 (45%), Gaps = 60/627 (9%)
Query: 90 LSAKLKRIACQMICTARGDHYAHHTTMLILEQLKSYSWDAKALIVQAAFALEYGKFLFLP 149
+S + R+AC++ + +H TM + E L S+ WD K ++ AAFAL YG+F L
Sbjct: 106 VSYAIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWDGKLVLTLAAFALNYGEFWLLV 165
Query: 150 QI-PKYPVERSLAELNGLLLIHQNTQHLIY--FSSVVKKVMQVIECITEWKRLTSAGYDI 206
Q K + +SLA L + + ++ T + + +++++ V C+ E L Y
Sbjct: 166 QFYSKNQLAKSLAMLKLVPVQNRVTLESVSQGLNDLIREMKSVTACVVELSELPDR-YIT 224
Query: 207 KDVPALSDTLHEIPVVVYWAIFTFVTCTGQLDDFTTDNKGQRHEL---------SKNFEN 257
DVP LS L IP+ VYW I + + C Q++ T HE+ + N
Sbjct: 225 PDVPQLSRILSTIPIAVYWTIRSVIACISQINMIT----AMGHEMMNTQMDLWETSMLAN 280
Query: 258 KLDIILRSFKEHLEECSKQI---GAIEDYTRRRNIVIHTGKDIVKVLKALIVSGDNRESR 314
KL I E L C + I + E ++ T D +K+L AL+ +
Sbjct: 281 KLKNIHDHLAETLRLCYRHIEKQRSSESLKVLHSLFDTTHIDNMKILTALVHPKPHITPL 340
Query: 315 QLVHNGLTGEQVRIEEFKKKHVLLFISGLENIEDETQLLRSIFEKLKDNPKEVEGYRKDD 374
Q +GLT +V ++ ++K VLL IS L ++DE + I+ + + N V+G
Sbjct: 341 Q---DGLTKRKVHLDVLRRKTVLLLISDLNILQDELSIFEQIYTESRRNLVGVDGKSHMP 397
Query: 375 FKILWIPIVDEWND-RYKKMLESHLQ--RTKIGWYVVKDFRFPTG--IKLIREVFNYKDR 429
++++W+P+VD D +L+ + R + WY V + ++ +R +++ ++
Sbjct: 398 YEVVWVPVVDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIERHVVEFMRGRWHFMNK 457
Query: 430 AVIPLISPEGKVENIDTKNIISVWGIDGFPFRTSDHTRLTQQWNWFWAEMTK-LNPKIGD 488
++ +I P+G +++ ++I +WG + FPF S L ++ + + ++ I +
Sbjct: 458 PILVVIDPQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETFSLNLIVDGIDSVIFN 517
Query: 489 LIEEDCYLFIYGGTDSKWMQEITSAVETMKRQIETVLQLD-------------------I 529
I+ D Y+F+YGG D W++ T A + + L++ I
Sbjct: 518 WIKPDNYIFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKRNHSHREQIRRISEVI 577
Query: 530 TIEPYPLGKDDPKVVPRFWIAIDSLFASRKQKKGGDQGVQDFATREIKRLLFLKQDPKGW 589
E +P ++ FW ++S+ S+ Q D D + IK++L + GW
Sbjct: 578 RSENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADD--HDDVMQGIKKILSYDK-LGGW 634
Query: 590 VILSRGSNVKLLGQGEAMYHTVKDFE-IWHGKLHQDV-SFDVAFKEYYEGIKAKKIGQKC 647
+LS+G + ++ G A+ T+ ++ W K H + A +++ ++ G+ C
Sbjct: 635 ALLSKGPEIVMIAHG-AIERTMSVYDRTW--KTHVPTKGYTKAMSDHHHDEVLRETGKPC 691
Query: 648 EHSE--IADYPTDILARINCPNMDCGR 672
H + I I ++NC +C R
Sbjct: 692 GHFDFHITARSGRIPEKMNC--FECQR 716
>AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:247288-250261 FORWARD
LENGTH=822
Length = 822
Score = 139 bits (351), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 158/667 (23%), Positives = 285/667 (42%), Gaps = 112/667 (16%)
Query: 23 FEFNDEQIL-ESVYRTHFHCVDKFDVGSLYCVASKVINHSIEITDTMIAKAGQLSDQFRE 81
F +D++++ + V +TH + FDV SL V + + + D+ K + + +
Sbjct: 134 FSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVVNDIFKSHVPSIDSSAPKPSLVFKDYAD 193
Query: 82 ETSFTSQQLSAKLKRIACQMICTARGDHYAH-------------HTTMLILEQLKSYSWD 128
TSF + + + +I+C++ C +H TT +L + Y WD
Sbjct: 194 HTSF--ETFADLIDQISCEIDCKCLHGGESHGMMTSGLHLDSRNTTTFSVLSLVSKYRWD 251
Query: 129 AKALIVQAAFALEYGKFLFLPQI-PKYPVERSLAELNGLLLIHQNTQHLIYFSSVVKKVM 187
AK ++V +A A++YG FL L + + +SLA + L I L + +M
Sbjct: 252 AKLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQLPSIFSRQNALHQRLDKTRILM 311
Query: 188 QVIECITEWKRLTSAGYDIKDVP------ALSDTLHEIPVVVYWAIFTFVTCTGQL---D 238
Q + LT+ DI +P A +D IP VYW + + C +
Sbjct: 312 Q------DMVDLTTTIIDIYQLPPNHITAAFTD---HIPTAVYWIVRCVLICVSHISGAS 362
Query: 239 DFTTDNKGQRHELSKNFENKLDIILRSFKEHLEECSKQ----------IGAIEDYTRRRN 288
F D E+S+ EN LR +L E K+ ++ +
Sbjct: 363 GFKQDQIMSFMEVSEIHENSER--LRKINAYLLEQFKKSKMTIEEGIIEEEYQELIQTFT 420
Query: 289 IVIHTGKDIVKVLKALIVSGDNRESRQLVHN-GLTGEQVRIEEFKKKHVLLFISGLENIE 347
+IH D+V L L+ R L H G++ +V I +KHVLL IS LENIE
Sbjct: 421 TIIHV--DVVPPLLRLL-----RPIDFLYHGAGVSKRRVGINVLTQKHVLLLISDLENIE 473
Query: 348 DETQLLRSIFEKLKDNPKEVEGYRKDDFKILWIPIVDEWNDRYKKMLESHLQRTKIGWYV 407
E +L S++ E +++ F+ILW+P+ D W + E+ + WYV
Sbjct: 474 KELYILESLY---------TEAWQQ-SFEILWVPVQDFWTEADDAKFEA--LHMNMRWYV 521
Query: 408 VKDFR--FPTGIKLIREVFNYKDRAVIPLISPEGKVENIDTKNIISVWGIDGFPFRTSDH 465
+ + R I+ +RE + +K+R ++ + P+G+V + + ++ +W PF T+
Sbjct: 522 LGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQPFAHPFTTARE 581
Query: 466 TRL--TQQWNW-FWAEMTKLNPKIGDLIEEDCYLFIYGGTDSKWMQEITSAVETMKRQIE 522
L Q+WN F + T +P + + + Y+ +YGG D +W++ TS + +
Sbjct: 582 RDLWSEQEWNLEFLIDGT--DPHSLNQLVDGKYICLYGGEDMQWIKNFTSLWRNVAKAA- 638
Query: 523 TVLQLDITIEPYPLGKDDPK-----------------VVPR------FWIAIDSLFASR- 558
+I +E +GK +PK +P FW ++S++ S+
Sbjct: 639 -----NIQLEMVYVGKRNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWESKQ 693
Query: 559 --------KQKKGGDQGVQDFATREIKRLLFLKQDPKGWVILSRGSNVKLLGQGEAMYHT 610
K ++G + +D +E+ +L + GW ++S+ S++ + +G
Sbjct: 694 RMLKAHGIKGREGFKEEEKDLVLQEVVAMLGYGGEGDGWGLVSKASDMMVRAKGNLFSRG 753
Query: 611 VKDFEIW 617
+ +F W
Sbjct: 754 LAEFNEW 760
>AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:25417542-25420099 REVERSE
LENGTH=576
Length = 576
Score = 66.6 bits (161), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 81/159 (50%), Gaps = 7/159 (4%)
Query: 86 TSQQLSAKLKRIACQMICTARGDHYAHHTTMLILEQLKSYSWDAKALIVQAAFALEYGKF 145
+ + L + RI+ QM+C G++ TM++ + LK Y WDAKA++V A YG
Sbjct: 69 SKETLPYAIFRISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAKAVLVLGVLAATYGGL 128
Query: 146 LFLPQIPKY-PVERSLAELNGLLLIHQNTQHLIYFSS---VVKKVMQVIECITEWKRLTS 201
L + PV S+A+LN L + + T+ + S ++K ++ V +CI +++++
Sbjct: 129 LLPVHLAICDPVAASIAKLNQLPI--ERTKFRPWLESLNLLIKAMVDVTKCIIKFEKIPF 186
Query: 202 AGYDIKDVPALSDTLHEIPVVVYWAIFTFVTCTGQLDDF 240
+ D L +TL I + Y + + +TC Q+ F
Sbjct: 187 KQAKL-DNNILGETLSNIYLTTYRVVKSALTCMQQIPYF 224
Score = 65.9 bits (159), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 77/348 (22%), Positives = 154/348 (44%), Gaps = 40/348 (11%)
Query: 324 EQVRIEEFKKKHVLLFISGLENIEDETQLLRSIFEKLKDNPKEVEGYRKDDFKILWIPIV 383
+Q+ I E + K LL +S + + L + ++L D+P + +++I+W+PI
Sbjct: 228 QQISITEVQDKVTLLLLS-----KPPVEPLFFLLQQLYDHPSNTNT--EQNYEIIWVPIP 280
Query: 384 D--EWNDRYKKMLESHLQRTKIGWYVVKD--FRFPTGIKLIREVFNYKD-RAVIPLISPE 438
+W D K++ + + + W V+ T + ++ ++YKD A++ +I
Sbjct: 281 SSQKWTDEEKEIFDFY--SNSLPWISVRQPWLMSSTILNFFKQEWHYKDNEAMLVVIDSN 338
Query: 439 GKVENIDTKNIISVWGIDGFPFRTSDHTRLTQQWNWFWAEMTKLNPKIGDLIE--EDCYL 496
G+ N++ +++ +WG+ +PF S L ++ W + L I E E C
Sbjct: 339 GRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEHGW---SINLLLDGIHPTFEGREIC-- 393
Query: 497 FIYGGTDSKWMQEITS---AVETMKRQIETVLQLDITIEPYPLGKD----DPKVVPRFWI 549
I+G + W+ E S ++ + Q+E + + + + + P + FW+
Sbjct: 394 -IFGSENLDWIDEFVSLARKIQNLGFQLELIYLSNQRRDERAMEESSILFSPTLQQLFWL 452
Query: 550 AIDSLFASRKQKKGGDQGVQDFATREIKRLL-FLKQDPKGWVILSRGSNVKLLGQGEAMY 608
++S+ S+ ++ + D E++ LL F +GW I+ GS + + GE M
Sbjct: 453 RLESIERSKLKRIVIEPSKPDRVFEEVRNLLDFDYGKHRGWGIIGNGSTAETV-DGEKMT 511
Query: 609 HTVKDFEIWHGKLHQDVSFDVAFKEYYEGIKAKKIGQKCEHSEIADYP 656
++ W G+ + + F A + I A+K CE S A P
Sbjct: 512 ERMRKIVRW-GEYAKGLGFTEAIE-----IAAEK---PCELSHTAVVP 550