Miyakogusa Predicted Gene
- Lj0g3v0285979.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0285979.2 tr|A8C978|A8C978_MEDTR Sieve element occlusion by
forisomes 1 OS=Medicago truncatula GN=SEO-F1 PE=2
,76.45,0,coiled-coil,NULL,CUFF.19125.2
(647 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator c... 191 1e-48
AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 138 1e-32
AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 60 4e-09
>AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator
complex subunit Med28 (InterPro:IPR021640); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:252033-255246 FORWARD
LENGTH=740
Length = 740
Score = 191 bits (485), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 172/634 (27%), Positives = 292/634 (46%), Gaps = 82/634 (12%)
Query: 73 ISCQMINTPRGEHHVHQTTMWILQHLKTYSWDAKILVTLAAFTLEYGNLLYLIQTPTSDP 132
++C++ H+ TM + +HL ++ WD K+++TLAAF L YG L+Q + +
Sbjct: 113 VACEIAYKSLTGSDSHEITMSVFEHLSSFQWDGKLVLTLAAFALNYGEFWLLVQFYSKNQ 172
Query: 133 LVNSLKQLN--QVQNRKVPAA------DLVLPITEVLKNIQEWATWSSVGYDSVEVPSLS 184
L SL L VQNR + DL+ + V + E + Y + +VP LS
Sbjct: 173 LAKSLAMLKLVPVQNRVTLESVSQGLNDLIREMKSVTACVVELSELPD-RYITPDVPQLS 231
Query: 185 DALQEIPVLVYWTIASIVAATG--NLVGVSDYAL----------SRFTERLSLANSNLKE 232
L IP+ VYWTI S++A N++ + + S +L + +L E
Sbjct: 232 RILSTIPIAVYWTIRSVIACISQINMITAMGHEMMNTQMDLWETSMLANKLKNIHDHLAE 291
Query: 233 HLKLSKNQID---SVEDYLTRKKAFSNPK-DIVDFLKLLIQRNGSNLLIHDGSTKMKTDV 288
L+L I+ S E F D + L L+ + DG TK K +
Sbjct: 292 TLRLCYRHIEKQRSSESLKVLHSLFDTTHIDNMKILTALVHPKPHITPLQDGLTKRKVHL 351
Query: 289 EVFRQKHVLLFISGLDRVRDEILLLNSIHDRLKENPKEVIKGFKKEDFVILWIPIVDLWD 348
+V R+K VLL IS L+ ++DE+ + I+ + N V G + ++W+P+VD +
Sbjct: 352 DVLRRKTVLLLISDLNILQDELSIFEQIYTESRRNLVGV-DGKSHMPYEVVWVPVVDPIE 410
Query: 349 DEQ-----EHKFNLLKNIIKFYAVEC--FSELPGLGLIREKFNYWGKPIVPVLNPQGDSM 401
D + + KF L++ + +Y+V+ E + +R ++++ KPI+ V++PQG+
Sbjct: 411 DFERSPILQKKFEDLRDPMPWYSVDSPKLIERHVVEFMRGRWHFMNKPILVVIDPQGNEA 470
Query: 402 NEDAMDLIFQWGIDAFPFRKSDGYDLTLKWKWFWDVTRKVNL---GIE------VKGDRY 452
+ +A+ +I+ WG +AFPF +S +L W+ T +NL GI+ +K D Y
Sbjct: 471 SLNALHMIWIWGTEAFPFTRSREEEL---WR---RETFSLNLIVDGIDSVIFNWIKPDNY 524
Query: 453 IFIYGGADKKWVQDFTMAVEKTKR------------------HETVLRADVMIEHYHLGK 494
IF+YGG D W++ FTMA + T + E + R +I +L
Sbjct: 525 IFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKRNHSHREQIRRISEVIRSENLSH 584
Query: 495 D--EPKIVPRFWIEIES---KRLKKHHQDPLDCEIQDIVKSLLCLKQDPQGWAILTKGYN 549
EP ++ FW +ES +++ D D +Q I K L K GWA+L+KG
Sbjct: 585 SWAEPALMWFFWTRLESMLYSKIQLGKADDHDDVMQGIKKILSYDKLG--GWALLSKGPE 642
Query: 550 VKLLGHGEPMYETLKDLEIWKDKVLQKEGFDIAFKE-YYDTKLKDMIARQPCEFLN--VG 606
+ ++ HG WK V K G+ A + ++D L++ +PC + +
Sbjct: 643 IVMIAHGAIERTMSVYDRTWKTHVPTK-GYTKAMSDHHHDEVLRE--TGKPCGHFDFHIT 699
Query: 607 NENANVIATITCPNPTCGRVMDVTSVHYKCCHND 640
+ + + C C R M+ + + CCH++
Sbjct: 700 ARSGRIPEKMNCFE--CQRPME-KYMSFSCCHDE 730
>AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:247288-250261 FORWARD
LENGTH=822
Length = 822
Score = 138 bits (347), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 173/720 (24%), Positives = 304/720 (42%), Gaps = 128/720 (17%)
Query: 13 FDLSDSQIL-DKVYLTHLHDDDKCDKDILLNIVSNVILKTRFAESRAAV----------- 60
F LSD +++ D+V TH D D LL++V N I K+ ++
Sbjct: 134 FSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVV-NDIFKSHVPSIDSSAPKPSLVFKDYA 192
Query: 61 --TSFHPEFRVLKLISCQM--INTPRGEHH-----------VHQTTMWILQHLKTYSWDA 105
TSF ++ ISC++ GE H + TT +L + Y WDA
Sbjct: 193 DHTSFETFADLIDQISCEIDCKCLHGGESHGMMTSGLHLDSRNTTTFSVLSLVSKYRWDA 252
Query: 106 KILVTLAAFTLEYGNLLYLIQTPTSDPLVNSL---KQLNQVQNRKVPAADLVLPITEVLK 162
K+++ L+A ++YG L L +T ++ L SL KQL + +R+ A L T +L
Sbjct: 253 KLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQLPSIFSRQ-NALHQRLDKTRIL- 310
Query: 163 NIQEWATWSSVGYDSVEVPS--LSDALQE-IPVLVYWTIASIVAATGNLVGVSDYALSRF 219
+Q+ ++ D ++P ++ A + IP VYW + ++ ++ G S + +
Sbjct: 311 -MQDMVDLTTTIIDIYQLPPNHITAAFTDHIPTAVYWIVRCVLICVSHISGASGFKQDQI 369
Query: 220 ------------TERLSLANSNLKEHLKLSKNQIDSVEDYLTRKKAFSNPKDIVD----- 262
+ERL N+ L E K SK I+ ++ I+
Sbjct: 370 MSFMEVSEIHENSERLRKINAYLLEQFKKSKMTIEEGIIEEEYQELIQTFTTIIHVDVVP 429
Query: 263 -FLKLLIQRNGSNLLIHD-GSTKMKTDVEVFRQKHVLLFISGLDRVRDEILLLNSIHDRL 320
L+LL + L H G +K + + V QKHVLL IS L+ + E+ +L S++
Sbjct: 430 PLLRLL---RPIDFLYHGAGVSKRRVGINVLTQKHVLLLISDLENIEKELYILESLYTEA 486
Query: 321 KENPKEVIKGFKKEDFVILWIPIVDLWDDEQEHKFNLLKNIIKFYAVECFSEL--PGLGL 378
+ + F ILW+P+ D W + + KF L +++Y + +L +
Sbjct: 487 WQ-----------QSFEILWVPVQDFWTEADDAKFEALHMNMRWYVLGEPRKLRRAAIRF 535
Query: 379 IREKFNYWGKPIVPVLNPQGDSMNEDAMDLIFQWGIDAFPFRKSDGYDLTLKWKW----F 434
+RE + + +PI+ L+P+G M+ +A +++ W A PF + DL + +W
Sbjct: 536 VREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQPFAHPFTTARERDLWSEQEWNLEFL 595
Query: 435 WDVTRKVNLGIEVKGDRYIFIYGGADKKWVQDFTMAVEKTKRHETVLRADVMIEHYHLGK 494
D T +L V G +YI +YGG D +W+++FT + A++ +E ++GK
Sbjct: 596 IDGTDPHSLNQLVDG-KYICLYGGEDMQWIKNFTSLWRNVAK-----AANIQLEMVYVGK 649
Query: 495 DEPK--IVP---------------------RFWIEIES-----KRLKKHH---------Q 517
PK I P FW +ES +R+ K H +
Sbjct: 650 RNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWESKQRMLKAHGIKGREGFKE 709
Query: 518 DPLDCEIQDIVKSLLCLKQDPQGWAILTKGYNVKLLGHGEPMYETLKDLEIWKDKVLQKE 577
+ D +Q++V ++L + GW +++K ++ + G L + W+ + K
Sbjct: 710 EEKDLVLQEVV-AMLGYGGEGDGWGLVSKASDMMVRAKGNLFSRGLAEFNEWEVNIPTK- 767
Query: 578 GFDIAFKEYYDTKLKDMIARQPCEFLNVGNENANVIATITCPNPTCGRVMDVTSVHYKCC 637
GF A ++ +L C + + + C C R M+ + Y+CC
Sbjct: 768 GFLTALNDHLLMRLPP----HHCTRFMLPETAGIIPNEVECTE--CRRTMEKYYL-YQCC 820
>AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:25417542-25420099 REVERSE
LENGTH=576
Length = 576
Score = 60.5 bits (145), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 140/314 (44%), Gaps = 48/314 (15%)
Query: 333 KEDFVILWIPI--VDLWDDEQEHKFNLLKNIIKFYAVE--CFSELPGLGLIREKFNYW-G 387
++++ I+W+PI W DE++ F+ N + + +V L +++++Y
Sbjct: 269 EQNYEIIWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMSSTILNFFKQEWHYKDN 328
Query: 388 KPIVPVLNPQGDSMNEDAMDLIFQWGIDAFPFRKSDGYDLTLKWKWFWDVTRKVNL---G 444
+ ++ V++ G +N +AMD++ WG+ A+PF S +L + W +NL G
Sbjct: 329 EAMLVVIDSNGRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEHGW------SINLLLDG 382
Query: 445 IEVKGD-RYIFIYGGADKKWVQDFTMAVEKTKRHETVLRADVMIEHYHLGKDE------- 496
I + R I I+G + W+ +F K + L +I + +DE
Sbjct: 383 IHPTFEGREICIFGSENLDWIDEFVSLARKIQNLGFQLE---LIYLSNQRRDERAMEESS 439
Query: 497 ----PKIVPRFWIEIES---KRLKKHHQDPLDCE-IQDIVKSLLCLKQDP-QGWAILTKG 547
P + FW+ +ES +LK+ +P + + + V++LL +GW I+ G
Sbjct: 440 ILFSPTLQQLFWLRLESIERSKLKRIVIEPSKPDRVFEEVRNLLDFDYGKHRGWGIIGNG 499
Query: 548 YNVKLLGHGEPMYETLKDLEIWKDKVLQKEGFDIAFKEYYDTKLKDMIARQPCEFLN--- 604
+ + GE M E ++ + W + + GF T+ ++ A +PCE +
Sbjct: 500 STAETV-DGEKMTERMRKIVRWGEYA-KGLGF---------TEAIEIAAEKPCELSHTAV 548
Query: 605 VGNENANVIATITC 618
V E A + +TC
Sbjct: 549 VPFEEALTMKVVTC 562
Score = 54.7 bits (130), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 38/71 (53%)
Query: 73 ISCQMINTPRGEHHVHQTTMWILQHLKTYSWDAKILVTLAAFTLEYGNLLYLIQTPTSDP 132
IS QM+ GE+ + + TM + LK Y WDAK ++ L YG LL + DP
Sbjct: 80 ISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAKAVLVLGVLAATYGGLLLPVHLAICDP 139
Query: 133 LVNSLKQLNQV 143
+ S+ +LNQ+
Sbjct: 140 VAASIAKLNQL 150