Miyakogusa Predicted Gene
- Lj0g3v0256639.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0256639.1 tr|E2FKH3|E2FKH3_SOYBN Sieve element occlusion c
OS=Glycine max GN=SEOc PE=2 SV=1,82.14,0,coiled-coil,NULL; SUBFAMILY
NOT NAMED,NULL; THIOREDOXIN,NULL,CUFF.16866.1
(702 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator c... 548 e-156
AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 449 e-126
AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 166 7e-41
>AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator
complex subunit Med28 (InterPro:IPR021640); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:252033-255246 FORWARD
LENGTH=740
Length = 740
Score = 548 bits (1411), Expect = e-156, Method: Compositional matrix adjust.
Identities = 284/698 (40%), Positives = 426/698 (61%), Gaps = 28/698 (4%)
Query: 24 ASDDSAMMKQVQGTHAPDGREIDVKHIIQIVDEILIQVIGRGVEGHDVKREQETLEISAA 83
+SD+S M+K +Q TH+PD RE+ V+ ++ +V++IL + ++ D L
Sbjct: 37 SSDESMMLKLIQQTHSPDAREVQVRGLLSLVEDILDRAT---LDSEDTNASMLPLPTEDK 93
Query: 84 LAEFDM---LDSLAFVINKISCELSCKWSGGGDAHASTMVLLTYMSNYAWHAKVVLTLAA 140
L + M LDS+++ I++++CE++ K G D+H TM + ++S++ W K+VLTLAA
Sbjct: 94 LMQSSMMSVLDSVSYAIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWDGKLVLTLAA 153
Query: 141 FAVISGEFWLVANMSALNTLAKSVALLKQLPDMVENSASLRPQFDALNKLVKAALDVTYC 200
FA+ GEFWL+ + N LAKS+A+LK +P V+N +L LN L++ VT C
Sbjct: 154 FALNYGEFWLLVQFYSKNQLAKSLAMLKLVP--VQNRVTLESVSQGLNDLIREMKSVTAC 211
Query: 201 IIEFKELPSEYISEDMPPMSVASAHIPIAAYWVIRSIVACASQIALLIGSRNEAISSATE 260
++E ELP YI+ D+P +S + IPIA YW IRS++AC SQI ++ +E +++ +
Sbjct: 212 VVELSELPDRYITPDVPQLSRILSTIPIAVYWTIRSVIACISQINMITAMGHEMMNTQMD 271
Query: 261 AWELSSLAHKVTSIHEHLKNQLELCYQYIDDKRHVEAFHNLIRLFETSHVDNMKILRALI 320
WE S LA+K+ +IH+HL L LCY++I+ +R E+ L LF+T+H+DNMKIL AL+
Sbjct: 272 LWETSMLANKLKNIHDHLAETLRLCYRHIEKQRSSESLKVLHSLFDTTHIDNMKILTALV 331
Query: 321 YPKDDIPPLIDGTTKSKVSLEVLRRKHVLLLISDLDLAQEEIMVLDNLYKDAR------- 373
+PK I PL DG TK KV L+VLRRK VLLLISDL++ Q+E+ + + +Y ++R
Sbjct: 332 HPKPHITPLQDGLTKRKVHLDVLRRKTVLLLISDLNILQDELSIFEQIYTESRRNLVGVD 391
Query: 374 SRGEMHYEMVWIPVVDKA---TWNDVNKQKFEYLQSLMAWHSVRDPFIIEPSVIRYNKEV 430
+ M YE+VW+PVVD + + ++KFE L+ M W+SV P +IE V+ + +
Sbjct: 392 GKSHMPYEVVWVPVVDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIERHVVEFMRGR 451
Query: 431 WNFTKRAIVVALDPQGRLSSPNALHMIWIWGNLAFPFTREKEESLWKQEIWSLELLVDGI 490
W+F + I+V +DPQG +S NALHMIWIWG AFPFTR +EE LW++E +SL L+VDGI
Sbjct: 452 WHFMNKPILVVIDPQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETFSLNLIVDGI 511
Query: 491 DPMVLEWMAEEKIVCLYGGEDLEWIETFTATAMNVARAGKFDLEMVYVGKSN--AKERMQ 548
D ++ W+ + + LYGG+DL+WI FT A A+ +LEM YVGK N +E+++
Sbjct: 512 DSVIFNWIKPDNYIFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKRNHSHREQIR 571
Query: 549 RMISTFANRKFSYFWPNVTSIWFFWARLESMLYSKLQHGSTVENDPIMSEVMTVLSFDGS 608
R+ + S+ W +WFFW RLESMLYSK+Q G ++D +M + +LS+D
Sbjct: 572 RISEVIRSENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADDHDDVMQGIKKILSYDKL 631
Query: 609 DRGWAIFCRGASEMARAKGDTALTSLRDFDK-WKHKIEQDGLVPALNDYLHQ---IHTPD 664
GWA+ +G + A G T + +D+ WK + G A++D+ H T
Sbjct: 632 G-GWALLSKGPEIVMIAHGAIERT-MSVYDRTWKTHVPTKGYTKAMSDHHHDEVLRETGK 689
Query: 665 HCNRLI--LPGSTGGIPEKVVCAECGRQMEKYFMYRCC 700
C + +G IPEK+ C EC R MEKY + CC
Sbjct: 690 PCGHFDFHITARSGRIPEKMNCFECQRPMEKYMSFSCC 727
>AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:247288-250261 FORWARD
LENGTH=822
Length = 822
Score = 449 bits (1156), Expect = e-126, Method: Compositional matrix adjust.
Identities = 263/721 (36%), Positives = 399/721 (55%), Gaps = 44/721 (6%)
Query: 9 APRKMQ--QRKERRMFSASDDSAMMKQVQGTHAPDGREIDVKHIIQIVDEILIQVIGRGV 66
P K Q R R MFS SDD M +V TH+PD DV ++ +V++I V
Sbjct: 119 GPGKKQAFHRNGRPMFSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVVNDIF----KSHV 174
Query: 67 EGHDVKREQETLEISAALAEFDMLDSLAFVINKISCELSCKWSGGGDAHA---------- 116
D + +L + A+ ++ A +I++ISCE+ CK GG++H
Sbjct: 175 PSIDSSAPKPSL-VFKDYADHTSFETFADLIDQISCEIDCKCLHGGESHGMMTSGLHLDS 233
Query: 117 ---STMVLLTYMSNYAWHAKVVLTLAAFAVISGEFWLVANMSALNTLAKSVALLKQLPDM 173
+T +L+ +S Y W AK+VL L+A AV G F L+A A N L KS+AL+KQLP +
Sbjct: 234 RNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQLPSI 293
Query: 174 VENSASLRPQFDALNKLVKAALDVTYCIIEFKELPSEYISEDMPPMSVASAHIPIAAYWV 233
+L + D L++ +D+T II+ +LP +I+ + + HIP A YW+
Sbjct: 294 FSRQNALHQRLDKTRILMQDMVDLTTTIIDIYQLPPNHIT------AAFTDHIPTAVYWI 347
Query: 234 IRSIVACASQIALLIGSRNEAISSATEAWELSSLAHKVTSIHEHLKNQLELCYQYIDDKR 293
+R ++ C S I+ G + + I S E E+ + ++ I+ +L Q + I++
Sbjct: 348 VRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKSKMTIEEGI 407
Query: 294 HVEAFHNLIRLFETS-HVDNMKILRALIYPKDDIPPLIDGTTKSKVSLEVLRRKHVLLLI 352
E + LI+ F T HVD + L L+ P D + G +K +V + VL +KHVLLLI
Sbjct: 408 IEEEYQELIQTFTTIIHVDVVPPLLRLLRPIDFLYHGA-GVSKRRVGINVLTQKHVLLLI 466
Query: 353 SDLDLAQEEIMVLDNLYKDARSRGEMHYEMVWIPVVDKATWNDVNKQKFEYLQSLMAWHS 412
SDL+ ++E+ +L++LY +A + +E++W+PV D W + + KFE L M W+
Sbjct: 467 SDLENIEKELYILESLYTEAWQQS---FEILWVPVQD--FWTEADDAKFEALHMNMRWYV 521
Query: 413 VRDPFIIEPSVIRYNKEVWNFTKRAIVVALDPQGRLSSPNALHMIWIWGNLAFPFTREKE 472
+ +P + + IR+ +E W F R I+VALDP+G++ S NA M+WIW A PFT +E
Sbjct: 522 LGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQPFAHPFTTARE 581
Query: 473 ESLWKQEIWSLELLVDGIDPMVLEWMAEEKIVCLYGGEDLEWIETFTATAMNVARAGKFD 532
LW ++ W+LE L+DG DP L + + K +CLYGGED++WI+ FT+ NVA+A
Sbjct: 582 RDLWSEQEWNLEFLIDGTDPHSLNQLVDGKYICLYGGEDMQWIKNFTSLWRNVAKAANIQ 641
Query: 533 LEMVYVGKSNAKERMQRMISTFANRKFSYFWPNVTSIWFFWARLESMLYSKLQ----HG- 587
LEMVYVGK N K +Q +I+T S+ P++ IWFFW R+ESM SK + HG
Sbjct: 642 LEMVYVGKRNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWESKQRMLKAHGI 701
Query: 588 ------STVENDPIMSEVMTVLSFDGSDRGWAIFCRGASEMARAKGDTALTSLRDFDKWK 641
E D ++ EV+ +L + G GW + + + M RAKG+ L +F++W+
Sbjct: 702 KGREGFKEEEKDLVLQEVVAMLGYGGEGDGWGLVSKASDMMVRAKGNLFSRGLAEFNEWE 761
Query: 642 HKIEQDGLVPALNDYLHQIHTPDHCNRLILPGSTGGIPEKVVCAECGRQMEKYFMYRCCV 701
I G + ALND+L P HC R +LP + G IP +V C EC R MEKY++Y+CC+
Sbjct: 762 VNIPTKGFLTALNDHLLMRLPPHHCTRFMLPETAGIIPNEVECTECRRTMEKYYLYQCCL 821
Query: 702 E 702
E
Sbjct: 822 E 822
>AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:25417542-25420099 REVERSE
LENGTH=576
Length = 576
Score = 166 bits (419), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 184/370 (49%), Gaps = 30/370 (8%)
Query: 337 KVSLEVLRRKHVLLLISDLDLAQEEIMVLDNLYK-DARSRGEMHYEMVWIPVVDKATWND 395
++S+ ++ K LLL+S + + +L LY + + E +YE++W+P+ W D
Sbjct: 229 QISITEVQDKVTLLLLSKPPV-EPLFFLLQQLYDHPSNTNTEQNYEIIWVPIPSSQKWTD 287
Query: 396 VNKQKFEYLQSLMAWHSVRDPFIIEPSVIRYNKEVWNFT-KRAIVVALDPQGRLSSPNAL 454
K+ F++ + + W SVR P+++ +++ + K+ W++ A++V +D GR + NA+
Sbjct: 288 EEKEIFDFYSNSLPWISVRQPWLMSSTILNFFKQEWHYKDNEAMLVVIDSNGRFVNMNAM 347
Query: 455 HMIWIWGNLAFPFTREKEESLWKQEIWSLELLVDGIDPMVLEWMAEEKIVCLYGGEDLEW 514
M+ IWG A+PF+ +E+ LWK+ WS+ LL+DGI P E + +C++G E+L+W
Sbjct: 348 DMVLIWGVKAYPFSVSREDELWKEHGWSINLLLDGIHPTF-----EGREICIFGSENLDW 402
Query: 515 IETFTATAMNVARAGKFDLEMVYVGKSNAKERMQRMISTFANRKFSYFWPNVTSIWFFWA 574
I+ F + A + G F LE++Y+ ER S F P + + FW
Sbjct: 403 IDEFVSLARKIQNLG-FQLELIYLSNQRRDERAMEESSIL-------FSPTLQQL--FWL 452
Query: 575 RLESMLYSKLQHGSTVENDP--IMSEVMTVLSFD-GSDRGWAIFCRGASEMARAKGDTAL 631
RLES+ SKL+ + P + EV +L FD G RGW I G++ G+
Sbjct: 453 RLESIERSKLKRIVIEPSKPDRVFEEVRNLLDFDYGKHRGWGIIGNGST-AETVDGEKMT 511
Query: 632 TSLRDFDKWKHKIEQDGLVPALNDYLHQIHTPDHC---NRLILPGSTGGIPEKVVCAECG 688
+R +W + G A+ +I C + ++P + V C +C
Sbjct: 512 ERMRKIVRWGEYAKGLGFTEAI-----EIAAEKPCELSHTAVVPFEEALTMKVVTCEKCK 566
Query: 689 RQMEKYFMYR 698
M+++ Y+
Sbjct: 567 WPMKRFVAYQ 576
Score = 121 bits (303), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 127/242 (52%), Gaps = 8/242 (3%)
Query: 19 RRMFSASDDSAMMKQVQGTHAPDGREIDVKHIIQIVDEILIQVIGRGVEGHDVKREQETL 78
RR SA ++ +++Q+ +H PDGR +D + ++Q V+ IL V+ +DV R T
Sbjct: 4 RRDISALNEDIIVEQLLRSHDPDGRWLDSEMLLQEVETILSFVLQ-----NDVSRPLLTE 58
Query: 79 EISAALAEFDMLDSLAFVINKISCELSCKWSGGGDAHASTMVLLTYMSNYAWHAKVVLTL 138
+ FD ++L + I +IS ++ C +G + TMVL + Y W AK VL L
Sbjct: 59 NCITTIEVFDSKETLPYAIFRISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAKAVLVL 118
Query: 139 AAFAVISGEFWLVANMSALNTLAKSVALLKQLPDMVENSASLRPQFDALNKLVKAALDVT 198
A G L +++ + +A S+A L QLP +E + RP ++LN L+KA +DVT
Sbjct: 119 GVLAATYGGLLLPVHLAICDPVAASIAKLNQLP--IERT-KFRPWLESLNLLIKAMVDVT 175
Query: 199 YCIIEFKELPSEYISEDMPPMSVASAHIPIAAYWVIRSIVACASQIALLIGSRNEAISSA 258
CII+F+++P + D + ++I + Y V++S + C QI ++ +I+
Sbjct: 176 KCIIKFEKIPFKQAKLDNNILGETLSNIYLTTYRVVKSALTCMQQIPYFKQTQQISITEV 235
Query: 259 TE 260
+
Sbjct: 236 QD 237