Miyakogusa Predicted Gene
- Lj4g3v2717170.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2717170.1 tr|E2FKH5|E2FKH5_SOYBN Sieve element occlusion e
OS=Glycine max GN=SEOe PE=2 SV=1,76.81,0,seg,NULL; SUBFAMILY NOT
NAMED,NULL; THIOREDOXIN,NULL,CUFF.51552.1
(702 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator c... 543 e-154
AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 458 e-129
AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 148 1e-35
>AT3G01680.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Mediator
complex subunit Med28 (InterPro:IPR021640); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:252033-255246 FORWARD
LENGTH=740
Length = 740
Score = 543 bits (1400), Expect = e-154, Method: Compositional matrix adjust.
Identities = 283/702 (40%), Positives = 426/702 (60%), Gaps = 27/702 (3%)
Query: 19 ASDDTVMTKQIRATHAPVGGHINVRPLLNVVQDIFHRASFIPHIVQGKQEQLGAIEDNSH 78
+SD+++M K I+ TH+P + VR LL++V+DI RA+ L ED
Sbjct: 37 SSDESMMLKLIQQTHSPDAREVQVRGLLSLVEDILDRATLDSEDTNASMLPL-PTEDKLM 95
Query: 79 QSDLADMLDISYHTINKISCEISCKCVGGGDAHATTIEILSMLSSYSWDTKVVIGLAAFA 138
QS + +LD + I++++CEI+ K + G D+H T+ + LSS+ WD K+V+ LAAFA
Sbjct: 96 QSSMMSVLDSVSYAIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWDGKLVLTLAAFA 155
Query: 139 ANLGEFWLMAQLYATNRLAKSVAMLK----QIHETLEQVEALGPKFETVNNLLKAMLDVA 194
N GEFWL+ Q Y+ N+LAKS+AMLK Q TLE V + +N+L++ M V
Sbjct: 156 LNYGEFWLLVQFYSKNQLAKSLAMLKLVPVQNRVTLESVS------QGLNDLIREMKSVT 209
Query: 195 NCIVEFHELPSQYIDPEAPEMLNVSTLIPSAVYWTIRSIVACASHVLGIVGLAQGYLIST 254
C+VE ELP +YI P+ P++ + + IP AVYWTIRS++AC S + I + + +
Sbjct: 210 ACVVELSELPDRYITPDVPQLSRILSTIPIAVYWTIRSVIACISQINMITAMGHEMMNTQ 269
Query: 255 IETWELSSLVHKLDNINSHLRKQLALCHQHLDDNKQIEAFQTLRRLFETLHLDNMKVLKA 314
++ WE S L +KL NI+ HL + L LC++H++ + E+ + L LF+T H+DNMK+L A
Sbjct: 270 MDLWETSMLANKLKNIHDHLAETLRLCYRHIEKQRSSESLKVLHSLFDTTHIDNMKILTA 329
Query: 315 LFCSKDDLLPLFDGSTKKRVSIDVLKRKIVLFYITDLHHVSEQEIMIFEQMYQESRQDST 374
L K + PL DG TK++V +DVL+RK VL I+DL ++ + E+ IFEQ+Y ESR++
Sbjct: 330 LVHPKPHITPLQDGLTKRKVHLDVLRRKTVLLLISDL-NILQDELSIFEQIYTESRRNLV 388
Query: 375 RVEGQ----YEFVWLPVV---EKGTSGAEAKQRFEKLQSMMPWFSVYDPSLLEPSTIKYI 427
V+G+ YE VW+PVV E +++FE L+ MPW+SV P L+E ++++
Sbjct: 389 GVDGKSHMPYEVVWVPVVDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIERHVVEFM 448
Query: 428 KEVWLFSGKPILVVLDPHGKVMNLNAIHMMWIWGSLAYPFXXXXXXXXXXXXTWGLTLLA 487
+ W F KPILVV+DP G +LNA+HM+WIWG+ A+PF T+ L L+
Sbjct: 449 RGRWHFMNKPILVVIDPQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETFSLNLIV 508
Query: 488 DTIDPLLIDWVSAGKYICLYGGDDMEWIRKFTSTAKSVARELQIPLEIIYVGKSNPG--E 545
D ID ++ +W+ YI LYGGDD++WIR+FT AK+ A++ + LE+ YVGK N E
Sbjct: 509 DGIDSVIFNWIKPDNYIFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKRNHSHRE 568
Query: 546 RVRKINKAIREENLSNVLADLTIIWFFWVRLESMWHSKLQQSKTVENDQIMHGIMRILSF 605
++R+I++ IR ENLS+ A+ ++WFFW RLESM +SK+Q K ++D +M GI +ILS+
Sbjct: 569 QIRRISEVIRSENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADDHDDVMQGIKKILSY 628
Query: 606 DSSDQGWAVINQGTGRMEQGKGDTFIKCLEEHEHWKDKVNDKGFLPAMDDY-----MQEL 660
D GWA++++G + G WK V KG+ AM D+ ++E
Sbjct: 629 DKLG-GWALLSKGPEIVMIAHGAIERTMSVYDRTWKTHVPTKGYTKAMSDHHHDEVLRET 687
Query: 661 QTPHHCNRLILPGVNGRIPQKVVCAECGRSMEKFFMYRCCNE 702
P + +GRIP+K+ C EC R MEK+ + CC++
Sbjct: 688 GKPCGHFDFHITARSGRIPEKMNCFECQRPMEKYMSFSCCHD 729
>AT3G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:247288-250261 FORWARD
LENGTH=822
Length = 822
Score = 458 bits (1178), Expect = e-129, Method: Compositional matrix adjust.
Identities = 262/725 (36%), Positives = 400/725 (55%), Gaps = 49/725 (6%)
Query: 5 PRKMQS--RANRHIFSASDDTVMTKQIRATHAPVGGHINVRPLLNVVQDIFHRASFIPHI 62
P K Q+ R R +FS SDD VM ++ TH+P +V LL+VV DIF S +P I
Sbjct: 120 PGKKQAFHRNGRPMFSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVVNDIF--KSHVPSI 177
Query: 63 VQGKQEQLGAIEDNSHQSDLADMLDISYHTINKISCEISCKCVGGGDAHA---------- 112
+ +D + + D+ I++ISCEI CKC+ GG++H
Sbjct: 178 DSSAPKPSLVFKDYADHTSFETFADL----IDQISCEIDCKCLHGGESHGMMTSGLHLDS 233
Query: 113 ---TTIEILSMLSSYSWDTKVVIGLAAFAANLGEFWLMAQLYATNRLAKSVAMLKQIHET 169
TT +LS++S Y WD K+V+ L+A A G F L+A+ +ATN+L KS+A++KQ+
Sbjct: 234 RNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQLPSI 293
Query: 170 LEQVEALGPKFETVNNLLKAMLDVANCIVEFHELPSQYIDPEAPEMLNVSTLIPSAVYWT 229
+ AL + + L++ M+D+ I++ ++LP +I + IP+AVYW
Sbjct: 294 FSRQNALHQRLDKTRILMQDMVDLTTTIIDIYQLPPNHITAA------FTDHIPTAVYWI 347
Query: 230 IRSIVACASHVLGIVGLAQGYLISTIETWELSSLVHKLDNINSHLRKQLALCHQHLDDNK 289
+R ++ C SH+ G G Q ++S +E E+ +L IN++L +Q +++
Sbjct: 348 VRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKSKMTIEEGI 407
Query: 290 QIEAFQTLRRLFETL-HLDNMKVLKALFCSKDDLLPLFDGSTKKRVSIDVLKRKIVLFYI 348
E +Q L + F T+ H+D + L L D L G +K+RV I+VL +K VL I
Sbjct: 408 IEEEYQELIQTFTTIIHVDVVPPLLRLL-RPIDFLYHGAGVSKRRVGINVLTQKHVLLLI 466
Query: 349 TDLHHVSEQEIMIFEQMYQESRQDSTRVEGQYEFVWLPVVEKGTSGAEAKQRFEKLQSMM 408
+DL ++ E+E+ I E +Y E+ Q S +E +W+PV + T +AK FE L M
Sbjct: 467 SDLENI-EKELYILESLYTEAWQQS------FEILWVPVQDFWTEADDAK--FEALHMNM 517
Query: 409 PWFSVYDPSLLEPSTIKYIKEVWLFSGKPILVVLDPHGKVMNLNAIHMMWIWGSLAYPFX 468
W+ + +P L + I++++E W F +PILV LDP G+VM+ NA M+WIW A+PF
Sbjct: 518 RWYVLGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQPFAHPFT 577
Query: 469 XXXXXXXXXXXTWGLTLLADTIDPLLIDWVSAGKYICLYGGDDMEWIRKFTSTAKSVARE 528
W L L D DP ++ + GKYICLYGG+DM+WI+ FTS ++VA+
Sbjct: 578 TARERDLWSEQEWNLEFLIDGTDPHSLNQLVDGKYICLYGGEDMQWIKNFTSLWRNVAKA 637
Query: 529 LQIPLEIIYVGKSNPGERVRKINKAIREENLSNVLADLTIIWFFWVRLESMWHSKLQQSK 588
I LE++YVGK NP ++ I IREENLS+ L DL IWFFW R+ESMW SK + K
Sbjct: 638 ANIQLEMVYVGKRNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWESKQRMLK 697
Query: 589 T-----------VENDQIMHGIMRILSFDSSDQGWAVINQGTGRMEQGKGDTFIKCLEEH 637
E D ++ ++ +L + GW ++++ + M + KG+ F + L E
Sbjct: 698 AHGIKGREGFKEEEKDLVLQEVVAMLGYGGEGDGWGLVSKASDMMVRAKGNLFSRGLAEF 757
Query: 638 EHWKDKVNDKGFLPAMDDYMQELQTPHHCNRLILPGVNGRIPQKVVCAECGRSMEKFFMY 697
W+ + KGFL A++D++ PHHC R +LP G IP +V C EC R+MEK+++Y
Sbjct: 758 NEWEVNIPTKGFLTALNDHLLMRLPPHHCTRFMLPETAGIIPNEVECTECRRTMEKYYLY 817
Query: 698 RCCNE 702
+CC E
Sbjct: 818 QCCLE 822
>AT1G67790.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:25417542-25420099 REVERSE
LENGTH=576
Length = 576
Score = 148 bits (373), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 194/400 (48%), Gaps = 31/400 (7%)
Query: 304 LHLDNMKVLK-ALFCSKDDLLPLFDGSTKKRVSIDVLKRKIVLFYITDLHHVSEQEIMIF 362
++L +V+K AL C + +P F + +++SI ++ K+ L ++ E +
Sbjct: 203 IYLTTYRVVKSALTCMQQ--IPYFKQT--QQISITEVQDKVTLLLLS--KPPVEPLFFLL 256
Query: 363 EQMYQESRQDSTRVEGQYEFVWLPVVEKGTSGAEAKQRFEKLQSMMPWFSVYDPSLLEPS 422
+Q+Y +T E YE +W+P+ E K+ F+ + +PW SV P L+ +
Sbjct: 257 QQLYDHP--SNTNTEQNYEIIWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMSST 314
Query: 423 TIKYIKEVWLFS-GKPILVVLDPHGKVMNLNAIHMMWIWGSLAYPFXXXXXXXXXXXXTW 481
+ + K+ W + + +LVV+D +G+ +N+NA+ M+ IWG AYPF W
Sbjct: 315 ILNFFKQEWHYKDNEAMLVVIDSNGRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEHGW 374
Query: 482 GLTLLADTIDPLLIDWVSAGKYICLYGGDDMEWIRKFTSTAKSVARELQIPLEIIYVGKS 541
+ LL D I P G+ IC++G ++++WI +F S A+ + + L LE+IY+
Sbjct: 375 SINLLLDGIHPTF-----EGREICIFGSENLDWIDEFVSLARKI-QNLGFQLELIYLSNQ 428
Query: 542 NPGERVRKINKAIREENLSNVLADLTIIWFFWVRLESMWHSKLQQ--SKTVENDQIMHGI 599
ER A+ E S++L T+ FW+RLES+ SKL++ + + D++ +
Sbjct: 429 RRDER------AMEE---SSILFSPTLQQLFWLRLESIERSKLKRIVIEPSKPDRVFEEV 479
Query: 600 MRILSFD-SSDQGWAVINQGTGRMEQGKGDTFIKCLEEHEHWKDKVNDKGFLPAMDDYMQ 658
+L FD +GW +I G+ E G+ + + + W + GF A++ +
Sbjct: 480 RNLLDFDYGKHRGWGIIGNGST-AETVDGEKMTERMRKIVRWGEYAKGLGFTEAIEIAAE 538
Query: 659 ELQTPHHCNRLILPGVNGRIPQKVVCAECGRSMEKFFMYR 698
+ H ++P + V C +C M++F Y+
Sbjct: 539 KPCELSHTA--VVPFEEALTMKVVTCEKCKWPMKRFVAYQ 576
Score = 82.8 bits (203), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 107/227 (47%), Gaps = 9/227 (3%)
Query: 14 RHIFSASDDTVMTKQIRATHAPVGGHINVRPLLNVVQDIFHRASFIPHIVQGKQEQLGAI 73
R SA ++ ++ +Q+ +H P G ++ LL V+ I SF+ +Q +
Sbjct: 4 RRDISALNEDIIVEQLLRSHDPDGRWLDSEMLLQEVETIL---SFV---LQNDVSRPLLT 57
Query: 74 EDNSHQSDLADMLDISYHTINKISCEISCKCVGGGDAHATTIEILSMLSSYSWDTKVVIG 133
E+ ++ D + + I +IS ++ C C G + T+ + +L Y WD K V+
Sbjct: 58 ENCITTIEVFDSKETLPYAIFRISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAKAVLV 117
Query: 134 LAAFAANLGEFWLMAQLYATNRLAKSVAMLKQIHETLEQVEALGPKFETVNNLLKAMLDV 193
L AA G L L + +A S+A L Q+ +E+ + P E++N L+KAM+DV
Sbjct: 118 LGVLAATYGGLLLPVHLAICDPVAASIAKLNQL--PIERTK-FRPWLESLNLLIKAMVDV 174
Query: 194 ANCIVEFHELPSQYIDPEAPEMLNVSTLIPSAVYWTIRSIVACASHV 240
CI++F ++P + + + + I Y ++S + C +
Sbjct: 175 TKCIIKFEKIPFKQAKLDNNILGETLSNIYLTTYRVVKSALTCMQQI 221