Miyakogusa Predicted Gene
- Lj0g3v0074399.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0074399.1 Non Chatacterized Hit- tr|I1M968|I1M968_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.42277
PE,66.13,0,seg,NULL; NT-C2,EEIG1/EHBP1 N-terminal domain,CUFF.3718.1
(946 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G26160.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 495 e-140
AT5G20610.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 311 1e-84
AT1G42550.1 | Symbols: PMI1 | plastid movement impaired1 | chr1:... 106 8e-23
>AT5G26160.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G20610.1); Has 197 Blast hits to 158 proteins
in 44 species: Archae - 0; Bacteria - 14; Metazoa - 28;
Fungi - 15; Plants - 117; Viruses - 2; Other Eukaryotes
- 21 (source: NCBI BLink). | chr5:9143269-9146312
FORWARD LENGTH=976
Length = 976
Score = 495 bits (1274), Expect = e-140, Method: Compositional matrix adjust.
Identities = 343/952 (36%), Positives = 497/952 (52%), Gaps = 159/952 (16%)
Query: 27 QLLHDIEEISKALYLHNTPFKASFS-SVDNRSKSAGKTRVSKRQVDSTTRFRREDLFPRD 85
QLL DI+E+SKALYL N P + S S RS+S +T + + +++ L P
Sbjct: 17 QLLRDIKEVSKALYLTNGPQRPVLSLSPPVRSQSVSRTTEIGLVLSNK---KKKSLVP-- 71
Query: 86 KKFLSSAWIWKKPLKALAHVGHQKFNCCFNLHVHSIEGLPLNFNGISLSVNWKRKSSILR 145
W WKKPL A+AH G ++F+ CF LHVHSIEGLPLN +G L V WKRK ++
Sbjct: 72 -------WNWKKPLNAIAHFGQRRFDVCFLLHVHSIEGLPLNLDGTKLVVQWKRKDEVMT 124
Query: 146 TRPARVLKGAVEFNETLTHRCSIYGGRTGSGHALKYESRRFLIYASIVGAPEHDLGDHHV 205
T+P++VL+G EF ETLTHRCS+YG + G + KY+ + FLIY S V AP LG H +
Sbjct: 125 TQPSKVLQGTAEFEETLTHRCSVYGSKHGPHRSAKYQVKLFLIYVSPVDAPWLVLGKHWI 184
Query: 206 DVTRILPLTLEELLGDKRSGAWSTNFRLAGKALGASLNVSFSYQVMXXXXXXXXXXXXXX 265
D+TRILPL+LEE+ G + + W+T+F+L+G A A LN+SF Y V+
Sbjct: 185 DLTRILPLSLEEMEGTRSTRKWNTSFKLSGLAESAVLNLSFDYSVVTSSV---------- 234
Query: 266 XXXXXXXVMGCTPSNREVKLWQGGSFPHDVYNGSVLPSHSNGDMLPHEALLKSGFSISKS 325
C +++ V L + GS P + P +G ++ +E ++S+S
Sbjct: 235 ----------CDSTSKNVMLRRVGSVPSMDHRS---PPLDDGKVV-NEVSPSLSLNLSQS 280
Query: 326 MTILYQKLDEGNFHTSESEDSEHLEPLKSQILLKPDSPRESNLDDSDDTEFSITEQGIET 385
+ LY+KL E N S + E + L+ D DSDD + +G+ET
Sbjct: 281 IDFLYEKLGEQNPQRSTGTEVE--------LGLETD----KQAADSDD-----SGKGVET 323
Query: 386 LEEDSLKLDQTGVQTFDMSTVEIIDVDEILKDDD--IFDENNTSFDSMDAICSSYVNGAI 443
+++ L+++ + S +EIIDV EILKD+D +F+E T F +D + V
Sbjct: 324 FQQERSGLEESNDPNTESSRIEIIDVHEILKDEDESVFEE--TYF--IDQLS---VAALK 376
Query: 444 ADDSKHICMKIVDTVPETS-------------------DSFGKEHYLSIKSNYKSHKMAK 484
++ S + VD P+++ DS KE++L +KS+YK+ K++
Sbjct: 377 SEPSNLLPKHSVDGTPKSTFSSQVISESSESKSPSAMDDSTEKENFLEVKSSYKAAKISM 436
Query: 485 KSHSLDDIIESVAGDFLNMLAMENGSFGSGCGGDPQSPREQLLRQFEQEALASGDFTFDF 544
S SLDDI ESVA DFLNML +E S+ G+P SPRE LLR+FE+EA ASG+F D
Sbjct: 437 TSLSLDDITESVANDFLNMLELEECSYVYTSDGEPTSPRESLLREFEKEAFASGNFLLDL 496
Query: 545 DANGEELG-TDTLEDGFGYSAEDSDLSSIIQAAEEEYATEHQSIMQRRKAKLLEDLETDS 603
+ E + D + F +SA S + E + + Q ++ RRKAK+LEDLET++
Sbjct: 497 NGEAEYVSDIDEKSNDFSFSA------SSLDVGENKREGKSQLLIDRRKAKVLEDLETET 550
Query: 604 LMQQWGLNERDFENSPRTWSGGFGSPIEL-CDNEPSMLPSIGQGLGSFVQTRGGGFLRSM 662
L+++ ++ F+NS S GFGSPIEL D +LP +G +G V T+GGG +RSM
Sbjct: 551 LLRECDFDDNSFDNSLCVCSDGFGSPIELPVDKGLDLLP-LGDNIGPSVWTKGGGCIRSM 609
Query: 663 RPSLFVNAKNCGNLIIQASNPVVLPAKMGNDILEILLHVASSGVEDLCDHIYKLMPLKDI 722
LF +K LI+Q S PVVL +++G+DILEIL A+SG+E LC + L+PL+DI
Sbjct: 610 NHLLFRESKEASQLIMQVSVPVVLVSELGSDILEILQIFAASGIEGLCSEVNALIPLEDI 669
Query: 723 TGKSIKHILRDHRTPKGALQRQGSWEDLFEECPQGYL----TDESMGLDF----VPLEAI 774
GK+I ++ + + + + + P G L ++E G VPLE +
Sbjct: 670 MGKTIHEVVDVTKFKRTGQDCSDKSKGVVVQKPPGQLHLCSSNEEFGSSMCPSNVPLEDV 729
Query: 775 APMTIDKIEALLIEGLRTQYDMSNEQAPSYIHSQHTNTPDFGDIRANMRRYPTLEKVTKL 834
+ ID+I L IEGL+ Q MS++ PS I + + D LE
Sbjct: 730 TSLAIDEIYILSIEGLKIQCSMSDQDPPSGIAPKPMDQSD------------ALE----- 772
Query: 835 QLEDSGEIYNDVDGLMGLSLTLDQWLRLDSGIIDADQNLGKMLKILKAHHCKIPEIDDDR 894
L+ SLTLD+WLRLD G+
Sbjct: 773 --------------LIRFSLTLDEWLRLDQGM---------------------------- 790
Query: 895 LKNAVDEVKTCGEKHGLLGNHITVAFMIQLRDPLRNYEPVGVPMLVLTHVER 946
L+N ++ + G+ H L N +T+A + LRDP N EP+G ML L VER
Sbjct: 791 LENKDQDLASNGKGHT-LRNKLTLALQVLLRDPSLNNEPIGASMLALIQVER 841
>AT5G20610.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G26160.1); Has
918 Blast hits to 759 proteins in 180 species: Archae -
6; Bacteria - 105; Metazoa - 264; Fungi - 89; Plants -
167; Viruses - 5; Other Eukaryotes - 282 (source: NCBI
BLink). | chr5:6969184-6972794 FORWARD LENGTH=1164
Length = 1164
Score = 311 bits (797), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 213/490 (43%), Positives = 284/490 (57%), Gaps = 48/490 (9%)
Query: 491 DIIESVAGDFLNMLAMENGSFGSGCGGDPQSPREQLLRQFEQEALASGDFTFDFDANGE- 549
D+ ESVA +FL+ML +E+ FG +P+SPRE+LLR+FE E LA+G FDF G+
Sbjct: 549 DVAESVACEFLDMLGIEHSPFGLSSESEPESPRERLLREFEMETLAAGSL-FDFSIEGDD 607
Query: 550 -ELGTDT---------LEDGFGYSAEDSDLSSIIQAAEEEYATEHQSIMQRRKAKLLEDL 599
+L D E+GF DL+S++ EEEY E Q+ + +AK+LE L
Sbjct: 608 PQLECDENFPNEYESDFEEGF-------DLASLVHDIEEEYQLEAQARVSHPRAKMLEGL 660
Query: 600 ETDSLMQQWGLNERDFENSPRTWSGGFGSPIELCDNEPSMLPSIGQGLGSFVQTRGGGFL 659
ET+SLM++WG+NE F+NSP P + EP LP +G GLG VQT+ GGFL
Sbjct: 661 ETESLMREWGMNENTFQNSPPHNGRDAFHPADFPVKEPFDLPPLGDGLGPVVQTKNGGFL 720
Query: 660 RSMRPSLFVNAKNCGNLIIQASNPVVLPAKMGNDILEILLHVASSGVEDLCDHIYKLMPL 719
RSM P LF N+K G+LI+Q S PVV+PA+MG+ I+EIL +A++G+E L K+MPL
Sbjct: 721 RSMNPLLFRNSKAGGSLIMQVSTPVVVPAEMGSGIMEILQKLATAGIEKLSMQANKVMPL 780
Query: 720 KDITGKSIKHIL-----------RDHRTPK-----GALQRQGSWEDLFEECPQGYLT--- 760
DITGK+++ +L RDH + + R G F P+ + +
Sbjct: 781 DDITGKTMEEVLWETSPTIDIGDRDHVSERESGDASGFVRGGERRTSFAAKPKKFGSSSG 840
Query: 761 DESMGLDFVPLEAIAPMTIDKIEALLIEGLRTQYDMSNEQAPSYIHSQHTNTPDFGDIRA 820
+ + ++V LE +AP+ +D+IEAL +EGLR Q MS+E APS I +Q GDI A
Sbjct: 841 NNNFDSEYVSLEDLAPLAMDQIEALSLEGLRIQSGMSDEDAPSDITAQ-----SIGDISA 895
Query: 821 NMRRYPT--LEKVTKLQ-LEDSGEIYNDVDGLMGLSLTLDQWLRLDSGII-DADQNLGKM 876
+ LE LQ L+ + +D DGLMGLSLTLD+W++LDSG I D D+ +
Sbjct: 896 FQGKSGCVGLEGAAGLQLLDIKDDGDDDDDGLMGLSLTLDEWMKLDSGDIGDEDEINERT 955
Query: 877 LKILKAHHCKIPEIDDDRLKNAVDEVKTCGEKHGLLGNHITVAFMIQLRDPLRNYEPVGV 936
KIL AHH P + G K GLLGN TVA M+QLRDPLRNYEPVG
Sbjct: 956 SKILAAHHAN-PLNFIRKGSKGEKRKGKKGRKCGLLGNTFTVALMVQLRDPLRNYEPVGA 1014
Query: 937 PMLVLTHVER 946
PML L VER
Sbjct: 1015 PMLSLIQVER 1024
Score = 218 bits (555), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 188/348 (54%), Gaps = 40/348 (11%)
Query: 26 EQLLHDIEEISKALYLHNTPFKASFSSVDNRSKSAGKTRVSKRQVDSTTRFRREDLF-PR 84
++LL ++E+IS+ALY++ P R AG + T R +L P+
Sbjct: 13 QKLLKEVEKISEALYVNKNP----------RGSVAGSNKTP------TKPLSRSNLAEPK 56
Query: 85 DKKFLSSAWIWKKPLKALAHVGHQKFNCCFNLHVHSIEGLPLNFNGISLSVNWKRKSSIL 144
+KK S W W PL+A+ HV +++FNCCF+ VHSIEGLP F +SL+V+WKR+ L
Sbjct: 57 EKK---SFWNW--PLRAINHVRNRRFNCCFSAQVHSIEGLPPIFQDLSLTVHWKRRDESL 111
Query: 145 RTRPARVLKGAVEFNETLTHRCSIYGGRTGSGHALKYESRRFLIYASIVGAPEHDLGDHH 204
TRPA+V G EF + LTH CS+YG R+G H+ KYE++ FL+Y S+VG+PE DLG H
Sbjct: 112 STRPAKVSNGRAEFKDKLTHTCSVYGSRSGPHHSAKYEAKHFLLYVSLVGSPEIDLGKHR 171
Query: 205 VDVTRILPLTLEELLGDKRSGAWSTNFRLAGKALGASLNVSFSYQVM-----XXXXXXXX 259
+D+T++LPLTLEEL +K SG WST F+L+GKA GA+L++SF Y V+
Sbjct: 172 MDLTKLLPLTLEELQDEKSSGKWSTTFQLSGKANGATLSMSFGYTVVGDTRNPASSGSTQ 231
Query: 260 XXXXXXXXXXXXXVMGCTPS-NREVKLWQGGS----FPHDVYNGSVLPSHSNGDMLP--H 312
G T + + + L G S + H + N P N + + H
Sbjct: 232 NFRSSSNVKQTSNNTGLTRAISAKSSLGNGKSASRRYDHSIVNRESHPLSQNMEEIKDLH 291
Query: 313 EALLKSGFSISKSMTILYQKLDEGNFHTSESEDSE------HLEPLKS 354
E L + S+ LYQK DE + E H+EP++S
Sbjct: 292 EILPAVQSDLGSSVNTLYQKFDEEKVDPANESQFEFDVVTKHIEPVES 339
>AT1G42550.1 | Symbols: PMI1 | plastid movement impaired1 |
chr1:15977131-15979734 FORWARD LENGTH=843
Length = 843
Score = 106 bits (264), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 91/164 (55%), Gaps = 6/164 (3%)
Query: 93 WIWKKPLKALAHVGHQKFNCCFNLHVHSIEGLPLNFNGISLSVNWKRKSS---ILRTRPA 149
W WK P++ L +G QK +C ++ V + + LP + NG+ L V ++K + ++T P
Sbjct: 123 WNWK-PIRGLVRIGMQKLSCLLSVEVVAAQNLPASMNGLRLGVCVRKKETKDGAVQTMPC 181
Query: 150 RVLKGAVEFNETLTHRCSI-YGGRTGSGHALKYESRRFLIYASIVGAPEHDLGDHHVDVT 208
RV +G+ +F ETL +C + Y G G K+E+R FL Y V A E + G H VD++
Sbjct: 182 RVSQGSADFEETLFIKCHVYYSPANGKGSPAKFEARPFLFYLFAVDAKELEFGRHVVDLS 241
Query: 209 RILPLTLEEL-LGDKRSGAWSTNFRLAGKALGASLNVSFSYQVM 251
++ ++E++ R W N+ L+GKA G L + +Q+M
Sbjct: 242 ELIQESVEKMNYEGARVRQWDMNWGLSGKAKGGELALKLGFQIM 285
Score = 61.2 bits (147), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 87/189 (46%), Gaps = 21/189 (11%)
Query: 634 DNEPSMLPSIGQGLGSFVQTRGGGFLRSMRP-SLFVNAKNCGNLIIQASNP-VVLPAKMG 691
D + L +G+G+G VQTR GG+L SM P V K+ L++Q S VVLP
Sbjct: 510 DESENYLSDLGKGIGCVVQTRDGGYLVSMNPFDTVVMRKDTPKLVMQISKQIVVLPEAGP 569
Query: 692 NDILEILLHVASSGVEDLCDHIYKLMPLKDITGKSIKHIL------------RDHRTPKG 739
E+ +A SG E+L I LM + ++ GK+ + + R
Sbjct: 570 ATGFELFHRMAGSG-EELESKISSLMAIDELMGKTGEQVAFEGIASAIIQGRNKERANTS 628
Query: 740 ALQRQGSWEDLFEECPQGYLTDESMGLDFV---PL---EAIAPMTIDKIEALLIEGLRTQ 793
A + + + + G G+ V PL E + +++ K+E +++EGL+ Q
Sbjct: 629 AARTVAAVKTMANAMSSGRRERIMTGIWNVEENPLTSAEEVLAVSLQKLEEMVVEGLKIQ 688
Query: 794 YDMSNEQAP 802
DM +++AP
Sbjct: 689 ADMVDDEAP 697