Miyakogusa Predicted Gene
- Lj1g3v3740580.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3740580.1 Non Chatacterized Hit- tr|I1MLP6|I1MLP6_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,90.2,1e-18,
,CUFF.31129.1
(384 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G01720.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 505 e-143
AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 55 1e-07
AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 54 1e-07
AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 54 1e-07
AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 54 1e-07
>AT3G01720.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 374 Blast hits to 211 proteins in 23 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316;
Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink).
| chr3:262412-265608 REVERSE LENGTH=802
Length = 802
Score = 505 bits (1300), Expect = e-143, Method: Compositional matrix adjust.
Identities = 248/384 (64%), Positives = 290/384 (75%), Gaps = 7/384 (1%)
Query: 1 MHSFHLRGQPGNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAV 60
MHSF GQPGNITRLLSC+ E LK YKGH LAPTHYVPSMSRHPLTGDWYPAINKPAAV
Sbjct: 414 MHSFRQSGQPGNITRLLSCTDEALKNYKGHDLAPTHYVPSMSRHPLTGDWYPAINKPAAV 473
Query: 61 LHWLNHANTDAEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLIGWGNELAKLHTS 120
+HWL+H N DAE++VILDADMILRGPITPWEFKA RGRPVSTPYDYLIG N+LA+LHT
Sbjct: 474 VHWLHHTNIDAEYVVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNDLARLHTR 533
Query: 121 HPDACDKVGGVIIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGY 180
+P+ACDKVGGVIIMHI+DLRKFA+ WL KT+EVRAD+ H+ + +TGDIYESGWISEMYGY
Sbjct: 534 NPEACDKVGGVIIMHIEDLRKFAMYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGY 593
Query: 181 SFGAAELKLRHTINGEIMTYPGYVHEPGVKYRVFHYGLRFSVGNWSFNKAEWRDVDLVNR 240
SFGAAEL LRH+IN EIM YPGYV EPG YRVFHYGL F VGNWSF+KA WR+ DL+N+
Sbjct: 594 SFGAAELNLRHSINKEIMIYPGYVPEPGADYRVFHYGLEFKVGNWSFDKANWRNTDLINK 653
Query: 241 CWSKFPEPPDPSTLDHDDKDNFQRNLLSIECVKTLNEALHLHHERKECPRAGSLSTSKGD 300
CW+KFP+PP PS + D D QR+LLSIEC + LNEAL LHH+R+ CP GS ST K
Sbjct: 654 CWAKFPDPPSPSAVHQTDNDLRQRDLLSIECGQKLNEALFLHHKRRNCPEPGSESTEKIS 713
Query: 301 KIEEFGDFNGKFDSKNNHMSANDSEGLTTVPKDRIGIPSSFRFWVIFLCSFSGFGFLVII 360
+ G+ K +D ++ + G S+ + WVI L SG GFLV++
Sbjct: 714 VSRKVGNIETK------QTQGSDETKESSGSSESEGRFSTLKLWVIALWLISGVGFLVVM 767
Query: 361 FVVHS-GHKRRGMKMKHLRSRRRN 383
+V S R + K R++RR
Sbjct: 768 LLVFSTRRGRGTTRGKGYRNKRRT 791
Score = 345 bits (885), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 170/294 (57%), Positives = 206/294 (70%), Gaps = 4/294 (1%)
Query: 1 MHSFHLRGQPGNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAV 60
MHSF GQPG ITRLLSC+ + K Y+G +LAPT VPS SRHP TGDWYPAINKP V
Sbjct: 50 MHSFLKSGQPGPITRLLSCTDDQKKTYRGMNLAPTFEVPSWSRHPKTGDWYPAINKPVGV 109
Query: 61 LHWLNHANTD--AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLIGWGNELAKLH 118
L+WL H+ +++VILDADMI+RGPI PWE AERGRP + Y YL+G N L +LH
Sbjct: 110 LYWLQHSEEAKHVDWVVILDADMIIRGPIIPWELGAERGRPFAAHYGYLVGCDNLLVRLH 169
Query: 119 TSHPDACDKVGGVIIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMY 178
T HP+ CDKVGG++ MHIDDLR A LWL KTE+VR D AH+ N+TGDIY GWISEMY
Sbjct: 170 TKHPELCDKVGGLLAMHIDDLRVLAPLWLSKTEDVRQDTAHWTTNLTGDIYGKGWISEMY 229
Query: 179 GYSFGAAELKLRHTINGEIMTYPGYVHEPGVKYRVFHYGLRFSVGNWSFNKAEWRDVDLV 238
GYSFGAAE L+H IN ++M YPGYV GV+ + HYGL FS+GNWSF K + + ++V
Sbjct: 230 GYSFGAAEAGLKHKINDDLMIYPGYVPREGVEPVLMHYGLPFSIGNWSFTKLDHHEDNIV 289
Query: 239 NRCWSKFPEPPDPSTLDHDDKDNFQRN--LLSIECVKTLNEALHLHHERKECPR 290
C FPEPP P + + D +R +LS+EC+ TLNE L L H CP+
Sbjct: 290 YDCNRLFPEPPYPREVKIMEPDPSKRRGLILSLECMNTLNEGLILRHAENGCPK 343
>AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane,
membrane; EXPRESSED IN: cultured cell, leaf; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:8754794-8756855 REVERSE LENGTH=366
Length = 366
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 99/224 (44%), Gaps = 36/224 (16%)
Query: 30 HSLAPTHY---VPSMSRHPLTG---DWYPAINKPAAVLHWLNHANTDAEFIVILDADMIL 83
HS P Y +P+ PL Y +N+P A + WL + ++I++ + D I+
Sbjct: 114 HSGKPDQYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHII 173
Query: 84 RGPITPWEFKAERGRPVSTPYDYLIGWGNELAKLHTSHPDA------CDKVGGV-IIMHI 136
PI A+ G + P+ Y+ E L +P+ D +G +I+
Sbjct: 174 VKPI---PNLAKDGLGAAFPFFYIEPKKYEKV-LRKYYPEVRGPVTNIDPIGNSPVIVGK 229
Query: 137 DDLRKFALLWLHKT----EEVRADRAHFARNITGDIYESGWISEMYGYSFGAAELKLRHT 192
D L+K A W++ + ++ AD+A GW+ EMY Y+ +A + +
Sbjct: 230 DALKKIAPTWMNVSLAMKKDPEADKAF------------GWVLEMYAYAVSSALHGVSNI 277
Query: 193 INGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
++ + M P + E G KY + + YG + + G ++ K EWR
Sbjct: 278 LHKDFMIQPPWDIEVGDKYIIHYTYGCDYDMKGKLTYGKIGEWR 321
>AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 54.3 bits (129), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)
Query: 11 GNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHANTD 70
G TR+L + D + + + P + R Y +N+P A + WL A
Sbjct: 98 GGFTRILHSGNSDNLMDEIPTFVVDPLPPGLDRG------YVVLNRPWAFVQWLERATIK 151
Query: 71 AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLI--GWGNELAKLHTSHPDACDKV 128
+++++ + D + + P A G P + P+ Y+ + N + K + + +
Sbjct: 152 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 208
Query: 129 GGV----IIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGYSFGA 184
+ +I+ + L K A W++ + ++ D T + GW+ EMYGY+ +
Sbjct: 209 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 260
Query: 185 AELKLRHTINGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
A +RH + + M P + K+ + + YG +++ G ++ K EWR
Sbjct: 261 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWR 312
>AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 228 Blast hits to 200 proteins in 21 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
| chr5:4338676-4340827 FORWARD LENGTH=358
Length = 358
Score = 54.3 bits (129), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)
Query: 11 GNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHANTD 70
G TR+L + D + + + P + R Y +N+P A + WL A
Sbjct: 98 GGFTRILHSGNSDNLMDEIPTFVVDPLPPGLDRG------YVVLNRPWAFVQWLERATIK 151
Query: 71 AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLI--GWGNELAKLHTSHPDACDKV 128
+++++ + D + + P A G P + P+ Y+ + N + K + + +
Sbjct: 152 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 208
Query: 129 GGV----IIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGYSFGA 184
+ +I+ + L K A W++ + ++ D T + GW+ EMYGY+ +
Sbjct: 209 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 260
Query: 185 AELKLRHTINGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
A +RH + + M P + K+ + + YG +++ G ++ K EWR
Sbjct: 261 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWR 312
>AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 54.3 bits (129), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)
Query: 11 GNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHANTD 70
G TR+L + D + + + P + R Y +N+P A + WL A
Sbjct: 98 GGFTRILHSGNSDNLMDEIPTFVVDPLPPGLDRG------YVVLNRPWAFVQWLERATIK 151
Query: 71 AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLI--GWGNELAKLHTSHPDACDKV 128
+++++ + D + + P A G P + P+ Y+ + N + K + + +
Sbjct: 152 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 208
Query: 129 GGV----IIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGYSFGA 184
+ +I+ + L K A W++ + ++ D T + GW+ EMYGY+ +
Sbjct: 209 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 260
Query: 185 AELKLRHTINGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
A +RH + + M P + K+ + + YG +++ G ++ K EWR
Sbjct: 261 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWR 312