Miyakogusa Predicted Gene
- Lj5g3v1326740.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1326740.1 Non Chatacterized Hit- tr|I1MJ02|I1MJ02_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,64,3e-18,seg,NULL,CUFF.55533.1
(446 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G04550.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 350 9e-97
AT5G28500.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 340 9e-94
AT5G28500.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 221 1e-57
>AT3G04550.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast
stroma, chloroplast; EXPRESSED IN: 22 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G28500.1); Has 110 Blast hits to 110 proteins
in 51 species: Archae - 0; Bacteria - 67; Metazoa - 1;
Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr3:1225961-1227310 FORWARD
LENGTH=449
Length = 449
Score = 350 bits (899), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 198/384 (51%), Positives = 242/384 (63%), Gaps = 22/384 (5%)
Query: 70 TLDIPGQIDILANRLGVWHEYAPLISSHIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
+LD G+I+ILA R+ +W EYAPLISS +GFTPPTIEELTGI+ +EQNR IV AQVRD
Sbjct: 81 SLDSAGKIEILAGRMALWFEYAPLISSLYTDGFTPPTIEELTGISSIEQNRLIVGAQVRD 140
Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
S+LQS +P+L+S FD GGAELLY+I FI++ D KGA +LAR++KD
Sbjct: 141 SILQSIHEPELISAFDTGGAELLYEIRLLSTTQRVAAATFIIDRNIDSKGAQDLARAIKD 200
Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNP-EQRDSVLEQALRVAESEKARDA 248
+P+RRG+ GW FDYNLPGDCLSF+YYR SRE N+NP +QR S+L QAL VAESEKA++
Sbjct: 201 YPNRRGDVGWLDFDYNLPGDCLSFLYYRQSRE-NKNPSDQRTSMLLQALGVAESEKAKNR 259
Query: 249 IQKEL----KXXXXXXXXXXXXXXXXXXXXXXRLKIGEXXXXXXXXXLPVCNADEGGDAI 304
+ EL + RLK GE LPVC A+EG I
Sbjct: 260 LNTELYGDKEAEKEKEKKKKEEEVKAIRIPVVRLKFGEVAEATSVVVLPVCKAEEGEKKI 319
Query: 305 LEAPSECRTXXXXXXXXXXXXXXXXXXXWEKWVVLPGWEPLVELGKGCVVVSFVDAR-VL 363
LEAP E W++WVVLP W P+ +GKG V VSF D R VL
Sbjct: 320 LEAPMEI-------IAGGDFKVVEAEKGWKRWVVLPSWNPVAAIGKGGVAVSFRDDRKVL 372
Query: 364 PWKANRWYKEEPILVVADRSKREVGADDGFYLVKVEDDGVGLKVERGLGLKERGVSESLG 423
PW KEEP+LVVADR + V ADDG+YLV E+ GLK+E+G LK R V ESLG
Sbjct: 373 PWDG----KEEPLLVVADRVRNVVEADDGYYLVVAEN---GLKLEKGSDLKAREVKESLG 425
Query: 424 SVILVVRPPREEDDG-QLSDEDWD 446
V+LVVRPPRE+DD Q S ++WD
Sbjct: 426 MVVLVVRPPREDDDDWQTSHQNWD 449
>AT5G28500.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast
stroma, chloroplast; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G04550.1); Has 109 Blast hits to 109 proteins
in 49 species: Archae - 0; Bacteria - 67; Metazoa - 0;
Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr5:10477810-10479114 FORWARD
LENGTH=434
Length = 434
Score = 340 bits (873), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 190/379 (50%), Positives = 241/379 (63%), Gaps = 17/379 (4%)
Query: 70 TLDIPGQIDILANRLGVWHEYAPLISSHIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
+LD G+I++LA+RLG+W EYAPLISS EGFTPP+IEELTGI+GVEQN IV AQVRD
Sbjct: 71 SLDTAGKIEVLADRLGLWFEYAPLISSLYTEGFTPPSIEELTGISGVEQNSLIVGAQVRD 130
Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
SL+QS P+L++ FD GAELLY+I +IV++ D KGA +LAR++KD
Sbjct: 131 SLVQSGAKPELIAAFDTNGAELLYEIRLLNTTQRVAAAEYIVDHGFDTKGAGDLARAIKD 190
Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNPEQRDSVLEQALRVAESEKARDAI 249
FP RRG+ G FDYNLPGDCLSFM YR SRE+ E R ++LEQAL A +EKA+ A+
Sbjct: 191 FPHRRGDVGLGDFDYNLPGDCLSFMLYRKSREHRSPSEIRTTLLEQALETAVTEKAKKAV 250
Query: 250 QKELKXXXXXXXXXXXXXXXXXXXXXXRLKIGEXXXXXXXXXLPVCNADEGGDAILEAPS 309
+EL RL+ GE LPVC A+EG + +LEAP
Sbjct: 251 LRELH-GESEEERVKEEEIKIIRVPVVRLRFGEVAGASSVVVLPVCKAEEGEEKLLEAPM 309
Query: 310 ECRTXXXXXXXXXXXXXXXXXXXWEKWVVLPGWEPLVELGKGCVVVSFVDAR-VLPWKAN 368
E + W +WVVLPGW+P+V + KG V VSF D R VLPW
Sbjct: 310 EFES-------GGEFGVVEAEKDWSRWVVLPGWDPVVAVRKG-VAVSFSDDREVLPWNG- 360
Query: 369 RWYKEEPILVVADRSKREVGADDGFYLVKVEDDGVGLKVERGLGLKERGVSESLGSVILV 428
K E I+VV DR K+ V AD+G+Y + V D G+K++RGL LKE+GV+ESLG V+LV
Sbjct: 361 ---KGEAIMVVIDREKKTVEADNGYYYLVVADG--GMKLDRGLVLKEKGVNESLGMVVLV 415
Query: 429 VRPPREEDDG-QLSDEDWD 446
VRPPR++DD Q++DEDWD
Sbjct: 416 VRPPRDDDDEWQINDEDWD 434
>AT5G28500.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G04550.1). | chr5:10477810-10478945 FORWARD
LENGTH=269
Length = 269
Score = 221 bits (562), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 107/185 (57%), Positives = 134/185 (72%)
Query: 70 TLDIPGQIDILANRLGVWHEYAPLISSHIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
+LD G+I++LA+RLG+W EYAPLISS EGFTPP+IEELTGI+GVEQN IV AQVRD
Sbjct: 71 SLDTAGKIEVLADRLGLWFEYAPLISSLYTEGFTPPSIEELTGISGVEQNSLIVGAQVRD 130
Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
SL+QS P+L++ FD GAELLY+I +IV++ D KGA +LAR++KD
Sbjct: 131 SLVQSGAKPELIAAFDTNGAELLYEIRLLNTTQRVAAAEYIVDHGFDTKGAGDLARAIKD 190
Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNPEQRDSVLEQALRVAESEKARDAI 249
FP RRG+ G FDYNLPGDCLSFM YR SRE+ E R ++LEQAL A +EKA+ A+
Sbjct: 191 FPHRRGDVGLGDFDYNLPGDCLSFMLYRKSREHRSPSEIRTTLLEQALETAVTEKAKKAV 250
Query: 250 QKELK 254
+EL
Sbjct: 251 LRELH 255