Miyakogusa Predicted Gene
- Lj2g3v1366060.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1366060.2 Non Chatacterized Hit- tr|I1J5F4|I1J5F4_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.6409
PE=,95.85,0,SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.36864.2
(338 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G70160.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 617 e-177
AT4G27020.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 538 e-153
AT5G54870.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 528 e-150
>AT1G70160.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G27020.1);
Has 108 Blast hits to 108 proteins in 20 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 89;
Viruses - 0; Other Eukaryotes - 19 (source: NCBI BLink).
| chr1:26420159-26422345 FORWARD LENGTH=523
Length = 523
Score = 617 bits (1592), Expect = e-177, Method: Compositional matrix adjust.
Identities = 291/338 (86%), Positives = 311/338 (92%)
Query: 1 MPSGMLGTLLSLVDVLPLFSNTAWGQNANLDFLKKHMGATFEKRSQPWRANIDPADVHSG 60
MPSGMLGTLLSL+DVLPLFSNTAWGQNANL FL KHMGATFEKRSQPWR+ I+P DVHSG
Sbjct: 163 MPSGMLGTLLSLIDVLPLFSNTAWGQNANLAFLTKHMGATFEKRSQPWRSMINPEDVHSG 222
Query: 61 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDEMGNLWVGESGHENEKGEEIIVVI 120
DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKD++GNLWVGESGHENEKGEEIIVVI
Sbjct: 223 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDDLGNLWVGESGHENEKGEEIIVVI 282
Query: 121 PWHEWWELALKDGSNPQIALLPLHPELRAKFNSTAAWEYARSMSGKPYGYHNMIFSWIDT 180
PW EWWEL LKD SNPQ+ALLPLHP++RAKFN+TAAWEYARSM GKPYGYHNMIFSWIDT
Sbjct: 283 PWDEWWELTLKDNSNPQVALLPLHPDIRAKFNNTAAWEYARSMLGKPYGYHNMIFSWIDT 342
Query: 181 AADNYPPPLDAHLVTSVMSMWTRMQPAYAANMWNEALNKRLGTEGLDLHDILVETEKRGI 240
DNYPPPLDAHLV SVMSMWTR+QPAYAANMWNEALNKRLGTE LDL+ IL ET +RG+
Sbjct: 343 LGDNYPPPLDAHLVISVMSMWTRVQPAYAANMWNEALNKRLGTEDLDLYGILEETARRGM 402
Query: 241 TFDELLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPISSSIQVTEFTIRDAYMLRI 300
+FDELLTIPEQDEWVYSDGKSTTCVAFIL+MYK AGIF P++ IQVTEFTIRDAY L++
Sbjct: 403 SFDELLTIPEQDEWVYSDGKSTTCVAFILAMYKAAGIFDPLADHIQVTEFTIRDAYTLKL 462
Query: 301 FEDNQTRLPRWCNNENDGLPFCQILGEYRMELPGYNTL 338
FE NQTRLP WCN E L FCQILGEYRMELPGYNT+
Sbjct: 463 FESNQTRLPSWCNTEEGKLDFCQILGEYRMELPGYNTI 500
>AT4G27020.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; LOCATED IN: vacuole; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G54870.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr4:13568604-13571381
REVERSE LENGTH=523
Length = 523
Score = 538 bits (1386), Expect = e-153, Method: Compositional matrix adjust.
Identities = 248/340 (72%), Positives = 290/340 (85%), Gaps = 2/340 (0%)
Query: 1 MPSGMLGTLLSLVDVLPLFSNTAWGQNANLDFLKKHMGATFEKRSQPWRANIDPADVHSG 60
M +GMLGTL +L DV PLF+NT WG+N+N+ FLK HMGA F R +PW NI ++HSG
Sbjct: 161 MEAGMLGTLRALWDVFPLFTNTGWGENSNIAFLKNHMGANFYPRPKPWVTNITTDEIHSG 220
Query: 61 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDEMGNLWVGESGHENEKGEEIIVVI 120
D LA+SKIRGRWGGFETLEKWV+GA+AGHTAVCL+D G LWVGESG+ENEKGE++I ++
Sbjct: 221 DLLAISKIRGRWGGFETLEKWVSGAYAGHTAVCLRDSEGKLWVGESGNENEKGEDVIAIL 280
Query: 121 PWHEWWEL-ALKDGSNPQIALLPLHPELRAKFNSTAAWEYARSMSGKPYGYHNMIFSWID 179
PW EWWE KD SNP IALLPLHP+ RAKFN TAAWEYARSM GKPYGYHN+IFSWID
Sbjct: 281 PWEEWWEFEQTKDDSNPHIALLPLHPDYRAKFNVTAAWEYARSMDGKPYGYHNLIFSWID 340
Query: 180 TAADNYPPPLDAHLVTSVMSMWTRMQPAYAANMWNEALNKRLGTEGLDLHDILVETEKRG 239
T + NYPPPLDA LV SVM++W+++QP YAANMWNEALNKRLGTEGLDL D+LVE EKRG
Sbjct: 341 TISGNYPPPLDAQLVASVMTVWSKIQPDYAANMWNEALNKRLGTEGLDLPDVLVEVEKRG 400
Query: 240 ITFDELLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPISSSIQVTEFTIRDAYMLR 299
+FDELL +PEQD+W+YSDGKST+C+AFIL MYKEAG+F PISSSIQVTEFTI+DAYML+
Sbjct: 401 SSFDELLAVPEQDDWIYSDGKSTSCIAFILEMYKEAGLFDPISSSIQVTEFTIKDAYMLK 460
Query: 300 IFEDNQTRLPRWCN-NENDGLPFCQILGEYRMELPGYNTL 338
FE N +R P+WCN N+ LP+CQILG+YRMELPGYNT+
Sbjct: 461 FFESNASRFPKWCNDNDVVKLPYCQILGKYRMELPGYNTM 500
>AT5G54870.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: vacuole;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT4G27020.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:22289149-22291604 FORWARD LENGTH=531
Length = 531
Score = 528 bits (1360), Expect = e-150, Method: Compositional matrix adjust.
Identities = 241/340 (70%), Positives = 291/340 (85%), Gaps = 2/340 (0%)
Query: 1 MPSGMLGTLLSLVDVLPLFSNTAWGQNANLDFLKKHMGATFEKRSQPWRANIDPADVHSG 60
M +GMLGTL +L DV PLFSNT WG+++NL FL+KHMGA FE R +PW N+ + SG
Sbjct: 169 MHAGMLGTLQALWDVFPLFSNTGWGESSNLAFLEKHMGANFEPRPEPWVTNVTTDQIQSG 228
Query: 61 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDEMGNLWVGESGHENEKGEEIIVVI 120
D LA+SKIRGRWGGFETLEKWV+GA+AGH+AV L+D G LWVGESG+EN+KGE++I ++
Sbjct: 229 DLLAISKIRGRWGGFETLEKWVSGAYAGHSAVALRDSEGKLWVGESGNENDKGEDVIAIL 288
Query: 121 PWHEWWEL-ALKDGSNPQIALLPLHPELRAKFNSTAAWEYARSMSGKPYGYHNMIFSWID 179
PW EWW KD SNPQIALLPLHP++RAKF+ AAW+YARSM GKPYGYHN+IFSWID
Sbjct: 289 PWEEWWAFEQTKDDSNPQIALLPLHPDVRAKFDVAAAWKYARSMEGKPYGYHNLIFSWID 348
Query: 180 TAADNYPPPLDAHLVTSVMSMWTRMQPAYAANMWNEALNKRLGTEGLDLHDILVETEKRG 239
T ++NYPPPLDAHLV S M++W++MQP YAANMWNEALNKRLGTEGLDL D+LVE EKRG
Sbjct: 349 TVSENYPPPLDAHLVASFMTVWSQMQPEYAANMWNEALNKRLGTEGLDLSDVLVEVEKRG 408
Query: 240 ITFDELLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPISSSIQVTEFTIRDAYMLR 299
+FD+LL +PE D+W+YSDGKST+C+AFIL MYKEAG+FGP++SSIQVTEFTI+DAYML
Sbjct: 409 SSFDKLLAVPELDDWIYSDGKSTSCIAFILEMYKEAGLFGPLASSIQVTEFTIKDAYMLN 468
Query: 300 IFEDNQTRLPRWCN-NENDGLPFCQILGEYRMELPGYNTL 338
FE+N +RLP WCN N++ LP+CQILG+YRMELPGYNT+
Sbjct: 469 FFENNASRLPTWCNDNDSVKLPYCQILGKYRMELPGYNTM 508