Miyakogusa Predicted Gene
- Lj3g3v0966700.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0966700.1 Non Chatacterized Hit- tr|I1N1C4|I1N1C4_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,94.18,0,SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; seg,NULL,CUFF.41964.1
(361 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G70160.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 593 e-170
AT4G27020.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 516 e-147
AT5G54870.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 514 e-146
>AT1G70160.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G27020.1);
Has 108 Blast hits to 108 proteins in 20 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 89;
Viruses - 0; Other Eukaryotes - 19 (source: NCBI BLink).
| chr1:26420159-26422345 FORWARD LENGTH=523
Length = 523
Score = 593 bits (1529), Expect = e-170, Method: Compositional matrix adjust.
Identities = 281/361 (77%), Positives = 307/361 (85%)
Query: 1 MPSGMLGTLLSLIDVLPLFANNAWGQKANLDFLKKHMGATFEKRSQPWRATIDPADVHSG 60
MPSGMLGTLLSLIDVLPLF+N AWGQ ANL FL KHMGATFEKRSQPWR+ I+P DVHSG
Sbjct: 163 MPSGMLGTLLSLIDVLPLFSNTAWGQNANLAFLTKHMGATFEKRSQPWRSMINPEDVHSG 222
Query: 61 DFLAVSKIRGRWGGFETLEKWVTGSFAGHTAVCLKDDMGNLWXXXXXXXXXXXXXXXXXX 120
DFLAVSKIRGRWGGFETLEKWVTG+FAGHTAVCLKDD+GNLW
Sbjct: 223 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDDLGNLWVGESGHENEKGEEIIVVI 282
Query: 121 XXXXXXXLALKDSSNPQIALLPLHPDLRAKFNSTAAWEYARSMSGKPYGYHNMIFSWIDT 180
L LKD+SNPQ+ALLPLHPD+RAKFN+TAAWEYARSM GKPYGYHNMIFSWIDT
Sbjct: 283 PWDEWWELTLKDNSNPQVALLPLHPDIRAKFNNTAAWEYARSMLGKPYGYHNMIFSWIDT 342
Query: 181 VADNYPPPLDAHLVISVMSMWTRLQPAYSANMWNEALNKRLGTEGLDLHDIIVETEKRGI 240
+ DNYPPPLDAHLVISVMSMWTR+QPAY+ANMWNEALNKRLGTE LDL+ I+ ET +RG+
Sbjct: 343 LGDNYPPPLDAHLVISVMSMWTRVQPAYAANMWNEALNKRLGTEDLDLYGILEETARRGM 402
Query: 241 PFDQLLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPFSSSIQVTEFTIRDAYMLRL 300
FD+LLTIPEQDEWVYSDGKSTTCVAFIL+MYK AGIF P + IQVTEFTIRDAY L+L
Sbjct: 403 SFDELLTIPEQDEWVYSDGKSTTCVAFILAMYKAAGIFDPLADHIQVTEFTIRDAYTLKL 462
Query: 301 FEDNQTRLPSWCNNENDRLPFCQILGEYRMELPGYNTLEPYAHMNEYCPSLPPTYDRPSR 360
FE NQTRLPSWCN E +L FCQILGEYRMELPGYNT+ PY +MN+ CPSLPP Y+RPS+
Sbjct: 463 FESNQTRLPSWCNTEEGKLDFCQILGEYRMELPGYNTIYPYPNMNQNCPSLPPNYERPSK 522
Query: 361 C 361
C
Sbjct: 523 C 523
>AT4G27020.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; LOCATED IN: vacuole; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G54870.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr4:13568604-13571381
REVERSE LENGTH=523
Length = 523
Score = 516 bits (1329), Expect = e-147, Method: Compositional matrix adjust.
Identities = 240/363 (66%), Positives = 280/363 (77%), Gaps = 2/363 (0%)
Query: 1 MPSGMLGTLLSLIDVLPLFANNAWGQKANLDFLKKHMGATFEKRSQPWRATIDPADVHSG 60
M +GMLGTL +L DV PLF N WG+ +N+ FLK HMGA F R +PW I ++HSG
Sbjct: 161 MEAGMLGTLRALWDVFPLFTNTGWGENSNIAFLKNHMGANFYPRPKPWVTNITTDEIHSG 220
Query: 61 DFLAVSKIRGRWGGFETLEKWVTGSFAGHTAVCLKDDMGNLWXXXXXXXXXXXXXXXXXX 120
D LA+SKIRGRWGGFETLEKWV+G++AGHTAVCL+D G LW
Sbjct: 221 DLLAISKIRGRWGGFETLEKWVSGAYAGHTAVCLRDSEGKLWVGESGNENEKGEDVIAIL 280
Query: 121 XXXXXXXL-ALKDSSNPQIALLPLHPDLRAKFNSTAAWEYARSMSGKPYGYHNMIFSWID 179
KD SNP IALLPLHPD RAKFN TAAWEYARSM GKPYGYHN+IFSWID
Sbjct: 281 PWEEWWEFEQTKDDSNPHIALLPLHPDYRAKFNVTAAWEYARSMDGKPYGYHNLIFSWID 340
Query: 180 TVADNYPPPLDAHLVISVMSMWTRLQPAYSANMWNEALNKRLGTEGLDLHDIIVETEKRG 239
T++ NYPPPLDA LV SVM++W+++QP Y+ANMWNEALNKRLGTEGLDL D++VE EKRG
Sbjct: 341 TISGNYPPPLDAQLVASVMTVWSKIQPDYAANMWNEALNKRLGTEGLDLPDVLVEVEKRG 400
Query: 240 IPFDQLLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPFSSSIQVTEFTIRDAYMLR 299
FD+LL +PEQD+W+YSDGKST+C+AFIL MYKEAG+F P SSSIQVTEFTI+DAYML+
Sbjct: 401 SSFDELLAVPEQDDWIYSDGKSTSCIAFILEMYKEAGLFDPISSSIQVTEFTIKDAYMLK 460
Query: 300 LFEDNQTRLPSWCN-NENDRLPFCQILGEYRMELPGYNTLEPYAHMNEYCPSLPPTYDRP 358
FE N +R P WCN N+ +LP+CQILG+YRMELPGYNT+EPY HMNE+CPSLPP Y RP
Sbjct: 461 FFESNASRFPKWCNDNDVVKLPYCQILGKYRMELPGYNTMEPYPHMNEHCPSLPPKYHRP 520
Query: 359 SRC 361
C
Sbjct: 521 KNC 523
>AT5G54870.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: vacuole;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT4G27020.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:22289149-22291604 FORWARD LENGTH=531
Length = 531
Score = 514 bits (1323), Expect = e-146, Method: Compositional matrix adjust.
Identities = 236/363 (65%), Positives = 286/363 (78%), Gaps = 2/363 (0%)
Query: 1 MPSGMLGTLLSLIDVLPLFANNAWGQKANLDFLKKHMGATFEKRSQPWRATIDPADVHSG 60
M +GMLGTL +L DV PLF+N WG+ +NL FL+KHMGA FE R +PW + + SG
Sbjct: 169 MHAGMLGTLQALWDVFPLFSNTGWGESSNLAFLEKHMGANFEPRPEPWVTNVTTDQIQSG 228
Query: 61 DFLAVSKIRGRWGGFETLEKWVTGSFAGHTAVCLKDDMGNLWXXXXXXXXXXXXXXXXXX 120
D LA+SKIRGRWGGFETLEKWV+G++AGH+AV L+D G LW
Sbjct: 229 DLLAISKIRGRWGGFETLEKWVSGAYAGHSAVALRDSEGKLWVGESGNENDKGEDVIAIL 288
Query: 121 XXXXXXXL-ALKDSSNPQIALLPLHPDLRAKFNSTAAWEYARSMSGKPYGYHNMIFSWID 179
KD SNPQIALLPLHPD+RAKF+ AAW+YARSM GKPYGYHN+IFSWID
Sbjct: 289 PWEEWWAFEQTKDDSNPQIALLPLHPDVRAKFDVAAAWKYARSMEGKPYGYHNLIFSWID 348
Query: 180 TVADNYPPPLDAHLVISVMSMWTRLQPAYSANMWNEALNKRLGTEGLDLHDIIVETEKRG 239
TV++NYPPPLDAHLV S M++W+++QP Y+ANMWNEALNKRLGTEGLDL D++VE EKRG
Sbjct: 349 TVSENYPPPLDAHLVASFMTVWSQMQPEYAANMWNEALNKRLGTEGLDLSDVLVEVEKRG 408
Query: 240 IPFDQLLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPFSSSIQVTEFTIRDAYMLR 299
FD+LL +PE D+W+YSDGKST+C+AFIL MYKEAG+FGP +SSIQVTEFTI+DAYML
Sbjct: 409 SSFDKLLAVPELDDWIYSDGKSTSCIAFILEMYKEAGLFGPLASSIQVTEFTIKDAYMLN 468
Query: 300 LFEDNQTRLPSWCN-NENDRLPFCQILGEYRMELPGYNTLEPYAHMNEYCPSLPPTYDRP 358
FE+N +RLP+WCN N++ +LP+CQILG+YRMELPGYNT+EPY+HMNE CP+LPP Y+RP
Sbjct: 469 FFENNASRLPTWCNDNDSVKLPYCQILGKYRMELPGYNTMEPYSHMNEQCPTLPPKYNRP 528
Query: 359 SRC 361
C
Sbjct: 529 DNC 531