Miyakogusa Predicted Gene
- Lj0g3v0101359.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0101359.1 Non Chatacterized Hit- tr|D7KKP9|D7KKP9_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,30.77,1e-18,GUB_WAK_bind,Wall-associated receptor kinase
galacturonan-binding domain; seg,NULL,CUFF.5690.1
(313 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G50290.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 367 e-102
AT3G17350.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 134 9e-32
AT1G10380.1 | Symbols: | Putative membrane lipoprotein | chr1:3... 100 1e-21
AT1G11915.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 89 6e-18
>AT5G50290.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT3G17350.1); Has 300 Blast hits
to 300 proteins in 14 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 300; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr5:20461814-20463896 FORWARD LENGTH=303
Length = 303
Score = 367 bits (942), Expect = e-102, Method: Compositional matrix adjust.
Identities = 176/296 (59%), Positives = 214/296 (72%), Gaps = 8/296 (2%)
Query: 15 HLIFLSALLITSCSSTQANSCRTYCGNITVDYPFATQYGCGHPGFRDLLFCINDVLMLHI 74
++ L +T ++CR+YCGNITVDYPF + GCGHPG+RDLLFC+NDVLM HI
Sbjct: 2 KILILILSFVTLFEICVVDACRSYCGNITVDYPFGIRNGCGHPGYRDLLFCMNDVLMFHI 61
Query: 75 ASGSYRVLEIDYAYQALTLHEPHMSTCEDLVLGAKGNGFALEPWRAAYMNPTPDNVFMLI 134
+SGSYRVL+IDYAYQ++TLH+PHMS CE +VLG KGNGF E WR Y NPT DNVFMLI
Sbjct: 62 SSGSYRVLDIDYAYQSITLHDPHMSNCETIVLGGKGNGFEAEDWRTPYFNPTSDNVFMLI 121
Query: 135 ACSPRSPLFQGFPGKHLPCRNVSGMGCEDYYGCPAWDMLGHKRXXXXXXXXXXXXPPECC 194
CSP+SP+FQGFP K +PCRN+SGM CE+Y CPAWDM+G+++ PP CC
Sbjct: 122 GCSPKSPIFQGFPEKKVPCRNISGMSCEEYMSCPAWDMVGYRQ----PGIHSGSGPPMCC 177
Query: 195 AVPYESIKGINLKHLDCEGYSSAYSVAPLRVDGPGDWAYGIRVRYSVQGSDEFCGACEAT 254
V +ES+K INL L+CEGYSSAY++APL++ GP DWAYGIRV+Y +QGSD FC AC AT
Sbjct: 178 GVGFESVKAINLSKLECEGYSSAYNLAPLKLRGPSDWAYGIRVKYELQGSDAFCRACVAT 237
Query: 255 GGSCGY---GSDGIRQVCMCGNFNSTSNCDSVGISAGA-RPVRVKMSAGLLACMLA 306
G+CGY G+R VCMC N NST+NCDSV GA VR K L+ +A
Sbjct: 238 SGTCGYEPADGGGLRHVCMCDNHNSTTNCDSVISPTGASSSVRPKAIGSLIIYFIA 293
>AT3G17350.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G50290.1);
Has 203 Blast hits to 203 proteins in 13 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 203;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr3:5934111-5935276 FORWARD LENGTH=301
Length = 301
Score = 134 bits (336), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 126/252 (50%), Gaps = 14/252 (5%)
Query: 32 ANSCRTYCGNITVDYPFATQYGCGHPGFRDLLFCINDVLMLHIASGSYRVLEIDYAYQAL 91
A SCRT CGNI ++YPF GCG P +R + C D L SGSY+V IDY + +
Sbjct: 28 ATSCRTLCGNIPINYPFGIDGGCGSPQYRGMFNCSTD-LYFTTPSGSYKVQSIDYEKKTM 86
Query: 92 TLHEPHMSTCEDLVLGAKGNGFALEPWRAAYMNPTPDNVFMLIACSPRSPLFQGFPGKHL 151
+ +P MSTC L + F + + + P+ D VF L CS SP+ + ++L
Sbjct: 87 VIFDPAMSTCSIL---QPHHDFKMADIQNTLIRPSYDTVFALFNCSNDSPVHNRY--RNL 141
Query: 152 PCRNVSGMGCEDYY-GCPAWDMLGHKRXXXXXXXXXXXXPPECCAVPYESIKGINLKHLD 210
C N +G C++ Y C ++ + P CC Y++++ +++ LD
Sbjct: 142 -CFNAAGHSCDELYSSCTSFRIF--NTTSPYGNNSTVHTTPYCCFTNYDTVRVMSMNILD 198
Query: 211 CEGYSSAYSVAPLRVDGPGDWAYGIRVRYSVQGSDEFCGACEATGGSCGYGSDGIRQVCM 270
C Y++ +R GP DW+YGI + YSV ++ C C +GG+CG+ ++ +C
Sbjct: 199 CSHYTTVIDNGKMRGVGPLDWSYGIELSYSV--TEIGCDRCRKSGGTCGFDAETEIFLCQ 256
Query: 271 C--GNFNSTSNC 280
C N N T C
Sbjct: 257 CSGSNNNPTREC 268
>AT1G10380.1 | Symbols: | Putative membrane lipoprotein |
chr1:3400706-3402110 FORWARD LENGTH=305
Length = 305
Score = 100 bits (249), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/261 (31%), Positives = 119/261 (45%), Gaps = 30/261 (11%)
Query: 33 NSCRTYCGNITVDYPFATQYGCGHPGFRDLLFCINDVLMLHIAS--GSYRVLEIDYAYQA 90
+C+ CG I + YP T GCG P F + C D L + + GSY + +DYA Q
Sbjct: 27 QACQKTCGQIPIKYPLGTGSGCGDPRFTRYITCDPDQQTLTLTTHTGSYPITSVDYAKQE 86
Query: 91 LTLHEPHMSTCEDLVLGAKGNGFALEPWRAAYMNPTPDNVFMLIACS-PRSPLFQGFP-- 147
+ + +P MSTC +GF L+ W A + + D VF L+ CS SP+F
Sbjct: 87 IYVTDPSMSTC---ACTRPSHGFGLD-WDAPF-SFHDDTVFTLLDCSVDESPVFTPLSNG 141
Query: 148 -GKHLPCRNVSGMGCEDYY-GCPAWDMLGHKRXXXXXXXXXXXXPPECCAVPYESIKG-- 203
G+ C S C Y C A ++ + C VP +
Sbjct: 142 SGRVSLCDRQSSSICTFLYSNCRAISLINLQVSTC------------CVYVPLDLGPSFE 189
Query: 204 INLKHLDCEGYSSAYSVAPLRVDGPGDWAYGIRVRYSVQGSDEF---CGACEATGGSCGY 260
++L L C YS Y++ P + P +W YGI ++Y DE+ CG+CE + G+CG+
Sbjct: 190 MDLNKLKCSSYSGFYNLGPGQESHPENWNYGIALKYKFNVFDEYPGVCGSCERSNGACGF 249
Query: 261 GSDGIRQVCMC-GNFNSTSNC 280
+ VC C G N+TS+C
Sbjct: 250 NTQSSSFVCNCPGGINTTSDC 270
>AT1G11915.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: root; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G17350.1);
Has 261 Blast hits to 261 proteins in 13 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 261;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:4021830-4023084 FORWARD LENGTH=329
Length = 329
Score = 88.6 bits (218), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 131/296 (44%), Gaps = 37/296 (12%)
Query: 13 HYHLIFLSALLI----TSCSSTQANSCRTYCGNITVDYPFATQYGCGHPGFRDLLFCIND 68
H ++IF S L+ +S +S+Q+N CR+ CGNI ++YPF+ GCG P +R +L C ++
Sbjct: 5 HSYIIFFSLLMTILLQSSTTSSQSNLCRSSCGNIPINYPFSIDDGCGSPYYRHMLICSDN 64
Query: 69 --VLMLHIASGSYRVLEIDYAYQALTLHEPHMSTCEDLVLGAKGNGFALEPWRAAYMNPT 126
L L SG Y V I Y+ L + +P M C+D F+++ + + +
Sbjct: 65 DTKLELRTPSGKYPVKSISYSDPHLLVSDPFMWNCQDRDNFRPTRSFSID--SSTHFTVS 122
Query: 127 PDNVFMLIACSPR------SPLF-QGFPGKHLPCRNVSGMGCEDYYGCPAWDMLGHKRXX 179
P N ++ C+ PLF + FP + + S C C + LG +
Sbjct: 123 PQNDYLFFNCNTDKVIVEPKPLFCERFPDRCDSSCDSSSYLCRHLPECGS--ALGSRV-- 178
Query: 180 XXXXXXXXXXPPECCAVPYESIKGINLKHLDCEGYSSAYSVAPLRVDGPGDW--AYGIRV 237
CC+ ++ + + L DC Y+S Y + + P D YGIRV
Sbjct: 179 ------------SCCSYYPKATQSLRLMLQDCATYTSVYWRSTGVENAPYDQFPEYGIRV 226
Query: 238 RYSVQGSDEFCGACEAT---GGSCGYGSDGIRQVCMCGNFNSTSNCDSVGISAGAR 290
Y + + C C+ T GG CG+ + +C+C N T+ C + R
Sbjct: 227 DYEFPVTMK-CLLCQETTKGGGVCGFNTRTRDFLCLCKQGNVTTYCKDPSLVNHKR 281