Miyakogusa Predicted Gene
- Lj1g3v2940700.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2940700.1 Non Chatacterized Hit- tr|K4CNI8|K4CNI8_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,42.62,0.000000000000003,DUF761,Protein of unknown function
DUF761, plant; seg,NULL,CUFF.29718.1
(1074 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G60380.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 131 2e-30
AT4G16790.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 63 9e-10
>AT3G60380.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is:
hydroxyproline-rich glycoprotein family protein
(TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins
in 424 species: Archae - 6; Bacteria - 372; Metazoa -
2603; Fungi - 655; Plants - 291; Viruses - 28; Other
Eukaryotes - 2147 (source: NCBI BLink). |
chr3:22316913-22319144 REVERSE LENGTH=743
Length = 743
Score = 131 bits (330), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 134/279 (48%), Gaps = 47/279 (16%)
Query: 48 SQAPEFVSQTALTKFWELLHLLFIGIAVTYGLFSRRNAELDSHVETHSSDGSSPSYVSKM 107
SQAP+FV +T LTKFWEL+HLLF+GIAV YGLFSRRN E + D SS SYVS++
Sbjct: 51 SQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRI 110
Query: 108 FPGSALFGDG-GENSSGFDEKRVMHCWDPPQNYDGEQPGGVCSNVGGTVGVFDEQYKPQL 166
F S++F + +NS F + R E S VG + +
Sbjct: 111 FQVSSVFDEEFDDNSCEFVDVR-----------SDESVSARASVVGKSESFV---VESGE 156
Query: 167 PIPEDNFGFPFRYDGNGTNVVQAWNSEYYHSEPVVVVAQPYVSAGESGEVVGHKPLGLPV 226
FG TN V+AWNS+Y+ + VVVA+P + G G VV H+PLGLP+
Sbjct: 157 LEESSEFG--------ETNEVRAWNSQYFQGKSKVVVARP--AYGLDGHVV-HQPLGLPI 205
Query: 227 RSLRSVAREGDGPNFINEXXXXXXXXXXXXXXXXXXXNREFGDLGPSNLEKQFNDAAAVG 286
R LRS R+ N E L N F++ A
Sbjct: 206 RRLRSSLRDN-----------AALQDKSFADSCDGAVNAEAESLLADNF---FDEVLA-- 249
Query: 287 GSASPIPWRSRGGKMEREKSYGNVTHPSHFRPLSFDEAV 325
ASP+PW++R M G+ +PS+F+P+S DE +
Sbjct: 250 APASPVPWQARPEMM----GIGD-NYPSNFQPISVDETL 283
>AT4G16790.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr4:9451747-9453168 REVERSE LENGTH=473
Length = 473
Score = 63.2 bits (152), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 46/78 (58%), Gaps = 11/78 (14%)
Query: 48 SQAPEFVSQTALTKFWELLHLLFIGIAVTYGLFSRRNAELDSHVETHSSD--------GS 99
SQ PE +QT L ELLHL+F+GIAV+YGLFSRRN + T +SD +
Sbjct: 47 SQTPELANQTRLL---ELLHLVFVGIAVSYGLFSRRNYDGGGGGGTSNSDHNKADHSNNN 103
Query: 100 SPSYVSKMFPGSALFGDG 117
S SYV K+ S++F G
Sbjct: 104 SHSYVPKILEVSSVFNVG 121