Miyakogusa Predicted Gene
- Lj0g3v0036229.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0036229.1 tr|Q7XC52|Q7XC52_ORYSJ Expressed protein OS=Oryza
sativa subsp. japonica GN=OSJNBb0089A17.6 PE=4
SV=,30.42,4e-18,seg,NULL; coiled-coil,NULL,gene.g2551.t1.1
(506 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G16790.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 110 3e-24
AT3G60380.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 87 3e-17
>AT4G16790.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr4:9451747-9453168 REVERSE LENGTH=473
Length = 473
Score = 110 bits (274), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 72/173 (41%), Positives = 96/173 (55%), Gaps = 24/173 (13%)
Query: 18 NKED--PNKFYHHFLYKAAIVLIFFVILPLFPSQAPEFINQSLFARNWEFLHLLFVGIAI 75
NKED P KFY F++KA I+ + ++P+F SQ PE NQ+ R E LHL+FVGIA+
Sbjct: 15 NKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQT---RLLELLHLVFVGIAV 71
Query: 76 SYGLFSRRNNE------TEKENNSKFD----SAQSLVSKFLQVSSFF---EDDAESENPS 122
SYGLFSRRN + T +++K D ++ S V K L+VSS F + +
Sbjct: 72 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESESEPSDD 131
Query: 123 ESDETTKIYTWSNQHHRNEP------VIVVAKQRNEKPLLLPVRSLKSRLVDD 169
S + K TW N++H P V V+ + EKPLLLPVRSL V D
Sbjct: 132 SSGDQRKFQTWKNKYHMKIPEVETRFVDRVSSENREKPLLLPVRSLNYSRVSD 184
>AT3G60380.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is:
hydroxyproline-rich glycoprotein family protein
(TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins
in 424 species: Archae - 6; Bacteria - 372; Metazoa -
2603; Fungi - 655; Plants - 291; Viruses - 28; Other
Eukaryotes - 2147 (source: NCBI BLink). |
chr3:22316913-22319144 REVERSE LENGTH=743
Length = 743
Score = 87.0 bits (214), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 135/316 (42%), Gaps = 79/316 (25%)
Query: 48 SQAPEFINQSLFARNWEFLHLLFVGIAISYGLFSRRNNETEKE-NNSKFD-SAQSLVSKF 105
SQAP+F+ +++ + WE +HLLFVGIA++YGLFSRRN E+ + ++ D S+ S VS+
Sbjct: 51 SQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRI 110
Query: 106 LQVSSFFE---DDAESE-----------------NPSES--------------DETTKIY 131
QVSS F+ DD E SES ET ++
Sbjct: 111 FQVSSVFDEEFDDNSCEFVDVRSDESVSARASVVGKSESFVVESGELEESSEFGETNEVR 170
Query: 132 TWSNQHHRNEPVIVVAK-------QRNEKPLLLPVRSLKSRLVDDPEAAESCTEPFSVSR 184
W++Q+ + + +VVA+ +PL LP+R L+S L R
Sbjct: 171 AWNSQYFQGKSKVVVARPAYGLDGHVVHQPLGLPIRRLRSSL-----------------R 213
Query: 185 SNSRTGSKRFSSNLNRARNAEVEGPGSTXXXXXXXXXXXXXLPSPIPWRSRSGKMEPKQE 244
N+ K F+ + + A NAE E + SP+PW++R M
Sbjct: 214 DNAALQDKSFADSCDGAVNAEAE----SLLADNFFDEVLAAPASPVPWQARPEMMGIGDN 269
Query: 245 VFDAPAPSSAFAELASKPSMEESEINKVESRSVKSQTQNXXXXXXXXXXXXTKFTPMASS 304
P S L K S + S SQ QN +F+P S
Sbjct: 270 YPSNFQPISVDETL--KSISSRSTGSSSSQTSYASQNQN-------------RFSPSRSV 314
Query: 305 SSESLAKNTEDLLRKK 320
S+ESL N E+L+++K
Sbjct: 315 SAESLNSNVEELVKEK 330