Miyakogusa Predicted Gene
- Lj2g3v0343880.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v0343880.1 Non Chatacterized Hit- tr|C5X6M2|C5X6M2_SORBI
Putative uncharacterized protein Sb02g012570
OS=Sorghu,43.9,1e-16,SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.34504.1
(469 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G52430.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 287 1e-77
AT4G25620.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 283 2e-76
AT1G63720.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 134 1e-31
AT1G76660.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 111 1e-24
>AT5G52430.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr5:21283093-21285045 REVERSE LENGTH=438
Length = 438
Score = 287 bits (734), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 207/503 (41%), Positives = 258/503 (51%), Gaps = 99/503 (19%)
Query: 1 MMGSLNNSVDTVNXXXXXXXXXESRVQPAAVPKKRWXXXXXXXXXXXXQKSSKRIGHXXX 60
M +NNSV+TVN ESRVQP++ K RW QK++KRIG+
Sbjct: 1 MRNVVNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVL 60
Query: 61 XXXXXXXXXXXXXXXXQNPSTSILMPFIXXXXXXXXFLQSDPPSATHSPAGLLSLTSLAA 120
ST++++PFI FLQSDP S +HSP G LSLTS
Sbjct: 61 VPEPVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTS--- 117
Query: 121 NAYXXXXXXXIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPE-SVQLTTPSSPEVP 179
N + +FT+GPYA ETQ V+PPVFS F TEPSTA +TPPPE SV +TTPSSPEVP
Sbjct: 118 NTFSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVP 177
Query: 180 FAQLLASSLDRARKSNGS---QKFALYNYEFQPYQQYPGSP-GGQLISPGSAFSTSGTST 235
FAQLL SSL+ R+ + S QKF+ +YEF+ Q PGSP GG LISPGS S SGTS+
Sbjct: 178 FAQLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSS 237
Query: 236 PFPDRRP-------------------TRKWSSRMGSGSLTPESAGQGSRLGSGSLTPNGV 276
P+P + P RKW SR GSGS+TP G GS L SG+LTPNG
Sbjct: 238 PYPGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP--VGHGSGLASGALTPNG- 294
Query: 277 GLASRLGSGCVTPDGLGQDSRLGSGSLTPDGAGPSSQDRISVQNQFSGEASLANTENGIQ 336
+ SG+LTP+ +QNQ S ASLAN+++G
Sbjct: 295 -------------------PEIVSGNLTPNNT------TWPLQNQISEVASLANSDHG-- 327
Query: 337 SNSTLVDHRVSFELTGEDVARCLANKTGILLRNISRSSQGILAKDPIE---------RDN 387
S + DHRVSFELTGEDVARCLA+K ++RS + D IE R N
Sbjct: 328 SEVMVADHRVSFELTGEDVARCLASK-------LNRSHDRMNNNDRIETEESSSTDIRRN 380
Query: 388 IQRDSSSCCDVCSGETNDKQCCQK-HHSVNSSSKEFNFDSRKGDVSGTAANSSEWWANKK 446
I++ S N++ QK S SSKEF FD+ K +
Sbjct: 381 IEKRSGD-------RENEQHRIQKLSSSSIGSSKEFKFDNTKDE---------------- 417
Query: 447 VVGKESKSANSWAFFPMLQPEIS 469
E + NSW+FFP L+ +S
Sbjct: 418 --NIEKVAGNSWSFFPGLRSGVS 438
>AT4G25620.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr4:13067447-13069296 REVERSE LENGTH=449
Length = 449
Score = 283 bits (723), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 220/495 (44%), Positives = 270/495 (54%), Gaps = 81/495 (16%)
Query: 2 MGSLNNS-VDTVNXXXXXXXXXESRVQPAAVPKKR---WXXXXXXXXXXXXQKSSKRIGH 57
M S+NNS VDTVN ESR QP++V KKR W +K++KRIGH
Sbjct: 1 MRSVNNSSVDTVNAAASAIVSAESRTQPSSVQKKRGSWWSLYWCFGS----KKNNKRIGH 56
Query: 58 XXXXXXXXXXXXXXX-XXXXQNPSTSILMPFIXXXXXXXXFLQSDPPSATHSPAGLLSLT 116
+ STSI MPFI FL S PPSA+H+P L L
Sbjct: 57 AVLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LC 115
Query: 117 SLAANAYXXXXXXXIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPESVQLTTPSSP 176
SL N FTIGPYA+ETQ V+PPVFS FTTEPSTA FT +PSSP
Sbjct: 116 SLTVN-----EPPSAFTIGPYAHETQPVTPPVFSAFTTEPSTAPFT-----PPPESPSSP 165
Query: 177 EVPFAQLLASSLDRARKSNG---SQKFALYNYEFQPYQQYPGSPGGQLISPGSAFSTSGT 233
EVPFAQLL SSL+RAR+++G +QKF+ +YEF+ Q YPGSPGG LISPG SGT
Sbjct: 166 EVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPG-----SGT 220
Query: 234 STPFP-------------------DRRPTRKWSSRMGSGSLTPESAGQGSRLGSGSLTPN 274
S+P+P + RKW SR GSGS+TP AGQGSRLGSG+LTP+
Sbjct: 221 SSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP--AGQGSRLGSGALTPD 278
Query: 275 GVGLASRLGSGCVTPDGLGQDSRLGSGSLTPDGAGPSSQDRISVQNQFSGEASLANTENG 334
G S+L SG VTP+G R+ G+LTP + + +Q S ASLAN+++G
Sbjct: 279 G----SKLTSGVVTPNGAETVIRMSYGNLTP-------LEGSLLDSQISEVASLANSDHG 327
Query: 335 IQSN---STLVDHRVSFELTGEDVARCLANKTGILLRNISRSSQGILAKDPIERDNIQRD 391
+ + +V HRVSFELTGEDVARCLA+K ++RS A R N
Sbjct: 328 SSRHNDEALVVPHRVSFELTGEDVARCLASK-------LNRSGSHEKASGEHLRPN---- 376
Query: 392 SSSCCDVCSGETNDKQCCQKHHSVNSSSKEFNFDSRKGDVSGTAANSSEWWANKKVVGKE 451
CC SGET +Q + S+KEF FDS ++ SEWWAN+KV GK
Sbjct: 377 ---CCKT-SGETESEQSQKLRSFSTGSNKEFKFDSTNEEM--IEKIRSEWWANEKVAGKG 430
Query: 452 SKS-ANSWAFFPMLQ 465
S NSW FFP+L+
Sbjct: 431 DHSPRNSWTFFPVLR 445
>AT1G63720.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: hydroxyproline-rich glycoprotein family protein
(TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins
in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132;
Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes
- 79 (source: NCBI BLink). | chr1:23636122-23637348
REVERSE LENGTH=358
Length = 358
Score = 134 bits (337), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 123/241 (51%), Gaps = 17/241 (7%)
Query: 6 NNSVDTVNXXXXXXXXXESRV-QPAAVPKKR-WXXXXXXXXXXXXQKSSKRIGHXXXXXX 63
NN DT+N + R+ Q + + KKR W + KRIG+
Sbjct: 8 NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67
Query: 64 XXXXXXXXXXXXXQNPSTSIL-MPFIXXXXXXXXFLQSDPPSATHSPAGLLSLTSLAANA 122
+ I +PFI F QS+PPSAT SP G+LS + L N
Sbjct: 68 PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 126
Query: 123 YXXXXXXXIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPE--SVQL--TTPSSPEV 178
IF IGPYA+ETQLVSPPVFS +TTEPS+A TPP + S+ L TTPSSPEV
Sbjct: 127 ----NRPSIFAIGPYAHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPEV 182
Query: 179 PFAQLLASSLDRARKSNGSQKFALYNYEFQPYQQYPGSPGGQLISPGSAFSTSGTSTPFP 238
PFAQL S + S G + +YEFQ YQ PGSP GQLISP SG ++PFP
Sbjct: 183 PFAQLFNS--NHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPSPG---SGPTSPFP 237
Query: 239 D 239
D
Sbjct: 238 D 238
>AT1G76660.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 22 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: hydroxyproline-rich
glycoprotein family protein (TAIR:AT5G52430.1); Has 353
Blast hits to 231 proteins in 60 species: Archae - 0;
Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125;
Viruses - 4; Other Eukaryotes - 139 (source: NCBI
BLink). | chr1:28769157-28771036 REVERSE LENGTH=431
Length = 431
Score = 111 bits (278), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 81/140 (57%), Gaps = 9/140 (6%)
Query: 97 FLQSDPPSATHSPAGLLSLTSLAANAYXXXXXXXIFTIGPYAYETQLVSPPVFSNFTTEP 156
F S PS T SP LSL AAN+ ++ GPYA+ETQLVSPPVFS FTTEP
Sbjct: 78 FTNSALPSTTQSPNCYLSL---AANS-PGGPSSSMYATGPYAHETQLVSPPVFSTFTTEP 133
Query: 157 STASFTPPPESVQLTTPSSPEVPFAQLLASSLDRARKSNGSQKFALYNYEFQPYQQYPGS 216
STA FTPPPE +LT PSSP+VP+A+ L SS+D G YN Y YPGS
Sbjct: 134 STAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH-----YNDLQATYSLYPGS 188
Query: 217 PGGQLISPGSAFSTSGTSTP 236
P L SP S S G +P
Sbjct: 189 PASALRSPISRASGDGLLSP 208