Miyakogusa Predicted Gene
- Lj0g3v0160019.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0160019.1 NODE_47867_length_6150_cov_27.376585.path2.1
(1770 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G07940.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 310 6e-84
AT5G07940.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 310 6e-84
AT5G07940.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 310 6e-84
AT5G07980.1 | Symbols: | dentin sialophosphoprotein-related | c... 305 2e-82
AT5G07970.1 | Symbols: | dentin sialophosphoprotein-related | c... 282 2e-75
AT3G29385.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 99 2e-20
>AT5G07940.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: pollen tube;
BEST Arabidopsis thaliana protein match is: dentin
sialophosphoprotein-related (TAIR:AT5G07980.1). |
chr5:2534720-2540086 FORWARD LENGTH=1526
Length = 1526
Score = 310 bits (794), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 216/530 (40%), Positives = 304/530 (57%), Gaps = 46/530 (8%)
Query: 1 MPGNEVGDRVHNFFGQGNLSQGQDH-SLAVDGNWQGLSNNLWVGSQRPPSVPFISNL-NF 58
MPGNE G+++HNFFGQ LSQ Q H S VD +W +N L VG+QR I+NL ++
Sbjct: 1 MPGNEFGEKIHNFFGQEGLSQDQQHQSQVVDRSWSSFNNGL-VGNQRQIDPSLIANLKSY 59
Query: 59 NQQQS-DTEQGYTSSPHFIHGLNITQSNLRPE-SGNQLQNQQRVVNGYLQGQQVFQTRQN 116
N QQS D E+G+ SS + HGLN TQ +R E S + LQ Q++ NGY+ G QT N
Sbjct: 60 NTQQSVDHERGHQSS-NSQHGLNYTQQPIRSEFSRSLLQEHQQLPNGYMHGNLGLQTMPN 118
Query: 117 GANIFGVDTESDRNSLSRGIPLLESQGSGVELYKKSLARNDAAESPVNFDFFGGQQISG- 175
GAN+ G D ES R+ LS ++G EL+ + R + ESPVN+DFFGGQQ S
Sbjct: 119 GANVLGGDVESSRDKLS-------ARGFTPELHNVPM-RLEMGESPVNYDFFGGQQQSNT 170
Query: 176 RYNGMLQPLPRQQSGINEMHLLQQHVVLNQMQELKRQQQYHQL--EPKQQNSITPASSIS 233
+ +GMLQPLPRQQ N+M LL+Q V++ QM E + QQQ + E +Q NS+ ++++
Sbjct: 171 QLSGMLQPLPRQQMTFNDMQLLKQQVMVKQMHEYQMQQQLQKQQLEARQLNSLN-RNAVN 229
Query: 234 NQTIASHSASLINGIPINEASNFIWQPEVIPSNSNWLQGGASPIMHGSSNGLMLSPEQGQ 293
+ + +INGIP+ AS+ +QP+++ N+NW+ G SP + GSS+GLM++PE GQ
Sbjct: 230 GSCASDTQSRMINGIPLQNASSNWFQPDLMTGNTNWMHRGISPAVQGSSSGLMITPEHGQ 289
Query: 294 TMRLMGLVPNQGDQSLYGVPISGSRGTPSMYSHVQADRPAVPQVSIPRQYSHVHGDKSVL 353
+ L+ Q SLYG+P+SG+ + +S VQ +R A P S R YS
Sbjct: 290 S----NLMAQQFGPSLYGMPVSGTNAPQNAFSSVQMNRLAAPHGSANRSYSLT------- 338
Query: 354 QHISANSNSFPAHQYTAFSDQINTNDGTSVSKQSILGKSMFGSTA-HGINSRLNMENLQQ 412
+Q T+F +Q + D + + K++F T+ N+R N EN QQ
Sbjct: 339 ------------NQPTSFLNQGDVQDSQMHPRSTYQEKALFSQTSVPDSNNRPNFENFQQ 386
Query: 413 VSSEQNIVPVQEFNGRQELAGSSETLQNMMVAQTPPSQHLATLDPAEEKILFGSEDSMWD 472
S + + Q+ + E +G +E + Q + LDP EEKILFGS+D++WD
Sbjct: 387 DDSRERNISAQDKFCQMEDSGPAEKSFMKVPENMNALQKSSALDPTEEKILFGSDDNLWD 446
Query: 473 GFGRNSG----GFSMMDGTDNFSEFPSIQSGSWSALMQSAVAETSSSGIG 518
FG ++ G M +D F PS+QSGSWSALMQSAVAET+S G
Sbjct: 447 AFGSSTDMSLQGNLMSSNSDLFDACPSLQSGSWSALMQSAVAETTSDDAG 496
Score = 234 bits (596), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 143/377 (37%), Positives = 202/377 (53%), Gaps = 21/377 (5%)
Query: 1396 GHNTNVTSQEVVGYGQNNAPSASNSNKTISVRRNHSLVNPQMAPSWFEQYGTFKNGKILP 1455
G N ++ + G + S S+ N SVR +H ++PQMAPSW+ QYGTFKNG + P
Sbjct: 1169 GLNNKESANHLPHLGHTVSQSFSSKNHAASVRADHQQISPQMAPSWYSQYGTFKNGLVQP 1228
Query: 1456 MYDVRKMTAAKILDHPFTVPNQSDSLHLQNSVEQVKXXXXXXXXXXXXXTIPASVASENE 1515
M D + T KI + V + D H S +Q P+S +
Sbjct: 1229 MNDTGRFTPLKIGEQSSNVESSVDGTHTVQSCKQC--LMEQMSGSAPGVETPSSDS---- 1282
Query: 1516 HYELSTPPVEHDLLIMRPRKRKSATSELLPWHKELTQGTKRLRDLSEAELVWAQTANRLI 1575
L + L + +P+KRK+ATSEL W+KE+ Q ++RL+ LSEAE+ WA+ NR
Sbjct: 1283 ---LLHGATDKLLKVDKPKKRKTATSELQSWNKEVMQDSQRLKTLSEAEINWARETNRFA 1339
Query: 1576 EKVECTTEVIQDLPAMVKSXXXXXXXXXXXXXXXSPPPAAVLMADVKLHHKSVVYSVSRL 1635
EKVE T +++D P ++S SPPPA V+ ++ V Y+ R
Sbjct: 1340 EKVEFET-LLEDSPP-IRSKRRLIHTTQLMQQLFSPPPARVISLVASSNYDVVAYTAGRA 1397
Query: 1636 TLGEACSSISWSGCDKLLPPGSKNLLPEKNKSSDKVDRCILKVM-DLVDRTSKVEDDILR 1694
LG+ACSS S + PP + N L E+ ++ D+ I K D + RT K+E D
Sbjct: 1398 ALGDACSSSSTDRSEGFSPPNNSNPLSERTENEKISDQYISKAAEDFISRTRKLETDFAG 1457
Query: 1695 LDSRASILDLRVECQDLERFSVINRIAKFHGRGQNDGAETSSSSDASANTQKLPM-KYVT 1753
L++ +I DLRVE QDLE+F+VINR AKFH SSS + + N+ KL + +YVT
Sbjct: 1458 LENGTTIPDLRVEVQDLEKFAVINRFAKFH--------PPSSSMNRTVNSLKLNLQRYVT 1509
Query: 1754 AVPLPRNLPDRVQCLSL 1770
P+P+N+PDRVQCLSL
Sbjct: 1510 IAPMPQNIPDRVQCLSL 1526
Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 123/317 (38%), Gaps = 66/317 (20%)
Query: 868 RPPLTRKFQYHPMGDLGTEVEPYGNQHVINSQHMPLTHFGGDNGQDQSYLGQSKYGHYDR 927
RP +RKFQYHPMG++ +V Q ++ H+P T G +Q Y GQSK+
Sbjct: 708 RPSTSRKFQYHPMGNI--DVTNESCQEKVS--HLPTTLEQVPVG-NQGYFGQSKFLGQSA 762
Query: 928 SYSEMEKGEKSL-DNNASKIIVPGY----LPKTMNSLDKSFGNYALQRLASPR------- 975
+++G S D N + G P T S D++ + AS R
Sbjct: 763 MNMPIDRGHVSQNDLNCTNEAFNGMGSENSPSTSASADRNVDRCNQVKSASSRQTMLELL 822
Query: 976 -----------------APETESSDGSAVHNQWNXXXXXXGFGLQLAPPTQRLPVVSSRG 1018
PE +S + N GF LQLAPP+Q P +
Sbjct: 823 HKVDQSPDNSSETNVSGIPEANASAEYGGQFRHNQSSASQGFNLQLAPPSQLAPSPDNVQ 882
Query: 1019 LSETVLPTPNVSDTA-DKGHAGLATNQTFPSQEPSHWELKNSISSTTGQIFDKASQYSAL 1077
S L N T +KG G + ++ P W S +T Q
Sbjct: 883 FSRNSLQPLNSFHTGPEKG--GTSQSRFAP------WASNQSYQQSTHQ----------- 923
Query: 1078 GKIP-----QDFTSGFPFSRTHTQNQNVTHLGGQVANTQSANTTLIDSC--VSGNQIDEF 1130
G P + TSGFP+SR + QNQ + VA QSA +DS +S Q+ E
Sbjct: 924 GPFPGILGGSNMTSGFPYSRGYHQNQQMA-----VATRQSAANNSVDSSSELSTPQVKER 978
Query: 1131 CERAQTSQSETASAQDM 1147
E + Q +++Q +
Sbjct: 979 DESSDFDQRMLSASQPL 995
>AT5G07940.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: pollen tube;
BEST Arabidopsis thaliana protein match is: dentin
sialophosphoprotein-related (TAIR:AT5G07980.1). |
chr5:2534720-2540086 FORWARD LENGTH=1526
Length = 1526
Score = 310 bits (794), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 216/530 (40%), Positives = 304/530 (57%), Gaps = 46/530 (8%)
Query: 1 MPGNEVGDRVHNFFGQGNLSQGQDH-SLAVDGNWQGLSNNLWVGSQRPPSVPFISNL-NF 58
MPGNE G+++HNFFGQ LSQ Q H S VD +W +N L VG+QR I+NL ++
Sbjct: 1 MPGNEFGEKIHNFFGQEGLSQDQQHQSQVVDRSWSSFNNGL-VGNQRQIDPSLIANLKSY 59
Query: 59 NQQQS-DTEQGYTSSPHFIHGLNITQSNLRPE-SGNQLQNQQRVVNGYLQGQQVFQTRQN 116
N QQS D E+G+ SS + HGLN TQ +R E S + LQ Q++ NGY+ G QT N
Sbjct: 60 NTQQSVDHERGHQSS-NSQHGLNYTQQPIRSEFSRSLLQEHQQLPNGYMHGNLGLQTMPN 118
Query: 117 GANIFGVDTESDRNSLSRGIPLLESQGSGVELYKKSLARNDAAESPVNFDFFGGQQISG- 175
GAN+ G D ES R+ LS ++G EL+ + R + ESPVN+DFFGGQQ S
Sbjct: 119 GANVLGGDVESSRDKLS-------ARGFTPELHNVPM-RLEMGESPVNYDFFGGQQQSNT 170
Query: 176 RYNGMLQPLPRQQSGINEMHLLQQHVVLNQMQELKRQQQYHQL--EPKQQNSITPASSIS 233
+ +GMLQPLPRQQ N+M LL+Q V++ QM E + QQQ + E +Q NS+ ++++
Sbjct: 171 QLSGMLQPLPRQQMTFNDMQLLKQQVMVKQMHEYQMQQQLQKQQLEARQLNSLN-RNAVN 229
Query: 234 NQTIASHSASLINGIPINEASNFIWQPEVIPSNSNWLQGGASPIMHGSSNGLMLSPEQGQ 293
+ + +INGIP+ AS+ +QP+++ N+NW+ G SP + GSS+GLM++PE GQ
Sbjct: 230 GSCASDTQSRMINGIPLQNASSNWFQPDLMTGNTNWMHRGISPAVQGSSSGLMITPEHGQ 289
Query: 294 TMRLMGLVPNQGDQSLYGVPISGSRGTPSMYSHVQADRPAVPQVSIPRQYSHVHGDKSVL 353
+ L+ Q SLYG+P+SG+ + +S VQ +R A P S R YS
Sbjct: 290 S----NLMAQQFGPSLYGMPVSGTNAPQNAFSSVQMNRLAAPHGSANRSYSLT------- 338
Query: 354 QHISANSNSFPAHQYTAFSDQINTNDGTSVSKQSILGKSMFGSTA-HGINSRLNMENLQQ 412
+Q T+F +Q + D + + K++F T+ N+R N EN QQ
Sbjct: 339 ------------NQPTSFLNQGDVQDSQMHPRSTYQEKALFSQTSVPDSNNRPNFENFQQ 386
Query: 413 VSSEQNIVPVQEFNGRQELAGSSETLQNMMVAQTPPSQHLATLDPAEEKILFGSEDSMWD 472
S + + Q+ + E +G +E + Q + LDP EEKILFGS+D++WD
Sbjct: 387 DDSRERNISAQDKFCQMEDSGPAEKSFMKVPENMNALQKSSALDPTEEKILFGSDDNLWD 446
Query: 473 GFGRNSG----GFSMMDGTDNFSEFPSIQSGSWSALMQSAVAETSSSGIG 518
FG ++ G M +D F PS+QSGSWSALMQSAVAET+S G
Sbjct: 447 AFGSSTDMSLQGNLMSSNSDLFDACPSLQSGSWSALMQSAVAETTSDDAG 496
Score = 234 bits (596), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 143/377 (37%), Positives = 202/377 (53%), Gaps = 21/377 (5%)
Query: 1396 GHNTNVTSQEVVGYGQNNAPSASNSNKTISVRRNHSLVNPQMAPSWFEQYGTFKNGKILP 1455
G N ++ + G + S S+ N SVR +H ++PQMAPSW+ QYGTFKNG + P
Sbjct: 1169 GLNNKESANHLPHLGHTVSQSFSSKNHAASVRADHQQISPQMAPSWYSQYGTFKNGLVQP 1228
Query: 1456 MYDVRKMTAAKILDHPFTVPNQSDSLHLQNSVEQVKXXXXXXXXXXXXXTIPASVASENE 1515
M D + T KI + V + D H S +Q P+S +
Sbjct: 1229 MNDTGRFTPLKIGEQSSNVESSVDGTHTVQSCKQC--LMEQMSGSAPGVETPSSDS---- 1282
Query: 1516 HYELSTPPVEHDLLIMRPRKRKSATSELLPWHKELTQGTKRLRDLSEAELVWAQTANRLI 1575
L + L + +P+KRK+ATSEL W+KE+ Q ++RL+ LSEAE+ WA+ NR
Sbjct: 1283 ---LLHGATDKLLKVDKPKKRKTATSELQSWNKEVMQDSQRLKTLSEAEINWARETNRFA 1339
Query: 1576 EKVECTTEVIQDLPAMVKSXXXXXXXXXXXXXXXSPPPAAVLMADVKLHHKSVVYSVSRL 1635
EKVE T +++D P ++S SPPPA V+ ++ V Y+ R
Sbjct: 1340 EKVEFET-LLEDSPP-IRSKRRLIHTTQLMQQLFSPPPARVISLVASSNYDVVAYTAGRA 1397
Query: 1636 TLGEACSSISWSGCDKLLPPGSKNLLPEKNKSSDKVDRCILKVM-DLVDRTSKVEDDILR 1694
LG+ACSS S + PP + N L E+ ++ D+ I K D + RT K+E D
Sbjct: 1398 ALGDACSSSSTDRSEGFSPPNNSNPLSERTENEKISDQYISKAAEDFISRTRKLETDFAG 1457
Query: 1695 LDSRASILDLRVECQDLERFSVINRIAKFHGRGQNDGAETSSSSDASANTQKLPM-KYVT 1753
L++ +I DLRVE QDLE+F+VINR AKFH SSS + + N+ KL + +YVT
Sbjct: 1458 LENGTTIPDLRVEVQDLEKFAVINRFAKFH--------PPSSSMNRTVNSLKLNLQRYVT 1509
Query: 1754 AVPLPRNLPDRVQCLSL 1770
P+P+N+PDRVQCLSL
Sbjct: 1510 IAPMPQNIPDRVQCLSL 1526
Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 123/317 (38%), Gaps = 66/317 (20%)
Query: 868 RPPLTRKFQYHPMGDLGTEVEPYGNQHVINSQHMPLTHFGGDNGQDQSYLGQSKYGHYDR 927
RP +RKFQYHPMG++ +V Q ++ H+P T G +Q Y GQSK+
Sbjct: 708 RPSTSRKFQYHPMGNI--DVTNESCQEKVS--HLPTTLEQVPVG-NQGYFGQSKFLGQSA 762
Query: 928 SYSEMEKGEKSL-DNNASKIIVPGY----LPKTMNSLDKSFGNYALQRLASPR------- 975
+++G S D N + G P T S D++ + AS R
Sbjct: 763 MNMPIDRGHVSQNDLNCTNEAFNGMGSENSPSTSASADRNVDRCNQVKSASSRQTMLELL 822
Query: 976 -----------------APETESSDGSAVHNQWNXXXXXXGFGLQLAPPTQRLPVVSSRG 1018
PE +S + N GF LQLAPP+Q P +
Sbjct: 823 HKVDQSPDNSSETNVSGIPEANASAEYGGQFRHNQSSASQGFNLQLAPPSQLAPSPDNVQ 882
Query: 1019 LSETVLPTPNVSDTA-DKGHAGLATNQTFPSQEPSHWELKNSISSTTGQIFDKASQYSAL 1077
S L N T +KG G + ++ P W S +T Q
Sbjct: 883 FSRNSLQPLNSFHTGPEKG--GTSQSRFAP------WASNQSYQQSTHQ----------- 923
Query: 1078 GKIP-----QDFTSGFPFSRTHTQNQNVTHLGGQVANTQSANTTLIDSC--VSGNQIDEF 1130
G P + TSGFP+SR + QNQ + VA QSA +DS +S Q+ E
Sbjct: 924 GPFPGILGGSNMTSGFPYSRGYHQNQQMA-----VATRQSAANNSVDSSSELSTPQVKER 978
Query: 1131 CERAQTSQSETASAQDM 1147
E + Q +++Q +
Sbjct: 979 DESSDFDQRMLSASQPL 995
>AT5G07940.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: dentin sialophosphoprotein-related
(TAIR:AT5G07980.1); Has 1906 Blast hits to 1127 proteins
in 203 species: Archae - 2; Bacteria - 210; Metazoa -
401; Fungi - 205; Plants - 136; Viruses - 0; Other
Eukaryotes - 952 (source: NCBI BLink). |
chr5:2534720-2540086 FORWARD LENGTH=1526
Length = 1526
Score = 310 bits (794), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 216/530 (40%), Positives = 304/530 (57%), Gaps = 46/530 (8%)
Query: 1 MPGNEVGDRVHNFFGQGNLSQGQDH-SLAVDGNWQGLSNNLWVGSQRPPSVPFISNL-NF 58
MPGNE G+++HNFFGQ LSQ Q H S VD +W +N L VG+QR I+NL ++
Sbjct: 1 MPGNEFGEKIHNFFGQEGLSQDQQHQSQVVDRSWSSFNNGL-VGNQRQIDPSLIANLKSY 59
Query: 59 NQQQS-DTEQGYTSSPHFIHGLNITQSNLRPE-SGNQLQNQQRVVNGYLQGQQVFQTRQN 116
N QQS D E+G+ SS + HGLN TQ +R E S + LQ Q++ NGY+ G QT N
Sbjct: 60 NTQQSVDHERGHQSS-NSQHGLNYTQQPIRSEFSRSLLQEHQQLPNGYMHGNLGLQTMPN 118
Query: 117 GANIFGVDTESDRNSLSRGIPLLESQGSGVELYKKSLARNDAAESPVNFDFFGGQQISG- 175
GAN+ G D ES R+ LS ++G EL+ + R + ESPVN+DFFGGQQ S
Sbjct: 119 GANVLGGDVESSRDKLS-------ARGFTPELHNVPM-RLEMGESPVNYDFFGGQQQSNT 170
Query: 176 RYNGMLQPLPRQQSGINEMHLLQQHVVLNQMQELKRQQQYHQL--EPKQQNSITPASSIS 233
+ +GMLQPLPRQQ N+M LL+Q V++ QM E + QQQ + E +Q NS+ ++++
Sbjct: 171 QLSGMLQPLPRQQMTFNDMQLLKQQVMVKQMHEYQMQQQLQKQQLEARQLNSLN-RNAVN 229
Query: 234 NQTIASHSASLINGIPINEASNFIWQPEVIPSNSNWLQGGASPIMHGSSNGLMLSPEQGQ 293
+ + +INGIP+ AS+ +QP+++ N+NW+ G SP + GSS+GLM++PE GQ
Sbjct: 230 GSCASDTQSRMINGIPLQNASSNWFQPDLMTGNTNWMHRGISPAVQGSSSGLMITPEHGQ 289
Query: 294 TMRLMGLVPNQGDQSLYGVPISGSRGTPSMYSHVQADRPAVPQVSIPRQYSHVHGDKSVL 353
+ L+ Q SLYG+P+SG+ + +S VQ +R A P S R YS
Sbjct: 290 S----NLMAQQFGPSLYGMPVSGTNAPQNAFSSVQMNRLAAPHGSANRSYSLT------- 338
Query: 354 QHISANSNSFPAHQYTAFSDQINTNDGTSVSKQSILGKSMFGSTA-HGINSRLNMENLQQ 412
+Q T+F +Q + D + + K++F T+ N+R N EN QQ
Sbjct: 339 ------------NQPTSFLNQGDVQDSQMHPRSTYQEKALFSQTSVPDSNNRPNFENFQQ 386
Query: 413 VSSEQNIVPVQEFNGRQELAGSSETLQNMMVAQTPPSQHLATLDPAEEKILFGSEDSMWD 472
S + + Q+ + E +G +E + Q + LDP EEKILFGS+D++WD
Sbjct: 387 DDSRERNISAQDKFCQMEDSGPAEKSFMKVPENMNALQKSSALDPTEEKILFGSDDNLWD 446
Query: 473 GFGRNSG----GFSMMDGTDNFSEFPSIQSGSWSALMQSAVAETSSSGIG 518
FG ++ G M +D F PS+QSGSWSALMQSAVAET+S G
Sbjct: 447 AFGSSTDMSLQGNLMSSNSDLFDACPSLQSGSWSALMQSAVAETTSDDAG 496
Score = 234 bits (596), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 143/377 (37%), Positives = 202/377 (53%), Gaps = 21/377 (5%)
Query: 1396 GHNTNVTSQEVVGYGQNNAPSASNSNKTISVRRNHSLVNPQMAPSWFEQYGTFKNGKILP 1455
G N ++ + G + S S+ N SVR +H ++PQMAPSW+ QYGTFKNG + P
Sbjct: 1169 GLNNKESANHLPHLGHTVSQSFSSKNHAASVRADHQQISPQMAPSWYSQYGTFKNGLVQP 1228
Query: 1456 MYDVRKMTAAKILDHPFTVPNQSDSLHLQNSVEQVKXXXXXXXXXXXXXTIPASVASENE 1515
M D + T KI + V + D H S +Q P+S +
Sbjct: 1229 MNDTGRFTPLKIGEQSSNVESSVDGTHTVQSCKQC--LMEQMSGSAPGVETPSSDS---- 1282
Query: 1516 HYELSTPPVEHDLLIMRPRKRKSATSELLPWHKELTQGTKRLRDLSEAELVWAQTANRLI 1575
L + L + +P+KRK+ATSEL W+KE+ Q ++RL+ LSEAE+ WA+ NR
Sbjct: 1283 ---LLHGATDKLLKVDKPKKRKTATSELQSWNKEVMQDSQRLKTLSEAEINWARETNRFA 1339
Query: 1576 EKVECTTEVIQDLPAMVKSXXXXXXXXXXXXXXXSPPPAAVLMADVKLHHKSVVYSVSRL 1635
EKVE T +++D P ++S SPPPA V+ ++ V Y+ R
Sbjct: 1340 EKVEFET-LLEDSPP-IRSKRRLIHTTQLMQQLFSPPPARVISLVASSNYDVVAYTAGRA 1397
Query: 1636 TLGEACSSISWSGCDKLLPPGSKNLLPEKNKSSDKVDRCILKVM-DLVDRTSKVEDDILR 1694
LG+ACSS S + PP + N L E+ ++ D+ I K D + RT K+E D
Sbjct: 1398 ALGDACSSSSTDRSEGFSPPNNSNPLSERTENEKISDQYISKAAEDFISRTRKLETDFAG 1457
Query: 1695 LDSRASILDLRVECQDLERFSVINRIAKFHGRGQNDGAETSSSSDASANTQKLPM-KYVT 1753
L++ +I DLRVE QDLE+F+VINR AKFH SSS + + N+ KL + +YVT
Sbjct: 1458 LENGTTIPDLRVEVQDLEKFAVINRFAKFH--------PPSSSMNRTVNSLKLNLQRYVT 1509
Query: 1754 AVPLPRNLPDRVQCLSL 1770
P+P+N+PDRVQCLSL
Sbjct: 1510 IAPMPQNIPDRVQCLSL 1526
Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 123/317 (38%), Gaps = 66/317 (20%)
Query: 868 RPPLTRKFQYHPMGDLGTEVEPYGNQHVINSQHMPLTHFGGDNGQDQSYLGQSKYGHYDR 927
RP +RKFQYHPMG++ +V Q ++ H+P T G +Q Y GQSK+
Sbjct: 708 RPSTSRKFQYHPMGNI--DVTNESCQEKVS--HLPTTLEQVPVG-NQGYFGQSKFLGQSA 762
Query: 928 SYSEMEKGEKSL-DNNASKIIVPGY----LPKTMNSLDKSFGNYALQRLASPR------- 975
+++G S D N + G P T S D++ + AS R
Sbjct: 763 MNMPIDRGHVSQNDLNCTNEAFNGMGSENSPSTSASADRNVDRCNQVKSASSRQTMLELL 822
Query: 976 -----------------APETESSDGSAVHNQWNXXXXXXGFGLQLAPPTQRLPVVSSRG 1018
PE +S + N GF LQLAPP+Q P +
Sbjct: 823 HKVDQSPDNSSETNVSGIPEANASAEYGGQFRHNQSSASQGFNLQLAPPSQLAPSPDNVQ 882
Query: 1019 LSETVLPTPNVSDTA-DKGHAGLATNQTFPSQEPSHWELKNSISSTTGQIFDKASQYSAL 1077
S L N T +KG G + ++ P W S +T Q
Sbjct: 883 FSRNSLQPLNSFHTGPEKG--GTSQSRFAP------WASNQSYQQSTHQ----------- 923
Query: 1078 GKIP-----QDFTSGFPFSRTHTQNQNVTHLGGQVANTQSANTTLIDSC--VSGNQIDEF 1130
G P + TSGFP+SR + QNQ + VA QSA +DS +S Q+ E
Sbjct: 924 GPFPGILGGSNMTSGFPYSRGYHQNQQMA-----VATRQSAANNSVDSSSELSTPQVKER 978
Query: 1131 CERAQTSQSETASAQDM 1147
E + Q +++Q +
Sbjct: 979 DESSDFDQRMLSASQPL 995
>AT5G07980.1 | Symbols: | dentin sialophosphoprotein-related |
chr5:2549432-2554669 REVERSE LENGTH=1501
Length = 1501
Score = 305 bits (781), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 216/535 (40%), Positives = 298/535 (55%), Gaps = 47/535 (8%)
Query: 1 MPGNEVGDRVHNFFGQGNLSQGQDHSLAVDGNWQGLSNNLWVGSQRPPSVPFISNLN--F 58
MPGNE G+R HNFFGQ LSQ Q S VDG+W SN L VG+QR ++L
Sbjct: 1 MPGNEFGERTHNFFGQEGLSQDQHQSQVVDGSWSSFSNGL-VGNQRQIDPSLTADLKSYR 59
Query: 59 NQQQSDTEQGYTSSPHFIHGLNITQSNLRPE-SGNQLQNQQRVVNGYLQGQQVFQTRQNG 117
QQ D E+G +S+ HGLN TQ +R E S + LQ Q+ NGY+ G QT N
Sbjct: 60 TQQPVDPERGQSSNSQ--HGLNFTQQPMRSEYSRSVLQEPQQPTNGYMHGNLGLQTMPNE 117
Query: 118 ANIFGVDTESDRNSLSRGIPLLESQGSGVELYKKSLARNDAAESPVNFDFFGGQQISG-R 176
AN+ G+D ES R+ LS +G +L+K R + ESPVN+DFFGGQQ S +
Sbjct: 118 ANVLGMDVESSRDKLSE-------RGFTPDLHKIP-TRFEMGESPVNYDFFGGQQQSNTQ 169
Query: 177 YNGMLQPLPRQQSGINEMHLLQQHVVLNQMQELKRQQQYH--QLEPKQQNSITPASSISN 234
GMLQPLPRQQ N+M LL+Q V++ QM E + QQQ +LE +Q NS+ ++++
Sbjct: 170 LPGMLQPLPRQQVSFNDMQLLKQQVMVKQMHEYQMQQQLQKQRLEARQLNSLN-RNAVNG 228
Query: 235 QTIASHSASLINGIPINEASNFIWQPEVIPSNSNWLQGGASPIMHGSSNGLMLSPEQGQT 294
++ + + +INGIP+ AS+ QP+++ N+NW+ G SP + GSS+GLM++P+ GQ
Sbjct: 229 SCVSDNQSHMINGIPLQNASSNWLQPDLMTGNTNWMHRGISPAVQGSSSGLMITPDHGQA 288
Query: 295 MRLMGLVPNQGDQSLYGVPISGSRGTPSMYSHVQADRPAVPQVSIPRQYSHVHGDKSVLQ 354
L+ Q + SLYG+P+SG+ + +S Q +R A Q
Sbjct: 289 ----NLMAQQFEPSLYGMPVSGTNAPHNAFSSSQMNRLAA-------------------Q 325
Query: 355 HISANSNSFPAHQYTAFSDQINTNDGTSVSKQSILGKSMFGSTA-HGINSRLNMENLQQV 413
H SAN S +Q T+F +Q + D + + + K +F T+ NS N E+LQ+
Sbjct: 326 HGSANRTSSVTNQPTSFLNQGDVQDSHMLPRSTYPEKLLFSQTSVPSSNSMPNFESLQED 385
Query: 414 SSEQNIVPVQEFNGRQELAGSSETLQNMMVAQTPPSQHLATLDPAEEKILFGSEDSMWDG 473
S + + VQ G+ E +G SE Q LDP EEKILFGS+D++W+
Sbjct: 386 DSRERNISVQAKFGQMEGSGPSEQSFIKAPENINALQKSTALDPTEEKILFGSDDNLWEA 445
Query: 474 FGRNSG----GFSMMDGTDNFSEFPSIQSGSWSALMQSAVAETSSSGIGGQEEWS 524
FG ++ G M +D F PS+QSGSWSALMQSAVAETSS G EW+
Sbjct: 446 FGNSTDMSLTGNLMSSSSDLFDGCPSLQSGSWSALMQSAVAETSSDD-AGVHEWA 499
Score = 250 bits (639), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 170/473 (35%), Positives = 241/473 (50%), Gaps = 59/473 (12%)
Query: 1307 QAFGQSLRPNNV----LNHNSPLLGQVQSMRNVGIDPSNRDAKRM---KVSDNLVDKQQV 1359
Q+FG+SL NN L H+ + G + DA +M +V ++ +D Q+V
Sbjct: 1079 QSFGRSLPSNNFPKDSLRHDEQMAGSGEG-----------DAPKMTVKRVENSAIDPQKV 1127
Query: 1360 DSNHGYDNAVKDVSENKSSILSSDPSMTSFLSKLHDGHNTNVTSQEVVGYGQNNAPSASN 1419
A K ++ S SD S + DG N + + +GQN S S
Sbjct: 1128 --------APKGEQQSPSK---SD-------SLVRDGLNHRESVNHMPYFGQNVTQSFST 1169
Query: 1420 SNKTISVRRNHSLVNPQMAPSWFEQYGTFKNGKILPMYDVRKMTAAKILDHPFTVPNQSD 1479
N + SV +H ++PQMAPSW+ QYGTFKNG + P+ D + T KI + V + D
Sbjct: 1170 KNHSASVGADHQQISPQMAPSWYSQYGTFKNGLVQPVNDTGRFTPLKIGEQSSNVGSSVD 1229
Query: 1480 SLHLQNSVEQVKXXXXXXXXXXXXXTIPASVASENEHYELSTPPVEHDLLIMRPRKRKSA 1539
H V+ T+ A + S L E L + +P+KRK+A
Sbjct: 1230 GTH------SVQLSQHFKMQQMSGSTLGAEIPSSE---SLPHGATEQLLKVNKPKKRKTA 1280
Query: 1540 TSELLPWHKELTQGTKRLRDLSEAELVWAQTANRLIEKVECTTEVIQDLPAMVKSXXXXX 1599
TSEL+PW+KE+ QG +RL+ L EAE+ WA+ NR EKVE T +++D P +KS
Sbjct: 1281 TSELIPWNKEVMQGHQRLKTLGEAEVDWARATNRFAEKVEFET-LLEDSPP-IKSKRRLV 1338
Query: 1600 XXXXXXXXXXSPPPAAVLMADVKLHHKSVVYSVSRLTLGEACSSISWSGCDKLLPPGSKN 1659
SPPPA V+ +++ V Y+ +R LG+ACSS S + PP N
Sbjct: 1339 YTTQLMQQLCSPPPARVISLVASSNYEFVAYTAARGALGDACSSSSTDRSEGFWPPNISN 1398
Query: 1660 LLPEKNKSSDKVDRCILKVM-DLVDRTSKVEDDILRLDSRASILDLRVECQDLERFSVIN 1718
L E+ K+ D+ I K D + RT K+E D RL++ +I DLRVE QDLE+F+VIN
Sbjct: 1399 PLSERTKTEKISDQYISKAAEDFISRTRKLETDFARLENGTTIPDLRVEVQDLEKFAVIN 1458
Query: 1719 RIAKFHGRGQNDGAETSSSSDASANTQKL-PMKYVTAVPLPRNLPDRVQCLSL 1770
R AKFH S D + N+ ++ P +YVT P+P+N+PDRVQCLSL
Sbjct: 1459 RFAKFH----------PPSMDRTLNSVRINPQRYVTVAPMPQNIPDRVQCLSL 1501
>AT5G07970.1 | Symbols: | dentin sialophosphoprotein-related |
chr5:2544126-2547916 REVERSE LENGTH=1097
Length = 1097
Score = 282 bits (721), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 212/571 (37%), Positives = 294/571 (51%), Gaps = 81/571 (14%)
Query: 1 MPGNEVGDRVHNFFGQGNLSQGQDHSLAVDGNWQGLSNNLWVGSQRPPSVPFISNLN-FN 59
MPGNE G+R+HNFFGQ LSQ A DG+W G N L V +QR I+NL ++
Sbjct: 1 MPGNEYGERIHNFFGQEGLSQDSHQPQAGDGSWSGFRNGL-VSNQRQIDPSLIANLKTYS 59
Query: 60 QQQS-DTEQGYTSSPHFIHGLNITQSNLRPE-SGNQLQNQQRVVNGYLQGQQVFQTRQNG 117
QQS D E+G +S+ HGLN Q +R + S + L+ Q+ GY+ G + Q N
Sbjct: 60 TQQSVDPERGQSSNSQ--HGLNFAQQPMRSDYSRSVLREHQQSTTGYMHGNLMLQASPNE 117
Query: 118 ANIFGVDTESDRNSLSRGIPLLESQGSGVELYK-KSLARNDAAESPVNFDFFGGQQ-ISG 175
+ GVD ES R+ LS GSG L + K+ R D ESPVN+DFFGGQQ ++
Sbjct: 118 GSFVGVDVESSRDRLS---------GSGFTLDRHKTPMRFDMGESPVNYDFFGGQQQLNN 168
Query: 176 RYNGMLQPLPRQQSGINEMHLLQQHVVLNQMQELKRQQ-------QYHQLEPKQQNSITP 228
+ GM+QP PRQQ N+M LL+QH + QM E + QQ + QL N++
Sbjct: 169 QLPGMIQPFPRQQMTFNDMQLLKQHAMAKQMHEYQIQQQLQKQQLEARQLNSLHSNAVNG 228
Query: 229 ASSISNQTIASHSASLINGIPINEASNFIWQPEVIPSNSNWLQGGASPIMHGSSNGLMLS 288
+ S NQ+ S I+G+P+ +ASN QP+++ N+NW+ G SPI+ SS+GL+++
Sbjct: 229 SLSSDNQSHPS-----ISGVPLQDASNNWLQPDLMTGNTNWMHRGISPIVQSSSSGLVIT 283
Query: 289 PEQGQTMRLMGLVPNQGDQSLYGVPISGSRGTPSMYSHVQADRPAVPQVSIPRQYSHVHG 348
PE G L+ Q + SLYG+P+ G+ + +S Q A
Sbjct: 284 PEHGHA----NLMAQQFETSLYGMPVGGTDAPQNAFSSFQMKMLAA-------------- 325
Query: 349 DKSVLQHISANSNSFPAHQYTAFSDQINTNDGTSVSKQSILGKSMFGSTAHGINSRLNME 408
QH SAN +S +Q T+F +N +D + + + + G N R N E
Sbjct: 326 -----QHGSANMSSSLTNQPTSF---LNQSDSHMLPRSTYQENLYSHISVPGSNDRPNFE 377
Query: 409 NLQQVSSEQNIVPVQEFNGRQELAGSSETLQNMMVAQTPPSQHLATLDPAEEKILFGSED 468
+ QQ +S Q + QE G+ + +G SE + Q TLDP EEKILFGS+D
Sbjct: 378 SFQQDNSGQQNISGQEEFGQMDGSGLSEKSFMKVPENINTLQKSTTLDPTEEKILFGSDD 437
Query: 469 SMWDGFGRNSG----GFSMMDGTDNFSEFPSIQSGSWSALMQSAVAETSSSGIGGQEEWS 524
++W+ FG ++ G M +D F PS+QSGSWSALMQSAVAET+S G EW
Sbjct: 438 NLWEAFGNSTDMSLTGNLMSSSSDLFDACPSLQSGSWSALMQSAVAETASDD-AGVHEWG 496
Query: 525 GLSFQNIGPSSGNEHPSTTDSRKQQSAWADN 555
KQQS WA+N
Sbjct: 497 S---------------------KQQSVWANN 506
Score = 205 bits (521), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/379 (34%), Positives = 188/379 (49%), Gaps = 12/379 (3%)
Query: 1393 LHDGHNTNVTSQEVVGYGQNNAPSASNSNKTISVRRNHSLVNPQMAPSWFEQYGTFKNGK 1452
+ DG N ++ ++ +G + S N N +S +H ++PQ+APS F QY FKNG
Sbjct: 730 IRDGLNHKDSANCMLQFGPTISQSFFNKNHAVSAGSDHQQISPQIAPSRFSQYEAFKNGL 789
Query: 1453 ILPMYDVRKMTAAKILDHPFTVPNQSDSLHLQNSVEQVKXXXXXXXXXXXXXTIPASVAS 1512
+ P+ D + T KI + + N D LH S +Q+ +
Sbjct: 790 VQPVNDTGRFTLLKIGERYSNLGNSDDGLHSVQSSKQLNTADPGYIVHMQQISGSTPGVE 849
Query: 1513 ENEHYELSTPPVEHDLLIMRPRKRKSATSELLPWHKELTQGTKRLRDLSEAELVWAQTAN 1572
L + L + +P+KRK+ TSELL W KE+ Q +RL+ L EAE+ WA+ N
Sbjct: 850 TLSSASLPCGATDQLLKVYKPKKRKNVTSELLSWSKEVMQRPQRLKTLGEAEVDWARATN 909
Query: 1573 RLIEKVECTTEVIQDLPAMVKSXXXXXXXXXXXXXXXSPPPAAVLMADVKLHHKSVVYSV 1632
R EKVE T +++D P ++S P P V + ++ V YS
Sbjct: 910 RFAEKVEFAT-LLEDGPP-IRSKRRLIYTTQLMQQLFRPLPGRV--KSLVTSYEFVAYSA 965
Query: 1633 SRLTLGEACSSISWSGCDKLLPPGSKNLLPEKNKSSDKVDRCILKVM-DLVDRTSKVEDD 1691
+R LG+ACSS S + L + N L E+ ++ D+ I K D + RT K+E D
Sbjct: 966 ARAALGDACSSTSTDRIEGFLLQNNLNPLSERTETEKMSDQYISKAAEDFISRTKKLETD 1025
Query: 1692 ILRLDSRASILDLRVECQDLERFSVINRIAKFHGRGQNDGAETSSSSDASANTQKLPMKY 1751
L+ +I DLRVE QDLERF+VINR A FH Q+ + S S N P +Y
Sbjct: 1026 FAGLEKGTTITDLRVEVQDLERFAVINRFASFH---QSSSSMDRSVSSLRLN----PQRY 1078
Query: 1752 VTAVPLPRNLPDRVQCLSL 1770
VT P+PR++PDRVQCLS
Sbjct: 1079 VTVAPVPRHIPDRVQCLSF 1097
>AT3G29385.1 | Symbols: | BEST Arabidopsis thaliana protein match is:
dentin sialophosphoprotein-related (TAIR:AT5G07980.1);
Has 74 Blast hits to 74 proteins in 11 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr3:11284395-11285402 REVERSE LENGTH=218
Length = 218
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/238 (32%), Positives = 108/238 (45%), Gaps = 27/238 (11%)
Query: 1534 RKRKSATSELLPWHKELTQGTKRLRDLSEAELVWAQTANRLIEKVECTTEVIQDLPAMVK 1593
+KRKS+T PWHK QG++ ++ AE W N L EKV+ T E I
Sbjct: 7 KKRKSSTFLQSPWHKVYLQGSELCHNIRIAEQEWNLATNTLSEKVD-TNEAISP------ 59
Query: 1594 SXXXXXXXXXXXXXXXSPPPAAVLMAD-VKLHHKSVVYSVSRLTLGEACSSISWSGCDKL 1652
S P P V + D L+++ V+Y VSR+ L +CS S DK
Sbjct: 60 SKRRLLSSTHLMQQLLQPAPTFVFLGDNAALNYEIVLYYVSRINLANSCSLKCRSDLDK- 118
Query: 1653 LPPGSKNLLPEKNKSSDKVDRCILKVMDLVDRTSKVEDDILRLDSRASILDLRVECQDLE 1712
S N K S+ +L V ++ K+E + L+ SILD+ E QDLE
Sbjct: 119 ----SINRQTSKTASNQDQQHSLL-VNAFNEKIQKLESNFQSLERTTSILDIIFEIQDLE 173
Query: 1713 RFSVINRIAKFHGRGQNDGAETSSSSDASANTQKLPMKYVTAVPLPRNLPDRVQCLSL 1770
RFS+IN + KFH R A + +P KY A+ +P NLP+ + CL L
Sbjct: 174 RFSMINHLGKFHNR-------------AKTFKRLIPHKYAVAIQMPMNLPEPLHCLPL 218