Miyakogusa Predicted Gene
- Lj1g3v3329900.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3329900.1 Non Chatacterized Hit- tr|D8S3L3|D8S3L3_SELML
Putative uncharacterized protein OS=Selaginella
moelle,36.09,3e-17,BAR/IMD domain-like,NULL; coiled-coil,NULL;
seg,NULL; FAMILY NOT NAMED,NULL,CUFF.30441.1
(498 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G33490.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 322 3e-88
AT3G26910.2 | Symbols: | hydroxyproline-rich glycoprotein famil... 220 2e-57
AT3G26910.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 220 2e-57
AT5G41100.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 218 9e-57
AT5G41100.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 218 1e-56
>AT2G33490.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr2:14183552-14187666 FORWARD LENGTH=623
Length = 623
Score = 322 bits (826), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 220/494 (44%), Positives = 277/494 (56%), Gaps = 31/494 (6%)
Query: 1 MKRQCDEKRDVYEYMATRFRERGRSKGGKTETFSLQQLQTARDEYDEEATLFVFRLKSLK 60
M+R CDEKR+VYE M TR RE+GRSKGGK ETFS QQLQ A D+Y+ E TLFVFRLKSLK
Sbjct: 132 MQRLCDEKRNVYEGMLTRQREKGRSKGGKGETFSPQQLQEAHDDYENETTLFVFRLKSLK 191
Query: 61 QGQSRSLLTQAARHHAAQLCFFKKAVKSLETVEPHVKSVTEQQHIDYHFXXXXXXXXXXX 120
QGQ+RSLLTQAARHHAAQLCFFKKA+ SLE V+PHV+ VTE QHIDYHF
Sbjct: 192 QGQTRSLLTQAARHHAAQLCFFKKALSSLEEVDPHVQMVTESQHIDYHF-SGLEDDDGDD 250
Query: 121 XXXXXXXXXXXXXXXXXXSFDYGQIEQEQDV-STSRNSMELDQVELTLPRGSTAEAAKEN 179
SF+Y +++QD S++ S EL ++T P+ A+EN
Sbjct: 251 EIENNENDGSEVHDDGELSFEYRVNDKDQDADSSAGGSSELGNSDITFPQIGGPYTAQEN 310
Query: 180 LDKLQRNLFSFR--VRTGSQSAPLFADNKPD-SSEKLRQMRPSLSRKFSSYVLPTPVDAK 236
+ R SFR VR SQSAPLF +N+ SEKL +MR +L+RKF++Y LPTPV+
Sbjct: 311 EEGNYRKSHSFRRDVRAVSQSAPLFPENRTTPPSEKLLRMRSTLTRKFNTYALPTPVETT 370
Query: 237 SSISSGSNNPKPSKMQTNSNEATT-NLWHSSPLEQKKHETIRDEFSSPTVRNAQSVLKES 295
S SS ++ + +N +A T +W+SSPLE + + S V + VL+ES
Sbjct: 371 RSPSSTTSPGHKNVGSSNPTKAITKQIWYSSPLETRGPAKVS---SRSMVALKEQVLRES 427
Query: 296 NSNTATTRLPPPLVDGLLSSNHDYVSAYSKKIKRHAFSGPLTSNPMPTRPVSVDSVQMFX 355
N NT+ RLPPPL DGLL S +KR +FSGPLTS P+P +P+S S
Sbjct: 428 NKNTS--RLPPPLADGLLFSRLG-------TLKRRSFSGPLTSKPLPNKPLSTTS---HL 475
Query: 356 XXXXXXXXXXXXXXXXXXXXXXXXTIVSSPKISELHELPRPPTNFPSNSRLLGLVGYSGP 415
T VS+PKISELHELPRPP S+++ +GYS P
Sbjct: 476 YSGPIPRNPVSKLPKVSSSPTASPTFVSTPKISELHELPRPPPR--SSTKSSRELGYSAP 533
Query: 416 LVPRGQKVSAPNNLVXXXXXXXXXXXXQAMARSFSIPSSGARVTXXXXXXXXXXXXXXXX 475
LV R Q +S P L+ A+ RSFSIP+S R +
Sbjct: 534 LVSRSQLLSKP--LITNSASPLPIPP--AITRSFSIPTSNLRAS----DLDMSKTSLGTK 585
Query: 476 XXDIASPPLTPIAL 489
SPPLTP++L
Sbjct: 586 KLGTPSPPLTPMSL 599
>AT3G26910.2 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr3:9915304-9918511 REVERSE LENGTH=614
Length = 614
Score = 220 bits (561), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 179/516 (34%), Positives = 243/516 (47%), Gaps = 69/516 (13%)
Query: 1 MKRQCDEKRDVYEYMATRFRERGRSKGGKTETFSLQQLQTARDEYDEEATLFVFRLKSLK 60
MK+QCD KR+VYE + +E+GR K K E + + A E+ +EAT+ +FRLKSLK
Sbjct: 134 MKQQCDGKRNVYEM--SLVKEKGRPKSSKGERHIPPESRPAYSEFHDEATMCIFRLKSLK 191
Query: 61 QGQSRSLLTQAARHHAAQLCFFKKAVKSLETVEPHVKSVTEQQHIDYHFXXXXXXXXXXX 120
+GQ+RSLL QA RHH AQ+ F +KSLE VE HVK E+QHID
Sbjct: 192 EGQARSLLIQAVRHHTAQMRLFHTGLKSLEAVERHVKVAVEKQHIDCDLSVHGNEMEASE 251
Query: 121 XXXXXXXXXXXXXXXXXXSFDYGQIEQEQDVST--SRNSMELDQVELTLPRGSTAEAAKE 178
SFDY EQ+ + S+ + + ++D +L+ PR ST A
Sbjct: 252 DDDDDGRYMNREGEL---SFDYRTNEQKVEASSLSTPWATKMDDTDLSFPRPSTTRPAAV 308
Query: 179 NLDKLQRNLFSFRVR-TGSQSAPLFADNKPDSSEKLRQMRPSLSRKFSSYVLPTPVDAK- 236
N D + S R + S SAPLF + KPD SE+LRQ PS F++YVLPTP D++
Sbjct: 309 NADHREEYPVSTRDKYLSSHSAPLFPEKKPDVSERLRQANPS----FNAYVLPTPNDSRY 364
Query: 237 SSISSGSNNPKPSKMQTNSNEATTNLWHSSPLEQKKHETIRDEFSSPTVRNAQSVLKESN 296
S S + NP+P +N + N+WHSSPLE K + +D K++
Sbjct: 365 SKPVSQALNPRP------TNHSAGNIWHSSPLEPIK--SGKDG-------------KDAE 403
Query: 297 SNTATTRLPPPLVDGLLSSNHDYVSAYSKKIKRHAFSGPLTSNPMPTRPVSVDSVQMFXX 356
SN+ RLP P S D + RHAFSGPL P T+P+++
Sbjct: 404 SNSFYGRLPRP-------STTDTHHHQQQAAGRHAFSGPL--RPSSTKPITMADSYSGAF 454
Query: 357 XXXXXXXXXXXXXXXXXXXXXXXTI----VSSPKISELHELPRPPTNF---PSNSRLLGL 409
T SSP+++ELHELPRPP +F P ++ GL
Sbjct: 455 CPLPTPPVLQSHPHSSSSPRVSPTASPPPASSPRLNELHELPRPPGHFAPPPRRAKSPGL 514
Query: 410 VGYSGPLVPRGQK-------VSAPNNLVXXXXXXXXXXXXQAMARSFSIPSSGARVTXXX 462
VG+S PL Q+ V + N+V + RS+SIPS RV
Sbjct: 515 VGHSAPLTAWNQERSTVTVAVPSATNIV----ASPLPVPPLVVPRSYSIPSRNQRVVSQR 570
Query: 463 XXXXXXXXXXXXXXXDIASPPLTPIALSNSRPSSDG 498
+ASPPLTP++LS P + G
Sbjct: 571 LVERRDDI--------VASPPLTPMSLSRPLPQATG 598
>AT3G26910.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr3:9915338-9918511 REVERSE LENGTH=608
Length = 608
Score = 220 bits (560), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 179/516 (34%), Positives = 243/516 (47%), Gaps = 69/516 (13%)
Query: 1 MKRQCDEKRDVYEYMATRFRERGRSKGGKTETFSLQQLQTARDEYDEEATLFVFRLKSLK 60
MK+QCD KR+VYE + +E+GR K K E + + A E+ +EAT+ +FRLKSLK
Sbjct: 134 MKQQCDGKRNVYEM--SLVKEKGRPKSSKGERHIPPESRPAYSEFHDEATMCIFRLKSLK 191
Query: 61 QGQSRSLLTQAARHHAAQLCFFKKAVKSLETVEPHVKSVTEQQHIDYHFXXXXXXXXXXX 120
+GQ+RSLL QA RHH AQ+ F +KSLE VE HVK E+QHID
Sbjct: 192 EGQARSLLIQAVRHHTAQMRLFHTGLKSLEAVERHVKVAVEKQHIDCDLSVHGNEMEASE 251
Query: 121 XXXXXXXXXXXXXXXXXXSFDYGQIEQEQDVST--SRNSMELDQVELTLPRGSTAEAAKE 178
SFDY EQ+ + S+ + + ++D +L+ PR ST A
Sbjct: 252 DDDDDGRYMNREGEL---SFDYRTNEQKVEASSLSTPWATKMDDTDLSFPRPSTTRPAAV 308
Query: 179 NLDKLQRNLFSFRVR-TGSQSAPLFADNKPDSSEKLRQMRPSLSRKFSSYVLPTPVDAK- 236
N D + S R + S SAPLF + KPD SE+LRQ PS F++YVLPTP D++
Sbjct: 309 NADHREEYPVSTRDKYLSSHSAPLFPEKKPDVSERLRQANPS----FNAYVLPTPNDSRY 364
Query: 237 SSISSGSNNPKPSKMQTNSNEATTNLWHSSPLEQKKHETIRDEFSSPTVRNAQSVLKESN 296
S S + NP+P +N + N+WHSSPLE K + +D K++
Sbjct: 365 SKPVSQALNPRP------TNHSAGNIWHSSPLEPIK--SGKDG-------------KDAE 403
Query: 297 SNTATTRLPPPLVDGLLSSNHDYVSAYSKKIKRHAFSGPLTSNPMPTRPVSVDSVQMFXX 356
SN+ RLP P S D + RHAFSGPL P T+P+++
Sbjct: 404 SNSFYGRLPRP-------STTDTHHHQQQAAGRHAFSGPL--RPSSTKPITMADSYSGAF 454
Query: 357 XXXXXXXXXXXXXXXXXXXXXXXTI----VSSPKISELHELPRPPTNF---PSNSRLLGL 409
T SSP+++ELHELPRPP +F P ++ GL
Sbjct: 455 CPLPTPPVLQSHPHSSSSPRVSPTASPPPASSPRLNELHELPRPPGHFAPPPRRAKSPGL 514
Query: 410 VGYSGPLVPRGQK-------VSAPNNLVXXXXXXXXXXXXQAMARSFSIPSSGARVTXXX 462
VG+S PL Q+ V + N+V + RS+SIPS RV
Sbjct: 515 VGHSAPLTAWNQERSTVTVAVPSATNIV----ASPLPVPPLVVPRSYSIPSRNQRVVSQR 570
Query: 463 XXXXXXXXXXXXXXXDIASPPLTPIALSNSRPSSDG 498
+ASPPLTP++LS P + G
Sbjct: 571 LVERRDDI--------VASPPLTPMSLSRPLPQATG 598
>AT5G41100.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: hydroxyproline-rich
glycoprotein family protein (TAIR:AT3G26910.2); Has 1497
Blast hits to 1191 proteins in 214 species: Archae - 4;
Bacteria - 102; Metazoa - 485; Fungi - 316; Plants -
187; Viruses - 37; Other Eukaryotes - 366 (source: NCBI
BLink). | chr5:16447429-16450686 FORWARD LENGTH=582
Length = 582
Score = 218 bits (554), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 184/507 (36%), Positives = 241/507 (47%), Gaps = 95/507 (18%)
Query: 1 MKRQCDEKRDVYEYMAT-RFRERGRSKGGKTETFSLQQLQTARDEYDEEATLFVFRLKSL 59
MK+QC+EKRDV ++M +++ + KG K E +QL+TARDE +EATL +FRLKSL
Sbjct: 133 MKQQCEEKRDVVKHMLMEHVKDKVQVKGTKGERLIRRQLETARDELQDEATLCIFRLKSL 192
Query: 60 KQGQSRSLLTQAARHHAAQLCFFKKAVKSLETVEPHVKSVTEQQHIDYHFXXXXXXXXXX 119
K+GQ+RSLLTQAARHH AQ+ F +KSLE VE HV+ ++QHID
Sbjct: 193 KEGQARSLLTQAARHHTAQMHMFFAGLKSLEAVEQHVRIAADRQHIDCVL---SDPGNEM 249
Query: 120 XXXXXXXXXXXXXXXXXXXSFDYGQIEQEQDV-STSRNSMELDQVELTLPRGSTAEAAKE 178
SFDY EQ +V ST SM++D +L+ R S A +A
Sbjct: 250 DCSEDNDDDDRLVNRDGELSFDYITSEQRVEVISTPHGSMKMDDTDLSFQRPSPAGSATV 309
Query: 179 NLDKLQRNLFSFR-VRTGSQSAPLFADNKPDSSEK-LRQMRPSLSRKFSSYVLPTPVDAK 236
N D + + S R RT S SAPLF D K D +++ +RQM PS ++Y+LPTPVD+K
Sbjct: 310 NADPREEHSVSNRDRRTSSHSAPLFPDKKADLADRSMRQMTPSA----NAYILPTPVDSK 365
Query: 237 SSISSGSNNPKPSKMQTNSNEATTNLWHSSPLEQKKHETIRDEFSSPTVRNAQSVLKESN 296
SS P +K T +N + NLWHSSPLE K + K++
Sbjct: 366 SS-------PIFTKPVTQTNH-SANLWHSSPLEPIK-----------------TAHKDAE 400
Query: 297 SNTATTRLPPPLVDGLLSSNHDYVSAYSKKIKRHAFSGPL--TSNPMPTRPVSVDSVQMF 354
SN +RLP P HAFSGPL +S +P PV+V +
Sbjct: 401 SNL-YSRLPRP--------------------SEHAFSGPLKPSSTRLPV-PVAVQA---- 434
Query: 355 XXXXXXXXXXXXXXXXXXXXXXXXXTIVSSPKISELHELPRPPTNF--PSNSRLLGLVGY 412
+ SSP+I+ELHELPRPP F P S+ GLVG+
Sbjct: 435 ------------QSSSPRISPTASPPLASSPRINELHELPRPPGQFAPPRRSKSPGLVGH 482
Query: 413 SGPLVPRGQK---VSAPNNLVXXXXXXXXXXXXQAMARSFSIPSSGARVTXXXXXXXXXX 469
S PL Q+ V N+V + RS+SIPS R
Sbjct: 483 SAPLTAWNQERSNVVVSTNIV----ASPLPVPPLVVPRSYSIPSRNQRAMAQQPLPERNQ 538
Query: 470 XXXXXXXXDIASP---PLTPIALSNSR 493
+ASP PLTP +L N R
Sbjct: 539 NR-------VASPPPLPLTPASLMNLR 558
>AT5G41100.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: hydroxyproline-rich
glycoprotein family protein (TAIR:AT3G26910.2); Has 1503
Blast hits to 1197 proteins in 220 species: Archae - 4;
Bacteria - 108; Metazoa - 481; Fungi - 318; Plants -
186; Viruses - 39; Other Eukaryotes - 367 (source: NCBI
BLink). | chr5:16447429-16450610 FORWARD LENGTH=586
Length = 586
Score = 218 bits (554), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 184/507 (36%), Positives = 241/507 (47%), Gaps = 95/507 (18%)
Query: 1 MKRQCDEKRDVYEYMAT-RFRERGRSKGGKTETFSLQQLQTARDEYDEEATLFVFRLKSL 59
MK+QC+EKRDV ++M +++ + KG K E +QL+TARDE +EATL +FRLKSL
Sbjct: 133 MKQQCEEKRDVVKHMLMEHVKDKVQVKGTKGERLIRRQLETARDELQDEATLCIFRLKSL 192
Query: 60 KQGQSRSLLTQAARHHAAQLCFFKKAVKSLETVEPHVKSVTEQQHIDYHFXXXXXXXXXX 119
K+GQ+RSLLTQAARHH AQ+ F +KSLE VE HV+ ++QHID
Sbjct: 193 KEGQARSLLTQAARHHTAQMHMFFAGLKSLEAVEQHVRIAADRQHIDCVL---SDPGNEM 249
Query: 120 XXXXXXXXXXXXXXXXXXXSFDYGQIEQEQDV-STSRNSMELDQVELTLPRGSTAEAAKE 178
SFDY EQ +V ST SM++D +L+ R S A +A
Sbjct: 250 DCSEDNDDDDRLVNRDGELSFDYITSEQRVEVISTPHGSMKMDDTDLSFQRPSPAGSATV 309
Query: 179 NLDKLQRNLFSFR-VRTGSQSAPLFADNKPDSSEK-LRQMRPSLSRKFSSYVLPTPVDAK 236
N D + + S R RT S SAPLF D K D +++ +RQM PS ++Y+LPTPVD+K
Sbjct: 310 NADPREEHSVSNRDRRTSSHSAPLFPDKKADLADRSMRQMTPSA----NAYILPTPVDSK 365
Query: 237 SSISSGSNNPKPSKMQTNSNEATTNLWHSSPLEQKKHETIRDEFSSPTVRNAQSVLKESN 296
SS P +K T +N + NLWHSSPLE K + K++
Sbjct: 366 SS-------PIFTKPVTQTNH-SANLWHSSPLEPIK-----------------TAHKDAE 400
Query: 297 SNTATTRLPPPLVDGLLSSNHDYVSAYSKKIKRHAFSGPL--TSNPMPTRPVSVDSVQMF 354
SN +RLP P HAFSGPL +S +P PV+V +
Sbjct: 401 SNL-YSRLPRP--------------------SEHAFSGPLKPSSTRLPV-PVAVQA---- 434
Query: 355 XXXXXXXXXXXXXXXXXXXXXXXXXTIVSSPKISELHELPRPPTNF--PSNSRLLGLVGY 412
+ SSP+I+ELHELPRPP F P S+ GLVG+
Sbjct: 435 ------------QSSSPRISPTASPPLASSPRINELHELPRPPGQFAPPRRSKSPGLVGH 482
Query: 413 SGPLVPRGQK---VSAPNNLVXXXXXXXXXXXXQAMARSFSIPSSGARVTXXXXXXXXXX 469
S PL Q+ V N+V + RS+SIPS R
Sbjct: 483 SAPLTAWNQERSNVVVSTNIV----ASPLPVPPLVVPRSYSIPSRNQRAMAQQPLPERNQ 538
Query: 470 XXXXXXXXDIASP---PLTPIALSNSR 493
+ASP PLTP +L N R
Sbjct: 539 NR-------VASPPPLPLTPASLMNLR 558