Miyakogusa Predicted Gene
- Lj2g3v0126210.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v0126210.1 tr|Q9MB08|Q9MB08_HELAN Multicystatin
OS=Helianthus annuus GN=smc PE=2
SV=1,56.32,4e-17,Cystatin/monellin,NULL; Cystatin-like
domain,Proteinase inhibitor I25, cystatin;
CYSTATIN,Proteinase,NODE_17764_length_320_cov_1150.887451.path1.1
(109 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G12490.1 | Symbols: ATCYSB, ATCYS6, CYSB | cystatin B | chr3:... 153 3e-38
AT3G12490.2 | Symbols: ATCYSB, ATCYS6, CYSB | cystatin B | chr3:... 152 4e-38
AT5G05110.1 | Symbols: | Cystatin/monellin family protein | chr... 128 9e-31
AT2G40880.1 | Symbols: FL3-27, ATCYSA, CYSA | cystatin A | chr2:... 126 3e-30
AT5G12140.1 | Symbols: ATCYS1, CYS1 | cystatin-1 | chr5:3923295-... 102 5e-23
AT4G16500.1 | Symbols: | Cystatin/monellin superfamily protein ... 57 4e-09
AT5G47550.1 | Symbols: | Cystatin/monellin superfamily protein ... 55 1e-08
AT2G31980.1 | Symbols: AtCYS2, CYS2 | PHYTOCYSTATIN 2 | chr2:136... 52 7e-08
>AT3G12490.1 | Symbols: ATCYSB, ATCYS6, CYSB | cystatin B |
chr3:3960523-3961777 REVERSE LENGTH=201
Length = 201
Score = 153 bits (386), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/105 (69%), Positives = 88/105 (83%), Gaps = 1/105 (0%)
Query: 5 LGGIRDSQGFQNSLEIDALARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLEAI 64
+GG+ D QNS E+++LARFAVDEHN K+NALLEF RVVKA+EQVVAG++HHLTLE +
Sbjct: 4 VGGVGDVPANQNSGEVESLARFAVDEHNKKENALLEFARVVKAKEQVVAGTLHHLTLEIL 63
Query: 65 DGGEKKIYEAKVWVKSWLNFKELQEFKEVAGDAPLFTTSDLGVKK 109
+ G+KK+YEAKVWVK WLNFKELQEFK A DAP T+SDLG K+
Sbjct: 64 EAGQKKLYEAKVWVKPWLNFKELQEFKP-ASDAPAITSSDLGCKQ 107
>AT3G12490.2 | Symbols: ATCYSB, ATCYS6, CYSB | cystatin B |
chr3:3960523-3961876 REVERSE LENGTH=234
Length = 234
Score = 152 bits (385), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 73/105 (69%), Positives = 88/105 (83%), Gaps = 1/105 (0%)
Query: 5 LGGIRDSQGFQNSLEIDALARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLEAI 64
+GG+ D QNS E+++LARFAVDEHN K+NALLEF RVVKA+EQVVAG++HHLTLE +
Sbjct: 37 VGGVGDVPANQNSGEVESLARFAVDEHNKKENALLEFARVVKAKEQVVAGTLHHLTLEIL 96
Query: 65 DGGEKKIYEAKVWVKSWLNFKELQEFKEVAGDAPLFTTSDLGVKK 109
+ G+KK+YEAKVWVK WLNFKELQEFK A DAP T+SDLG K+
Sbjct: 97 EAGQKKLYEAKVWVKPWLNFKELQEFKP-ASDAPAITSSDLGCKQ 140
>AT5G05110.1 | Symbols: | Cystatin/monellin family protein |
chr5:1507615-1508765 REVERSE LENGTH=232
Length = 232
Score = 128 bits (321), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 65/106 (61%), Positives = 78/106 (73%), Gaps = 4/106 (3%)
Query: 4 KLGGIRDSQG-FQNSLEIDALARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLE 62
KLGG DS+ + EID +A FAV EHN ++NA+LE RV+KA EQVVAG ++ LTLE
Sbjct: 44 KLGGFSDSKNDWNGGKEIDDIALFAVQEHNRRENAVLELARVLKATEQVVAGKLYRLTLE 103
Query: 63 AIDGGEKKIYEAKVWVKSWLNFKELQEFKEVAGDAPLFTTSDLGVK 108
I+ GEKKIYEAKVWVK W+NFK+LQEFK + P FT SDLG K
Sbjct: 104 VIEAGEKKIYEAKVWVKPWMNFKQLQEFKNI---IPSFTISDLGFK 146
>AT2G40880.1 | Symbols: FL3-27, ATCYSA, CYSA | cystatin A |
chr2:17057463-17057930 FORWARD LENGTH=125
Length = 125
Score = 126 bits (317), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 58/88 (65%), Positives = 75/88 (85%)
Query: 5 LGGIRDSQGFQNSLEIDALARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLEAI 64
LGG+ D +G QNS EI++LARFA+ EHN +QN +LEF ++VKA+EQVVAG+M+HLTLEA
Sbjct: 35 LGGVHDLRGNQNSGEIESLARFAIQEHNKQQNKILEFKKIVKAREQVVAGTMYHLTLEAK 94
Query: 65 DGGEKKIYEAKVWVKSWLNFKELQEFKE 92
+G + K +EAKVWVK W+NFK+LQEFKE
Sbjct: 95 EGDQTKNFEAKVWVKPWMNFKQLQEFKE 122
>AT5G12140.1 | Symbols: ATCYS1, CYS1 | cystatin-1 |
chr5:3923295-3923936 REVERSE LENGTH=101
Length = 101
Score = 102 bits (254), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 65/91 (71%)
Query: 3 TKLGGIRDSQGFQNSLEIDALARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLE 62
T +GG+RD N L++++LARFAVDEHN +N LE+ R++ A+ QVVAG+MHHLT+E
Sbjct: 8 TIVGGVRDIDANANDLQVESLARFAVDEHNKNENLTLEYKRLLGAKTQVVAGTMHHLTVE 67
Query: 63 AIDGGEKKIYEAKVWVKSWLNFKELQEFKEV 93
DG K+YEAKV K+W N K+L+ F +
Sbjct: 68 VADGETNKVYEAKVLEKAWENLKQLESFNHL 98
>AT4G16500.1 | Symbols: | Cystatin/monellin superfamily protein |
chr4:9301530-9301883 REVERSE LENGTH=117
Length = 117
Score = 56.6 bits (135), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/74 (43%), Positives = 48/74 (64%), Gaps = 1/74 (1%)
Query: 19 EIDALARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLEAIDGGEK-KIYEAKVW 77
++ A+A++A++EHN + L F +VV+ QVV+G+ + L + A DGG K K YEA V
Sbjct: 42 DVVAVAKYAIEEHNKESKEKLVFVKVVEGTTQVVSGTKYDLKIAAKDGGGKIKNYEAVVV 101
Query: 78 VKSWLNFKELQEFK 91
K WL+ K L+ FK
Sbjct: 102 EKLWLHSKSLESFK 115
>AT5G47550.1 | Symbols: | Cystatin/monellin superfamily protein |
chr5:19286596-19286964 REVERSE LENGTH=122
Length = 122
Score = 54.7 bits (130), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 41/71 (57%), Gaps = 1/71 (1%)
Query: 23 LARFAVDEHNNKQNALLEFGRVVKAQEQVVAGSMHHLTLEAIDG-GEKKIYEAKVWVKSW 81
+ FAV E+N + + L+F VV + QVV+G+ + L + A DG G K Y A VW K W
Sbjct: 45 IGEFAVSEYNKRSESGLKFETVVSGETQVVSGTNYRLKVAANDGDGVSKNYLAIVWDKPW 104
Query: 82 LNFKELQEFKE 92
+ F+ L F+
Sbjct: 105 MKFRNLTSFEP 115
>AT2G31980.1 | Symbols: AtCYS2, CYS2 | PHYTOCYSTATIN 2 |
chr2:13609246-13609770 REVERSE LENGTH=147
Length = 147
Score = 52.4 bits (124), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 20/114 (17%)
Query: 5 LGGIRDSQGFQNSLEIDALARFAVDEHN----NKQNAL-------------LEFGRVVKA 47
LGG + + EI L R+ V++ N N+Q + L+F RVV A
Sbjct: 36 LGGKSGVPNIRTNREIQQLGRYCVEQFNQQAQNEQGNIGSIAKTDTAISNPLQFSRVVSA 95
Query: 48 QEQVVAGSMHHLTLEAI-DGGEKKIYEAKVWVKSWLNFKELQEFKEVAGDAPLF 100
Q+QVVAG ++L +E G +++++ V ++ WL+ K+L F V +P++
Sbjct: 96 QKQVVAGLKYYLRIEVTQPNGSTRMFDSVVVIQPWLHSKQLLGFTPVV--SPVY 147