Miyakogusa Predicted Gene
- Lj0g3v0188049.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0188049.1 Non Chatacterized Hit- tr|I3SN90|I3SN90_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,99.44,0,Cupin,Cupin 1; RmlC-like cupins,RmlC-like cupin domain;
11SGLOBULIN,11-S seed storage protein, plant,CUFF.11986.1
(356 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G28680.1 | Symbols: | RmlC-like cupins superfamily protein |... 549 e-157
AT1G07750.1 | Symbols: | RmlC-like cupins superfamily protein |... 542 e-154
AT1G03890.1 | Symbols: | RmlC-like cupins superfamily protein |... 114 7e-26
AT1G03880.1 | Symbols: CRU2, CRB | cruciferin 2 | chr1:985786-98... 92 5e-19
AT5G44120.3 | Symbols: CRA1, ATCRA1, CRU1 | RmlC-like cupins sup... 84 1e-16
AT5G44120.2 | Symbols: CRA1, ATCRA1, CRU1 | RmlC-like cupins sup... 70 2e-12
AT4G28520.1 | Symbols: CRU3, CRC | cruciferin 3 | chr4:14087596-... 61 1e-09
AT4G28520.3 | Symbols: CRU3, CRC | cruciferin 3 | chr4:14087596-... 55 1e-07
AT3G22640.1 | Symbols: PAP85 | cupin family protein | chr3:80119... 50 3e-06
>AT2G28680.1 | Symbols: | RmlC-like cupins superfamily protein |
chr2:12303118-12304747 REVERSE LENGTH=356
Length = 356
Score = 549 bits (1415), Expect = e-157, Method: Compositional matrix adjust.
Identities = 268/354 (75%), Positives = 297/354 (83%)
Query: 1 MAIDLSPQLAKKLYGGNGGSYHAWSPSELPMLREGNIGASKLALQKNGFALPRYSDSSKV 60
M +DLSP+L KK+YGG+GGSY AW P ELPMLR+GNIGASKLAL+K G ALPRYSDS KV
Sbjct: 1 MELDLSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKV 60
Query: 61 AYVLQGTGVVGIVLPESEEKVVAIKKGDALALPFGVVTWWYNKEDTELVVLFLGDTSKAH 120
AYVLQG G GIVLPE EEKV+AIKKGD++ALPFGVVTWW+N EDTELVVLFLG+T K H
Sbjct: 61 AYVLQGAGTAGIVLPEKEEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKGH 120
Query: 121 KAGEFTDFYLTGSNGIFTGFSTEFVGRAWDLKENEVKTLVGNQSAKGIVKLEGNITIPEP 180
KAG+FTDFYLTGSNGIFTGFSTEFVGRAWDL E VK LVG+Q+ GIVK++ ++ +PEP
Sbjct: 121 KAGQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPEP 180
Query: 181 KPDHRDGMALNCLEAALDVDIKDGGRVVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGF 240
K R G LNCLEA LDVDIKDGGRVVVLNTKNLPLVGEVG GADLVR+DG +MCSPGF
Sbjct: 181 KKGDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCSPGF 240
Query: 241 SCDSALQVTYIXXXXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFS 300
SCDSALQVTYI KRVLET VKAG LFIVPRF+VVSKIAD +GL WFS
Sbjct: 241 SCDSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWFS 300
Query: 301 IISTPNPIFTHLAGSSSVWKALSPTVLQAAFNVDSGVEQLFRSKRTADAIFFPP 354
I++TP+PIFTHLAG +SVWKALSP VLQAAF VD VE+ FRSKRT+DAIFF P
Sbjct: 301 IVTTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSDAIFFSP 354
>AT1G07750.1 | Symbols: | RmlC-like cupins superfamily protein |
chr1:2404300-2405863 REVERSE LENGTH=356
Length = 356
Score = 542 bits (1396), Expect = e-154, Method: Compositional matrix adjust.
Identities = 257/354 (72%), Positives = 300/354 (84%)
Query: 1 MAIDLSPQLAKKLYGGNGGSYHAWSPSELPMLREGNIGASKLALQKNGFALPRYSDSSKV 60
M +DL+P+L KK+YGG+GGSY AW P ELPML++GNIGA+KLAL+KNGFA+PRYSDSSKV
Sbjct: 1 MELDLTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKV 60
Query: 61 AYVLQGTGVVGIVLPESEEKVVAIKKGDALALPFGVVTWWYNKEDTELVVLFLGDTSKAH 120
AYVLQG+G GIVLPE EEKV+AIK+GD++ALPFGVVTWW+N ED ELV+LFLG+T K H
Sbjct: 61 AYVLQGSGTAGIVLPEKEEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKGH 120
Query: 121 KAGEFTDFYLTGSNGIFTGFSTEFVGRAWDLKENEVKTLVGNQSAKGIVKLEGNITIPEP 180
KAG+FT+FYLTG+NGIFTGFSTEFVGRAWDL EN VK LVG+Q+ GIVKL+ +P+P
Sbjct: 121 KAGQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQP 180
Query: 181 KPDHRDGMALNCLEAALDVDIKDGGRVVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGF 240
K ++R G LNCLEA LDVDIKDGGRVVVLNTKNLPLVGEVG GADLVR+D +MCSPGF
Sbjct: 181 KEENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCSPGF 240
Query: 241 SCDSALQVTYIXXXXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFS 300
SCDSALQVTYI KRVLET +KAG+LFIVPRF+VVSKIAD +G+ WFS
Sbjct: 241 SCDSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWFS 300
Query: 301 IISTPNPIFTHLAGSSSVWKALSPTVLQAAFNVDSGVEQLFRSKRTADAIFFPP 354
I++TP+PIFTHLAG++SVWK+LSP VLQAAF V VE+ FRS RT+ AIFFPP
Sbjct: 301 IVTTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSAIFFPP 354
>AT1G03890.1 | Symbols: | RmlC-like cupins superfamily protein |
chr1:989250-990908 FORWARD LENGTH=451
Length = 451
Score = 114 bits (286), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 164/389 (42%), Gaps = 64/389 (16%)
Query: 5 LSPQLAKKLYGGNGGSYHAWSPSELPMLREGNIGASKLALQKNGFALPRYSDSSKVAYVL 64
L+P A K G + SP LR + +++ LQ N LP + +AYV+
Sbjct: 44 LAPAQATKFEAGQMEVWDHMSPE----LRCAGVTVARITLQPNSIFLPAFFSPPALAYVV 99
Query: 65 QGTGVVGIV---LPES-----------------------EEKVVAIKKGDALALPFGVVT 98
QG GV+G + PE+ +K+ ++GD A GV
Sbjct: 100 QGEGVMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVSQ 159
Query: 99 WWYNKEDTELVVLFLGD-TSKAHKAGEFTD-FYLTGS--------------NGIFTGFST 142
WWYN+ D++ V++ + D T++ ++ + F L GS N F+GF
Sbjct: 160 WWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFDP 219
Query: 143 EFVGRAWDLKENEVKTLVGNQSAKG-IVKLEG--NITIPEPKPDHRDGMALNCLE----A 195
+ A+ + K L + +G I++ G + IP P+ +DG+A E A
Sbjct: 220 NIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCTA 279
Query: 196 ALDVDIKD----------GGRVVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGFSCDSA 245
+ +I D GR+ LN+ NLP++ V L A L G M P ++ + A
Sbjct: 280 KIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTAN-A 338
Query: 246 LQVTYIXXXXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFSIISTP 305
V Y+ + V V G + ++P+ + VSK A G EW S +
Sbjct: 339 HTVLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTND 398
Query: 306 NPIFTHLAGSSSVWKALSPTVLQAAFNVD 334
N L+G +S +A+ V++A++ V+
Sbjct: 399 NAYINTLSGQTSYLRAVPVDVIKASYGVN 427
>AT1G03880.1 | Symbols: CRU2, CRB | cruciferin 2 |
chr1:985786-987916 FORWARD LENGTH=455
Length = 455
Score = 92.0 bits (227), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 84/380 (22%), Positives = 146/380 (38%), Gaps = 66/380 (17%)
Query: 18 GGSYHAWSPSELPMLREGNIGASKLALQKNGFALPRYSDSSKVAYVLQGTGVVGIVLP-- 75
GG W P LR + ++ G LP + ++ K+ +V+ G G++G V+P
Sbjct: 48 GGRIEVWD-HHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVVHGRGLMGRVIPGC 106
Query: 76 -----------------------ESEEKVVAIKKGDALALPFGVVTWWYNKEDTELVVLF 112
+ +KV ++ GD +A P GV W+YN + L+++
Sbjct: 107 AETFMESPVFGEGQGQGQSQGFRDMHQKVEHLRCGDTIATPSGVAQWFYNNGNEPLILVA 166
Query: 113 LGD--TSKAHKAGEFTDFYLTGSN----------------GIFTGFSTEFVGRAWDLKEN 154
D +++ F + G+N IF GF+ E + +A+ +
Sbjct: 167 AADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNIFNGFAPEILAQAFKINVE 226
Query: 155 EVKTLVGNQSAKG-IVKLEGNITIPEPKPDHRDG---------------MALNCLE---- 194
+ L Q +G IVK+ G + P +G + C E
Sbjct: 227 TAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPHEIANGLEETLCTMRCTENLDD 286
Query: 195 -AALDVDIKDGGRVVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGFSCDSALQVTYIXX 253
+ DV G + LN+ NLP++ + L A + AM P ++ + A Y+
Sbjct: 287 PSDADVYKPSLGYISTLNSYNLPILRLLRLSALRGSIRKNAMVLPQWNVN-ANAALYVTN 345
Query: 254 XXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFSIISTPNPIFTHLA 313
+RV + + +G L +VP+ + V K A E EW + N LA
Sbjct: 346 GKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGEQFEWIEFKTNENAQVNTLA 405
Query: 314 GSSSVWKALSPTVLQAAFNV 333
G +SV + L V+ + +
Sbjct: 406 GRTSVMRGLPLEVITNGYQI 425
>AT5G44120.3 | Symbols: CRA1, ATCRA1, CRU1 | RmlC-like cupins
superfamily protein | chr5:17756460-17758246 REVERSE
LENGTH=472
Length = 472
Score = 84.0 bits (206), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 83/387 (21%), Positives = 145/387 (37%), Gaps = 73/387 (18%)
Query: 18 GGSYHAWSPSELPMLREGNIGASKLALQKNGFALPRYSDSSKVAYVLQGTGVVGIVLP-- 75
G W P LR + ++ ++ G LP + +++K+++V +G G++G V+P
Sbjct: 54 AGRIEVWD-HHAPQLRCSGVSFARYIIESKGLYLPSFFNTAKLSFVAKGRGLMGKVIPGC 112
Query: 76 ------------------------ESEEKVVAIKKGDALALPFGVVTWWYNKEDTELVVL 111
+ +KV I+ GD +A GV W+YN LV++
Sbjct: 113 AETFQDSSEFQPRFEGQGQSQRFRDMHQKVEHIRSGDTIATTPGVAQWFYNDGQEPLVIV 172
Query: 112 FLGDTSKAHKAGEFT--DFYLTGSN----------------GIFTGFSTEFVGRAWDLKE 153
+ D + + FYL G+N IF GF E + +A +
Sbjct: 173 SVFDLASHQNQLDRNPRPFYLAGNNPQGQVWLQGREQQPQKNIFNGFGPEVIAQALKIDL 232
Query: 154 NEVKTLVGNQSAKG-IVKLEGNITIPEP-----KPD-----------HRDGMALNCLEAA 196
+ L +G IV+++G + P +P H +G+ A
Sbjct: 233 QTAQQLQNQDDNRGNIVRVQGPFGVIRPPLRGQRPQEEEEEEGRHGRHGNGLEETICSAR 292
Query: 197 LDVDIKDGGR----------VVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGFSCDSAL 246
++ D R + LN+ +LP++ + L A + AM P ++ + A
Sbjct: 293 CTDNLDDPSRADVYKPQLGYISTLNSYDLPILRFIRLSALRGSIRQNAMVLPQWNAN-AN 351
Query: 247 QVTYIXXXXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFSIISTPN 306
+ Y+ RV + V G L VP+ + V K A +W + N
Sbjct: 352 AILYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQGFSVVKRATSNRFQWVEFKTNAN 411
Query: 307 PIFTHLAGSSSVWKALSPTVLQAAFNV 333
LAG +SV + L V+ F +
Sbjct: 412 AQINTLAGRTSVLRGLPLEVITNGFQI 438
>AT5G44120.2 | Symbols: CRA1, ATCRA1, CRU1 | RmlC-like cupins
superfamily protein | chr5:17756460-17757811 REVERSE
LENGTH=368
Length = 368
Score = 70.1 bits (170), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/325 (22%), Positives = 121/325 (37%), Gaps = 47/325 (14%)
Query: 54 YSDSSKVAYVLQGTGVVGIVLPESEEKVVAIKKGDALALPFGVVTWWYNKEDTELVVLFL 113
+ DSS+ +G G + +KV I+ GD +A GV W+YN LV++ +
Sbjct: 12 FQDSSEFQPRFEGQGQ-SQRFRDMHQKVEHIRSGDTIATTPGVAQWFYNDGQEPLVIVSV 70
Query: 114 GDTSKAHKAGEFTD--FYLTGSN----------------GIFTGFSTEFVGRAWDLKENE 155
D + + FYL G+N IF GF E + +A +
Sbjct: 71 FDLASHQNQLDRNPRPFYLAGNNPQGQVWLQGREQQPQKNIFNGFGPEVIAQALKIDLQT 130
Query: 156 VKTLVGNQSAKG-IVKLEGNITIPEP-----KPDHR-----------DGMALNCLEAALD 198
+ L +G IV+++G + P +P +G+ A
Sbjct: 131 AQQLQNQDDNRGNIVRVQGPFGVIRPPLRGQRPQEEEEEEGRHGRHGNGLEETICSARCT 190
Query: 199 VDIKDGGR----------VVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGFSCDSALQV 248
++ D R + LN+ +LP++ + L A + AM P ++ + A +
Sbjct: 191 DNLDDPSRADVYKPQLGYISTLNSYDLPILRFIRLSALRGSIRQNAMVLPQWNAN-ANAI 249
Query: 249 TYIXXXXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFSIISTPNPI 308
Y+ RV + V G L VP+ + V K A +W + N
Sbjct: 250 LYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQGFSVVKRATSNRFQWVEFKTNANAQ 309
Query: 309 FTHLAGSSSVWKALSPTVLQAAFNV 333
LAG +SV + L V+ F +
Sbjct: 310 INTLAGRTSVLRGLPLEVITNGFQI 334
>AT4G28520.1 | Symbols: CRU3, CRC | cruciferin 3 |
chr4:14087596-14089617 FORWARD LENGTH=524
Length = 524
Score = 61.2 bits (147), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 117/298 (39%), Gaps = 43/298 (14%)
Query: 78 EEKVVAIKKGDALALPFGVVTWWYNKEDTELVVLFLGDTSKAHKAGEFTD--FYLT---- 131
+KV +++GD A G W YN + LV++ L D + + F+L
Sbjct: 193 HQKVEHVRRGDVFANTPGSAHWIYNSGEQPLVIIALLDIANYQNQLDRNPRVFHLAGNNQ 252
Query: 132 -----------GSNGIFTGFSTEFVGRAWDLKENEVKTLVGNQSAKG-IVKLEGNITIPE 179
+++GF + + +A + + L Q ++G IV+++G +
Sbjct: 253 QGGFGGSQQQQEQKNLWSGFDAQVIAQALKIDVQLAQQLQNQQDSRGNIVRVKGPFQVVR 312
Query: 180 P---KP------DHRDGMALNCLE---------------AALDVDIKDGGRVVVLNTKNL 215
P +P H N LE A DV GRV +N+ L
Sbjct: 313 PPLRQPYESEEWRHPRSPQGNGLEETICSMRSHENIDDPARADVYKPSLGRVTSVNSYTL 372
Query: 216 PLVGEVGLGADLVRLDGGAMCSPGFSCDSALQVTYIXXXXXXXXXXXXXXKRVLETTVKA 275
P++ V L A L G AM P ++ + A ++ Y + VL+ V+
Sbjct: 373 PILEYVRLSATRGVLQGNAMVLPKYNMN-ANEILYCTGGQGRIQVVNDNGQNVLDQQVQK 431
Query: 276 GNLFIVPRFYVVSKIADPEGLEWFSIISTPNPIFTHLAGSSSVWKALSPTVLQAAFNV 333
G L ++P+ + + EW S + N + + LAG +S+ +AL V+ F +
Sbjct: 432 GQLVVIPQGFAYVVQSHGNKFEWISFKTNENAMISTLAGRTSLLRALPLEVISNGFQI 489
>AT4G28520.3 | Symbols: CRU3, CRC | cruciferin 3 |
chr4:14087596-14089617 FORWARD LENGTH=453
Length = 453
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 66/149 (44%), Gaps = 3/149 (2%)
Query: 187 GMALNCLEAALDVDIKDG--GRVVVLNTKNLPLVGEVGLGADLVRLDGGAMCSPGFSCDS 244
G + AL +D+ GRV +N+ LP++ V L A L G AM P ++ +
Sbjct: 271 GFDAQVIAQALKIDVYKPSLGRVTSVNSYTLPILEYVRLSATRGVLQGNAMVLPKYNMN- 329
Query: 245 ALQVTYIXXXXXXXXXXXXXXKRVLETTVKAGNLFIVPRFYVVSKIADPEGLEWFSIIST 304
A ++ Y + VL+ V+ G L ++P+ + + EW S +
Sbjct: 330 ANEILYCTGGQGRIQVVNDNGQNVLDQQVQKGQLVVIPQGFAYVVQSHGNKFEWISFKTN 389
Query: 305 PNPIFTHLAGSSSVWKALSPTVLQAAFNV 333
N + + LAG +S+ +AL V+ F +
Sbjct: 390 ENAMISTLAGRTSLLRALPLEVISNGFQI 418
>AT3G22640.1 | Symbols: PAP85 | cupin family protein |
chr3:8011902-8013883 REVERSE LENGTH=486
Length = 486
Score = 49.7 bits (117), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 70/149 (46%), Gaps = 11/149 (7%)
Query: 30 PMLREG--NIGASKLALQKNGFALPRYSDSSKVAYVLQGTGVVGIVLPESEEKVVAIKKG 87
P L G N S + ++ F +P + D+ V VLQG GV+ V +++E I KG
Sbjct: 93 PALFRGIENYRFSLVEMEPTTFFVPHHLDADAVFIVLQGKGVIEFVTDKTKES-FHITKG 151
Query: 88 DALALPFGVVTWWYNKEDTELVVLFLGD-TSKAHKAGEFTDFYLTGS---NGIFTGFSTE 143
D + +P GV + N T V L L T + G + D++ S F GF+ E
Sbjct: 152 DVVRIPSGVTNFITNTNQT--VPLRLAQITVPVNNPGNYKDYFPAASQFQQSYFNGFTKE 209
Query: 144 FVGRAWDLKENEVKTLV--GNQSAKGIVK 170
+ ++++ E + LV + +GI++
Sbjct: 210 VLSTSFNVPEELLGRLVTRSKEIGQGIIR 238