Miyakogusa Predicted Gene
- Lj1g3v0281600.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0281600.1 Non Chatacterized Hit- tr|I1KC72|I1KC72_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,51.4,2e-17,ULP_PROTEASE,Peptidase C48, SUMO/Sentrin/Ubl1;
seg,NULL; Cysteine proteinases,NULL; Peptidase_C48,Pe,CUFF.25483.1
(284 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G48480.1 | Symbols: | Cysteine proteinases superfamily prote... 211 5e-55
AT4G33620.1 | Symbols: | Cysteine proteinases superfamily prote... 73 2e-13
AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D | chr1:... 72 4e-13
AT1G09730.1 | Symbols: | Cysteine proteinases superfamily prote... 69 3e-12
AT1G09730.2 | Symbols: | Cysteine proteinases superfamily prote... 69 3e-12
AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases superf... 61 9e-10
AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases superf... 61 9e-10
>AT3G48480.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:17957326-17959062 REVERSE LENGTH=298
Length = 298
Score = 211 bits (537), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 105/231 (45%), Positives = 150/231 (64%), Gaps = 6/231 (2%)
Query: 58 SLPSYLSDIPRRPRTXXXXXXAE-EALPRPKLDSGVFDNNLVKIWNSFSEDKRKPFAYFD 116
+ P + + RR R+ ++ R KL+S F L +W FS++K+ F Y D
Sbjct: 67 TAPCFYDECTRRGRSERRIKCKYLDSKLRKKLNSKAFVGYLEDVWRGFSDEKKNSFVYLD 126
Query: 117 SLWFSLYRAASS--KDKVLTWIKKEHIFSKAYVFVPIVCWGHWSLLIFCHFGESLQSTTR 174
LWFS+Y++ + + V +K + IFSK YVF+PIV W HW+LLIFC+FGE L S
Sbjct: 127 CLWFSMYKSENHNIRSSVFDSVKTKQIFSKKYVFLPIVYWSHWTLLIFCNFGEDLDSD-- 184
Query: 175 SRCMLLLDSLEMVNP-RRLEPDIRRFVVDIYKAWDRPETKNLIYQIPLLVPKVPQQRDGN 233
CML LDSL+ + +RLEPDIR+FV+DIY+A R E +L+ +IP VP VPQQ +
Sbjct: 185 KTCMLFLDSLQTTDSSQRLEPDIRKFVLDIYRAEGRTEDSSLVDEIPFYVPMVPQQTNDV 244
Query: 234 ECGNFVLYFINLFLRCAPENFSMGGYPYFMKKDWFTFEDFDRFCERLYSLN 284
ECG+FVLY+I+ F+ APENF++ PYF+K+DWF+ +D ++FC+ L+SL
Sbjct: 245 ECGSFVLYYIHRFIEDAPENFNVEDMPYFLKEDWFSHKDLEKFCDELHSLG 295
>AT4G33620.1 | Symbols: | Cysteine proteinases superfamily protein
| chr4:16147692-16152853 FORWARD LENGTH=783
Length = 783
Score = 73.2 bits (178), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 81/164 (49%), Gaps = 34/164 (20%)
Query: 131 KVLTWIKKEHIFSKAYVFVPIVCWGHWSLLIFCHFGE-------------SLQSTTRSRC 177
+V W K +F K Y+F+PI C HWSL+I CH GE +++ R C
Sbjct: 375 RVQKWTKNVDLFEKDYIFIPINCSFHWSLVIICHPGELVPSHVNFHSFDDEVENPQRVPC 434
Query: 178 MLLLDSLEMVNPRRLEPDIRRFVVDIYKAWDRPE-------TKNLIYQIPLLVP---KVP 227
+L LDS++ + L ++I+ ++ R E T N + P + ++P
Sbjct: 435 ILHLDSIKGSHKGGL--------INIFPSYLREEWKARHENTTNDSSRAPNMQSISLELP 486
Query: 228 QQRDGNECGNFVLYFINLFLRCAPENFS---MGGYPYFMKKDWF 268
QQ + +CG F+L++++LF+ AP F+ + F+ ++WF
Sbjct: 487 QQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSANFLTRNWF 530
>AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D |
chr1:22208332-22211910 FORWARD LENGTH=584
Length = 584
Score = 72.4 bits (176), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 76/145 (52%), Gaps = 15/145 (10%)
Query: 135 WIKKEHIFSKAYVFVPIVCWGHWSLLIFCHFGESLQSTTRSRCMLLLDSLEMVNPRRLEP 194
W K +F KAY+F+PI HWSL+I C + +S +L LDSL + + + +
Sbjct: 417 WWKGIDLFRKAYIFIPIHEDLHWSLVIVCIPDKKDESGL---TILHLDSLGLHSRKSIVE 473
Query: 195 DIRRFVVDIYK-------AWDRPET----KNLIYQIPLLVPKVPQQRDGNECGNFVLYFI 243
+++RF+ D + + D P + KNL +I V +VPQQ++ +CG FVL+FI
Sbjct: 474 NVKRFLKDEWNYLNQDDYSLDLPISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFVLFFI 533
Query: 244 NLFLRCAPENFSMGGYPYFMKKDWF 268
F+ AP+ F KK WF
Sbjct: 534 KRFIEEAPQRLKRKDLGMFDKK-WF 557
>AT1G09730.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:3148017-3154236 REVERSE LENGTH=963
Length = 963
Score = 69.3 bits (168), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 80/151 (52%), Gaps = 14/151 (9%)
Query: 131 KVLTWIKKEHIFSKAYVFVPIVCWGHWSLLIFCHFGE-------SLQSTTRSRCMLLLDS 183
+V W +K +F K Y+FVP+ HWSL++ CH GE L + + C+L +DS
Sbjct: 496 RVRKWTRKVDMFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDS 555
Query: 184 LEMVNPRRLEPDIRRFVVDIYKAWDRPETKNL---IYQIPLLVPKVPQQRDGNECGNFVL 240
++ + L+ ++ ++ + +K + + ++ + + ++PQQ + +CG F+L
Sbjct: 556 IKGSHA-GLKNLVQTYLCEEWKERHKETSDDISSRFMNLRFVSLELPQQENSFDCGLFLL 614
Query: 241 YFINLFLRCAPENFS---MGGYPYFMKKDWF 268
+++ LFL AP NFS + F+ +WF
Sbjct: 615 HYLELFLAEAPLNFSPFKIYNASNFLYLNWF 645
>AT1G09730.2 | Symbols: | Cysteine proteinases superfamily protein
| chr1:3148017-3154236 REVERSE LENGTH=931
Length = 931
Score = 69.3 bits (168), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 80/151 (52%), Gaps = 14/151 (9%)
Query: 131 KVLTWIKKEHIFSKAYVFVPIVCWGHWSLLIFCHFGE-------SLQSTTRSRCMLLLDS 183
+V W +K +F K Y+FVP+ HWSL++ CH GE L + + C+L +DS
Sbjct: 464 RVRKWTRKVDMFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDS 523
Query: 184 LEMVNPRRLEPDIRRFVVDIYKAWDRPETKNL---IYQIPLLVPKVPQQRDGNECGNFVL 240
++ + L+ ++ ++ + +K + + ++ + + ++PQQ + +CG F+L
Sbjct: 524 IKGSHA-GLKNLVQTYLCEEWKERHKETSDDISSRFMNLRFVSLELPQQENSFDCGLFLL 582
Query: 241 YFINLFLRCAPENFS---MGGYPYFMKKDWF 268
+++ LFL AP NFS + F+ +WF
Sbjct: 583 HYLELFLAEAPLNFSPFKIYNASNFLYLNWF 613
>AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases
superfamily protein | chr1:3487639-3491102 FORWARD
LENGTH=571
Length = 571
Score = 60.8 bits (146), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 92/190 (48%), Gaps = 26/190 (13%)
Query: 104 FSEDKRKPFAYFDSLWF--SLYRAASSKD--------KVLTWIKKEHIFSKAYVFVPIVC 153
FS DK +F + +F L A S K K W K +F K+Y+F+PI
Sbjct: 364 FSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHE 423
Query: 154 WGHWSLLIFCHFGESLQSTTRSRCMLLLDSLEMVNPRRLE-PDIRRFVVDIYK------A 206
HWSL+I C + +S ++ LDSL + +PR L +++RF+ + +
Sbjct: 424 DLHWSLVIICIPDKEDESGL---TIIHLDSLGL-HPRNLIFNNVKRFLREEWNYLNQDAP 479
Query: 207 WDRPETKNLIYQIPLLVP----KVPQQRDGNECGNFVLYFINLFLRCAPENFSMGGYPYF 262
D P + + +P ++ +VPQQ++ +CG F+L+FI F+ AP+ ++
Sbjct: 480 LDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDLKMI 539
Query: 263 MKKDWFTFED 272
KK WF E+
Sbjct: 540 HKK-WFKPEE 548
>AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases
superfamily protein | chr1:3487639-3491102 FORWARD
LENGTH=570
Length = 570
Score = 60.8 bits (146), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 92/190 (48%), Gaps = 26/190 (13%)
Query: 104 FSEDKRKPFAYFDSLWF--SLYRAASSKD--------KVLTWIKKEHIFSKAYVFVPIVC 153
FS DK +F + +F L A S K K W K +F K+Y+F+PI
Sbjct: 363 FSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHE 422
Query: 154 WGHWSLLIFCHFGESLQSTTRSRCMLLLDSLEMVNPRRLE-PDIRRFVVDIYK------A 206
HWSL+I C + +S ++ LDSL + +PR L +++RF+ + +
Sbjct: 423 DLHWSLVIICIPDKEDESGL---TIIHLDSLGL-HPRNLIFNNVKRFLREEWNYLNQDAP 478
Query: 207 WDRPETKNLIYQIPLLVP----KVPQQRDGNECGNFVLYFINLFLRCAPENFSMGGYPYF 262
D P + + +P ++ +VPQQ++ +CG F+L+FI F+ AP+ ++
Sbjct: 479 LDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDLKMI 538
Query: 263 MKKDWFTFED 272
KK WF E+
Sbjct: 539 HKK-WFKPEE 547