Miyakogusa Predicted Gene
- Lj6g3v1449680.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1449680.1 Non Chatacterized Hit- tr|I1MRJ1|I1MRJ1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.42000
PE,61.43,0,seg,NULL; ULP_PROTEASE,Peptidase C48, SUMO/Sentrin/Ubl1;
Peptidase_C48,Peptidase C48, SUMO/Sentrin/U,CUFF.59767.1
(494 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G15880.1 | Symbols: ESD4, ATESD4 | Cysteine proteinases super... 448 e-126
AT3G06910.1 | Symbols: ULP1A, ELS1, AtULP1a | UB-like protease 1... 375 e-104
AT4G00690.1 | Symbols: ULP1B | UB-like protease 1B | chr4:281313... 326 2e-89
AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D | chr1:... 92 1e-18
AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases superf... 92 1e-18
AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases superf... 91 1e-18
AT1G09730.1 | Symbols: | Cysteine proteinases superfamily prote... 69 6e-12
AT1G09730.2 | Symbols: | Cysteine proteinases superfamily prote... 69 6e-12
AT4G33620.1 | Symbols: | Cysteine proteinases superfamily prote... 64 2e-10
>AT4G15880.1 | Symbols: ESD4, ATESD4 | Cysteine proteinases
superfamily protein | chr4:9012769-9015797 FORWARD
LENGTH=489
Length = 489
Score = 448 bits (1153), Expect = e-126, Method: Compositional matrix adjust.
Identities = 240/501 (47%), Positives = 313/501 (62%), Gaps = 19/501 (3%)
Query: 1 MGVMTSHRKRPEECMNV--NHSXXXXXXXXXDSQRKRPRLLNSMSRPAPTP---NGIVSR 55
MG + +RKR +E N S K+ R +MS + P N +SR
Sbjct: 1 MGAVAINRKRSDESFNFINQQSTNPLRNSPYFQASKKRRFSFAMSEDSGKPASSNPTISR 60
Query: 56 ISRYPDAKPPLIREVHAPCRNRKFDLVRRDLAEXXXXXXXXXXVDVKGSVLFAAYEEARR 115
ISRYPDAK PL RE+HAP R ++R A+ + + Y++A+R
Sbjct: 61 ISRYPDAKAPLRREIHAPSRG----ILRYGKAKSNDY------CEKDANFFVRKYDDAKR 110
Query: 116 SALAEVRYLPSKGKEVIDVDAXXXXXXXXXXXXXXXXXXXHDLEAKXXXXXXXXXXXQST 175
SAL +R++ +KGK+ +D+ D +
Sbjct: 111 SALEALRFV-NKGKDFVDLGDEVEKEEVVSDDSSVQAIEVIDCDDDEEKKNLQPSFSSGV 169
Query: 176 SSPDSALTNKGDNSLXXXXXXXXXXXXEIDFSSVRAYKKLIESADRRDDRLKRLEFEITL 235
+ + +++ + D SS+ AY+KL++SA++R+ +L+ L FEI L
Sbjct: 170 TDVKKGENFRVEDTSMMLDSLSLDRDVDNDASSLEAYRKLMQSAEKRNSKLEALGFEIVL 229
Query: 236 NEKRRETFNLLRPKKELVEV-VPKEPFVPLTKEEENEVARAFYT-NRKKILISHENSNIE 293
NEK+ RPK V VP+EPF+PLT++EE EV RAF NR+K+L +HENSNI+
Sbjct: 230 NEKKLSLLRQSRPKTVEKRVEVPREPFIPLTEDEEAEVYRAFSGRNRRKVLATHENSNID 289
Query: 294 ISGEKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFYKKLIGGRNGYD 353
I+GE QCL P+AWLNDEVIN+YLELLKERE REP+KYLKCH+FNTFFYKKL+ +GY+
Sbjct: 290 ITGEVLQCLTPSAWLNDEVINVYLELLKERETREPKKYLKCHYFNTFFYKKLVSD-SGYN 348
Query: 354 FKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVINKKDKKFQYLDSLKGRDSHVLN 413
FK+VRRWTTQRKLGY L++CD IFVPIHR +HW LAVIN ++ K YLDSL G D +LN
Sbjct: 349 FKAVRRWTTQRKLGYALIDCDMIFVPIHRGVHWTLAVINNRESKLLYLDSLNGVDPMILN 408
Query: 414 VLAKYFVDEVKDKTGKDIDISDWEKEFVEDLPEQKNGYDCGVFMIKYADFYSRGLGLCFK 473
LAKY DE +K+GK ID + W+ EFVEDLP+QKNGYDCG+FM+KY DF+SRGLGLCF
Sbjct: 409 ALAKYMGDEANEKSGKKIDANSWDMEFVEDLPQQKNGYDCGMFMLKYIDFFSRGLGLCFS 468
Query: 474 QEHMPYFRVRTAKELLRLRAD 494
QEHMPYFR+RTAKE+LRLRAD
Sbjct: 469 QEHMPYFRLRTAKEILRLRAD 489
>AT3G06910.1 | Symbols: ULP1A, ELS1, AtULP1a | UB-like protease 1A |
chr3:2178905-2181188 REVERSE LENGTH=502
Length = 502
Score = 375 bits (963), Expect = e-104, Method: Compositional matrix adjust.
Identities = 204/446 (45%), Positives = 267/446 (59%), Gaps = 19/446 (4%)
Query: 56 ISRYPDAKPPLIREVHAPCRNRKFDLVRRDLAEXXXXXXXXXXVDVKGSVLFAAYEEARR 115
I RYP+ K L R+VHAP R L +V G+ L + +R
Sbjct: 69 IYRYPEVKSSLRRQVHAPVR---------ILNSGRDRSTRQGSGNVLGTFLTRNNDMWKR 119
Query: 116 SAL-AEVRYLPSKGKEVIDVDAXXXXXXXXXXXXXXXXXXXHDLEAKXXXXXXXXXXX-- 172
+AL + +RY +EVIDVD +E
Sbjct: 120 NALDSSLRY--RTDREVIDVDDELGDVEMISDDTSREGVENVAMEVDEVEEKAEMGNGLF 177
Query: 173 QSTSSPDSALTNKGDNSLXXXXXXXXXXXXEIDFSSVRAYKKLIESADRRDDRLKRLEFE 232
+S + G+ S D +S AY+K++ESA R +LK F
Sbjct: 178 SEVASLKNGSLRVGECS-KANSSSLVVNRPVTDVTSFEAYRKVLESAVNRTSKLKDRGFV 236
Query: 233 ITLNEKRRETFNLL----RPKKELVEVVPKEPFVPLTKEEENEVARAFYTNRKKILISHE 288
E+ R L R +E VEVV +E FVPL++EEE V RAF N IL++H+
Sbjct: 237 DFFKERGRALLRSLSSFWRQDEEPVEVVQREAFVPLSREEETAVRRAFSANDSNILVTHK 296
Query: 289 NSNIEISGEKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFYKKLIGG 348
NSNI+I+G+ +CL P WLNDEVINLY+ LLKERE REP+K+LKCHFFNTFF+ KL+
Sbjct: 297 NSNIDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFFFTKLVNS 356
Query: 349 RNGYDFKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVINKKDKKFQYLDSLKGRD 408
GY++ +VRRWT+ ++LGY L +CDKIF+PIH IHW LAVIN KD+KFQYLDS KGR+
Sbjct: 357 ATGYNYGAVRRWTSMKRLGYHLKDCDKIFIPIHMNIHWTLAVINIKDQKFQYLDSFKGRE 416
Query: 409 SHVLNVLAKYFVDEVKDKTGKDIDISDWEKEFVEDLPEQKNGYDCGVFMIKYADFYSRGL 468
+L+ LA+YFVDEV+DK+ D+D+S W +EFV+DLP Q+NG+DCG+FM+KY DFYSRGL
Sbjct: 417 PKILDALARYFVDEVRDKSEVDLDVSRWRQEFVQDLPMQRNGFDCGMFMVKYIDFYSRGL 476
Query: 469 GLCFKQEHMPYFRVRTAKELLRLRAD 494
LCF QE MPYFR RTAKE+L+L+A+
Sbjct: 477 DLCFTQEQMPYFRARTAKEILQLKAE 502
>AT4G00690.1 | Symbols: ULP1B | UB-like protease 1B |
chr4:281313-283129 FORWARD LENGTH=348
Length = 348
Score = 326 bits (836), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 163/278 (58%), Positives = 202/278 (72%), Gaps = 12/278 (4%)
Query: 228 RLEFEITLNEKRRETFNLLRPKKELVEVVP--KEPFVPLTKEEENEVARAFYT-NRKKIL 284
R+E + N ++ N L E ++V EPF PL +EE V A NRKKIL
Sbjct: 72 RIEEHSSSNRGFKDGNNFLVSSTETKKLVAFCGEPFQPLNEEEVALVNSALSKRNRKKIL 131
Query: 285 ISHENSNIEISGEKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFYKK 344
+SH+NSNI+ISGE QCL P WLND+V NLYLELLKER+ R+PQKY KCHFFNTFFY K
Sbjct: 132 VSHKNSNIDISGETLQCLRPNQWLNDDVTNLYLELLKERQTRDPQKYFKCHFFNTFFYVK 191
Query: 345 LIGGRNGYDFKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVINKKDKKFQYLDSL 404
L+ G +GY++K+V RWTT+RKLGY L++CD IFVPIH +IHW L VIN +++KF YLDSL
Sbjct: 192 LVSG-SGYNYKAVSRWTTKRKLGYDLIDCDIIFVPIHIDIHWTLGVINNRERKFVYLDSL 250
Query: 405 -KGRDSHVLNVLAKYFVDEVKDKTGKDIDISDWEKEFVEDLPEQKNGYDCGVFMIKYADF 463
G +LN +AKY VDEVK K+ K+ID+S W E+VE+ P+Q+NGYDCG+FM+KY DF
Sbjct: 251 FTGVGHTILNAMAKYLVDEVKQKSQKNIDVSSWGMEYVEERPQQQNGYDCGMFMLKYIDF 310
Query: 464 YSRGLGLCFKQ-------EHMPYFRVRTAKELLRLRAD 494
YSRGL L F Q + MPYFR+RTAKE+LRLRAD
Sbjct: 311 YSRGLSLQFSQVIRDVIKKDMPYFRLRTAKEILRLRAD 348
>AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D |
chr1:22208332-22211910 FORWARD LENGTH=584
Length = 584
Score = 91.7 bits (226), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 94/193 (48%), Gaps = 31/193 (16%)
Query: 292 IEISGEKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFYKKLI----- 346
+++ + +CL P +L V+N Y+ L+++ Q CHFFNT+FYKKL
Sbjct: 343 VQVCLKDLECLAPREYLTSPVMNFYMRFLQQQISSSNQISADCHFFNTYFYKKLSDAVTY 402
Query: 347 -GGRNGYDFKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVINKKDKKFQ------ 399
G F RRW G L IF+PIH ++HW L ++ DKK +
Sbjct: 403 KGNDKDAFFVRFRRWWK----GIDLFRKAYIFIPIHEDLHWSLVIVCIPDKKDESGLTIL 458
Query: 400 YLDS--LKGRDSHVLNVLAKYFVDEV----KDKTGKDIDISD--WE------KEFVEDLP 445
+LDS L R S V NV ++ DE +D D+ IS+ W+ E V +P
Sbjct: 459 HLDSLGLHSRKSIVENV-KRFLKDEWNYLNQDDYSLDLPISEKVWKNLPRRISEAVVQVP 517
Query: 446 EQKNGYDCGVFMI 458
+QKN +DCG F++
Sbjct: 518 QQKNDFDCGPFVL 530
>AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases
superfamily protein | chr1:3487639-3491102 FORWARD
LENGTH=570
Length = 570
Score = 91.7 bits (226), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 104/230 (45%), Gaps = 38/230 (16%)
Query: 259 EPFVPLTKEEENEVARA-----FYTNRKKILISHENSNIEISGEKFQCLGPAAWLNDEVI 313
EP P+ EE E+ +Y + + S +++S + +CL P +L VI
Sbjct: 295 EPLSPMVVEEACELPEGLPEDIYYPSSDQ---SDGRDLVQVSLKDLKCLSPGEYLTSPVI 351
Query: 314 NLYLELLKEREQREPQKYLKCHFFNTFFYKKLI-------GGRNGYDFKSVRRWTTQRKL 366
N Y+ ++ + CHFFNTFFYKKL R+ Y F RRW
Sbjct: 352 NFYIRYVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAY-FVKFRRWWK---- 406
Query: 367 GYGLLECDKIFVPIHREIHWCLAVINKKDKKFQ------YLDSLKGRDSHVLNVLAKYFV 420
G+ L IF+PIH ++HW L +I DK+ + +LDSL +++ K F+
Sbjct: 407 GFDLFCKSYIFIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFL 466
Query: 421 DEVKDKTGKDI------------DISDWEKEFVEDLPEQKNGYDCGVFMI 458
E + +D D+ + E +P+QKN +DCG+F++
Sbjct: 467 REEWNYLNQDAPLDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLL 516
>AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases
superfamily protein | chr1:3487639-3491102 FORWARD
LENGTH=571
Length = 571
Score = 91.3 bits (225), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 104/230 (45%), Gaps = 38/230 (16%)
Query: 259 EPFVPLTKEEENEVARA-----FYTNRKKILISHENSNIEISGEKFQCLGPAAWLNDEVI 313
EP P+ EE E+ +Y + + S +++S + +CL P +L VI
Sbjct: 296 EPLSPMVVEEACELPEGLPEDIYYPSSDQ---SDGRDLVQVSLKDLKCLSPGEYLTSPVI 352
Query: 314 NLYLELLKEREQREPQKYLKCHFFNTFFYKKLI-------GGRNGYDFKSVRRWTTQRKL 366
N Y+ ++ + CHFFNTFFYKKL R+ Y F RRW
Sbjct: 353 NFYIRYVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAY-FVKFRRWWK---- 407
Query: 367 GYGLLECDKIFVPIHREIHWCLAVINKKDKKFQ------YLDSLKGRDSHVLNVLAKYFV 420
G+ L IF+PIH ++HW L +I DK+ + +LDSL +++ K F+
Sbjct: 408 GFDLFCKSYIFIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFL 467
Query: 421 DEVKDKTGKDI------------DISDWEKEFVEDLPEQKNGYDCGVFMI 458
E + +D D+ + E +P+QKN +DCG+F++
Sbjct: 468 REEWNYLNQDAPLDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLL 517
>AT1G09730.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:3148017-3154236 REVERSE LENGTH=963
Length = 963
Score = 69.3 bits (168), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 99/229 (43%), Gaps = 39/229 (17%)
Query: 292 IEISGEKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFY--------K 343
+ I + L P ++ND +I+ Y+ LK + Q E + F
Sbjct: 426 VSICKRDVELLQPETFVNDTIIDFYINYLKNQIQTEEKHRFHFFNSFFFRKLADLDKDPS 485
Query: 344 KLIGGRNGYDFKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVI------------ 391
+ G+ F VR+WT RK+ + D IFVP++ +HW L VI
Sbjct: 486 SIADGKAA--FLRVRKWT--RKVD--MFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDL 539
Query: 392 ----NKKDKKFQYLDSLKGRDSHVLNVLAKYFVDEVKDK---TGKDIDISDWEKEFVE-D 443
+KK ++DS+KG + + N++ Y +E K++ T DI FV +
Sbjct: 540 DLDDSKKVPCILHMDSIKGSHAGLKNLVQTYLCEEWKERHKETSDDISSRFMNLRFVSLE 599
Query: 444 LPEQKNGYDCGVFMIKYADFYSRGLGLCFKQEHMPYFRVRTAKELLRLR 492
LP+Q+N +DCG+F++ Y + + L F F++ A L L
Sbjct: 600 LPQQENSFDCGLFLLHYLELFLAEAPLNFSP-----FKIYNASNFLYLN 643
>AT1G09730.2 | Symbols: | Cysteine proteinases superfamily protein
| chr1:3148017-3154236 REVERSE LENGTH=931
Length = 931
Score = 69.3 bits (168), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 99/229 (43%), Gaps = 39/229 (17%)
Query: 292 IEISGEKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFY--------K 343
+ I + L P ++ND +I+ Y+ LK + Q E + F
Sbjct: 394 VSICKRDVELLQPETFVNDTIIDFYINYLKNQIQTEEKHRFHFFNSFFFRKLADLDKDPS 453
Query: 344 KLIGGRNGYDFKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVI------------ 391
+ G+ F VR+WT RK+ + D IFVP++ +HW L VI
Sbjct: 454 SIADGKAA--FLRVRKWT--RKVD--MFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDL 507
Query: 392 ----NKKDKKFQYLDSLKGRDSHVLNVLAKYFVDEVKDK---TGKDIDISDWEKEFVE-D 443
+KK ++DS+KG + + N++ Y +E K++ T DI FV +
Sbjct: 508 DLDDSKKVPCILHMDSIKGSHAGLKNLVQTYLCEEWKERHKETSDDISSRFMNLRFVSLE 567
Query: 444 LPEQKNGYDCGVFMIKYADFYSRGLGLCFKQEHMPYFRVRTAKELLRLR 492
LP+Q+N +DCG+F++ Y + + L F F++ A L L
Sbjct: 568 LPQQENSFDCGLFLLHYLELFLAEAPLNFSP-----FKIYNASNFLYLN 611
>AT4G33620.1 | Symbols: | Cysteine proteinases superfamily protein
| chr4:16147692-16152853 FORWARD LENGTH=783
Length = 783
Score = 63.9 bits (154), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 99/206 (48%), Gaps = 48/206 (23%)
Query: 297 EKFQCLGPAAWLNDEVINLYLELLKEREQREPQKYLKCHFFNTFFYKKL----------I 346
+ + L P ++ND +I+ Y++ LK R P++ + HFFN FF++KL
Sbjct: 310 QDIELLKPRRFINDTIIDFYIKYLKNR--ISPKERGRFHFFNCFFFRKLANLDKGTPSTC 367
Query: 347 GGRNGYDFKSVRRWTTQRKLGYGLLECDKIFVPIHREIHWCLAVI-------------NK 393
GGR Y + V++WT L E D IF+PI+ HW L +I +
Sbjct: 368 GGREAY--QRVQKWTK----NVDLFEKDYIFIPINCSFHWSLVIICHPGELVPSHVNFHS 421
Query: 394 KDKKFQ---------YLDSLKGRDSH---VLNVLAKYFVDEVK---DKTGKDIDISDWEK 438
D + + +LDS+KG SH ++N+ Y +E K + T D + +
Sbjct: 422 FDDEVENPQRVPCILHLDSIKG--SHKGGLINIFPSYLREEWKARHENTTNDSSRAPNMQ 479
Query: 439 EFVEDLPEQKNGYDCGVFMIKYADFY 464
+LP+Q+N +DCG+F++ Y D +
Sbjct: 480 SISLELPQQENSFDCGLFLLHYLDLF 505