Miyakogusa Predicted Gene
- Lj0g3v0287359.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0287359.1 Non Chatacterized Hit- tr|C5Z2Z5|C5Z2Z5_SORBI
Putative uncharacterized protein Sb10g016910
OS=Sorghu,53.41,1e-18,ULP_PROTEASE,Peptidase C48, SUMO/Sentrin/Ubl1;
Cysteine proteinases,NULL; seg,NULL; SENTRIN/SUMO-SPE,CUFF.19208.1
(322 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D | chr1:... 347 9e-96
AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases superf... 293 8e-80
AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases superf... 293 8e-80
AT1G09730.1 | Symbols: | Cysteine proteinases superfamily prote... 132 3e-31
AT1G09730.2 | Symbols: | Cysteine proteinases superfamily prote... 132 3e-31
AT4G33620.1 | Symbols: | Cysteine proteinases superfamily prote... 105 3e-23
AT4G15880.1 | Symbols: ESD4, ATESD4 | Cysteine proteinases super... 77 2e-14
AT3G48480.1 | Symbols: | Cysteine proteinases superfamily prote... 75 8e-14
AT3G06910.1 | Symbols: ULP1A, ELS1, AtULP1a | UB-like protease 1... 75 8e-14
AT4G00690.1 | Symbols: ULP1B | UB-like protease 1B | chr4:281313... 74 1e-13
>AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D |
chr1:22208332-22211910 FORWARD LENGTH=584
Length = 584
Score = 347 bits (889), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 158/286 (55%), Positives = 213/286 (74%), Gaps = 1/286 (0%)
Query: 3 SDGSRSRNGQPIVLXXXXXXXXEPLILENTENKLSEYLKEAKIYFPSRDDPECVEICYKD 62
S SR R + +P + +L E L+E I +P+RDDP V++C KD
Sbjct: 291 SQSSRRRKKSEDTVINVDEEEAQPSTVAEQAAELPEGLQE-DICYPTRDDPHFVQVCLKD 349
Query: 63 TDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRET 122
+CLAP YL+S +MNFY+R+LQQQ S +N +D HFFNTYFYKKL +AV+ K +D++
Sbjct: 350 LECLAPREYLTSPVMNFYMRFLQQQISSSNQISADCHFFNTYFYKKLSDAVTYKGNDKDA 409
Query: 123 IFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIILHLDSLGLHSSQ 182
FV+FRRWWKG+++F+KAY+ IPIHEDLHWSL+I+CIPDK DESG ILHLDSLGLHS +
Sbjct: 410 FFVRFRRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCIPDKKDESGLTILHLDSLGLHSRK 469
Query: 183 SVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFV 242
S+ +N+K +L +E Y+++D D+ I++++WK L RRI V+ VPQQKN++DCG FV
Sbjct: 470 SIVENVKRFLKDEWNYLNQDDYSLDLPISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFV 529
Query: 243 LYFIKRFMEEAPERLKKKDLDMFSKRWFRPEEASSLRVKIKKLLIE 288
L+FIKRF+EEAP+RLK+KDL MF K+WFRP+EAS+LR+KI+ LIE
Sbjct: 530 LFFIKRFIEEAPQRLKRKDLGMFDKKWFRPDEASALRIKIRNTLIE 575
>AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases
superfamily protein | chr1:3487639-3491102 FORWARD
LENGTH=570
Length = 570
Score = 293 bits (751), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 139/266 (52%), Positives = 193/266 (72%), Gaps = 6/266 (2%)
Query: 26 PLILENTENKLSEYLKEAKIYFPSRDDPE---CVEICYKDTDCLAPEGYLSSTIMNFYIR 82
P+++E +L E L E IY+PS D + V++ KD CL+P YL+S ++NFYIR
Sbjct: 299 PMVVEEA-CELPEGLPE-DIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIR 356
Query: 83 YLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYV 142
Y+Q + + ++ HFFNT+FYKKL EAVS K +DR+ FVKFRRWWKG ++F K+Y+
Sbjct: 357 YVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFCKSYI 416
Query: 143 LIPIHEDLHWSLIIICIPDKGDESGPIILHLDSLGLHSSQSVFDNIKSYLIEEKKYMDRD 202
IPIHEDLHWSL+IICIPDK DESG I+HLDSLGLH +F+N+K +L EE Y+++D
Sbjct: 417 FIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEWNYLNQD 476
Query: 203 CVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRFMEEAPERLKKKDL 262
D+ I+ ++W+ L I + VPQQKN++DCGLF+L+FI+RF+EEAP+RL +DL
Sbjct: 477 APL-DLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL 535
Query: 263 DMFSKRWFRPEEASSLRVKIKKLLIE 288
M K+WF+PEEAS+LR+KI +L++
Sbjct: 536 KMIHKKWFKPEEASALRIKIWNILVD 561
>AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases
superfamily protein | chr1:3487639-3491102 FORWARD
LENGTH=571
Length = 571
Score = 293 bits (751), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 139/266 (52%), Positives = 193/266 (72%), Gaps = 6/266 (2%)
Query: 26 PLILENTENKLSEYLKEAKIYFPSRDDPE---CVEICYKDTDCLAPEGYLSSTIMNFYIR 82
P+++E +L E L E IY+PS D + V++ KD CL+P YL+S ++NFYIR
Sbjct: 300 PMVVEEA-CELPEGLPE-DIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIR 357
Query: 83 YLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYV 142
Y+Q + + ++ HFFNT+FYKKL EAVS K +DR+ FVKFRRWWKG ++F K+Y+
Sbjct: 358 YVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFCKSYI 417
Query: 143 LIPIHEDLHWSLIIICIPDKGDESGPIILHLDSLGLHSSQSVFDNIKSYLIEEKKYMDRD 202
IPIHEDLHWSL+IICIPDK DESG I+HLDSLGLH +F+N+K +L EE Y+++D
Sbjct: 418 FIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEWNYLNQD 477
Query: 203 CVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRFMEEAPERLKKKDL 262
D+ I+ ++W+ L I + VPQQKN++DCGLF+L+FI+RF+EEAP+RL +DL
Sbjct: 478 APL-DLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL 536
Query: 263 DMFSKRWFRPEEASSLRVKIKKLLIE 288
M K+WF+PEEAS+LR+KI +L++
Sbjct: 537 KMIHKKWFKPEEASALRIKIWNILVD 562
>AT1G09730.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:3148017-3154236 REVERSE LENGTH=963
Length = 963
Score = 132 bits (333), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 144/281 (51%), Gaps = 40/281 (14%)
Query: 40 LKEAKIYFPSRD-----------DPECVEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQA 88
L + K YFPS D DP+ V IC +D + L PE +++ TI++FYI YL+ Q
Sbjct: 399 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 458
Query: 89 SLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHE 148
+ + + +D + F++ R+W + V++F K Y+ +P++
Sbjct: 459 QTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPVNY 518
Query: 149 DLHWSLIIICIPDKG--------DESG--PIILHLDSL-GLHSSQSVFDNIKSYLIEEKK 197
+LHWSLI+IC P + D+S P ILH+DS+ G H+ +++YL EE K
Sbjct: 519 NLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHAGLKNL--VQTYLCEEWK 576
Query: 198 YMDRDCVYSDVSIADRIWKCLSRRIEAQVIT--VPQQKNEYDCGLFVLYFIKRFMEEAPE 255
++ D+S SR + + ++ +PQQ+N +DCGLF+L++++ F+ EAP
Sbjct: 577 ERHKE-TSDDIS---------SRFMNLRFVSLELPQQENSFDCGLFLLHYLELFLAEAPL 626
Query: 256 RLKKKDL----DMFSKRWFRPEEASSLRVKIKKLLIEELQN 292
+ + WF P EAS R I+KL+ E L+N
Sbjct: 627 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLEN 667
>AT1G09730.2 | Symbols: | Cysteine proteinases superfamily protein
| chr1:3148017-3154236 REVERSE LENGTH=931
Length = 931
Score = 132 bits (332), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 144/281 (51%), Gaps = 40/281 (14%)
Query: 40 LKEAKIYFPSRD-----------DPECVEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQA 88
L + K YFPS D DP+ V IC +D + L PE +++ TI++FYI YL+ Q
Sbjct: 367 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 426
Query: 89 SLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHE 148
+ + + +D + F++ R+W + V++F K Y+ +P++
Sbjct: 427 QTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPVNY 486
Query: 149 DLHWSLIIICIPDKG--------DESG--PIILHLDSL-GLHSSQSVFDNIKSYLIEEKK 197
+LHWSLI+IC P + D+S P ILH+DS+ G H+ +++YL EE K
Sbjct: 487 NLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHAGLKNL--VQTYLCEEWK 544
Query: 198 YMDRDCVYSDVSIADRIWKCLSRRIEAQVIT--VPQQKNEYDCGLFVLYFIKRFMEEAPE 255
++ D+S SR + + ++ +PQQ+N +DCGLF+L++++ F+ EAP
Sbjct: 545 ERHKE-TSDDIS---------SRFMNLRFVSLELPQQENSFDCGLFLLHYLELFLAEAPL 594
Query: 256 RLKKKDL----DMFSKRWFRPEEASSLRVKIKKLLIEELQN 292
+ + WF P EAS R I+KL+ E L+N
Sbjct: 595 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLEN 635
>AT4G33620.1 | Symbols: | Cysteine proteinases superfamily protein
| chr4:16147692-16152853 FORWARD LENGTH=783
Length = 783
Score = 105 bits (263), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 126/256 (49%), Gaps = 31/256 (12%)
Query: 52 DPECVEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSS-LSDYHFFNTYFYKKLK 110
+P+ V + +D + L P +++ TI++FYI+YL+ + S ++ F L
Sbjct: 301 EPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNRISPKERGRFHFFNCFFFRKLANLD 360
Query: 111 EAVSCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLIIICIPD--------- 161
+ RE + + ++W K V++F+K Y+ IPI+ HWSL+IIC P
Sbjct: 361 KGTPSTCGGREA-YQRVQKWTKNVDLFEKDYIFIPINCSFHWSLVIICHPGELVPSHVNF 419
Query: 162 -------KGDESGPIILHLDSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRI 214
+ + P ILHLDS+ + + SYL EE K + +D S A
Sbjct: 420 HSFDDEVENPQRVPCILHLDSIKGSHKGGLINIFPSYLREEWKARHENTT-NDSSRA--- 475
Query: 215 WKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRFMEEAPER----LKKKDLDMFSKRWF 270
+++ + +PQQ+N +DCGLF+L+++ F+ +AP + L + + ++ WF
Sbjct: 476 -----PNMQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSANFLTRNWF 530
Query: 271 RPEEASSLRVKIKKLL 286
+EAS R I +LL
Sbjct: 531 PAKEASLKRRNILELL 546
>AT4G15880.1 | Symbols: ESD4, ATESD4 | Cysteine proteinases
superfamily protein | chr4:9012769-9015797 FORWARD
LENGTH=489
Length = 489
Score = 76.6 bits (187), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 30/198 (15%)
Query: 56 VEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSC 115
++I + CL P +L+ ++N Y+ L+++ + H+FNT+FYKKL
Sbjct: 288 IDITGEVLQCLTPSAWLNDEVINVYLELLKERETREPKKYLKCHYFNTFFYKKL------ 341
Query: 116 KQSDRETIFVKFRRWWK----GVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIIL 171
SD F RRW G + + +PIH +HW+L +I + +L
Sbjct: 342 -VSDSGYNFKAVRRWTTQRKLGYALIDCDMIFVPIHRGVHWTLAVI------NNRESKLL 394
Query: 172 HLDSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQ 231
+LDSL + + + Y+ +E S I W V +PQ
Sbjct: 395 YLDSLN-GVDPMILNALAKYMGDEANEK------SGKKIDANSWDM------EFVEDLPQ 441
Query: 232 QKNEYDCGLFVLYFIKRF 249
QKN YDCG+F+L +I F
Sbjct: 442 QKNGYDCGMFMLKYIDFF 459
>AT3G48480.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:17957326-17959062 REVERSE LENGTH=298
Length = 298
Score = 74.7 bits (182), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 73/142 (51%), Gaps = 14/142 (9%)
Query: 132 KGVNIFQKAYVLIPIHEDLHWSLIIIC-IPDKGDESGPIILHLDSL-GLHSSQSVFDNIK 189
K IF K YV +PI HW+L+I C + D +L LDSL SSQ + +I+
Sbjct: 149 KTKQIFSKKYVFLPIVYWSHWTLLIFCNFGEDLDSDKTCMLFLDSLQTTDSSQRLEPDIR 208
Query: 190 SYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRF 249
++++ + R D S+ D I V VPQQ N+ +CG FVLY+I RF
Sbjct: 209 KFVLDIYRAEGRT---EDSSLVDEI--------PFYVPMVPQQTNDVECGSFVLYYIHRF 257
Query: 250 MEEAPERLKKKDLDMFSKR-WF 270
+E+APE +D+ F K WF
Sbjct: 258 IEDAPENFNVEDMPYFLKEDWF 279
>AT3G06910.1 | Symbols: ULP1A, ELS1, AtULP1a | UB-like protease 1A |
chr3:2178905-2181188 REVERSE LENGTH=502
Length = 502
Score = 74.7 bits (182), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 93/195 (47%), Gaps = 29/195 (14%)
Query: 56 VEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSC 115
++I K CL P +L+ ++N Y+ L+++ + HFFNT+F+ KL + +
Sbjct: 300 IDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFFFTKLVNSATG 359
Query: 116 KQSDRETIFVKFRRWWK----GVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIIL 171
+ RRW G ++ + IPIH ++HW+L +I I D+
Sbjct: 360 YN------YGAVRRWTSMKRLGYHLKDCDKIFIPIHMNIHWTLAVINIKDQK------FQ 407
Query: 172 HLDSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQ 231
+LDS + + D + Y ++E + D+ V DVS W+ + V +P
Sbjct: 408 YLDSFKGREPK-ILDALARYFVDEVR--DKSEVDLDVS----RWR------QEFVQDLPM 454
Query: 232 QKNEYDCGLFVLYFI 246
Q+N +DCG+F++ +I
Sbjct: 455 QRNGFDCGMFMVKYI 469
>AT4G00690.1 | Symbols: ULP1B | UB-like protease 1B |
chr4:281313-283129 FORWARD LENGTH=348
Length = 348
Score = 74.3 bits (181), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 90/193 (46%), Gaps = 25/193 (12%)
Query: 56 VEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKL--KEAV 113
++I + CL P +L+ + N Y+ L+++ + HFFNT+FY KL
Sbjct: 139 IDISGETLQCLRPNQWLNDDVTNLYLELLKERQTRDPQKYFKCHFFNTFFYVKLVSGSGY 198
Query: 114 SCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIILHL 173
+ K R T K G ++ + +PIH D+HW+L +I ++ ++L
Sbjct: 199 NYKAVSRWTTKRKL-----GYDLIDCDIIFVPIHIDIHWTLGVINNRERK------FVYL 247
Query: 174 DSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQQK 233
DSL ++ + + YL++E K + + DVS W V PQQ+
Sbjct: 248 DSLFTGVGHTILNAMAKYLVDEVKQKSQKNI--DVS----SWGM------EYVEERPQQQ 295
Query: 234 NEYDCGLFVLYFI 246
N YDCG+F+L +I
Sbjct: 296 NGYDCGMFMLKYI 308