
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147012.13 + phase: 0 /pseudo
(459 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM10110.1| unknown protein [Arabidopsis thaliana] gi|1442339... 242 2e-62
emb|CAE46910.1| SUMO protease [Arabidopsis thaliana] 237 7e-61
emb|CAB45994.1| hypothetical protein [Arabidopsis thaliana] gi|7... 233 1e-59
pir||E71424 hypothetical protein - Arabidopsis thaliana 233 1e-59
emb|CAA72071.1| G14587-5 [Arabidopsis thaliana] gi|2326350|emb|C... 229 1e-58
gb|AAF26995.1| hypothetical protein [Arabidopsis thaliana] 174 4e-42
ref|NP_187347.2| Ulp1 protease family protein [Arabidopsis thali... 174 4e-42
dbj|BAC42191.1| unknown protein [Arabidopsis thaliana] 172 2e-41
ref|NP_918865.1| P0458A05.23 [Oryza sativa (japonica cultivar-gr... 156 1e-36
gb|AAR06297.1| putative sentrin-specific protease [Oryza sativa ... 113 1e-23
emb|CAB80878.1| hypothetical protein [Arabidopsis thaliana] gi|3... 107 8e-22
ref|NP_191978.2| Ulp1 protease family protein [Arabidopsis thali... 107 8e-22
ref|XP_629677.1| hypothetical protein DDB0184304 [Dictyostelium ... 73 2e-11
emb|CAG07397.1| unnamed protein product [Tetraodon nigroviridis] 72 3e-11
ref|NP_701689.1| hypothetical protein PFL1635w [Plasmodium falci... 66 3e-09
gb|EAK88003.1| ULP1 like chllamydin domain containing protease, ... 63 2e-08
ref|XP_139402.3| PREDICTED: similar to sentrin 15 [Mus musculus] 59 3e-07
ref|NP_001016222.1| hypothetical protein LOC548976 [Xenopus trop... 58 7e-07
gb|AAD39580.1| T10O24.20 [Arabidopsis thaliana] 57 9e-07
ref|NP_015305.1| Ubl (ubiquitin-like protein)-specific protease ... 57 2e-06
>gb|AAM10110.1| unknown protein [Arabidopsis thaliana] gi|14423394|gb|AAK62379.1|
Unknown protein [Arabidopsis thaliana]
gi|18414542|ref|NP_567478.1| Ulp1 protease family
protein [Arabidopsis thaliana]
Length = 489
Score = 242 bits (617), Expect = 2e-62
Identities = 151/339 (44%), Positives = 201/339 (58%), Gaps = 26/339 (7%)
Query: 14 HWLSVDHNRPSPNSPTF---KRQKLSTTMSNNTPPPLTTNATVFRISRYPKTKPPLRREI 70
++++ P NSP F K+++ S MS ++ P ++N T+ RISRYP K PLRREI
Sbjct: 16 NFINQQSTNPLRNSPYFQASKKRRFSFAMSEDSGKPASSNPTISRISRYPDAKAPLRREI 75
Query: 71 HAPCSSRPRKFDLSTNLKLTESSNLMASDTTDELMEKYEHAKHAALATISFREKGK---- 126
HAP R +N + +N + KY+ AK +AL + F KGK
Sbjct: 76 HAPSRGILRYGKAKSNDYCEKDANFF--------VRKYDDAKRSALEALRFVNKGKDFVD 127
Query: 127 VKDNI-KEVVVLKDSNVEDVEEFLSVVKDHKLKENGLVCLDDKVEVVNGGIQQQSTSSMV 185
+ D + KE VV DS+V+ +E + D + K+N V V G +
Sbjct: 128 LGDEVEKEEVVSDDSSVQAIE--VIDCDDDEEKKNLQPSFSSGVTDVKKG----ENFRVE 181
Query: 186 DAAKMLDNLSLS--TKNDLSGVCAYKKLLEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLR 243
D + MLD+LSL ND S + AY+KL+++ +KR L L FEI LNEK+ S R
Sbjct: 182 DTSMMLDSLSLDRDVDNDASSLEAYRKLMQSAEKRNSKLEALGFEIVLNEKKLSLLRQSR 241
Query: 244 PKK-ELVEEVPREPFVPLTKEEEVEVTRAFSA-NQNKVLVNHEKSNIEISGKMFRCLRPG 301
PK E EVPREPF+PLT++EE EV RAFS N+ KVL HE SNI+I+G++ +CL P
Sbjct: 242 PKTVEKRVEVPREPFIPLTEDEEAEVYRAFSGRNRRKVLATHENSNIDITGEVLQCLTPS 301
Query: 302 EWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKK 340
WLNDEVIN+YLELLKERE REP+K+LKCH+FNTFFYKK
Sbjct: 302 AWLNDEVINVYLELLKERETREPKKYLKCHYFNTFFYKK 340
Score = 69.3 bits (168), Expect = 2e-10
Identities = 36/73 (49%), Positives = 44/73 (59%), Gaps = 21/73 (28%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNG---------------------FVIEHMPYF 446
DE +K+G+ +D +SW+ EFVEDLP+Q NG F EHMPYF
Sbjct: 416 DEANEKSGKKIDANSWDMEFVEDLPQQKNGYDCGMFMLKYIDFFSRGLGLCFSQEHMPYF 475
Query: 447 RLRTAKEILRLKA 459
RLRTAKEILRL+A
Sbjct: 476 RLRTAKEILRLRA 488
>emb|CAE46910.1| SUMO protease [Arabidopsis thaliana]
Length = 489
Score = 237 bits (604), Expect = 7e-61
Identities = 150/339 (44%), Positives = 199/339 (58%), Gaps = 26/339 (7%)
Query: 14 HWLSVDHNRPSPNSPTF---KRQKLSTTMSNNTPPPLTTNATVFRISRYPKTKPPLRREI 70
++++ P NSP F K+++ S MS ++ P ++N T+ RISRYP K PLRREI
Sbjct: 16 NFINQQSTNPLRNSPYFQASKKRRFSFAMSEDSGKPASSNPTISRISRYPDAKAPLRREI 75
Query: 71 HAPCSSRPRKFDLSTNLKLTESSNLMASDTTDELMEKYEHAKHAALATISFREKGK---- 126
HAP R +N + +N + KY+ AK +AL + F KGK
Sbjct: 76 HAPSRGILRYGKAKSNDYCEKDANFF--------VRKYDDAKRSALEALRFVNKGKDFVD 127
Query: 127 VKDNI-KEVVVLKDSNVEDVEEFLSVVKDHKLKENGLVCLDDKVEVVNGGIQQQSTSSMV 185
+ D + KE VV DS+V+ +E + D + K+N V V G +
Sbjct: 128 LGDEVEKEEVVSDDSSVQAIE--VIDCDDDEEKKNLQPSFSSGVTDVKKG----ENFRVE 181
Query: 186 DAAKMLDNLSLS--TKNDLSGVCAYKKLLEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLR 243
D + MLD+LSL ND S + AY+KL+++ +KR L L FEI LNEK+ S R
Sbjct: 182 DTSMMLDSLSLDRDVDNDASSLEAYRKLMQSAEKRNSKLEALGFEIVLNEKKLSLLRQSR 241
Query: 244 PKK-ELVEEVPREPFVPLTKEEEVEVTRAFSA-NQNKVLVNHEKSNIEISGKMFRCLRPG 301
PK E EVPREPF+PLT++EE EV RAFS N+ KVL HE SNI+I+G++ +CL P
Sbjct: 242 PKTVEKRVEVPREPFIPLTEDEEAEVYRAFSGRNRRKVLATHENSNIDITGEVLQCLTPS 301
Query: 302 EWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKK 340
WLNDEVIN+YLELLKERE REP K+LKC +FNTFFYKK
Sbjct: 302 AWLNDEVINVYLELLKERETREPPKYLKCLYFNTFFYKK 340
Score = 69.3 bits (168), Expect = 2e-10
Identities = 36/73 (49%), Positives = 44/73 (59%), Gaps = 21/73 (28%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNG---------------------FVIEHMPYF 446
DE +K+G+ +D +SW+ EFVEDLP+Q NG F EHMPYF
Sbjct: 416 DEANEKSGKKIDANSWDMEFVEDLPQQKNGYDCGMFMLKYIDFFSRGLGLCFSQEHMPYF 475
Query: 447 RLRTAKEILRLKA 459
RLRTAKEILRL+A
Sbjct: 476 RLRTAKEILRLRA 488
>emb|CAB45994.1| hypothetical protein [Arabidopsis thaliana]
gi|7268336|emb|CAB78630.1| hypothetical protein
[Arabidopsis thaliana] gi|25371728|pir||H85175
hypothetical protein AT4g15880 [imported] - Arabidopsis
thaliana
Length = 424
Score = 233 bits (593), Expect = 1e-59
Identities = 144/311 (46%), Positives = 187/311 (59%), Gaps = 23/311 (7%)
Query: 39 MSNNTPPPLTTNATVFRISRYPKTKPPLRREIHAPCSSRPRKFDLSTNLKLTESSNLMAS 98
MS ++ P ++N T+ RISRYP K PLRREIHAP R +N + +N
Sbjct: 1 MSEDSGKPASSNPTISRISRYPDAKAPLRREIHAPSRGILRYGKAKSNDYCEKDANFF-- 58
Query: 99 DTTDELMEKYEHAKHAALATISFREKGK----VKDNI-KEVVVLKDSNVEDVEEFLSVVK 153
+ KY+ AK +AL + F KGK + D + KE VV DS+V+ +E +
Sbjct: 59 ------VRKYDDAKRSALEALRFVNKGKDFVDLGDEVEKEEVVSDDSSVQAIE--VIDCD 110
Query: 154 DHKLKENGLVCLDDKVEVVNGGIQQQSTSSMVDAAKMLDNLSLS--TKNDLSGVCAYKKL 211
D + K+N V V G + D + MLD+LSL ND S + AY+KL
Sbjct: 111 DDEEKKNLQPSFSSGVTDVKKG----ENFRVEDTSMMLDSLSLDRDVDNDASSLEAYRKL 166
Query: 212 LEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLRPKK-ELVEEVPREPFVPLTKEEEVEVTR 270
+++ +KR L L FEI LNEK+ S RPK E EVPREPF+PLT++EE EV R
Sbjct: 167 MQSAEKRNSKLEALGFEIVLNEKKLSLLRQSRPKTVEKRVEVPREPFIPLTEDEEAEVYR 226
Query: 271 AFSA-NQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLK 329
AFS N+ KVL HE SNI+I+G++ +CL P WLNDEVIN+YLELLKERE REP+K+LK
Sbjct: 227 AFSGRNRRKVLATHENSNIDITGEVLQCLTPSAWLNDEVINVYLELLKERETREPKKYLK 286
Query: 330 CHFFNTFFYKK 340
CH+FNTFFYKK
Sbjct: 287 CHYFNTFFYKK 297
Score = 47.8 bits (112), Expect = 7e-04
Identities = 18/32 (56%), Positives = 26/32 (81%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNGFV 439
DE +K+G+ +D +SW+ EFVEDLP+Q NG+V
Sbjct: 373 DEANEKSGKKIDANSWDMEFVEDLPQQKNGYV 404
>pir||E71424 hypothetical protein - Arabidopsis thaliana
Length = 398
Score = 233 bits (593), Expect = 1e-59
Identities = 144/311 (46%), Positives = 187/311 (59%), Gaps = 23/311 (7%)
Query: 39 MSNNTPPPLTTNATVFRISRYPKTKPPLRREIHAPCSSRPRKFDLSTNLKLTESSNLMAS 98
MS ++ P ++N T+ RISRYP K PLRREIHAP R +N + +N
Sbjct: 1 MSEDSGKPASSNPTISRISRYPDAKAPLRREIHAPSRGILRYGKAKSNDYCEKDANFF-- 58
Query: 99 DTTDELMEKYEHAKHAALATISFREKGK----VKDNI-KEVVVLKDSNVEDVEEFLSVVK 153
+ KY+ AK +AL + F KGK + D + KE VV DS+V+ +E +
Sbjct: 59 ------VRKYDDAKRSALEALRFVNKGKDFVDLGDEVEKEEVVSDDSSVQAIE--VIDCD 110
Query: 154 DHKLKENGLVCLDDKVEVVNGGIQQQSTSSMVDAAKMLDNLSLS--TKNDLSGVCAYKKL 211
D + K+N V V G + D + MLD+LSL ND S + AY+KL
Sbjct: 111 DDEEKKNLQPSFSSGVTDVKKG----ENFRVEDTSMMLDSLSLDRDVDNDASSLEAYRKL 166
Query: 212 LEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLRPKK-ELVEEVPREPFVPLTKEEEVEVTR 270
+++ +KR L L FEI LNEK+ S RPK E EVPREPF+PLT++EE EV R
Sbjct: 167 MQSAEKRNSKLEALGFEIVLNEKKLSLLRQSRPKTVEKRVEVPREPFIPLTEDEEAEVYR 226
Query: 271 AFSA-NQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLK 329
AFS N+ KVL HE SNI+I+G++ +CL P WLNDEVIN+YLELLKERE REP+K+LK
Sbjct: 227 AFSGRNRRKVLATHENSNIDITGEVLQCLTPSAWLNDEVINVYLELLKERETREPKKYLK 286
Query: 330 CHFFNTFFYKK 340
CH+FNTFFYKK
Sbjct: 287 CHYFNTFFYKK 297
Score = 38.9 bits (89), Expect = 0.34
Identities = 14/26 (53%), Positives = 21/26 (79%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPE 433
DE +K+G+ +D +SW+ EFVEDLP+
Sbjct: 373 DEANEKSGKKIDANSWDMEFVEDLPQ 398
>emb|CAA72071.1| G14587-5 [Arabidopsis thaliana] gi|2326350|emb|CAA72042.1|
hypothetical protein [Arabidopsis thaliana]
Length = 396
Score = 229 bits (585), Expect = 1e-58
Identities = 142/304 (46%), Positives = 183/304 (59%), Gaps = 23/304 (7%)
Query: 46 PLTTNATVFRISRYPKTKPPLRREIHAPCSSRPRKFDLSTNLKLTESSNLMASDTTDELM 105
P ++N T+ RISRYP K PLRREIHAP R +N + +N +
Sbjct: 6 PASSNPTISRISRYPDAKAPLRREIHAPSRGILRYGKAKSNDYCEKDANFF--------V 57
Query: 106 EKYEHAKHAALATISFREKGK----VKDNI-KEVVVLKDSNVEDVEEFLSVVKDHKLKEN 160
KY+ AK +AL + F KGK + D + KE VV DS+V+ +E + D + K+N
Sbjct: 58 RKYDDAKRSALEALRFVNKGKDFVDLGDEVEKEEVVSDDSSVQAIE--VIDCDDDEEKKN 115
Query: 161 GLVCLDDKVEVVNGGIQQQSTSSMVDAAKMLDNLSLS--TKNDLSGVCAYKKLLEAVDKR 218
V V G + D + MLD+LSL ND S + AY+KL+++ +KR
Sbjct: 116 LQPSFSSGVTDVKKG----ENFRVEDTSMMLDSLSLDRDVDNDASSLEAYRKLMQSAEKR 171
Query: 219 GDTLGRLKFEIQLNEKRKSAFDLLRPKK-ELVEEVPREPFVPLTKEEEVEVTRAFSA-NQ 276
L L FEI LNEK+ S RPK E EVPREPF+PLT++EE EV RAFS N+
Sbjct: 172 NSKLEALGFEIVLNEKKLSLLRQSRPKTVEKRVEVPREPFIPLTEDEEAEVYRAFSGRNR 231
Query: 277 NKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFNTF 336
KVL HE SNI+I+G++ +CL P WLNDEVIN+YLELLKERE REP+K+LKCH+FNTF
Sbjct: 232 RKVLATHENSNIDITGEVLQCLTPSAWLNDEVINVYLELLKERETREPKKYLKCHYFNTF 291
Query: 337 FYKK 340
FYKK
Sbjct: 292 FYKK 295
Score = 38.9 bits (89), Expect = 0.34
Identities = 14/26 (53%), Positives = 21/26 (79%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPE 433
DE +K+G+ +D +SW+ EFVEDLP+
Sbjct: 371 DEANEKSGKKIDANSWDMEFVEDLPQ 396
>gb|AAF26995.1| hypothetical protein [Arabidopsis thaliana]
Length = 478
Score = 174 bits (442), Expect = 4e-42
Identities = 114/293 (38%), Positives = 164/293 (55%), Gaps = 17/293 (5%)
Query: 56 ISRYPKTKPPLRREIHAPCSSRPRKFDLSTNLKLTESSNLMASDTTDELMEKYEHAKHAA 115
I RYP+ K LRR++HAP D ST S N++ + T +A ++
Sbjct: 45 IYRYPEVKSSLRRQVHAPVRILNSGRDRSTR---QGSGNVLGTFLTRNNDMWKRNALDSS 101
Query: 116 LATISFREKGKVKDNIKEVVVLKDSN----VEDVEEFLSVVKDHKLKENGLVCLDDKVEV 171
L + RE V D + +V ++ D VE+V + V++ NGL +V
Sbjct: 102 LRYRTDREVIDVDDELGDVEMISDDTSREGVENVAMEVDEVEEKAEMGNGLF---SEVAS 158
Query: 172 VNGGIQQQSTSSMVDAAKMLDNLSLSTKNDLSGVCAYKKLLEAVDKRGDTLGRLKFEIQL 231
+ G + S +++ ++ N ++ D++ AY+K+LE+ R L F
Sbjct: 159 LKNGSLRVGECSKANSSSLVVNRPVT---DVTSFEAYRKVLESAVNRTSKLKDRGFVDFF 215
Query: 232 NEKRKSAFDLL----RPKKELVEEVPREPFVPLTKEEEVEVTRAFSANQNKVLVNHEKSN 287
E+ ++ L R +E VE V RE FVPL++EEE V RAFSAN + +LV H+ SN
Sbjct: 216 KERGRALLRSLSSFWRQDEEPVEVVQREAFVPLSREEETAVRRAFSANDSNILVTHKNSN 275
Query: 288 IEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKK 340
I+I+GK+ RCL+PG+WLNDEVINLY+ LLKERE REP+KFLKCHFFNTFF+ K
Sbjct: 276 IDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFFFTK 328
Score = 63.5 bits (153), Expect = 1e-08
Identities = 35/73 (47%), Positives = 41/73 (55%), Gaps = 21/73 (28%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNG---------------------FVIEHMPYF 446
DEV+DK+ D+D+S W EFV+DLP Q NG F E MPYF
Sbjct: 405 DEVRDKSEVDLDVSRWRQEFVQDLPMQRNGFDCGMFMVKYIDFYSRGLDLCFTQEQMPYF 464
Query: 447 RLRTAKEILRLKA 459
R RTAKEIL+LKA
Sbjct: 465 RARTAKEILQLKA 477
>ref|NP_187347.2| Ulp1 protease family protein [Arabidopsis thaliana]
Length = 502
Score = 174 bits (442), Expect = 4e-42
Identities = 114/293 (38%), Positives = 164/293 (55%), Gaps = 17/293 (5%)
Query: 56 ISRYPKTKPPLRREIHAPCSSRPRKFDLSTNLKLTESSNLMASDTTDELMEKYEHAKHAA 115
I RYP+ K LRR++HAP D ST S N++ + T +A ++
Sbjct: 69 IYRYPEVKSSLRRQVHAPVRILNSGRDRSTR---QGSGNVLGTFLTRNNDMWKRNALDSS 125
Query: 116 LATISFREKGKVKDNIKEVVVLKDSN----VEDVEEFLSVVKDHKLKENGLVCLDDKVEV 171
L + RE V D + +V ++ D VE+V + V++ NGL +V
Sbjct: 126 LRYRTDREVIDVDDELGDVEMISDDTSREGVENVAMEVDEVEEKAEMGNGLF---SEVAS 182
Query: 172 VNGGIQQQSTSSMVDAAKMLDNLSLSTKNDLSGVCAYKKLLEAVDKRGDTLGRLKFEIQL 231
+ G + S +++ ++ N ++ D++ AY+K+LE+ R L F
Sbjct: 183 LKNGSLRVGECSKANSSSLVVNRPVT---DVTSFEAYRKVLESAVNRTSKLKDRGFVDFF 239
Query: 232 NEKRKSAFDLL----RPKKELVEEVPREPFVPLTKEEEVEVTRAFSANQNKVLVNHEKSN 287
E+ ++ L R +E VE V RE FVPL++EEE V RAFSAN + +LV H+ SN
Sbjct: 240 KERGRALLRSLSSFWRQDEEPVEVVQREAFVPLSREEETAVRRAFSANDSNILVTHKNSN 299
Query: 288 IEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKK 340
I+I+GK+ RCL+PG+WLNDEVINLY+ LLKERE REP+KFLKCHFFNTFF+ K
Sbjct: 300 IDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFFFTK 352
Score = 63.5 bits (153), Expect = 1e-08
Identities = 35/73 (47%), Positives = 41/73 (55%), Gaps = 21/73 (28%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNG---------------------FVIEHMPYF 446
DEV+DK+ D+D+S W EFV+DLP Q NG F E MPYF
Sbjct: 429 DEVRDKSEVDLDVSRWRQEFVQDLPMQRNGFDCGMFMVKYIDFYSRGLDLCFTQEQMPYF 488
Query: 447 RLRTAKEILRLKA 459
R RTAKEIL+LKA
Sbjct: 489 RARTAKEILQLKA 501
>dbj|BAC42191.1| unknown protein [Arabidopsis thaliana]
Length = 502
Score = 172 bits (436), Expect = 2e-41
Identities = 113/293 (38%), Positives = 163/293 (55%), Gaps = 17/293 (5%)
Query: 56 ISRYPKTKPPLRREIHAPCSSRPRKFDLSTNLKLTESSNLMASDTTDELMEKYEHAKHAA 115
I RYP+ K LRR++HAP D ST S N++ + T +A ++
Sbjct: 69 IYRYPEVKSSLRRQVHAPVRILNSGRDRSTR---QGSGNVLGTFLTRNNDMWKRNALDSS 125
Query: 116 LATISFREKGKVKDNIKEVVVLKDSN----VEDVEEFLSVVKDHKLKENGLVCLDDKVEV 171
L + RE V D + +V ++ D VE+V + V++ NGL +V
Sbjct: 126 LRYRTDREVIDVDDELGDVEMISDDTSREGVENVAMEVDEVEEKAEMGNGLF---SEVAS 182
Query: 172 VNGGIQQQSTSSMVDAAKMLDNLSLSTKNDLSGVCAYKKLLEAVDKRGDTLGRLKFEIQL 231
+ G + S +++ ++ N ++ D++ AY+K+LE+ R L F
Sbjct: 183 LKNGSLRVGECSKANSSSLVVNRPVT---DVTSFEAYRKVLESAVNRTSKLKDRGFVDFF 239
Query: 232 NEKRKSAFDLL----RPKKELVEEVPREPFVPLTKEEEVEVTRAFSANQNKVLVNHEKSN 287
E+ ++ L R +E VE V RE FVPL++EEE V RAFSAN + +LV H+ SN
Sbjct: 240 KERGRALLRSLSSFWRQDEEPVEVVQREAFVPLSREEETAVRRAFSANDSNILVTHKNSN 299
Query: 288 IEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKK 340
I+I+GK+ RCL+PG+WLNDEVINLY+ LLKERE REP+KFLKCHFFNTF + K
Sbjct: 300 IDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFIFTK 352
Score = 63.5 bits (153), Expect = 1e-08
Identities = 35/73 (47%), Positives = 41/73 (55%), Gaps = 21/73 (28%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNG---------------------FVIEHMPYF 446
DEV+DK+ D+D+S W EFV+DLP Q NG F E MPYF
Sbjct: 429 DEVRDKSEVDLDVSRWRQEFVQDLPMQRNGFDCGMFMVKYIDFYSRGLDLCFTQEQMPYF 488
Query: 447 RLRTAKEILRLKA 459
R RTAKEIL+LKA
Sbjct: 489 RARTAKEILQLKA 501
>ref|NP_918865.1| P0458A05.23 [Oryza sativa (japonica cultivar-group)]
gi|15408679|dbj|BAB64088.1| putative SUMO protease
[Oryza sativa (japonica cultivar-group)]
gi|11875201|dbj|BAB19414.1| putative SUMO protease
[Oryza sativa (japonica cultivar-group)]
Length = 497
Score = 156 bits (394), Expect = 1e-36
Identities = 120/299 (40%), Positives = 154/299 (51%), Gaps = 63/299 (21%)
Query: 208 YKKLLEAVDKRGDT-LGRLKFEIQLNEKRKSAFDLLRPKKELVEEVPR-------EPFVP 259
YK+L EA +R D L L+FE++L EK + + +L + +PR EPFVP
Sbjct: 214 YKELYEASSRRRDAKLRTLEFEVRLAEKGRLGLE------QLADVLPRFGPRKEDEPFVP 267
Query: 260 LTKEEEVEVTRAFSA-NQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKE 318
T E+E V A N+ + LV HE SNI I+ + +CL EWLNDEVINLYLELLKE
Sbjct: 268 FTDEDEDSVDHALGGRNRRERLVVHESSNIVITRETLQCLNETEWLNDEVINLYLELLKE 327
Query: 319 RERREPQKFLKCHFFNTFFYKKQECL*LQISEEMDYSKEIGIWPT*M**NICTYS--PGD 376
RE REP KFLKCHFFNTFFYKK I+ DY K + W T YS D
Sbjct: 328 RELREPNKFLKCHFFNTFFYKK------LITGGYDY-KSVRRWTT---KRKLGYSLLECD 377
Query: 377 PLVLSCYQ*EGKEISVS*FTERN*H--RSAGS-------------ADEVKDKTGEDVDIS 421
+ + ++ ++V ++ S GS DEVKDK+G+ +D
Sbjct: 378 KIFVPIHKEVHWCLAVINIRDKKFQFLDSLGSMDMKALRTLARYLVDEVKDKSGQHIDAL 437
Query: 422 SWETEFVEDLPEQMNG---------------------FVIEHMPYFRLRTAKEILRLKA 459
SW+ E V++LP Q NG F +HM YFR RTAKEIL L+A
Sbjct: 438 SWKQEGVKNLPLQENGWDCGMFMLKYIDFYSRDMGLTFGQKHMHYFRKRTAKEILNLRA 496
>gb|AAR06297.1| putative sentrin-specific protease [Oryza sativa (japonica
cultivar-group)] gi|50916459|ref|XP_468626.1| putative
sentrin-specific protease [Oryza sativa (japonica
cultivar-group)]
Length = 397
Score = 113 bits (283), Expect = 1e-23
Identities = 61/113 (53%), Positives = 74/113 (64%), Gaps = 4/113 (3%)
Query: 235 RKSAFDLLR--PKKELVEEVPREPFVPLTKEEEVEVTRAF--SANQNKVLVNHEKSNIEI 290
RKS+ D + P + ++ E F PLT EEE EV S K++V H SNI+I
Sbjct: 138 RKSSSDRGKAPPADDTAKQDLCEVFTPLTNEEESEVNNILYGSDQSKKIIVMHGPSNIDI 197
Query: 291 SGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKKQEC 343
+ + CLR WLNDEVINLYLELLKER +REP++FLKCHFFNTFFYKK C
Sbjct: 198 TKEKIWCLRTCNWLNDEVINLYLELLKERAQREPKRFLKCHFFNTFFYKKLAC 250
Score = 47.4 bits (111), Expect = 0.001
Identities = 30/73 (41%), Positives = 34/73 (46%), Gaps = 21/73 (28%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNG---------------------FVIEHMPYF 446
DE+ DK+ VD SSW + P Q NG F EHM YF
Sbjct: 324 DELNDKSNIQVDTSSWLKISSDSCPLQQNGWDCGMFMLKFIDFHSRGIGLCFTQEHMDYF 383
Query: 447 RLRTAKEILRLKA 459
R RTAKEILRL+A
Sbjct: 384 RKRTAKEILRLRA 396
>emb|CAB80878.1| hypothetical protein [Arabidopsis thaliana]
gi|3047118|gb|AAC13629.1| F6N23.7 gene product
[Arabidopsis thaliana] gi|7486563|pir||T01233
hypothetical protein F6N23.7 - Arabidopsis thaliana
Length = 233
Score = 107 bits (267), Expect = 8e-22
Identities = 45/66 (68%), Positives = 57/66 (86%)
Query: 275 NQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFN 334
N+ K+LV+H+ SNI+ISG+ +CLRP +WLND+V NLYLELLKER+ R+PQK+ KCHFFN
Sbjct: 20 NRKKILVSHKNSNIDISGETLQCLRPNQWLNDDVTNLYLELLKERQTRDPQKYFKCHFFN 79
Query: 335 TFFYKK 340
TFFY K
Sbjct: 80 TFFYVK 85
Score = 59.3 bits (142), Expect = 2e-07
Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 28/80 (35%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNGF----------------------------V 439
DEVK K+ +++D+SSW E+VE+ P+Q NG+ +
Sbjct: 153 DEVKQKSQKNIDVSSWGMEYVEERPQQQNGYDCGMFMLKYIDFYSRGLSLQFSQVIRDVI 212
Query: 440 IEHMPYFRLRTAKEILRLKA 459
+ MPYFRLRTAKEILRL+A
Sbjct: 213 KKDMPYFRLRTAKEILRLRA 232
>ref|NP_191978.2| Ulp1 protease family protein [Arabidopsis thaliana]
Length = 242
Score = 107 bits (267), Expect = 8e-22
Identities = 45/66 (68%), Positives = 57/66 (86%)
Query: 275 NQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFN 334
N+ K+LV+H+ SNI+ISG+ +CLRP +WLND+V NLYLELLKER+ R+PQK+ KCHFFN
Sbjct: 20 NRKKILVSHKNSNIDISGETLQCLRPNQWLNDDVTNLYLELLKERQTRDPQKYFKCHFFN 79
Query: 335 TFFYKK 340
TFFY K
Sbjct: 80 TFFYVK 85
Score = 59.3 bits (142), Expect = 2e-07
Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 28/80 (35%)
Query: 408 DEVKDKTGEDVDISSWETEFVEDLPEQMNGF----------------------------V 439
DEVK K+ +++D+SSW E+VE+ P+Q NG+ +
Sbjct: 162 DEVKQKSQKNIDVSSWGMEYVEERPQQQNGYDCGMFMLKYIDFYSRGLSLQFSQVIRDVI 221
Query: 440 IEHMPYFRLRTAKEILRLKA 459
+ MPYFRLRTAKEILRL+A
Sbjct: 222 KKDMPYFRLRTAKEILRLRA 241
>ref|XP_629677.1| hypothetical protein DDB0184304 [Dictyostelium discoideum]
gi|60463074|gb|EAL61269.1| hypothetical protein
DDB0184304 [Dictyostelium discoideum]
Length = 769
Score = 72.8 bits (177), Expect = 2e-11
Identities = 46/157 (29%), Positives = 79/157 (50%), Gaps = 23/157 (14%)
Query: 207 AYKKLLEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLRPKKELVEEVPREPFVPLTKEEEV 266
+++ L++ +++ D++ R K E Q+ ++R L K E + + + T E+ +
Sbjct: 465 SFESLIDQIEQNDDSIFRNKMEKQIQKERDQEIGSLLQKLESLSSSSNKYNLFKTIEQRM 524
Query: 267 EVT-------RAFSANQNKVLVNHEKSN-----------IEISGKMFRCLRPGEWLNDEV 308
E+ R + +N+++ + K+ E+ R L PG+WLNDEV
Sbjct: 525 EMEKMKKPTLRQLTQQENQIIDDIFKNGRPDDMISELPLAEVRRSDVRLLSPGKWLNDEV 584
Query: 309 INLYLELLKERERREPQ-----KFLKCHFFNTFFYKK 340
IN Y+E+LK R+ + + F KCHFFNTFFY K
Sbjct: 585 INFYMEVLKIRDAEKKKISGNNSFPKCHFFNTFFYPK 621
>emb|CAG07397.1| unnamed protein product [Tetraodon nigroviridis]
Length = 314
Score = 72.4 bits (176), Expect = 3e-11
Identities = 76/258 (29%), Positives = 114/258 (43%), Gaps = 69/258 (26%)
Query: 245 KKELVEEVPREPFVPLTKEEEVEVTRAFSA-NQNKVLVNHEKSNIEISGKMFRCLRPGEW 303
+ +++EE P P LT+E EVEV RA N ++VL E + ++ K + L W
Sbjct: 80 ESKVLEETPEFP--DLTEEMEVEVNRALKGGNCHEVL--SEGFGLSLTRKDLQTLSNLNW 135
Query: 304 LNDEVINLYLELLKERERREPQKFLKCHFFNTFFYKKQE----CL*LQISEEMD-YSKEI 358
LNDEVIN Y+ LL ER ++ F + FNTFFY K C + +++ D +SK+I
Sbjct: 136 LNDEVINFYMNLLMERSQK--PNFPSVNAFNTFFYPKLRKSGYCAVRRWTKKTDIFSKDI 193
Query: 359 GIWPT*M**NICTYSPGDPLVLSCYQ*EGKEISVS*FTERN--*HRSAGS---------- 406
+ P + + C +SV F +R+ + S G
Sbjct: 194 LLVPIHLGVHWC-------------------LSVVDFRKRSIMYYDSMGGKNDEACRVLL 234
Query: 407 ---ADEVKDKTGEDVDISSWETEFVE--DLPEQMNG---------------------FVI 440
+E KDK G+++D + W E ++P+QMNG F
Sbjct: 235 EYLKEESKDKKGKEMDTTGWILHSKERHEIPQQMNGSDCGMFTCKYAEYITKEKPIKFTQ 294
Query: 441 EHMPYFRLRTAKEILRLK 458
HMPYFR R E++ K
Sbjct: 295 RHMPYFRRRMVWELVNRK 312
>ref|NP_701689.1| hypothetical protein PFL1635w [Plasmodium falciparum 3D7]
gi|23496860|gb|AAN36413.1| hypothetical protein PFL1635w
[Plasmodium falciparum 3D7]
Length = 1016
Score = 65.9 bits (159), Expect = 3e-09
Identities = 98/398 (24%), Positives = 165/398 (40%), Gaps = 73/398 (18%)
Query: 84 STNLKLTESSNLMASDTTDELMEKYEHAKHAALATISFREKGKVKDNIKEVVVLKDSNVE 143
STNL E N A D L + K+ SF ++ D K V K E
Sbjct: 615 STNLNKEEVFNEEAIKDNDILKKLRYEYKNLINIIDSFIDETLNYDLDKNSPVSKLHLDE 674
Query: 144 DVEEFLSVVKDHKLKENGLVCLDDKVEVVNGGIQQQSTSSMVDAAKMLDNLSLSTKNDLS 203
++ + KDH L +D EV N I +SM K +++ S KN
Sbjct: 675 KNDKMCNESKDHFLNNKNNFKDNDADEVYNN-IDDDYKNSM----KFIEDTSSEDKNKYV 729
Query: 204 GVCAYKK--LLEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLRPKKELVEEVPREPFVPLT 261
+ Y + L+EA++K +++++K+K L++ K++ E++ ++ + T
Sbjct: 730 -ILKYDEDSLIEALEK-----------LRIDKKKKDK--LIKLKEKYPEDIEKDAYDDET 775
Query: 262 KEEEVEVTRAFSAN----------------QNKVLVNHEKSNIEISGKMFRCLRPGEWLN 305
K+++++ F + +N+VL+ EK N+ + +CL WLN
Sbjct: 776 KKKKIDKNIFFKCSKKEYYEKAIIILNEKIENRVLI--EKFNVPLLYSQIKCLIDTRWLN 833
Query: 306 DEVINLYLELLKERERREPQK-----FLKCHFFNTFFYKKQECL*LQISEEMDYSKEIGI 360
DEVIN YL +L+E + + K F+TFF++ L + +YSK +
Sbjct: 834 DEVINFYLSMLQEYNEQHTKNNSLTFIPKIFTFSTFFFQS-----LNFNGSYNYSK-VSR 887
Query: 361 WPT*M**NICTYS--------PGDPLVLSCYQ*EGKEI--------SVS*FTERN*HRSA 404
W +I ++ G+ L + K+I S F E +
Sbjct: 888 WTKRKQVDIFSFDLILIPLHVGGNHWTLGSIHMKDKKICLYDSLNGSNKKFFE---YMRR 944
Query: 405 GSADEVKDKTGEDVDISSW----ETEFVEDLPEQMNGF 438
DE+KDK +D+DIS W E + +P Q NG+
Sbjct: 945 YIVDEMKDKKQKDLDISLWTYSKEGVSEKGIPHQENGY 982
>gb|EAK88003.1| ULP1 like chllamydin domain containing protease, transcripts
identified by EST [Cryptosporidium parvum]
gi|66358078|ref|XP_626217.1| ULP1 like chllamydin domain
containing protease, transcripts identified by EST
[Cryptosporidium parvum]
Length = 456
Score = 63.2 bits (152), Expect = 2e-08
Identities = 52/177 (29%), Positives = 89/177 (49%), Gaps = 23/177 (12%)
Query: 191 LDNLSLSTKNDLSGVCAYKKLLEAVDKRGDTLGRLKFEIQLNEKRKSAFDLLRPKKELVE 250
L++L+L +N +KKL+ +D +FE Q ++++ F L K
Sbjct: 153 LNSLALCERNLEDKRKEFKKLVYGLDDS-------QFEEQESKEKDELFVSLGVIKY--- 202
Query: 251 EVPREPFVPLTKEEEVEVTRAFS---ANQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDE 307
+ P E +EE+ R++ +N+ +++ + KSNIE++ + +CLR +WLNDE
Sbjct: 203 KYPIE-----CSDEELNKARSYLNSLSNRGQIVAINYKSNIELTIDLIQCLRSQQWLNDE 257
Query: 308 VINLYLELLKERERREPQKFL--KCHFFNTFFYKKQECL*LQISEEMDYS-KEIGIW 361
+IN Y +L+ER R+ K +N+FFY K C Q ++E Y K + W
Sbjct: 258 LINFYFSMLQERNDRQTSNGFKPKVWLWNSFFYTKLTC--DQSNDETGYCYKNVSRW 312
>ref|XP_139402.3| PREDICTED: similar to sentrin 15 [Mus musculus]
Length = 625
Score = 58.9 bits (141), Expect = 3e-07
Identities = 41/124 (33%), Positives = 64/124 (51%), Gaps = 13/124 (10%)
Query: 218 RGDTLGRLKFEIQLNEKRKSAFDLLRPKKELVEEVPREPFVPLTKEEEVEVTRAFSAN-Q 276
RGD+L K + EK+ P + + + + +T++ E E+ A Q
Sbjct: 370 RGDSLLPNKMAVLAAEKK--------PPTDQEKGREMDQILDITEDMEKEIENALGPGPQ 421
Query: 277 NKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLELLKERERREPQKFLKCHFFNTF 336
+++L++ K ++IS + L G+WLNDEVIN Y+ LL ER E Q + H F+TF
Sbjct: 422 DEILISRFK--LQISRGDIQTLENGQWLNDEVINFYMNLLV--ERNENQGYPALHVFSTF 477
Query: 337 FYKK 340
FY K
Sbjct: 478 FYPK 481
>ref|NP_001016222.1| hypothetical protein LOC548976 [Xenopus tropicalis]
Length = 511
Score = 57.8 bits (138), Expect = 7e-07
Identities = 37/98 (37%), Positives = 54/98 (54%), Gaps = 5/98 (5%)
Query: 257 FVPLTKEEEVEVTRA-FSANQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLEL 315
F LT+ E E+ RA F +Q++ L E + I+ K L WLNDE+IN Y+ L
Sbjct: 390 FPELTEAMEREIKRALFGGSQDQTL--SEGYRLTITRKDIMTLHSLNWLNDEIINFYMNL 447
Query: 316 LKERERREPQKFLKCHFFNTFFYKKQECL*LQISEEMD 353
+ ER +R + K H FNTFF+ + + + SE+MD
Sbjct: 448 IMERSKR--KGLPKVHAFNTFFFYQVKICWISSSEKMD 483
>gb|AAD39580.1| T10O24.20 [Arabidopsis thaliana]
Length = 582
Score = 57.4 bits (137), Expect = 9e-07
Identities = 29/86 (33%), Positives = 43/86 (49%), Gaps = 8/86 (9%)
Query: 255 EPFVPLTKEEEVEVTRAFSANQNKVLVNHEKSNIEISGKMFRCLRPGEWLNDEVINLYLE 314
EP P+ EE E+ ++ + +++S K +CL PGE+L VIN Y+
Sbjct: 319 EPLSPMVVEEACELPEGDQSDGRDL--------VQVSLKDLKCLSPGEYLTSPVINFYIR 370
Query: 315 LLKERERREPQKFLKCHFFNTFFYKK 340
++ + CHFFNTFFYKK
Sbjct: 371 YVQHHVFSADKTAANCHFFNTFFYKK 396
>ref|NP_015305.1| Ubl (ubiquitin-like protein)-specific protease that cleaves Smt3p
protein conjugates; specifically required for cell cycle
progression; associates with nucleoporins and may
interact with septin rings during telophase; Ulp1p
[Saccharomyces cerevisiae]
gi|17380332|sp|Q02724|ULP1_YEAST Ubiquitin-like-specific
protease 1 gi|1039457|gb|AAB68167.1| Lpb11p
Length = 621
Score = 56.6 bits (135), Expect = 2e-06
Identities = 62/278 (22%), Positives = 126/278 (45%), Gaps = 59/278 (21%)
Query: 97 ASDTTDELMEKYEHAKHAALATISFREKGKVKDNIKEVVVLKDSNVEDVE---------- 146
+ + T + Y+ ++ T R+K K NI ++ + E+V
Sbjct: 222 SENNTSDQKNSYDRRQYG---TAFIRKKKVAKQNINNTKLVSRAQSEEVTYLRQIFNGEY 278
Query: 147 EFLSVVKDHKLKENGLVCLDDKVEVVNGGIQQQSTSSMVDAAKMLDNL-------SLSTK 199
+ ++K+ + ++ L+ +D + + G+++ S++D + + + L T+
Sbjct: 279 KVPKILKEERERQLKLMDMDKEKDT---GLKK----SIIDLTEKIKTILIENNKNRLQTR 331
Query: 200 NDLSGVCAY---KKLLEAVDKRGDTLG-RLKFEIQ--------------LNEKRKSAFDL 241
N+ + KK+ K D L +LKF+ LNE++K DL
Sbjct: 332 NENDDDLVFVKEKKISSLERKHKDYLNQKLKFDRSILEFEKDFKRYNEILNERKKIQEDL 391
Query: 242 LRPKKELVEEVPREPFVP-LTKEEEVEVTRAFSANQNKVLVNHEKSNIEISGKMFRCLRP 300
+ K++L ++ VP L ++++ +V +A ++ +N L+N + NIEI+ + F+ L P
Sbjct: 392 KKKKEQLA----KKKLVPELNEKDDDQVQKALASRENTQLMNRD--NIEITVRDFKTLAP 445
Query: 301 GEWLNDEVINLYLELLKERERREPQKFLKCHFFNTFFY 338
WLND +I +++ + E+ P FN+FFY
Sbjct: 446 RRWLNDTIIEFFMKYI---EKSTPNTVA----FNSFFY 476
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.324 0.138 0.410
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 752,848,144
Number of Sequences: 2540612
Number of extensions: 30935751
Number of successful extensions: 127604
Number of sequences better than 10.0: 404
Number of HSP's better than 10.0 without gapping: 74
Number of HSP's successfully gapped in prelim test: 333
Number of HSP's that attempted gapping in prelim test: 127121
Number of HSP's gapped (non-prelim): 618
length of query: 459
length of database: 863,360,394
effective HSP length: 131
effective length of query: 328
effective length of database: 530,540,222
effective search space: 174017192816
effective search space used: 174017192816
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)
Medicago: description of AC147012.13