Miyakogusa Predicted Gene

Lj0g3v0287359.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0287359.1 Non Chatacterized Hit- tr|C5Z2Z5|C5Z2Z5_SORBI
Putative uncharacterized protein Sb10g016910
OS=Sorghu,53.41,1e-18,ULP_PROTEASE,Peptidase C48, SUMO/Sentrin/Ubl1;
Cysteine proteinases,NULL; seg,NULL; SENTRIN/SUMO-SPE,CUFF.19208.1
         (322 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D | chr1:...   347   9e-96
AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases superf...   293   8e-80
AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases superf...   293   8e-80
AT1G09730.1 | Symbols:  | Cysteine proteinases superfamily prote...   132   3e-31
AT1G09730.2 | Symbols:  | Cysteine proteinases superfamily prote...   132   3e-31
AT4G33620.1 | Symbols:  | Cysteine proteinases superfamily prote...   105   3e-23
AT4G15880.1 | Symbols: ESD4, ATESD4 | Cysteine proteinases super...    77   2e-14
AT3G48480.1 | Symbols:  | Cysteine proteinases superfamily prote...    75   8e-14
AT3G06910.1 | Symbols: ULP1A, ELS1, AtULP1a | UB-like protease 1...    75   8e-14
AT4G00690.1 | Symbols: ULP1B | UB-like protease 1B | chr4:281313...    74   1e-13

>AT1G60220.1 | Symbols: OTS1, ULP1D | UB-like protease 1D |
           chr1:22208332-22211910 FORWARD LENGTH=584
          Length = 584

 Score =  347 bits (889), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 158/286 (55%), Positives = 213/286 (74%), Gaps = 1/286 (0%)

Query: 3   SDGSRSRNGQPIVLXXXXXXXXEPLILENTENKLSEYLKEAKIYFPSRDDPECVEICYKD 62
           S  SR R      +        +P  +     +L E L+E  I +P+RDDP  V++C KD
Sbjct: 291 SQSSRRRKKSEDTVINVDEEEAQPSTVAEQAAELPEGLQE-DICYPTRDDPHFVQVCLKD 349

Query: 63  TDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRET 122
            +CLAP  YL+S +MNFY+R+LQQQ S +N   +D HFFNTYFYKKL +AV+ K +D++ 
Sbjct: 350 LECLAPREYLTSPVMNFYMRFLQQQISSSNQISADCHFFNTYFYKKLSDAVTYKGNDKDA 409

Query: 123 IFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIILHLDSLGLHSSQ 182
            FV+FRRWWKG+++F+KAY+ IPIHEDLHWSL+I+CIPDK DESG  ILHLDSLGLHS +
Sbjct: 410 FFVRFRRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCIPDKKDESGLTILHLDSLGLHSRK 469

Query: 183 SVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFV 242
           S+ +N+K +L +E  Y+++D    D+ I++++WK L RRI   V+ VPQQKN++DCG FV
Sbjct: 470 SIVENVKRFLKDEWNYLNQDDYSLDLPISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFV 529

Query: 243 LYFIKRFMEEAPERLKKKDLDMFSKRWFRPEEASSLRVKIKKLLIE 288
           L+FIKRF+EEAP+RLK+KDL MF K+WFRP+EAS+LR+KI+  LIE
Sbjct: 530 LFFIKRFIEEAPQRLKRKDLGMFDKKWFRPDEASALRIKIRNTLIE 575


>AT1G10570.2 | Symbols: OTS2, ULP1C | Cysteine proteinases
           superfamily protein | chr1:3487639-3491102 FORWARD
           LENGTH=570
          Length = 570

 Score =  293 bits (751), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 139/266 (52%), Positives = 193/266 (72%), Gaps = 6/266 (2%)

Query: 26  PLILENTENKLSEYLKEAKIYFPSRDDPE---CVEICYKDTDCLAPEGYLSSTIMNFYIR 82
           P+++E    +L E L E  IY+PS D  +    V++  KD  CL+P  YL+S ++NFYIR
Sbjct: 299 PMVVEEA-CELPEGLPE-DIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIR 356

Query: 83  YLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYV 142
           Y+Q      + + ++ HFFNT+FYKKL EAVS K +DR+  FVKFRRWWKG ++F K+Y+
Sbjct: 357 YVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFCKSYI 416

Query: 143 LIPIHEDLHWSLIIICIPDKGDESGPIILHLDSLGLHSSQSVFDNIKSYLIEEKKYMDRD 202
            IPIHEDLHWSL+IICIPDK DESG  I+HLDSLGLH    +F+N+K +L EE  Y+++D
Sbjct: 417 FIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEWNYLNQD 476

Query: 203 CVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRFMEEAPERLKKKDL 262
               D+ I+ ++W+ L   I    + VPQQKN++DCGLF+L+FI+RF+EEAP+RL  +DL
Sbjct: 477 APL-DLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL 535

Query: 263 DMFSKRWFRPEEASSLRVKIKKLLIE 288
            M  K+WF+PEEAS+LR+KI  +L++
Sbjct: 536 KMIHKKWFKPEEASALRIKIWNILVD 561


>AT1G10570.1 | Symbols: OTS2, ULP1C | Cysteine proteinases
           superfamily protein | chr1:3487639-3491102 FORWARD
           LENGTH=571
          Length = 571

 Score =  293 bits (751), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 139/266 (52%), Positives = 193/266 (72%), Gaps = 6/266 (2%)

Query: 26  PLILENTENKLSEYLKEAKIYFPSRDDPE---CVEICYKDTDCLAPEGYLSSTIMNFYIR 82
           P+++E    +L E L E  IY+PS D  +    V++  KD  CL+P  YL+S ++NFYIR
Sbjct: 300 PMVVEEA-CELPEGLPE-DIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIR 357

Query: 83  YLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYV 142
           Y+Q      + + ++ HFFNT+FYKKL EAVS K +DR+  FVKFRRWWKG ++F K+Y+
Sbjct: 358 YVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFCKSYI 417

Query: 143 LIPIHEDLHWSLIIICIPDKGDESGPIILHLDSLGLHSSQSVFDNIKSYLIEEKKYMDRD 202
            IPIHEDLHWSL+IICIPDK DESG  I+HLDSLGLH    +F+N+K +L EE  Y+++D
Sbjct: 418 FIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEWNYLNQD 477

Query: 203 CVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRFMEEAPERLKKKDL 262
               D+ I+ ++W+ L   I    + VPQQKN++DCGLF+L+FI+RF+EEAP+RL  +DL
Sbjct: 478 APL-DLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL 536

Query: 263 DMFSKRWFRPEEASSLRVKIKKLLIE 288
            M  K+WF+PEEAS+LR+KI  +L++
Sbjct: 537 KMIHKKWFKPEEASALRIKIWNILVD 562


>AT1G09730.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr1:3148017-3154236 REVERSE LENGTH=963
          Length = 963

 Score =  132 bits (333), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 144/281 (51%), Gaps = 40/281 (14%)

Query: 40  LKEAKIYFPSRD-----------DPECVEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQA 88
           L + K YFPS D           DP+ V IC +D + L PE +++ TI++FYI YL+ Q 
Sbjct: 399 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 458

Query: 89  SLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHE 148
                    +     +      +      +D +  F++ R+W + V++F K Y+ +P++ 
Sbjct: 459 QTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPVNY 518

Query: 149 DLHWSLIIICIPDKG--------DESG--PIILHLDSL-GLHSSQSVFDNIKSYLIEEKK 197
           +LHWSLI+IC P +         D+S   P ILH+DS+ G H+       +++YL EE K
Sbjct: 519 NLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHAGLKNL--VQTYLCEEWK 576

Query: 198 YMDRDCVYSDVSIADRIWKCLSRRIEAQVIT--VPQQKNEYDCGLFVLYFIKRFMEEAPE 255
              ++    D+S         SR +  + ++  +PQQ+N +DCGLF+L++++ F+ EAP 
Sbjct: 577 ERHKE-TSDDIS---------SRFMNLRFVSLELPQQENSFDCGLFLLHYLELFLAEAPL 626

Query: 256 RLKKKDL----DMFSKRWFRPEEASSLRVKIKKLLIEELQN 292
                 +    +     WF P EAS  R  I+KL+ E L+N
Sbjct: 627 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLEN 667


>AT1G09730.2 | Symbols:  | Cysteine proteinases superfamily protein
           | chr1:3148017-3154236 REVERSE LENGTH=931
          Length = 931

 Score =  132 bits (332), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 144/281 (51%), Gaps = 40/281 (14%)

Query: 40  LKEAKIYFPSRD-----------DPECVEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQA 88
           L + K YFPS D           DP+ V IC +D + L PE +++ TI++FYI YL+ Q 
Sbjct: 367 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 426

Query: 89  SLTNSSLSDYHFFNTYFYKKLKEAVSCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHE 148
                    +     +      +      +D +  F++ R+W + V++F K Y+ +P++ 
Sbjct: 427 QTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPVNY 486

Query: 149 DLHWSLIIICIPDKG--------DESG--PIILHLDSL-GLHSSQSVFDNIKSYLIEEKK 197
           +LHWSLI+IC P +         D+S   P ILH+DS+ G H+       +++YL EE K
Sbjct: 487 NLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHAGLKNL--VQTYLCEEWK 544

Query: 198 YMDRDCVYSDVSIADRIWKCLSRRIEAQVIT--VPQQKNEYDCGLFVLYFIKRFMEEAPE 255
              ++    D+S         SR +  + ++  +PQQ+N +DCGLF+L++++ F+ EAP 
Sbjct: 545 ERHKE-TSDDIS---------SRFMNLRFVSLELPQQENSFDCGLFLLHYLELFLAEAPL 594

Query: 256 RLKKKDL----DMFSKRWFRPEEASSLRVKIKKLLIEELQN 292
                 +    +     WF P EAS  R  I+KL+ E L+N
Sbjct: 595 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLEN 635


>AT4G33620.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr4:16147692-16152853 FORWARD LENGTH=783
          Length = 783

 Score =  105 bits (263), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 73/256 (28%), Positives = 126/256 (49%), Gaps = 31/256 (12%)

Query: 52  DPECVEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSS-LSDYHFFNTYFYKKLK 110
           +P+ V +  +D + L P  +++ TI++FYI+YL+ + S         ++ F       L 
Sbjct: 301 EPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNRISPKERGRFHFFNCFFFRKLANLD 360

Query: 111 EAVSCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLIIICIPD--------- 161
           +        RE  + + ++W K V++F+K Y+ IPI+   HWSL+IIC P          
Sbjct: 361 KGTPSTCGGREA-YQRVQKWTKNVDLFEKDYIFIPINCSFHWSLVIICHPGELVPSHVNF 419

Query: 162 -------KGDESGPIILHLDSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRI 214
                  +  +  P ILHLDS+       + +   SYL EE K    +   +D S A   
Sbjct: 420 HSFDDEVENPQRVPCILHLDSIKGSHKGGLINIFPSYLREEWKARHENTT-NDSSRA--- 475

Query: 215 WKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRFMEEAPER----LKKKDLDMFSKRWF 270
                  +++  + +PQQ+N +DCGLF+L+++  F+ +AP +    L  +  +  ++ WF
Sbjct: 476 -----PNMQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSANFLTRNWF 530

Query: 271 RPEEASSLRVKIKKLL 286
             +EAS  R  I +LL
Sbjct: 531 PAKEASLKRRNILELL 546


>AT4G15880.1 | Symbols: ESD4, ATESD4 | Cysteine proteinases
           superfamily protein | chr4:9012769-9015797 FORWARD
           LENGTH=489
          Length = 489

 Score = 76.6 bits (187), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 30/198 (15%)

Query: 56  VEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSC 115
           ++I  +   CL P  +L+  ++N Y+  L+++ +         H+FNT+FYKKL      
Sbjct: 288 IDITGEVLQCLTPSAWLNDEVINVYLELLKERETREPKKYLKCHYFNTFFYKKL------ 341

Query: 116 KQSDRETIFVKFRRWWK----GVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIIL 171
             SD    F   RRW      G  +     + +PIH  +HW+L +I      +     +L
Sbjct: 342 -VSDSGYNFKAVRRWTTQRKLGYALIDCDMIFVPIHRGVHWTLAVI------NNRESKLL 394

Query: 172 HLDSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQ 231
           +LDSL       + + +  Y+ +E          S   I    W          V  +PQ
Sbjct: 395 YLDSLN-GVDPMILNALAKYMGDEANEK------SGKKIDANSWDM------EFVEDLPQ 441

Query: 232 QKNEYDCGLFVLYFIKRF 249
           QKN YDCG+F+L +I  F
Sbjct: 442 QKNGYDCGMFMLKYIDFF 459


>AT3G48480.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr3:17957326-17959062 REVERSE LENGTH=298
          Length = 298

 Score = 74.7 bits (182), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 73/142 (51%), Gaps = 14/142 (9%)

Query: 132 KGVNIFQKAYVLIPIHEDLHWSLIIIC-IPDKGDESGPIILHLDSL-GLHSSQSVFDNIK 189
           K   IF K YV +PI    HW+L+I C   +  D     +L LDSL    SSQ +  +I+
Sbjct: 149 KTKQIFSKKYVFLPIVYWSHWTLLIFCNFGEDLDSDKTCMLFLDSLQTTDSSQRLEPDIR 208

Query: 190 SYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQQKNEYDCGLFVLYFIKRF 249
            ++++  +   R     D S+ D I           V  VPQQ N+ +CG FVLY+I RF
Sbjct: 209 KFVLDIYRAEGRT---EDSSLVDEI--------PFYVPMVPQQTNDVECGSFVLYYIHRF 257

Query: 250 MEEAPERLKKKDLDMFSKR-WF 270
           +E+APE    +D+  F K  WF
Sbjct: 258 IEDAPENFNVEDMPYFLKEDWF 279


>AT3G06910.1 | Symbols: ULP1A, ELS1, AtULP1a | UB-like protease 1A |
           chr3:2178905-2181188 REVERSE LENGTH=502
          Length = 502

 Score = 74.7 bits (182), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 52/195 (26%), Positives = 93/195 (47%), Gaps = 29/195 (14%)

Query: 56  VEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKLKEAVSC 115
           ++I  K   CL P  +L+  ++N Y+  L+++ +         HFFNT+F+ KL  + + 
Sbjct: 300 IDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFFFTKLVNSATG 359

Query: 116 KQSDRETIFVKFRRWWK----GVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIIL 171
                   +   RRW      G ++     + IPIH ++HW+L +I I D+         
Sbjct: 360 YN------YGAVRRWTSMKRLGYHLKDCDKIFIPIHMNIHWTLAVINIKDQK------FQ 407

Query: 172 HLDSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQ 231
           +LDS      + + D +  Y ++E +  D+  V  DVS     W+      +  V  +P 
Sbjct: 408 YLDSFKGREPK-ILDALARYFVDEVR--DKSEVDLDVS----RWR------QEFVQDLPM 454

Query: 232 QKNEYDCGLFVLYFI 246
           Q+N +DCG+F++ +I
Sbjct: 455 QRNGFDCGMFMVKYI 469


>AT4G00690.1 | Symbols: ULP1B | UB-like protease 1B |
           chr4:281313-283129 FORWARD LENGTH=348
          Length = 348

 Score = 74.3 bits (181), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 90/193 (46%), Gaps = 25/193 (12%)

Query: 56  VEICYKDTDCLAPEGYLSSTIMNFYIRYLQQQASLTNSSLSDYHFFNTYFYKKL--KEAV 113
           ++I  +   CL P  +L+  + N Y+  L+++ +         HFFNT+FY KL      
Sbjct: 139 IDISGETLQCLRPNQWLNDDVTNLYLELLKERQTRDPQKYFKCHFFNTFFYVKLVSGSGY 198

Query: 114 SCKQSDRETIFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLIIICIPDKGDESGPIILHL 173
           + K   R T   K      G ++     + +PIH D+HW+L +I   ++        ++L
Sbjct: 199 NYKAVSRWTTKRKL-----GYDLIDCDIIFVPIHIDIHWTLGVINNRERK------FVYL 247

Query: 174 DSLGLHSSQSVFDNIKSYLIEEKKYMDRDCVYSDVSIADRIWKCLSRRIEAQVITVPQQK 233
           DSL      ++ + +  YL++E K   +  +  DVS     W          V   PQQ+
Sbjct: 248 DSLFTGVGHTILNAMAKYLVDEVKQKSQKNI--DVS----SWGM------EYVEERPQQQ 295

Query: 234 NEYDCGLFVLYFI 246
           N YDCG+F+L +I
Sbjct: 296 NGYDCGMFMLKYI 308