Miyakogusa Predicted Gene

Lj0g3v0277809.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0277809.1 Non Chatacterized Hit- tr|I3S995|I3S995_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,96.49,0,OTU,Ovarian tumour, otubain; SUBFAMILY NOT NAMED,NULL;
OTU DOMAIN CONTAINING PROTEIN,NULL; Cysteine ,CUFF.18453.1
         (228 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G02070.1 | Symbols:  | Cysteine proteinases superfamily prote...   342   1e-94
AT3G22260.1 | Symbols:  | Cysteine proteinases superfamily prote...   256   1e-68
AT3G22260.3 | Symbols:  | Cysteine proteinases superfamily prote...   250   6e-67
AT3G22260.2 | Symbols:  | Cysteine proteinases superfamily prote...   250   6e-67
AT5G04250.2 | Symbols:  | Cysteine proteinases superfamily prote...   226   1e-59
AT5G04250.1 | Symbols:  | Cysteine proteinases superfamily prote...   226   1e-59
AT5G03330.2 | Symbols:  | Cysteine proteinases superfamily prote...   207   3e-54
AT5G03330.1 | Symbols:  | Cysteine proteinases superfamily prote...   207   3e-54
AT2G39320.1 | Symbols:  | Cysteine proteinases superfamily prote...    85   4e-17
AT5G67170.2 | Symbols:  | SEC-C motif-containing protein / OTU-l...    58   5e-09
AT5G67170.1 | Symbols:  | SEC-C motif-containing protein / OTU-l...    58   6e-09

>AT3G02070.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr3:361368-363132 FORWARD LENGTH=219
          Length = 219

 Score =  342 bits (877), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 153/205 (74%), Positives = 180/205 (87%)

Query: 24  DIEDDRMIALVLSEEYAKLDGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQRL 83
           D EDDRMIA +LSEEY+KLDG VGRRLS L PV HVPRIN +IP ++DA++DHQRLLQRL
Sbjct: 15  DTEDDRMIAFMLSEEYSKLDGAVGRRLSNLAPVPHVPRINCYIPNLNDATLDHQRLLQRL 74

Query: 84  NIYGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXXX 143
           N+YGLCE++VSGDGNCQFRALSDQLYRSPE+HK VR+E+V+QLK+ RS+YE YVPM    
Sbjct: 75  NVYGLCELKVSGDGNCQFRALSDQLYRSPEYHKQVRREVVKQLKECRSMYESYVPMKYKR 134

Query: 144 XXXXMAKLGEWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWSE 203
               M K GEWGDH+TLQAA+D+FAAKICLLTSFRDTCFIEI+P YQAP+  +WLSFWSE
Sbjct: 135 YYKKMGKFGEWGDHITLQAAADRFAAKICLLTSFRDTCFIEIIPQYQAPKGVLWLSFWSE 194

Query: 204 VHYNSLYEVRDAPIQHKPKKKHWLF 228
           VHYNSLY+++ AP+QHKPK+KHWLF
Sbjct: 195 VHYNSLYDIQAAPVQHKPKRKHWLF 219


>AT3G22260.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr3:7871489-7873393 FORWARD LENGTH=240
          Length = 240

 Score =  256 bits (653), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 120/206 (58%), Positives = 154/206 (74%), Gaps = 2/206 (0%)

Query: 24  DIEDDRMIALVLSE-EYAKLDGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQR 82
           D +DD+ IA +L+E E  + +G +G+RLS L+ + H PR+N  IP I+DA++DH+ L  R
Sbjct: 36  DTDDDQTIARILAEDESLRREGKLGKRLSHLDSIPHTPRVNREIPDINDATLDHELLSGR 95

Query: 83  LNIYGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXX 142
           L  YGL E+++ GDGNCQFRAL+DQL+R+ ++HKHVRK +V+QLK  R LYE YVPM   
Sbjct: 96  LATYGLAELQMEGDGNCQFRALADQLFRNADYHKHVRKHVVKQLKQQRKLYEEYVPMKYR 155

Query: 143 XXXXXMAKLGEWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWS 202
                M K GEWGDHVTLQAA+D+F AKICL+TSFRD  +IEI+P  + P RE WLSFWS
Sbjct: 156 HYTRKMKKHGEWGDHVTLQAAADRFEAKICLVTSFRDQSYIEILPHNKNPLREAWLSFWS 215

Query: 203 EVHYNSLYEVRDAPIQHKPKKKHWLF 228
           EVHYNSLY   D P + KP++KHWLF
Sbjct: 216 EVHYNSLYANGDVPTR-KPRRKHWLF 240


>AT3G22260.3 | Symbols:  | Cysteine proteinases superfamily protein
           | chr3:7871489-7873393 FORWARD LENGTH=245
          Length = 245

 Score =  250 bits (638), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 120/211 (56%), Positives = 155/211 (73%), Gaps = 7/211 (3%)

Query: 24  DIEDDRMIALVLSE-EYAKLDGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQR 82
           D +DD+ IA +L+E E  + +G +G+RLS L+ + H PR+N  IP I+DA++DH+ L  R
Sbjct: 36  DTDDDQTIARILAEDESLRREGKLGKRLSHLDSIPHTPRVNREIPDINDATLDHELLSGR 95

Query: 83  LNIYGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXX 142
           L  YGL E+++ GDGNCQFRAL+DQL+R+ ++HKHVRK +V+QLK  R LYE YVPM   
Sbjct: 96  LATYGLAELQMEGDGNCQFRALADQLFRNADYHKHVRKHVVKQLKQQRKLYEEYVPMKYR 155

Query: 143 XXXXXMAKLGEWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWS 202
                M K GEWGDHVTLQAA+D+F AKICL+TSFRD  +IEI+P  + P RE WLSFWS
Sbjct: 156 HYTRKMKKHGEWGDHVTLQAAADRFEAKICLVTSFRDQSYIEILPHNKNPLREAWLSFWS 215

Query: 203 EVHYNSLY-----EVRDAPIQHKPKKKHWLF 228
           EVHYNSLY      + D P + KP++KHWLF
Sbjct: 216 EVHYNSLYANGVLALPDVPTR-KPRRKHWLF 245


>AT3G22260.2 | Symbols:  | Cysteine proteinases superfamily protein
           | chr3:7871489-7873393 FORWARD LENGTH=245
          Length = 245

 Score =  250 bits (638), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 120/211 (56%), Positives = 155/211 (73%), Gaps = 7/211 (3%)

Query: 24  DIEDDRMIALVLSE-EYAKLDGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQR 82
           D +DD+ IA +L+E E  + +G +G+RLS L+ + H PR+N  IP I+DA++DH+ L  R
Sbjct: 36  DTDDDQTIARILAEDESLRREGKLGKRLSHLDSIPHTPRVNREIPDINDATLDHELLSGR 95

Query: 83  LNIYGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXX 142
           L  YGL E+++ GDGNCQFRAL+DQL+R+ ++HKHVRK +V+QLK  R LYE YVPM   
Sbjct: 96  LATYGLAELQMEGDGNCQFRALADQLFRNADYHKHVRKHVVKQLKQQRKLYEEYVPMKYR 155

Query: 143 XXXXXMAKLGEWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWS 202
                M K GEWGDHVTLQAA+D+F AKICL+TSFRD  +IEI+P  + P RE WLSFWS
Sbjct: 156 HYTRKMKKHGEWGDHVTLQAAADRFEAKICLVTSFRDQSYIEILPHNKNPLREAWLSFWS 215

Query: 203 EVHYNSLY-----EVRDAPIQHKPKKKHWLF 228
           EVHYNSLY      + D P + KP++KHWLF
Sbjct: 216 EVHYNSLYANGVLALPDVPTR-KPRRKHWLF 245


>AT5G04250.2 | Symbols:  | Cysteine proteinases superfamily protein
           | chr5:1176397-1178492 FORWARD LENGTH=345
          Length = 345

 Score =  226 bits (576), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 146/204 (71%), Gaps = 4/204 (1%)

Query: 26  EDDRMIALVLSEEYAKLDGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQRLNI 85
           +DD + ++ + EE       VG+RL+++ P+AHVP+IN  +P+  +   DH+RL QRL +
Sbjct: 145 DDDSVCSVEIEEESWSE---VGKRLNQMIPIAHVPKINGELPSEDEQISDHERLFQRLQL 201

Query: 86  YGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXXXXX 145
           YGL E ++ GDGNCQFR+LSDQLYRSPEHH  VR+++V QL  +R +YE YVPM      
Sbjct: 202 YGLVENKIEGDGNCQFRSLSDQLYRSPEHHNFVREQVVNQLAYNREIYEGYVPMAYNDYL 261

Query: 146 XXMAKLGEWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWSEVH 205
             M + GEWGDHVTLQAA+D F  ++ ++TSF+DTC+IEI+P +Q   R I LSFW+EVH
Sbjct: 262 KAMKRNGEWGDHVTLQAAADLFGVRMFVITSFKDTCYIEILPHFQKSNRLICLSFWAEVH 321

Query: 206 YNSLYEVRDAPI-QHKPKKKHWLF 228
           YNS+Y   + PI + K KKK+W+F
Sbjct: 322 YNSIYPEGELPIPEGKKKKKYWVF 345


>AT5G04250.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr5:1176397-1178492 FORWARD LENGTH=345
          Length = 345

 Score =  226 bits (576), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 146/204 (71%), Gaps = 4/204 (1%)

Query: 26  EDDRMIALVLSEEYAKLDGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQRLNI 85
           +DD + ++ + EE       VG+RL+++ P+AHVP+IN  +P+  +   DH+RL QRL +
Sbjct: 145 DDDSVCSVEIEEESWSE---VGKRLNQMIPIAHVPKINGELPSEDEQISDHERLFQRLQL 201

Query: 86  YGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXXXXX 145
           YGL E ++ GDGNCQFR+LSDQLYRSPEHH  VR+++V QL  +R +YE YVPM      
Sbjct: 202 YGLVENKIEGDGNCQFRSLSDQLYRSPEHHNFVREQVVNQLAYNREIYEGYVPMAYNDYL 261

Query: 146 XXMAKLGEWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWSEVH 205
             M + GEWGDHVTLQAA+D F  ++ ++TSF+DTC+IEI+P +Q   R I LSFW+EVH
Sbjct: 262 KAMKRNGEWGDHVTLQAAADLFGVRMFVITSFKDTCYIEILPHFQKSNRLICLSFWAEVH 321

Query: 206 YNSLYEVRDAPI-QHKPKKKHWLF 228
           YNS+Y   + PI + K KKK+W+F
Sbjct: 322 YNSIYPEGELPIPEGKKKKKYWVF 345


>AT5G03330.2 | Symbols:  | Cysteine proteinases superfamily protein
           | chr5:807728-809608 FORWARD LENGTH=356
          Length = 356

 Score =  207 bits (528), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 94/184 (51%), Positives = 129/184 (70%)

Query: 43  DGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQRLNIYGLCEVRVSGDGNCQFR 102
           DG  GRRL+++ P+ ++P+IN  IP   +A  DH+RL  RL ++   EV+V GDGNCQFR
Sbjct: 168 DGEFGRRLNQMVPIPYIPKINGEIPPEEEAVSDHERLRNRLEMFDFTEVKVPGDGNCQFR 227

Query: 103 ALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXXXXXXXMAKLGEWGDHVTLQA 162
           AL+DQLY++ + HKHVR++IV+QLK     Y+ YVPM        M++ GEWGDHVTLQA
Sbjct: 228 ALADQLYKTADRHKHVRRQIVKQLKSRPDSYQGYVPMDFSDYLRKMSRSGEWGDHVTLQA 287

Query: 163 ASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWSEVHYNSLYEVRDAPIQHKPK 222
           A+D +  KI +LTSF+DTC+IEI+P  Q  +  I+LSFW+EVHYN++Y  RD       +
Sbjct: 288 AADAYRVKIVVLTSFKDTCYIEILPTSQESKGVIFLSFWAEVHYNAIYLNRDTSETELQR 347

Query: 223 KKHW 226
           K+ W
Sbjct: 348 KRKW 351


>AT5G03330.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr5:807728-809608 FORWARD LENGTH=356
          Length = 356

 Score =  207 bits (528), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 94/184 (51%), Positives = 129/184 (70%)

Query: 43  DGGVGRRLSKLEPVAHVPRINSFIPTISDASMDHQRLLQRLNIYGLCEVRVSGDGNCQFR 102
           DG  GRRL+++ P+ ++P+IN  IP   +A  DH+RL  RL ++   EV+V GDGNCQFR
Sbjct: 168 DGEFGRRLNQMVPIPYIPKINGEIPPEEEAVSDHERLRNRLEMFDFTEVKVPGDGNCQFR 227

Query: 103 ALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXXXXXXXMAKLGEWGDHVTLQA 162
           AL+DQLY++ + HKHVR++IV+QLK     Y+ YVPM        M++ GEWGDHVTLQA
Sbjct: 228 ALADQLYKTADRHKHVRRQIVKQLKSRPDSYQGYVPMDFSDYLRKMSRSGEWGDHVTLQA 287

Query: 163 ASDKFAAKICLLTSFRDTCFIEIMPLYQAPQREIWLSFWSEVHYNSLYEVRDAPIQHKPK 222
           A+D +  KI +LTSF+DTC+IEI+P  Q  +  I+LSFW+EVHYN++Y  RD       +
Sbjct: 288 AADAYRVKIVVLTSFKDTCYIEILPTSQESKGVIFLSFWAEVHYNAIYLNRDTSETELQR 347

Query: 223 KKHW 226
           K+ W
Sbjct: 348 KRKW 351


>AT2G39320.1 | Symbols:  | Cysteine proteinases superfamily protein
           | chr2:16417592-16418517 REVERSE LENGTH=189
          Length = 189

 Score = 84.7 bits (208), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 70/120 (58%), Gaps = 20/120 (16%)

Query: 93  VSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYECYVPMXXXXXXXXMAKLG 152
           +  DGNCQFRAL+DQLY++ + H+ VR+EIV+Q                      ++   
Sbjct: 2   MKSDGNCQFRALADQLYQNSDCHELVRQEIVKQ-------------------NMSLSTNS 42

Query: 153 EWGDHVTLQAASDKFAAKICLLTSFRDTCFIEIMPLYQA-PQREIWLSFWSEVHYNSLYE 211
           +WGD VTL+ A+D +  KI L+TS +   F+E +P  Q  P + I +S+ + +H+NS+Y+
Sbjct: 43  QWGDEVTLRVAADVYQVKIILITSIKLIPFMEFLPKSQKEPDKVIHMSYLAGIHFNSIYK 102


>AT5G67170.2 | Symbols:  | SEC-C motif-containing protein / OTU-like
           cysteine protease family protein |
           chr5:26799851-26801763 FORWARD LENGTH=374
          Length = 374

 Score = 57.8 bits (138), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 69/152 (45%), Gaps = 22/152 (14%)

Query: 75  DHQRLLQRLNIYGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYE 134
           D  +   +L+  GL  ++V+ DGNC FRA++DQL  + + H   R  IV  +  +R ++E
Sbjct: 23  DLSQFRAQLDALGLKIIQVTADGNCFFRAIADQLEGNEDEHNKYRNMIVLYIVKNREMFE 82

Query: 135 CYVP--MXXXXXXXXMAKLGEWGDHVTLQAASDKFAAKICL---------LTSFRDTCFI 183
            ++   +        M   G W  ++ LQAAS    + IC+         + +F DT   
Sbjct: 83  PFIEDDVPFEDYCKTMDDDGTWAGNMELQAASLVTRSNICIHRNMSPRWYIRNFEDT--- 139

Query: 184 EIMPLYQAPQREIWLSFWSEVHYNSLYEVRDA 215
                     R I LS+    HYNS+    DA
Sbjct: 140 --------RTRMIHLSYHDGEHYNSVRSKEDA 163


>AT5G67170.1 | Symbols:  | SEC-C motif-containing protein / OTU-like
           cysteine protease family protein |
           chr5:26799851-26801763 FORWARD LENGTH=375
          Length = 375

 Score = 57.8 bits (138), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 69/152 (45%), Gaps = 22/152 (14%)

Query: 75  DHQRLLQRLNIYGLCEVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHRSLYE 134
           D  +   +L+  GL  ++V+ DGNC FRA++DQL  + + H   R  IV  +  +R ++E
Sbjct: 24  DLSQFRAQLDALGLKIIQVTADGNCFFRAIADQLEGNEDEHNKYRNMIVLYIVKNREMFE 83

Query: 135 CYVP--MXXXXXXXXMAKLGEWGDHVTLQAASDKFAAKICL---------LTSFRDTCFI 183
            ++   +        M   G W  ++ LQAAS    + IC+         + +F DT   
Sbjct: 84  PFIEDDVPFEDYCKTMDDDGTWAGNMELQAASLVTRSNICIHRNMSPRWYIRNFEDT--- 140

Query: 184 EIMPLYQAPQREIWLSFWSEVHYNSLYEVRDA 215
                     R I LS+    HYNS+    DA
Sbjct: 141 --------RTRMIHLSYHDGEHYNSVRSKEDA 164