Jatropha Genome Database

Jcr4S05943.60
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Jcr4S05943.60
         (294 letters)

Database: Medicago_aa3.5 
           47,529 sequences; 14,043,872 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

IMGA|Medtr6g083830.1 Unknown Protein (AHRD V1); contains Interpr...   383   e-107
IMGA|Medtr5g055480.1 Unknown Protein (AHRD V1); contains Interpr...   383   e-107
IMGA|Medtr5g006920.1 Unknown Protein (AHRD V1); contains Interpr...   358   2e-99
IMGA|Medtr3g104840.1 Unknown Protein (AHRD V1); contains Interpr...   328   2e-90
IMGA|Medtr3g108740.1 cDNA clone J023006H21 full insert sequence ...   214   4e-56
IMGA|Medtr4g127550.1 cDNA clone J023006H21 full insert sequence ...   208   3e-54
IMGA|Medtr2g037600.1 Unknown Protein (AHRD V1); contains Interpr...   194   4e-50
IMGA|Medtr2g037620.1 Unknown Protein (AHRD V1); contains Interpr...   187   5e-48
IMGA|Medtr8g094180.1 BC10 protein (AHRD V1 ***- Q65XS5_ORYSJ); c...   171   5e-43
IMGA|Medtr1g041250.1 Unknown Protein (AHRD V1); contains Interpr...   144   5e-35
IMGA|Medtr1g031060.1 Unknown Protein (AHRD V1); contains Interpr...   144   5e-35
IMGA|Medtr3g107210.1 Unknown Protein (AHRD V1); contains Interpr...   138   3e-33
IMGA|Medtr3g106930.1 Unknown Protein (AHRD V1); contains Interpr...   138   3e-33
IMGA|Medtr8g094310.1 BC10 protein (AHRD V1 ***- Q65XS5_ORYSJ); c...   133   1e-31
IMGA|Medtr2g087610.1 AT5G22070 protein (Fragment) (AHRD V1 ***- ...   124   6e-29
IMGA|Medtr1g031080.1 Unknown Protein (AHRD V1); contains Interpr...    95   4e-20
IMGA|Medtr1g041270.1 Unknown Protein (AHRD V1); contains Interpr...    94   1e-19

>IMGA|Medtr6g083830.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR011527  ABC transporter, transmembrane
           region, type 1 chr06_pseudomolecule_IMGAG_V3.5
           19374577-19371067 E EGN_Mt100125 20100825
          Length = 418

 Score =  383 bits (983), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 182/295 (61%), Positives = 225/295 (76%), Gaps = 1/295 (0%)

Query: 1   MADKELLWRASMVARVSEFPFKLTPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHY 60
           M ++EL WRASM+  + + PFK  PKVAF+FLTKG + LAPLWE FFKG+EGLYSIY+H 
Sbjct: 123 MNEEELFWRASMIPMIHKPPFKQIPKVAFMFLTKGHVLLAPLWEKFFKGNEGLYSIYIHP 182

Query: 61  SPSFNGTV-PVDSVFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCI 119
           +PSFN TV    SVF+GRRIPSKE +WGE +++            DFSNQ F+LLSESCI
Sbjct: 183 NPSFNETVYDQSSVFHGRRIPSKEVKWGENSMIEAERRLLANALLDFSNQRFVLLSESCI 242

Query: 120 PLFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNI 179
           PLFNFSTIY YLM SEK+FV+++DL G  GR RYN +MSP I L QWRKGSQWFQ+ R++
Sbjct: 243 PLFNFSTIYTYLMNSEKTFVEAYDLEGAVGRGRYNYKMSPLIKLSQWRKGSQWFQIDRSL 302

Query: 180 AIEVVSDRKYFPVFQRFCKGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSH 239
           A+ +VSD+ YF +F+ +C   CY+DEHY+PT V IKF K+ SNRTLTW+DW K GPHPS 
Sbjct: 303 ALHIVSDKLYFSMFKNYCDPPCYSDEHYMPTMVSIKFWKRNSNRTLTWVDWSKGGPHPSK 362

Query: 240 YGRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLMKF 294
           + R  +T++FLE LR    C+YNGK  ++C LFARKF P+AL RLLRFAPKLM+F
Sbjct: 363 FFRQHLTIDFLERLRFGSTCEYNGKTINVCHLFARKFTPHALDRLLRFAPKLMQF 417


>IMGA|Medtr5g055480.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR011009  Protein kinase-like
           chr05_pseudomolecule_IMGAG_V3.5 22301709-22299117 E
           EGN_Mt100125 20100825
          Length = 390

 Score =  383 bits (983), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 174/294 (59%), Positives = 222/294 (75%), Gaps = 1/294 (0%)

Query: 1   MADKELLWRASMVARVSEFPFKLTPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHY 60
           ++D+ELLWRAS+  +++E+PF   PKVAFLFL +G +PLAPLWE FFKGH+G YSIYVH 
Sbjct: 98  LSDEELLWRASLSPKINEYPFDRVPKVAFLFLVRGPVPLAPLWEKFFKGHKGYYSIYVHS 157

Query: 61  SPSFNGTVPVDSVFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIP 120
           +PS+NG+     VF+GRRIPSK+  WG+F ++            DFSNQ F+L+SESCIP
Sbjct: 158 NPSYNGSEVESPVFHGRRIPSKKVEWGKFNMIEAERRLLANALLDFSNQRFVLISESCIP 217

Query: 121 LFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNIA 180
           LFNFST+Y+YLM S KS+V ++D     GR RY  +MSPTI L +WRKGSQWF+M RN+A
Sbjct: 218 LFNFSTVYSYLMNSTKSYVMAYDQASSVGRGRYRIKMSPTIKLREWRKGSQWFEMDRNLA 277

Query: 181 IEVVSDRKYFPVFQRFCKGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSHY 240
           +EV+SDR Y+PVF ++C G CYADEHYLPT V IKF K  +NR+LTW+DW K GPHP  Y
Sbjct: 278 LEVISDRTYYPVFGKYCNGSCYADEHYLPTLVSIKFWKSNTNRSLTWVDWSKGGPHPVKY 337

Query: 241 GRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLMKF 294
            R +VT  FLE+LRN+  C YNG   ++C+LFARKF+P +L RL+RFAPK+M  
Sbjct: 338 VRPEVTCEFLENLRNQ-TCKYNGNSTNVCYLFARKFLPTSLTRLMRFAPKVMHL 390


>IMGA|Medtr5g006920.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR011009  Protein kinase-like
           chr05_pseudomolecule_IMGAG_V3.5 898735-900842 H
           EGN_Mt100125 20100825
          Length = 422

 Score =  358 bits (920), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 173/295 (58%), Positives = 214/295 (72%), Gaps = 1/295 (0%)

Query: 1   MADKELLWRASMVARVSEFPFKLTPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHY 60
           M + EL WRAS+   + + PFK TPKVAF+FLTKG + LAPLWE FFKG+EGLYSIYVH 
Sbjct: 127 MNEDELFWRASLAPMIHKTPFKQTPKVAFMFLTKGPVLLAPLWEKFFKGNEGLYSIYVHP 186

Query: 61  SPSFNGTVPVDS-VFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCI 119
           SPSFN TV   S VF+GRRIPSK+ +WGE +++            DFSNQ F+LLSE CI
Sbjct: 187 SPSFNETVYNQSLVFHGRRIPSKKVKWGENSMIEAERRLLANALLDFSNQRFVLLSEHCI 246

Query: 120 PLFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNI 179
           PLFNF TIY YLM S+++FV++ D+ G  GR RYN RM P I L QWRKG+QWFQ+ R +
Sbjct: 247 PLFNFFTIYTYLMKSKQTFVEANDIPGRVGRVRYNRRMCPLIQLSQWRKGAQWFQIDRYL 306

Query: 180 AIEVVSDRKYFPVFQRFCKGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSH 239
           A+ +VSD+ YF +F+++C   C +DEHYLPT V IKF K+ SNRTLTW+DW K G HP+ 
Sbjct: 307 AVRIVSDKPYFSMFKKYCHPRCISDEHYLPTLVSIKFWKRNSNRTLTWVDWSKGGAHPAK 366

Query: 240 YGRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLMKF 294
           +   DVT++FLE LR    C+YNGK  ++C LFARKF   AL  LL FAPKLM+F
Sbjct: 367 FSSKDVTIDFLERLRFGSTCEYNGKTTNVCHLFARKFGTQALDGLLTFAPKLMQF 421


>IMGA|Medtr3g104840.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR003035  Plant regulator RWP-RK
           chr03_pseudomolecule_IMGAG_V3.5 36857274-36855481 E
           EGN_Mt100125 20100825
          Length = 394

 Score =  328 bits (841), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 159/295 (53%), Positives = 206/295 (69%), Gaps = 19/295 (6%)

Query: 1   MADKELLWRASMVARVSEFPFKLTPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHY 60
           M D EL  R S+++ + E PF  TPK+AF+FLTKG + LAP WE FFKG+EG+YSIY+H 
Sbjct: 117 MNDDELFRRTSLISMIHEPPFNQTPKIAFMFLTKGPVLLAPFWEKFFKGNEGMYSIYIHP 176

Query: 61  SPSFNGTVPVD-SVFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCI 119
           SPSFN TV  + SVF+GRRIPSKE +WGE +++            DFSNQ F+LLSESCI
Sbjct: 177 SPSFNQTVYNERSVFHGRRIPSKEVKWGETSMIEAERRLLANALLDFSNQRFVLLSESCI 236

Query: 120 PLFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNI 179
           PLFNFSTIY YLM S ++FV++ +++                   QW+KGSQWFQ+ R +
Sbjct: 237 PLFNFSTIYTYLMNSNETFVEANEIKN-----------------SQWKKGSQWFQIDRYL 279

Query: 180 AIEVVSDRKYFPVFQRFCKGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSH 239
            + +VSD+ YF +F+++C   CY+DEHYLPTF+  +F K+ SNRTLTW+DW K GPHPS 
Sbjct: 280 GLHIVSDKTYFSMFKKYCNTPCYSDEHYLPTFISNEFGKRNSNRTLTWVDWSKGGPHPSS 339

Query: 240 YGRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLMKF 294
           +   DVT  FLE LR    C++NG+  SIC LFARKF P+AL  L+R+APKLM+F
Sbjct: 340 FTGKDVTTEFLERLRFGSTCEHNGR-TSICHLFARKFTPHALDILVRYAPKLMQF 393


>IMGA|Medtr3g108740.1 cDNA clone J023006H21 full insert sequence
           (AHRD V1 ***- B7EKK3_ORYSJ); contains Interpro domain(s)
            IPR013865  Protein of unknown function DUF1754,
           eukaryotic chr03_pseudomolecule_IMGAG_V3.5
           38917211-38922166 E EGN_Mt100125 20100825
          Length = 383

 Score =  214 bits (545), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 120/308 (38%), Positives = 170/308 (55%), Gaps = 26/308 (8%)

Query: 1   MADKELLWRASMVARVSEFPFKL-TPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVH 59
           + DKE+  R  +   ++  P +  TPKVAFLF+T G+LP   LW LFF+GH+G +SIYVH
Sbjct: 67  LTDKEIESRVVVKDLLNYVPIQTNTPKVAFLFMTPGTLPFEKLWHLFFQGHDGRFSIYVH 126

Query: 60  YSPSFNGTVPVDSVFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCI 119
              S    V     F GR I S+   WG F ++            D  NQ+F+LLSESCI
Sbjct: 127 --ASREKPVHFSRYFVGREIHSEPVSWGSFAMMEAERRLLANALLDPDNQHFVLLSESCI 184

Query: 120 PLFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNI 179
           P+ +F  +YNYL+ +  SF++ F   GP G  RY   M P + +  +RKGSQWF MKR  
Sbjct: 185 PIRHFEFVYNYLVFTNVSFIECFVDPGPHGNGRYIEHMLPEVEMKDFRKGSQWFSMKRQH 244

Query: 180 AIEVVSDRKYFPVFQRFCK------GYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKR 233
           A+ V++D  YF  F+ +C+        CY+DEHYLPT+  +      SNR++T++DW + 
Sbjct: 245 AVIVIADNLYFTKFKYYCRPNMEGGRNCYSDEHYLPTYFNMLDPGGISNRSVTYVDWSEG 304

Query: 234 GPHPSHYGRMDVTVNFLESLRNKG----------------PCDYNGKENSICFLFARKFV 277
             HP  +G   +T   L++L +                  PC +NG +   C+LFARKF 
Sbjct: 305 KWHPRSFGAQHITYKLLKTLTSLNQSPHITSDSKRTVLITPCMWNGSKRP-CYLFARKFY 363

Query: 278 PNALPRLL 285
           P AL +L+
Sbjct: 364 PEALDKLM 371


>IMGA|Medtr4g127550.1 cDNA clone J023006H21 full insert sequence
           (AHRD V1 ***- B7EKK3_ORYSJ); contains Interpro domain(s)
            IPR000648  Oxysterol-binding protein
           chr04_pseudomolecule_IMGAG_V3.5 44353605-44360572 E
           EGN_Mt100125 20100825
          Length = 396

 Score =  208 bits (529), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 115/283 (40%), Positives = 159/283 (56%), Gaps = 24/283 (8%)

Query: 26  KVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHYSPSFNGTVPVDSVFYGRRIPSKETR 85
           K+AF+FL+ GSLPL  LW+ FF+GHEG +S+YVH S S    V V   F  R I S +  
Sbjct: 108 KIAFMFLSPGSLPLEKLWDNFFQGHEGKFSVYVHASKS--KPVHVSRYFVNRDIRSGQVV 165

Query: 86  WGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSEKSFVQSFDLR 145
           WG+ ++V            D  NQ+F+LLS+SC+PL++F  IYNYLM +  S+V  F   
Sbjct: 166 WGKISMVDAERRILATALQDPDNQHFVLLSDSCVPLYHFDYIYNYLMHTNISYVDCFKDP 225

Query: 146 GPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNIAIEVVSDRKYFPVFQRFCK-----GY 200
           GP G  RY+ RM P + +  +RKG+QWF MKR  A+ V++D  Y+  F+ +C+       
Sbjct: 226 GPHGNGRYSDRMLPEVEVKDFRKGAQWFSMKRQHAVIVMADYLYYSKFRAYCQPGLEGKN 285

Query: 201 CYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSHYGRMDVTVNFLESLRNKG--- 257
           C ADEHYLPTF  I      +N ++T +DW +R  HP  Y   DVT   L+++ +     
Sbjct: 286 CIADEHYLPTFFQIVDPGGIANWSVTHVDWSERKWHPKSYRDHDVTYELLKNITSVDVSV 345

Query: 258 -------------PCDYNGKENSICFLFARKFVPNALPRLLRF 287
                        PC +NG +   C+LFARKF P  L +LL  
Sbjct: 346 HVTSDEKKEVQSWPCLWNGIQKP-CYLFARKFTPETLDKLLHL 387


>IMGA|Medtr2g037600.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR002885  Pentatricopeptide repeat
           chr02_pseudomolecule_IMGAG_V3.5 13702955-13697294 E
           EGN_Mt100125 20100825
          Length = 393

 Score =  194 bits (494), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 112/286 (39%), Positives = 157/286 (54%), Gaps = 25/286 (8%)

Query: 24  TPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHYSPSFNGTVPVDSVFYGRRIPSKE 83
            PKVAF+FLT GSLP   LW+ FF+GHEG +S+YVH S +    V V   F  R I S +
Sbjct: 104 NPKVAFMFLTPGSLPFEKLWDNFFQGHEGKFSVYVHASQT--KPVHVSRYFVNRDIRSDQ 161

Query: 84  TRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSEKSFVQSFD 143
             WG+ ++V            D +NQ+F+LLS+SC+PL+NF  I++YLM +  SFV  F 
Sbjct: 162 VIWGKMSMVEAERRLLANALQDPNNQHFVLLSDSCVPLYNFDYIFDYLMYTNISFVDCFW 221

Query: 144 LRGPDGRY-RYNPRMSPTIMLDQWRKGSQWFQMKRNIAIEVVSDRKYFPVFQRFCK---- 198
             GP G   RY+  M P + L  +RKG+QWF +KR  A+ V++D  Y+  FQ  C+    
Sbjct: 222 DPGPVGNSGRYSEHMLPEVELKDFRKGAQWFSLKRKHALIVMADHVYYSKFQAHCEPGVD 281

Query: 199 -GYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSHYGRMDVTVNFLESLRNKG 257
              C  DEHYLPTF  I      +N ++T +DW ++  HP  Y   D+T   L+++ +  
Sbjct: 282 GKNCIPDEHYLPTFFTIVDPGGIANWSVTHVDWSEQKWHPKSYRAQDITYELLKNITSID 341

Query: 258 ----------------PCDYNGKENSICFLFARKFVPNALPRLLRF 287
                           PC +NG +   C+LFARKF P+    LL+ 
Sbjct: 342 ESVHVTSDEKKEVQIWPCLWNGIQKP-CYLFARKFSPDTEDNLLKL 386


>IMGA|Medtr2g037620.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR002885  Pentatricopeptide repeat
           chr02_pseudomolecule_IMGAG_V3.5 13710096-13707762 H
           EGN_Mt100125 20100825
          Length = 400

 Score =  187 bits (476), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 159/301 (52%), Gaps = 40/301 (13%)

Query: 24  TPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHYSPSFNGTVPVDSVFYGRRIPSKE 83
            PK+AF+FLT GSLP   LW+ FF+GHEG +S+YVH S +    V V   F  R I S +
Sbjct: 96  NPKIAFMFLTPGSLPFEKLWDNFFQGHEGKFSVYVHASKA--KPVHVSRYFVNRDIRSDQ 153

Query: 84  TRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSEKSFVQSFD 143
             WG+ +IV            D +NQ+F+LLS+SC+PL+NF+ I++YLM ++KSFV SF 
Sbjct: 154 LVWGKMSIVEAERRLLANALQDPNNQHFVLLSDSCVPLYNFNYIFDYLMYTDKSFVDSFR 213

Query: 144 LRGPDGRYRYNPRMSPTIMLDQWRKGSQ----------------WFQMKRNIAIEVVSDR 187
             GP G  RY+  M P + +  +R G+Q                WF +KR  A++V++D 
Sbjct: 214 DPGPVGNGRYSEHMLPEVEIKDFRTGAQGLTEVRAGLKVHRMLIWFSLKRQHAVKVMADH 273

Query: 188 KYFPVFQRFCKGY-----CYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRGPHPSHYGR 242
            Y+  FQ  C+       C  DEHYLPTF  I      +  ++T++D  ++  HP  Y  
Sbjct: 274 LYYSKFQAQCESCVDGKNCILDEHYLPTFFTIVDPNGIAKWSVTYVDRSEQKRHPKSYRT 333

Query: 243 MDVTVNFLESLRN----------------KGPCDYNGKENSICFLFARKFVPNALPRLLR 286
            D+T   L+++++                +  C +NG     C+LFARKF P     LL+
Sbjct: 334 QDITYELLKNIKSIDESVHVTSDEKKEVQRWTCFWNGFRKP-CYLFARKFSPETEESLLK 392

Query: 287 F 287
            
Sbjct: 393 L 393


>IMGA|Medtr8g094180.1 BC10 protein (AHRD V1 ***- Q65XS5_ORYSJ);
           contains Interpro domain(s)  IPR007246  Gaa1-like, GPI
           transamidase component chr08_pseudomolecule_IMGAG_V3.5
           27125463-27131216 E EGN_Mt100125 20100825
          Length = 363

 Score =  171 bits (433), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 113/308 (36%), Positives = 153/308 (49%), Gaps = 49/308 (15%)

Query: 25  PKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHYSPSF--NGTVPVDSVFYGRRI-PS 81
           PK+AFLF+ +  LPL  +W+ FF+G +  +SI+VH  P F  N      S F  R++  S
Sbjct: 55  PKIAFLFIARNRLPLELVWDAFFRGGDNNFSIFVHPRPGFVLNEATTRSSYFLNRQVNDS 114

Query: 82  KETRWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSEKSFVQS 141
            +  WGE +++            D  N  F+ LS+SCIPL+NFS  Y+Y+M +  SFV S
Sbjct: 115 IQIDWGEASMIEAERILLRHALDDPLNDRFVFLSDSCIPLYNFSYTYDYIMSTPTSFVDS 174

Query: 142 F-DLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNIAIEVVSDRKYFPVFQRFCKGY 200
           F D +G     RYNP+M P I +  WRKGSQW  + R  A  VV D   FP+FQ+FCK  
Sbjct: 175 FADTKGG----RYNPKMDPVIPVYNWRKGSQWAVLTRKHAKVVVEDDTVFPMFQKFCKKK 230

Query: 201 --------------------CYADEHYLPTFVGIKFLKKT-SNRTLTWIDW--------P 231
                               C  DEHY+ T +  K L+K  + R++T   W         
Sbjct: 231 PLPEFWRDQVIPADTSKIHNCIPDEHYVQTLLAQKDLEKELTRRSVTHTAWDISNSRDRE 290

Query: 232 KRGPHPSHYGRMDVT---VNFLESLRN--------KGPCDYNGKENSICFLFARKFVPNA 280
           +RG HP  Y   D T   + F++ + N        +  C   GK  S CFLFARKF   A
Sbjct: 291 RRGWHPVTYKFSDATPMLIKFIKEIDNIYYETEYRREWCTSKGKP-STCFLFARKFTRTA 349

Query: 281 LPRLLRFA 288
             RLL  +
Sbjct: 350 ALRLLNMS 357


>IMGA|Medtr1g041250.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR008974  TRAF-like
           chr01_pseudomolecule_IMGAG_V3.5 11448697-11449785 H
           EGN_Mt100125 20100825
          Length = 362

 Score =  144 bits (364), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 109/292 (37%), Positives = 144/292 (49%), Gaps = 30/292 (10%)

Query: 22  KLTPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHYSPSFNGTVPVDSVFYGRRIPS 81
           K  PK+AFLFLT  +L  APLWE FF G+  L++IYVH  P+     P   VF  R IPS
Sbjct: 77  KPKPKIAFLFLTNSNLTFAPLWEKFFVGNNHLFNIYVHADPTTYVASP-GGVFQNRFIPS 135

Query: 82  KET-RWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSE-KSFV 139
           K T R+    I             D  NQYF L+S+ CIPLF+F  IYNYL  ++ KSF 
Sbjct: 136 KPTKRYSPSLIAAARRLLASALLDDPLNQYFALISQRCIPLFSFQFIYNYLFKNQLKSFA 195

Query: 140 QS--FDLRGP----------DGRYRYNPR----MSPTIMLDQWRKGSQWFQMKRNIAIEV 183
            S  F+L  P          +   RYN R    M P +  + +R GSQ+F + R     V
Sbjct: 196 NSSEFNLLYPSYIEILSEAENLNIRYNARGENVMMPEVPFEDFRVGSQFFILNRKHTKVV 255

Query: 184 VSDRKYFPVFQRFC--KGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRG-PHPSHY 240
           + D+K +  FQ  C  K YCY +EHY  T + ++ LK  +  TLT ++W      HP  Y
Sbjct: 256 LRDQKLWNKFQIPCTNKYYCYPEEHYFSTLLSMEDLKGCTGFTLTRVNWTGAVYGHPHLY 315

Query: 241 GRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLM 292
              +V+      LR            S  +LFARKF P  L  L+  A  ++
Sbjct: 316 TPAEVSPELFRQLR--------VSNWSYSYLFARKFSPECLAPLMNIADDVI 359


>IMGA|Medtr1g031060.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR013626  Pheophorbide a oxygenase
           chr01_pseudomolecule_IMGAG_V3.5 9080809-9081897 H
           EGN_Mt100125 20100825
          Length = 362

 Score =  144 bits (364), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 109/292 (37%), Positives = 144/292 (49%), Gaps = 30/292 (10%)

Query: 22  KLTPKVAFLFLTKGSLPLAPLWELFFKGHEGLYSIYVHYSPSFNGTVPVDSVFYGRRIPS 81
           K  PK+AFLFLT  +L  APLWE FF G+  L++IYVH  P+     P   VF  R IPS
Sbjct: 77  KPKPKIAFLFLTNSNLTFAPLWEKFFVGNNHLFNIYVHADPTTYVASP-GGVFQNRFIPS 135

Query: 82  KET-RWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSE-KSFV 139
           K T R+    I             D  NQYF L+S+ CIPLF+F  IYNYL  ++ KSF 
Sbjct: 136 KPTKRYSPSLIAAARRLLASALLDDPLNQYFALISQRCIPLFSFQFIYNYLFKNQLKSFA 195

Query: 140 QS--FDLRGP----------DGRYRYNPR----MSPTIMLDQWRKGSQWFQMKRNIAIEV 183
            S  F+L  P          +   RYN R    M P +  + +R GSQ+F + R     V
Sbjct: 196 NSSEFNLLYPSYIEILSEAENLNIRYNARGENVMMPEVPFEDFRVGSQFFILNRKHTKVV 255

Query: 184 VSDRKYFPVFQRFC--KGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRG-PHPSHY 240
           + D+K +  FQ  C  K YCY +EHY  T + ++ LK  +  TLT ++W      HP  Y
Sbjct: 256 LRDQKLWNKFQIPCTNKYYCYPEEHYFSTLLSMEDLKGCTGFTLTRVNWTGAVYGHPHLY 315

Query: 241 GRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLM 292
              +V+      LR            S  +LFARKF P  L  L+  A  ++
Sbjct: 316 TPAEVSPELFRQLR--------VSNWSYSYLFARKFSPECLAPLMNIADDVI 359


>IMGA|Medtr3g107210.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR006566  FBD-like
           chr03_pseudomolecule_IMGAG_V3.5 38068477-38069610 H
           EGN_Mt100125 20100825
          Length = 377

 Score =  138 bits (348), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 107/304 (35%), Positives = 147/304 (48%), Gaps = 19/304 (6%)

Query: 3   DKELLWRASMVARVSEFPFKLTPKVAFLFLTKGSLPLAPLWELFFKG-HEGLYSIYVHYS 61
           ++ L   A+   R + +P KL    AF+FLT   LP A LWE +F    + LY+IY+H  
Sbjct: 76  EETLFIVANHTKRKATWPRKL----AFMFLTTTPLPFASLWESYFNQIPKKLYNIYIHAD 131

Query: 62  PSFNGTVPVDSVFYGRRIPSKET-RWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIP 120
           P+F+   P   VF  R IPSK T R+                  D SN  FILLS SCIP
Sbjct: 132 PTFSYDPPFSGVFSNRIIPSKPTARFSPTLTSAARRLVARALIDDRSNYIFILLSSSCIP 191

Query: 121 LFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPR----MSPTIMLDQWRKGSQWFQMK 176
           L +F+  Y+ L+ S KSF++  +   P    R+  R    M PT+ ++ +R GSQ++ + 
Sbjct: 192 LHSFNFTYHTLINSNKSFIEILN-NEPSSYDRWAARGEQAMLPTVKIEDFRIGSQFWALT 250

Query: 177 RNIAIEVVSDRKYFPVFQRFC--KGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRG 234
           R  A  VVSDRK +  F + C     CY +E+Y  T + +   K   + TLT +DW  R 
Sbjct: 251 RKHARLVVSDRKIWSKFNKPCIRLDSCYPEENYFSTLINMWDPKGCVHATLTHVDWEGRD 310

Query: 235 P-HPSHYGRMDVTVNFLESLRNKGP-----CDYNGKENSICFLFARKFVPNALPRLLRFA 288
             HP  Y   +V    + SLR   P      D  G      FLFARKF    L  L   A
Sbjct: 311 DGHPRTYVADEVCPELIWSLRRDRPRYGDDDDNGGWRRRDPFLFARKFSAECLQLLTEIA 370

Query: 289 PKLM 292
             ++
Sbjct: 371 DGVI 374


>IMGA|Medtr3g106930.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR008942  ENTH/VHS
           chr03_pseudomolecule_IMGAG_V3.5 37943210-37942077 H
           EGN_Mt100125 20100825
          Length = 377

 Score =  138 bits (348), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 107/304 (35%), Positives = 147/304 (48%), Gaps = 19/304 (6%)

Query: 3   DKELLWRASMVARVSEFPFKLTPKVAFLFLTKGSLPLAPLWELFFKG-HEGLYSIYVHYS 61
           ++ L   A+   R + +P KL    AF+FLT   LP A LWE +F    + LY+IY+H  
Sbjct: 76  EETLFIVANHTKRKATWPRKL----AFMFLTTTPLPFASLWESYFNQIPKKLYNIYIHAD 131

Query: 62  PSFNGTVPVDSVFYGRRIPSKET-RWGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIP 120
           P+F+   P   VF  R IPSK T R+                  D SN  FILLS SCIP
Sbjct: 132 PTFSYDPPFSGVFSNRIIPSKPTARFSPTLTSAARRLVARALIDDRSNYIFILLSSSCIP 191

Query: 121 LFNFSTIYNYLMGSEKSFVQSFDLRGPDGRYRYNPR----MSPTIMLDQWRKGSQWFQMK 176
           L +F+  Y+ L+ S KSF++  +   P    R+  R    M PT+ ++ +R GSQ++ + 
Sbjct: 192 LHSFNFTYHTLINSNKSFIEILN-NEPSSYDRWAARGEQAMLPTVKIEDFRIGSQFWALT 250

Query: 177 RNIAIEVVSDRKYFPVFQRFC--KGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKRG 234
           R  A  VVSDRK +  F + C     CY +E+Y  T + +   K   + TLT +DW  R 
Sbjct: 251 RKHARLVVSDRKIWSKFNKPCIRLDSCYPEENYFSTLINMWDPKGCVHATLTHVDWEGRD 310

Query: 235 P-HPSHYGRMDVTVNFLESLRNKGP-----CDYNGKENSICFLFARKFVPNALPRLLRFA 288
             HP  Y   +V    + SLR   P      D  G      FLFARKF    L  L   A
Sbjct: 311 DGHPRTYVADEVCPELIWSLRRDRPRYGDDDDNGGWRRRDPFLFARKFSAECLQLLTEIA 370

Query: 289 PKLM 292
             ++
Sbjct: 371 DGVI 374


>IMGA|Medtr8g094310.1 BC10 protein (AHRD V1 ***- Q65XS5_ORYSJ);
           contains Interpro domain(s)  IPR001623  Heat shock
           protein DnaJ, N-terminal chr08_pseudomolecule_IMGAG_V3.5
           27201375-27205743 H EGN_Mt100125 20100825
          Length = 323

 Score =  133 bits (334), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 141/315 (44%), Gaps = 76/315 (24%)

Query: 45  LFFKGHEGLYSIYVHYSPSF-----------------NGTVPVDSV-----FYGRRIPSK 82
           + F+G +  +SI+VH  P F                 N ++ ++ +      +GRR+  +
Sbjct: 8   MHFQGGDNNFSIFVHPRPGFVLNEATTRSSYFLNRQVNDSIQINELDRNKKLFGRRLWRR 67

Query: 83  ETR---WGEFTIVXXXXXXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSEKSFV 139
                 WGE +++            D  N  F+ LS+SCIPL+NFS  Y+Y+M +  SFV
Sbjct: 68  LIHVIDWGEASMIEAERILLRHALDDPLNDRFVFLSDSCIPLYNFSYTYDYIMSTPTSFV 127

Query: 140 QSF-DLRGPDGRYRYNPRMSPTIMLDQWRKGSQWFQMKRNIAIEVVSDRKYFPVFQRFCK 198
            SF D +G     RYNP+M P I +  WRKGSQW  + R  A  VV D   FP+FQ+FCK
Sbjct: 128 DSFADTKGG----RYNPKMDPVIPVYNWRKGSQWAVLTRKHAKVVVEDDTVFPMFQKFCK 183

Query: 199 GY--------------------CYADEHYLPTFVGIKFLKKT-SNRTLTWIDW------- 230
                                 C  DEHY+ T +  K L+K  + R++T   W       
Sbjct: 184 KKPLPEFWRDQVIPADTSKIHNCIPDEHYVQTLLAQKDLEKELTRRSVTHTAWDISNSRD 243

Query: 231 -PKRGPHPSHYGRMDVT---VNFLESLR-------------NKGPCDYNGKENSICFLFA 273
             +RG HP  Y   D T   + F++ L               +  C   GK  S CFLFA
Sbjct: 244 RERRGWHPVTYKFSDATPMLIKFIKGLTCTEIDNIYYETEYRREWCTSKGKP-STCFLFA 302

Query: 274 RKFVPNALPRLLRFA 288
           RKF   A  RLL  +
Sbjct: 303 RKFTRTAALRLLNMS 317


>IMGA|Medtr2g087610.1 AT5G22070 protein (Fragment) (AHRD V1 ***-
           B9DHZ2_ARATH); contains Interpro domain(s)  IPR000184
           Bacterial surface antigen (D15)
           chr02_pseudomolecule_IMGAG_V3.5 27129335-27125959 F
           EGN_Mt100125 20100825
          Length = 368

 Score =  124 bits (311), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 99/297 (33%), Positives = 138/297 (46%), Gaps = 39/297 (13%)

Query: 26  KVAFLFLTKGSLPLAPLWELFFKGH-EGLYSIYVHYSPSFNGTVPVDSVFYG---RRIPS 81
           K+AFLFLT   L   PLW LFF+     L+++YVH  P  N T+   S  Y    + I S
Sbjct: 78  KIAFLFLTNTDLHFTPLWNLFFQTTPSKLFNVYVHSDPRVNLTLLRSSNNYNPIFKFISS 137

Query: 82  KETRWGEFTIVXXXX-XXXXXXXXDFSNQYFILLSESCIPLFNFSTIYNYLMGSE----- 135
           K+T     T++             D SN YFI+LS+ CIPL +F  IY  L  S      
Sbjct: 138 KKTYRASPTLISATRRLLASAILDDASNAYFIVLSQYCIPLHSFDYIYKSLFLSPTFDLT 197

Query: 136 -------------KSFVQSFDLRGPDGRYRYNPR----MSPTIMLDQWRKGSQWFQMKRN 178
                        KSF++  +  GP    RY  R    M P +  +++R GSQ+F + R 
Sbjct: 198 DSESTQFGVRLKYKSFIEIIN-NGPRLWKRYTARGRYAMMPEVPFEKFRVGSQFFTLTRK 256

Query: 179 IAIEVVSDRKYFPVFQRFC--KGYCYADEHYLPTFVGIKFLKKTSNRTLTWIDWPKR-GP 235
            A+ VV DR  +  F+  C     CY +EHY PT + ++     +  TLT ++W      
Sbjct: 257 HALVVVKDRTLWRKFKVPCYRDDECYPEEHYFPTLLSMEDSDGVTGYTLTNVNWTGTVNG 316

Query: 236 HPSHYGRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKLM 292
           HP  Y   +V+   +  LR           NS  FLFARKFVP+ L  L+  A  ++
Sbjct: 317 HPHTYQPEEVSPELILRLRKST--------NSESFLFARKFVPDCLEPLMGIAKSVI 365


>IMGA|Medtr1g031080.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR013626  Pheophorbide a oxygenase
           chr01_pseudomolecule_IMGAG_V3.5 9085918-9090204 E
           EGN_Mt100125 20100825
          Length = 292

 Score = 95.1 bits (235), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 109/241 (45%), Gaps = 29/241 (12%)

Query: 73  VFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFS-NQYFILLSESCIPLFNFSTIYNYL 131
           VF+ R I SK T+    +++            D   NQYF L+S+ C+PL +F  +YNYL
Sbjct: 57  VFHNRFISSKPTQRASPSLISAARRLLASALLDDPLNQYFALVSQHCVPLLSFRFVYNYL 116

Query: 132 MGSEKSFVQSFD-------------LRGPDGRYRYNPR----MSPTIMLDQWRKGSQWFQ 174
             ++   + SF                 P+   RYN R    M P +  + +R GSQ+F 
Sbjct: 117 FKNQLMSLASFSDFNLLYPSFIEILSEDPNLYERYNARGENVMLPEVPFEDFRVGSQFFI 176

Query: 175 MKRNIAIEVVSDRKYFPVFQRFCKGY--CYADEHYLPTFVGIKFLKKTSNRTLTWIDWPK 232
           + R  A  VV D K +  F+  C     CY +EHY PT + ++ L   +  TLT ++W  
Sbjct: 177 LNRKHAKVVVRDYKLWKKFRIPCVNLDSCYPEEHYFPTLLSMEDLNGCTGFTLTRVNWTG 236

Query: 233 R-GPHPSHYGRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLLRFAPKL 291
               HP  Y   +V+   +  LR           +S  +LFARKF P  L  L+  A  +
Sbjct: 237 CWDGHPHLYTPEEVSPELIRQLR--------VSNSSYSYLFARKFSPECLAPLMDIADDV 288

Query: 292 M 292
           +
Sbjct: 289 I 289


>IMGA|Medtr1g041270.1 Unknown Protein (AHRD V1); contains Interpro
           domain(s)  IPR000209  Peptidase S8 and S53, subtilisin,
           kexin, sedolisin chr01_pseudomolecule_IMGAG_V3.5
           11453806-11458092 E EGN_Mt100125 20100825
          Length = 297

 Score = 93.6 bits (231), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 106/234 (45%), Gaps = 29/234 (12%)

Query: 73  VFYGRRIPSKETRWGEFTIVXXXXXXXXXXXXDFS-NQYFILLSESCIPLFNFSTIYNYL 131
           VF+ R I SK T+    +++            D   NQYF L+S+ C+PL +F  +YNYL
Sbjct: 57  VFHNRFISSKPTQRASPSLISAARRLLASALLDDPLNQYFALVSQHCVPLLSFRFVYNYL 116

Query: 132 MGSEKSFVQSFD-------------LRGPDGRYRYNPR----MSPTIMLDQWRKGSQWFQ 174
             ++   + SF                 P+   RYN R    M P +  + +R GSQ+F 
Sbjct: 117 FKNQLMSLASFSDFNLLYPSFIEILSEDPNLYERYNARGENVMLPEVPFEDFRVGSQFFI 176

Query: 175 MKRNIAIEVVSDRKYFPVFQRFCKGY--CYADEHYLPTFVGIKFLKKTSNRTLTWIDWPK 232
           + R  A  VV D K +  F+  C     CY +EHY PT + ++ L   +  TLT ++W  
Sbjct: 177 LNRKHAKVVVRDYKLWKKFRIPCVNLDSCYPEEHYFPTLLSMEDLNGCTGFTLTRVNWTG 236

Query: 233 R-GPHPSHYGRMDVTVNFLESLRNKGPCDYNGKENSICFLFARKFVPNALPRLL 285
               HP  Y   +V+   +  LR           +S  +LFARKF P  L  L+
Sbjct: 237 CWDGHPHLYTPEEVSPELIRQLR--------VSNSSYSYLFARKFSPECLAPLM 282