PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion¶
Conference: ICML 2025
arXiv: 2412.17780
Code: https://huggingface.co/ChatterjeeLab/PepTune
Area: Computational Biology
Keywords: Therapeutic peptide design, discrete diffusion, multi-objective optimization, MCTS, SMILES
TL;DR¶
PepTune combines a Masked Discrete Language Model (MDLM) with a Monte Carlo Tree Search (MCTS) multi-objective-guided strategy within the discrete peptide SMILES space to optimize multiple therapeutic properties (binding affinity, solubility, membrane permeability, etc.) simultaneously, enabling the de novo design of peptide drugs containing non-canonical amino acids and cyclization modifications.
Background & Motivation¶
Background: Peptide therapeutics (such as GLP-1 receptor agonists semaglutide/liraglutide) have achieved milestone successes in treating diseases like diabetes and obesity. Peptides are capable of binding diverse protein surfaces and disrupting protein-protein interactions; since 2000, 33 therapeutic peptides have been approved by the FDA.
Limitations of Prior Work: Designing peptides that simultaneously satisfy multiple conflicting objectives (e.g., binding affinity, solubility, and membrane permeability) remains a major challenge. Existing methods are limited to (1) continuous spaces, (2) unconditional generation, or (3) single-objective guidance. Traditional methods rely on screening random combinatorial libraries of \(10^{12}\) magnitude.
Key Challenge: Therapeutic peptides require non-canonical amino acids (nAAs) and cyclization modifications to improve stability and permeability, but existing deep learning models can only process 20 standard amino acids. Meanwhile, multi-objective guidance in discrete spaces is extremely difficult because gradients cannot be directly computed.
Goal: To build a diffusion model that can perform multi-objective conditional generation in the discrete peptide SMILES space.
Key Insight: (1) Representing peptides using SMILES instead of amino acid sequences to support nAAs and cyclization; (2) Using MCTS rather than gradient guidance to solve the challenge of discrete-space guidance.
Core Idea: MDLM is responsible for exploring valid structures within the discrete peptide space, while MCTS guides the generation process to evolve towards the Pareto-optimal direction across multiple therapeutic targets.
Method¶
Overall Architecture¶
- Input: Target protein sequence + list of therapeutic properties to be optimized
- First Stage: Pre-train PepMDLM (unconditional generator) on 11 million peptide SMILES
- Second Stage: Perform multi-objective conditional guidance on the generation process using an MCTS strategy guided by property classifiers
- Output: A set of Pareto-optimal peptide SMILES + corresponding property scores
Key Designs¶
-
State-dependent Masking Schedule:
- Key Insight: Peptide bonds are the foundational structure of all valid peptides.
- Designing a polynomial masking schedule: \(\alpha_t(\mathbf{x}_0) = 1 - t^w\) for peptide bond tokens, and \(\alpha_t = 1 - t\) for non-peptide bond tokens.
- Peptide bond tokens are masked later in the forward process and unmasked earlier in the reverse process.
- The loss weight of peptide bond tokens in training is scaled up by a factor of \(w\): \(\frac{w}{t} \log \langle \mathbf{x}_0, \mathbf{x}_\theta \rangle\).
- Why: The vast majority of arbitrary SMILES are not valid peptides—letting the model first "build the backbone" before filling in side chains is crucial.
-
Global Sequence Invalidity Loss (Invalid Loss):
- Take the argmax of predicted probabilities to obtain a discrete sequence, then check if it represents a valid peptide.
- Invalid sequences backpropagate gradients using a penalty weighted by softmax probabilities.
- \(\mathcal{L}_{\text{invalid}} = \sum_\ell \text{SM}(x_{\theta,k}^{(\ell)}) \cdot \mathbf{1}[\tilde{\mathbf{x}}_0 \text{ is Invalid}]\)
- Why: Since argmax is non-differentiable, the softmax probability is used as a surrogate to bypass this bottleneck.
-
MCTS Multi-Objective Guidance:
- Selection: Starting from the root node (fully masked), non-dominated child paths are selected based on cumulative rewards.
- Expansion: Sample \(M\) different unmasking schemes using Gumbel noise.
- Rollout: Greedily unmask to a complete sequence and calculate scores for \(K\) objectives using classifiers.
- Backpropagation: The reward vector is backpropagated to update all nodes along the path.
- Reward Definition: \(r_k(\mathbf{x}) = \frac{1}{|\mathcal{P}^*|} \sum_n \mathbb{I}[s_k(\mathbf{x}) \geq s_k(\tilde{\mathbf{x}}_n)]\) (the proportion of designs beaten in the Pareto set).
- Penalty for invalid peptides: Deduct scores proportional to the invalidity ratio across all dimensions.
- Why: Since the discrete space lacks gradients, MCTS achieves gradient-free multi-objective guidance through search and reward feedback.
Loss & Training¶
- Total Loss: \(\mathcal{L} = \mathcal{L}_{\text{NELBO}}^\infty + \mathcal{L}_{\text{invalid}}\)
- RoFormer Backbone: 8 layers, 768 hidden dimensions, 8 attention heads
- Training Data: 11 million peptide SMILES (synthetic data from SmProt + CycloPs)
- PeptideCLM SPE Tokenization: 581 tokens, averaging 4 characters/token
- 8×A6000 GPUs, 1600 GPU hours, AdamW, lr=3e-4
- MCTS: 128 iterations, 50 children/expansion
Key Experimental Results¶
Main Results¶
| Target Protein | Property | Best Docking Score | Baseline/Control |
|---|---|---|---|
| TfR (Blood-brain barrier) | Binding + Solubility + Non-hemolytic | -8.4 kcal/mol | T7 peptide: -8.4 (but PepTune is shorter) |
| GLP-1R (Diabetes) | Binding + Solubility + Non-hemolytic | -7.4 kcal/mol | Semaglutide: -5.7 (longer) |
| GFAP (Alexander disease) | Binding + Permeability + Solubility + Non-hemolytic | -8.5 kcal/mol | No known peptide binder |
| TfR+GLAST (Dual-target) | Dual-target binding + Solubility | TfR: -10.5, GLAST: -9.2 | First dual-target peptide design |
Ablation Study¶
| Configuration | Validity↑ | Diversity | Description |
|---|---|---|---|
| PepMDLM (No Guidance) | 45% (len=100) | 0.705 | Baseline unconditional model |
| PepTune (MCTS-Guided) | 100% (after 20 iter) | 0.677 | MCTS boosts validity to 100% |
| HELM-GPT (Control) | 83.9% | 0.595 | HELM representation, does not support nAAs |
| Without state-dependent mask | ~30% | - | Validity drops significantly |
| Without invalid loss | ~35% | - | Validity drops |
Key Findings¶
- MCTS guidance achieves a 100% valid peptide generation rate after just 20 iterations.
- The generated peptides are rich in non-canonical amino acids (average of 2.94 per peptide) and cyclization structures.
- Multi-objective guidance does not significantly sacrifice diversity (dropping by only 0.03 compared to unconditional generation).
- Dual-target peptide experiments demonstrate that PepTune can optimize binding affinities for two proteins simultaneously.
- The docking score of the GLP-1R binder outperforms semaglutide with a shorter sequence.
Highlights & Insights¶
- First Discrete-Space Multi-Objective Guided Diffusion: The integration of MCTS and masked diffusion is a significant contribution to the field.
- Advantages of SMILES Representation: Supports nAAs and cyclization, greatly expanding the designable space of peptides.
- Physical Intuition of State-Dependent Masking: The generation order of "backbone first, side chains second" aligns well with chemical intuition.
- Clinical Relevance: Case studies on multiple real-world therapeutic targets demonstrate promising prospects for downstream experimental validation.
Limitations & Future Work¶
- Reliance on synthetic peptide data (CycloPs); rare nAAs may increase the difficulty and cost of synthesis.
- The quality of property classifiers is a bottleneck—properties like membrane permeability lack external validation.
- MCTS sampling speed is relatively slow (128 iterations × 50 children × rollout).
- Lack of wet-lab validation (only computational docking has been performed).
Related Work & Insights¶
- MDLM (Sahoo et al.) provides the foundation for masked discrete diffusion.
- Structure-based models like RFpeptides require the target's 3D structure, whereas PepTune only needs the sequence.
- Inspiration: The MCTS guidance strategy can be extended to other discrete biological sequence generation tasks such as protein design and DNA sequence optimization.
Rating¶
- Novelty: ⭐⭐⭐⭐⭐ The combination of MCTS + discrete diffusion + peptide SMILES is highly innovative.
- Experimental Thoroughness: ⭐⭐⭐⭐⭐ 5 single-target + 2 dual-target case studies, with detailed ablation details.
- Writing Quality: ⭐⭐⭐⭐ Rich content but somewhat lengthy.
- Value: ⭐⭐⭐⭐⭐ Significant practical value for drug design.