PyEPO: A PyTorch-based End-to-End Predict-then-Optimize Tool

Learning Framework

Publication

This repository is the official implementation of the paper: PyEPO: A PyTorch-based End-to-End Predict-then-Optimize Library for Linear and Integer Programming (Accepted to Mathematical Programming Computation (MPC))

Citation:

@article{tang2024,
  title={PyEPO: a PyTorch-based end-to-end predict-then-optimize library for linear and integer programming},
  author={Tang, Bo and Khalil, Elias B},
  journal={Mathematical Programming Computation},
  issn={1867-2957},
  doi={10.1007/s12532-024-00255-x},
  year={2024},
  month={July},
  publisher={Springer}
}

If you use the CaVE loss, please also cite:

@inproceedings{tang2024cave,
  title={CaVE: A Cone-Aligned Approach for Fast Predict-then-Optimize with Binary Linear Programs},
  author={Tang, Bo and Khalil, Elias B},
  booktitle={Integration of Constraint Programming, Artificial Intelligence, and Operations Research},
  pages={193--210},
  year={2024},
  publisher={Springer}
}

Introduction

PyEPO (PyTorch-based End-to-End Predict-then-Optimize Tool) is a Python-based, open-source software that supports modeling and solving predict-then-optimize problems with linear objective functions. The core capability of PyEPO is to build optimization models with GurobiPy, COPT, Pyomo, Google OR-Tools, MPAX or any other solvers and algorithms, then embed the optimization model into an artificial neural network for the end-to-end training. For this purpose, PyEPO implements various methods as PyTorch autograd modules.

For end-to-end learning on binary linear programs (TSP, CVRP, knapsack, ...), PyEPO ships CaVE [13]. CaVE replaces the per-step ILP solve with a cone-alignment projection onto the binding-constraint normals at the true optimum; backed by an interior-point QP solver (Clarabel) with a low iteration cap, this delivers paper-faithful regret on TSP-scale binary LPs. Because the cone projection is far cheaper than the per-instance ILP solve, CaVE trains an order of magnitude faster than SPO+ at this scale.

PyEPO also integrates MPAX, a JAX solver that runs the first-order PDHG method on GPU. Because both the prediction network and the solver stay on the GPU, MPAX solves a whole mini-batch of instances at once and avoids the GPU-to-CPU transfer that CPU solvers like Gurobi pay at every training step.

Documentation

The official PyEPO docs can be found at https://khalil-research.github.io/PyEPO.

Slides

Our recent tutorial was at the ACC 2024 conference. You can view the talk slides here.

Tutorial

01 Optimization Model: Build an optimization solver
02 Optimization Dataset: Generate synthetic data and use optDataset
03 Training and Testing: Train and test different approaches
04 CaVE for Binary Linear Programs: Train with the cone-aligned CaVE loss vs SPO+ on TSP
05 2D Knapsack Solution Visualization: Visualize solutions for the knapsack problem
06 Warcraft Shortest Path: Train shortest path models on the Warcraft terrains dataset
07 Real-World Energy Scheduling: Apply PyEPO to real energy data
08 kNN Robust Losses: Use optDatasetKNN for robust losses
09 Solving on MPAX with PDHG: Use MPAX for GPU-accelerated batch solving
10 JAX Frontend: Train any loss in JAX/Flax with jax.grad (MPAX or any solver)

Experiments

To reproduce the experiments in the original paper, please use the code and follow the instructions in this branch. Please note that this branch is a very early version.

Features

End-to-end gradient surrogates for predict-then-optimize, covering the seven families in the docs:
- Surrogate losses — convex upper bound on regret (SPO+ [1]) and finite-difference directional gradient (PG [11]).
- Perturbed methods — Monte-Carlo gradients over random cost perturbations: DPO and PFYL [5] [6], I-MLE [9], AI-MLE [10].
- Regularized methods — L2-regularized Frank-Wolfe over the convex hull of feasible solutions: RFWO and RFYL [6].
- Black-box methods — informative gradient estimates that replace the solver's zero gradient: DBB [3] (interpolation) and NID [4] (signed identity).
- Cone-aligned estimation — supervise the predicted cost by projecting onto the binding-constraint normals at the true optimum; binary linear programs only: CaVE [13] — an order of magnitude faster than SPO+ at TSP scale.
- Contrastive methods — margin against a cached pool of non-optimal solutions: NCE and CMAP [7].
- Learning to rank — rank the true optimum highest among the pool: pointwise / pairwise / listwise LTR [8].
Multi-solver backend under a unified optModel API: Gurobi, COPT, Pyomo, Google OR-Tools, and the GPU-native MPAX PDHG solver.
Symbolic modeling with pyepo.dsl: define an LP, MIP, or QP once with Variable, Parameter, and constraints, then compile it to any backend. The compiled model is a standard optModel, so every loss above works unchanged.
Parallel solving via a Pathos worker pool to amortize per-instance ILP solves across a mini-batch.
Solution caching [7] reuses previously computed optima to skip redundant solver calls in contrastive and ranking training.
kNN-smoothed targets [12] replace each label with a neighborhood aggregate for noise-robust regret.

Installation

Clone and Install from this Repo

You can download PyEPO from our GitHub repository.

git clone -b main --depth 1 https://github.com/khalil-research/PyEPO.git

And install it.

pip install PyEPO/pkg/.

Pip Install

The package is now available for installation on PyPI. You can easily install PyEPO using pip by running the following command:

pip install pyepo

Conda Install

PyEPO is also available on Anaconda Cloud. If you prefer to use conda for installation, you can install PyEPO with the following command:

conda install -c pyepo pyepo

Dependencies

Sample Code

An end-to-end predict-then-optimize example. The optimization model is defined with pyepo.dsl and compiled to Gurobi; change backend to run the same model on COPT, Pyomo, OR-Tools, or MPAX.

#!/usr/bin/env python
# coding: utf-8

import numpy as np
import pyepo
from pyepo import EPO, dsl
import torch
from torch import nn
from torch.utils.data import DataLoader


# prediction model
class LinearRegression(nn.Module):

    def __init__(self):
        super(LinearRegression, self).__init__()
        self.linear = nn.Linear(num_feat, num_item)

    def forward(self, x):
        out = self.linear(x)
        return out


if __name__ == "__main__":

    # generate data
    num_data = 1000 # number of data
    num_feat = 5 # size of feature
    num_item = 10 # number of items
    weights, x, c = pyepo.data.knapsack.genData(num_data, num_feat, num_item,
                                                dim=3, deg=4, noise_width=0.5, seed=135)

    # optimization model: define symbolically, compile to Gurobi
    items = dsl.Variable(num_item, vtype=EPO.BINARY)
    cost = dsl.Parameter(num_item)
    optmodel = dsl.Problem(dsl.Maximize(cost @ items),
                           [weights @ items <= np.array([7, 8, 9])]).compile(backend="gurobi")

    # init prediction model
    predmodel = LinearRegression()
    # set optimizer
    optimizer = torch.optim.Adam(predmodel.parameters(), lr=1e-2)
    # init SPO+ loss
    spop = pyepo.func.SPOPlus(optmodel, processes=1)

    # build dataset
    dataset = pyepo.data.dataset.optDataset(optmodel, x, c)
    # get data loader
    dataloader = DataLoader(dataset, batch_size=32, shuffle=True)

    # training
    num_epochs = 10
    for epoch in range(num_epochs):
        for data in dataloader:
            x, c, w, z = data
            # forward pass
            cp = predmodel(x)
            loss = spop(cp, c, w, z)
            # backward pass
            optimizer.zero_grad()
            loss.backward()
            optimizer.step()

    # eval
    regret = pyepo.metric.regret(predmodel, optmodel, dataloader)
    print("Regret on Training Set: {:.4f}".format(regret))

JAX frontend (`pyepo.func.jax`)

End-to-end training of a shortest-path predictor on a 5x5 grid with the SPO+ loss (Flax + optax):

import jax
import jax.numpy as jnp
import optax
from flax import linen as nn

import pyepo
from pyepo.data.dataset import optDataset
from pyepo.func.jax import SPOPlus

# optimization model: 5x5 grid shortest path (any PyEPO solver works)
grid = (5, 5)
optmodel = pyepo.model.shortestPathModel(grid)

# synthetic data
x, c = pyepo.data.shortestpath.genData(
    num_data=1000, num_features=5, grid=grid, deg=4, noise_width=0.5, seed=135,
)
ds = optDataset(optmodel, x, c)
xj = jnp.asarray(x, jnp.float32)
cj, wj, zj = (jnp.asarray(a, jnp.float32) for a in (ds.costs, ds.sols, ds.objs))

# linear predictor and SPO+ loss
predmodel = nn.Dense(optmodel.num_cost)
params = predmodel.init(jax.random.PRNGKey(0), xj[:1])
spo = SPOPlus(optmodel, reduction="mean")
optimizer = optax.adam(1e-2)
opt_state = optimizer.init(params)

# end-to-end training
for epoch in range(10):
    grads = jax.grad(lambda p: spo(predmodel.apply(p, xj), cj, wj, zj))(params)
    updates, opt_state = optimizer.update(grads, opt_state)
    params = optax.apply_updates(params, updates)

Name		Name	Last commit message	Last commit date
Latest commit History 1,410 Commits
.github/workflows		.github/workflows
docs		docs
images		images
notebooks		notebooks
pkg		pkg
run		run
slides		slides
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyEPO: A PyTorch-based End-to-End Predict-then-Optimize Tool

Learning Framework

Publication

Introduction

Documentation

Slides

Tutorial

Experiments

Features

Installation

Clone and Install from this Repo

Pip Install

Conda Install

Dependencies

Sample Code

JAX frontend (`pyepo.func.jax`)

Reference

About

Uh oh!

Releases 20

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PyEPO: A PyTorch-based End-to-End Predict-then-Optimize Tool

Learning Framework

Publication

Introduction

Documentation

Slides

Tutorial

Experiments

Features

Installation

Clone and Install from this Repo

Pip Install

Conda Install

Dependencies

Sample Code

JAX frontend (pyepo.func.jax)

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 20

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

JAX frontend (`pyepo.func.jax`)

Packages