A Hybrid Learning-to-Optimize Framework for Mixed-Integer Quadratic Programming

Published in L4DC, 2026

This work proposes a learning-to-optimize (L2O) framework for accelerating the solution of parametric MIQP problems by learning structured solution components directly from data. The key idea is to predict high-quality integer decisions using a neural network, while preserving exact continuous optimality by solving a differentiable quadratic programming (QP) layer conditioned on the predicted integers. By explicitly separating discrete and continuous variables, the framework leverages problem structure and improves both feasibility and performance.

To train the model, we introduce a hybrid loss function that combines:

a supervised loss, encouraging predicted integer solutions to match globally optimal ones when labels are available, and
a self-supervised loss, derived directly from the MIQP objective and constraints, promoting feasibility and consistency even without labeled solutions.

This hybrid learning strategy bridges the gap between purely supervised and purely self-supervised approaches. The effectiveness of the proposed method is demonstrated on benchmark MI-MPC problems, where it achieves significant computational speedups while maintaining strong feasibility and near-optimal control performance.

Paper: https://arxiv.org/abs/2511.19383
GitHub Repository: https://github.com/mlab-upenn/L2O-MIQP

BibTeX:

@inproceedings{le2026hybrid,
title=, 
author={Le, Viet-Anh and Xie, Mu and Mangharam, Rahul},
year={2026},
booktitle={8th Annual Learning for Dynamics \& Control Conference},
}

Download Paper arXiv GitHub

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Mu Xie

Share on