Learning Optimal Classification Trees Using a Binary Linear Program Formulation

Authors

  • Sicco Verwer Delft University of Technology
  • Yingqian Zhang Eindhoven University of Technology

DOI:

https://doi.org/10.1609/aaai.v33i01.33011624

Abstract

We provide a new formulation for the problem of learning the optimal classification tree of a given depth as a binary linear program. A limitation of previously proposed Mathematical Optimization formulations is that they create constraints and variables for every row in the training data. As a result, the running time of the existing Integer Linear programming (ILP) formulations increases dramatically with the size of data. In our new binary formulation, we aim to circumvent this problem by making the formulation size largely independent from the training data size. We show experimentally that our formulation achieves better performance than existing formulations on both small and large problem instances within shorter running time.

Downloads

Published

2019-07-17

How to Cite

Verwer, S., & Zhang, Y. (2019). Learning Optimal Classification Trees Using a Binary Linear Program Formulation. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 1625-1632. https://doi.org/10.1609/aaai.v33i01.33011624

Issue

Section

AAAI Technical Track: Constraint Satisfaction and Optimization