Convex Formulations for Fair Principal Component Analysis

Matt Olfat; Anil Aswani

doi:10.1609/aaai.v33i01.3301663

Authors

Matt Olfat University of California, Berkeley
Anil Aswani University of California, Berkeley

DOI:

https://doi.org/10.1609/aaai.v33i01.3301663

Abstract

Though there is a growing literature on fairness for supervised learning, incorporating fairness into unsupervised learning has been less well-studied. This paper studies fairness in the context of principal component analysis (PCA). We first define fairness for dimensionality reduction, and our definition can be interpreted as saying a reduction is fair if information about a protected class (e.g., race or gender) cannot be inferred from the dimensionality-reduced data points. Next, we develop convex optimization formulations that can improve the fairness (with respect to our definition) of PCA and kernel PCA. These formulations are semidefinite programs, and we demonstrate their effectiveness using several datasets. We conclude by showing how our approach can be used to perform a fair (with respect to age) clustering of health data that may be used to set health insurance rates.

Convex Formulations for Fair Principal Component Analysis

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription