Automatic Bayesian Density Analysis

Antonio Vergari; Alejandro Molina; Robert Peharz; Zoubin Ghahramani; Kristian Kersting; Isabel Valera

doi:10.1609/aaai.v33i01.33015207

Authors

Antonio Vergari Max-Planck Institute for Intelligent Systems
Alejandro Molina TU Darmstadt
Robert Peharz University of Cambridge
Zoubin Ghahramani University of Cambridge
Kristian Kersting TU Darmstadt
Isabel Valera Max Planck Institute for Intelligence Systems

DOI:

https://doi.org/10.1609/aaai.v33i01.33015207

Abstract

Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for exploratory data analysis are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous data; moreover, their expressiveness generally comes at the price of intractable inference. As a result, supervision from statisticians is usually needed to find the right model for the data. However, since domain experts are not necessarily also experts in statistics, we propose Automatic Bayesian Density Analysis (ABDA) to make exploratory data analysis accessible at large. Specifically, ABDA allows for automatic and efficient missing value estimation, statistical data type and likelihood discovery, anomaly detection and dependency structure mining, on top of providing accurate density estimation. Extensive empirical evidence shows that ABDA is a suitable tool for automatic exploratory analysis of mixed continuous and discrete tabular data.

Automatic Bayesian Density Analysis

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription