Understanding and mitigating difficulties in posterior predictive evaluation

May 30, 2024·

Abhinav Agrawal

· 0 min read

We find that the accuracy of simple MC PPD estimator suffers when dataset mismatch, latent dimension, or test data size increases. Adaptive importance sampling can help.

Abstract

Predictive posterior densities (PPDs) are of interest in approximate Bayesian inference. Typically, these are estimated by simple Monte Carlo (MC) averages using samples from the approximate posterior. We observe that the signal-to-noise ratio (SNR) of such estimators can be extremely low. An analysis for exact inference reveals SNR decays exponentially as there is an increase in (a) the mismatch between training and test data, (b) the dimensionality of the latent space, or (c) the size of the test data relative to the training data. Further analysis extends these results to approximate inference. To remedy the low SNR problem, we propose replacing simple MC sampling with importance sampling using a proposal distribution optimized at test time on a variational proxy for the SNR and demonstrate that this yields greatly improved estimates.

Type

Preprint

Publication

Arxiv

Last updated on May 30, 2024

Posterior Predictive Evaluation Importance Sampling Approximate Inference

Authors

Abhinav Agrawal

PhD in Computer Science

← Disentangling impact of capacity, objective, batchsize, estimators, and step-size on flow VI May 30, 2024

Amortized Variational Inference in Hierarhical Distributions Sep 28, 2021 →