The aspect Bernoulli model: multiple causes of presences and absences

  • Authors:
  • Ella Bingham;Ata Kabán;Mikael Fortelius

  • Affiliations:
  • University of Helsinki and Helsinki University of Technology, Helsinki Institute for Information Technology, P.O. Box 68, 00014, Helsinki, Finland;University of Birmingham, School of Computer Science, Birmingham, UK;University of Helsinki, Division of Palaeontology, Helsinki, Finland

  • Venue:
  • Pattern Analysis & Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a probabilistic multiple cause model for the analysis of binary (0–1) data. A distinctive feature of the aspect Bernoulli (AB) model is its ability to automatically detect and distinguish between “true absences” and “false absences” (both of which are coded as 0 in the data), and similarly, between “true presences” and “false presences” (both of which are coded as 1). This is accomplished by specific additive noise components which explicitly account for such non-content bearing causes. The AB model is thus suitable for noise removal and data explanatory purposes, including omission/addition detection. An important application of AB that we demonstrate is data-driven reasoning about palaeontological recordings. Additionally, results on recovering corrupted handwritten digit images and expanding short text documents are also given, and comparisons to other methods are demonstrated and discussed.