Login

Statistical Models for Co-occurrence Data

Show full item record




Title: Statistical Models for Co-occurrence Data
Author: Hofmann, Thomas; Puzicha, Jan
Issue Date: 1998-02-01
Abstract: Modeling and predicting co-occurrences of events is a fundamental problem of unsupervised learning. In this contribution we develop a statistical framework for analyzing co-occurrence data in a general setting where elementary observations are joint occurrences of pairs of abstract objects from two finite sets. The main challenge for statistical models in this context is to overcome the inherent data sparseness and to estimate the probabilities for pairs which were rarely observed or even unobserved in a given sample set. Moreover, it is often of considerable interest to extract grouping structure or to find a hierarchical data organization. A novel family of mixture models is proposed which explain the observed data by a finite number of shared aspects or clusters. This provides a common framework for statistical inference and structure discovery and also includes several recently proposed models as special cases. Adopting the maximum likelihood principle, EM algorithms are derived to fit the model parameters. We develop improved versions of EM which largely avoid overfitting problems and overcome the inherent locality of EM--based optimization. Among the broad variety of possible applications, e.g., in information retrieval, natural language processing, data mining, and computer vision, we have chosen document retrieval, the statistical analysis of noun/adjective co-occurrence and the unsupervised segmentation of textured images to test and evaluate the proposed algorithms.
URI: http://hdl.handle.net/1721.1/7253
Other Identifiers: AIM-1625
CBCL-159
Series/Report no.: AIM-1625, CBCL-159

Files in this item

Files Size Format View
AIM-1625.pdf 1.464Mb PDF View/Open
AIM-1625.ps 1.827Mb Postscript View/Open

This item appears in the following Collection(s)

Show full item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links