knowledge-database (beta)

Current group: comp.ai.

JMLR: Learning Hidden Variable Networks: The Information Bottleneck Approach

JMLR: Learning Hidden Variable Networks: The Information Bottleneck Approach  
Redistributed
From:Redistributed
Subject:JMLR: Learning Hidden Variable Networks: The Information Bottleneck Approach
Date:Thu, 20 Jan 2005 18:55:03 GMT
[[Redistributed from JMLR announce]]

~From: elm@cs.umass.edu
~Date: Thu, 13 Jan 2005 09:29:36 -0500
~Subject: [Jmlr-announce] Learning Hidden Variable Networks: The Information Bottleneck Approach

The Journal of Machine Learning Research (www.jmlr.org) is pleased to
announce publication of a new paper:
------------------------------------------------------------------------
-------
Learning Hidden Variable Networks: The Information Bottleneck Approach
Gal Elidan and Nir Friedman
JMLR 6 (Jan): 81--127, 2005

Abstract

A central challenge in learning probabilistic graphical models is
dealing with domains that involve hidden variables. The common approach
for learning model parameters in such domains is the expectation
maximization (EM) algorithm. This algorithm, however, can easily get
trapped in suboptimal local maxima. Learning the model structure is
even more challenging. The structural EM algorithm can adapt the
structure in the presence of hidden variables, but usually performs
poorly without prior knowledge about the cardinality and location of
the hidden variables. In this work, we present a general approach for
learning Bayesian networks with hidden variables that overcomes these
problems. The approach builds on the information bottleneck framework
of Tishby et al. (1999). We start by proving formal correspondence
between the information bottleneck objective and the standard
parametric EM functional. We then use this correspondence to construct
a learning algorithm that combines an information-theoretic smoothing
term with a continuation procedure. Intuitively, the algorithm bypasses
local maxima and achieves superior solutions by following a continuous
path from a solution of, an easy and smooth, target function, to a
solution of the desired likelihood function. As we show, our
algorithmic framework allows learning of the parameters as well as the
structure of a network. In addition, it also allows us to introduce new
hidden variables during model selection and learn their cardinality. We
demonstrate the performance of our procedure on several challenging
real-life data sets.
------------------------------------------------------------------------
------
This paper and previous papers are available electronically at
http://www.jmlr.org in PDF format. The papers of Volumes 1-4 were also
published in hardcopy by MIT Press; please see
http://mitpress.mit.edu/JMLR for details. Volume 5 and subsequent
volumes will be printed in hardcopy by Microtome Publishing. Please see
http://www.mtome.com/Publications/jmlr.html for details and ordering
information.

-Erik G. Learned-Miller
elm@cs.umass.edu


_________________
Erik G. Learned-Miller
University of Massachusetts Amherst
http://www.cs.umass.edu/~elm
(413) 545-2993


_______________________________________________
Jmlr-announce mailing list
Jmlr-announce@lists.csail.mit.edu
http://lists.csail.mit.edu/mailman/listinfo/jmlr-announce

[ comp.ai is moderated. To submit, just post and be patient, or if ]
[ that fails mail your article to , and ]
[ ask your news administrator to fix the problems with your system. ]
   

Copyright © 2006 knowledge-database   -   All rights reserved