首页 /研究 /Exponential family sparse coding with applications to self-taught learning

PERCEPTION

Exponential family sparse coding with applications to self-taught learning

Honglak Lee, Rajat Raina, Alex Teichman, Andrew Y. Ng

发表年份: 2009
引用次数: 71

摘要

Sparse coding is an unsupervised learning algorithm for finding concise, slightly higher-level representations of inputs, and has been successfully applied to self-taught learning, where the goal is to use unlabeled data to help on a supervised learning task, even if the unlabeled data cannot be associated with the labels of the supervised task [Raina et al., 2007]. However, sparse coding uses a Gaussian noise model and a quadratic loss function, and thus performs poorly if applied to binary valued, integer valued, or other non-Gaussian data, such as text. Drawing on ideas from generalized linear models (GLMs), we present a generalization of sparse coding to learning with data drawn from any exponential family distribution (such as Bernoulli, Poisson, etc). This gives a method that we argue is much better suited to model other data types than Gaussian. We present an algorithm for solving the L1regularized optimization problem defined by this model, and show that it is especially efficient when the optimal solution is sparse. We also show that the new model results in significantly improved self-taught learning performance when applied to text classification and to a robotic perception task. 1

关键词

Computer scienceArtificial intelligenceExponential familyNeural codingMachine learningData modelingGaussianPattern recognition (psychology)AlgorithmMathematics

Exponential family sparse coding with applications to self-taught learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control