A Tutorial on Energy-Based Learning

Fei HUANG

A Tutorial on Energy-Based Learning

Fei HUANG

2006

visibility

…

description

59 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Energy-Based Models (EBMs) capture dependencies between variables by associating a scalar energy to each configuration of the variables. Inference consists in clamping the value of observed variables and finding configurations of the remaining variables that minimize the energy. Learning consists in finding an energy function in which observed configurations of the variables are given lower energies than unobserved ones. The EBM approach provides a common theoretical framework for many learning models, including traditional discriminative and generative approaches, as well as graph-transformer networks, conditional random fields, maximum margin Markov networks, and several manifold learning methods.

Figures (23)

Figure 15: The architecture of a system where two RBF units with centers U' and U? ar
placed on top of the machine Gi, to produce distances d, and do. — Figure 15: The architecture of a system where two RBF units with centers U' and U? ar placed on top of the machine Gi, to produce distances d, and do.

Figure 18: Figure showing the direction of gradient of the negative log-likelihood loss in the
feasible region R in the space defined by the two energies Ec and E;. — Figure 18: Figure showing the direction of gradient of the negative log-likelihood loss in the feasible region R in the space defined by the two energies Ec and E;.

Figure 20: A log domain factor graph for linear structured models, which include conditional
random fields, support vector Markov models, and maximum margin Markov networks. — Figure 20: A log domain factor graph for linear structured models, which include conditional random fields, support vector Markov models, and maximum margin Markov networks.

Yann LeCun

Probabilistic graphical models associate a probability to each configuration of the relevant variables. Energy-based models (EBM) associate an energy to those configurations, eliminating the need for proper normalization of probability distributions. Making a decision (an inference) with an EBM consists in comparing the energies associated with various configurations of the variable to be predicted, and choosing the one with the smallest energy. Such systems must be trained discriminatively to associate low energies to the desired configurations and higher energies to undesired configurations. A wide variety of loss function can be used for this purpose. We give sufficient conditions that a loss function should satisfy so that its minimization will cause the system to approach to desired behavior. We give many specific examples of suitable loss functions, and show an application to object recognition in images. it is important to note that the energy is quantity minimized

Log In

A Tutorial on Energy-Based Learning

Sign up for access to the world's latest research

Abstract

Figures (23)

Related papers

Related papers

Related topics