Header
Home | Sitemap Set as homepage | Add to favorites
  Search the Site     » Advanced Search
Sections



Gibbs Sampler

by

image

Gibbs Sampler

Let X = (x1,...,xd), where xi is either a scalar or a vector. Suppose that we want to draw samples of X from an underlying density p(X). In the Gibbs sampler, one randomly or systematically chooses a coordinate, say xi, and then updates its value with a new sample xi' drawn from the conditional distribution p(xi|X[-i]), where we define graphics/453fig01.gif. Algorithmically, the Gibbs sampler can be implemented as follows:

Algorithm 8.3: [Random-scan Gibbs sampler] Suppose that the current sample is graphics/453fig02.gif. Then:

  • Select i randomly from the index set {1,...,d} according to a given probability vector (p1,...,pd).

  • Draw graphics/453fig03.gif from the conditional distribution p(graphics/453fig04.gif), and let graphics/453fig05.gif.

Algorithm 8.4: [Systematic-scan Gibbs sampler] Let the current sample be graphics/453fig02.gif. For i = 1,...,d, draw graphics/453fig03.gif from the conditional distribution

Equation 8.14

graphics/08equ014.gif


It is easy to check that every individual conditional update leaves p(·) invariant. Suppose that currently graphics/454fig01.gif. Then graphics/454fig02.gif follows its marginal distribution under p(·). Thus,

Equation 8.15

graphics/08equ015.gif


implying that the joint distribution of (graphics/454fig03.gif) is unchanged at p(·) after one update.

Under regularity conditions, in the steady state, the sequence of vectors ..., X(t-1), X(t), X(t+1),... is a realization of a homogeneous Markov chain with the transition kernel from state X' to state X given by

Equation 8.16

graphics/08equ016.gif


The convergence behavior of the Gibbs sampler is investigated in [71, 134, 137, 278, 436, 473] and general conditions are given for the following two results:

  • The distribution of X(t) converges geometrically to p(X), as t .

  • graphics/454equ03.gif, for any integrable function f.

The Gibbs sampler requires an initial transient period to converge to equilibrium. An initial period of length t0 is known as the burning-in period, and the first t0 samples should always be discarded. Convergence is usually detected in some ad hoc way; some methods for this are found in [472]. One such method is to monitor a sequence of weights that measure the discrepancy between the sampled and desired distribution [472]. The samples generated by the Gibbs sampler are not independent, hence care needs to be taken in assessing the accuracy of such estimators. By grouping variables together (i.e., drawing samples of several elements of X simultaneously), one can usually accelerate the convergence and generate less-correlated data [274, 278]. To reduce the dependence between samples, one can extract every rth sample to be used in the estimation procedure. When r is large, this approach generates almost independent samples.

Other Techniques

A main problem with all the MCMC algorithms is that they may, for various reasons, move very slowly in the configuration space or may become trapped in a local mode. This phenomenon is generally called slow mixing of the chain. When a chain is slow mixing, estimation based on the resulting Monte Carlo samples becomes very inaccurate. Some recent techniques suitable for designing more efficient MCMC samplers include parallel tempering [141], the multiple-try method [279], and evolutionary Monte Carlo [264].


182 times read

Related news

» Distribution System Services
by admin posted on Apr 30,2007
» Origins of VoIP
by admin posted on Aug 23,2007
» Basis-of-Voice-Coding
by admin posted on Oct 24,2006


More Top News
Cisco Wireless Networking
Most Popular
Featured Author