Expectation–maximization algorithm

Alternating Steps And Logic

The algorithm proceeds by alternating between two distinct steps until values converge. The first step is the Expectation step where one defines Q as the expected value of the log likelihood function. This calculation uses the current conditional distribution of the unknown data given observed data and current parameter estimates. The second step is the Maximization step which finds parameters that maximize the quantity derived in the previous phase. These updated parameters then determine the distribution of latent variables for the next iteration. One can simply pick arbitrary values for one set of unknowns to start the process. Use them to estimate the second set, then use those new values to find a better estimate of the first set. This cycle repeats until the resulting values both reach fixed points. The derivative of the likelihood becomes arbitrarily close to zero at that point. That point represents either a local maximum or a saddle point rather than necessarily a global maximum.

Who published the expectation-maximization algorithm in 1977?

Arthur Dempster, Nan Laird, and Donald Rubin published a paper in 1977 that gave the method its name. This work formalized an iterative process for finding maximum likelihood estimates in statistical models with unobserved latent variables.

When did Rolf Sundberg provide a detailed treatment of the EM method for exponential families?

Rolf Sundberg provided a detailed treatment of the EM method for exponential families in his thesis from 1971. His work built upon collaboration with Per Martin-Löf and Anders Martin-Löf at Stockholm University.

What are the two distinct steps used by the expectation-maximization algorithm to converge values?

The first step is the Expectation step where one defines Q as the expected value of the log likelihood function. The second step is the Maximization step which finds parameters that maximize the quantity derived in the previous phase.

Why does the expectation-maximization algorithm offer no guarantee of reaching a maximum likelihood estimator?

An EM iteration increases the observed data likelihood function but offers no guarantee of reaching a maximum likelihood estimator. For multimodal distributions the method may converge to a local maximum depending on starting values.

How does C.F. Jeff Wu correct convergence analysis differ from the original 1977 work?

C.F. Jeff Wu published a correct convergence analysis in 1983 after noting flaws in the original 1977 work. Wu's proof established convergence outside of the exponential family as claimed by Dempster-Laird-Rubin.

Expectation–maximization algorithm.

Alternating Steps And Logic

Continue Browsing

Common questions

Who published the expectation-maximization algorithm in 1977?

When did Rolf Sundberg provide a detailed treatment of the EM method for exponential families?

What are the two distinct steps used by the expectation-maximization algorithm to converge values?

Why does the expectation-maximization algorithm offer no guarantee of reaching a maximum likelihood estimator?

How does C.F. Jeff Wu correct convergence analysis differ from the original 1977 work?

Local Optima And Convergence

Medical Imaging And Engineering

Speeding Up The Process

Gaussian Mixtures And Data