Vanek SPECOM 2013 presentation slides

Jan  Vanek

Vanek SPECOM 2013 presentation slides

Jan Vanek

visibility

…

description

19 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

AI-generated Abstract

This presentation discusses the enhancements in Gaussian Mixture Models (GMMs) for speech recognition with a focus on handling data complexities and improving performance on unseen data. Key methodologies include robust model training, adjustments to covariance matrices, and the application of Monte Carlo simulations to optimize estimates. Findings suggest effective strategies for model complexity and unseen data performance through various testing on real speech data.

David Nahamoo, Mukund Padmanabhan

International Conference on Acoustics, Speech, and Signal Processing, 1996

Two discriminative techniques are described (and evaluated) for estimating the parameters of the Gaussians in a large vocabulary speech-recognition system. The first technique is based on using a modification of the maximum mutual information (MMI) objective function, and appears to provide no improvement over standard ML estimation. The second technique is based on a heuristic correction of the Gaussian parameters,

Log In

Vanek SPECOM 2013 presentation slides

Sign up for access to the world's latest research

AI-generated Abstract

Related papers

Related papers