Prof. T. V. Sreenivas

Followers

Following

Co-author

Public Views

University of Surrey

Queen Mary, University of London

Diemo Schwarz

Institut de Recherche et Coordination Acoustique/Musique IRCAM

Gael Richard

Telecom ParisTech

Roland Badeau

Telecom ParisTech

Perfecto Herrera

Pompeu Fabra University

Yi-hsuan Yang

Academia Sinica

Interests

Uploads

Papers by Prof. T. V. Sreenivas

Student's-t mixture model based multi-instrument recognition in polyphonic music

by Harshavardhan Sundar and Prof. T. V. Sreenivas

2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

We address the problem of multi-instrument recognition in polyphonic music signals. Individual in... more We address the problem of multi-instrument recognition in polyphonic music signals. Individual instruments are modeled within a stochastic framework using Student's-t Mixture Models (tMMs). We impose a mixture of these instrument models on the polyphonic signal model. No a priori knowledge is assumed about the number of instruments in the polyphony. The mixture weights are estimated in a latent variable framework from the polyphonic data using an Expectation Maximization (EM) algorithm, derived for the proposed approach. The weights are shown to indicate instrument activity. The output of the algorithm is an Instrument Activity Graph (IAG), using which, it is possible to find out the instruments that are active at a given time. An average F-ratio of 0.75 is obtained for polyphonies containing 2-5 instruments, on a experimental test set of 8 instruments: clarinet, flute, guitar, harp, mandolin, piano, trombone and violin.

Download

A Mixture Model Approach for Formant Tracking and the Robustness of Student's-t Distribution

by Harshavardhan Sundar and Prof. T. V. Sreenivas

IEEE Transactions on Audio, Speech, and Language Processing, 2000

ABSTRACT We address the problem of robust formant tracking in continuous speech in the presence o... more ABSTRACT We address the problem of robust formant tracking in continuous speech in the presence of additive noise. We propose a new approach based on mixture modeling of the formant contours. Our approach consists of two main steps: (i) Computation of a pyknogram based on multiband amplitude-modulation/frequency-modulation (AM/FM) decomposition of the input speech; and (ii) Statistical modeling of the pyknogram using mixture models. We experiment with both Gaussian mixture model (GMM) and Student&#39;s-t mixture model (tMM) and show that the latter is robust with respect to handling outliers in the pyknogram data, parameter selection, accuracy, and smoothness of the estimated formant contours. Experimental results on simulated data as well as noisy speech data show that the proposed tMM-based approach is also robust to additive noise. We present performance comparisons with a recently developed adaptive filterbank technique proposed in the literature and the classical Burg&#39;s spectral estimator technique, which show that the proposed technique is more robust to noise.

Student's-t mixture model based multi-instrument recognition in polyphonic music

by Harshavardhan Sundar and Prof. T. V. Sreenivas

2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013

Download

A Mixture Model Approach for Formant Tracking and the Robustness of Student's-t Distribution

by Harshavardhan Sundar and Prof. T. V. Sreenivas

IEEE Transactions on Audio, Speech, and Language Processing, 2000

Prof. T. V. Sreenivas

Uploads

Papers by Prof. T. V. Sreenivas

Log In