Open Access. Powered by Scholars. Published by Universities.®

Signal Processing Commons

Open Access. Powered by Scholars. Published by Universities.®

Marquette University

Dr. Dolittle Project: A Framework for Classification and Understanding of Animal Vocalizations

Speech analysis

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Signal Processing

Stress And Emotion Classification Using Jitter And Shimmer Features, Xi Li, Jidong Tao, Michael T. Johnson, Joseph Soltis, Anne Savage, Kirsten Leong, John D. Newman Apr 2007

Stress And Emotion Classification Using Jitter And Shimmer Features, Xi Li, Jidong Tao, Michael T. Johnson, Joseph Soltis, Anne Savage, Kirsten Leong, John D. Newman

Dr. Dolittle Project: A Framework for Classification and Understanding of Animal Vocalizations

In this paper, we evaluate the use of appended jitter and shimmer speech features for the classification of human speaking styles and of animal vocalization arousal levels. Jitter and shimmer features are extracted from the fundamental frequency contour and added to baseline spectral features, specifically Mel-frequency cepstral coefficients (MFCCs) for human speech and Greenwood function cepstral coefficients (GFCCs) for animal vocalizations. Hidden Markov models (HMMs) with Gaussian mixture models (GMMs) state distributions are used for classification. The appended jitter and shimmer features result in an increase in classification accuracy for several illustrative datasets, including the SUSAS dataset for human speaking …


Generalized Perceptual Linear Prediction (Gplp) Features For Animal Vocalization Analysis, Patrick J. Clemins, Michael T. Johnson Jul 2006

Generalized Perceptual Linear Prediction (Gplp) Features For Animal Vocalization Analysis, Patrick J. Clemins, Michael T. Johnson

Dr. Dolittle Project: A Framework for Classification and Understanding of Animal Vocalizations

A new feature extraction model, generalized perceptual linear prediction (gPLP), is developed to calculate a set of perceptually relevant features for digital signal analysis of animalvocalizations. The gPLP model is a generalized adaptation of the perceptual linear prediction model, popular in human speech processing, which incorporates perceptual information such as frequency warping and equal loudness normalization into the feature extraction process. Since such perceptual information is available for a number of animal species, this new approach integrates that information into a generalized model to extract perceptually relevant features for a particular species. To illustrate, qualitative and quantitative comparisons are made …