Open Access. Powered by Scholars. Published by Universities.®

Mechanical Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Acoustics, Dynamics, and Controls

Audio Engineering Theses

Coding

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Mechanical Engineering

A Perceptual Evaluation Of Short-Time Fourier Transform Window Duration And Divergence Cost Function On Audio Source Separation Using Non-Negative Matrix Factorization, Ryan J. Miller May 2020

A Perceptual Evaluation Of Short-Time Fourier Transform Window Duration And Divergence Cost Function On Audio Source Separation Using Non-Negative Matrix Factorization, Ryan J. Miller

Audio Engineering Theses

Non-negative matrix factorization (NMF) is an established method of performing audio source separation. Previous studies used NMF with supplementary systems to improve performance, but little has been done to investigate perceptual effects of NMF parameters. The present study aimed to evaluate two NMF parameters for speech enhancement: the short-time Fourier transform (STFT) window duration and divergence cost function. Two experiments were conducted: the first investigated the effect of STFT window duration on target speech intelligibility in a sentence keyword identification task. The second experiment had participants rate residual noise levels present in target speech using three different cost functions: the …


A Perceptual Comparison Of “Black Box” Modeling Algorithms For Nonlinear Audio Systems, Paul G. Mayo Aug 2018

A Perceptual Comparison Of “Black Box” Modeling Algorithms For Nonlinear Audio Systems, Paul G. Mayo

Audio Engineering Theses

Nonlinear systems identification is a widespread topic of interest, particularly within the audio industry, as these techniques are employed to synthesize black box models of nonlinear audio effects. Given the myriad approaches to black box modeling, questions arise as to whether an “optimal” approach exists, or one that achieves valid subjective results as a model with minimal computational expense. This thesis uses ABX listening tests to compare black box models of three hardware audio effects using two popular nonlinear implementations, along with two proposed modified implementations. Models were constructed in the Hammerstein form using sine sweeps and a novel measurement …