Open Access. Powered by Scholars. Published by Universities.®

Signal Processing Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

SelectedWorks

Discipline
Keyword
Publication Year
Publication

Articles 31 - 60 of 61

Full-Text Articles in Signal Processing

Detection Of Speech Under Physical Stress: Model Development, Sensor Selection, And Feature Fusion, Sanjay A. Patil, John Hl Hansen Sep 2008

Detection Of Speech Under Physical Stress: Model Development, Sensor Selection, And Feature Fusion, Sanjay A. Patil, John Hl Hansen

Sanjay A. Patil

No abstract provided.


Quranic Verse Recitation Recognition Module For Support In J-Qaf Learning: A Review, Noor Jamaliah Ibrahim, Zaidi Razak, Zulkifli Mohd Yusoff, Mohd Yamani Idna Idris, Emran Mohd Tamil, Noorzaily Mohamed Noor, Noor Naemah Abdul Rahman Aug 2008

Quranic Verse Recitation Recognition Module For Support In J-Qaf Learning: A Review, Noor Jamaliah Ibrahim, Zaidi Razak, Zulkifli Mohd Yusoff, Mohd Yamani Idna Idris, Emran Mohd Tamil, Noorzaily Mohamed Noor, Noor Naemah Abdul Rahman

Noor Jamaliah Ibrahim

Each person’s voice is different. Thus, the Quran sound, which had been recited by most of recitors will probably tend to differ a lot from one person to another. Although those Quranic sentence were particularly taken from the same verse, but the way of the sentence in Al-Quran been recited or delivered may be different. It may produce the difference sounds for the different recitors. Those same combinations of letters may be pronounced differently due to the use of harakates. This paper seeks to provide a comprehensive review of Quran Arabic verse recitation recognition focusing on the techniques used, the …


Proceedings From Scientific Conference On Green Energy And It, Dr. Erik Dahlquist Mar 2008

Proceedings From Scientific Conference On Green Energy And It, Dr. Erik Dahlquist

Dr. Erik Dahlquist

This conference is part of the annual Energitinget, a national arena for energy in Sweden, with some 2500 participants. The focus with this session is to give a forum for researchers to present scientific results, and also to discuss these with other researchers. It contains papers in the area of Energy and IT as well as Green energy generally


Quranic Verse Recitation Feature Extraction Using Mel-Frequency Cepstral Coefficient (Mfcc), Noor Jamaliah Ibrahim, Zaidi Razak, Emran Mohd Tamil, Mohd Yamani Idna Idris, Zulkifli Mohd Yusoff Mar 2008

Quranic Verse Recitation Feature Extraction Using Mel-Frequency Cepstral Coefficient (Mfcc), Noor Jamaliah Ibrahim, Zaidi Razak, Emran Mohd Tamil, Mohd Yamani Idna Idris, Zulkifli Mohd Yusoff

Noor Jamaliah Ibrahim

Each person’s voice is different. Thus, the Quran sound, which had been recited by most of recitors will probably tend to differ a lot from one person to another. Although those Quranic sentence were particularly taken from the same verse, but the way of the sentence in Al-Quran been recited or delivered may be different. It may produce the difference sounds for the different recitors. Those same combinations of letters may be pronounced differently due to the use of harakates. This paper explores the viability of Mel-Frequency Cepstral Coefficient (MFCC) technique to extract features from Quranic verse recitation. Features extraction …


Use Of Modeling And Simulation In Pulp And, Dr. Erik Dahlquist Jan 2008

Use Of Modeling And Simulation In Pulp And, Dr. Erik Dahlquist

Dr. Erik Dahlquist

The book is a handbook for operators and process engineers in primarily pulp and paper industry, but also other process industries about how to utilise simulation as a tool for enhanced process operations. The book has been written as part of a EU COST action on Process Simulation, with 14 countries and 50 researchers and process industry representatives involved.


Trade-Off Investigation Between Diversity And Spatial Multiplexing In Practical Systems Using Numerical Results, Neda Adib, Hafez Hadinejad Mahram Jan 2008

Trade-Off Investigation Between Diversity And Spatial Multiplexing In Practical Systems Using Numerical Results, Neda Adib, Hafez Hadinejad Mahram

Neda Adib

Using multiple antennas in wireless communication systems proposes some degrees of freedom in system design, these degrees of freedom can be applied to improve the system reliability by using Space Time Block Coding (STBC) which provides diversity gain. Degrees of freedom also can be applied to increase the system capacity using Spatial Multiplexing (SM) methods. It is possible to achieve a trade-off between diversity gain and Spatial multiplexing gain. Selecting the method which provides best efficiency for a specific system is an important issue. In this paper we will show which space-time architecture achieves the best efficiency for a specific …


Engineering: Beyond Ears In Pre-College Years, Uchechukwu O. Ofoegbu, Ananth N. Iyer, John Helferty, Joseph Fischgrund Jun 2007

Engineering: Beyond Ears In Pre-College Years, Uchechukwu O. Ofoegbu, Ananth N. Iyer, John Helferty, Joseph Fischgrund

Ananth N Iyer

A 12-week program was developed in which electrical engineering concepts, in form of robotics projects, are taught to students at a secondary educational institution for the deaf and hearing impaired. The robotics course was originally designed for, and has been taught for about a decade to freshmen at the Temple University college of Engineering. The objectives of this project range from eliminating existing boundaries of engineering education to increasing the anticipation of success amongst the physically impaired. A prior breakthrough in the extension of engineering education beyond assumed “limits” was achieved when a young man who was both sight and …


Speaker Recognition In Adverse Conditions, Ananth N. Iyer, Uchechukwu O. Ofoegbu, Robert E. Yantorno, Stanley J. Wenndt Mar 2007

Speaker Recognition In Adverse Conditions, Ananth N. Iyer, Uchechukwu O. Ofoegbu, Robert E. Yantorno, Stanley J. Wenndt

Ananth N Iyer

Recognizing speakers from their voices is a challenging area of research with several practical applications. Presently speaker verification (SV) systems achieve a high level of accuracy under ideal conditions such as, when there is ample data to build speaker models and when speaker verification is performed in the presence of little or no interference. In general, these systems assume that the features extracted from the data follow a particular parametric probability density function (pdf), i.e., Gaussian or a mixture of Gaussians; where a form of the pdf is imposed on the speech data rather than determining the underlying structure of …


Unsupervised Indexing Of Noisy Conversations With Short Speaker Utterances, Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt Mar 2007

Unsupervised Indexing Of Noisy Conversations With Short Speaker Utterances, Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt

Ananth N Iyer

Two speaker indexing system for conversations are presented in this paper. The first method involves indexing two-speaker conversations. In this method, two reference models are judiciously chosen from the conversation such that they represent the two different speakers. Models are then matched to the reference speakers using distance-based comparisons. The second technique is based on first determining the number of participants in the conversation using a speaker count method termed the “Residual Ratio Algorithm” (RRA), and then indexing based on this count. The RRA involves an elimination process in which speech segments matching a chosen set of reference models are …


Ut-Scope: Speech Under Lombard Effect And Cognitive Stress, Ayako Ikeno, Vaishnavi Varadarajan, Sanjay A. Patil, John Hl Hansen Mar 2007

Ut-Scope: Speech Under Lombard Effect And Cognitive Stress, Ayako Ikeno, Vaishnavi Varadarajan, Sanjay A. Patil, John Hl Hansen

Sanjay A. Patil

This paper presents UT-scope data base, and automatic and perceptual an evaluation of Lombard speech in in-set speaker recognition.


Speech Under Stress: Analysis, Modeling And Recognition, Sanjay A. Patil, John Hl Hansen Jan 2007

Speech Under Stress: Analysis, Modeling And Recognition, Sanjay A. Patil, John Hl Hansen

Sanjay A. Patil

In this chatpter, we consider a range of issues associated with analysis, modeling, and recognition of speech under stress.


Blind Speaker Clustering, Ananth N. Iyer, Uchechukwu O. Ofoegbu, Robert E. Yantorno, Brett Y. Smolenski Dec 2006

Blind Speaker Clustering, Ananth N. Iyer, Uchechukwu O. Ofoegbu, Robert E. Yantorno, Brett Y. Smolenski

Ananth N Iyer

A novel approach to performing speaker clustering in telephone conversations is presented in this paper. The method is based on a simple observation that the distance between populations of feature vectors extracted from different speakers is greater than a preset threshold. This observation is incorporated into the clustering problem by the formulation of a constrained optimization problem. A modified c-means algorithm is designed to solve the optimization problem. Another key aspect in speaker clustering is to determine the number of clusters, which is either assumed or expected as an input in traditional methods. The proposed method does not require such …


Generic Modeling Applied To Speaker Count, Ananth N. Iyer, Uchechukwu O. Ofoegbu, Robert E. Yantorno, Brett Y. Smolenski Dec 2006

Generic Modeling Applied To Speaker Count, Ananth N. Iyer, Uchechukwu O. Ofoegbu, Robert E. Yantorno, Brett Y. Smolenski

Ananth N Iyer

The problem of determing the number of speakers participating in a conversation and building their models in short conversations, within an unknown group of speakers, is addressed in this paper. The lack of information about the number of speakers and the unavailability of sufficient data present a challenging task of efficiently estimating the speaker model parameters. The proposed method uses a novel generic speaker identification (GSID) system as a guide in the model building process. The GSID system is designed performing speaker identification where the speaker associated with the test data may not be enrolled. The models in the GSID …


Detection Of A Third Speaker In Telephone Conversations, Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt Sep 2006

Detection Of A Third Speaker In Telephone Conversations, Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt

Ananth N Iyer

Differentiating speakers participating in telephone conversations is a challenging task in speech processing because only short consecutive utterances can be examined for each speaker. Research has shown that, given only brief utterances (1 second or less), humans can recognize speakers with an accuracy of about 54% on average. The task becomes even more challenging when no information about the speakers is known a priori. In this paper, a technique for determining whether there are two or three speakers participating in a telephone conversation is presented. This approach assumes no knowledge or information about any of the participating speakers. The technique …


A Novel Approach To Automated Source Separation In Multispeaker Environment, Robert M. Nickel, Ananth N. Iyer May 2006

A Novel Approach To Automated Source Separation In Multispeaker Environment, Robert M. Nickel, Ananth N. Iyer

Ananth N Iyer

We are proposing a new approach to the solution of the cocktail party problem (CPP). The goal of the CPP is to isolate the speech signals of individuals who are concurrently talking while being recorded with a properly positioned microphone array. The new approach provides a powerful yet simple alternative to commonly used methods for the separation of speakers. It is based on the observation that the estimation of the signal transfer matrix between speakers and microphones is significantly simplified if one can assure that during certain periods of the conversation only one speaker is active while all other speakers …


Emotion Detection From Infant Facial Expressions And Cries, Pritam Pal, Ananth N. Iyer, Robert E. Yantorno May 2006

Emotion Detection From Infant Facial Expressions And Cries, Pritam Pal, Ananth N. Iyer, Robert E. Yantorno

Ananth N Iyer

A new system for translating the infant cries from its facial image and cry sounds is presented in this paper. The system is designed to analyze the facial image and sound of the crying infant to derive the reason why the infant is crying. The image and the sound represent the same cry event. The image processing module determines the state of certain facial features, certain combinations of which determine the reason for crying. The sound processing module analyzes the data for the fundamental frequency and the first two formants and uses k-means clustering to determine the reason of the …


Estimation Of Lyapunov Spectra From A Time Series, S. Srinivasan, Sanjay A. Patil, Saurabh Prasad, G. Lazarou, Joseph Picone Mar 2006

Estimation Of Lyapunov Spectra From A Time Series, S. Srinivasan, Sanjay A. Patil, Saurabh Prasad, G. Lazarou, Joseph Picone

Sanjay A. Patil

No abstract provided.


Sequential State-Space Filters For Speech Enhancement, Sanjay A. Patil, Ryan Irwin, Sundar Srinivasan, Saurabh Prasad, G. Lazarou, Joseph Picone Mar 2006

Sequential State-Space Filters For Speech Enhancement, Sanjay A. Patil, Ryan Irwin, Sundar Srinivasan, Saurabh Prasad, G. Lazarou, Joseph Picone

Sanjay A. Patil

No abstract provided.


Nist Sre Speaker Recognition Evaluation Workshop 2006, Sanjay Patil, Vinod Prakash, Pongtep Angkitatrakul, John Hansen, Wooil Kim, Syed Moosa Jan 2006

Nist Sre Speaker Recognition Evaluation Workshop 2006, Sanjay Patil, Vinod Prakash, Pongtep Angkitatrakul, John Hansen, Wooil Kim, Syed Moosa

Sanjay A. Patil

No abstract provided.


Design Of Stable Narrow Band-Pass Filter Using Multi-Stage Biquad Topology, Raman Attri Sep 2005

Design Of Stable Narrow Band-Pass Filter Using Multi-Stage Biquad Topology, Raman Attri

Raman K. Attri

Band pass filtering techniques have been a challenging task due to requirement of keeping Quality factor, gain and mid-frequency of the filter independent of each other. Other most important aspect is keeping the filter stable, keeping mid-frequency immune to circuit component tolerances and to achieve the mid-frequency at the accurate value. The Biquad family of topologies are typically used for a stabilized filtering application, however the design requirements on Biquad topology for low frequency application is still a challenge. The requirements become more stringent for bandwidth curve, roll-off curve and preciseness of frequency filtering as we move down to low …


Practical Design Evaluation Of Extremely Narrow Band-Pass Filter Topologies, Raman K. Attri Sep 2005

Practical Design Evaluation Of Extremely Narrow Band-Pass Filter Topologies, Raman K. Attri

Raman K. Attri

Narrow Band pass filtering techniques have been a challenging task since the inception of audio and telecommunication applications. The challenge involves keeping Quality factor, gain and mid-frequency of the filter independent of each other. Other most important aspect is keeping the filter stable, keeping mid-frequency immune to circuit component tolerances and to achieve the mid-frequency at the accurate value. The requirements turns more stringent when working with low frequency Narrow band-pass filters where the shift in few Hz would cause great frequency errors. The selection of right topology for best performance is the key top successful design. This paper objectively …


Evaluation Of Single Op-Amp Topologies For Extremely Narrow Band-Pass Filter Design, Raman Attri Sep 2005

Evaluation Of Single Op-Amp Topologies For Extremely Narrow Band-Pass Filter Design, Raman Attri

Raman K. Attri

Narrow Band pass filtering techniques have been a challenging task since the inception of audio and telecommunication applications. The challenge involves keeping Quality factor, gain and mid-frequency of the filter independent of each other. Other most important aspect is keeping the filter stable, keeping mid-frequency immune to circuit component tolerances and to achieve the mid-frequency at the accurate value. The requirements turns more stringent when working with low frequency Narrow band-pass filters where even the shift in few Hz would cause great frequency errors. The selection of right topology for best performance is the key to successful design. This paper …


Speaker Identification Using Usable Speech Concept, Ananth N. Iyer, Brett Y. Smolenski, Robert E. Yantorno, Jashmin K. Shah, Edward J. Cupples, Stanley J. Wenndt Sep 2004

Speaker Identification Using Usable Speech Concept, Ananth N. Iyer, Brett Y. Smolenski, Robert E. Yantorno, Jashmin K. Shah, Edward J. Cupples, Stanley J. Wenndt

Ananth N Iyer

Most signal processing involves processing a signal without concern for the quality or information content of that signal. In speech processing, speech is processed on a frame-by-frame basis, usually only with concern that the frame is either speech or silence. However, knowing how reliable the information is in a frame of speech can be very important and useful. This is where usable speech detection and extraction can play a very important role. The usable speech frames can be defined as frames of speech that contain higher information content compared to unusable frames with reference to a particular application. We have …


Robust Speaker Verification With Principal Pitch Components, Robert M. Nickel, Sachin P. Oswal, Ananth N. Iyer Sep 2004

Robust Speaker Verification With Principal Pitch Components, Robert M. Nickel, Sachin P. Oswal, Ananth N. Iyer

Ananth N Iyer

We are presenting a new method that improves the accuracy of text dependent speaker identification systems. The new method exploits a set of novel speech features that is derived from a principal component analysis (PC) of voiced speech segments. The new PC features are only weakly correlated with the corresponding cepstral features. A distance measure that combines both, cepstral and PC pitch features provides a discriminative power that cannot be achieved with cepstral features alone. It is well known that the discriminative power of cepstral features declines if the dimensionality of the feature space is increased beyond its optimal value. …


Sequential K-Nn Pattern Recognition For Usable Speech Classification, Jashmin K. Shah, Brett Y. Smolenski, Robert E. Yantorno, Ananth N. Iyer Sep 2004

Sequential K-Nn Pattern Recognition For Usable Speech Classification, Jashmin K. Shah, Brett Y. Smolenski, Robert E. Yantorno, Ananth N. Iyer

Ananth N Iyer

The accuracy of speech processing techniques degrades when operating in a co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing application such as speaker identification. Usable speech measures are features that are extracted from the co-channel signal to distinguish between usable and unusable speech. In this paper, a new usable speech extraction technique is presented. The new method extracts features recursively and variable length segmentation is performed …


Usable Speech Detection Using A Context Dependent Gaussian Mixture Model Classifier, Robert E. Yantorno, Brett Y. Smolenski, Ananth N. Iyer, Jashmin K. Shah May 2004

Usable Speech Detection Using A Context Dependent Gaussian Mixture Model Classifier, Robert E. Yantorno, Brett Y. Smolenski, Ananth N. Iyer, Jashmin K. Shah

Ananth N Iyer

Speech that is corrupted by nonstationary interference, but contains segments that are still usable for applications such as speaker identification or speech recognition, is referred to as "usable" speech. A common example of nonstationary interference occurs when there is more than one person talking at the same time, which is known as co-channel speech. In general the above speech processing applications do not work in co-channel environments; however, they can work on the extracted usable segments. Unfortunately, currently available usable speech measures only detect about 75% of the total available usable speech. The first reason for this high error stems …


Structural Usable Speech Measure Using Lpc Residual, Ananth N. Iyer, Melinda Gleiter, Brett Y. Smolenski, Robert E. Yantorno Dec 2003

Structural Usable Speech Measure Using Lpc Residual, Ananth N. Iyer, Melinda Gleiter, Brett Y. Smolenski, Robert E. Yantorno

Ananth N Iyer

In an operational environment speech is degraded by many kinds of interferences. The operation of many speech processing techniques are plagued by such interferences. Usable speech extraction is a novel concept of processing degraded speech data. The idea of usable speech is to identify and extract portions of degraded speech that are considered useful for various speech processing systems. The performance reduction of speaker identification systems under degraded conditions and use of usable speech concept to improve the performance has been demonstrated in previous work. A new usable speech measure, based on the structure of Linear Predictive Coding (LPC) residual …


Design Approach To Use Platinum Rtd Sensor In Snow Temperature Measurements, Raman Attri Jan 2001

Design Approach To Use Platinum Rtd Sensor In Snow Temperature Measurements, Raman Attri

Raman K. Attri

The snow temperature measurement is a very critical area of hydrological instrumentation. The measured data is used to assess the run-off water, snow pack strength and snow avalanche. The hydrological and snow avalanche forecast models require that temperature be measured at different points in the snow pack, above the snow pack and below the snow pack. A state-of-the-art multi-point snow temperature measurement system has been designed and developed for this purpose. The type of the temperature sensor used is a critical aspect of this physical parameter instrumentation. After lot of study of various aspects of different temperature sensing elements, the …


Design Of An Instrumentation System To Record Distribution Profile Of Snow Layer Temperature For Modelling Of Snow Avalanche Forecast, Raman Attri Aug 1999

Design Of An Instrumentation System To Record Distribution Profile Of Snow Layer Temperature For Modelling Of Snow Avalanche Forecast, Raman Attri

Raman K. Attri

The measurement of snow hydrological parameters is extremely important in developing a model for the predication of Snow avalanche as well as Snowmelt water in the rivers. When direct measurement of these parameters is practically difficult, its dependence on snow temperature is used to develop snow cover models. A robust model for avalanche forecasting requires a sophisticated instrumentation system which can measure the required temperature parameters at right data points within snow pack. A Snow Temperature Profile Sensing System along with Surface temperature Sensor has been designed to measure Snow temperature gradient, temperature distributions, and average temperature of snow pack, …


Implementation Of Linear Array Of Ultrasonic Transmitter-Receiver Transducers For Detection Of Non-Smooth Porous Surface, Raman K. Attri, Swaranjit Singh Jul 1999

Implementation Of Linear Array Of Ultrasonic Transmitter-Receiver Transducers For Detection Of Non-Smooth Porous Surface, Raman K. Attri, Swaranjit Singh

Raman K. Attri

Level measurements, thickness measurement or remote surface detection using ultrasonic pulse transit method require that the target surface be at 90O to the incident beam so that reflected beam comes back at 180O angel to effectively use this method. This is perfectly true in case of flat, solid surface at right angle to the incident beam. But surface irregularities of a porous, non-smooth, uneven material such as snow cause penetration of incident wave into the surface, absorption of the incident energy, scatter of energy in many directions and further attenuation of reflected signal making it difficult to detect the reflected …