Automatic Speech Recognition System

Finlogy Publication

Automatic Speech Recognition System

Finlogy Publication

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Speech recognition is one of the next generation technologies for human-computer interaction. Speech recognition has been researched since the late 1950s but due to its computational complexity and limited computing capabilities of the last few decades, its progress has been impeded. In laboratory settings automatic speech recognition systems (ASR) have achieved high levels of recognition accuracies, which tend to degrade in real world environments. This paper analyses the basics of the speech recognition system. Major problems faced by ASR in real world environments have been discussed with major focus on the techniques. These technique used in the development of noise robust ASR .

Figures (7)

Most commercial companies claim that recognition software can achieve between 98% to 99% accuracy if operated under optimal conditions. “Optimal conditions’ usually assume that users: have speech characteristics which match the training data, can achieve proper speaker adaptation, and work in a clean noise environment.

Imperial Journal of Interdisciplinary Research (WIR) Vol-2, Issue-3 , 2016 ISSN : 2454-1362 , http://www.onlinejournal.in

LPC coefficients can be estimated by applying some procedures on the speech signal. These procedures started with applying autocorrelation on the windowed frames. Every windowed frame is auto correlated by pth order by applying the MATLAB code bellow:

Fig.4. : Flowchart of LPC method. on every column of the matrix, and then LPC function of 12 order is applied on every column also, now every column has 12 coefficients, and finally the coefficients are rearranged in a one column to use them as input to the neural network in the classification phase. The number of coefficients in this thesis was 420 for LPC method.

Imperial Journal of Interdisciplinary Research (IJIR) Vol-2, Issue-3 , 2016 ISSN : 2454-1362 , http://www.onlinejournal.in

Fig.6. : Flowchart of spectrogram method.

Key takeaways

The goal of an ASR system is to accurately and efficiently convert a speech signal into a text message transcription of the spoken words independent of the speaker, environment or the device used to record the speech (i.e. the microphone).
The input of the system is the speech signal.
LPC coefficients can be estimated by applying some procedures on the speech signal.
Spectrogram of a speech signal can be derived by taking a Fast Fourier Transform (FFT) for each frame of the speech signal.Then the rotation of plot diagram implemented to fix vertical axis as frequency and horizontal axis as amplitude.
Speech Recognition is a special case of pattern recognition.

Related papers

A Review on: Speech Recognition System

vaishali bhimte

This paper presents a brief survey on Speech recognition and discusses major themes and advances. Automatic speech recognition uses the process and related technology for converting speech signals into a sequence of words or other linguistic units by means of an algorithm implemented as a computer program. After years of research and development the accuracy of automatic speech recognition remains one of the important research challenges. Speech understanding systems presently are capable of understanding speech input for vocabularies of thousands of words in operational environments. Speech Recognition offers greater freedom to employ the physically handicapped in several applications like manufacturing processes, medicine and telephone network. The objective of this review paper is to summarize and compare some of the well known methods used in various stages of speech recognition system.

Log In

Automatic Speech Recognition System

Sign up for access to the world's latest research

Abstract

Key takeaways

Related papers

Related papers