Academia.eduAcademia.edu

Automatic Speech Recognition System

Abstract

Speech recognition is one of the next generation technologies for human-computer interaction. Speech recognition has been researched since the late 1950s but due to its computational complexity and limited computing capabilities of the last few decades, its progress has been impeded. In laboratory settings automatic speech recognition systems (ASR) have achieved high levels of recognition accuracies, which tend to degrade in real world environments. This paper analyses the basics of the speech recognition system. Major problems faced by ASR in real world environments have been discussed with major focus on the techniques. These technique used in the development of noise robust ASR .

Key takeaways

  • The goal of an ASR system is to accurately and efficiently convert a speech signal into a text message transcription of the spoken words independent of the speaker, environment or the device used to record the speech (i.e. the microphone).
  • The input of the system is the speech signal.
  • LPC coefficients can be estimated by applying some procedures on the speech signal.
  • Spectrogram of a speech signal can be derived by taking a Fast Fourier Transform (FFT) for each frame of the speech signal.Then the rotation of plot diagram implemented to fix vertical axis as frequency and horizontal axis as amplitude.
  • Speech Recognition is a special case of pattern recognition.