Academia.eduAcademia.edu

Speech Recognition: How it Works and How it's Used

Abstract

Speech recognition has always been an accepted and expected feature of futuristic computer systems. From HAL in 2001: A Space Odyssey ("Open the pod bay doors, HAL." "I'm sorry Dave, but I'm afraid I can't do that.") to Lieutenant Commander Data in Star Trek: The Next Generation, authors who write of the future anticipate the day when computers can understand, interpret and follow the spoken words of humans.

Key takeaways

  • In general, the speech recognizer takes the digitized samples of the word, and runs them through a preprocessing stage [5].
  • The natural alternative to HMMs in speech recognition is a neural network.
  • An excellent example of a neural network-based recognizer is a speech recognition engine developed at the Oregon Graduate Institute [12], designed for speech recognition over the telephone system.
  • When in use, a neural network takes as inputs the templates, one at a time, and the spoken word, and outputs whether or not they match.
  • Capability 3 is implemented by a time limit for progressing to the next word in the text; the intervenor assumes that the reader is stuck on this word if the time limit is exceeded without a previously unread word appearing in the recognizer output.