Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
Speech recognition is one of the next generation technologies for human-computer interaction. Speech recognition has been researched since the late 1950s but due to its computational complexity and limited computing capabilities of the last few decades, its progress has been impeded. In laboratory settings automatic speech recognition systems (ASR) have achieved high levels of recognition accuracies, which tend to degrade in real world environments. This paper analyses the basics of the speech recognition system. Major problems faced by ASR in real world environments have been discussed with major focus on the techniques. These technique used in the development of noise robust ASR .
This paper presents a brief survey on Speech recognition and discusses major themes and advances. Automatic speech recognition uses the process and related technology for converting speech signals into a sequence of words or other linguistic units by means of an algorithm implemented as a computer program. After years of research and development the accuracy of automatic speech recognition remains one of the important research challenges. Speech understanding systems presently are capable of understanding speech input for vocabularies of thousands of words in operational environments. Speech Recognition offers greater freedom to employ the physically handicapped in several applications like manufacturing processes, medicine and telephone network. The objective of this review paper is to summarize and compare some of the well known methods used in various stages of speech recognition system.
International Journal of Computer Applications, 2015
To be able to control devices by voice has always intrigued mankind. Today after intense research, Speech Recognition System, have made a niche for themselves and can be seen in many walks of life. The accuracy of Speech Recognition Systems remains one of the most important research challenges e.g. noise, speaker variability, language variability, vocabulary size and domain. The design of speech recognition system requires careful attentions to the challenges such as various types of Speech Classes and Speech Representation, Speech Preprocessing stages, Feature Extraction techniques, Database and Performance evaluation. This paper presents the advances made as well as highlights the pressing problems for a speech recognition system. The paper also classifies the system into Front End and Back End for better understanding and representation of speech recognition system in each part.
International Journal of Computer Applications, 2016
Speech is the most prominent & primary mode of Communication among human beings. Now-a-days Speech also has potential of being important mode of interaction with computers. This paper gives an overview of Automatic Speech Recognition System, Classification of Speech Recognition System and also includes overview of the steps followed for developing the Speech Recognition System in stages. This paper also helps in choosing the tool and technique along with their relative merits & demerits. A comparative study of different techniques is also included in this paper.
Speech feature extraction has been a key focus in robust speech recognition research. Selecting proper features is the key of effective system performance. Robustness to additive noise remains a large unsolved problem in automatic speech recognition research today. One of the environmental changes that have a large impact on the performance of current ASR systems is background noise. There are several approaches that one can take to improve ASR systems robustness [7, 10] to changes in background noise. One of these approaches is to address the problem at the feature extraction stage of the system. That is, to use a speech feature extraction algorithm that produces features that are as invariant as possible to background noise changes, while simultaneously capturing the salient speech information. Many feature extraction algorithms have been proposed that are designed specifically to have a low sensitivity to background noise. In this paper we are presenting some feature extraction algorithm developed for noisy environment.
International Journal of Computer Applications, 2012
This paper attempts to describe a literature review of Automatic Speech Recognition. It discusses past years advances made so as to provide progress that has been accomplished in this area of research. One of the important challenges for researchers is ASR accuracy. The Speech recognition System focuses on difficulties with ASR, basic building blocks of speech processing, feature extraction, speech recognition and performance evaluation. The main objective of the review paper is to bring to light the progress made for ASRs of different languages and the technological viewpoint of ASR in different countries and to compare and contrast the techniques used in various stages of Speech recognition and identify research topic in this challenging field. We are not presenting exhaustive descriptions of systems or mathematical formulations but rather, we are presenting distinctive and novel features of selected systems and their relative merits and demerits.
The most important & obvious mode of exchanging information among the human beings is the voice. Human can instruct machine using speech, thus education industry, military and medical sectors, uses this technique. However recognition of speech is not the new area, researchers are engaged for accurateness in voice recognition system, from last few decades. The sketch of that system care considerable challenges like set of speech, mode of speech, word list, transducers, illness and medium; because of all this necessity the component of noise in automatic speech recognition is at a great distant. Many researchers have put their efforts to sort out above challenges. This paper provide brief summary of the latest work in the field of speech recognition and through some light on virtuous and pandemonium databases of pieces of voice.
Computing
The paper highlights a brief study on speech recognition technology, describing the various processing stages and results and also some primary applications as well. Following this review some of the vital strengths and speech processing steps will also discuss.
Collegium of Economic Analysis Annals, 2018
Speech is one of the easiest ways allowing communication between people and machines. Speech recognition technology makes everyday life easier: it is widely used in mobile phones, computers, tablets, cars, etc. However, the quality of automatic speech recognition is affected by many factors, therefore, so much effort is put into improving the performance of speech recognition systems. The aim of this paper is to present the current state of development of speech recognition systems and to examine their drawbacks and limitations. The paper discusses the current classification, construction and functioning of speech recognition systems, which gives an insight into the speech-to-text software implemented so far. The analysis of disadvantages and limitations of speech recognition systems has allowed identifying the weak points of these systems. Problems that are to be solved in the near future indicate the direction of further development of speech recognition systems.
Acta Informatica Malaysia
Speech Recognition Systems now-a-days use many interdisciplinary technologies ranging from Pattern Recognition, Signal Processing, Natural Language Processing implementing to unified statistical framework. Such systems find a wide area of applications in areas like signal processing problems and many more. The objective of this paper is to present the concepts about Speech Recognition Systems starting from the evolution to the advancements that have now been adapted to the Speech Recognition Systems to make them more robust and accurate. This paper has the detailed study of the mechanism, the challenges and the tools to overcome those challenges with a concluding note that would ensure that with the advancements of the technologies, this world is surely going to experience revolutionary changes in the near future.
This paper takes a tour of speech recognition system which includes it's evaluation and accuracy of system and discuss the structure of utterance that uses the vocal tract to make the utterance. Dynamic warping with its neural approach to convert the speech into text. This paper also explains the basic working of speech recognition system with elaboration of it's techniques.
This paper reviews some of various research carried out over the last decade in the area of Automatic Speech Recognition (ASR) and discusses the major themes and advance made in the last decade of research, in order to show the outlook of technology and an appreciation of the fundamental progress that has been achieved in this weighty area of speech communication. Over period of research and development, the accuracy of automatic speech recognition remains one of the important research challenges such as variation of the context, environmental condition, speaker's variation and poor-quality audio. The design of speech recognition requires careful attention to the following issue: Definition of various types of speech classes, speech representation, techniques, database and performance evaluation. The history, challenges of speech recognition system and various techniques to solve these challenges constructed by various research works have been presented in a chronological order. The objective of this paper is to compare and summarize well know approaches used in various steps of speech recognition system.
It is a well known fact that, speech recognition systems perform well when the system is used in conditions similar to the one used to train the acoustic models. However, mismatches degrade the performance. In adverse environment, it is very difficult to predict the category of noise in advance in case of real world environmental noise and difficult to achieve environmental robustness. After doing rigorous experimental study it is observed that, a unique method is not available that will clean the noisy speech as well as preserve the quality which have been corrupted by real natural environmental (mixed) noise. It is also observed that only back-end techniques are not sufficient to improve the performance of a speech recognition system. It is necessary to implement performance improvement techniques at every step of back-end as well as front-end of the Automatic Speech Recognition (ASR) model. Current recognition systems solve this problem using a technique called adaptation. This study presents an experimental study that aims two points, first is to implement the hybrid method that will take care of clarifying the speech signal as much as possible with all combinations of filters and enhancement techniques. The second point is to develop a method for training all categories of noise that can adapt the acoustic models for a new environment that will help to improve the performance of the speech recognizer under real world environmental mismatched conditions. This experiment confirms that hybrid adaptation methods improve the ASR performance on both levels, (Signal-to-Noise Ratio) SNR improvement as well as word recognition accuracy in real world noisy environment.
VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE, 2019
In order to make fast communication between human and machine, speech recognition system are used. Number of speech recognition systems have been developed by various researchers. For example speech recognition, speaker verification and speaker recognition. The basic stages of speech recognition system are pre-processing, feature extraction and feature selection and classification. Numerous works have been done for improvement of all these stages to get accurate and better results. In this paper the main focus is given to addition of machine learning in speech recognition system. This paper covers architecture of ASR that helps in getting idea about basic stages of speech recognition system. Then focus is given to the use of machine learning in ASR. The work done by various researchers using Support vector machine and artificial neural network is also covered in a section of the paper. Along with this review is presented on work done using SVM, ELM, ANN, Naive Bayes and kNN classifi...
Noise in everyday acoustic environments such as cars, traffic environments, and cafeterias remains one of the main challenges in automatic speech recognition (ASR). As a reserch theme, it has received wide attention in conferences and scientific journals focused on speech technology. This article collection reviews both the classic and novel approaches suggested for noise robust ASR. The articles are literature reviews written for the spring 2009 seminar course on noise robust automatic speech recognition (course code T-61.6060) held at TKK.
Speech recognition is the next big step that the technology needs to take for general users. An Automatic Speech Recognition (ASR) will play a major role in focusing new technology to users. Applications of ASR are speech to text conversion, voice input in aircraft, data entry, voice user interfaces such as voice dialing. Speech recognition involves extracting features from the input signal and classifying them to classes using pattern matching model. This can be done using feature extraction method. This paper involves a general study of automatic speech recognition and various methods to generate an ASR system. General techniques that can be used to implement an ASR includes artificial neural networks, Hidden Markov model, acoustic – phonetic approach
2017
Speech is the most natural form of communication and interaction between humans. In computer system, the text and symbols are the most common form of translation. Speech Recognition is an important application that enables interaction of human being with machines. The various stages in the speech recognition system are pre-processing, segmentation of speech signal, feature extraction of speech and recognition stage. Among many speech recognition systems, continuous speech recognition system is very important and most popular system. This paper presents the basic idea of speech recognition, proposed types of speech recognition techniques, issues in speech recognition, different useful approaches for noise filtering, features extraction of the speech signal with its advantages and disadvantages and various pattern matching approaches for recognizing the speech of the speakers. Now day’s research in speech recognition system is motivated for ASR system with a large vocabulary that supp...
A Study on Automatic Speech Recognition, 2019
Speech is an easy and usable technique of communication between humans, but nowadays humans are not limited to connecting to each other but even to the different machines in our lives. The most important is the computer. So, this communication technique can be used between computers and humans. This interaction is done through interfaces, this area called Human Computer Interaction (HCI). This paper gives an overview of the main definitions of Automatic Speech Recognition (ASR) which is an important domain of artificial intelligence and which should be taken into account during any related research (Type of speech, vocabulary size... etc.). It also gives a summary of important research relevant to speech processing in the few last years, with a general idea of our proposal that could be considered as a contribution in this area of research and by giving a conclusion referring to certain enhancements that could be in the future works.
Language is the most important means of communication and speech is its main medium. In human to machine interface, speech signal is transformed into analog and digital wave form which can be understood by machine. Speech technologies are vastly used and has unlimited uses. These technologies enable machines to respond correctly and reliably to human voices, and provide useful and valuable services .This paper gives an overview of the speech recognition process, its basic model, and its application, approaches and also discuss comparative study of different approaches which are used for speech recognition system. These papers also give an overview of different techniques of speech recognition system to summarize some of the well known methods used in various stages of speech recognition system.
International Journal of Computer Applications, 2016
Speech is the nature's gift to the human being which contributes towards the intelligence and discrimination from rest of the animal kingdom. Taking into consideration technological aspects, speech recognition is the buzzword today, as communication and hands free computing evolving day by day. Speech is a very important mode of the communication and interaction with the digital computer. Speech recognition along with the wide range of applicability in domain of computer science, medical science, psychology, sports, neurology has many challenges while developing. Developing real time speech recognizer may hurdle from adverse environment to anatomy of the human body. It also involves linguistic aspects too. This paper explores various challenges in developing a robust ASR system.
International journal of advance engineering & research development, 2014
Speech interface to computer is the next big step that the technology needs to take for general users. Automatic speech recognition (ASR) will play an important role in taking technology to the people. There are numerous applications of speech recognition such as direct voice input in aircraft, data entry, speech-to-text processing, voice user interfaces such as voice dialing. ASR system can be divided into two different parts, namely feature extraction and feature recognition. This paper provides an overview for Speech recognition where Acoustic modeling techniques, Feature extraction techniques for Speech recognition are briefly discussed.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.