Voice Assistant Using Python Report SUBMITTED
Voice Assistant Using Python Report SUBMITTED
Submitted by:
DORESWAMY S B (2022BCSE07AED956)
MANASVI S (2022BCSE07AED957)
POORNIMA (2022BCSE07AED953)
NITYA (2022BCSE07AED958)
DARSHAN (2022BCSE07AED954)
BACHELOR OF TECHNOLOGY
IN
1|Page
Department of Computer Science and Engineering
Alliance College of Engineering and Design
Alliance University
Chikkahagade Cross, Chandapura-Anekal Main Road, Bangalore-562106
CERTIFICATE
This is to certify that the Design Project – 1 work entitled “VOICE ASSISTANT USING
“PYTHON” is the bonafide work done by Mr. DORESWAMY S B
(2022BCSE07AED956), Ms. MANASVI S (2022BCSE07AED957), Ms. POORNIMA
(2022BCSE07AED953), Ms. NITYA (2022BCSE07AED958), Mr. DARSHAN
(2022BCSE07AED954) submitted in partial fulfillment of the requirements for the Award
of the Degree Bachelor of Technology in Computer Science and Engineering during the
year 2022- 2026.
External Examiners:
1. Name: Signature
2. Name: Signature
2|Page
APPENDIX 3
TABLE OF CONTENTS
3|Page
ABSTRACT
In today’s generation, where time is the important, and the demand for technology to simplify
and streamline our daily tasks has never been greater. One such application that gained
significant traction is voice assistant using phyton (VPA).
These applications are powered by artificial intelligence, it has an aim to help in various fields
and allow individuals to focus on more needful matters.
VPA has an ability to complete a wide range of tasks based on user input, typically through
voice commands. From searching for information of movies to writing notes and sending
messages and providing locations VPA place major role in changing all of these (based on
our digital lives).
In response to this growing demand, the development of this application using programing
languages like phyton has brought a much more gain in the area of research and
implementation.
By identifying the power of speech recognition and natural language processing these VPA’s
provide a user-friendly experience and help users to interact with their device with no effort.
Here this VPA’s gives knowledge about the languages power like by using phyton we can be
easily able to do the voice recognition and bring it up in very easy manner. Coming to phyton
it does has functionalities such as information retrieval, task management, and device control.
Trought the combination of tragic algorithms and user-friendly interfaces, this voice assistant
plays a productive and efficient role in our daily lives.
Voice-activated virtual assistants have become popular due to their easy way of using. These
applications powered by software programs have very interesting advantages, they carry out
specific wake words and carry out a wide range of tasks from sending messages on WhatsApp
to drafting emails.
This generation is characterized by the presence of smartphones and desktops, our lives have
become fast-paced and demanding. Keeping the word activity aside, this type of application
is making our lives easier by simplifying typing etc.
We just wanted to know more about the voice assistant so here, we just aimed on exploring
the design and development of intelligent voice personal assistants using phyton, highlighting
their potential to simplify task, improve productivity and enhance user experience in today’s
digital age.
History
The history of voice assistants using Phyton application is a basement to the evolution of
technology and the growing demand for human and computer interactions While phyton is
popular language for various applications, it’s flexibility and ease of use have made it
particularly well-suited for developing VPA.
The journey began with early experiments in speech recognition and production, where
phyton libraries such as Speech recognition were used to make it as the basic voice
interactions with computer. These early implementations made a path to build complex
voice assistants.
Where machine learning is getting advanced these days, Phyton frameworks like Tensor
Flow and PY Torch became instrumental in training and deploying voice recognition
models. These models get paired with Phyton-based speech synthesis libraires like GTTS
(Google text-to-speech), enabled voice assistants to understand respond to user commands
with increasing accuracy.
The rise of smart speakers and virtual assistants from tech giants like Google, Amazon, and
Apple. The demand for voice-enabled applications gained popularity. Phyton’s broad
information provided developers to create integrations and skills for these platforms,
expanding the capabilities of voice assistants to encompass a wide range of tasks, form home
autorotation to productivity tools
Today, Phyton continues to be the first application of voice assistant development, providing
innovative applications that transfer cutting-edge technologies like natural language
understanding, machine learning and cloud computing. As this field continues to evolve,
Phyton’s role in building the voice assistant will be difficult.
Looking ahead, Phyton is able to continue shaping the future of voice assistant technology.
With ongoing advantages on machine learning, NLP, and cloud computing, Phyton based
voice assistants are expected to become even more intelligent, responsive and integrated in
our daily lives.
5|Page
CHAPTER 1
INTRODUCTION
A Voice Assistant application is a very helpful application which makes our life very easy by
converting complex things into simplified ways. So, the point comes how does it function?
So, the answer is by processing spoken commands from users it does its actions. When a user
speaks into the device’s microphone, the voice assistant employs search recognition
technology to convert the spoken words into text.
Once the user’s intent is discerned, the voice assistant determines the appropriate action to
fulfill the request, whether it’s retrying information from the web, controlling smart home
devices, or performing other tasks
VPA has an ability to complete a wide range of tasks based on user input, typically through
voice commands. From searching for information of movies to writing notes and sending
messages and providing locations VPA place major role in changing all of these
A Voice assistant application operates by understanding and responding to spoken commands
by user. When user speaks into the microphone of their device, the voice assistant uses the
advanced speech recognition technology to accurately transcribe the spoken words to text
And here we just wanted slight change and thought of bringing a voice detector like which
recognizes or voice and tell our mood swing and bring up the suitable songs or some activity
to make up the mood better, if we are in mood of happy there comes the motivation which
keeps us moving forward like this.
So here we just thought of this project, which will be helpful for the people who are suffering
from their mental illness, where they can’t tell their problems to the human beings, can have
it and I'm not sure about making the person complete fine, but yeah suffering alone will be
degraded because this app will be giving the enough support for their stability for upcoming
days.
We just looked at the applications which were developed by developers, so we can bring our
own idea as I informed above, we thought of making a brief app like collecting data of 50
mood swings and bringing the correct solution for this. We discussed it with our mentor too
and it is in the process.
In this generation this application brings happiness to everyone, including the child, teen
youths, everyone. So, Voice assistant app makes up the major role and even the feelings
detector would help we guess.
6|Page
CHAPTER 2
REVIEW OF LITREATURE
Author Name,
Journal, Research Performance Advant Disadvant
S.No
Publication Methodology Metrics ages ages
Year
8|Page
4
Divisha Pandey Started by This includes It provides rich Recourses
Researching both speech Ecosystem; it has consumption,
Afra Ali existing voice recognition a vast ecosystem Phytons dynamic
assistant accuracy library and typing and
Pub year:2022 mythologies. We (transcribing framework for automatic memory
studied this paper spoken words natural language management can
deeply and brought correctly) processing. lead to higher
out these ideas. Higher resource of
accuracy leads consumption .
to more.
5. Anjali Fahal, The main goal of We use python Python offers a Python, while easy
Trupti Kanade Artificial as a wide range of use and flexible,
intelligence (AI) is programming libraries and might not be the
Pub year:2021 the realization of language Frameworks for fastest language for
International natural dialogue because it have natural language CPU-intensive tasks.
Research Journal between humans a major processing(NLP) However, for most
of Modernization and machines. libraries which and other Ai personal assistant
in Engineering There are many IT is use to related tasks. Applications.
companies have execute
used the dialogue commands. By
systems using python
technology to Installer
establish various Packages our
kinds of Virtual personal virtual
Personal
9|Page
6. Neha Prasad, In a world with In the following The Integration Reliance on
Pranaya Kodava ever increasing paper, we of chatbot and Internet Connection:
Pub year:2020 needs for comfort, propose a Both chatbots and
ITM Web of human race is Home Voice assistant voice assistants
Conferences relying more and Automation technology require a stable
32,01002,2020 more on system that allows for hands- internet connection
technological uses a free and natural to function properly.
advancements to wholesome language control Any disruption in
find solutions to blending of of smart home the internet service
their problems. some devices. could affect the
technologies usability of home
like Internet of automation. Privacy
Things. Concerns: Voice
assistants often raise
privacy concerns as
they continuously
listen for trigger
words. Users may be
hesitant to have
devices listening to
their conversations
at all times.
10 | P a g e
2.2.1 OBSERVATIONS FROM LITERATURE REVIEW
• Brief Description of observations made from the papers.
Response Time: Evaluating the speed at which the voice assistant processes voice inputs
and provides responses. Faster response times generally lead to a more seamless user
experience, while delays can be frustrating.
Error Handling: Noting how the voice assistant handles errors or misunderstandings.
This includes observing whether it provides helpful prompts or clarifications when it
doesn't understand a command, and how it handles situations where it cannot fulfill a
request.
Integration with Other Services: If the voice assistant interacts with external services or
APIs, observing how well it integrates with those services and the accuracy of the
information it retrieves or actions it performs.
User Interface: If the voice assistant has a graphical user interface (GUI) or other means
of interaction beyond voice, assessing the usability and intuitiveness of these interfaces.
Feedback Mechanisms: Examining how the voice assistant provides feedback to the
user, such as through spoken responses, visual cues, or notifications.
Customization and Extensibility: If the voice assistant allows for customization or the
addition of new features, observing how easy it is to extend its functionality and tailor
it to specific needs or preferences.
11 | P a g e
2.2.2 LIMITATIONS OF LITERATURE REVIEW
1.The limitations we observed from the paper include dialectical variability which
affects the performance. the higher accuracy has been holding by male spoken digits
compared to female spoken digits in gender – based experiments. Including that, this
paper study mentions the major requirement of algorithm of a specific size and
recording the circumstances which is recorded in speech, which is one of the limitations
in strategy. The Paper we referred highlights the importance of integrating machine
learning and deep learning to grace the performance of voice assistants and control
various tasks which is efficient.
2.Though only limitations do not bring up the solution, as we should take up any of the
solution, we call it has possibilities of overcoming on the limitations. To overcome the
limitations which are given in the paper, firstly we should focus on integrating machine
learning and deep learning techniques to build the performance of voice assistants and
control the tasks efficiently. Adding to it, Addressing the di electrical
3.Variability by considering the difference in accuracy between female and male spoken
digits in gender-based experiments could have brought up to overcome from limitations
Possibilities of overcoming on the limitations in this proposed work
12 | P a g e
CHAPTER 3
In today's age of technology, there is a great value for quick and brilliant voice
assistants that can make the task easy and give a productive solution. However,
improving a voice assistant in Python has few challenges, which includes natural
language understanding, speech recognition, context awareness, and closest possible
co-ordination with various applications. The aim of this project is to design and
implement a advanced voice assistant using Python that can understand and respond
to user commands and perform tasks such as scheduling appointments, collecting
information from the web, controlling smart home devices, and more, all while
assuring user privacy and data security
In the present internet era, in person and business are much reliant on technology to
do daily tasks and improve productivity. As part of this digital evolution, there is
increasing value for voice assistants that can co-ordinately integrate into various
applications. However, developing an improved voice assistant in python presents
multifaced challenge is a challenging task for the developers.
Developing a voice assistant using python must also include some tasks that can be
easily done by it for example: setting a reminder, alarm, playing music, and many
more. It must be able to understand the languages that are used by the individual.
And the aim is to make is easy for the uneducated people to use the technology and
help to solve their queries.
13 | P a g e
3.2 PROPOSED RESEARCH METHODOLOGY
1. Computers/Laptops
2. Memory
3. Microphones
4. Speakers
5. Sound Card
6. Internet Connection.
14 | P a g e
REFERENCES (MIN. OF 10 PAPERS)
1.M. Bapat, H. Gune, and P. Bhattacharyya, “A paradigm-based finite state morphological
analyzer for Marathi,” in Proceedings of the 1st Workshop on South and Southeast Asian
Natural Language Processing (WSSANLP), pp. 26–34, 2010.
7.G. Muhammad, Y. Alotaibi, M. N. Huda, et al., pronunciation variation for asr: A survey
of the “Automatic speech recognition for digits, literature, Speech Communication, vol. 29,
no. in Computers and Information Technology, 2009.2, pp. 225–246, 1999.
8. S. R. Eddy, “Hidden Markov models,” Current opinion in structural biology, vol. 6, no. 3,
pp. 361–365, 1996.
9. Excellent style manual for science writers is “Speech recognition with flat direct models,”
IEEE Journal of Selected Topics in Signal Processing, 2010.
10. Srivastava S., Prakash S. (2020) Security Enhancement of IoT Based Smart Home Using
Hybrid Technique. In: Bhattacharjee A., Borgohain S., Soni B., Verma G., Gao XZ. (eds)
Machine Learning, Image Processing, Network Security and Data Sciences. MIND 2020.
Communications in Computer and Information Science, vol 1241. Springer, Singapore.
https://doi.org/10.1007/978-981-15-6318-8_44
15 | P a g e
APPENDIX: PLAGIARISM REPORT
16 | P a g e