1.1 Background To The Study: Chapter One: Introduction
1.1 Background To The Study: Chapter One: Introduction
Voice is the basic, common and efficient form of communication method for people to
interact with each other. Today, speech technologies are commonly available for a limited but
interesting range of tasks. Some of these tasks includes; voice recognition as biometric,
correctly and reliably to human voices and provide useful and valuable services to the users.
As communication with computer is faster using voice rather than the peripheral device such
as keyboard, so also people will prefer such method to ease communication among
among human beings is primarily based on spoken words. Therefore, it is common for people
Speech-to-text software also known as Automated Speech Recognition (ASR) software, does
exactly what the name implies. It uses speech recognition technology to identify patterns in
sound wave and matches them to the phonemes to translate them into text. This software has
been in place in some form or another since early 1950s. It is an ever evolving technology
that has gradually become part of our everyday lives. As a result, automated speech
recognition software is becoming more affordable, better and easily accessible. While others
use it for automated dictation, it can also allow quick and easy control of digital web
application. With all of the innumerable reasons why speech recognition software is
preferable, it is also good to narrow it down to the lecture halls in our schools today which
could make learning and teaching more easier and fun considering the different abilities we
have in hearing. When a lecturer’s speech is translated in an on going lecture period, it will
be easily accessible without having to dictate every spoken word which is a boring and
In the society, every individual including animals interacts with each other and in one way or
the other, tries to convey vital information from one person to another. The recipient of the
message, may get the exact or full message the sender is trying to convey, or may get the
partial idea or sometimes cannot understand the message at all depending on how fast or slow
the message conveyed may be. Also, considering This thesis, considers an overview of voice
day interactions.
The aim of this study is to design and develop a working voice recognition software: speech-
to-text conversion, to help in making learning in schools more easy, fun and enhanced.
This application is a software that can be used for speech recognition by converting
text, manipulate the text, format the text using the same commands.
To develop a model that will compare the wave data with phoneme database and
The programming language of choice to be used on this project is python and tools like
PyQT5, Tkinter, CMU Sphinx, alongside Google Speech Engine, Visual Studio Code IDE and
also the Agile Software Development Cycle will be adopted to achieve this goal.
The software helps both lecturers and students to manage their workload.
The software can operate transparently behind the application, benefiting users who
It prepares such a standard file that can be used in another speech dictation or any
This application will enable end users to manage their files by restoring, renaming,
Speech can be saved in a right format so that the user can replay recorded speech if
required.
Speech activated macros enables the user to speak a natural word or phrase rather
This application could also help those with limited mobility and those who are
software.
development, and its applications. The first part deals with the descriptions of speech
recognition process from text-to-speech, and its applications in different sectors but our focus
is the tertiary institutions. The second part of this report covers the speech recognition
process, the code for the software and its working. Lastly, this report concludes at the
different potential uses of the designed application, further improvements and considerations.
iv. The software cannot understand the complexities of jargon and phrases due to
limited vocabulary.