0% found this document useful (0 votes)
483 views

1.1 Background To The Study: Chapter One: Introduction

The document introduces a study that aims to design and develop a voice recognition software to convert speech to text to make learning in schools more easy and fun. It discusses how speech is a primary form of human communication and voice interfaces are becoming more common. The study aims to access current learning methods in schools and create software that recognizes speech and displays text in real-time to benefit both students and teachers. It outlines the objectives, significance, scope, and limitations of the study.

Uploaded by

vector
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
483 views

1.1 Background To The Study: Chapter One: Introduction

The document introduces a study that aims to design and develop a voice recognition software to convert speech to text to make learning in schools more easy and fun. It discusses how speech is a primary form of human communication and voice interfaces are becoming more common. The study aims to access current learning methods in schools and create software that recognizes speech and displays text in real-time to benefit both students and teachers. It outlines the objectives, significance, scope, and limitations of the study.

Uploaded by

vector
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

CHAPTER ONE: INTRODUCTION

1.1 BACKGROUND TO THE STUDY

Voice is the basic, common and efficient form of communication method for people to

interact with each other. Today, speech technologies are commonly available for a limited but

interesting range of tasks. Some of these tasks includes; voice recognition as biometric,

speech-to-text and text-to-speech et-cetera. These technologies enable machines to respond

correctly and reliably to human voices and provide useful and valuable services to the users.

As communication with computer is faster using voice rather than the peripheral device such

as keyboard, so also people will prefer such method to ease communication among

themselves in schools, organizations, hospitals and in the world at large. Communication

among human beings is primarily based on spoken words. Therefore, it is common for people

to expect voice interfaces with the computer system.

1.2 JUSTIFICATION FOR THE STUDY

Speech-to-text software also known as Automated Speech Recognition (ASR) software, does

exactly what the name implies. It uses speech recognition technology to identify patterns in

sound wave and matches them to the phonemes to translate them into text. This software has

been in place in some form or another since early 1950s. It is an ever evolving technology

that has gradually become part of our everyday lives. As a result, automated speech

recognition software is becoming more affordable, better and easily accessible. While others

use it for automated dictation, it can also allow quick and easy control of digital web

application. With all of the innumerable reasons why speech recognition software is

preferable, it is also good to narrow it down to the lecture halls in our schools today which
could make learning and teaching more easier and fun considering the different abilities we

have in hearing. When a lecturer’s speech is translated in an on going lecture period, it will

be easily accessible without having to dictate every spoken word which is a boring and

rigorous format of learning.

1.3 STATEMENT OF THE PROBLEM

In the society, every individual including animals interacts with each other and in one way or

the other, tries to convey vital information from one person to another. The recipient of the

message, may get the exact or full message the sender is trying to convey, or may get the

partial idea or sometimes cannot understand the message at all depending on how fast or slow

the message conveyed may be. Also, considering This thesis, considers an overview of voice

recognition system:speech-to-text, software development and its application in our day-to-

day interactions.

1.4 AIM AND OBJECTIVES OF THE STUDY

The aim of this study is to design and develop a working voice recognition software: speech-

to-text conversion, to help in making learning in schools more easy, fun and enhanced.

Some of the specific objectives are:

 To access the current or the traditional way of communications/learning in schools.

 To design and develop an automated speech recognition software.

 This application is a software that can be used for speech recognition by converting

the voice to text.


 To design an interactive user friendly text editor which allows the user to enter the

text, manipulate the text, format the text using the same commands.

 To develop a model that will compare the wave data with phoneme database and

displaying the sentences on the screen.

 To test the developed automated speech recognition software.

 To implement the designed speech recognition system

1.5 METHOD OF ACHIEVING THE OBJECTIVES

The programming language of choice to be used on this project is python and tools like

PyQT5, Tkinter, CMU Sphinx, alongside Google Speech Engine, Visual Studio Code IDE and

also the Agile Software Development Cycle will be adopted to achieve this goal.

1.6 SIGNIFICANCE OF THE STUDY

 The software helps both lecturers and students to manage their workload.

 The software can operate transparently behind the application, benefiting users who

are unfamiliar with speech recognition system.

 It prepares such a standard file that can be used in another speech dictation or any

common text editors.

 This application will enable end users to manage their files by restoring, renaming,

saving, backing up and deleting.

 Speech can be saved in a right format so that the user can replay recorded speech if

need be for any corrections.


 This application can switch between dictations made3 without any extra efforts

required.

 Speech activated macros enables the user to speak a natural word or phrase rather

than use the keyboard to activate a macro.

 This application could also help those with limited mobility and those who are

physically challenged (hearing impaired people) through the use of audio-visual

software.

1.7 SCOPE OF THE STUDY

This thesis report considers an overview of speech recognition technology software

development, and its applications. The first part deals with the descriptions of speech

recognition process from text-to-speech, and its applications in different sectors but our focus

is the tertiary institutions. The second part of this report covers the speech recognition

process, the code for the software and its working. Lastly, this report concludes at the

different potential uses of the designed application, further improvements and considerations.

1.8 LIMITATION OF THE STUDY

Some of the limitations of the thesis are as follows;

i. Lack of maximum accuracy and misinterpretation of the spoken words.

ii. Time cost and productivity in training and set-up.

iii. Background noise interference.

iv. The software cannot understand the complexities of jargon and phrases due to

limited vocabulary.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy