An Intelligent Web-Based Voice Chat Bot
An Intelligent Web-Based Voice Chat Bot
ABSTRACT
Speech recognition is technology that can recognize spoken words, which can then
Microsoft, Google and Apple — five of the world’s top tech companies — are
already offering this feature on various devices through services like Google
words from phonemes and contextually analyzing the words to ensure the correct
spelling of words that sounds the same. Approach: Studying the possibility of
the sound signals and neural networks of irregular users. Fixed weights are
trained on those forms first and then the system gives the output match
for each of these formats and high speed. The proposed neural network
words from phonemes and contextually analyzing the words to ensure the correct
spelling of words that sounds the same. Approach: Studying the possibility of
the sound signals and neural networks of irregular users. Fixed weights are
trained on those forms first and then the system gives the output match
for each of these formats and high speed. The proposed neural
Speech recognition or speech to text includes capturing and digitizing the sound
from phonemes and contextually analyzing the words to ensure the correct spelling
of words that sounds the same. Approach: Studying the possibility of designing a
neuron networks where this system is able to distinguish the sound signals and
neural networks of irregular users. Fixed weights are trained on those forms first
and then the system gives the output match for each of these formats and high
modulated techniques.
Most of the works done till today on the field of IVR system has been primarily
focused on the input mechanisms based on the keyboard or touch pad. In such
cases it is tedious to provide the input command every time through typing of texts.
This way of providing input to the computer system may be enhanced if we could
provide direct speech input instead of typing. This enables in fast interaction
between the system and user and therefore increases overall satisfaction of the
customers. This also increases the speed of access of the information from the
system.
INTRODUCTION
Using voice commands has become pretty ubiquitous nowadays, as more mobile
phone users use voice assistants such as Siri and Cortana, and as devices such as
Amazon Echo and Google Home have been invading our living rooms. These
systems are built with speech recognition software that allows their users to issue
voice commands. Now, our web browsers will become familiar with to Web
Speech API, which allows users to integrate voice data in web apps.
With the current state of web apps, we can rely on various UI elements to interact
with users. With the Web Speech API, we can develop rich web applications with
natural user interactions and minimal visual interface, using voice commands. This
enables countless use cases for richer web applications. Moreover, the API can
make web apps accessible, helping people with physical or cognitive disabilities or
This project creates an artificial intelligence (AI) voice chat interface in the
browser. The app will listen to the user’s voice and reply with a synthetic voice.
OBJECTIVES
The prime objective of the project being proposed is to design and build a system
that a basic user can interact so that she/he can make use of voice commands to
deal with system i.e. making a system that has capability of recognizing the
isolated speaker words and process the request to forward the given task. The
recognition.
attribute.
Using voice is introduced; creating and catering for a more personal and
convenient experience. The process of an online chat system would follow a client
server approach which acquires the signal and streams it to a server. The input
voice is then processed and a response is generated. This process places a large
part process of capturing and analysis of an input signal [3]. While the client
utilizes the operating system for an input mechanism to acquire a signal, it is for
the client to interpret the signal. This process can alleviate processing from the
server and allow the server to generate responses faster than when it has more
into two categories: data retrieval and information output. The core focus of this
to the request, factual and personal. This requires aspects of news and an intelligent
platform that efforts to simulate typed discussion, with the goal of at least
provisionally tricking the social into thoughtful they were speaking to other person.
given topic using the natural language. Till date several chat-bots have been
organised on the internet for the determination of education, consumer service site,
supervision, entertaining, etc. The famous existing chat-bots are ALICE , Siri and
Ok Google. The AI based chat-bots are famous because they are light weight, easy
application for college purpose which will provide all the information related to
college and student queries. Firstly the bot analyzes user triggered message to the
chat-bot program, then according it matches reply from the MySQL database, the
answer is formulated and send back to the user. Students must select the category
listed in a drop down fashion having various options such as admission, faculty
details, syllabus, exams etc. Hence, this will avoid student s direct enquiry to
college. If any new applicant enquirers for admission and the particulars about any
section of the college this bot will assistance to get the answer of enquiry of the
applicant. The chat-bots that are currently been live in market uses text, voice and
emotion intelligence as the input. In this paper, we have used the text as user input.
If the present proposes need to be improved, we have to provide some options. For
the same, we restart from the basics. There is always need to rethink about the
machineries do well on this portion. They can help carry out even complex
The web service processes all received queries using the response generation
system), which makes use of a data repository. The data repository is updated by
IBM has had a prominent role within speech recognition since its inception,
different words, advancing the initial work from Bell Labs from the 1950s.
However, IBM didn’t stop there, but continued to innovate over the years,
METHODOLOGY
Artificial Intelligence
consciousness and emotionality. The distinction between the former and the latter
textbooks define the field as the study of "intelligent agents": any device that
perceives its environment and takes actions that maximize its chance of
often used to describe machines (or computers) that mimic "cognitive" functions
that humans associate with the human mind, such as "learning" and "problem
solving".
"intelligence" are often removed from the definition of AI, a phenomenon known
as the AI effect. A quip in Tesler's Theorem says "AI is whatever hasn't been done
attracted widespread global attention. For most of its history, AI research has been
divided into sub-fields that often fail to communicate with each other. These sub-
perception and the ability to move and manipulate objects. General intelligence is
computational intelligence, and traditional symbolic AI. Many tools are used in AI,
networks, and methods based on statistics, probability and economics. The AI field
The field was founded on the assumption that human intelligence "can be so
precisely described that a machine can be made to simulate it". This raises
philosophical arguments about the mind and the ethics of creating artificial beings
endowed with human-like intelligence. These issues have been explored by myth,
Speech Recognition
process human speech into a written format. While it’s commonly confused with
verbal format to a text one whereas voice recognition just seeks to identify an
Many speech recognition applications and devices are available, but the more
advanced solutions use AI and machine learning. They integrate grammar, syntax,
structure, and composition of audio and voice signals to understand and process
human speech. Ideally, they learn as they go — evolving responses with each
interaction.
The best kind of systems also allow organizations to customize and adapt the
Acoustics training: Attend to the acoustical side of the business. Train the
center) and speaker styles (like voice pitch, volume and pace).
making inroads in several areas, the better to improve human and machine
interaction.
Speech recognition algorithms
Speech recognition technology is evaluated on its accuracy rate, i.e. word error rate
(WER), and speed. A number of factors can impact word error rate, such as
parity – meaning an error rate on par with that of two humans speaking – has long
Various algorithms and computation techniques are used to recognize speech into
text and improve the accuracy of transcription. Below are brief explanations of
Markov chain model, which stipulates that the probability of a given state
hinges on the current state, not its prior states. While a Markov chain
model is useful for observable events, such as text inputs, hidden markov
sentences, etc.—in the sequence. These labels create a mapping with the
sequence.
For example, “order the pizza” is a trigram or 3-gram and “please order the
weights, a bias (or threshold) and an output. If that output value exceeds a
given threshold, it “fires” or activates the node, passing data to the next
layer in the network. Neural networks learn this mapping function through
accurate and can accept more data, this comes at a performance efficiency
models.
technology today, helping businesses and consumers save time and even lives.
daily lives, particularly on our mobile devices. We use voice commands to access
them through our smartphones, such as through Google Assistant or Apple’s Siri,
for tasks, such as voice search, or through our speakers, via Amazon’s Alexa or
Microsoft’s Cortana, to play music. They’ll only continue to integrate into the
Healthcare: Doctors and nurses leverage dictation applications to capture and log
help a call center transcribe thousands of phone calls between customers and
agents to identify common call patterns and issues. Cognitive bots can also talk to
people via a webpage, answering common queries and solving basic requests
without needing to wait for a contact center agent to be available. It both instances
speech recognition systems help reduce time to resolution for consumer issues.
Security: As technology integrates into our daily lives, security protocols are an
PYTHON
Python is an interpreted, high-level, general-purpose programming
language.Created by Guido van Rossum and first released in 1991, Python design
programmers write clear, logical code for small and large-scale projects. Python is
IMPLEMENTATION
A framework "is a code library that makes a developer's life easier when building
Flask is based on the Werkzeug WSGI toolkit and Jinja2 template engine. Both are
Pocco projects.It is classified as a microframework because it does not require
functions. However, Flask supports extensions that can add application features as
technologies and several common framework related tools. Applications that use
WSGI
Web Server Gateway Interface (WSGI) has been adopted as a standard for Python
Werkzeug
utility functions. This enables building a web framework on top of it. The Flask
Jinja2
combines a template with a certain data source to render dynamic web pages.
Flask is often referred to as a micro framework. It aims to keep the core of an
application simple yet extensible. Flask does not have built-in abstraction layer for
database handling, nor does it have form a validation support. Instead, Flask
supports the extensions to add such functionality to the application. Some of the
To build the web app, we’re going to take three major steps:
1. Use the web speech website Speech Recognition interface to listen to the user’s
voice.
Data-flow diagrams (DFDs) model a perspective of the system that is most readily
understood by users – the flow of information through the system and the activities
non-specialist users alike. The models enable software engineers, customers and
requirements. Although this means that our customers are required to understand
the modeling techniques and constructs, in data-flow modeling only a limited set of
constructs are used, and the rules applied are designed to be simple and easy to
follow. These same rules and constructs apply to all data-flow diagrams (i.e., for
each of the different software process activities in which DFDs can be used).
Data-flow diagrams provide a very important tool for software engineering, for a
number of reasons:
• The system scope and boundaries are clearly indicated on the diagrams (more
will be described about the boundaries of systems and each DFD later in this
chapter).
• Processes
• Data-flows
• Data stores
• External entities
The rest of this section describes each of the four elements of DFDs, in terms of
their purpose, how the element is notated and the rules associated with how the
different notations exist for depicting these elements, although it is only the shape
of the symbols which vary in each case, not the underlying logic. This unit uses the
As data-flow diagrams are not a part of the UML specification, ArgoUML and
Umbrello do not support their creation. However, Dia is free software available for
Processes
Purpose
Processes are the essential activities, carried out within the system boundary, that
use information. A process is represented in the model only where the information
which provides the input into the activity is manipulated or transformed in some
way, so that the data-flowing out of the process is changed compared to that which
flowed in. The activity may involve capturing information about something that
customer's address for example. It may require calculations to be carried out, such
as the quantity left in stock following the allocation of stock items to a customer's
Notation
Processes are depicted with a box, divided into three parts. Figure.
The notation for a process.The top left-hand box contains the process number. This
is simply for identification and reference purposes, and does not in any way imply
priority and sequence. The main part of the box is used to describe the process
itself, giving the processing performed on the data it receives. The smaller
rectangular box at the bottom of the process is used in the Current Physical Data-
Flow Diagram to indicate the location where the processing takes place. This may
be the physical location — the Customer Services Department or the Stock Room,
for example. However, it is more often used to denote the staff role responsible for
Rules
followed by a pithy and meaningful description of the object of the activity. Create
• Processes may not act as data sources or sinks. Data flowing into a process must
have some corresponding output, which is directly related to it. Similarly, data-
flowing out of a process must have some corresponding input to which it is directly
related.
• Normally only processes that transform system data are shown on data-flow
flow to the data store (and not the initial retrieval from the data store) is shown on
the diagram.
another process, only the flow from the data store to the process is shown on the
diagram.
Data-flows
Purpose
the data-flow diagram. Data-flows are used to model the flow of information into
the system, out of the system, and between elements within the system.
external entities, which is, strictly speaking, outside of the system boundaries.
Notation
A data-flow is depicted on the diagram as a directed line drawn between the
source and recipient of the data-flow, with the arrow depicting the direction of
flow. Figure.
Notation for a data-flow The directed line is labelled with the data-flow name,
which briefly describes the information contained in the flow. This could be a
Maintenance Contract, Service Call Details, Purchase Order, and so on. Data-flows
between external entities are depicted by dashed, rather than unbroken, lines.
Data stores
Purpose
A data store is a place where data is stored and retrieved within the system. This
may be a file, Customer Contracts file for example, a catalogue or reference list,
Options Lists for example, a log book such as the Job Book, and so on.
Notation
reference prefixed by a letter. The main area of the rectangle is labelled with the
name of the data store. Brief names are chosen to reflect the content of the data
store.
Rules
• One convention that could be used is to determine the letter identifying a data
• Data stores may not act as data sources or sinks. Data flowing into a data store
provide input data flows to receive output flows from a number of processes.
For the sake of clarity and to avoid crisscrossing of data-flows in the data-flow
diagram, a single data store may be included in the diagram at more than one point.
Where the depiction of a data store is repeated in this way, this is signified by
drawing a second vertical line along the left-hand edge of the rectangle for each
External entities
Purpose
External entities are entities outside of the system boundary which interact with the
system, in that they send information into the system or receive information from
and Supplier in our running example; or just external to the application area where
users' activities are not directly supported by the system under investigation.
Accounts and Engineering are shown as external entities as they are recipients of
information from the system. Sales also provide input to the system. External
entities are often referred to as sources and sinks. All information represented
within the system is sourced initially from an external entity. Data can leave the
External entities are represented on the diagram as ovals drawn outside of the
Names consist of a singular noun describing the role of the entity. Above the label,
Rules
• Each external entity must communicate with the system in some way, thus there
is always a dataflow between an external entity and a process within the system.
• External entities may provide and receive data from a number of processes. It
may be appropriate, for the sake of clarity and to avoid crisscrossing of data flows,
to depict the same external entity at a number of points on the diagram. Where this
is the case, a line is drawn across the left corner of the ellipse, for each occurrence
of the external entity on the diagram. Customer is duplicated in this way in our
example.
EXPERIMENTAL RESULTS
create a website and implemented speech recognition and responses through the
website.
SYSTEM REQUIREMENTS
Windows 10
CONCLUSION
Interaction with the website. We created a website using flask framework. The
chat-bot has a stored set of responses, takes dynamic user input as speech into