0% found this document useful (0 votes)

54 views17 pages

Automatic Speech Recognition

Automatic speech recognition is the task of getting a computer to understand spoken language by either reacting appropriately or converting speech to text. Humans do this through the ear and brain processing sound waves produced during articulation. Computers do it by digitizing the acoustic signal, analyzing it acoustically, matching it to a phoneme dictionary using a language model. Multilingual speech recognition systems use techniques like universal speech models, language identification classifiers, and monolingual speech recognizers with dynamic confidence scoring to recognize multiple languages. The end-to-end multilingual ASR system has client, frontend, and backend components including an LID backend, speech recognizer backend, web search backend, and voice synthesizer backend. HMM-

Uploaded by

Mayank Kulkarni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views17 pages

Automatic Speech Recognition

Uploaded by

Mayank Kulkarni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 17

Automatic Speech Recognition

What is the task?

Getting a computer to understand spoken

language
By understand we might mean

React appropriately
Convert the input speech into another medium,
e.g. text

How do humans do it?

Articulation produces
sound waves which
the ear conveys to the brain
for processing
3

How computers do it?

Acoustic waveform

Acoustic signal

Digitization
Acoustic analysis of the speech
signal
Phoneme dictionary
Language model

Speech recognition

Multilingual Architecture

Multilingual speakers already out-number

monolingual speakers.
The capacity to transparently recognize multiple
spoken languages is a desirable feature of ASR
systems.
eg. OK GOOGLE, SIRI

Multilingual Techniques

Universal Speech Model

Language Identification (LID) classifiers

Monolingual speech recognizers decode along

with LID (Confidence Score)
Dynamic confidence score and LID decision

ASR Multilingual Design

The end-to-end multilingual speech recognition system consists of the

following components:
1. Client
2. Frontend
-Recognize
-Recognize+Search+Synthesis
-Multi-recognize+Search+Synthesis
3. Backend
-LID Backend
-Speech Recognizer Backend
-Web Search Backend
-Voice Synthesizer Backend
9

Multirecognizer Module

Representation of Speech & Speech

Signal

Grammar & Syntax

-How the occurrence of words in sequence is governed

Lexicon or Dictionary

- How a word is supposed to be pronounced as a

sequence of unitary sounds

Acoustic-phonetics

-How a unitary sound and/or a sequence of unitary sounds

are supposed to be produced with the articulatory
apparatus
12

THE HIDDEN MAROV MODEL

The input audio waveform from a microphone is converted into a sequence of

fixed size acoustic vectors Y 1: T = y 1. . . y T in a process called feature
extraction[3]. The decoder then attempts to find the sequence of words w 1: L =
w 1. . . w L which is most likely to have generated Y, i.e. the decoder tries to
find,
w = arg max {P (w|Y)}.
However, since P (w|Y) is difficult to model directly, Bayes Rule is used
to transform above equation into the equivalent problem of finding:
w = arg max {p(Y |w) P (w)}

Arcgitecture of HMM Based

Recognizer

The overall recognition system of speech recognition using HMM includes :

Feature Analysis

Unit Matching System

Lexical Decoding

Syntactic analysis

Semantic Analysis

Phoneme and Topologies

Composite HMM for Vertibri Recogition (Pronunciation Dictionary)

Speech Recognition Seminar Report
87% (97)
Speech Recognition Seminar Report
32 pages
Artificial Intelligence-For Speech Recognition
100% (3)
Artificial Intelligence-For Speech Recognition
13 pages
Lecture 9 - Speech Recognition
No ratings yet
Lecture 9 - Speech Recognition
65 pages
Speechrecognitionfinalpresentation 141124072610 Conversion Gate01
No ratings yet
Speechrecognitionfinalpresentation 141124072610 Conversion Gate01
30 pages
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
100% (1)
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
65 pages
Speech Recognition1
100% (1)
Speech Recognition1
39 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
35 pages
Xiao Guest Lecture ASR
No ratings yet
Xiao Guest Lecture ASR
39 pages
A Review On Different Approaches For Speech - Recognition System
No ratings yet
A Review On Different Approaches For Speech - Recognition System
6 pages
Unit 5 UA
No ratings yet
Unit 5 UA
19 pages
Speech Recognition UTHM
No ratings yet
Speech Recognition UTHM
30 pages
Lectures 1 Rabiner Speech Processing
No ratings yet
Lectures 1 Rabiner Speech Processing
77 pages
3MCA67 Speech Recognition
No ratings yet
3MCA67 Speech Recognition
14 pages
Speech Recognition Application
No ratings yet
Speech Recognition Application
13 pages
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
No ratings yet
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
6 pages
Feature Extraction Using PCA
No ratings yet
Feature Extraction Using PCA
36 pages
Artificial Intelligence For Speech Recognition
No ratings yet
Artificial Intelligence For Speech Recognition
13 pages
Speech Recognition1
No ratings yet
Speech Recognition1
24 pages
HG3052 SpeechSynthesisAndRecognition Lecture 10 Update2019-20
No ratings yet
HG3052 SpeechSynthesisAndRecognition Lecture 10 Update2019-20
49 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
Hidden Markov Model and Persian Speech Recognition
No ratings yet
Hidden Markov Model and Persian Speech Recognition
9 pages
Speech Recognition Report
100% (1)
Speech Recognition Report
20 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
34 pages
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
No ratings yet
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
14 pages
14-Speech Recognition
No ratings yet
14-Speech Recognition
11 pages
Final Slide
No ratings yet
Final Slide
18 pages
Minor Project123
No ratings yet
Minor Project123
40 pages
Presentation On Speech Recognition
No ratings yet
Presentation On Speech Recognition
11 pages
A Speaker Independent Continuous Speech Recognizer For Amharic
No ratings yet
A Speaker Independent Continuous Speech Recognizer For Amharic
5 pages
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
Speech Recognition: BY Charu Joshi
100% (2)
Speech Recognition: BY Charu Joshi
26 pages
IT Report-1
No ratings yet
IT Report-1
14 pages
Asr01 Intro
No ratings yet
Asr01 Intro
43 pages
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
No ratings yet
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
32 pages
Speech Recognition
No ratings yet
Speech Recognition
4 pages
Comparative Analysis of Automatic Speech Recognition Techniques
No ratings yet
Comparative Analysis of Automatic Speech Recognition Techniques
8 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
28 pages
SPEECH
100% (1)
SPEECH
17 pages
Assignment Submission Speech Recognition System Architectural Design
No ratings yet
Assignment Submission Speech Recognition System Architectural Design
5 pages
SPEECH RECOGNITION SYSTEM Final
No ratings yet
SPEECH RECOGNITION SYSTEM Final
16 pages
Term Paper ECE-300 Topic: - Speech Recognition
No ratings yet
Term Paper ECE-300 Topic: - Speech Recognition
14 pages
Phases of Speech Recognition
No ratings yet
Phases of Speech Recognition
2 pages
Build Automatic Speech Recognition System: Bachelor of Technology
No ratings yet
Build Automatic Speech Recognition System: Bachelor of Technology
25 pages
Ann LA2 Project
No ratings yet
Ann LA2 Project
23 pages
Design and Implementation
No ratings yet
Design and Implementation
74 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
23 pages
Speech Recognition: BY Charu Joshi
No ratings yet
Speech Recognition: BY Charu Joshi
26 pages
Speech Recognition As Emerging Revolutionary Technology
No ratings yet
Speech Recognition As Emerging Revolutionary Technology
4 pages
A Study On Automatic Speech Recognition
100% (1)
A Study On Automatic Speech Recognition
2 pages
NLP 1.3.1 - Speed Recogmnition
No ratings yet
NLP 1.3.1 - Speed Recogmnition
20 pages
Speech Recognition
No ratings yet
Speech Recognition
4 pages
Speech Technology
No ratings yet
Speech Technology
5 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
9 pages
Speech Technology Overview
No ratings yet
Speech Technology Overview
15 pages
Redaction HTK Amazigh Speech
No ratings yet
Redaction HTK Amazigh Speech
15 pages
Review of Feature Extraction Techniques in Automatic Speech Recognition
100% (1)
Review of Feature Extraction Techniques in Automatic Speech Recognition
6 pages
Visual Word: Unlocking the Power of Image Understanding
From Everand
Visual Word: Unlocking the Power of Image Understanding
Fouad Sabry
No ratings yet
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
Speech Recognition: Fundamentals and Applications
From Everand
Speech Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Silent Speech Interface: Fundamentals and Applications
From Everand
Silent Speech Interface: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Automatic Speech Recognition

Uploaded by

Automatic Speech Recognition

Uploaded by

Automatic Speech Recognition

What is the task?

Getting a computer to understand spoken

How do humans do it?

How computers do it?

Multilingual speakers already out-number

Universal Speech Model

Language Identification (LID) classifiers

Monolingual speech recognizers decode along

ASR Multilingual Design

The end-to-end multilingual speech recognition system consists of the

Representation of Speech & Speech

Grammar & Syntax

-How the occurrence of words in sequence is governed

- How a word is supposed to be pronounced as a

-How a unitary sound and/or a sequence of unitary sounds

THE HIDDEN MAROV MODEL

The input audio waveform from a microphone is converted into a sequence of

Arcgitecture of HMM Based

The overall recognition system of speech recognition using HMM includes :

Unit Matching System

Phoneme and Topologies

Composite HMM for Vertibri Recogition (Pronunciation Dictionary)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.