0% found this document useful (0 votes)

13 views23 pages

HMM Isolated Word Recognition

The document discusses Hidden Markov Models (HMMs) for the recognition of isolated words, detailing the training and recognition processes, as well as the structure and components of HMMs. It covers the evaluation, decoding, and training problems associated with HMMs, including algorithms like the Forward-Backward and Viterbi algorithms for efficient computation. The document emphasizes the abstraction of states in HMMs and their application in speech recognition tasks.

Uploaded by

constanzaelf07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views23 pages

HMM Isolated Word Recognition

Uploaded by

constanzaelf07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Hidden Markov Models I

Recognition of isolated words

Recommended texts:
Spoken Language Processing, Chapter 8
Speech and Language Processing, Appendix A
Statistical recognition
• I. Recognition of isolated words
– Training: creation of a model for each word.
– Recognition: To determine the best match
model/utterance

• II. Large Vocabulary Speech Recognition (LVSR)

– Subword units (phones) in context, e.g., m-a+r
– Phonetic dictionary
– Language model P(house | the white)
Isolated words. General scheme

Model W1

Model W2 Training

speech
Model WN
Parameterizer recognized
speech

Classifier Decision

Recognition
Hidden Markov Model (HMM)
T11 T22 T33

T12 T23

T13

• Generic representation of a statistical model for processes that

generate time series. The HMM is a sequence model.
• The “segments” in the time series are referred to as states: the
process passes through these states to generate time series
• The entire structure may be viewed as a generalization of DTW
models
Bhiksha Raj and Rita Singh (CMU)
Hidden Markov Models

• A Hidden Markov Model consists of two components

– A state/transition backbone that specifies how many states there
are, and how they can follow one another
– A set of probability distributions, one for each state, which
specifies the distribution of all vectors in that state

Markov chain

Data distributions

HMMs
HMM as a statistical model
• An HMM is a statistical model for a time-varying process
• The process is always in one of a countable number of states at
any time

• When the process visits in any state, it generates an

observation by a random draw from a distribution associated
with that state

• The process constantly moves from state to state. The

probability that the process will move to any state is
determined solely by the current state
– i.e. the dynamics of the process are Markovian

• The entire model represents a probability distribution over the

sequence of observations
– It has a specific probability of generating any particular sequence
HMMs
– The probabilities of all possible observation sequences sums to 1
How an HMM models a process

HMM assumed to be
generating data

state sequence

state distributions

Observation
sequence

HMMs
HMMs are abstractions
• The states are not directly observed
– Here states of the process are analogous to configurations of the vocal tract that
produces the signal
– We only hear the speech; we do not see the vocal tract
– i.e. the states are hidden

• The interpretation of states is not always obvious

– The vocal tract actually goes through a continuum of configurations
– The model represents all of these using only a fixed number of states

• The model abstracts the process that generates the data

– The system goes through a finite number of states
– When in any state it can either remain at that state, or go to another with some
probability
– When at any states it generates observations according to a distribution associated with
that state

17 March 2007 HMMs

HMM Parameters
0.6 0.7
• The topology of the HMM
0.4

– No. of states and allowed

transitions 0.5 0.3

– E.g. here we have 3 states and

cannot go from the blue state to
the red 0.5

• The transition probabilities æ .6 .4 0 ö

ç ÷
– Often represented as a matrix as T = ç 0 .7 .3 ÷
here ç .5 0 .5 ÷
è ø
– Tij is the probability that when in
state i, the process will move to j
• The probability of beginning at a
particular state
• The state output distributions

17 March 2007 HMMs

HMM state output distributions
• The state output distribution represents the distribution of data produced from
any state. We can have:
• Discrete probabilities (DHMM) , e.g., Vector Quantization (k-means)
• Continuous probabilities , e.g., Gaussian Mixture Model (GMM), DNN

Bhiksha Raj and Rita Singh (CMU)

Discrete Hidden Markov models
a 22
a11 a33
a13
a12 b3(ot)
b1(ot) a23
1 a21
2 a32
3
• N states {1...N} b2(ot) a31
• At instant of time t the system will be in a specific state si

• In equispaced intervals, the system may change its state with probability aij
aij = P(qt = j | qt-1= i) aij>0,

• Every time a transition occurs, the system generates an observation from a

finite alphabet that depends on the state to which has moved.
b j (Ot ) = P (Ot qt = j )
• There is no longer a one-to-one correspondence between the observation
sequence and the state sequence, so you cannot unanimously determine the
state sequence for a given observation sequence; i.e., the state sequence is
not observable and therefore hidden.
Types of models
a
11

Ergodic 1
Phonotactic recognition
a12
a
13

a a31
21

a
23

a22 2 3 a 33

a32

Left-right or Backis’ Phonetic units modeling

a a a a a
11 22 33 44 nn

a a a a
1 12 2 23 3 34 4 (n-1)n n
Problems to be solved
• Evaluation: Given a sequence of observations O = o1
o2...oT and a model l, what is the probability P(O|l)
that the model generates the observations?.
• Decoding: Given a sequence of observations O = o1
o2...oT and a model l, find the optimum sequence of
states Q=q1,...,qT
• Training: Given the model l=(p,A,B) and a set of
training sequences, how to adjust the model
parameters l to maximize the joint probability P(O|l)
Evaluation
Given the sequence O = o1 o2...oT and the model
l= (p,A,B), calculate P(O)
Solution: Let’s suppose a sequence of states:
Q= q1q2...qT, that have created the observations.

P(Q) = π q aq q ...aq
1 1 2 T −1qT

P(O,Q) = π q bq (o1 )aq q bq (o2 )...aq bq (oT )

1 1 1 2 2 T −1qT T

P(O) = ∑ π q bq (o1 )aq q bq (o2 )...aq b (oT )

1 1 1 2 2 T −1qT qT
all Q
Efficient solution
• The direct method requires 2TNT operations
1sec:100 observations and 5 states: 1072 calculations

• Forward backward algorithm

– Accumulates operations in sequences with repeated paths
– #operations N2T = 2500 operations
Most likely sequence of states

• Given a sequence of observations, searches, in

every instant t, the most likely sequence of states

• Viterbi’s algorithm

• #operations: N2T (multiplications or additions)

Viterbi’s algorithm
Sequence of states with the highest probability
dt(j) = max P(q1 q2 ...qt-1 qt = j , o1 o2 ...ot )
q1,q2...qt-1

Initialization d1(i) = pi bi(o1)

y1(i) = 0 1£i£N

Recursion: dt(j) = max [dt-1(i) aij] bj(ot) 2£t£T

1£i£N

yt(j) = arg max [dt-1(i) aij] 1£j£N

1£i£N

Ending P* = max [dT(i)]

1£i£N

qT*=arg max [dT(i)]

1£i£N

Sequence (backtracking)
qt* = yt+1( qt+1*) t = T-1, T-2, ...1
State Initialization

4
3
2
1
1 2 T signal
Initialization: For every
state, we calculate the
pjbj(O1) probability of generating
the first observation.
State Recursion
4
3
2
1
1 2 T signal
dt-1(4) a42 In t, we calculate the highest
a32 probability of arriving at the state
n from the others and generating
a22 dt(2) the observation Ot
dt-1(1) a12 In yt(n) we note down the state
from which we achieve the
highest probability
b2(Ot)
Ending: backtracking

State

4
3
2
1
1 2 T signal

Sequence (backtracking)
qt* = yt+1( qt+1*) t = T-1, T-2, ...1
Training or parameter
estimation

• Given the sequence O = o1 o2... oT adjust the model

l= (p,A,B) that maximizes P(O|l)
• Iterative solution: Baum Welch
– start from an initial model
– pass the training sequence and re-estimate the model
parameters
– P(O|l) > P(O| l)
Initialization
• Manual
• Automatic
– Uniform matrix A
– Segmentation of the training sequences into number of
states
– Grouping of the spectra in every state
– Estimation of B
– Iterate until convergence
• Iterate: using Viterbi algorithm to segment the training sequence into states
improves the initialization
Isolated words. General scheme

HMM λ0 P(O|λ0)
speech

O = o1 o2...oT HMM λ1 P(O|λ1)

Parameterizer

HMM λM P(O|λM)

• Each HMM model is trained with its own set of recordings of isolated words.
• Each test recording has a single isolated word
• P(O|λi) can be computed by the forward algorithm. In practice, the Viterbi
algorithm ( max P(Q,O|λi) ) is faster and gives the same accuracy.

4.pattern Recognition (Pattern Classification) - Convolutional Neural Networks - (CNN)
No ratings yet
4.pattern Recognition (Pattern Classification) - Convolutional Neural Networks - (CNN)
235 pages
Btech Cs 4 Sem Theory of Automata and Formal Languages Ncs 402 2017
100% (1)
Btech Cs 4 Sem Theory of Automata and Formal Languages Ncs 402 2017
3 pages
2024 Fall CSE366 12 HMM
No ratings yet
2024 Fall CSE366 12 HMM
46 pages
Introduction To HMM
No ratings yet
Introduction To HMM
38 pages
AD3501 Deep Learning Course Plan
No ratings yet
AD3501 Deep Learning Course Plan
6 pages
Session 6-Markov Slide
No ratings yet
Session 6-Markov Slide
68 pages
Vision Mamba
No ratings yet
Vision Mamba
14 pages
Recognition of Socphatic Speaking
No ratings yet
Recognition of Socphatic Speaking
7 pages
Automata Theory 2024
No ratings yet
Automata Theory 2024
2 pages
HMM in BI
No ratings yet
HMM in BI
37 pages
Isolated-Word Speech Recognition Using Hidden Markov Models: H Akon Sandsmark December 18, 2010
No ratings yet
Isolated-Word Speech Recognition Using Hidden Markov Models: H Akon Sandsmark December 18, 2010
9 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Real Statistics Examples Distributions
No ratings yet
Real Statistics Examples Distributions
491 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
32 pages
Soft Comp PDF
No ratings yet
Soft Comp PDF
2 pages
Prques 2
No ratings yet
Prques 2
13 pages
HMM Detailed
No ratings yet
HMM Detailed
41 pages
Gated Recurrent Unit
No ratings yet
Gated Recurrent Unit
12 pages
Unit - 4 Hidden Markov Models
No ratings yet
Unit - 4 Hidden Markov Models
39 pages
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
No ratings yet
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
35 pages
BE Comp - Deep Learning
No ratings yet
BE Comp - Deep Learning
1 page
SARIMA Model RMSE 1
No ratings yet
SARIMA Model RMSE 1
9 pages
FLAT Unitwise Imp Questions
100% (6)
FLAT Unitwise Imp Questions
5 pages
Hidden Markov Models Applied To Information Extraction: Part I: Concept
No ratings yet
Hidden Markov Models Applied To Information Extraction: Part I: Concept
34 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
10 pages
Module 6.2
No ratings yet
Module 6.2
25 pages
AML Mod2
No ratings yet
AML Mod2
38 pages
MLRD 8
No ratings yet
MLRD 8
39 pages
DR - Amin.ML Ch07 DeepLearning 1
No ratings yet
DR - Amin.ML Ch07 DeepLearning 1
12 pages
IS 7118 Unit-6 HMM
No ratings yet
IS 7118 Unit-6 HMM
78 pages
Homicide Forecasting For The State of Guanajuato Using LSTM and Geospatial Information
No ratings yet
Homicide Forecasting For The State of Guanajuato Using LSTM and Geospatial Information
6 pages
Computational Genomics Hidden Markov Models (HMMS)
No ratings yet
Computational Genomics Hidden Markov Models (HMMS)
55 pages
Hidden Markovnikov Model
No ratings yet
Hidden Markovnikov Model
32 pages
Recitation4 Notes
No ratings yet
Recitation4 Notes
6 pages
Project Report
0% (1)
Project Report
53 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
47 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Slides
No ratings yet
Slides
69 pages
Discrete Probability Distributions
No ratings yet
Discrete Probability Distributions
6 pages
Supervised Learning Network Introduction: Unit 2
No ratings yet
Supervised Learning Network Introduction: Unit 2
52 pages
Unit 4
No ratings yet
Unit 4
153 pages
PR l23 PDF
No ratings yet
PR l23 PDF
23 pages
Lec20 PDF
No ratings yet
Lec20 PDF
7 pages
Знімок екрана 2022-10-31 о 18.56.30
No ratings yet
Знімок екрана 2022-10-31 о 18.56.30
96 pages
Holt Winters Forcasting
No ratings yet
Holt Winters Forcasting
17 pages
HMM
No ratings yet
HMM
25 pages
Cu HMM
No ratings yet
Cu HMM
13 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
Hidden Markov Models: CH 3.2, 3.2 of DEKM
No ratings yet
Hidden Markov Models: CH 3.2, 3.2 of DEKM
27 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
26 pages
01 Hidden Markov Models
No ratings yet
01 Hidden Markov Models
3 pages
Name: Madhurima Sengupta Department: Electronics and Communication Section: Ii ROLL NO: 18700316051 Semester: 6Th Techno International Newtown
No ratings yet
Name: Madhurima Sengupta Department: Electronics and Communication Section: Ii ROLL NO: 18700316051 Semester: 6Th Techno International Newtown
15 pages
Hidden Markov Modelss
No ratings yet
Hidden Markov Modelss
59 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
HMM Presentation
No ratings yet
HMM Presentation
31 pages
Markov Models
No ratings yet
Markov Models
54 pages
Hidden Markov Models and Sequential Data
No ratings yet
Hidden Markov Models and Sequential Data
45 pages
Gen Ai
No ratings yet
Gen Ai
23 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
6 pages
1 s2.0 S0952197623018018 Main
No ratings yet
1 s2.0 S0952197623018018 Main
11 pages
Introduction To Machine Learning CMU-10701: Hidden Markov Models
No ratings yet
Introduction To Machine Learning CMU-10701: Hidden Markov Models
30 pages
Lecture07 HMM S
No ratings yet
Lecture07 HMM S
26 pages
ML 5
No ratings yet
ML 5
28 pages
An Introduction To Hidden Markov Models
No ratings yet
An Introduction To Hidden Markov Models
12 pages
CSCI 2670 Introduction To Theory of Computing: September 28, 2005
No ratings yet
CSCI 2670 Introduction To Theory of Computing: September 28, 2005
27 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
20 pages
Hidden Markov Models in Speech Recognition: Wayne Ward
No ratings yet
Hidden Markov Models in Speech Recognition: Wayne Ward
35 pages
09 - Hidden Markov Model
No ratings yet
09 - Hidden Markov Model
78 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
Back Propagation
No ratings yet
Back Propagation
9 pages
Introduction To Artificial Neural Networks: Andrew L. Nelson
No ratings yet
Introduction To Artificial Neural Networks: Andrew L. Nelson
29 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
Theory of Automata: Lecture#04 Non-Deterministic Finite Automata, NFA & FA Equivalence, NFA & Kleene's Theorem
No ratings yet
Theory of Automata: Lecture#04 Non-Deterministic Finite Automata, NFA & FA Equivalence, NFA & Kleene's Theorem
22 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
5 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
36 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
Hidden Markov Model HMM
No ratings yet
Hidden Markov Model HMM
11 pages
Assignment 04 (First Semester 2022-2023)
No ratings yet
Assignment 04 (First Semester 2022-2023)
9 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Chapter 16: Time-Series Forecasting
No ratings yet
Chapter 16: Time-Series Forecasting
48 pages
AI5006 - Deep Learning
No ratings yet
AI5006 - Deep Learning
6 pages
Algorithms - Hidden Markov Models
No ratings yet
Algorithms - Hidden Markov Models
7 pages
Hidden Markov Model: Fundamentals and Applications
From Everand
Hidden Markov Model: Fundamentals and Applications
Fouad Sabry
No ratings yet
Markov Decision Process: Fundamentals and Applications
From Everand
Markov Decision Process: Fundamentals and Applications
Fouad Sabry
No ratings yet
Combs Method: Fundamentals and Applications
From Everand
Combs Method: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

HMM Isolated Word Recognition

Uploaded by

HMM Isolated Word Recognition

Uploaded by

Hidden Markov Models I

Recognition of isolated words

• II. Large Vocabulary Speech Recognition (LVSR)

• Generic representation of a statistical model for processes that

• A Hidden Markov Model consists of two components

• When the process visits in any state, it generates an

• The process constantly moves from state to state. The

• The entire model represents a probability distribution over the

• The interpretation of states is not always obvious

• The model abstracts the process that generates the data

17 March 2007 HMMs

– No. of states and allowed

– E.g. here we have 3 states and

• The transition probabilities æ .6 .4 0 ö

17 March 2007 HMMs

Bhiksha Raj and Rita Singh (CMU)

• Every time a transition occurs, the system generates an observation from a

Left-right or Backis’ Phonetic units modeling

P(O,Q) = π q bq (o1 )aq q bq (o2 )...aq bq (oT )

P(O) = ∑ π q bq (o1 )aq q bq (o2 )...aq b (oT )

• Forward backward algorithm

• Given a sequence of observations, searches, in

• #operations: N2T (multiplications or additions)

Initialization d1(i) = pi bi(o1)

Recursion: dt(j) = max [dt-1(i) aij] bj(ot) 2£t£T

yt(j) = arg max [dt-1(i) aij] 1£j£N

Ending P* = max [dT(i)]

qT*=arg max [dT(i)]

• Given the sequence O = o1 o2... oT adjust the model

O = o1 o2...oT HMM λ1 P(O|λ1)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.