0% found this document useful (0 votes)

120 views17 pages

Artificial Intelligence-An Introduction: Department of Computer Science & Engineering

Artificial Intelligence is composed of two words Artificial and Intelligence, where Artificial defines "man-made," and intelligence defines "thinking power", hence AI means "a man-made thinking power.“ Artificial Intelligence exists when a machine can have human based skills such as learning, reasoning, and solving problems.

Uploaded by

Aravali GF

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views17 pages

Artificial Intelligence-An Introduction: Department of Computer Science & Engineering

Uploaded by

Aravali GF

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Artificial Intelligence- An Introduction

Department of
Computer Science & Engineering

1
What is AI?

oArtificial Intelligence is composed of two

words Artificial and Intelligence, where Artificial defines "man-
made," and intelligence defines "thinking power", hence AI
means "a man-made thinking power.“

oArtificial Intelligence exists when a machine can have human

based skills such as learning, reasoning, and solving problems

2
History of AI

3
Does AI have applications?
•Autonomous planning and scheduling of tasks aboard a spacecraft

•Beating Gary Kasparov in a chess match

•Steering a driver-less car

•Understanding language

•Robotic assistants in surgery

•Monitoring trade in the stock market to see if insider trading is going on

4
Applications

5
Goals of AI
Problem solving
oProblem-solving agents:
oIn Artificial Intelligence, Search techniques are universal problem-solving methods. Rational
agents or Problem-solving agents in AI mostly used these search strategies or algorithms to solve a
specific problem and provide the best result.

6/34 6
An Agent
 ‘Anything’ that can gather information about its environment
and take action based on that information.

7
Components of a Basic Speech Recognition System
A speech capturing Device: It consists of a microphone, which converts the sound wave signals to electrical signals and
an Analog to Digital Converter which samples and digitizes the analog signals to obtain the discrete data that the
computer can understand.
A Digital Signal Module or a Processor: It performs processing on the raw speech signal like frequency domain
conversion, restoring only the required information etc.
Preprocessed signal storage: The preprocessed speech is stored in the memory to carry out further task of speech
recognition.
Reference Speech patterns: The computer or the system consists of predefined speech patterns or templates already
stored in the memory, to be used as the reference for matching.
Pattern matching algorithm: The unknown speech signal is compared with the reference speech pattern to determine the
actual words or the pattern of words.
8
Working of the System
Working of the System
A speech can be seen as an acoustic waveform, i.e., signal carrying message information. This acoustic
waveform is converted to analog electrical signals by the microphone. The Analog to Digital converter converts
this analog signal to digital samples by taking precise measurements of the wave at discrete intervals.

The digitized signal consists of a stream of periodic signals sampled at 16000 times per second and is not
suitable to carry out actual speech recognition process as the pattern cannot be easily located. To extract the
actual information, the signal in time domain is converted to signal in frequency domain.

This is done by the Digital Signal Processor using FFT technique. In the digital signal, the component after
every 1/100th of a second is analyzed and the frequency spectrum for each such component is computed. In
other words, the digitized signal is segmented into small parts of frequency amplitudes.

Each segment or the frequency graph represents the different sounds made by human beings. The computer
performs the matching of the unknown segments with the stored phonetics of the particular language.
Factors on which Speech Recognition
system depends
The speech recognition system depends on the following factors:

Isolated Words: There needs to be a pause between the consecutive words spoken because continuous words can
overlap making it difficult for the system to understand when a word starts or ends. Thus, there needs to be a
silence between consecutive words.

Single Speaker: Many speakers trying to give speech input at the same time can cause overlapping of the signals
and interruptions. Most of the speech recognition systems used are speaker dependent systems.

Vocabulary size: Languages with large vocabulary are difficult to be considered for pattern matching than those
with small vocabulary as chances of having ambiguous words are lesser in the latter.
Components of ASR

LEXICON MODEL, ACOUSTIC MODEL, & LANGUAGE MODEL

Lexicon
The lexicon is the primary step in decoding speech. Creating a comprehensive lexical design for an ASR system
involves including the fundamental elements of both spoken language (the audio input the ASR system receives)
and written vocabulary (the text the system sends out).

Acoustic Model
Acoustic modeling involves separating an audio signal into small time frames. Acoustic models analyze each
frame and provide the probability of using different phonemes in that section of audio. Simply put, acoustic models
aim to predict which sound is spoken in each frame.

Language Model

Today’s ASR systems employ natural language processing (NLP) to help computers understand the context of what
a speaker says. Language models recognize the intent of spoken phrases and use that knowledge to compose word
sequences. They operate in a similar way to acoustic models by using deep neural networks trained on text data to
estimate the probability of which word comes next in a phrase.

Together, the lexicon, acoustic model, and language model enable ASR systems to make close-to-accurate
predictions about the words and sentences in an audio input.
How ASR Works?
In the simplest terms, speech recognition occurs when a computer receives audio input from a
person speaking, processes that input by breaking down the various components of speech, and then
transcribes that speech to text.

Some ASR systems are speaker-dependent and must be trained to recognize particular words and
speech patterns. These are essentially the voice-recognition systems used in your smart devices. You
need to say specific words and phrases into your phone before the ASR-powered voice assistant
starts working in order for it to learn to identify your voice.

Other ASR systems are speaker-independent. These systems do not require any training. Speak-
independent systems have the ability to recognize spoken words regardless of the speaker. Speaker-
independent systems are practical solutions for business applications like interactive voice response
(IVR).
ASR Use Cases
From speech recognition’s mid-twentieth-century origins to its multi-industry applications today, the use cases for ASR technology are far-
reaching. ASR made it out of the computer science laboratories and is now integrated into our everyday lives.

 Voice Assistants : According to a 2020 survey conducted by NPR and Edison Research, 63% of respondents said they use a voice
assistant. The ability to use voice commands to help complete tasks like opening mobile apps, sending a text message, or searching the
web affords users a greater level of convenience.

 Language learning: For people engaged in self-guided language study, apps using speech-recognition tools put them a step closer to
having a comprehensive learning experience during independent study. Apps like Busuu and Babbel use ASR technology to help students
practice their pronunciation and accents in their target languages. Using these apps, a student speaks into their phone or computer in their
target language. The ASR software listens to that voice input, analyzes it, and if it matches what the system identifies as the correct
pronunciation, it informs the learner. If the student’s voice input doesn’t match what the ASR knows to be correct, it will inform the
student of their missed pronunciation as well.

 Transcription Services : One of the first widespread use cases of ASR was for the simple transcription of speech. Speech-to-text
services offer a level of convenience in many contexts and open the door to improved audio and video accessibility. Health care
practitioners use dictation products like Dragon Naturally Speaking to help them take hands-free notes while attending to patients. ASR
captioning also allows for real-time transcription of live video, which allows a broader audience to access the media.

 Call Centers: ASR is crucial for the automation of processes for businesses with extensive customer support demands. With an influx of
callers, companies need a way to efficiently handle a vast amount of customer communication. ASR technology is one of the main
mechanisms involved in smart IVR — a system that automates routine inbound communications as well as large-scale outbound call
campaigns. 15
Challenges & Issues in ASR

• Imprecision and false interpretations

• Time and lack of efficiency
• Accents and local differences
• Background noise and loud environments
• Privacy and data security

16
Aravali College of Engineering And Management
Jasana, Tigoan Road, Neharpar, Faridabad, Delhi NCR
Toll Free Number : 91- 8527538785
Website : www.acem.edu.in

Book - Handbook of Collaborative Learning (2013)
100% (1)
Book - Handbook of Collaborative Learning (2013)
498 pages
Nouns & Pronouns: Subject Predicate Nominative Appositive Direct & Indirect Object Object of The Preposition
100% (1)
Nouns & Pronouns: Subject Predicate Nominative Appositive Direct & Indirect Object Object of The Preposition
66 pages
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
100% (1)
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
65 pages
HG3052 SpeechSynthesisAndRecognition Lecture 10 Update2019-20
No ratings yet
HG3052 SpeechSynthesisAndRecognition Lecture 10 Update2019-20
49 pages
Automatic Speech Recognition Thesis
100% (3)
Automatic Speech Recognition Thesis
7 pages
Chapter One
No ratings yet
Chapter One
13 pages
Speech Recognition of Isolated Words Usi
No ratings yet
Speech Recognition of Isolated Words Usi
10 pages
Speech Recognition: BY Charu Joshi
100% (2)
Speech Recognition: BY Charu Joshi
26 pages
IT Report-1
No ratings yet
IT Report-1
14 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
28 pages
Voice Controlled Wheel Chair
0% (1)
Voice Controlled Wheel Chair
56 pages
Application and Development Prospect of AI Speech Recognition Technology
No ratings yet
Application and Development Prospect of AI Speech Recognition Technology
5 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
A Comprehensive Survey On Automatic Speech Recognition Using Neural Networks
No ratings yet
A Comprehensive Survey On Automatic Speech Recognition Using Neural Networks
46 pages
Text and Speech CCS369-UNIT 5
No ratings yet
Text and Speech CCS369-UNIT 5
9 pages
The Discipline of Counseling
100% (4)
The Discipline of Counseling
91 pages
Research
No ratings yet
Research
117 pages
Speech Recognition
No ratings yet
Speech Recognition
7 pages
Speech Recognition
No ratings yet
Speech Recognition
17 pages
A Review On Different Approaches For Speech - Recognition System
No ratings yet
A Review On Different Approaches For Speech - Recognition System
6 pages
Research Paper
No ratings yet
Research Paper
9 pages
Speech Recognition1
No ratings yet
Speech Recognition1
24 pages
Awareness in Action The Role of Consciousness in Language Acquisition
100% (1)
Awareness in Action The Role of Consciousness in Language Acquisition
272 pages
A Review On Speech Recognition Challenge
No ratings yet
A Review On Speech Recognition Challenge
7 pages
AI Speech Recognition Document
No ratings yet
AI Speech Recognition Document
26 pages
Tsa Ut V
No ratings yet
Tsa Ut V
9 pages
Methods of Social Work and Its Role in Understanding Team Climate and Team Effectiveness For Organisational Development
100% (1)
Methods of Social Work and Its Role in Understanding Team Climate and Team Effectiveness For Organisational Development
20 pages
Ai Project Sona-1 (1) - 250630 - 194118
No ratings yet
Ai Project Sona-1 (1) - 250630 - 194118
10 pages
A Report On
No ratings yet
A Report On
35 pages
Convai Technical Overview Speech Ai Part 2 2301964
No ratings yet
Convai Technical Overview Speech Ai Part 2 2301964
11 pages
Automatic Sound Recognition Technology: Modern College of Engineering, Pune-05
100% (2)
Automatic Sound Recognition Technology: Modern College of Engineering, Pune-05
20 pages
Automatic Speech Recognition: A Review: Anchal Katyal, Amanpreet Kaur, Jasmeen Gill
No ratings yet
Automatic Speech Recognition: A Review: Anchal Katyal, Amanpreet Kaur, Jasmeen Gill
4 pages
ASR Proof
No ratings yet
ASR Proof
19 pages
Under The Guidance Of: S K Biswal
No ratings yet
Under The Guidance Of: S K Biswal
19 pages
Speech Recognition
No ratings yet
Speech Recognition
4 pages
Speech Recognition-Statistical Methods
No ratings yet
Speech Recognition-Statistical Methods
18 pages
As R Tutorial
No ratings yet
As R Tutorial
16 pages
Speech Technology
No ratings yet
Speech Technology
5 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
19 pages
Unit 1: Unit 1 Greeting and Introduction
No ratings yet
Unit 1: Unit 1 Greeting and Introduction
7 pages
Shareef Seminar Docs
No ratings yet
Shareef Seminar Docs
24 pages
Human-Robot Communication: Supervisor: Prof. Nejat Biomechantronics Lab Progress Report
No ratings yet
Human-Robot Communication: Supervisor: Prof. Nejat Biomechantronics Lab Progress Report
23 pages
Automated Speech Recognition Systems Applications in Industry
No ratings yet
Automated Speech Recognition Systems Applications in Industry
4 pages
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
No ratings yet
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
10 pages
Automatic Speech Recognition: MD SHAKIR ALAM (2K18/CO/194)
No ratings yet
Automatic Speech Recognition: MD SHAKIR ALAM (2K18/CO/194)
2 pages
Solomon Teferra Abate, Martha Yifiru Tachbelie, Wolfgang Menzel - Amharic
No ratings yet
Solomon Teferra Abate, Martha Yifiru Tachbelie, Wolfgang Menzel - Amharic
12 pages
Speech Recognition Application
No ratings yet
Speech Recognition Application
13 pages
Automatic Speech Recognition Documentation
No ratings yet
Automatic Speech Recognition Documentation
24 pages
Artificial Intelligence For Speech Recognition
No ratings yet
Artificial Intelligence For Speech Recognition
9 pages
Ai For Speech Recognition
No ratings yet
Ai For Speech Recognition
27 pages
IRJET Speech Scribd
No ratings yet
IRJET Speech Scribd
3 pages
Speech Recognition
No ratings yet
Speech Recognition
10 pages
Speech Recognition System - A Review: April 2016
No ratings yet
Speech Recognition System - A Review: April 2016
10 pages
Roadmap A1 TB - p006
No ratings yet
Roadmap A1 TB - p006
1 page
Writing To Learn (Pre-Reading) - (Literacy Strategy Guide)
No ratings yet
Writing To Learn (Pre-Reading) - (Literacy Strategy Guide)
8 pages
Speech Recognition - Specific Task of Speech Recognition: Abstract
No ratings yet
Speech Recognition - Specific Task of Speech Recognition: Abstract
7 pages
Working of A Voice Recognition System
No ratings yet
Working of A Voice Recognition System
2 pages
NLP Project Reportttt
No ratings yet
NLP Project Reportttt
9 pages
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
Useful Expressions For An Informal Email For FCE Writing
No ratings yet
Useful Expressions For An Informal Email For FCE Writing
2 pages
Voice Recognition System
No ratings yet
Voice Recognition System
4 pages
Speech Recognition Technology: Applications & Future: Pankaj Pathak
No ratings yet
Speech Recognition Technology: Applications & Future: Pankaj Pathak
3 pages
Jauhiainen Thesis PDF
No ratings yet
Jauhiainen Thesis PDF
296 pages
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
No ratings yet
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
6 pages
Evidence-Based Practices
No ratings yet
Evidence-Based Practices
8 pages
Perception, Attitude, and Readiness in Artificial Intelligence Adoption Among Academic Librarians in The Bicol Region Librarians Council (BRLC)
No ratings yet
Perception, Attitude, and Readiness in Artificial Intelligence Adoption Among Academic Librarians in The Bicol Region Librarians Council (BRLC)
6 pages
Transactional Theory Powerpoint
100% (1)
Transactional Theory Powerpoint
7 pages
Critical Discourse Analysis of William Blake S Poem The Sick Rose
No ratings yet
Critical Discourse Analysis of William Blake S Poem The Sick Rose
5 pages
Speech Recognition As Emerging Revolutionary Technology
No ratings yet
Speech Recognition As Emerging Revolutionary Technology
4 pages
Arnold Gesell Theorist Project
No ratings yet
Arnold Gesell Theorist Project
11 pages
A Survey On Speech Recognition
No ratings yet
A Survey On Speech Recognition
2 pages
Countries and Nationalities 1
No ratings yet
Countries and Nationalities 1
2 pages
Almusa Essay 4 Final Draft-2 New
No ratings yet
Almusa Essay 4 Final Draft-2 New
2 pages
Speech Recognition Full Report
No ratings yet
Speech Recognition Full Report
11 pages
Complete Grammar Book New Version 2008
No ratings yet
Complete Grammar Book New Version 2008
59 pages
David Skrbina Ed Mind That Abides Panpsy
No ratings yet
David Skrbina Ed Mind That Abides Panpsy
7 pages
Osho Active Meditation Creativity Biography News Contact Ebook Home Page
No ratings yet
Osho Active Meditation Creativity Biography News Contact Ebook Home Page
4 pages
Assessment of Tertiary Education Readiness
No ratings yet
Assessment of Tertiary Education Readiness
10 pages
Grammar Notes for Lesson 7: は N が ADJです。
No ratings yet
Grammar Notes for Lesson 7: は N が ADJです。
16 pages
Bits
No ratings yet
Bits
2 pages
Session 1 - Life in The Future
No ratings yet
Session 1 - Life in The Future
8 pages
Monovit: Self-Supervised Monocular Depth Estimation With A Vision Transformer
No ratings yet
Monovit: Self-Supervised Monocular Depth Estimation With A Vision Transformer
11 pages
Exercise Logic With Ans
100% (1)
Exercise Logic With Ans
11 pages
Class 3 Parent Syllabus Final
No ratings yet
Class 3 Parent Syllabus Final
9 pages
CS607 MidTerm MCQs With Reference Solved by Arslan 1
No ratings yet
CS607 MidTerm MCQs With Reference Solved by Arslan 1
6 pages
References 1677564056 1678629493
No ratings yet
References 1677564056 1678629493
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Artificial Intelligence-An Introduction: Department of Computer Science & Engineering

Uploaded by

Artificial Intelligence-An Introduction: Department of Computer Science & Engineering

Uploaded by

Artificial Intelligence- An Introduction

oArtificial Intelligence is composed of two

oArtificial Intelligence exists when a machine can have human

•Beating Gary Kasparov in a chess match

•Steering a driver-less car

•Robotic assistants in surgery

•Monitoring trade in the stock market to see if insider trading is going on

LEXICON MODEL, ACOUSTIC MODEL, & LANGUAGE MODEL

• Imprecision and false interpretations

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.