0% found this document useful (0 votes)
11 views22 pages

Major Presentation

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views22 pages

Major Presentation

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 22

CAPSTONE

PROJECT Phase II
Batch- 70
Team Members Roll numbers
MADURI RAM CHARAN TEJA 2003A52026

SUHAAS SANGA 2003A52132

RENUKUNTLA DHANUSH 2003A52053

KOTHAPALLY PREM SAI 2003A52052

GURRAPU ADITYA KRISHNA 2003A52085

- Under the Guidance of Dr. R. Vijaya Prakash


Project
Title

Gesture Controlled Presentation


System with Speech Recognition
and Webcam Interaction
Problem Identification

• Most of the people while giving presentation they are


using highlighters or digital pen to write the things,
people are using remotes to move the slides instead of
that we can just control the presentation by hand
gestures.
Literature Review
• In the realm of academia and research, presenting complex information
from research papers in an engaging and accessible manner is of
paramount importance. Hand gesture-controlled presentations, powered by
hand-dedicated models in OpenCV, offer a novel approach to enhance the
presentation of research papers. As a litterateur, I am intrigued by the
potential of this technology to transform the dissemination of knowledge,
making it more interactive, captivating, and immersive. This review aims to
explore the advantages, challenges, and implications of using hand gesture-
controlled presentations for research paper communication.
Literature Review

• Hand gesture-controlled presentations, powered by hand-


dedicated models in OpenCV, represent a captivating and
transformative approach to presenting research papers. As a
literateur, I am intrigued by the enhanced interactivity this
technology offers. It has the potential to the way research
findings are communicated, breaking down barriers between
researchers and audiences. As this technology advances, I
eagerly anticipate its impact on scholarly communication and
the advancement of human understanding.
Objective of the problem

• We are thinking to build a program where we can control


the presentation by our hand gestures.
• We can define the gestures dynamically so that we can
perform the operations in the presentation like to move to
the next slide and previous slide.
• We can also write and delete on the slides. We can even
point out the features or content on the slide that we are
explaining only by using hand gestures.
Proposed Plan

• The proposed plan for Capstone Project aims to develop


an efficient idea where we can control the presentation
using HandDetector, HandTracking Module, OpenCV
concepts.
• We can even detect the gestures clearly based that we can
move to left and write, draw the things and delete the
things.
• This project will help people to work and present the things
in an easy and efficient manner.
Proposed Plan
● PNG IMAGES of ppt (input).

● Uploading the pngs into the folder.

● Arranging the pngs in order.

● Folder path to the program.

● Program starts detecting the hand gestures and movie according to that.
STEP BY STEP WORKING :
Draw or Write Gesture Delete Gesture
Terminate or Exit Gesture Pointer Gesture
Next Slide Gesture Previous Slide Gesture
Speech command Control

Next slide

Previous slide

Go to required slide

Terminate the presentation

Delete All

Delete
Proposed Approach/Algorithm/model

● The algorithm we are using this project is:


● from cvzone.HandTrackingModule import
HandDetector
● Speech Recognition Module.
● speech_recognition Module
● My using HandTracking module we will identify the
no.of fingers up or down based on that we will perform
accurate actions on the presentation slides.
● We use list concept to execute this project complete
and it makes us to sorted the presentation slides.
Speech Recognition Module

● Audio Detection Range: Speech recognition


modules can typically detect and transcribe audio
signals within a wide range of frequencies, covering
the audible spectrum of human speech, which is
roughly 20 Hz to 20 kHz.
● Internal Algorithm: The internal algorithm of
speech recognition modules often involves complex
techniques such as feature extraction using Mel-
frequency cepstral coefficients (MFCCs), acoustic
modeling with Hidden Markov Models (HMMs) or
deep neural networks (DNNs), language modeling
with N-gram models or neural networks, and
decoding using algorithms like the Viterbi algorithm
or beam search.
● Steps of the Algorithm:
● Feature Extraction: Convert raw audio signals into feature
vectors representing acoustic characteristics.
● Acoustic Modeling: Create models to map features to phonemes
or words using statistical methods or neural networks.
● Language Modeling: Incorporate language models to predict the
likelihood of word sequences given the acoustic features.
● Decoding: Determine the most probable word sequence using
algorithms like Viterbi or beam search.
● Integration: Integrate with external services or APIs for improved
accuracy or additional functionality.
Input data/Tool Used:


Input data In this project is ppt slides in Png format in a folder.

Then in the next step the images will be arranged in order and stored in
folder .

The presentation is ready.

TOOL USED:

Python

The images will be will arranged in order use the python.
Result
Conclusion

● The primary objective of this project is less time to


detect and train with more accurate.
● In this project by using opencv modules we are
able to achieve less train and more accurate
model.
● By using HandTracking Module and Speech
Recognition module we outwork other models.
● This project completely work on gesture detect
and action perform method.
Future Scope:


Improve the long distance gesture
recognition.

Develop overall improvement of
application.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy