BDHXB
BDHXB
research object. Speech recognition allows the machine to turn the speech
signal into text or commands through the process of identification and
understanding, and also makes the function of natural voice communication.
Speech recognition involves many fields of physiology, psychology, linguistics,
computer science and signal processing, and is even related to the person's
body language, and its ultimate goal is to achieve natural language
communication between man and machine. The speech recognition technology
is gradually becoming the key technology of the IT man-machine interface. The
paper describes the development of speech recognition technology and its
basic principles, methods, reviewed the classification of speech recognition
systems and voice recognition technology, analyzed the problems faced by the
speech recognition.
Introducing Our Roles
SpeakNote Till Now
Team Members Current
Introduction Progress
Conclusion
Automated Notes Maker From Audio Recordings
Transcribing lectures or speeches
Accessibility for the hearing impaired
Dictation
Improved productivity
Language learning
Translation
Our Team ( Group-09 )
We as an innovative and creative-
thinking group of four developers
and coders have discussed and
Name of College Roll
decided to make our project on this
Member Number topic, that is, “Automated Notes
Maker from Audio Recordings.”
Rajdeep 037
Chowdhury
(EEE 6th
semester)
Rounak Dey 012
(EEE 6th
semester)
Prabhat Jha 038
(EEE 6th
semester)
Sakshi Jha 011
(EEE 6th
semester)
Assistant Professor at Academy of Technology
Our project guide, Prof. Abhijit Patra, shed light upon us
about the nature and path of learning for the project and
the execution of the concepts.
He helped us in getting the right materials from the internet
to study and do our research on to make this project possible.
He gave us the correct motivation for this project. He
reviewed our process and the code and the overall look of
the after periodic intervals of time and told us our mistakes
and areas we can improve on.
The technologies,
languages and third party GitHub or
websites used in building
HTML5
Vercel VS Code
and deploying this The website’s “skeleton” We are suing the standard
is made with HTML5. Code editor for writing all the
website are given as
code and managing the files.
follows: CSS3
CSS3 HTML5 GitHub
The website’s styling ,i.e., We will deploy the project files
UI/UX is done with CSS3. on GitHub platform in a separate
repository.
ReactJs Node.js
We are planning to further build the JS API We are coding the functionalities
project and integrate ReactJs and of the website in JavaScript through
divide the whole page into components. Node.js framework.
VS Code
Speech Recognition API Editor Hosting Website
We have used Web Speech Hosting website to get our
API's Speech Recognition for personalized domain name like
Converting speech to text. Hostinger (www.hostinger.in),
or some other provider.
The detailed development process using HTML5, CSS3,
JavaScript, and API is given as follows
• We did the coding of the main logic and all functionalities of the website
by implementing them in JavaScript and Reactjs. We did the adding of all
the speech to text functionalities of the website by using the API.
• We took care of all the modules and the documentation related to the
project that need to be studied and used in our project concept.
• We did the of designing (UI/UX) and styling of the website on Figma and
developing the CSS part of the coding.
Current Progress
Pro Plan
Regular Plan Advanced Plan
199
For 1 month subscription
499
for 3 months subscription
999
for 6 months subscription
1. www.geeksforgeeks.org
2. www.wikipedia.org
3. www.w3schools.com