0% found this document useful (0 votes)
18 views3 pages

Paper 5728

The document discusses the development of a text-to-speech conversion application using Python, aimed at assisting individuals with visual and learning disabilities by converting text from various formats into audio files. It highlights the use of the gTTS API and various Python libraries to facilitate this conversion process. The paper also emphasizes the importance of making such technology accessible and plans for future enhancements, including support for localized languages.

Uploaded by

brightmafunga6
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views3 pages

Paper 5728

The document discusses the development of a text-to-speech conversion application using Python, aimed at assisting individuals with visual and learning disabilities by converting text from various formats into audio files. It highlights the use of the gTTS API and various Python libraries to facilitate this conversion process. The paper also emphasizes the importance of making such technology accessible and plans for future enhancements, including support for localized languages.

Uploaded by

brightmafunga6
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

ISSN (Online) 2581-9429

IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)

Volume 2, Issue 1, July 2022


Impact Factor 6.252

Text to Speech Conversion using Python


Sonali Wangikar1, Anjali Bobade2, Namrata Pawar3, Akshay Dhanmani4
Professor, Department of E&TC, Dr. Daulatrao Aher College of Engineering, Karad, India1
Students, Department of E&TC, Dr. Daulatrao Aher College of Engineering, Karad, India2,3,4

Abstract: Text to speech conversion means it is the application that converts text int audio file by
extracting and processing that text. The PDF file is convert this processed text into audio presentations of
text so we are developed a text to audio in the form of application converts input text into audio and then
it can be saved as mp3.file. This application is so useful for people who have visual disabilities and this
text to audio is easier for them.

Keywords: Toothpaste, Trikatu, Excudability, Foamility, Clove oil

I. INTRODUCTION
Text to speech conversion it is a conversion of text in the form of PDF file merge JPG PNG Extra file is converted into
audio file text to speech it it is a technology which can help to people who have learning disabilities and you have some
problems to read out the large text file this text to speech converter usually generate audio data as the output of system
text to speech converter it is a technology which is firstly scan the English test that are in the PDF for image and exchange
it into voice was found this system helps to people at various places for example in educational applications transfer
application or any other places that takes to scan from newspaper boobs image of any text captured by mobile audio is
finally created with the converter by the information getting from various place.

II. OBJECTIVE
People with learning disabilities significantly boost of the reading ability easy wait world improve the literacy problem
it help to focus on their studies to students

III. MOTIVATION
In this system our main goal is to help people with virtual learning disabilities and introduce people on new technology
for their day-to-day life it includes collecting the data in the form of PDF text or image text and then convert into a simple
audio from it helps to reduce time of reading it also helps to students the focusing on the content and compass
compassionate about that result this system is raising the components of the students does we develop the system to
introduce a new learning method to people in their life

IV. BLOCK DIAGRAM

Copyright to IJARSCT DOI 10.48175/IJARSCT-5728 599


www.ijarsct.co.in
ISSN (Online) 2581-9429
IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)

Volume 2, Issue 1, July 2022


Impact Factor 6.252
4.1 Block Diagram Description
This ia block diagram of Text to speech converter using python.There are different APIs available to translate text to
speech in Python programming language. One of the technology to convert Google Text to Speech is API which is
commonly known as the gTTS API. This gTTS API is handily to use tool which converts the given text into audio file
which can be saved as a .mp3 file..In above block diagram we gives image file or pdf files to the computer system as a
input. Then computer system convert them into audio files using some python programming and python libraries. Such
as Pypdf, pdfminer, gTTS, openCV, web-browser, PyQt5.

V. FLOW CHART

Copyright to IJARSCT DOI 10.48175/IJARSCT-5728 600


www.ijarsct.co.in
ISSN (Online) 2581-9429
IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)

Volume 2, Issue 1, July 2022


Impact Factor 6.252
VI. USED SOFTWARE/PROGRAMMING LANGUAGE
6.1 Python Programming Language
Python programming language is dynamically typed and high level general purpose language it supports various
programming paradigm include structure and object oriented and functional programming Python is simple or easy to
learn Syntax and readability introduced cost of the maintenance in 2022 python 3.10 version is released it meant to be
easy readable language it serves a scripting language for web application library such as pypdf numpy scipy allowed the
effective uses of python in scientific computing the Python is commonly used in artificial intelligence(AI) projects on
machine learning(ML) project with the help of various libraries and these technologies are spreading across multiple
industries and they have equally dominate by python.

VII. RESULT

VIII. CONCLUSION
Text to speech converter is rapidly growing aspect of computer technology and increasingly played important role in the
way we interconnect with the system and interfaces across various platforms to identify various operations and processes
involving into text to speech converter we also developed a very simple and easy graphical user interface which allows
the user to type his or her text provided in the text fields in application another area of further work is the implementation
of text to speech system on other platforms to increase functionality and added advantages in future we plan to make
efforts to create engines for localised languages to make Text-to-Speech system more accessible to a wider range

REFERENCES
[1]. Lemmetty, S., 1999. Review of Speech Syn1thesis Technology. Masters Dissertation, Helsinki University Of
Technology.
[2]. Dutoit, T., 1993. High quality text-to-speech synthesis of The French language. Doctoral dissertation, Faculte
Polytechnique de Mons.
[3]. Suendermann, D., Höge, H., and Black, A., 2010.Challenges in Speech Synthesis. Chen, F., Jokinen, K.,(eds.),
Speech Technology, Springer Science + Business Media LLC.
[4]. Allen, J., Hunnicutt, M. S., Klatt D., 1987. From Text to Speech: The MI Talk system. Cambridge University
Press.
[5]. Rubin, P., Baer, T., and Mermelstein, P., 1981. An Articulatory synthesizer for perceptual research. Journal of
the Acoustical Society of America 70: 321–328.
[6]. Van Santen, J.P.H., Sproat, R. W., Olive, J.P., and Hirschberg, J., 1997. Progress in Speech Synthesis Springer.

Copyright to IJARSCT DOI 10.48175/IJARSCT-5728 601


www.ijarsct.co.in

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy