Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
Abstract we describe in this paper the use of neural there are methods in which a small set of words, such as
networks, fuzzy logic and genetic algorithms for voice digits, are used as key words and each user is prompted to
recognition. In particular, we consider the case of speaker utter a given sequence of key words that is randomly chosen
recognition by analyzing the sound signals with the help of every time the system is used. Yet even this method is not
intelligent techniques, such as the neural networks and fuzzy completely reliable, since it can be deceived with advanced
systems. We use the neural networks for analyzing the sound electronic recording equipment that can reproduce key words
signal of an unknown speaker, and after this first step, a set of in a requested order. Therefore, a text-prompted speaker
type-2 fuzzy rules is used for decision making. We need to use recognition method has recently been proposed .
fuzzy logic due to the uncertainty of the decision process. We
also use genetic algorithms to optimize the architecture of the
neural networks. We illustrate our approach with a sample of
sound signals from real speakers in our institution.
I. INTRODUCTION
Speaker recognition, which can be classified into
identification and verification, is the process of automatically
recognizing who is speaking on the basis of individual
information included in speech waves. This technique makes
(a) Speaker identification
it possible to use the speaker's voice to verify their identity
and control access to services such as voice dialling, banking
by telephone, telephone shopping, database access services,
information services, voice mail, security control for
confidential information areas, and remote access to
computers [10].
III. VOICE CAPTURING AND PROCESSING Fig. 4. Main window of the computer program for processing
The first step for achieving voice recognition is to capture the signals.
the sound signal of the voice. We use a standard microphone
for capturing the voice signal. After this, we use the sound
We also show in Figure 5 the use of the Fast Fourier
recorder of the Windows operating system to record the
Transform (FFT) to obtain the spectral analysis of the word
sounds that belong to the database for the voices of different
"way" in Spanish.
persons. A fixed time of recording is established to have
homogeneity in the signals. We show in Figure 3 the sound
signal recorder used in the experiments.
in this section our modular neural network approach with the use
of type-2 fuzzy logic in the integration of results .
VII. REFERENCES
[1] O. Castillo, O. and P. Melin, "A New Approach for Plant Monitoring using
Type-2 Fuzzy Logic and Fractal Theory", International Journal of
General Systems, Taylor and Francis, Vol. 33, 2004, pp. 305-319.
[9] N.N Karnik, and J.M. Mendel, An Introduction to Type-2 Fuzzy Logic
Systems, Technical Report, University of Southern California, 1998.
[13] P. Melin, and O. Castillo, A New Method for Adaptive Control of Non-
Linear Plants Using Type-2 Fuzzy Logic and Neural Networks,
International Journal of General Systems, Taylor and Francis, Vo