0% found this document useful (0 votes)

41 views15 pages

Project Report: Call For Your Symphony

The document describes a project that develops an algorithm to play a desired song by calling out its name. It uses cross-correlation between a recorded voice input and predefined voice models of song names stored in a library. Songs are generated by combining basic musical notes represented as sinusoidal functions of specific frequencies. The recorded input is correlated with each voice model, and the song with the highest correlation is played.

Uploaded by

Anshul Ranjan Modi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views15 pages

Project Report: Call For Your Symphony

Uploaded by

Anshul Ranjan Modi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

CALL FOR YOUR SYMPHONY

PROJECT REPORT

Submitted for the course: Signal Analysis And Processing (ECE1018)

By
YOGESH KAUSHIK 16BIS0149
ANSHUL RANJAN MODI 16BIS0141
SINGAM MEGHANA 16BIS0139

Slot: A1
Name of faculty: Dr.S.KALAIVANI
Dr.CHRISTOPHER CLEMENT J

SCHOOL OF ELECTRONICS ENGINEERING

November,2017

1
CERTIFICATE

This is to certify that the project work entitled “ Call For Your Symphony” that is being
submitted by “ Yogesh Kaushik,Anshul Ranjan Modi,Singam Meghana” for Signal Analsysis
And Processing(ECE1018) is a record of bonafide work done under my supervision. The
contents of this Project work, in full or in parts, have neither been taken from any other source
nor have been submitted for any other CAL course.

Place : Vellore

Date : 03/11/2017

Signature of Students:

YOGESH KAUSHIK

ANSHUL RANJAN MODI

SINGAM MEGHANA

Signature of Faculty:

Dr.S.KALAIVANI

Dr.CHRISTOPHER CLEMENT J

2
ACKNOWLEDGEMENTS

The members of the group would like to acknowledge all those who have helped with the
completion of this project.

First of all, we would like to express our surpassing gratefulness to Dr.Kalaivani S,

Dr.Christopher Clement J our teacher, for his valuable advice, continual support, suggestions and
patience during our study. We would also like to thank our Dean , Dr. Elizabeth Rufus, for
giving us an opportunity to carry out our studies at the University.

Our special thanks are extended to the lab assistants, who helped us. Finally are special thanks
are also due to all our friends for their academic, moral support and furthermore their helpful
assistance during the data collection throughout the study.

Signature:

< Yogesh Kaushik

16BIS0149

< Anshul Ranjan Modi

16BIS0141

< Singam Meghana

16BIS0139

3
ABSTRACT

Technology is serving the mankind with the ripen fruits of its advancement; similar is the
objective of our paper to aid to mankind with ease of initiating a task with advancement in
technology as the main weapon to reduce the complexity of the tasks in hand. In this paper we
have developed an algorithm for playing the desired track by just calling out the name of the
latter. Cross correlation * has aided us in making this algorithm; all of our work is done by using
MATLAB. Basic transducers ** are used to provide the physicality to our work.

*
Measure of similarity of two signals

**
(Microphone & speakers)

4
1.Introduction:
The algorithm which we created can be implemented to provide a smart environment which
would ease in work load of the user. As per the frontend part the user calls out the name of the
song and the tool we used processes user’s request and plays the desired track. Switching over to
the backend part, this is much complex as compared to the latter.

We have first generated the songs line by line using the basic musical notes; which are stored
under a library. Then there names are stored as the predefined voice models, which will later be
used during speech correlation. The main idea behind this project is the speech recognition. The
test voice input should match with the predefined voice models; which will result in the
generation of the desired output. [6]

The fig (1) depicts the overview of the algorithm used (basically the interaction b/w the client
and interface) which is incorporated using in MATLAB.

Fig (1): overview of algorithm

5
2.Methodology:
A: Generation of songs

Every musical note has a particular frequency. By using this frequency we can generate the
musical notes. Sinusoidal functions are incorporated in our algorithm to generate these basic
notes.

ote th o䳌䁮 h 䁮䁝䁮 o㌳䁓耀晦ee (1)

{F(x) is the note; f is the particular frequency; range will be the frequency range in which
function is defined}

Then the song is generated line by line as is done while playing a piano, picking up right notes in
a particular order to generate the desired melody.

The generated tracks are stored under a library which can later be accessed during the speech
correlation. The above stated method is used for the generation of the desired number of songs.

For storing the tracks in the accessible format we use Wavwrite, Audiowrite matlab
functions.One thing which serves as the link between the tracks and the input test voice sample is
predefined voice modules; for this we use the audiorecorder ***
function of the matlab. This
function aids in gathering the input voice modules from the user by means of the transducer
(microphone). [7]

*
Writes data to 8-, 16-, 24-, and 32-bit .wav files

**
Writes a matrix of audio data

***
Records audio from an input device, such as a microphone connected to your system

6
B: Speech recognition using Correlation:

In the field of signals, the resemblance of two function of the displacement of one relative to
other is called as cross correlation. This process has many applications in the field of
Neurophysiology, averaging, pattern recognition etc.

The general mathematical expression of the latter (2)*

t o et䳌 o e (2)

{x1 denotes the first function; x2 denotes the second function; t is the time τ is time variable}

Implementation:

We are using the cross correlation method to determine the result of the project. One by one, we
are calling the predefined voice modules stored in the library and are correlating those with the
input. The matlab function which aids to our need to solve this particular glitch is wavread *. We
are using the wavread function to read the predefined voice functions from the source library,
example:
{y1=wavread ('one.wav');
One.wav is the name of a track under the library}

Correlation factors are generated for each of the voice modules using cross correlation of the
latter with the test voice input. Then using the max ** function, the correlation term of maximum
value is used to compare with the correlation factors generated by cross correlation of each. The
one which is close to the maximum is the desired output, and then the song to be played is called
using the function wavread from the source library which gives user the desired output. [3][4][5]

The comparison is done by using simple conditional loop statements. If the desired match is
found the algorithm calls the desired track to be played from the source library and plays it, if not
matched error sound is popped.[1][2]
The figures depicted below show the working of the algorithm. Fig (2) is the test voice input; Fig
(3) is the cross correlation of the test voice input with one of the predefined voice module; Fig (4)
is the song being played.

7
Fig (2): test voice input

Fig (3): cross correlation result

Fig (4): track being played

Read Microsoft WAVE (.wav) sound file

**
Returns the largest element

8
Fs=8192;

a=sin(2*pi*440*(0:0.000125:0.5));

b=sin(2*pi*493.88*(0:0.000125:0.5));

c=sin(2*pi*554.37*(0:0.000125:0.5));

d=sin(2*pi*587.33*(0:0.000125:0.5));

e=sin(2*pi*659.26*(0:0.000125:0.5));

f=sin(2*pi*739.99*(0:0.000125:0.5));

g = sin(2*pi*195.99*(0:0.000125:0.5));

line1=[a,a,e,e,f,f,e,e,];

line2=[d,d,c,c,b,b,a,a,];

line3=[e,e,d,d,c,c,b,b];

song1=[line1,line2,line3,line3,line1,line2];

ln1=[c,d,e,e,e,e,e,e,e,e,e,d,e,f];

ln2=[e,e,e,d,d,b,b,d,c];

ln3=[c,g,g,g,g,f,a,g];

ln4=[f,f,f,f,f,f,e,d,f]

song2=[ln1,ln2,ln3,ln4];

l1=[e,e,e,e,e,e,e,g,c,d];

l2=[e,f,f,f,f,f,e,e,e,e];

l3=[e,d,d,e,d,g,e,e,e,e,e,e];

l4=[e,g,c,d,e,f,f,f,f];

l5=[f,e,e,e,e,g,g,f,d,c];

song3=[l1,l2,l3,l4,l5];

lnn1=[e,d,c,d,e,e,e];

lnn2=[d,d,d,e,g,g];

lnn3=[e,d,d,e,d,c];

song4=[lnn1,lnn2,lnn1,lnn3];

lnnn1=[b,a,b,b,a,b,g,a,a];

lnnn2=[g,d,a,b,g,d,a,b];

lnnn3=[c,b,c,c,b,a,g,a,g,a];

lnnn4=[g,d,a,b,g,d,a,b];

lnnn5=[c,b,c,c,b,a,g,a,g,a];

9
song5=[lnnn1,lnnn2,lnnn3,lnnn4,lnnn5];

recObj = audiorecorder;

disp('Start');

recordblocking(recObj, 2);

disp('end');

Obj=getaudiodata(recObj);

%Speech Recognition Using Correlation Method

%Write Following Command On Command Window

%speechrecognition('test.wav')

voice=Obj;

x=voice;

x=x';

x=x(1,:);

x=x';

y1=audioread('one.wav');

y1=y1';

y1=y1(1,:);

y1=y1';

z1=xcorr(x,y1);

m1=max(z1);

l1=length(z1);

t1=-((l1-1)/2):1:((l1-1)/2);

t1=t1';

%subplot(3,2,1);

plot(t1,z1);

y2=audioread('two.wav');

y2=y2';

y2=y2(1,:);

y2=y2';

z2=xcorr(x,y2);

m2=max(z2);

l2=length(z2);

10
t2=-((l2-1)/2):1:((l2-1)/2);

t2=t2';

%subplot(3,2,2);

figure

plot(t2,z2);

y3=audioread('three.wav');

y3=y3';

y3=y3(1,:);

y3=y3';

z3=xcorr(x,y3);

m3=max(z3);

l3=length(z3);

t3=-((l3-1)/2):1:((l3-1)/2);

t3=t3';

%subplot(3,2,3);

figure

plot(t3,z3);

y4=audioread('four.wav');

y4=y4';

y4=y4(1,:);

y4=y4';

z4=xcorr(x,y4);

m4=max(z4);

l4=length(z4);

t4=-((l4-1)/2):1:((l4-1)/2);

t4=t4';

%subplot(3,2,4);

figure

plot(t4,z4);

y5=audioread('five.wav');

y5=y5';

y5=y5(1,:);

11
y5=y5';

z5=xcorr(x,y5);

m5=max(z5);

l5=length(z5);

t5=-((l5-1)/2):1:((l5-1)/2);

t5=t5';

%subplot(3,2,5);

figure

plot(t5,z5);

m6=80;

a=[m1 m2 m3 m4 m5 m6];

m=max(a);

if m<=m1

sound(song1,Fs);

elseif m<=m2

sound(song2,Fs);

elseif m<=m3

sound(song3,Fs);

elseif m<=m4

sound(song4,Fs);

elseif m<m5

sound(song5,Fs);

else

soundsc(audioread('denied.wav'),8192)

end

12
3. APPLICATIONS:
The sources of entertainment for differently abled people are very limited. This paper mainly
focuses on the betterment of them in a simple but effective way. Many surveys have been
conducted where differently abled people have shared their problems. This project simplifies the
idea of playing the music with the help of our automated music system. The differently abled
people just have to use their voice to play the music, instead of playing it manually.

Other applications of our project can be used in audio systems used in cars, futuristic cars are
said to be enabled with the smart features, as in voice command controls; our project aids in the
latter. It not only serves as a futuristic wizard, but also aids to the society as a helping wizard to
the differentially abled.

13
CONCLUSION:
This paper will give a brief description of simple and efficient voice recognition method for
extraction of the song which is stored in the library under the particular voice module. The main
area of concern was the development of the algorithm. Our algorithm is efficient and serve caters
to the promises made in the paper. We successfully have created and tested the algorithm and
hope that it will be used in the days to come as a tech-wizard.

Fig (5) gives the procedural methodology of our work plan incorporated in the paper.

14
REFERENCES
[1]. X.D Huang and K.F.Lee. Phonene classification using semi continuous hidden markov
models. IEEE Trans. On signal processing, 40(5):1962-1967, May 1992.

[2]. Acero, Acoustical and environmental robustness in automatic speech recognition, Kluwer
Academic Pubs.1993.

[3]. Rabiner, L.R. Schafer, R.W. Digital processing of speech signals, Prentice hall, 1978.

[4]. F.Jelinek. “Continuous speech recognition by stasticial methods.” IEEE proceedings

64:4(1976):532-556.

[5]. Young,S., Review of large vocabulary continuous speech recognition, IEEE signal
processing Magazine, pp.45-57, September 1996.

[6]. Rabiner L.R, Juang B.H., Fundamentals of speech recognition, Prentice Hall, 1993.

[7]. “Speech and speaker recognition: A tutorial” by Samudravijaya K S. Young. The general use
of typing in phoneme-based hmm speech recognizers, Proceedings of ICASSP 1992

[9]. http://www.wikipedia.org

[10]. http://www.google.co.in

Cep Signal and System PDF
No ratings yet
Cep Signal and System PDF
5 pages
Voice Recognition Using FFT Transformation
No ratings yet
Voice Recognition Using FFT Transformation
3 pages
Noise Cancellation DSP Theory
No ratings yet
Noise Cancellation DSP Theory
19 pages
Speech Processing Lab Manual
No ratings yet
Speech Processing Lab Manual
23 pages
Voice Recognition Project DSP On Matlab
100% (1)
Voice Recognition Project DSP On Matlab
6 pages
Z80 Simulator IDE Getting Started Page
No ratings yet
Z80 Simulator IDE Getting Started Page
9 pages
NetApp Training OnTap Clustering Student Guide Rev4
69% (13)
NetApp Training OnTap Clustering Student Guide Rev4
699 pages
Xu Ly Am Thanh
No ratings yet
Xu Ly Am Thanh
10 pages
Vocal Chameleon
No ratings yet
Vocal Chameleon
11 pages
Correlation of Signals
No ratings yet
Correlation of Signals
34 pages
Lab 04: Synthesis of Sinusoidal Signals-Music Synthesis: Signal Processing First
No ratings yet
Lab 04: Synthesis of Sinusoidal Signals-Music Synthesis: Signal Processing First
12 pages
Guitar Tuner Using Correlation
No ratings yet
Guitar Tuner Using Correlation
14 pages
Pitch Shifter Project Report
No ratings yet
Pitch Shifter Project Report
15 pages
Speaker Recognition
No ratings yet
Speaker Recognition
11 pages
Project 3
No ratings yet
Project 3
2 pages
Call For Your Symphony - Final
No ratings yet
Call For Your Symphony - Final
5 pages
SAP Common COGI Errors
100% (1)
SAP Common COGI Errors
4 pages
SR - Lab File
No ratings yet
SR - Lab File
64 pages
V3S3-6 JamalPriceReport
No ratings yet
V3S3-6 JamalPriceReport
10 pages
Signal and System - 3-Mithun
No ratings yet
Signal and System - 3-Mithun
45 pages
Digital Signal Processing Report
No ratings yet
Digital Signal Processing Report
20 pages
Rails 2.3 and Rack
100% (1)
Rails 2.3 and Rack
24 pages
Making Music With MATLAB
No ratings yet
Making Music With MATLAB
4 pages
Algorithmic Composition
No ratings yet
Algorithmic Composition
50 pages
Audio Project
No ratings yet
Audio Project
13 pages
Ramaiah University of Applied Sciences: Faculty of Engineering & Technology Lab Exam Question Paper - M. Tech
No ratings yet
Ramaiah University of Applied Sciences: Faculty of Engineering & Technology Lab Exam Question Paper - M. Tech
7 pages
Report Rahman446 PDF
No ratings yet
Report Rahman446 PDF
45 pages
Signal
No ratings yet
Signal
14 pages
Ca2 DSP Report
No ratings yet
Ca2 DSP Report
7 pages
Enriquez Sa1 Ece0055l
No ratings yet
Enriquez Sa1 Ece0055l
29 pages
How To Build Your Own Mikrotik Wifi
No ratings yet
How To Build Your Own Mikrotik Wifi
14 pages
309 Open Ended 2
No ratings yet
309 Open Ended 2
6 pages
LAB 14 Open Ended Jawwad (2018-EE-011)
No ratings yet
LAB 14 Open Ended Jawwad (2018-EE-011)
5 pages
Hanoi University of Science and Technology
No ratings yet
Hanoi University of Science and Technology
9 pages
Signals Report
No ratings yet
Signals Report
12 pages
Voice Processing Tool
No ratings yet
Voice Processing Tool
51 pages
Lab 1: Basic Cisco Device Configuration: Topology Diagram
No ratings yet
Lab 1: Basic Cisco Device Configuration: Topology Diagram
17 pages
Audio Signal Processing
No ratings yet
Audio Signal Processing
7 pages
Requirements Specification: Ambulance Dispatch System
0% (2)
Requirements Specification: Ambulance Dispatch System
23 pages
VOICE Recognition Using MATLAB
No ratings yet
VOICE Recognition Using MATLAB
6 pages
The Voice Recognition (
No ratings yet
The Voice Recognition (
6 pages
Sound Zoned
No ratings yet
Sound Zoned
2 pages
673random Signal Analysi Final1
No ratings yet
673random Signal Analysi Final1
6 pages
Fir and I I R Filters Worksheet Answers
No ratings yet
Fir and I I R Filters Worksheet Answers
9 pages
Ali - Lab 09 Report
No ratings yet
Ali - Lab 09 Report
9 pages
Lab 3-Synthesis of Sinusoidal Signals and Sampling Theorem
No ratings yet
Lab 3-Synthesis of Sinusoidal Signals and Sampling Theorem
8 pages
Tutorial UML
No ratings yet
Tutorial UML
71 pages
SAP Variant Configuration
No ratings yet
SAP Variant Configuration
10 pages
Signal OEL
No ratings yet
Signal OEL
4 pages
Implementing Speaker Recognition: Chase Zhou Physics 406 - 11 May 2015
No ratings yet
Implementing Speaker Recognition: Chase Zhou Physics 406 - 11 May 2015
10 pages
QTS User Manual Cat1 Eng 4.2.1
No ratings yet
QTS User Manual Cat1 Eng 4.2.1
369 pages
Bubble Sort Example
No ratings yet
Bubble Sort Example
2 pages
Building Web Services With ABAP and SAP Web Application Server Exercises / Solutions
No ratings yet
Building Web Services With ABAP and SAP Web Application Server Exercises / Solutions
25 pages
Voice Analysis Using Short Time Fourier Transform and Cross Correlation Methods
No ratings yet
Voice Analysis Using Short Time Fourier Transform and Cross Correlation Methods
6 pages
Generating Audio Signal and Performing Different Operations On Recorded Signal
No ratings yet
Generating Audio Signal and Performing Different Operations On Recorded Signal
4 pages
Editor - in - Chief,+1437 Article+Text 5727 1 4 20190718
No ratings yet
Editor - in - Chief,+1437 Article+Text 5727 1 4 20190718
3 pages
Sns Lab 7 19-Ee-0
No ratings yet
Sns Lab 7 19-Ee-0
12 pages
Octave System Sound Processing Library: Lóránt Oroszlány
No ratings yet
Octave System Sound Processing Library: Lóránt Oroszlány
39 pages
Speech Recognition System Using Matlab
No ratings yet
Speech Recognition System Using Matlab
13 pages
ASP Lab Report
No ratings yet
ASP Lab Report
8 pages
Aryan Raj ASP Aat
No ratings yet
Aryan Raj ASP Aat
9 pages
2-Elements of ECDIS Cont PDF
No ratings yet
2-Elements of ECDIS Cont PDF
14 pages
1.2.4.4 Packet Tracer - Representing The Network Instructions
No ratings yet
1.2.4.4 Packet Tracer - Representing The Network Instructions
7 pages
Background:: EC380 Mini Project 1 - Jingle Bells Synthesizer
No ratings yet
Background:: EC380 Mini Project 1 - Jingle Bells Synthesizer
1 page
OpenSIPS 2.1 VM
No ratings yet
OpenSIPS 2.1 VM
4 pages
Predicting Singer Voice Using Convolutional Neural Network
No ratings yet
Predicting Singer Voice Using Convolutional Neural Network
17 pages
A Matlab Script To Explore Linear Predictive Coding With Vocal
No ratings yet
A Matlab Script To Explore Linear Predictive Coding With Vocal
6 pages
Ita Posgrad EA 268 Lab-1
No ratings yet
Ita Posgrad EA 268 Lab-1
4 pages
Speech Recognition Using Matlab Project Report: Submitted For The Course
No ratings yet
Speech Recognition Using Matlab Project Report: Submitted For The Course
6 pages
Angular University Devoxx 2013
No ratings yet
Angular University Devoxx 2013
97 pages
IBM Views and Viewpoints
No ratings yet
IBM Views and Viewpoints
24 pages
As A Single PDF
No ratings yet
As A Single PDF
3 pages
Whats New
No ratings yet
Whats New
18 pages
CMPG Detailed Overview
No ratings yet
CMPG Detailed Overview
25 pages
Audiosignalprocessing
No ratings yet
Audiosignalprocessing
11 pages
A Quantitative Performance Analysis Model For GPU Architectures
No ratings yet
A Quantitative Performance Analysis Model For GPU Architectures
12 pages
Compiler Design L1
No ratings yet
Compiler Design L1
43 pages
String Handling Instructions
No ratings yet
String Handling Instructions
7 pages
BOM Interfaces - White Paper
No ratings yet
BOM Interfaces - White Paper
23 pages
T1: Introduction To HCI BAIT2203 HCI: Page 1 of 1
No ratings yet
T1: Introduction To HCI BAIT2203 HCI: Page 1 of 1
1 page
Function: Varargout Barra (Varargin)
No ratings yet
Function: Varargout Barra (Varargin)
9 pages
XAMPP Installation
No ratings yet
XAMPP Installation
2 pages
Document 988907.1
No ratings yet
Document 988907.1
2 pages
Operating Sytem
No ratings yet
Operating Sytem
2 pages
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
No ratings yet
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
30 pages
Audio Processing With MatLab
No ratings yet
Audio Processing With MatLab
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Project Report: Call For Your Symphony

Uploaded by

Project Report: Call For Your Symphony

Uploaded by

CALL FOR YOUR SYMPHONY

Submitted for the course: Signal Analysis And Processing (ECE1018)

SCHOOL OF ELECTRONICS ENGINEERING

ANSHUL RANJAN MODI

First of all, we would like to express our surpassing gratefulness to Dr.Kalaivani S,

< Yogesh Kaushik

< Anshul Ranjan Modi

< Singam Meghana

Fig (1): overview of algorithm

ote th o䳌䁮 h 䁮䁝䁮 o㌳䁓耀晦ee (1)

The general mathematical expression of the latter (2)*

Fig (3): cross correlation result

Fig (4): track being played

Read Microsoft WAVE (.wav) sound file

%Speech Recognition Using Correlation Method

%Write Following Command On Command Window

[4]. F.Jelinek. “Continuous speech recognition by stasticial methods.” IEEE proceedings

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Project Report: Call For Your Symphony

Uploaded by

Project Report: Call For Your Symphony

Uploaded by

CALL FOR YOUR SYMPHONY

Submitted for the course: Signal Analysis And Processing (ECE1018)

SCHOOL OF ELECTRONICS ENGINEERING

ANSHUL RANJAN MODI

First of all, we would like to express our surpassing gratefulness to Dr.Kalaivani S,

< Yogesh Kaushik

< Anshul Ranjan Modi

< Singam Meghana

Fig (1): overview of algorithm

ote th o䳌 䁮 h 䁮 䁝 䁮 o㌳䁓 耀晦ee (1)

The general mathematical expression of the latter (2)*

Fig (3): cross correlation result

Fig (4): track being played

Read Microsoft WAVE (.wav) sound file

%Speech Recognition Using Correlation Method

%Write Following Command On Command Window

[4]. F.Jelinek. “Continuous speech recognition by stasticial methods.” IEEE proceedings

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

ote th o䳌䁮 h 䁮䁝䁮 o㌳䁓耀晦ee (1)