0% found this document useful (0 votes)

535 views15 pages

Music Genre Classification Slides

The document discusses using machine learning techniques for music genre classification. It explores using convolutional neural networks on MEL spectrograms of audio clips to classify songs into 7 genres. A pre-trained VGG-16 model is used for transfer learning. Additional spectral features are also extracted from the audio. Different classifiers like logistic regression, random forests, and SVMs are trained and evaluated. Results show ensembling classifiers improves performance over a single model. Frequency domain features perform better than time domain features. The confusion matrix reveals some genres like rock are predicted accurately while others like rhythm and blues see more misclassifications.

Uploaded by

Manoj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

535 views15 pages

Music Genre Classification Slides

Uploaded by

Manoj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Music Genre Classification using Machine

Learning Techniques

CS 698 - Computational Audio

Hareesh Bahuleyan
Problem Statement
● Music genres are a way to classify
music based on rhythmic structure,
harmonic content and
instrumentation

● Automatically recognition
○ Organize digital libraries
○ Provide recommendations
Data
Google Audio Set
● 2.1 Million audio samples (of 10 seconds)
● 527 classes of sounds
● Selected 7 labels

● Not the actual audio, just the YouTubeIDs,

start and end times
● 880 KB per wav file,
● Approximately 34 GB data
Convolutional Neural
Networks
MEL Spectrograms
● 2D colormap representation of the signal
● STFT: Window size = 2048, Hop size = 512, Hann window function, Number
of MEL bins = 96
CNN - Image Classification
● Consider spectrogram as an image and train a CNN classifier
● Matrix of pixel values - 3 channel RGB input
Convolution Block
Convolution Pooling Non-Linear Activation

Source: http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/
VGG-16

● Transfer Learning
○ Weights of conv base are fixed
● Fine Tuning
○ Both conv base and feed-forward network are trainable
Feature Engineering
Approaches
Feature Extraction

Time Domain Frequency Domain Classifiers

1. Mean 1. MEL Frequency Cepstral 1. Logistic Regression
Coefficients (MFCCs)
2. Variance 2. Random Forest
2. Chroma Features
3. Skewness 3. Support Vector
3. Spectral Centroid Machines
4. Kurtosis
4. Spectral Band-widths 4. Extreme Gradient
5. Zero Crossing Rate Boosting
5. Spectral Contrast
6. Root Mean Square
Energy 6. Spectral Roll-offs

7. Tempo

● Total Number of Features = 97

Spectral Features
● Spectral Centroid

● Spectral Band-width

● Spectral Contrast
○ Divide spectrum into frequency bands
○ Maximum magnitude - Minimum magnitude in each band
● Spectral Roll-off
○ Frequency below which 85% of the total energy in the spectrum lies
● Chroma Features
○ 12-element feature vector
○ Indicates how much energy of each pitch class, {C, C#, D, D#, E, ..., B}
Results
Comparison of Models
● Metrics: Accuracy | F-score | AUC

Baseline uses flatten

vector of pixels

Ensembling classifiers
is beneficial
Feature Importance Study
Keep only
most
important
top N
features

Time domain vs.

Frequency domain
Confusion Matrix

Good at predicting some

classes. Eg: Rock

Many mis-classifications
for Rhythm blues, Pop
genre

Classes are also

unbalanced
Thank You

Music Genre Classification Project Report PDF
0% (2)
Music Genre Classification Project Report PDF
29 pages
Hadoop: The Definitive Guide Unit 2 Part 2: Hadoop I/O
No ratings yet
Hadoop: The Definitive Guide Unit 2 Part 2: Hadoop I/O
26 pages
Unit 1 - Analog Communication - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Analog Communication - WWW - Rgpvnotes.in
12 pages
Continuous Time Fourier Transform - Practice Sheet 01
No ratings yet
Continuous Time Fourier Transform - Practice Sheet 01
8 pages
Data Structures - Map ADT
No ratings yet
Data Structures - Map ADT
7 pages
Problem Statement TCS
No ratings yet
Problem Statement TCS
23 pages
Music Genre Classification Using Machine Learning
No ratings yet
Music Genre Classification Using Machine Learning
3 pages
Spread Spectram-NPTEL
No ratings yet
Spread Spectram-NPTEL
12 pages
Abstract Classes
No ratings yet
Abstract Classes
5 pages
Major
No ratings yet
Major
15 pages
Methods For CMB Map Analysis
No ratings yet
Methods For CMB Map Analysis
13 pages
Software Prototyping
No ratings yet
Software Prototyping
13 pages
Effect of No Gaussian Turbulence On Extreme Buffeting Response of Sea Crossing Bridge
No ratings yet
Effect of No Gaussian Turbulence On Extreme Buffeting Response of Sea Crossing Bridge
19 pages
ch5 PDF
No ratings yet
ch5 PDF
25 pages
Chapter 13
No ratings yet
Chapter 13
54 pages
CM3022 How To Set Up
No ratings yet
CM3022 How To Set Up
4 pages
Write A Program To Implement The First Pattern Matching Algorithm
No ratings yet
Write A Program To Implement The First Pattern Matching Algorithm
5 pages
Time and Frequency Domain Analysis of Signals A Review IJERTV9IS120127
No ratings yet
Time and Frequency Domain Analysis of Signals A Review IJERTV9IS120127
6 pages
Cs 230 Final Project Paper
No ratings yet
Cs 230 Final Project Paper
6 pages
Chapter 12
No ratings yet
Chapter 12
27 pages
JNTUA Signals and Systems - PPT Notes - R20
No ratings yet
JNTUA Signals and Systems - PPT Notes - R20
177 pages
Cedar
100% (1)
Cedar
13 pages
Machine Learning and Deep Learning - Fundamentals and Applications
No ratings yet
Machine Learning and Deep Learning - Fundamentals and Applications
1 page
Application of Ai-Based Welding Process Monitoring For Quality Control in Pipe Production
No ratings yet
Application of Ai-Based Welding Process Monitoring For Quality Control in Pipe Production
6 pages
Digital To Analog Conversion in High Resolution Audio - v2
No ratings yet
Digital To Analog Conversion in High Resolution Audio - v2
180 pages
Automatic Genre Classification of Music Content: (A Survey)
No ratings yet
Automatic Genre Classification of Music Content: (A Survey)
28 pages
Atmospheric Turbulence Simulation Techniques With Application To Flight Analysis
No ratings yet
Atmospheric Turbulence Simulation Techniques With Application To Flight Analysis
172 pages
Music Genre Classification With ResNet and
No ratings yet
Music Genre Classification With ResNet and
17 pages
Spectral Content of NRZ Test Patterns
No ratings yet
Spectral Content of NRZ Test Patterns
5 pages
Chapter - 1: 1.1 Introduction To Music Genre Classification
No ratings yet
Chapter - 1: 1.1 Introduction To Music Genre Classification
57 pages
Remotesensing 05 05550 v2
No ratings yet
Remotesensing 05 05550 v2
22 pages
Cylinder & Plate
No ratings yet
Cylinder & Plate
11 pages
EEG Labview
No ratings yet
EEG Labview
6 pages
Detecting Malicious Facebook Applications
No ratings yet
Detecting Malicious Facebook Applications
19 pages
Bio Data PDF
No ratings yet
Bio Data PDF
1 page
Accuracy Prediction Using Machine Learning Techniques For Patient Liver Disease
100% (1)
Accuracy Prediction Using Machine Learning Techniques For Patient Liver Disease
15 pages
Chapter-1-Overview of Communication System
No ratings yet
Chapter-1-Overview of Communication System
58 pages
Depth of Anaesthesia Monitoring: What's Available, What's Validated and What's Next?
No ratings yet
Depth of Anaesthesia Monitoring: What's Available, What's Validated and What's Next?
10 pages
Cardiac Report
No ratings yet
Cardiac Report
71 pages
Multiple-Choice-Questions Lecture 3
No ratings yet
Multiple-Choice-Questions Lecture 3
2 pages
Stoyanov et al. - 2011 - Pink Noise, 1f^α Noise, and Their Effect on Solut
No ratings yet
Stoyanov et al. - 2011 - Pink Noise, 1f^α Noise, and Their Effect on Solut
22 pages
Pacman
No ratings yet
Pacman
63 pages
Project
No ratings yet
Project
25 pages
DSP Lab Manual
No ratings yet
DSP Lab Manual
95 pages
Muzic Genre Classification
No ratings yet
Muzic Genre Classification
4 pages
Music Genre Classification Using Machine Learning Techniques: April 2018
No ratings yet
Music Genre Classification Using Machine Learning Techniques: April 2018
13 pages
Intelligent Diagnosis of Cardiac Disease Prediction Using Machine Learning
No ratings yet
Intelligent Diagnosis of Cardiac Disease Prediction Using Machine Learning
17 pages
Disease Detection Using Deep Learning: Sourabh Patil (1545) Ashish Singh Rao (1505)
No ratings yet
Disease Detection Using Deep Learning: Sourabh Patil (1545) Ashish Singh Rao (1505)
11 pages
Seminar Final Report
No ratings yet
Seminar Final Report
26 pages
Cross Power Spectral Density: Z T Which Is Sum of Two Real Jointly WSS Random Processes
No ratings yet
Cross Power Spectral Density: Z T Which Is Sum of Two Real Jointly WSS Random Processes
8 pages
Day 4 Session 1 PDF
No ratings yet
Day 4 Session 1 PDF
85 pages
Session Plan
No ratings yet
Session Plan
1 page
Prediction of COVID-19 Using Machine Learning Techniques: Durga Mahesh Matta Meet Kumar Saraf
No ratings yet
Prediction of COVID-19 Using Machine Learning Techniques: Durga Mahesh Matta Meet Kumar Saraf
52 pages
A Comparison of Time Series Models To Predict COVID-19 Cases
No ratings yet
A Comparison of Time Series Models To Predict COVID-19 Cases
31 pages
Literaturereview Proposed System Cloud Security System Model Development Modules Experimental Results Advantages Conclusion References
No ratings yet
Literaturereview Proposed System Cloud Security System Model Development Modules Experimental Results Advantages Conclusion References
17 pages
Data Structure Module 5
No ratings yet
Data Structure Module 5
22 pages
Bangladeshi Flower Identification Using Computer Vision and Machine Learning Techniques
100% (1)
Bangladeshi Flower Identification Using Computer Vision and Machine Learning Techniques
16 pages
Oto 01010
No ratings yet
Oto 01010
82 pages
Random Breaking Waves Horizontal Seabed 2 Hans Peter Riedel. & Anthony Paul Byrne
No ratings yet
Random Breaking Waves Horizontal Seabed 2 Hans Peter Riedel. & Anthony Paul Byrne
6 pages
Dproject
No ratings yet
Dproject
10 pages
Adri Ludick
No ratings yet
Adri Ludick
17 pages
Cybercrime On Social Media PDF
No ratings yet
Cybercrime On Social Media PDF
4 pages
Vibration Characteristics of Diesel Driven Emergency Fire Pump - Clarke
No ratings yet
Vibration Characteristics of Diesel Driven Emergency Fire Pump - Clarke
7 pages
dsp26 28 2024
No ratings yet
dsp26 28 2024
31 pages
Using Short-Time Fourier Transform in Machinery
No ratings yet
Using Short-Time Fourier Transform in Machinery
8 pages
DAA Question Bank
No ratings yet
DAA Question Bank
10 pages
DAA Unit 3 Full Notes
No ratings yet
DAA Unit 3 Full Notes
69 pages
LP 4 Lab Manual
No ratings yet
LP 4 Lab Manual
52 pages
Type - 0 Grammar: Type-0 Grammars Generate Recursively Enumerable Languages. The
No ratings yet
Type - 0 Grammar: Type-0 Grammars Generate Recursively Enumerable Languages. The
4 pages
Sem-5 Mini Project
100% (1)
Sem-5 Mini Project
12 pages
Aqwa Fer
No ratings yet
Aqwa Fer
128 pages
Guideline For Offshore Structural Reliability Analysis (Aplication To Tension Leg Platforms) DNV
No ratings yet
Guideline For Offshore Structural Reliability Analysis (Aplication To Tension Leg Platforms) DNV
66 pages
TreeMap Program
No ratings yet
TreeMap Program
4 pages
Solutions For Java Practice Coding
No ratings yet
Solutions For Java Practice Coding
47 pages
AI Experiment No 1 Study of Prolog
No ratings yet
AI Experiment No 1 Study of Prolog
7 pages
Project Report GitHub
No ratings yet
Project Report GitHub
32 pages
Distributed File System
No ratings yet
Distributed File System
49 pages
Disease Prediction Using Deep Learning
No ratings yet
Disease Prediction Using Deep Learning
25 pages
Nonideal Effects in SC - Rev - 2
No ratings yet
Nonideal Effects in SC - Rev - 2
58 pages
Associative Memory Neural Networks
100% (1)
Associative Memory Neural Networks
35 pages
Synopsis of Music Player
No ratings yet
Synopsis of Music Player
7 pages
MC9223-Design and Analysis of Algorithm Unit-I - Introduction
No ratings yet
MC9223-Design and Analysis of Algorithm Unit-I - Introduction
35 pages
Chapter Three
No ratings yet
Chapter Three
37 pages
Synopsis of Spotify Clone (1) - 1
No ratings yet
Synopsis of Spotify Clone (1) - 1
11 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
Parallel Algorithm Lecture Notes
No ratings yet
Parallel Algorithm Lecture Notes
28 pages
S1 CS - U4 Data Ranges - Frequencies - Shifting
No ratings yet
S1 CS - U4 Data Ranges - Frequencies - Shifting
24 pages
BCA Multimedia Viva
No ratings yet
BCA Multimedia Viva
4 pages
Unit IV PUSHDOWN AUTOMATA AND POST MACHINES
No ratings yet
Unit IV PUSHDOWN AUTOMATA AND POST MACHINES
247 pages
DSAD Dynamic Hashing
No ratings yet
DSAD Dynamic Hashing
79 pages
Chomsky Classification of Grammars
No ratings yet
Chomsky Classification of Grammars
3 pages
Final Assesment All-In-One-Domain
No ratings yet
Final Assesment All-In-One-Domain
38 pages
SPM 3-I Couse File Format
No ratings yet
SPM 3-I Couse File Format
18 pages
Formal Language and Automata Theory: UNIT-1
No ratings yet
Formal Language and Automata Theory: UNIT-1
38 pages
Module 3 OOMD
No ratings yet
Module 3 OOMD
28 pages
(XXXX) 2 Marks (XXXX) : - Class B Extends A
No ratings yet
(XXXX) 2 Marks (XXXX) : - Class B Extends A
11 pages
Cs2358 Internet Programming Lab Anna University Syllabus
No ratings yet
Cs2358 Internet Programming Lab Anna University Syllabus
12 pages
Remove Left Factoring
100% (1)
Remove Left Factoring
2 pages
Report 20220209 Talent Matcht Prueba 1 Backend C.buitron Outlook - Com77911720767
No ratings yet
Report 20220209 Talent Matcht Prueba 1 Backend C.buitron Outlook - Com77911720767
10 pages
Software Testing Methodologies: Unit 1
No ratings yet
Software Testing Methodologies: Unit 1
16 pages
Computer Algoritham For Chennai Univarsity Unit5
No ratings yet
Computer Algoritham For Chennai Univarsity Unit5
11 pages
The Database System Environment
100% (1)
The Database System Environment
2 pages
NLP Asgn2
No ratings yet
NLP Asgn2
7 pages
Internship Report File
No ratings yet
Internship Report File
35 pages
CC Solution Set-1
No ratings yet
CC Solution Set-1
10 pages
DWDM Online Bits
No ratings yet
DWDM Online Bits
3 pages
Anna University OOPS Question Bank Unit 2
100% (1)
Anna University OOPS Question Bank Unit 2
6 pages
Job Recommender Java Spring Boot
No ratings yet
Job Recommender Java Spring Boot
21 pages
PHP Variables
No ratings yet
PHP Variables
4 pages
Software Quality: Robert Hughes and Mike Cotterell
No ratings yet
Software Quality: Robert Hughes and Mike Cotterell
46 pages
DAA 2020 Week 02 Assignment 01
No ratings yet
DAA 2020 Week 02 Assignment 01
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Music Genre Classification Slides

Uploaded by

Music Genre Classification Slides

Uploaded by

Music Genre Classification using Machine

CS 698 - Computational Audio

● Not the actual audio, just the YouTubeIDs,

Time Domain Frequency Domain Classifiers

● Total Number of Features = 97

Baseline uses flatten

Time domain vs.

Good at predicting some

Classes are also

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.