Some Basic Audio Features

Use the provided m-files for computing the features of an audio classification problem
14K download
Aggiornato 18 mar 2014

Visualizza la licenza

---------------------------------------
Theodoros Giannakopoulos
http:/www.di.uoa.gr/~tyiannak
---------------------------------------
Feature extraction (as in most pattern recognition problems) is maybe the most important step in audio classification tasks. The provided Matlab code computes some of the basic audio features for groups of sounds stored in WAV files. Furthermore, a simple class separability measure, based on feature histograms is used for measuring the ability of each feature to be used for classifying the given classes. Therefore, you can use the provided m-files for computing the features of an audio classification problem (i.e. specific audio classes) and understanding "how good" those features are for the specific classification task.

The features are calculated in a two-step way:

In particular, the following audio features and respective statistics are extracted for each audio segment:
Features Statistics
Energy Entropy Standard Deviation (std)
Signal Energy Std by Mean (average) Ratio
Zero Crossing Rate Std
Spectral Rolloff Std
Spectral Centroid Std
Spectral Flux Std by Mean Ratio

In order to compute the 6 feature statistics for a specific .wav file, you can use the computeAllStatistics(fileName, win, step).

After the features are calculated,

a) the histograms of each feature for all classes are estimated

b) a simple algorithm is used for estimating the separability of the audio classes. In other words, a measure that describes how "easily" the features will be classified. In the case of a multi-class classification problem, the measure is calculated for EACH CLASS opposed to ALL OTHER CLASSES, i.e. a measure value FOR EACH CLASS is computed. The algorithm is described in detail in http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=18791&objectType=FILE#.

EXAMPLE:

The main function of this demo is computeFeaturesDirectory(). The only recuired argument is a cell array with the names of the directories in which the .wav files of the respective classes are stored. For example, suppose you have three folders named MUSIC, SPEECH and NOISE, each one containing wav files with relevant audio content (i.e. wav files of segments containing music, speech and noise). In order to compute the audio features of those files simply write:

>> F = computeFeaturesDirectory({'music','speech','noise'});

Theodoros Giannakopoulos
http:/www.di.uoa.gr/~tyiannak

* * * * * * * * * * * * * * * * * * * *
IMPORTANT UPDATE:
More advanced audio analysis methods can be found in the MATLAB AUDIO ANALYSIS LIBRARY (http://www.mathworks.com/matlabcentral/fileexchange/45831-matlab-audio-analysis-library) which is part of the book INTRODUCTION TO AUDIO ANALYSIS: A MATLAB APPROACH (http://www.amazon.com/Introduction-Audio-Analysis-MATLAB%C2%AE-Approach/dp/0080993885)
* * * * * * * * * * * * * * * * * * * *

Cita come

Theodoros Giannakopoulos (2024). Some Basic Audio Features (https://www.mathworks.com/matlabcentral/fileexchange/19236-some-basic-audio-features), MATLAB Central File Exchange. Recuperato .

Compatibilità della release di MATLAB
Creato con R2007b
Compatibile con qualsiasi release
Compatibilità della piattaforma
Windows macOS Linux

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!
Versione Pubblicato Note della release
1.1.0.0

Minor changes in description

1.0.0.0

Summary changed