TimbreDistributionExtractor

Overview

Package

Class

Use

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

comirva.audio.extraction
Class TimbreDistributionExtractor

java.lang.Object
  comirva.audio.extraction.AttributeExtractor
      comirva.audio.extraction.AudioFeatureExtractor
          comirva.audio.extraction.TimbreDistributionExtractor

public class TimbreDistributionExtractor
extends AudioFeatureExtractor
extends AudioFeatureExtractor

Timbre Distribution Extractor

Description:

This class supports the extraction of the "Timbre Distribution" summarizing the timbre of an audio stream.

This is done by computing the MFCC for each audio frame(usually between 20ms and 50ms and a 50% overlap). The MFCCs are known to somehow characterize the timbre of such a short audio frame. Then one estimates the distribution of the MFCC vectors using a Gaussian Mixture Model.

The resulting distribution is a model of the song's overall timbre and can be compared to other timbre models. [1] Aucouturier, Pachet, "Improving Timbre Similarity: How high's the sky?" Journal of Negative Results in Speech and Audio Sciences, 1(1), 2004.

See Also:: GaussianMixture, MFCC, TimbreDistribution

Field Summary
`int`	`DEFAULT_NUMBER_COMPONENTS`
`protected MFCC`	`mfcc`
`int`	`minimumStreamLength`
`protected int`	`numberGaussianComponents`
`protected AudioPreProcessor`	`preProcessor`
`protected float`	`sampleRate`
`int`	`skipFinalSeconds`
`int`	`skipIntroSeconds`

Constructor Summary
`TimbreDistributionExtractor()` The default constructor uses 3 gaussian components for modeling the timbre distribution.
`TimbreDistributionExtractor(float sampleRate, int numberGaussianComponents, int skipIntro, int skipEnd, int minimumLength)` This constructor in contrast to the default constructor allows to specify the number of gaussian components used for modeling the timbre distribution.

Method Summary
`Attribute`	`calculate(File input)` This method is used to calculate the timbre distribution for a whole song.
`AttributeExtractor`	`copy()` This method returns a copy of an AttributeExtractor.
`int`	`getAttributeType()` Returns the type of the attribute that the class implementing this interface will return as the result of its extraction process.
`String`	`toString()` Returns the feature extractors name.

Methods inherited from class java.lang.Object
`clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait`

Field Detail

DEFAULT_NUMBER_COMPONENTS

public int DEFAULT_NUMBER_COMPONENTS

skipIntroSeconds

public int skipIntroSeconds

skipFinalSeconds

public int skipFinalSeconds

minimumStreamLength

public int minimumStreamLength

preProcessor

protected AudioPreProcessor preProcessor

mfcc

protected MFCC mfcc

numberGaussianComponents

protected int numberGaussianComponents

sampleRate

protected float sampleRate

Constructor Detail