cp.jku - TV & Radio Analysis

Audio Fingerprinting

Audio Fingerprinting aims at identifying an audio snippet given a large database of reference audio files (e.g. SoundHound, Shazam). Our institute developed an audio fingerprinter that is robust to all kinds of distortions, including severe time stretching and/or pitch shifting.

We offer the following topics for theses and practical works:

Exploring Attack Vectors on (Landmark-Based) Audio Fingerprinting Systems
Find and evaluate methods that change audio in a way that fools audio fingerprinters but is inperceptible to humans.
Developing a Client/Server Architecture and/or a Mobile Application
Make our fingerprinter usable from from the web.
Implement (and/or Improve) Modules of Our Fingerprinter In C for Speedup
For example: A Robust, Scalable Method to Extract Peaks from Spectrograms

Contact: Reinhard Sonnleitner, Gerhard Widmer

Recognising Sound Objects in Audio Streams

Given the gigantic and constantly growing amount of audiovisual material that is archived in digital form, there is a tremendous need in the media industry for methods that automatically extract semantic information from audiovisual data streams. In order to index and effectively search in huge multimedia databases, we need computers that can detect and name all sorts of recognisable sound objects in audio streams. The image to the right pertains to a real-world application, where TV channels are automatically monitored for the presence of music (in the foreground or background), using a music detection algorithm developed by us (in cooperation with the Austrian Research Institute for Artificial Intelligence, Vienna).

Detection of Rap/HipHop Music
Detecting Rap/HipHop challenges music detection systems because the vocals are difficult to distinguish from spoken language (for the computer). The goal is to develop new features and/or methods that overcome this problem.
Automatic Detection and/or Recognition of Commercials and Jingles in TV and Radio
Detection of Specific Musical Instruments
Develop methods that detect the presence of an instrument in the audio.

Contact: Reinhard Sonnleitner, Gerhard Widmer

Speech/Singing Detection and Classification

Determining the Number of Speakers in Audio Files
Classify Spoken/Sung Vocals
- Female or male?
- Single voice or several voices?
- Which language?

Contact: Reinhard Sonnleitner, Gerhard Widmer

last edited by kf on Sep 28, 2015