|
Department of
Computational Perception
|
|
TV & RADIO ANALYSIS
Audio Fingerprinting
Audio Fingerprinting aims at identifying an audio snippet given a large database of reference audio files (e.g. SoundHound, Shazam). Our institute developed an audio fingerprinter that is robust to all kinds of distortions, including severe time stretching and/or pitch shifting. We offer the following topics for theses and practical works:
|
Recognising Sound Objects in Audio Streams
Given the gigantic and constantly growing amount of audiovisual material that
is archived in digital form, there is a tremendous need in the media industry
for methods that automatically extract semantic information from audiovisual
data streams. In order to index and effectively search in huge multimedia
databases, we need computers that can detect and name all sorts of recognisable
sound objects in audio streams. The image to the right pertains to
a real-world application, where TV channels are automatically monitored for the
presence of music (in the foreground or background), using a music detection
algorithm developed by us (in cooperation with the Austrian Research Institute
for Artificial Intelligence, Vienna).
|
Speech/Singing Detection and Classification