Tag Archives: source separation

Journal paper and open dataset for source separation in Orchestra music

As part of the PHENICX project, we have recently published our research results in the task of audio sound source separation, which is the main research topic of one of our PhD students, Marius Miron.

During this work, we developed a method for orchestral music source separation along with a new dataset: the PHENICX-Anechoic dataset. The methods were integrated into the  PHENICX project for tasks as orchestra focus/instrument enhancement. To our knowledge, this is the first time source separation is objectively evaluated in such a complex scenario. 

This is the complete reference to the paper:

M. Miron, J. Carabias-Orti, J. J. Bosch, E. Gómez and J. Janer, “Score-informed source separation for multi-channel orchestral recordings”, Journal of Electrical and Computer Engineering (2016))”

Abstract: This paper proposes a system for score-informed audio source separation for multichannel orchestral recordings. The orchestral music repertoire relies on the existence of scores. Thus, a reliable separation requires a good alignment of the score with the audio of the performance. To that extent, automatic score alignment methods are reliable when allowing a tolerance window around the actual onset and offset. Moreover, several factors increase the difficulty of our task: a high reverberant image, large ensembles having rich polyphony, and a large variety of instruments recorded within a distant-microphone setup. To solve these problems, we design context-specific methods such as the refinement of score-following output in order to obtain a more precise alignment. Moreover, we extend a close-microphone separation framework to deal with the distant-microphone orchestral recordings. Then, we propose the first open evaluation dataset in this musical context, including annotations of the notes played by multiple instruments from an orchestral ensemble. The evaluation aims at analyzing the interactions of important parts of the separation framework on the quality of separation. Results show that we are able to align the original score with the audio of the performance and separate the sources corresponding to the instrument sections.

The PHENICX-Anechoic dataset includes audio and annotations useful for different MIR tasks as score-informed source separation, score following, multi-pitch estimation, transcription or instrument detection, in the context of symphonic music. This dataset is based on the anechoic recordings described in this paper:

Pätynen, J., Pulkki, V., and Lokki, T., “Anechoic recording system for symphony orchestra,” Acta Acustica united with Acustica, vol. 94, nr. 6, pp. 856-865, November/December 2008.

For more information about the dataset and how to download you can access the PHENICX-Anechoic web page.

Leave a comment

Filed under datasets, publications, research

Computational models of symphonic music: challenges and opportunities


This is the title of my keynote speech yesterday at the Mathematics and Computation in Music Conference that is taking place in London this week. I presented our work in the PHENICX project I am coordinating to apply MIR technologies to symphonic repertoire. This is the abstract:

An orchestral classical concert embraces a wealth of musical information, which may not be easily perceived or understood for general audiences. Current machine listening and visualization technologies can facilitate the appreciation of distinct musical facets, contributing to innovative and more enjoyable concert experiences. This presentation provides an overview of the challenges and opportunities that symphonic music poses for these technologies. We will summarize our current efforts in the improving of state-of-the-art methods for melody extraction, structural analysis, source separation when applied to this particular repertoire. Special emphasis will be given to the combination of symbolic, audio and gestural music descriptors, and to the development of meaningful visualizations designed to be exploited in off-line and live concert situations.

Among other things, I presented the work we carried out in Seville for the Exponential Prometheus opening concert of the Singularity Summit Spain, Seville, March 12th 2015.

This is a video of the event which illustrates our work in the phenicx project.

It was featured in the DIGITAL AGENDA FOR EUROPE.

Leave a comment

Filed under Uncategorized