Category Archives: research

TISMIR Journal Launch and Call for Papers

I am blogging some news related to a project I have been recently contributing.

It brings us great pleasure to announce the launch of the first issue of TISMIR, the Transactions of the International Society for Music Information Retrievalhttps://transactions.ismir.net/

TISMIR was established to complement the widely cited ISMIR conference proceedings and provide a vehicle for the dissemination of the highest quality and most substantial scientific research in MIR. TISMIR retains the Open Access model of the ISMIR Conference proceedings, providing rapid access, free of charge, to all journal content. In order to encourage reproducibility of the published research papers, we provide facilities for archiving the software and data used in the research. The TISMIR publication model avoids excessive cost to the authors or their institutions, with article charges being less than the ISMIR Conference registration fee.

The first issue contains an editorial introducing the journal, four research papers and one dataset paper:

Editorial: Introducing the Transactions of the International Society for Music Information Retrieval – Simon Dixon,  Emilia Gómez,  Anja Volk

Multimodal Deep Learning for Music Genre Classification – Sergio Oramas,  Francesco Barbieri,  Oriol Nieto,  Xavier Serra

Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification – Matthias Dorfer,  Jan Hajič jr.,  Andreas Arzt,  Harald Frostel,  Gerhard Widmer

A New Curated Corpus of Historical Electronic Music: Collation, Data and Research Findings – Nick Collins,  Peter Manning,  Simone Tarsitani

A Case for Reproducibility in MIR: Replication of ‘A Highly Robust Audio Fingerprinting System’ – Joren Six,  Federica Bressan,  Marc Leman

Pop Music Highlighter: Marking the Emotion Keypoints – Yu-Siang Huang,  Szu-Yu Chou,  Yi-Hsuan Yang

Two more papers (one research paper and one overview paper) are in press.

Authors:  We look forward to receiving new submissions to the journal – please see the Call for Papers below.

Best Regards
Simon Dixon, Anja Volk and Emilia Gómez

Editors-in-chief, TISMIR

CALL FOR PAPERS

The ISMIR Board is happy to announce the launch of the Transactions of the International Society for Music Information Retrieval (TISMIR), the open-access journal of our community.

 

TISMIR (http://tismir.ismir.net) publishes novel scientific research in the field of Music Information Retrieval (MIR), an interdisciplinary research area concerned with processing, analysing, organising and accessing music information. We welcome submissions from a wide range of disciplines, including computer science, musicology, cognitive science, library & information science, machine learning, and electrical engineering.

 

TISMIR is established to complement the widely cited ISMIR conference proceedings and provide a vehicle for the dissemination of the highest quality and most substantial scientific research in MIR. TISMIR retains the Open Access model of the ISMIR Conference proceedings, providing rapid access, free of charge, to all journal content. In order to encourage reproducibility of the published research papers, we provide facilities for archiving the software and data used in the research. TISMIR is published in electronic-only format, making it possible to offer very low publication costs to authors’ institutions, while ensuring fully open access content. With this call for papers we invite submissions for the following article types:

Article types

Research articles must describe the outcomes and application of unpublished original research. These should make a substantial contribution to knowledge and understanding in the subject matter and should be supported by relevant experiments.

Overview articles should focus in detail on specific aspects of MIR research. Overview articles will provide a comprehensive review of a broad MIR research problem, a critical evaluation of proposed techniques and/or an analysis of challenges for future research. Papers should critically engage with the relevant body of extant literature.

Datasets should present novel efforts in data gathering and annotation that have a strong potential impact in the way MIR technologies are exploited and evaluated.

 

If the paper extends or combines the authors’ previously published research, it is expected that there is a significant novel contribution in the submission (as a rule of thumb, we would expect at  least 50% of the underlying work – the ideas, concepts, methods, results, analysis and discussion – to be new). In addition, if there is any overlapping textual material, it should be rewritten.

 

Review process

The journal operates a double-blind peer review process.  Review criteria include originality, consideration of previous work, methodology, clarity and reproducibility.

 

Publication frequency

The journal is published online as a continuous volume and issue throughout the year, following an open access policy. Articles are made available as soon as they are ready to ensure that there are no unnecessary delays in getting content publicly available.

 

Editorial team

Editors in Chief

Simon Dixon, School of Electronic Engineering and Computer Science, Queen Mary University of London, United Kingdom

Emilia Gómez, Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain

Anja Volk, Department of Information and Computing Sciences, Utrecht University, Netherlands

Editorial Board

Juan P. Bello, Department of Music and Performing Arts Professions, & Department of Electrical and Computer Engineering, New York University, United States

Arthur Flexer, Austrian Research Institute for Artificial Intelligence (OFAI), Austria

Fabien Gouyon, Pandora, United States

Xiao Hu, Faculty of Education, Division of Information & Technology Studies, University of Hong Kong

Olivier Lartillot, Department of Musicology, University of Oslo, Norway

Jin Ha Lee, Information School, University of Washington, United States

Meinard Mueller, International Audio Laboratories Erlangen, Germany

Geoffroy Peeters, Sound Analysis/Synthesis Team, UMR STMS IRCAM CNRS, France

Markus Schedl, Department of Computational Perception, Johannes Kepler University Linz, Austria

 

Reviewers: The editorial board counts on reviewers from the ISMIR community, who are crucial to the success of the journal. To become a reviewer, please register here http://tismir.ubiquitypress.com/author/register/reviewer/

Journal Manager

Tim Wakeford, Ubiquity Press, United Kingdom

Contact

tismir@ismir.net

 

Website

http://tismir.ismir.net/

Leave a comment

Filed under publications, research, Uncategorized

Paper and dataset for Choir Singing Analysis, presented at ICMPC-ESCOM

Last week, Helena Cuesta, one of the PhD students I am working with, attended the 15th International Conference on Music Perception and Cognition and 10th triennial conference of the European Society for the Cognitive Sciences of Music in Graz (Austria). She presented the following paper in the poster session, as well as a contribution to the proceedings:

Cuesta, H., Gómez, E., Martorell, A., Loáiciga, F. Analysis of Intonation in Unison Choir Singing.

ICMPC/ESCOM is a very multidisciplinary conference, bringing together people from very different fields related to music such as music psychology, music perception, neuroscience, music theory, or music information retrieval.

The study investigates several expressive characteristics of unison choir singing, focusing on how singers blend together and interact with each other in terms of fundamental frequency dispersion, intonation, and vibrato. They also present an open dataset of choral singing that is available here, and was created in collaboration with the Anton Bruckner Choir (Barcelona).

This is a picture of the recording session. This work is being carried out in the context of two research projects: CASAS and TROMPA.

bruckner.png

 

Leave a comment

Filed under datasets, publications, research

LifeSoundTrack: música, tecnología, recuerdos y alzheimer / music, technology, memories and alzheimer

English version below!

A veces los proyectos más enriquecedores son los que realizas con menos recursos.

En éste proyecto utilizamos las tecnologías de recuperación de la información musical para encontrar la banda sonora de la vida de personas mayores españolas. Ésto ha presentado varios retos de investigación y tecnológicos. En concreto hemos podido observar el sesgo de los sistemas actuales de recomendación musical tanto en que sólo se centran en un repertorio musical popular y sus interfaces y descriptores musicales están pensados para usuarios jóvenes. Esto aplica a:

  • Las taxonomías de géneros musicales.
  • Los idiomas predominantes en las canciones y su etiquetado.
  • Los descriptores y playlists que se utilizan (e.g. música de fiesta).
  • La dificultad para tener los derechos de dar una canción a éstas personas para que la escuchen.
  • La dificultad de tener dispositivos fáciles de usar y dirigidos a éste tipo de personas.

Por tanto casi hemos tenido que empezar desde cero! Éste es el repositorio github que hemos creado (abierto): https://github.com/MTG/lifesoundtrack

Y si alguien quiere obtener su banda sonora puede hacerlo en bandasonoravital.upf.edu 

Atención! El sistema está pensado para personas mayores que han nacido o viven en España, por lo que utilizadlo sobre todo si cumplís éstos requisitos o para alguna persona que conozcáis que los cumpla.

Ha sido un privilegio poder colaborar con la Fundación Pasqual Maragall, La Fundación AVAN y la Escuela La Salud de Sabadell en éste piloto con personas que padecen la enfermedad de Alzheimer. Sobre todo ha sido genial trabajar con Nina, Anna, Carolina, y los chicos de La Salut. Aquí podéis ver un vídeo que me encanta y resume muy bien el proyecto:

Para más información no os perdáis el documental Sense Ficció que emitirá TV3 el día 8 de Mayo por la noche!!!

Más información del proyecto en la web del MTG y de la Fundación Pasqual Maragall.

Y las bandas sonoras se pueden generar en http://bandasonoravital.upf.edu/ 

¡Gracias a todo el equipo! En especial al grupo del MTG: Perfe Herrera, Felipe Navarro, Olga Slizovskaia, por su tiempo en éste proyecto donde no hemos tenido financiación específica.

auriculares

Sometimes the most enriching projects are those that you do with less resources and funding.

In this project we use music information retrieval (MIR) technologies to find the life soundtrack of Spanish elderly people. Starting from a questionnaire where we ask about biographical information and one´s relationship with music, we build a playlist looking at several sources such as spotify or youtube. 

This has presented several research and technological challenges. Specifically, we have been able to observe the bias of current music recommendation systems, as they focus on a popular musical repertoire and their interfaces and musical descriptors are designed for young users. This applies to:

  • The taxonomies of musical genres.
  • The predominant languages ​​in the songs and their labeling.
  • The descriptors and playlists that are used (e.g., what does happy music mean).
  • The difficulty to have the rights to give a song to these people to be heard.
  • The difficulty of having easy-to-use devices aimed at this type of person.

Therefore we almost had to start from scratch! This is the github repository that we have created (open): https://github.com/MTG/lifesoundtrack 

And if someone wants to get their soundtrack you can do it at bandasonoravital.upf.edu
Attention! The system is designed for seniors who have been born or live in Spain, so use it especially if you meet these requirements or for someone you know who complies.

It has been a privilege to be able to collaborate with the Pasqual Maragall Foundation, the AVAN Foundation and the Health School of Sabadell in this pilot with people suffering from Alzheimer’s disease. Above all it has been great to work with Nina, Anna, Carolina, and the guys from La Salut. Here you can see a video that I love and sums up the project very well:

For more information, do not miss the Sense Ficció documentary that will broadcast TV3 on May 8 at night !!!

More information on the project on the MTG website and the Pasqual Maragall Foundation. Thanks to all the team! Especially to the MTG group: Perfe Herrera, Felipe Navarro, Olga Slizovskaia, for their time in this project where we have not had specific funding.

Leave a comment

Filed under outreach, research, software

Interview at ACM SIGMM Records and the need for interdisciplinary research

The March Issue (Vol.10, Issue 1) of ACM SIGMM Records (News for the Multimedia Community) is out and it includes an interview of myself for the interdisciplinary column, kindly chaired by Cynthia Liem and Jochen Huber.

It is awesome already that there is an interdisciplinary column at SIGMM, recognising the challenges and also the potential of interdisciplinary research and insights as a way to have a comprehensive understanding, in this case, of multimedia computing. I was very pleased to ask questions about my experience in the MIR field and about diversity and interdisciplinarity.

You can read the interview and other interesting content here.

Leave a comment

Filed under outreach, press, research, womenTIC

¿Qué impacto tendrá la inteligencia artificial en el comportamiento humano?

Noticia en la web de la UPF sobre el nuevo proyecto HUMAINT

Podéis leerla aquí.

Leave a comment

Filed under press, research

HUMAINT: New research project on Human Behaviour and Machine Intelligence @ Joint Research Centre (EC) – We are hiring!

Postdoc application deadline: OCTOBER 26, 2017

casmihb

I am excited to lead a novel research initiative inside the European Commission’s Joint Research Centre, on the topic of machine learning and human behaviour.

The Joint Research Centre (JRC) is the European Commission’s science and knowledge service which employs scientists to carry out research in order to provide independent scientific advice and support to EU policy. The JRC Centre for Advanced Studies (JRC-CAS) was established to enhance the JRC’s capabilities to meet emerging challenges at the science-policy interface. JRC-CAS is now launching a three year interdisciplinary project to understand the potential impact of machine learning in human behaviour and societal welfare. It will be carried out at JRC centre in Seville, Spain. There will be close collaboration with the Music Technology Group and the Department of Information and Communication Technologies of Universitat Pompeu Fabra  in Barcelona, Spain.

The HUMAINT project will (1) provide a scientific understanding of machine vs human intelligence; (2) analyze the influence of machine learning algorithms into human behaviour (3) investigate to what extent these findings should influence the European regulatory framework. Given my research expertise, music will be an important use case to address.

In the context of this project, three postdoc positions in the area of machine learning and human behaviour are open for appointment from January 1, 2018, at the Joint Research Centre (European Commission) in Seville, Spain. The fully funded positions are available for a period of three years. Particular areas of interests:

  • Fairness, accountability, transparency, explainability of machine learning methods.

  • Social, ethical and economic aspects of artificial intelligence.

  • Human-computer interaction and human-centered machine learning.

  • Digital and behavioural economy.

  • Application domains: music and arts, social networks, health, transport, energy.

We are looking for highly motivated, independent, and outstanding postdoc candidates with a strong background in machine learning and/or human behaviour. An excellent research track record, ability to communicate research results and involvement in community initiatives is expected. Candidates should have EU/EEA citizenship.

The JRC offers an enriching multi-cultural and multi-lingual work environment with lifelong learning and professional development opportunities, and close links to top research organisations and international bodies around the world. Postdoctoral researchers receive a competitive salary and excellent working conditions, and will define their own research agenda inline with the project goals.

JRC-Seville is located in Cartuja 93 scientific and technological park. Seville is the fourth-largest city in Spain. With more than 30 centuries of history (gateway of America for two centuries, main actor in the first circumnavigation of the Earth),  three UNESCO World Heritage Sites, and privileged climate, it combines its historical and touristic character with a consolidated economic development and innovation potential.

We are also open for collaborations with external researchers, as one of our goals is to build an expert network in the topics.

You may obtain further information about the scientific aspects of the positions from Dr. Emilia Gómez (project scientific leader: emilia.gomez@upf.edu, using the subject [humaint]) and at the following web pages http://recruitment.jrc.ec.europa.eu/?site=SVQ&type=AX&category=FGIV and https://ec.europa.eu/jrc/en/working-with-us/jobs.

YOU CAN APPLY HERE BY OCTOBER 26, 2017 (first application round).

By the way, the positions are in Seville, the city I was born, one of the most beautiful places in the world 🙂

Leave a comment

Filed under projects, research

New open-access journal Transactions of ISMIR, open for submissions

I am happy to announce that the International Society for Music Information Retrieval launched the Transactions of the International Society for Music Information Retrieval, the open access journal of the ISMIR society at Ubiquity press. I am serving as Editor-in-Chief, together with Simon Dixon and Anja Volk.

TISMIR publishes novel scientific research in the field of music information retrieval (MIR).

We welcome submissions from a wide range of disciplines: computer science, musicology, cognitive science,  library & information science and electrical engineering.

We currently accept submissions.

View our submission guidelines for more information.

TISMIR

Leave a comment

Filed under publications, research

Special Issue at IEEE Multimedia Magazine

I have been collaborating for a while now on the edition of a Special Issue at IEEE Multimedia Magazine, which gathers state-of-the-art research on multimedia methods and technologies aimed at enriching music performance, production and consumption.

I have had the change to co-edit this issue with my colleagues Cynthia Liem (TU Delft, The Netherlands) and George Tzanetakis (University of Victoria, Canada), and I am very happy with the outcomes.

It is the second time I act as a co-editor for a journal (the first one was at JNMR and related to computational ethnomusicology) and I learnt a lot from the process. Editors have to asure good submissions, good reviews and recommendations, keeping the coherence and theme that we wanted to give as a message to our community. Yes: access, distribution and experiences in music are changing with new technologies. I am very happy with the outcomes!  Check our editorial paper here, and the full issue here.

And I love the design!

Captura de pantalla 2017-03-01 a las 10.45.27.png

 

 

Leave a comment

Filed under publications, research

Journal paper and open dataset for source separation in Orchestra music

As part of the PHENICX project, we have recently published our research results in the task of audio sound source separation, which is the main research topic of one of our PhD students, Marius Miron.

During this work, we developed a method for orchestral music source separation along with a new dataset: the PHENICX-Anechoic dataset. The methods were integrated into the  PHENICX project for tasks as orchestra focus/instrument enhancement. To our knowledge, this is the first time source separation is objectively evaluated in such a complex scenario. 

This is the complete reference to the paper:

M. Miron, J. Carabias-Orti, J. J. Bosch, E. Gómez and J. Janer, “Score-informed source separation for multi-channel orchestral recordings”, Journal of Electrical and Computer Engineering (2016))”

Abstract: This paper proposes a system for score-informed audio source separation for multichannel orchestral recordings. The orchestral music repertoire relies on the existence of scores. Thus, a reliable separation requires a good alignment of the score with the audio of the performance. To that extent, automatic score alignment methods are reliable when allowing a tolerance window around the actual onset and offset. Moreover, several factors increase the difficulty of our task: a high reverberant image, large ensembles having rich polyphony, and a large variety of instruments recorded within a distant-microphone setup. To solve these problems, we design context-specific methods such as the refinement of score-following output in order to obtain a more precise alignment. Moreover, we extend a close-microphone separation framework to deal with the distant-microphone orchestral recordings. Then, we propose the first open evaluation dataset in this musical context, including annotations of the notes played by multiple instruments from an orchestral ensemble. The evaluation aims at analyzing the interactions of important parts of the separation framework on the quality of separation. Results show that we are able to align the original score with the audio of the performance and separate the sources corresponding to the instrument sections.

The PHENICX-Anechoic dataset includes audio and annotations useful for different MIR tasks as score-informed source separation, score following, multi-pitch estimation, transcription or instrument detection, in the context of symphonic music. This dataset is based on the anechoic recordings described in this paper:

Pätynen, J., Pulkki, V., and Lokki, T., “Anechoic recording system for symphony orchestra,” Acta Acustica united with Acustica, vol. 94, nr. 6, pp. 856-865, November/December 2008.

For more information about the dataset and how to download you can access the PHENICX-Anechoic web page.

Leave a comment

Filed under datasets, publications, research

New project on MIR & singing: CASAS

At my lab we are starting a new project where we integrate our expertise in singing voice processing and music information retrieval to generate tools for choir singers.

CASAS (Community-Assisted Singing Analysis and Synthesis) is a project funded by the Ministry of Economy and Competitiveness of the Spanish Government (TIN2015-70816-R), that started in  January 1st 2016 and will end in December 31st 2018.

https://i0.wp.com/mtg.upf.edu/system/files/imagecache/projects_tech_thumbs/projects/Logo.jpgHumans use singing to create identity, express emotion, tell stories, exercise creativity, and connect with each other while singing together. This is demonstrated by the large community of music singers active in choirs and the fact that vocal music makes up an important part of our cultural heritage. Currently, an increasing amount of music resources are becoming digital, and the Web has become an important tool for singers to discover and study music, as a feedback resource and as a way to share their singing performances. The CASAS project has two complementary goals:

  • The first one is to improve state-of-the-art technologies that assist singers in their musical practice. We research on algorithms for singing analysis and synthesis (ex: automatic transcription, description, synthesis, classification and visualization), following a user-centered perspective, and with the goal of making them more robust, scalable and musically meaningful.
  • The second one is to enhance current public-domain vocal music archives and create research data for our target music information retrieval (MIR) tasks. Our project put a special emphasis on choral repertoire in Catalan and Spanish.

We exploit our current methods for Music Information Retrieval and Singing Voice Processing, and we involve a community of singers that use our technologies and provide their evaluations, ground truth data and relevance feedback.

I did my first logo, which is inspired by choirs, audio & “houses”, which is the english translation of “casas”. It will be an amazing project!

Leave a comment

Filed under projects, research