دورية أكاديمية

Tonal representations for music retrieval: from version identification to query-by-humming

التفاصيل البيبلوغرافية
العنوان: Tonal representations for music retrieval: from version identification to query-by-humming
المؤلفون: Salamon, Justin, Serra, Joan, Gómez, Emilia
المساهمون: Consejo Superior de Investigaciones Científicas (España), Ministerio de Educación y Ciencia (España), European Commission, Generalitat de Catalunya
بيانات النشر: Springer
سنة النشر: 2013
المجموعة: Digital.CSIC (Consejo Superior de Investigaciones Científicas / Spanish National Research Council)
مصطلحات موضوعية: Bass line, Music retrieval, Version identification, Query by humming, Music similarity, Cover song detection, Harmony, Melody extraction
الوصف: In this study we compare the use of different music representations for retrieving alternative performances of the same musical piece, a task commonly referred to as version identification. Given the audio signal of a song, we compute descriptors representing its melody, bass line and harmonic progression using state-of-the-art algorithms. These descriptors are then employed to retrieve different versions of the same musical piece using a dynamic programming algorithm based on nonlinear time series analysis. First, we evaluate the accuracy obtained using individual descriptors, and then we examine whether performance can be improved by combining these music representations (i.e. descriptor fusion). Our results show that whilst harmony is the most reliable music representation for version identification, the melody and bass line representations also carry useful information for this task. Furthermore, we show that by combining these tonal representations we can increase version detection accuracy. Finally, we demonstrate how the proposed version identification method can be adapted for the task of query-by-humming. We propose a melody-based retrieval approach, and demonstrate how melody representations extracted from recordings of a cappella singing can be successfully used to retrieve the original song from a collection of polyphonic audio. The current limitations of the proposed approach are discussed in the context of version identification and query-by-humming, and possible solutions and future research directions are proposed. ; This research was funded by Programa de Formación del Profesorado Universitario (FPU) of the Ministerio de Educación de España, Consejo Superior de Investigaciones Científicas (JAEDOC069/2010), Generalitat de Catalunya (2009-SGR-1434) and the European Commission, FP7 (Seventh Framework Programme), ICT-2011.1.5 Networked Media and Search Systems, grant agreement No. 287711. ; Peer Reviewed
نوع الوثيقة: article in journal/newspaper
اللغة: unknown
تدمد: 2192-6611
العلاقة: #PLACEHOLDER_PARENT_METADATA_VALUE#; info:eu-repo/grantAgreement/EC/FP7/287711; Preprint; Sí; International Journal of Multimedia Information Retrieval 2 (1): 45- 58 (2013); http://hdl.handle.net/10261/133865Test; http://dx.doi.org/10.13039/501100003339Test; http://dx.doi.org/10.13039/501100000780Test; http://dx.doi.org/10.13039/501100002809Test
DOI: 10.1007/s13735-012-0026-0
DOI: 10.13039/501100003339
DOI: 10.13039/501100000780
DOI: 10.13039/501100002809
الإتاحة: https://doi.org/10.1007/s13735-012-0026-0Test
https://doi.org/10.13039/501100003339Test
https://doi.org/10.13039/501100000780Test
https://doi.org/10.13039/501100002809Test
http://hdl.handle.net/10261/133865Test
حقوق: open
رقم الانضمام: edsbas.6512B8F4
قاعدة البيانات: BASE
الوصف
تدمد:21926611
DOI:10.1007/s13735-012-0026-0