Playing Technique Classification Based on Deep Collaborative Learning of Variational Auto-Encoder and Gaussian Process

Sih Huei Chen, Yuan Shan Lee, Min Che Hsieh, Jia Ching Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Modeling musical timbre is critical for various music information retrieval (MIR) tasks. This work addresses the task of classifying playing techniques, which involves extremely subtle variations of timbre among different categories. A deep collaborative learning framework is proposed to represent a music with greater discriminative power than previously achieved. Firstly, a novel variational autoencoder (VAE) is developed to eliminate the variation of acoustic features within a class. Secondly, a Gaussian process classifier is jointly learned to distinguish the variations of timbres between classes, which increases the discriminative power of the learned representations. We derive a new lower bound that guides a VAE-based representation. Experiments were conducted on a database of seven classes of guitar playing techniques. The experimental results demonstrated that the proposed method outperforms baselines in terms of the Fl-score and accuracy.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Multimedia and Expo, ICME 2018
PublisherIEEE Computer Society
ISBN (Electronic)9781538617373
DOIs
StatePublished - 8 Oct 2018
Event2018 IEEE International Conference on Multimedia and Expo, ICME 2018 - San Diego, United States
Duration: 23 Jul 201827 Jul 2018

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2018-July
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2018 IEEE International Conference on Multimedia and Expo, ICME 2018
Country/TerritoryUnited States
CitySan Diego
Period23/07/1827/07/18

Keywords

  • Gaussian process
  • Variational autoencoder
  • collaborative learning
  • playing technique classification

Fingerprint

Dive into the research topics of 'Playing Technique Classification Based on Deep Collaborative Learning of Variational Auto-Encoder and Gaussian Process'. Together they form a unique fingerprint.

Cite this