Bootstrap learning for accurate onset detection |
| |
Authors: | Ning Hu Roger B Dannenberg |
| |
Affiliation: | (1) Google Inc. New York Office, 1440 Broadway, 21st Floor, New York, NY, 10018, United States;(2) Computer Science Department, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA, 15213, United States |
| |
Abstract: | Supervised learning models have been applied to create good onset detection systems for musical audio signals. However, this
always requires a large set of labeled training examples, and hand-labeling is quite tedious and time consuming. In this paper,
we present a bootstrap learning approach to train an accurate note onset detection model. Audio alignment techniques are first
used to find the correspondence between a symbolic music representation (such as MIDI data) and an acoustic recording. This
alignment provides an initial estimate of note boundaries which can be used to train an onset detector. Once trained, the
detector can be used to refine the initial set of note boundaries and training can be repeated. This iterative training process
eliminates the need for hand-labeled audio. Tests show that this training method can improve an onset detector initially trained
on synthetic data.
Major part of work was done while the first author was at Carnegie Mellon University.
Editor: Gerhard Widmer |
| |
Keywords: | Bootstrap learning Onset detection Audio-to-score alignment |
本文献已被 SpringerLink 等数据库收录! |
|