madmom: a new Python Audio and Music Signal Processing Library
Sprache des Titels:
Deutsch
Original Buchtitel:
Proceedings of the 24th ACM International Conference on Multimedia (ACMMM)
Original Kurzfassung:
In this paper, we present
madmom
, an open-source audio
processing and music information retrieval (MIR) library
written in Python.
madmom
features a concise,
NumPy
-
compatible, object oriented design with simple calling con-
ventions and sensible default values for all parameters, which
facilitates fast prototyping of MIR applications. Prototypes
can be seamlessly converted into callable processing pipelines
through
madmom
's concept of
Processors
, callable objects
that run transparently on multiple cores.
Processors
can
also be serialised, saved, and re-run to allow results to be
easily reproduced anywhere.
Apart from low-level audio processing,
madmom
puts em-
phasis on musically meaningful high-level features. Many
of these incorporate machine learning techniques and
mad-
mom
provides a module that implements some methods
commonly used in MIR such as hidden Markov models and
neural networks. Additionally,
madmom
comes with several
state-of-the-art MIR algorithms for onset detection, beat,
downbeat and meter tracking, tempo estimation, and chord
recognition. These can easily be incorporated into bigger
MIR systems or run as stand-alone programs.