Multichannel high resolution NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain

Several probabilistic models involving latent components have been proposed for modelling time-frequency (TF) representations of audio signals such as spectrograms, notably in the nonnegative matrix factorization (NMF) literature. Among them, the recent high resolution NMF (HR-NMF) model is able to take both phases and local correlations in each frequency band into account, and its potential has been illustrated in applications such as source separation and audio inpainting. In this paper, HR-NMF is extended to multichannel signals and to convolutive mixtures. The new model can represent a variety of stationary and non-stationary signals, including autoregressive moving average (ARMA) processes and mixtures of damped sinusoids. A fast variational expectation-maximization (EM) algorithm is proposed to estimate the enhanced model. This algorithm is applied to a stereophonic piano signal, and proves capable of accurately modelling reverberation and restoring missing observations.

Data and Resources

Additional Info

Field Value
Source https://inria.hal.science/hal-00945249
Author Badeau, Roland, Plumbley, Mark. D.
Maintainer CCSD
Last Updated May 6, 2026, 18:32 (UTC)
Created May 6, 2026, 18:32 (UTC)
Identifier Report N°: EECSRR-13-03
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire Traitement et Communication de l'Information (LTCI) ; Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS)
creator Badeau, Roland
date 2013-05-06T00:00:00
harvest_object_id f3738cce-ba46-4420-8efe-cede42af1b4e
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-01-19T00:00:00
set_spec type:REPORT