Norbert is an implementation of multichannel Wiener filter, that is a very popular way of filtering multichannel audio for several applications, notably speech enhancement and source separation.
This filtering method assumes you have some way of estimating power or magnitude spectrograms for all the audio sources (non-negative) composing a mixture. If you only have a model for some target sources, and not for the rest, you may use norbert.residual_model
to let Norbert create a residual model for you.
Given all source spectrograms and the mixture Time-Frequency representation, this repository can build and apply the filter that is appropriate for separation, by optimally exploiting multichannel information (like in stereo signals). This is done in an iterative procedure called Expectation Maximization, where filtering and re-estimation of the parameters are iterated.
From a beginner's perspective, all you need to do is often to call norbert.wiener
with the mix and your spectrogram estimates. This should handle the rest.
From a more expert perspective, you will find the different ingredients from the EM algorithm as functions in the module as described in the API documentation
pip install norbert
Asssuming a complex spectrogram X
, and a (magnitude) estimate of a target V
to be extracted from the spectrogram, performing the multichannel wiener filter is as simple as this:
X = stft(audio)
V = model(X)
Y = norbert.wiener(V, X)
estimate = istft(Y)
norbert is a community focused project, we therefore encourage the community to submit bug-fixes and requests for technical support through github issues. For more details of how to contribute, please follow our CONTRIBUTING.md
.
Antoine Liutkus, Fabian-Robert Stöter
If you want to cite the Norbert software package, please use the DOI from Zenodo:
MIT