Tom Oviste

ICASSP 2026 — Neural Variable Span Filters for Interpretable Speech Enhancement

Introduction

This webpage is intended as a companion to the 2026 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) paper, Neural Variable Span Filters for Interpretable Speech Enhancement by T. Oviste, P. Mowlaee, J. Badajoz-Davila, J. R. Jensen and M. G. Christensen.

Here, we present audio samples filtered by our proposed Hybrid Variable Span Filter (HVSF). Signals are filtered by the HVSF with various multipliers applied to its μ parameter, to demonstrate the effect of the multiplier on the tradeoff between speech distortion and noise reduction.

Results

Sample 1 (SIR = +0 dB, SNR = -4 dB)

  Spectrogram Audio
Input PNG
Target PNG
Estimate (μ ⨯ 1) PNG
Estimate (μ ⨯ 10) PNG
Estimate (μ ⨯ 0.1) PNG

Sample 2 (SIR = +0 dB, SNR = +0 dB)

  Spectrogram Audio
Input PNG
Target PNG
Estimate (μ ⨯ 1) PNG
Estimate (μ ⨯ 10) PNG
Estimate (μ ⨯ 0.1) PNG

Sample 3 (SIR = -3 dB, SNR = -4 dB)

  Spectrogram Audio
Input PNG
Target PNG
Estimate (μ ⨯ 1) PNG
Estimate (μ ⨯ 10) PNG
Estimate (μ ⨯ 0.1) PNG