Skip to main navigation Skip to search Skip to main content

Fully Quantized Neural Networks for Audio Source Separation

  • Elad Cohen
  • , Hai Victor Habi
  • , Reuven Peretz
  • , Arnon Netzer

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Deep neural networks have shown state-of-the-art results in audio source separation tasks in recent years. However, deploying such networks, especially on edge devices, is challenging due to memory and computation requirements. In this work, we focus on quantization, a leading approach for addressing these challenges. We start with a theoretical and empirical analysis of the signal-to-distortion ratio (SDR) in the presence of quantization noise, which presents a fundamental limitation in audio source separation tasks. These analyses show that quantization noise mainly affects performance when the model produces high SDRs. We empirically validate the theoretical insights and illustrate them on audio source separation models. In addition, the empirical analysis shows a high sensitivity to activations quantization, especially to the network's input and output signals. Following the analysis, we propose Fully Quantized Source Separation (FQSS), a quantization-aware training (QAT) method for audio source separation tasks. FQSS introduces a novel loss function based on knowledge distillation that considers quantization-sensitive samples during training and handles the quantization noise of the input and output signals. We validate the efficiency of our method in both time and frequency domains. Finally, we apply FQSS to several architectures (CNNs, LSTMs, and Transformers) and show negligible degradation compared to the full-precision baseline models.

Original languageEnglish
Pages (from-to)926-933
Number of pages8
JournalIEEE Open Journal of Signal Processing
Volume5
DOIs
StatePublished - 1 Jan 2024
Externally publishedYes

Keywords

  • DNN
  • SDR
  • Source separation
  • compression
  • knowledge distillation
  • quantization

ASJC Scopus subject areas

  • Signal Processing

Fingerprint

Dive into the research topics of 'Fully Quantized Neural Networks for Audio Source Separation'. Together they form a unique fingerprint.

Cite this