Tracks

Table of Contents

  1. Track 1 – Transparency Codecs
  2. Track 2 – Speech Enhancement Codecs

The challenge comprises two tracks detailed below.

Track 1 – Transparency Codecs

Development of low-complexity, low-latency and low-bitrate speech codecs designed to preserve the perceptual transparency of input speech, including under mild noise and reverberation conditions.

Goal: Minimize perceptual speech degradation introduced by speech codecs while meeting the complexity, latency and bitrate constraints detailed under challenge Rules.

Focus Areas

  • Performance in clean conditions

  • Robustness to mild real-world distortions, e.g., background noise, reverberation

Back to top

Track 2 – Speech Enhancement Codecs

Development of low-complexity, low-latency and low-bitrate enhancement codecs, i.e., those that in addition to coding and compression perform denoising and dereverberation, including under challenging acoustic conditions.

Goal: Enable transmission of intelligible and natural-sounding speech including under challenging acoustic conditions while meeting the complexity, latency and bitrate constraints detailed under challenge Rules.

Focus Areas

  • Transparency in clean conditions

  • Robustness to everyday distortions

  • Denoising performance in additive noise, including under challenging codnitions

  • Dereverberation performance, including under challenging conditions

We welcome end-to-end approaches, as well as modular pipelines, e.g., codecs with enhancement front-ends.

Back to top