Tracks
Table of Contents
The challenge comprises two tracks detailed below.
Track 1 – Transparency Codecs
Development of low-complexity, low-latency and low-bitrate speech codecs designed to preserve the perceptual transparency of input speech, including under mild noise and reverberation conditions.
Goal: Minimize perceptual speech degradation introduced by speech codecs while meeting the complexity, latency and bitrate constraints detailed under challenge Rules.
Focus Areas
-
Performance in clean conditions
-
Robustness to mild real-world distortions, e.g., background noise, reverberation
Track 2 – Speech Enhancement Codecs
Development of low-complexity, low-latency and low-bitrate enhancement codecs, i.e., those that in addition to coding and compression perform denoising and dereverberation, including under challenging acoustic conditions.
Goal: Enable transmission of intelligible and natural-sounding speech including under challenging acoustic conditions while meeting the complexity, latency and bitrate constraints detailed under challenge Rules.
Focus Areas
-
Transparency in clean conditions
-
Robustness to everyday distortions
-
Denoising performance in additive noise, including under challenging codnitions
-
Dereverberation performance, including under challenging conditions
We welcome end-to-end approaches, as well as modular pipelines, e.g., codecs with enhancement front-ends.