Meta's New AI Audio Codec EnCodec Compresses Audio 10x Better Than MP3

Meta's New AI Audio Codec EnCodec Compresses Audio 10x Better Than MP3

By Marcus Bennett

November 20, 2024 at 04:50 AM

Meta has introduced an AI-powered audio compression technology called 'EnCodec' that achieves 10x better compression than traditional MP3 format, while maintaining high audio quality.

Person using audio production software

Person using audio production software

EnCodec works through a three-part system:

  • An encoder converts uncompressed audio into a lower frame rate representation
  • A quantizer compresses this representation while preserving essential data
  • A decoder reconstructs the audio in real-time using neural networks

Audio compression comparison graph

Audio compression comparison graph

The system uses discriminators to improve audio quality by playing a "cat-and-mouse game" - the discriminator identifies differences between original and reconstructed samples, while the compression model tries to generate samples that fool these discriminators.

Key achievements:

  • First neural network application for 48 kHz stereo audio compression
  • Exceeds CD quality (44.1 kHz sampling rate)
  • Delivers high-quality voice calls over poor network connections
  • Potential applications in metaverse experiences

While still in research phase, EnCodec demonstrates significant potential for delivering high-quality audio across networks regardless of conditions, particularly beneficial for metaverse applications and voice communications.

Businessman checking phone with charts

Businessman checking phone with charts

Fatboy Slim DJing with outstretched arm

Fatboy Slim DJing with outstretched arm

Related Articles

Previous Articles