Audio experiences are evolving beyond fixed formats, allowing audiences to customize how they engage with shows and live events. Next Generation Audio (NGA) technologies, particularly MPEG-H Audio, empower users to adjust dialogue clarity, select commentary, and enable audio descriptions, all within a single stream.
This article delves into the implementation of cloud-based NGA workflows utilizing MPEG-H Audio, which facilitates immersive and interactive audio delivery at scale.
Adapting to Cloud-Based Workflows
Traditionally, MPEG-H production workflows relied on on-premises servers. However, with the shift towards cloud solutions, the processing of audio and video—including metadata—requires a new approach. This transition has been made possible through collaboration with AWS Partners and technology leaders, enabling a comprehensive MPEG-H production workflow in the cloud.
Transforming Audio Interaction
For years, audio in broadcasting and streaming has been fixed, limiting audience interaction. NGA changes this by introducing object-based audio capabilities, allowing broadcasters to enhance their productions. Next-generation TV standards, such as ATSC 3.0 and DTV+, are incorporating NGA codecs to improve audio quality while optimizing bandwidth usage.
Key Features of MPEG-H Audio
MPEG-H Audio, developed by ISO/IEC MPEG with contributions from Fraunhofer IIS, is tailored for NGA services in broadcast and streaming. Key capabilities include:
- Object-based audio for dynamic sound positioning
- Extensive metadata for personalized audio experiences
- Support for various playback devices, from multi-speaker systems to mobile devices
Workflow Overview
The cloud-based NGA production process involves several key steps:
- Audio Capture: Separate audio sources are captured as discrete PCM signals, enhancing interactivity and immersive playback.
- Metadata Creation: Audio elements are processed to create metadata that defines user interactivity and audio rendering.
- Distribution: The final output is encoded into streaming formats, ready for OTT delivery.
- Playback Rendering: MPEG-H libraries ensure optimized audio playback across devices.
Benefits of Cloud-Based Production
The cloud approach offers significant advantages, including:
- Scalability to accommodate millions of viewers
- Flexibility in resource allocation during peak events
- Cost savings by reducing the need for on-premises hardware
Conclusion
The implementation of MPEG-H Audio workflows on AWS represents a significant advancement in audio production, allowing broadcasters to deliver high-quality, personalized audio experiences. As the industry continues to embrace cloud solutions, the potential for immersive audio will only grow.