Audio Level for RTC Encoded Frames

Category: WebRTC
Type: New or changed feature
Status: Proposed (Chrome Proposed)
Intent stage: None

Summary

This feature consists in exposing to the Web the audio level of an encoded frame transmitted via RTCPeerConnection and exposed using WebRTC Encoded Transform.

Motivation

audioLevel is exposed via other APIs (RTCStats, RTCContributingSources) and supports well-known use cases such as indicating who's talking in a VC application, or detecting silence. Having it as part of frame metadata makes audioLevel detection more accurate and efficient for applications using WebRTC Encode Transform, which work at the frame level. In this case, the application does not need to poll getStats() or getContributingSources() to get access to the audioLevel, and it will know that the audioLevel exactly corresponds to the frame being processed. It also unlocks other potential use cases, such as reducing the redundancy of frames with zero/low audio level.

Standards & signals

Specification: https://w3c.github.io/webrtc-encoded-transform/#dom-rtcencodedaudioframemetadata-audiolevel
Firefox: Positive — The official position has not yet been answered, but Mozilla's representative (and co-chair) in the WebRTC WG approved adding this to the spec. https://github.com/w3c/webrtc-encoded-transform/pull/253#pullrequestreview-2825299429
Safari: Positive — The official position has not yet been answered, but Apple's representative (and co-chair) in the WebRTC WG approved adding this to the spec. https://github.com/w3c/webrtc-encoded-transform/pull/253#pullrequestreview-2844026129
Web developers: No signals
Tracking bug: https://crbug.com/418116079

Explainers: https://github.com/guidou/webrtc-encoded-transform/blob/master/audio_level.md

View on chromestatus.com