Audio Output Devices API

This section specifies additions to the HTMLMediaElement [HTML] when the Audio Output Devices API is supported.

When the HTMLMediaElement constructor is invoked, the user agent MUST add the following initializing step:

Let the element have a [[SinkId]] internal slot, initialized to "".

WebIDLpartial interface HTMLMediaElement {
  [SecureContext] readonly attribute DOMString sinkId;
  [SecureContext] Promise<undefined> setSinkId (DOMString sinkId);
};

sinkId of type DOMString, readonly

This attribute contains the ID of the audio device through which output is being delivered, or the empty string if output is delivered through the user-agent default device. If nonempty, this ID should be equal to the deviceId attribute of one of the MediaDeviceInfo values returned from enumerateDevices().

On getting, the attribute MUST return the value of the [[SinkId]] slot.

setSinkId

Sets the ID of the audio device through which audio output should be rendered if the application is permitted to play out of a given device.

When this method is invoked, the user agent must run the following steps:

Let document be this's relevant global object's associated Document.
If document is not allowed to use the feature identified by "speaker-selection", return a promise rejected with a new DOMException whose name is NotAllowedError.
Let element be the HTMLMediaElement object on which this method was invoked.
Let sinkId be the method's first argument.
If sinkId is equal to element's [[SinkId]], return a promise resolved with undefined.
Let p be a new promise.
Run the following substeps in parallel:
1. If sinkId is not the empty string and does not match any audio output device identified by the result that would be provided by enumerateDevices(), reject p with a new DOMException whose name is NotFoundError and abort these substeps.
2. If sinkId is not the empty string, and the application would not be permitted to play audio through the device identified by sinkId if it weren't the current user agent default device, reject p with a new DOMException whose name is NotAllowedError and abort these substeps.
3. Switch the underlying audio output device for element to the audio device identified by sinkId.
  
  Note
  If this substep is successful and the media element's paused attribute is false, audio MUST stop playing out of the device represented by the element's sinkId attribute and will start playing out of the device identified by sinkId
4. If the preceding substep failed, reject p with a new DOMException whose name is AbortError, and abort these substeps.
5. Queue a task that runs the following steps:
  1. Set element's [[SinkId]] to sinkId.
  2. Resolve p.
Return p.

The audio device identified by a media element's sinkId attribute may become unavailable, for example if it is unplugged.

When the audio device identified by the sinkId attribute is no longer available, the user agent must take no action. For example, if the media element's paused attribute is false when the device identified by the sinkId is no longer available, then playback will continue as normal. In this case, audio will not be rendered because the device to which the media element is attached is unavailable.

The following paragraph is non-normative.

If the application wishes to react to the device change, the application can listen to the devicechange event and query enumerateDevices() for the list of updated devices. If the value of the media element's sinkId attribute is no longer present as the deviceId attribute in the returned list of MediaDeviceInfos, the device is no longer available and the application can choose to react accordingly.

New audio devices may become available to the user agent, or an audio device (identified by a media element's sinkId attribute) that had previously become unavailable may become available again, for example, if it is unplugged and later plugged back in.

In this scenario, the user agent must run the following steps:

Let sinkId be the identifier for the newly available device.
For each media element whose sinkId attribute is equal to sinkId:
1. If the media element's paused attribute is false, start rendering this object's audio out of the device represented by the sinkId attribute.

The following paragraph is non-normative.

If the application wishes to react to the device change, the application can listen to the devicechange event and query enumerateDevices() for the list of updated devices.

This section specifies additions to the MediaDevices when the Audio Output Devices API is supported.

WebIDLpartial interface MediaDevices {
  Promise<MediaDeviceInfo> selectAudioOutput(optional AudioOutputOptions options = {});
};

selectAudioOutput

Prompts the user to select a specific audio output device.

When the selectAudioOutput method is called, the user agent MUST run the following steps:

If this's relevant global object does not have transient activation, return a promise rejected with a DOMException object whose name attribute has the value InvalidStateError.
Let options be the method's first argument.
Let deviceId be options.deviceId.
Let p be a new promise.
Run the following steps in parallel:
1. Let descriptor be a PermissionDescriptor with its name set to "speaker-selection"
2. If descriptor's permission state is "denied", reject p with a new DOMException whose name attribute has the value NotAllowedError, and abort these steps.
3. Probe the user agent for available audio output devices.
4. If there is no audio output device, reject p with a new DOMException whose name attribute has the value NotFoundError and abort these steps.
5. If deviceId is not "" and matches an id previously exposed by selectAudioOutput in an earlier browsing session, the user agent MAY decide, based on its previous decision of whether to persist this id or not for this set of origins, to run the following sub steps:
  1. Let device be the device identified by deviceId, if available.
  2. If device is available, resolve p with either deviceId or a freshly rotated device id for device, and abort the in-parallel steps.
6. Prompt the user to choose an audio output device, with descriptor.
7. If the result of the request is "denied", reject p with a new DOMException whose name attribute has the value NotAllowedError and abort these steps.
8. Let deviceInfo be a new MediaDeviceInfo object to represent the selected audio output device.
9. Add deviceInfo.deviceId to [[explicitlyGrantedAudioOutputDevices]].
10. Resolve p with deviceInfo.
Return p.

Once a device is exposed after a call to selectAudioOutput, it MUST be listed by enumerateDevices() for the current browsing context.

If the promise returned by selectAudioOutput is resolved, then the user agent MUST ensure the document is both immediately allowed to play media in an HTMLMediaElement, and immediately allowed to start an AudioContext, without needing any additional user gesture.

Note

This is imprecise due to the current lack of standardization of autoplay in browsers.

This dictionary describes the options that can be used to obtain access to an audio output device.

WebIDLdictionary AudioOutputOptions {
  DOMString deviceId = "";
};

deviceId of type DOMString, defaulting to "": When the value of this dictionary member is not "", and matches the id previously exposed by selectAudioOutput in an earlier session, the user agent MAY opt to skip prompting the user in favor of resolving with this id or a new rotated id for the same device, assuming that device is currently available.

Note
Applications that wish to rely on user agents supporting persisted device ids must pass these through selectAudioOutput successfully before they will work with setSinkId. The reason for this is that it exposes fingerprinting information, but at the risk of prompting the user if the device is not available or the user agent decides not to honor the device id.

The Audio Output Devices API is a powerful feature that is identified by the name "speaker-selection".

It defines the following types and algorithms:

permission descriptor type

A permission covers access to the device given in the associated DevicePermissionDescriptor descriptor.

If the descriptor does not have a deviceId, its semantic is that it queries for access to all devices of that class. Thus, if a query for the "speaker-selection" powerful feature with no deviceId returns "granted", the client knows that there will not be a permission prompt for any audio output device known to it, if requested using the deviceId option to selectAudioOutput, and if "denied" is returned, it knows that no selectAudioOutput request for an audio output device will succeed.

If a permission state is present for access to some, but not all, audio output devices, a query without the deviceId will return "prompt".

extra permission data type

A list of deviceId values for the devices the user has made a non-default decision on access to.

permission query algorithm

The permission query algorithm runs the following steps:

If permissionDesc.deviceId exists in the extra permission data, set status.state to permissionDesc's permission state and terminate these steps.
Let global be a copy of permissionDesc with the deviceId member removed.
Set status.state to global's permission state.

permission revocation algorithm

This is the result of calling the device permission revocation algorithm passing name and deviceId as arguments. If the descriptor does not have a deviceId, then undefined is passed in place of deviceId.

This specification defines one policy-controlled feature identified by the string "speaker-selection". It has a default allowlist of "self".

Note

A document's permissions policy determines whether any content in that document is allowed to use selectAudioOutput to prompt the user for an audio output device, or allowed to use setSinkId to change the device through which audio output should be rendered, to a non-system-default user-permitted device. For selectAudioOutput this is enforced by the prompt the user to choose algorithm.

Audio Output Devices API

Abstract

Status of This Document

1. Introduction

2. `HTMLMediaElement` Extensions

Attributes

Methods

2.1 Algorithms

2.1.1 Sink no longer available

2.1.2 New sink available

3. `MediaDevices` Extensions

Methods

AudioOutputOptions dictionary

Dictionary `AudioOutputOptions` Members

4. Privacy Considerations

4.3 Permissions Integration

4.4 Permissions Policy Integration

5. Conformance

6. Acknowledgments

A. References

A.1 Normative references

A.2 Informative references

Audio Output Devices API

Abstract

Status of This Document

1. Introduction

2. HTMLMediaElement Extensions

Attributes

Methods

2.1 Algorithms

2.1.1 Sink no longer available

2.1.2 New sink available

3. MediaDevices Extensions

Methods

AudioOutputOptions dictionary

Dictionary AudioOutputOptions Members

4. Privacy Considerations

4.1 Consent

4.2 Obtaining Consent

4.3 Permissions Integration

4.4 Permissions Policy Integration

5. Conformance

6. Acknowledgments

A. References

A.1 Normative references

A.2 Informative references

2. `HTMLMediaElement` Extensions

3. `MediaDevices` Extensions

Dictionary `AudioOutputOptions` Members