1. Introduction
This section is non-normative.
Many users want to continue consuming media while they interact with other content, sites, or applications on their device. A common UI affordance for this type of activity is Picture-in-Picture (PiP), where the video is contained in a separate miniature window that is always on top of other windows. This window stays visible even when the user agent is not visible. Picture-in-Picture is a common platform-level feature among desktop and mobile OSs.
This
specification
extends
HTMLVideoElement
allowing
websites
to
initiate
and
control
this
behavior
by
exposing
the
following
sets
of
properties:
-
Notify the website when it enters and leaves Picture-in-Picture mode.
-
Allow the website to trigger Picture-in-Picture mode via a user gesture on a video element.
-
Allow the website to know the size of the Picture-in-Picture window and notify the website when it changes.
-
Allow the website to exit Picture-in-Picture mode.
-
Allow the website to check if Picture-in-Picture mode can be triggered.
2. Examples
2.1. Add a custom Picture-in-Picture button
< video id = "video" src = "https://example.com/file.mp4" ></ video > < button id = "togglePipButton" ></ button > < script > const video= document. getElementById( "video" ); const togglePipButton= document. getElementById( "togglePipButton" ); // Hide button if Picture-in-Picture is not supported or disabled. togglePipButton. hidden= ! document. pictureInPictureEnabled|| video. disablePictureInPicture; togglePipButton. addEventListener( "click" , async () => { // If there is no element in Picture-in-Picture yet, let's request // Picture-in-Picture for the video, otherwise leave it. try { if ( document. pictureInPictureElement) { await document. exitPictureInPicture(); } else { await video. requestPictureInPicture(); } } catch ( err) { // Video failed to enter/leave Picture-in-Picture mode. } }); </ script >
2.2. Monitor video Picture-in-Picture changes
< video id = "video" src = "https://example.com/file.mp4" ></ video > < script > const video= document. getElementById( "video" ); video. addEventListener( "enterpictureinpicture" , ( event) => { // Video entered Picture-in-Picture mode. const pipWindow= event. pictureInPictureWindow; console. log( `Picture-in-Picture window width: ${ pipWindow. width} ` ); console. log( `Picture-in-Picture window height: ${ pipWindow. height} ` ); }); video. addEventListener( "leavepictureinpicture" , () => { // Video left Picture-in-Picture mode. }); </ script >
2.3. Update video size based on Picture-in-Picture window size changes
< video id = "video" src = "https://example.com/file.mp4" ></ video > < button id = "pipButton" ></ button > < script > const video= document. getElementById( "video" ); const pipButton= document. getElementById( "pipButton" ); pipButton. addEventListener( "click" , async () => { try { await video. requestPictureInPicture(); } catch ( error) { // Video failed to enter Picture-in-Picture mode. } }); video. addEventListener( "enterpictureinpicture" , ( event) => { // Video entered Picture-in-Picture mode. const pipWindow= event. pictureInPictureWindow; updateVideoSize( pipWindow. width, pipWindow. height); pipWindow. addEventListener( "resize" , onPipWindowResize); }); video. addEventListener( "leavepictureinpicture" , ( event) => { // Video left Picture-in-Picture mode. const pipWindow= event. pictureInPictureWindow; pipWindow. removeEventListener( "resize" , onPipWindowResize); }); function onPipWindowResize( event) { // Picture-in-Picture window has been resized. const { width, height} = event. target; updateVideoSize( width, height); } function updateVideoSize( width, height) { // TODO: Update video size based on pip window width and height. } </ script >
3. Concepts
3.1. Internal Slot Definitions
A user agent has:
-
An initiators of active Picture-in-Picture sessions list of zero or more origins , which is initially empty.
Note: In case a user agent supports multiple Picture-in-Picture windows, the list allows duplicates.
An origin is said to have an active Picture-in-Picture session if any of the origins in initiators of active Picture-in-Picture sessions are same origin-domain with origin.
3.2. Request Picture-in-Picture
When the request Picture-in-Picture algorithm with video is invoked, the user agent MUST run the following steps:
-
If Picture-in-Picture support is
false, throw aNotSupportedErrorand abort these steps. -
If the document is not allowed to use the policy-controlled feature named
"picture-in-picture", throw aSecurityErrorand abort these steps. -
If video ’s
readyStateattribute isHAVE_NOTHING, throw aInvalidStateErrorand abort these steps. -
If video has no video track, throw a
InvalidStateErrorand abort these steps. -
If video ’s
disablePictureInPictureis true, the user agent MAY throw anInvalidStateErrorand abort these steps. -
If
pictureInPictureElementisnulland the relevant global object of this does not have transient activation , throw aNotAllowedErrorand abort these steps. -
If video is
pictureInPictureElement, abort these steps. -
Set
pictureInPictureElementto video . -
Let Picture-in-Picture window be a new instance of
PictureInPictureWindowassociated withpictureInPictureElement. -
Append relevant settings object ’s origin to initiators of active Picture-in-Picture sessions .
-
Queue a picture-in-picture task to fire an event named
enterpictureinpictureusingPictureInPictureEventat the video with itsbubblesattribute initialized totrueand itspictureInPictureWindowattribute initialized to Picture-in-Picture window . -
If
pictureInPictureElementis fullscreenElement , it is RECOMMENDED to exit fullscreen .
It is RECOMMENDED that video frames are not rendered in the page and in the Picture-in-Picture window at the same time but if they are, they MUST be kept in sync.
When a video is played in Picture-in-Picture mode, the states SHOULD transition as if it was played inline. That means that the events SHOULD fire at the same time, calling methods SHOULD have the same behaviour, etc. However, the user agent MAY transition out of Picture-in-Picture when the video element enters a state that is considered not compatible with Picture-in-Picture.
Styles applied to video (such as opacity, visibility, transform, etc.) MUST NOT apply in the Picture-in-Picture window. Its aspect ratio is based on the video size.
It is also RECOMMENDED that the Picture-in-Picture window has a maximum and minimum size. For example, it could be restricted to be between a quarter and a half of one dimension of the screen.
3.3. Exit Picture-in-Picture
When the exit Picture-in-Picture algorithm is invoked, the user agent MUST run the following steps:
-
If
pictureInPictureElementisnull, throw aInvalidStateErrorand abort these steps. -
Run the close window algorithm with the Picture-in-Picture window associated with
pictureInPictureElement. -
Queue a picture-in-picture task to fire an event named
leavepictureinpictureusingPictureInPictureEventat the video with itsbubblesattribute initialized totrueand itspictureInPictureWindowattribute initialized to Picture-in-Picture window associated withpictureInPictureElement. -
Unset
pictureInPictureElement. -
Remove one item matching relevant settings object ’s origin from initiators of active Picture-in-Picture sessions .
It is NOT RECOMMENDED that the video playback state changes when the exit Picture-in-Picture algorithm is invoked. The website SHOULD be in control of the experience if it is website initiated. However, the user agent MAY expose Picture-in-Picture window controls that change video playback state (e.g., pause).
As one of the unloading document cleanup steps , run the exit Picture-in-Picture algorithm .
3.4. Disable Picture-in-Picture
Some
pages
may
want
to
disable
Picture-in-Picture
mode
for
a
video
element;
for
example,
they
may
want
to
prevent
the
user
agent
from
suggesting
a
Picture-in-Picture
context
menu
in
some
cases.
To
support
these
use
cases,
a
new
disablePictureInPicture
attribute
is
added
to
the
list
of
content
attributes
for
video
elements.
The
disablePictureInPicture
IDL
attribute
MUST
reflect
the
content
attribute
of
the
same
name.
If
the
disablePictureInPicture
attribute
is
present
on
the
video
element,
the
user
agent
MAY
prevent
the
video
element
from
playing
in
Picture-in-Picture
mode
or
present
any
UI
to
do
so.
When
the
disablePictureInPicture
attribute
is
added
to
a
video
element,
the
user
agent
MAY
run
these
steps:
-
Reject any pending promises returned by the
requestPictureInPicture()method withInvalidStateError. -
If video is
pictureInPictureElement, run the exit Picture-in-Picture algorithm .
3.5. Interaction with Fullscreen
It
is
RECOMMENDED
to
run
the
exit
Picture-in-Picture
algorithm
when
the
pictureInPictureElement
fullscreen
flag
is
set.
3.6. Interaction with Remote Playback
The [Remote-Playback] specification defines a local playback device and a local playback state . For the purpose of Picture-in-Picture, the playback is local and regardless of whether it is played in page or in Picture-in-Picture.
3.7. Interaction with Media Session
The API will have to be used with the [MediaSession] API for customizing the available controls on the Picture-in-Picture window.
3.8. Interaction with Page Visibility
When
pictureInPictureElement
is
set,
the
Picture-in-Picture
window
MUST
be
visible,
even
when
the
Document
is
not
in
focus
or
hidden.
The
user
agent
SHOULD
provide
a
way
for
users
to
manually
close
the
Picture-in-Picture
window.
The Picture-in-Picture window visibility MUST NOT be taken into account by the user agent to determine if the system visibility state of a traversable navigable has changed.
3.9. One Picture-in-Picture window
Operating
systems
with
a
Picture-in-Picture
API
usually
restrict
Picture-in-Picture
mode
to
only
one
window.
Whether
only
one
window
is
allowed
in
Picture-in-Picture
mode
will
be
left
to
the
implementation
and
the
platform.
However,
because
of
the
one
Picture-in-Picture
window
limitation,
the
specification
assumes
that
a
given
Document
can
only
have
one
Picture-in-Picture
window.
What happens when there is a Picture-in-Picture request while a window is already in Picture-in-Picture will be left as an implementation detail: the current Picture-in-Picture window could be closed, the Picture-in-Picture request could be rejected or even two Picture-in-Picture windows could be created. Regardless, the User Agent will have to fire the appropriate events in order to notify the website of the Picture-in-Picture status changes.
4. API
4.1.
Extensions
to
HTMLVideoElement
partial interface HTMLVideoElement { [NewObject ]Promise <PictureInPictureWindow >();requestPictureInPicture attribute EventHandler ;onenterpictureinpicture attribute EventHandler ; [onleavepictureinpicture CEReactions ]attribute boolean ; };disablePictureInPicture
The
requestPictureInPicture()
method,
when
invoked,
MUST
return
a
new
promise
promise
and
run
the
following
steps
in
parallel
:
-
Let video be the video element on which the method was invoked.
-
Run the request Picture-in-Picture algorithm with video .
-
If the previous step threw an exception, queue a picture-in-picture task to reject promise with that exception and abort these steps.
-
Queue a picture-in-picture task to
Resolveresolve promise with the Picture-in-Picture window associated withpictureInPictureElement.
4.2.
Extensions
to
Document
partial interface Document {readonly attribute boolean ; [pictureInPictureEnabled NewObject ]Promise <undefined >(); };exitPictureInPicture
The
pictureInPictureEnabled
attribute’s
getter
must
return
true
if
Picture-in-Picture
support
is
true
and
the
context
object
is
allowed
to
use
the
feature
indicated
by
attribute
name
picture-in-picture
,
and
false
otherwise.
Picture-in-Picture
support
is
false
if
there’s
a
user
preference
that
disables
it
or
a
platform
limitation.
It
is
true
otherwise.
The
exitPictureInPicture()
method,
when
invoked,
MUST
return
a
new
promise
promise
and
run
the
following
steps
in
parallel
:
-
Run the exit Picture-in-Picture algorithm .
-
If the previous step threw an exception, queue a picture-in-picture task to reject promise with that exception and abort these steps.
-
Queue a picture-in-picture task to
Resolveresolve promise .
4.3.
Extension
to
DocumentOrShadowRoot
partial interface mixin DocumentOrShadowRoot {readonly attribute Element ?; };pictureInPictureElement
The
pictureInPictureElement
attribute’s
getter
must
run
these
steps:
-
If the context object is not connected , return
nulland abort these steps. -
Let candidate be the result of retargeting Picture-in-Picture element against the context object .
-
If candidate and the context object are in the same tree , return candidate and abort these steps.
-
Return
null.
4.4.
Interface
PictureInPictureWindow
[Exposed =Window ]interface :PictureInPictureWindow EventTarget {readonly attribute long ;width readonly attribute long ;height attribute EventHandler ; };onresize
A
PictureInPictureWindow
instance
represents
a
Picture-in-Picture
window
associated
with
an
HTMLVideoElement
.
When
instantiated,
an
instance
of
PictureInPictureWindow
has
its
state
set
to
opened
.
When
the
close
window
algorithm
with
an
instance
of
PictureInPictureWindow
is
invoked,
its
state
is
set
to
closed
.
The
width
attribute
MUST
return
the
width
in
CSS
pixels
of
the
Picture-in-Picture
window
associated
with
pictureInPictureElement
if
the
state
is
opened
.
Otherwise,
it
MUST
return
0.
The
height
attribute
MUST
return
the
height
in
CSS
pixels
of
the
Picture-in-Picture
window
associated
with
pictureInPictureElement
if
the
state
is
opened
.
Otherwise,
it
MUST
return
0.
When
the
size
of
the
Picture-in-Picture
window
associated
with
pictureInPictureElement
changes,
the
user
agent
MUST
queue
a
picture-in-picture
task
to
fire
an
event
named
resize
at
pictureInPictureElement
.
4.5. Event types
[Exposed =Window ]interface :PictureInPictureEvent Event {(constructor DOMString ,type PictureInPictureEventInit ); [eventInitDict SameObject ]readonly attribute PictureInPictureWindow ; };pictureInPictureWindow dictionary :PictureInPictureEventInit EventInit {required PictureInPictureWindow ; };pictureInPictureWindow
-
enterpictureinpicture -
Fired on a
HTMLVideoElementwhen it enters Picture-in-Picture. -
leavepictureinpicture -
Fired on a
HTMLVideoElementwhen it leaves Picture-in-Picture mode. -
resize -
Fired on a
PictureInPictureWindowwhen it changes size.
4.6. Task source
The task source for all the tasks queued in this specification is the media element event task source of the video element in question.
When an algorithm queues a picture-in-picture task T , the user agent MUST queue a global task T on the [[media element task source]] using the global object of the current realm .
4.7. CSS pseudo-class
The
:picture-in-picture
pseudo-class
MUST
match
the
Picture-in-Picture
element.
It
is
different
from
the
pictureInPictureElement
as
it
does
NOT
apply
to
the
shadow
host
chain.
5. Security considerations
This section is non-normative.
To
limit
potential
abuse
through
spoofing,
the
API
applies
only
to
HTMLVideoElement
.
User
interaction
with
the
Picture-in-Picture
window
is
intentionally
limited
so
that
the
only
effect
is
on
the
Picture-in-Picture
window
itself
or
the
media
being
played.
5.1. Secure Context
The API is not limited to [SECURE-CONTEXTS] because it exposes a feature to web applications that user agents usually offer natively on all media regardless of the browsing context .
5.2. Permissions Policy
This
specification
defines
a
policy-controlled
feature
named
"picture-in-picture"
that
controls
whether
the
request
Picture-in-Picture
algorithm
may
return
a
SecurityError
and
whether
pictureInPictureEnabled
is
true
or
false
.
The
default
allowlist
for
this
feature
is
*
.
6. Acknowledgments
Thanks to Jennifer Apacible, Zouhir Chahoud, Marcos Cáceres, Philip Jägenstedt, Jeremy Jones, Chris Needham, Jer Noble, Justin Uberti, Yoav Weiss, and Eckhart Wörner for their contributions to this document.