0% found this document useful (0 votes)

63 views27 pages

2.0-Widevin DRM Encoding and Packaging

This document provides: an introduction to encoding media including codecs, containers, and encoding profiles; best practices for H264, HEVC, and VP9 encoding; and an introduction to using the Widevine Shaka Packager for encrypting content.

Uploaded by

Oron Kessel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views27 pages

2.0-Widevin DRM Encoding and Packaging

Uploaded by

Oron Kessel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Encoding and Packaging

version 1.1
Contents
Summary 4

Introduction to Encoding 4
Elementary Stream 4
Codecs 5
Containers 5
Container formats 6
Transmuxing and Transcoding 6
Video Stream Structure 7
Group of Pictures (GOP) Structure 7
Why are GOPs important? 8
Encoding media 10
Constant Bitrate (CBR) 10
Variable Bitrate (VBR) 11
Aspect Ratio 11
Using encoding profiles 12
Common encoding profiles 12

Best Practices 13
General recommendations 13
H264 Encoding Profiles 14
Example encoding syntax using ffmpeg 14
Ffmpeg parameters 14
HEVC Encoding Profiles 16
Example encoding syntax using ffmpeg 16
Ffmpeg parameters 16
VP9 Encoding Profiles 18
Example encoding syntax using ffmpeg 18
Ffmpeg parameters 18
Content Encryption 20
Encryption Recommendations from Least Secure to Most Secure 20
Playback Security Levels for Chrome and iOS 20

Using the Widevine Shaka Packager 21

Install dependencies 21
Build the Shaka Packager 21

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 2 of 27
Getting Help with Shaka Packager 22
Audio and Video Stream Analysis 23
Encrypt content 24
Syntax for H264 - VOD 24
Syntax for VP9 - VOD 25
Sample MPD 25
Content Playback 27

© 2017 Google, Inc. All Rights Reserved. No express or implied warranties are provided for herein. All specifications are subject to
change and any expected future products, features or functionality will be provided on an if and when available basis. Note that the
descriptions of Google’s patents and other intellectual property herein are intended to provide illustrative, non-exhaustive examples
of some of the areas to which the patents and applications are currently believed to pertain, and is not intended for use in a legal
proceeding to interpret or limit the scope or meaning of the patents or their claims, or indicate that a Google patent claim(s) is
materially required to perform or implement any of the listed items.

Version Date Description By

1.0 11/8/2016 Initial revision Alex Lee

1.1 2/27/2017 Clarification updates Alex Lee

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 3 of 27
Summary

This document provides:

● A basic video primer with an emphasis on generating DASH-compatible content.
● Use of DASH and Common Encryption in content packaging.
● Coverage of best practices and recommendations.
● An introduction to the Widevine Shaka Packager.

For any questions, please contact Widevine from our website - www.widevine.com.

Introduction to Encoding

This section is designed to provide a quick layman’s understanding of how media is created,
what is used for media, common definitions and terminology, and supported codecs for DASH
playback.

Let’s start with the basic elements of a media file.

Elementary Stream
An elementary stream (ES) is essentially the encoding of media that’s perceptible to the user.
Every elementary stream contains a single media data type (audio, video, subtitles, captions).
The sum of several elementary streams allows for cohesive media playback experience on any
given platform.

The content of an elementary stream is dependent on the data format it holds, a codec
(coder-decoder) for video or audio. Elementary streams are broken down into frames and
encoded by codecs. A frame (or media sample) is typically referred to as a still image for video
or a few milliseconds of audio. It contains information to render a specific video or audio scene
at that specific point in time. A collection of frames would complete a video or audio clip, similar
to a flip book.

What this essentially means is audio and video content is stored as frames encoded
(compressed) by a codec. Each codec conforms to its own specifications and every codec
strives to provide the best media quality for the least amount of resources used (processing,
time, efficiency, size).

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 4 of 27
Since we are referring to audio and video content, the reference for a frame is based on time (in
milliseconds). Audio frame durations are typically within the 20-40ms range while video frames
vary and are usually expressed in terms for frames/second (fps)..

Codecs

The most common codecs used for video and audio processing in use today are:

Video Audio

AVC (H264) AAC

HEVC (H265) Opus

VP9 Vorbis

AV1 (coming soon) DTS

Dolby Digital (AC-3, EAC-3)

Codecs may be closed-source commercial products while others are open-source and
free-to-use with or without licensing requirements. Typically, codec selection is based on client
platform support. DASH presentations support multiple codecs (i.e. mixing and matching
different codecs within the same content).

Containers

Media containers are the grouping of one or more elementary streams into a data stream (in this
case, a file). A container is also a format specification that describes elements of the data
streams (timing, structure, and media information) that it holds.

The elementary stream metadata that is added to a container include (not a complete list):
● Codec type
● Codec-specific configuration data
● Video height and width
● Video frame rate
● Audio sampling rate
● Audio channels
● Frame timing and ordering information

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 5 of 27
Container formats

Here are examples of container formats commonly used for streaming media.

MPEG2-TS (.ts)
● Optimized for transmission over a closed network for broadcast systems.

ISO-BMFF (.mp4)
● Designed as a next-generation container format by Apple and adopted by ISO/IEC.
● There are multiple file extensions allowed for this format, however, we are focused on
MPEG4 (mp4).
● All data within this format is organized into boxes. Each box type represents a different
type of data element contained within. Boxes may also contain other boxes.
● Fragmented MP4 is a variant of ISO-BMFF optimized for streaming.

Matroska (.mkv, .webm)

● An open-sourced container format that is designed to be more efficient than ISO-BMFF.
● Uses a more compact version of metadata representation compared to MPEG4, which
potentially leads to a faster startup time for playback.

When creating DASH compliant media, the specification requires only a single elementary
stream be present per container.

Now that we have covered what media files consist of, how are these files created?

Transmuxing and Transcoding

Transcoding is a processing of an elementary stream consisting of decoding followed by

re-encoding to the same or another codec, while potentially altering its encoding characteristics
(video resolution, bitrate, audio sampling rate, etc.) . The resulting elementary stream may be
stored in a different container format than the original.

Transmuxing is the act moving elementary streams from one container format to another without
manipulating the actual streams. All audio visual data remains unchanged.

Transmuxing is a less processing-intensive effort compared to transcoding.

Why is transmuxing useful?

To deliver the same content to different platforms that support varying formats.

For example:

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 6 of 27
To deliver identically encoded streams over different delivery network to separate devices, using
MPEG2-TS for a IPTV cable box (i.e. Comcast Cable) and ISO-BMFF for DASH streaming to an
Android phone (Comcast Xfinity).

Why is transcoding useful?

● Optimize media for better user experience on playback.
● Caters to the adaptive streaming use-case where different media stream qualities are
generated to meet varying bandwidth requirements.
● To deal with device playback constraints. The media format applicable for one device
may not be compatible with another; as such, the media must be converted.
● To meet business needs
○ Varying levels of quality to meet device streaming capacity
○ Contractual obligations

The next section covers the structure of a video stream.

Video Stream Structure

Figure 1 - represents an arbitrary section of a video stream.

As previously stated, video is represented as a sequence of frames over time, where there are
groupings of frames that are similar in representation.

Group of Pictures (GOP) Structure

A GOP is:
● a self-contained decodable sequence of frames.
● normally represented as a short sequence of pictures.
● begins with a key frame, and ends before the next key frame..

There are different types of frames contained in a GOP:

● I-frame (Intra)
○ An I-frame (Intra-coded) is a full picture, much like a standard JPEG image file.
a.k.a a key frame.
○ Every GOP starts with an I-frame as it represents a complete visual
representation of a picture.

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 7 of 27
○ Because it contains the complete picture, an I-frame is usually larger in size
compared to other frame types.
○ Also known as a key frame.
● P-frame (Predictive)
○ Contains differences in the picture from previous frames.
● B-frame (Bi-Predictive)
○ Similar to a P-frame, but may be encoded as differences from subsequent
pictures. Because of this, it is usually decoded out of order.

If I-frames are so complete, why is there a need for P or B frames? Media consisting of all
I-frames would be very large, rendering it unusable for mass consumption. B and P frames are
a fraction of the size of an I-frame (since they only display the picture differences).

Why are GOPs important?

● Allows for seek points.
○ Provides the ability to move forward or backward in the video and always have
the ability to start by displaying a complete picture (the I-frame). A P-frame or
B-frame will only display the visual differences from preceding or successive
pictures, not a complete picture.
● Enables higher efficiency for encoding.
○ Inserting an I-frame at scene changes allows for smaller B and P frames,
reducing file size.
○ A scene change is a sequence of pictures that has no reference to previous
pictures. An example of a scene change is when the camera angle changes.
● Facilitate video adaptation.
○ A video decoder must always start from an I-frame to provide a proper picture
reference point.
○ Video adaptation is performed by switching from poor quality media to high
quality media (or vice versa). Without using I-frames, the end-user visual
experience would be extremely poor because the act of switching video quality
will potentially render B or P frames (going from a full picture render to partial).

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 8 of 27
Figure 2 - represents 3 video streams with aligned GOPs (i-frames).

There are 2 types of GOP.

● Open
○ Open GOPs start with a B-frame that is able to look at the last P-frame from the
preceding GOP as well as the first I-frame of its own GOP.
○ An open GOP has a reliance on the preceding GOP.
● Closed
○ Closed GOPs cannot contain any frame that refers to a frame in the previous or
next GOP.
○ A closed GOP is usually required when generating media to allow a smoother
adaptation experience.

A GOP length or size is dependent on the number of frames per GOP. A longer GOP is more
efficient (a larger grouping of similar pictures), however, provides fewer seek or adaptation
points.

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 9 of 27
Encoding media

Figure 3 - represents examples of varying complexity based on media timeline.

Media is not uniform. For example, a movie will contain many scenes - action sequences,
dialogues, romance and more. The amount of data required to encode a scene depends on
how much the frames change between one and the next (complexity). An action sequence
would require more data to encode correctly compared to a scene of a sleeping baby.

There are many considerations for encoding, however, the primary decision first comes down to
bitrate.

Constant Bitrate (CBR)

CBR refers to content that is encoded at a specific bitrate, uniformly across its entirety.

e.g. 6 Mbit/sec = 1 second of media requires 6MB

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 10 of 27
For a low-complexity scene, a CBR configuration will force the codec to add padding (empty
data) to meet the desired bitrate since it has to be constant.

For a high-complexity scene, a CBR configuration will force the codec to compromise on scene
quality if the required data to render the scene exceeds the CBR limit. The scene is too
complex to store in the amount of information allowed, resulting in graininess, artifacts, skipping.

Variable Bitrate (VBR)

VBR allows the codec to use fewer bits when it is not needed, saving them for more complex
scenes.

The basic principle is to set a target bitrate and allow for some level of variance over the length
of the media, to achieve the overall target bitrate.

VBR is generally recommended as it results in higher-quality encodings compared to CBR

encodings at the same bit rate.

What happens when the bitrate exceeds the target for extended periods of time? The end-user
device will be unable to display the frames correctly since it’s expecting X bitrate but receiving
X+Y bitrate instead. This can be shown as stuttering and artifacts (pixelation).

To work around these limitations, a video buffer verifier (VBV) is specified. The VBV manages
VBR variance. It specifies a maximum bitrate over a rolling buffer.

In our tests, for the majority of current devices, the VBV should be set to twice the target bitrate.

Aspect Ratio

Aside from bitrate, there are other parameters to control the display or viewing of pictures -
namely, the aspect ratio to preserve the original picture, preventing visual distortion.

There are 3 distinctions of aspect ratio - Sample Aspect Ratio, Pixel Aspect Ratio and Display
Aspect Ratio.

The display aspect ratio is most understood and referred to when discussing aspect ratios. It is
a reflection of what we see in media. References to terms like 4:3 or 16:9 refer to the display
aspect ratio. A common method is to define the display aspect ratio (DAR) and let your encoder
software calculate the SAR and PAR accordingly.

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 11 of 27
Using encoding profiles

Encoding profiles refer to a set of parameters that generates media with specific properties. For
example, an encoding profile for SD vs HD content would not be identical as there would be, at
minimum, a change in the display resolution.

These profiles vary per codec. Every codec defines its own specification and features - for
example: H264 uses Baseline, Main and High profiles. Even more importantly, encoding profile
support varies from device to device. Therefore, it is paramount to ensure that a device is a
capable of playback for any given encoding profile.

Common encoding profiles

Codec Profiles

AAC AAC-LC
AAC-HE (SBR - spectral band replication)
AAC-HEv2 (SBR and PS - parametric stereo)

VP9 0 (8-bit)
1 (8-bit)
2 (10-bit ) (12-bit)
3 (10-bit) (12-bit)

H264 Baseline, Main, High

H265 Main, Main10

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 12 of 27
Best Practices

The section outlines a series of recommendations and best practices from encoding to
encryption and enabling playback on various client platforms. These recommendations serve
as guidance on the design and implementation of EME, CENC, DASH support from Widevine
for both server and client pieces. To ensure maximum playback compatibility across all client
platforms, the lowest common denominator for encoding profiles should be used.

The table below illustrates the most common audio and video codec combinations for various
video resolutions:

Resolution Widevine Client Security Level Video Codec Audio Codec

SD L3 H264 (MP4) AAC

HD L1 H264 (MP4) AAC

DTS
Dolby Digital

UHD L1 HEVC (MP4) AAC

VP9 (WebM) DTS
Dolby Digital

General recommendations
● Use variable bitrate (VBR) tracks with a reasonable video buffer verifier (VBV) value.
○ VBV = 1.5 - 2x target bitrate.
● All files must have closed GOPs and identical IDR frame structure.
● IDR frame separation – approx 3 seconds (lowest value consistent with good image
quality).
● Key frames must be at the same exact intervals across all track types.
● DASH REQUIREMENT - 'moov' atom should immediately follow the 'ftyp' atom.
● ISO-BMFF Chunks should contain no more than 1 second worth of sample data.

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 13 of 27
H264 Encoding Profiles
(referenced from the Android Compatibility Definition Document)

Video Resolution Profile and Level Bitrate FPS GOP

240p Baseline 3.0 800 kbps 24 3s

480p Main 3.1 2000 kbps 24 3s

720p Main 4.0 8000 kbps 24 3s

1080p High 4.2 20000 kbps 24 3s

4K / UHD High 5.1 30000 kbps 24 3s

Example encoding syntax using ffmpeg

ffmpeg -i <source_file> -an \

-vf "scale=1280:trunc(ow/a/2)*2" \
-c:v libx264 -profile:v main -level:v 4.0 \
-x264opts scenecut=0:open_gop=0:min-keyint=72:keyint=72 \
-movflags +faststart \
-minrate 2M -maxrate 2M -bufsize 3M -b:v 2M <output_file>

Ffmpeg parameters
Parameter Description

-an No audio, video only

-vf The options used here will resize the video to the desired resolution while
maintaining the original aspect ratio

-c:v libx264 Selects the codec library

-profile Specifies the codec profile

-level Specified the codec level

-x264opts Declares additional options:

Enforce closed GOPs

Specify 3s GOP intervals (72 for 24fps source content)

-movflags +faststart Optimizes the output MP4 file format for streaming

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 14 of 27
-minrate Minimum bitrate

-maxrate Maximum bitrate

-b:v Target bitrate

-bufsize Video buffer size

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 15 of 27
HEVC Encoding Profiles
(referenced from the Android Compatibility Definition Document)

Video Resolution Profile and Level Bitrate FPS GOP

240p Main 3.0 600 kbps 24 3s

360p Main 3.0 1600 kbps 24 3s

480p Main 3.0 3000 kbps 24 3s

720p Main 3.1 4000 kbps 24 3s

1080p Main 4.1 10000 kbps 24 3s

4K / UHD Main10 5.1 20000 kbps 24 3s

Example encoding syntax using ffmpeg

ffmpeg -i <source_file> -an \

-vf "scale=1280:trunc(ow/a/2)*2" \
-c:v libx265 \
-x265-params
level=4.0:scenecut=0:open_gop=0:min-keyint=72:keyint=72 \
-movflags +faststart \
-minrate 4M -maxrate 4M -bufsize 6M -b:v 4M <output_file>

Ffmpeg parameters
Parameter Description

-an No audio, video only

-vf The options used here will resize the video to the desired resolution while
maintaining the original aspect ratio

-c:v libx265 Selects the codec library

-x265-params Declares additional options:

Specify HEVC level

Enforce closed GOPs
Specify 3s GOP intervals (72 for 24fps source content)

-movflags +faststart Optimizes the output MP4 file format for streaming

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 16 of 27
-minrate Minimum bitrate

-maxrate Maximum bitrate

-b:v Target bitrate

-bufsize Video buffer size

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 17 of 27
VP9 Encoding Profiles
(referenced from the Android Compatibility Definition Document)

Video Resolution Profile and Level Bitrate FPS GOP

240p Profile 0 (8-bit) 600 kbps 24 3s

360p Profile 0 (8-bit) 1600 kbps 24 3s

720p Profile 0 (8-bit) 4000 kbps 24 3s

1080p Profile 0 (8-bit) 5000 kbps 24 3s

4K / UHD Profile 2 (10-bit) 20000 kbps 24 3s

Example encoding syntax using ffmpeg

ffmpeg -i <source_file> \
-vf "scale=1280:trunc(ow/a/2)*2" \
-c:v libvpx-vp9 -keyint_min 72 -g 72 -profile:v 0 \
-threads 4 -tile-columns 6 -frame-parallel 1 \
-speed 1 -auto-alt-ref 1 -lag-in-frames 25 \
-an -minrate 4M -maxrate 4M -bufsize 4M -b:v 4M \
-f webm -dash 1 <output_file>

Ffmpeg parameters
Parameter Description

-an No audio, video only

-vf The options used here will resize the video to the desired resolution while
maintaining the original aspect ratio

-c:v libvpx-vp9 Selects the codec library

-profile:v Specifies VP9 profile

-f webm Specifies webm container output

-keyint_min Specifies minimum keyframe interval

-g Specifies keyframe interval

-threads 4 libvpx options optimized for VOD streaming media

-tile-columns 6

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 18 of 27
-frame-parallel 1 See the VP9 Encoding Guide.
-speed 1
-auto-alt-ref 1
-lag-in-frames 25
-dash 1

-minrate Minimum bitrate

-maxrate Maximum bitrate

-b:v Target bitrate

-bufsize Video buffer size

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 19 of 27
Content Encryption

Encryption best practices can be distilled into a simple statement: separate keys should be used
across different types of content (audio, video, resolution).

Encryption Recommendations from Least Secure to Most Secure

Audio Video

No encryption Single content key for all tracks

Separate content key for audio tracks Separate content key for each video resolution
group (SD, HD, UHD)

Separate content key for audio tracks Separate content key for each video track

Playback Security Levels for Chrome and iOS

The recommended security level setting for VIDEO tracks will be to specify
SW_SECURE_DECODE. The only supported security level setting for AUDIO tracks is
SW_SECURE_CRYPTO. Security level settings are specified by your license proxy
implementation on a per track basis. The table below provides the recommended security level
settings per Chrome platform.

Platform Video Audio

Browser (PC, Mac, Linux) SW_SECURE_DECODE (L3) SW_SECURE_CRYPTO (L3)

iOS

ChromeOS SW_SECURE_DECODE (L3) SW_SECURE_CRYPTO (L3)

HW_SECURE_ALL (L1)

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 20 of 27
Using the Widevine Shaka Packager

Widevine provides a reference open-source CENC packaging solution - S haka Packager. The
GiHub page provides documentation on how to build and use the packager. We highly
recommend joining the GitHub user group to keep up with the latest information and to ask
questions.

The following sections covers how to install and use the Shaka Packager.

Install dependencies

The Shaka Packager supports a variety of OSes, this section of the document will focus on
using Ubuntu Linux (14.04 or higher). For the sake of simplicity, all commands are executed as
user=root .

To ensure you have the basic build environment, run the following commands as user=root :

# apt-get install -y build-essential gcc wget git g++ subversion

It should also prompt to install any additional dependencies, say Yes.

Build the Shaka Packager

You may now execute the commands as a normal Linux user.

1. Packager source is located at https://github.com/google/shaka-packager

2. Pull gclient and ninja from Chrome Depot Tools:

$ git clone
https://chromium.googlesource.com/chromium/tools/depot_tools.git

3. Add depot_tools to your PATH:

$ export PATH=$PATH:`pwd`/depot_tools

a. Note that the above command contains a ` (back quote).

b. You may want to add this to your .bashrc file or your shell's equivalent so that
you don’t need to reset your $PATH manually each time you open a new shell.

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 21 of 27
4. Run the following commands in sequence.

$ mkdir shaka-packager
$ cd shaka-packager
$ gclient config https://www.github.com/google/shaka-packager.git
--name=src
$ gclient sync

5. Run the following to verify the directories were created:

$ ll src/

6. Build using ninja to create the reference binaries for use. This takes a while, grab a cup
of coffee.

$ ninja -C src/out/Release

7. Verify the binaries exist.

$ cd src/out/Release && ls

The following binary files will be used:

a. packager
i. Used to analyze and encrypt media files
b. mpd_generator
i. Used to generate playlist files

For ease of use, you can add the Release directory to your PATH or copy the 2 binaries to a
directory in your PATH:

$ export PATH=$PATH:$HOME/shaka-packager/src/out/Release

Getting Help with Shaka Packager

To view available options, run:

$ packager --help
$ mpd_generator --help

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 22 of 27
Audio and Video Stream Analysis

Use the --dump_strea m_info to analyze your audio and video streams

Example
$ packager input=~/llama_h264_main_480p_1000.mp4 --dump_stream_info
[0921/220412:INFO:demuxer.cc(58)] Initialize Demuxer for file
'llama_h264_main_480p_1000.mp4'.

File "llama_h264_main_480p_1000.mp4":
Found 1 stream(s).
Stream [0] type: Video
codec_string: avc1.4d401f
time_scale: 12288
duration: 1843712 (150.0 seconds)
is_encrypted: false
codec: H264
width: 858
height: 482
pixel_aspect_ratio: 3856:3861
trick_play_rate: 0
nalu_length_size: 4

Packaging completed successfully.

Understanding the output:

Found <#> stream(s) Number of streams found in the content
file

For DASH compliance, you’ll want this

value to be 1.

Stream [0] type: Specifies the stream number and if it is

video or audio

codec_string: Indicates video codec and profile.

This will match the MPD data.

time_scale: The number of time units that pass per

second in its time coordinate system. A
time coordinate system that measures time
in sixtieths of a second. See ISO-BMFF
spec for additional details.

duration Expressed in seconds

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 23 of 27
language: language of the video

is_encrypted: Determines if content is encrypted

codec: video codec

width: Video width

height: Video size

nalu_length_size_: Network Abstraction Layer Units -

typically expressed in 1, 2, or 4 bytes

Encrypt content

Shaka Packager allows for single and multi-file encryption. It’s optimal to package all your
content in a single command - encrypt all tracks and generate the MPD.

To encrypt using the Widevine Cloud License Service, you will need
a. A Content ID - a value that identifies the media that is being packaged.

To generate a random content ID (in hex)

$ echo -n <random alphanumeric string> | xxd -p

b. The proper credentials to access the License Service.

The widevine_test credentials in the Widevine License Service Test environment is

available for testing purposes.

Provider = widevine_test
IV = d58ce954203b7c9a9a9d467f59839249
KEY = 1ae8ccd0e7985cc0b6203a55855a1034afc252980e970ca90e5202689f947ab9

Syntax for H264 - VOD

$ packager \
input=/root/llama_audio_aac_128k.mp4,stream=audio,output=enc_llama_audio.mp4
\
input=/root/llama_h264_baseline_360p_600.mp4,stream=video,output=enc_llama_h
264_360p.mp4 \
input=/root/llama_h264_main_480p_1000.mp4,stream=video,output=enc_llama_h264
_480p.mp4 \
input=/root/llama_h264_main_720p_3000.mp4,stream=video,output=enc_llama_h264

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 24 of 27
_720p.mp4 \
input=/root/llama_h264_high_1080p_6000.mp4,stream=video,output=enc_llama_h26
4_1080p.mp4 \
--enable_widevine_encryption \
--key_server_url
"https://license.uat.widevine.com/cenc/getcontentkey/widevine_test" \
--content_id "<hex output>" \
--signer "widevine_test" \
--aes_signing_key
" \
"1ae8ccd0e7985cc0b6203a55855a1034afc252980e970ca90e5202689f947ab9
--aes_signing_iv "d58ce954203b7c9a9a9d467f59839249" \
--crypto_period_duration 0 \
--mpd_output llama_h264.mpd

Syntax for VP9 - VOD

$ packager \
input=/root/llama_audio_aac_128k.mp4,stream=audio,output=enc_llama_audio.mp4
\
input=/root/llama_vp9_360p_300.webm,stream=video,output=enc_llama_vp9_360p.w
ebm \
input=/root/llama_vp9_480p_500.webm,stream=video,output=enc_llama_vp9_480p.w
ebm \
input=/root/llama_vp9_720p_1500.webm,stream=video,output=enc_llama_vp9_720p.
webm \
input=/root/llama_vp9_1080p_3000.webm,stream=video,output=enc_llama_vp9_1080
p.webm \
--enable_widevine_encryption \
--key_server_url
"https://license.uat.widevine.com/cenc/getcontentkey/widevine_test" \
--content_id "<hex output>" \
--signer "widevine_test" \
--aes_signing_key
" \
"1ae8ccd0e7985cc0b6203a55855a1034afc252980e970ca90e5202689f947ab9
--aes_signing_iv "d58ce954203b7c9a9a9d467f59839249" \
--crypto_period_duration 0 \
--mpd_output llama_vp9.mpd

Sample MPD

Below is a sample MPD for Shaka Packager encrypted content.

<?xml version="1.0" encoding="UTF-8"?>

<!--Generated with https://github.com/google/edash-packager version

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 25 of 27
e0e0925-release-->
<MPD xmlns="urn:mpeg:dash:schema:mpd:2011"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xmlns:xlink="http://www.w3.org/1999/xlink"
xsi:schemaLocation="urn:mpeg:dash:schema:mpd:2011 DASH-MPD.xsd"
xmlns:cenc="urn:mpeg:cenc:2013" minBufferTime="PT2S" type="static"
profiles="urn:mpeg:dash:profile:isoff-on-demand:2011"
mediaPresentationDuration="PT734.1666870117188S">
<Period id="0">
<AdaptationSet id="0" contentType="audio" lang="en">
<Representation id="0" bandwidth="132252" codecs="mp4a.40.2"
mimeType="audio/mp4" audioSamplingRate="44100">
<AudioChannelConfiguration
schemeIdUri="urn:mpeg:dash:23003:3:audio_channel_configuration:2011"
value="2"/>
<ContentProtection value="cenc"
schemeIdUri="urn:mpeg:dash:mp4protection:2011"
cenc:default_KID="815f3b3b-c165-5415-886b-e6b837f7663f"/>
<ContentProtection
schemeIdUri="urn:uuid:edef8ba9-79d6-4ace-a3c8-27dcd51d21ed">

<cenc:pssh>AAAAaHBzc2gAAAAA7e+LqXnWSs6jyCfc1R0h7QAAAEgIARIQJPZ2phZEUvO+FnPYDjU
XwBIQW2j5Y6rUXreKuHqFkSvFSxIQgV87O8FlVBWIa+a4N/dmPxoFY3dpcDEiBRI0VmeIMgA=</cen
c:pssh>
</ContentProtection>
<BaseURL>enc_tears_audio.mp4</BaseURL>
<SegmentBase indexRange="1001-1920" timescale="44100">
<Initialization range="0-1000"/>
</SegmentBase>
</Representation>
</AdaptationSet>
<AdaptationSet id="1" contentType="video" width="720" height="300"
frameRate="12288/512" par="12:5">
<Representation id="1" bandwidth="686572" codecs="avc1.42c01e"
mimeType="video/mp4" sar="1:1">
<ContentProtection value="cenc"
schemeIdUri="urn:mpeg:dash:mp4protection:2011"
cenc:default_KID="24f676a6-1644-52f3-be16-73d80e3517c0"/>
<ContentProtection
schemeIdUri="urn:uuid:edef8ba9-79d6-4ace-a3c8-27dcd51d21ed">

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 26 of 27
<Initialization range="0-1126"/>
</SegmentBase>
</Representation>
</AdaptationSet>
</Period>
</MPD>

Content Playback

Once the encrypted media is staged on a web-accessible URL, it is available for

playback. Widevine’s open-source reference Shaka Player is recommended to test and
validate playback.

The Shaka Player public demo is configured to automatically enable playback of

encrypted content (using the widevine_test credentials).

Widevine DRM - Encoding and Packaging

version 1.1 - 2/27/2017 - Page 27 of 27

Widevine DRM Proxy Integration
No ratings yet
Widevine DRM Proxy Integration
42 pages
Write Your Ebook in 7 Days - Jose Rosado
100% (5)
Write Your Ebook in 7 Days - Jose Rosado
73 pages
MPEG DVB DTT DTH - Training
100% (1)
MPEG DVB DTT DTH - Training
61 pages
Video Formats Guide
100% (1)
Video Formats Guide
15 pages
Ansi Cta 708 e S 2023 + Errata Final
No ratings yet
Ansi Cta 708 e S 2023 + Errata Final
118 pages
Widevine Modular DRMSecurity Integration Guidefor CENC
No ratings yet
Widevine Modular DRMSecurity Integration Guidefor CENC
76 pages
NEA-Live and NEA-DVR - Deep Dive
No ratings yet
NEA-Live and NEA-DVR - Deep Dive
59 pages
Bitmovin - DRM - Digital Rights Management - Whitepaper - 2020
No ratings yet
Bitmovin - DRM - Digital Rights Management - Whitepaper - 2020
21 pages
Video Compression MPEG
100% (3)
Video Compression MPEG
31 pages
Imperium Acquisition Client Results Tracker
No ratings yet
Imperium Acquisition Client Results Tracker
3 pages
Unit 4amm
No ratings yet
Unit 4amm
22 pages
Widevine DRM Architecture Overview PDF
100% (1)
Widevine DRM Architecture Overview PDF
28 pages
Lecture 7
No ratings yet
Lecture 7
108 pages
Getting Started Widevine DRM On Devices. Version PDF
No ratings yet
Getting Started Widevine DRM On Devices. Version PDF
20 pages
Transcoding 101
No ratings yet
Transcoding 101
8 pages
Codecs and Compression: An Overview of Main Concepts and Standards
No ratings yet
Codecs and Compression: An Overview of Main Concepts and Standards
41 pages
DASH IF CPIX v1.0
No ratings yet
DASH IF CPIX v1.0
20 pages
Access Control Software User Manual F8 PDF
0% (1)
Access Control Software User Manual F8 PDF
101 pages
Media Foundation
No ratings yet
Media Foundation
66 pages
Fundamentals of Multi-Channel: Encoding For Streaming
No ratings yet
Fundamentals of Multi-Channel: Encoding For Streaming
13 pages
Ffmpeg: Presented by Pooja Mishra
No ratings yet
Ffmpeg: Presented by Pooja Mishra
20 pages
Video Coding Format - Wikipedia
No ratings yet
Video Coding Format - Wikipedia
10 pages
Video Codec Comparison Report
No ratings yet
Video Codec Comparison Report
17 pages
Webcast and Beyond Live Streaming Cheat Sheet 2023
No ratings yet
Webcast and Beyond Live Streaming Cheat Sheet 2023
7 pages
Video Formats Guide
No ratings yet
Video Formats Guide
15 pages
Leandromoreira Ffmpeg-Libav-Tutorial - 1
No ratings yet
Leandromoreira Ffmpeg-Libav-Tutorial - 1
27 pages
Media Networks - Audio and Video
No ratings yet
Media Networks - Audio and Video
45 pages
Protected Interoperable File Format (PIFF) 1.1
No ratings yet
Protected Interoperable File Format (PIFF) 1.1
32 pages
MX Component
No ratings yet
MX Component
432 pages
Ffmpeg Audio Video Manipulation
No ratings yet
Ffmpeg Audio Video Manipulation
12 pages
CAS Verimatrix
No ratings yet
CAS Verimatrix
6 pages
VSM 7 12 Vsom
No ratings yet
VSM 7 12 Vsom
762 pages
Scribd Documents For Free - Still Works in 2021 - Filelem
No ratings yet
Scribd Documents For Free - Still Works in 2021 - Filelem
5 pages
Lesson 06
No ratings yet
Lesson 06
38 pages
Video Compression MPEG
No ratings yet
Video Compression MPEG
25 pages
08 Android Multimedia Framework Overview
No ratings yet
08 Android Multimedia Framework Overview
29 pages
5-Encoders and Containers
No ratings yet
5-Encoders and Containers
5 pages
Video Coding Standards
No ratings yet
Video Coding Standards
14 pages
Compatibility Matrix
No ratings yet
Compatibility Matrix
310 pages
WDI Codtechhh
No ratings yet
WDI Codtechhh
4 pages
DCM Integration Into ROSA NMS
No ratings yet
DCM Integration Into ROSA NMS
108 pages
SMPTE - ST2022-1.2007 - Forward Error Correction For Real-Time Video Audio Transport Over IP Networks
100% (1)
SMPTE - ST2022-1.2007 - Forward Error Correction For Real-Time Video Audio Transport Over IP Networks
15 pages
Windows Media 9 Series
No ratings yet
Windows Media 9 Series
12 pages
Mpeg 4 1109
No ratings yet
Mpeg 4 1109
38 pages
Comparison of Container
No ratings yet
Comparison of Container
6 pages
Widevine API Mapping
No ratings yet
Widevine API Mapping
12 pages
Frequently Asked Questions (Faqs)
No ratings yet
Frequently Asked Questions (Faqs)
23 pages
NetBackup104 DeployGuide Kubernetes Clusters
No ratings yet
NetBackup104 DeployGuide Kubernetes Clusters
318 pages
What Is A Video Encoder and Decoder
No ratings yet
What Is A Video Encoder and Decoder
4 pages
Blog Video Transcoding
No ratings yet
Blog Video Transcoding
10 pages
Blog Video Transcoding
No ratings yet
Blog Video Transcoding
10 pages
OP 32 Outside Broadcast Requirements For Rugby League Rugby Union and Soccer Issue 4 January 2015
No ratings yet
OP 32 Outside Broadcast Requirements For Rugby League Rugby Union and Soccer Issue 4 January 2015
17 pages
Codecs Table Bible
No ratings yet
Codecs Table Bible
1 page
HTML Media: By: Reyes, Valerie Mae F. 10-Coachable
No ratings yet
HTML Media: By: Reyes, Valerie Mae F. 10-Coachable
19 pages
Widevine DRM Proxy Integration
No ratings yet
Widevine DRM Proxy Integration
23 pages
Installation Manual: Cruise Control Speed Limiter RPM Limiter
No ratings yet
Installation Manual: Cruise Control Speed Limiter RPM Limiter
40 pages
Low Latency Streaming Cmaf Whitepaper
No ratings yet
Low Latency Streaming Cmaf Whitepaper
11 pages
AppearTV User Manual 3.04
No ratings yet
AppearTV User Manual 3.04
234 pages
A Developer Centered Bug Prediction Model
No ratings yet
A Developer Centered Bug Prediction Model
4 pages
Malware Detection
No ratings yet
Malware Detection
29 pages
TITAN FILEMS 2 8 0-SupportedFormats
No ratings yet
TITAN FILEMS 2 8 0-SupportedFormats
14 pages
VideoFormat 2023 10
No ratings yet
VideoFormat 2023 10
4 pages
Evolution of TV Migration To The Cloud
No ratings yet
Evolution of TV Migration To The Cloud
20 pages
Week 9
No ratings yet
Week 9
11 pages
Technical Standard and Specifications: Broadcast & Operations
No ratings yet
Technical Standard and Specifications: Broadcast & Operations
21 pages
TH 23vs 29vs en
No ratings yet
TH 23vs 29vs en
24 pages
Video Streaming
No ratings yet
Video Streaming
8 pages
Glossary of Terms
No ratings yet
Glossary of Terms
4 pages
M/s. Deprocon Controls: #61, Veera Sagara Road, Attur Layout, Yelhanka, Bangalore - 560 106
No ratings yet
M/s. Deprocon Controls: #61, Veera Sagara Road, Attur Layout, Yelhanka, Bangalore - 560 106
1 page
Video Formats A. List of Most Common Codecs
No ratings yet
Video Formats A. List of Most Common Codecs
7 pages
M3 RFP
No ratings yet
M3 RFP
16 pages
OP 47 Issues 4 - Storage and Distribution of Teletext Subtitle and VBI Data For High Definition Television December 2008
No ratings yet
OP 47 Issues 4 - Storage and Distribution of Teletext Subtitle and VBI Data For High Definition Television December 2008
10 pages
Ateme dr5000
100% (1)
Ateme dr5000
2 pages
3260 ITouch User Manual
50% (2)
3260 ITouch User Manual
2 pages
Koalas - Assignment Tracker - OCT2022
No ratings yet
Koalas - Assignment Tracker - OCT2022
130 pages
Instalación y Configuración de MailWatcah
67% (3)
Instalación y Configuración de MailWatcah
6 pages
Suresh PHP Resume
No ratings yet
Suresh PHP Resume
4 pages
TMF641-ServiceOrdering-v5 0 0
No ratings yet
TMF641-ServiceOrdering-v5 0 0
58 pages
Presentation MANETs
No ratings yet
Presentation MANETs
24 pages
tMySQL Tutorial
No ratings yet
tMySQL Tutorial
7 pages
MAGB1 User Guide, ENG V1.14, 29-3-2018
No ratings yet
MAGB1 User Guide, ENG V1.14, 29-3-2018
50 pages
How To Install Esim On Samsung
No ratings yet
How To Install Esim On Samsung
20 pages
Streamcube: Audio Streaming Over Ip
No ratings yet
Streamcube: Audio Streaming Over Ip
2 pages
MemoQ Server Migration Guide
No ratings yet
MemoQ Server Migration Guide
5 pages
UGC - MMTTC, GRI Short Term Programme Flyer
No ratings yet
UGC - MMTTC, GRI Short Term Programme Flyer
12 pages
Financial EDI
No ratings yet
Financial EDI
10 pages
Uday R2
No ratings yet
Uday R2
2 pages
Paragraph e
No ratings yet
Paragraph e
8 pages
Spring 21 ENGL 253.01 Course Outline Inglis
No ratings yet
Spring 21 ENGL 253.01 Course Outline Inglis
11 pages
Universal Dual Path Communicator TL405LE Data Sheet EN 30026073
No ratings yet
Universal Dual Path Communicator TL405LE Data Sheet EN 30026073
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

2.0-Widevin DRM Encoding and Packaging

Uploaded by

2.0-Widevin DRM Encoding and Packaging

Uploaded by

Encoding and Packaging

Using the Widevine Shaka Packager 21

Widevine DRM - Encoding and Packaging

Version Date Description By

1.0 11/8/2016 Initial revision Alex Lee

1.1 2/27/2017 Clarification updates Alex Lee

Widevine DRM - Encoding and Packaging

This document provides:

Let’s start with the basic elements of a media file.

Widevine DRM - Encoding and Packaging

AVC (​H264​) AAC

HEVC​ (H265) Opus

AV1​ (coming soon) DTS

Dolby Digital (AC-3, EAC-3)

Widevine DRM - Encoding and Packaging

Matroska​ (.mkv, .webm)

Transmuxing and Transcoding

Transcoding​ is a processing of an elementary stream consisting of decoding followed by

Transmuxing is a less processing-intensive effort compared to transcoding.

Why is transmuxing useful?

Widevine DRM - Encoding and Packaging

Why is transcoding useful?

The next section covers the structure of a video stream.

Video Stream Structure

Figure 1 - represents an arbitrary section of a video stream.

Group of Pictures (GOP) Structure

There are ​different types of frames​ contained in a GOP:

Widevine DRM - Encoding and Packaging

Why are GOPs important?

Widevine DRM - Encoding and Packaging

There are 2 types of ​GOP​.

Widevine DRM - Encoding and Packaging

Figure 3 - represents examples of varying complexity based on media timeline.

Constant Bitrate (CBR)

e.g. 6 Mbit/sec = 1 second of media requires 6MB

Widevine DRM - Encoding and Packaging

Variable Bitrate (VBR)

VBR is generally recommended as it results in higher-quality encodings compared to CBR

Widevine DRM - Encoding and Packaging

Common encoding profiles

H264 Baseline, Main, High

H265 Main, Main10

Widevine DRM - Encoding and Packaging

Resolution Widevine Client Security Level Video Codec Audio Codec

SD L3 H264 (MP4) AAC

HD L1 H264 (MP4) AAC

UHD L1 HEVC (MP4) AAC

Widevine DRM - Encoding and Packaging

Video Resolution Profile and Level Bitrate FPS GOP

240p Baseline 3.0 800 kbps 24 3s

480p Main 3.1 2000 kbps 24 3s

720p Main 4.0 8000 kbps 24 3s

1080p High 4.2 20000 kbps 24 3s

4K / UHD High 5.1 30000 kbps 24 3s

Example encoding syntax using ffmpeg

ffmpeg -i ​<source_file>​ -an \

-an No audio, video only

-c:v libx264 Selects the codec library

-profile Specifies the codec profile

-level Specified the codec level

-x264opts Declares additional options:

Enforce closed GOPs

Widevine DRM - Encoding and Packaging

-maxrate Maximum bitrate

-b:v Target bitrate

-bufsize Video buffer size

Widevine DRM - Encoding and Packaging

Video Resolution Profile and Level Bitrate FPS GOP

240p Main 3.0 600 kbps 24 3s

360p Main 3.0 1600 kbps 24 3s

480p Main 3.0 3000 kbps 24 3s

720p Main 3.1 4000 kbps 24 3s

1080p Main 4.1 10000 kbps 24 3s

4K / UHD Main10 5.1 20000 kbps 24 3s

Example encoding syntax using ffmpeg

ffmpeg -i ​<source_file>​ -an \

AVC (H264) AAC

HEVC (H265) Opus

AV1 (coming soon) DTS

Matroska (.mkv, .webm)

Transcoding is a processing of an elementary stream consisting of decoding followed by

There are different types of frames contained in a GOP:

There are 2 types of GOP.

ffmpeg -i <source_file> -an \

ffmpeg -i <source_file> -an \

It should also prompt to install any additional dependencies, say Yes.

1. Packager source is located at https://github.com/google/shaka-packager

The widevine_test credentials in the Widevine License Service Test environment is