Convert and distribute your videos with AWS Elemental

"AudioDescriptions": [ { "AudioSourceName": "Audio Selector 1", "AudioType": 0, "AudioTypeControl": "FOLLOW_INPUT", "Codec": "AAC", "CodecSettings": { "AacSettings": { "AudioDescriptionBroadcasterMix": "NORMAL", "Bitrate": 160000, "CodecProfile": "LC", "CodingMode": "CODING_MODE_2_0", "RateControlMode": "CBR", "RawFormat": "NONE", "SampleRate": 48000, "Specification": "MPEG4" } }, "LanguageCodeControl": "FOLLOW_INPUT" } ]

{ "ContainerSettings": { "Container": "MP4", "Mp4Settings": { "CslgAtom": "INCLUDE", "FreeSpaceBox": "EXCLUDE", "MoovPlacement": "PROGRESSIVE_DOWNLOAD", }, }, "VideoDescription": { "AfdSignaling": "NONE", "AntiAlias": "ENABLED", "Height": 1080, "Width": 1920, "CodecSettings": { "Codec": "H_264", "H264Settings": { "AdaptiveQuantization": "HIGH", "CodecLevel": "LEVEL_4_2", "CodecProfile": "HIGH", "EntropyEncoding": "CABAC", "FieldEncoding": "PAFF", "FlickerAdaptiveQuantization": "ENABLED", "FramerateControl": "SPECIFIED", "FramerateConversionAlgorithm": "DUPLICATE_DROP", "FramerateDenominator": 1001, "FramerateNumerator": 30000, "GopBReference": "DISABLED", "GopClosedCadence": 1, "GopSize": 2, "GopSizeUnits": "SECONDS", "HrdBufferInitialFillPercentage": 90, "HrdBufferSize": 12000000, "InterlaceMode": "PROGRESSIVE", "MaxBitrate": 6000000, "MinIInterval": 0, "NumberBFramesBetweenReferenceFrames": 1, "NumberReferenceFrames": 3, "ParControl": "SPECIFIED", "ParDenominator": 1, "ParNumerator": 1, "QualityTuningLevel": "SINGLE_PASS_HQ", "QvbrSettings": { "QvbrQualityLevel": 8 }, "RateControlMode": "QVBR", "RepeatPps": "DISABLED", "SceneChangeDetect": "ENABLED", "Slices": 1, "SlowPal": "DISABLED", "Softness": 0, "SpatialAdaptiveQuantization": "ENABLED", "Syntax": "DEFAULT", "Telecine": "NONE", "TemporalAdaptiveQuantization": "ENABLED", "UnregisteredSeiTimecode": "DISABLED" } }, "ColorMetadata": "INSERT", "DropFrameTimecode": "ENABLED", "RespondToAfd": "NONE", "ScalingBehavior": "STRETCH_TO_OUTPUT", "Sharpness": 50, "TimecodeInsertion": "DISABLED" } }

[ { "Width": 3840, "Height": 2160, "Bitrate": 20000000, "Profile": "MAIN-MAIN", "Level": "AUTO", "Codec": "H_265", }, { "Width": 1920, "Height": 1080, "Bitrate": 6000000, "Profile": "HIGH", "Level": "LEVEL_4_2", }, { "Width": 1280, "Height": 720, "Bitrate": 3500000, "Profile": "HIGH", "Level": "LEVEL_4_2", }, { "Width": 854, "Height": 480, "Bitrate": 1000000, "Profile": "MAIN", "Level": "LEVEL_3_1", }, { "Width": 640, "Height": 360, "Bitrate": 700000, "Profile": "MAIN", "Level": "LEVEL_3_1", }, { "Width": 480, "Height": 270, "Bitrate": 400000, "FramerateNumerator": 15000, "Profile": "MAIN", "Level": "LEVEL_3_1", }, ]

Type: AWS::MediaPackage::PackagingConfiguration Properties: Id: hls PackagingGroupId: !Ref MediaPackagePackagingGroup HlsPackage: HlsManifests: - ManifestName: index IncludeIFrameOnlyStream: true StreamSelection: StreamOrder: VIDEO_BITRATE_DESCENDING SegmentDurationSeconds: 6 UseAudioRenditionGroup: false

Type: AWS::MediaPackage::PackagingConfiguration Properties: Id: dash PackagingGroupId: !Ref MediaPackagePackagingGroup DashPackage: DashManifests: - ManifestName: index ManifestLayout: FULL MinBufferTimeSeconds: 30 StreamSelection: StreamOrder: VIDEO_BITRATE_DESCENDING SegmentDurationSeconds: 2 SegmentTemplateFormat: NUMBER_WITH_TIMELINE

Let's Talk

Imagine this scenario: You are in possession of some videos that you want to distribute or share.

Your use case could be as simple as needing to share a lower resolution of a video taken on your smartphone or camera. Or your use case could be as complex as sharing production-quality copies of the latest movie you are producing.

For both cases, you can use Serverless Cloud Products to address your needs. AWS Elemental MediaConvert, one of the Media tools from the AWS Elemental family, will allow you to solve your problem.

Or perhaps, you want to distribute your content to viewers around the globe, allowing them to watch on any device? Then you can use another Serverless Cloud Product from the AWS Elemental family: AWS Elemental MediaPackage.

Services used

Video files are complex. You need to worry about image and audio, which means you need to worry about container, codecs, bitrate, pixel aspect ratio, and more.

MediaConvert

MediaConvert allows you to transcode file-based content. This means you can transform a video file into a different format and size.

But MediaConvert is more than just transformation. We won’t get into details of all functionalities in this article, but here are some noticeable features:

Watermarking
Graphic overlay (static or motion)
Select parts (time or size) of an Input
Rotation
Deinterlacing
And some more…

MediaPackage

AWS Elemental MediaPackage is a Just-In-Time (JIT) media packager for your existing assets. It will generate the relevant manifest for a group of video sources.

It not only allows you to define multiple qualities of the same video but also allows you to add multiple audio sources and different video sources, like camera angles.

For our use case, we are only interested in providing different qualities of the same content and only a single audio track.

MediaPackage uses the formats created by MediaConvert and generates on the fly a manifest either in HLS or in Dash to be consumed by the player.

Simple Use case: Reduce and Convert for mobile sharing

You created a FHD (1920x1080) video with your camera. Your camera creates movies in uncompressed QuickTime format. Most devices won’t be able to read this format unless they have the right codecs installed.

To allow recipients to play your video, you need to convert it to an MP4 container and the H.264 codec (de facto standard for Web distribution). You will also want to improve the download speed.

Upload your source file to S3
Trigger a MediaConvert job using this file
Convert to an MP4 Container
Convert to H.264 codec
Reduce the size to 1280x720

Result

Both outputs were created in a single MediaConvert job that took 17 seconds.

More complex use case: Create content for web distribution

In this use case, we want to distribute a 4K (3840x2160) 90 minutes movie. We want our viewer to enjoy our content on any type of screen: from small smartphones to big TV screens.

We also want them to enjoy it regardless of their network connectivity.

We need to be bandwidth conscious and not ship more than the viewer can consume.

MP4 container and H.264 Codec is a combination that can be viewed by most media players (smartphones, TV, Set-top boxes, Gaming devices, …). We will use this format to distribute our content.

Screen size consideration

We pre-render our content to adapt to our viewers’ screen size. Reducing the dimensions also allows us to reduce the bandwidth needed.

Unless mentioned, we will use H.264 and the source frame rate.

SD: 480 x 270, 400kbps, 15fps
SD: 640 x 360, 700kbps
SD: 854 x 480, 1Mbps
HD: 1280 x 720, 3.5Mbps
FHD: 1920 x 1080, 6Mbps
4K: 3840 x 2160, 20Mbps, H.265

Segmented videos: HLS and Dash

If we distribute a single file, the client would be stuck on a single quality. Without tweaks on the player side, the client would download the whole file, even the parts that aren’t watched.

To allow a streaming-like experience, we will use a distribution format named HLS (HTTP Live Streaming) and Dash.

Both format are very similar in concept, but the choice of their usage depends on the player's OS. For simplification, one could say that HLS is for Apple devices and Dash for others, but in reality, it’s slightly more complex.

The idea behind these formats is to split the movie into little chunks of a few seconds and define the structure through a manifest file.

By doing this, the player will download little chunks of content in the dimensions appropriate for the screen and the available bandwidth. By downloading only a few segments ahead of the current timestamp, it will use only the bandwidth needed for what is really watched and adapt playback on the current conditions of the player (phone rotated, window resized, bandwidth alterations, …).

By leveraging HTTP as a delivery mechanism, we not only rely on a universally approved protocol but also allow caching at the CDN level, improving distribution around the globe.

Generate HLS and Dash Segments

MediaConvert can generate segmented videos and store them in S3. But with everything generated statically once, you lose flexibility. There a multiple reasons you want to have a more dynamic way to generate your manifests:

Removing some renditions on client attributes: paid tiers, legal constraints in some countries
Ordering of renditions: Improve start time for some players
Cost of storing never accessed renditions
DRM protection
Additional audio or video tracks

The solution

Source Code: Github - Serverless-Guru - Templates SLS-Mediapipeline

Configuration

MediaConvert outputs

Audio: AAC, 160kbps, 48kHz

  
"AudioDescriptions": [
    {
        "AudioSourceName": "Audio Selector 1",
        "AudioType": 0,
        "AudioTypeControl": "FOLLOW_INPUT",
        "Codec": "AAC",
            "CodecSettings": {
                "AacSettings": {
                "AudioDescriptionBroadcasterMix": "NORMAL",
                "Bitrate": 160000,
                "CodecProfile": "LC",
                "CodingMode": "CODING_MODE_2_0",
                "RateControlMode": "CBR",
                "RawFormat": "NONE",
                "SampleRate": 48000,
                "Specification": "MPEG4"
            }
        },
        "LanguageCodeControl": "FOLLOW_INPUT"
    }
]

Video: MP4, H.264/H.265, Quality-based bitrate

Bitrate and dimensions are replaced for each output quality.

The GOPSize of 2 seconds is important, this allows to “cut” the MP4 on keyframes every multiple of 2 seconds, providing a fast and safe way to generate segments from the source file.

  
{
    "ContainerSettings": {
        "Container": "MP4",
        "Mp4Settings": {
            "CslgAtom": "INCLUDE",
            "FreeSpaceBox": "EXCLUDE",
            "MoovPlacement": "PROGRESSIVE_DOWNLOAD",
        },
    },
    "VideoDescription": {
        "AfdSignaling": "NONE",
        "AntiAlias": "ENABLED",
        "Height": 1080,
        "Width": 1920,
        "CodecSettings": {
            "Codec": "H_264",
            "H264Settings": {
                "AdaptiveQuantization": "HIGH",
                "CodecLevel": "LEVEL_4_2",
                "CodecProfile": "HIGH",
                "EntropyEncoding": "CABAC",
                "FieldEncoding": "PAFF",
                "FlickerAdaptiveQuantization": "ENABLED",
                "FramerateControl": "SPECIFIED",
                "FramerateConversionAlgorithm": "DUPLICATE_DROP",
                "FramerateDenominator": 1001,
                "FramerateNumerator": 30000,
                "GopBReference": "DISABLED",
                "GopClosedCadence": 1,
                "GopSize": 2,
                "GopSizeUnits": "SECONDS",
                "HrdBufferInitialFillPercentage": 90,
                "HrdBufferSize": 12000000,
                "InterlaceMode": "PROGRESSIVE",
                "MaxBitrate": 6000000,
                "MinIInterval": 0,
                "NumberBFramesBetweenReferenceFrames": 1,
                "NumberReferenceFrames": 3,
                "ParControl": "SPECIFIED",
                "ParDenominator": 1,
                "ParNumerator": 1,
                "QualityTuningLevel": "SINGLE_PASS_HQ",
                "QvbrSettings": {
                    "QvbrQualityLevel": 8
                },
                "RateControlMode": "QVBR",
                "RepeatPps": "DISABLED",
                "SceneChangeDetect": "ENABLED",
                "Slices": 1,
                "SlowPal": "DISABLED",
                "Softness": 0,
                "SpatialAdaptiveQuantization": "ENABLED",
                "Syntax": "DEFAULT",
                "Telecine": "NONE",
                "TemporalAdaptiveQuantization": "ENABLED",
                "UnregisteredSeiTimecode": "DISABLED"
            }
        },
        "ColorMetadata": "INSERT",
        "DropFrameTimecode": "ENABLED",
        "RespondToAfd": "NONE",
        "ScalingBehavior": "STRETCH_TO_OUTPUT",
        "Sharpness": 50,
        "TimecodeInsertion": "DISABLED"
    }
}

  
[
  {
    "Width": 3840,
    "Height": 2160,
    "Bitrate": 20000000,
    "Profile": "MAIN-MAIN",
    "Level": "AUTO",
    "Codec": "H_265",
  },
  {
    "Width": 1920,
    "Height": 1080,
    "Bitrate": 6000000,
    "Profile": "HIGH",
    "Level": "LEVEL_4_2",
  },
  {
    "Width": 1280,
    "Height": 720,
    "Bitrate": 3500000,
    "Profile": "HIGH",
    "Level": "LEVEL_4_2",
  },
  {
    "Width": 854,
    "Height": 480,
    "Bitrate": 1000000,
    "Profile": "MAIN",
    "Level": "LEVEL_3_1",
  },
  {
    "Width": 640,
    "Height": 360,
    "Bitrate": 700000,
    "Profile": "MAIN",
    "Level": "LEVEL_3_1",
  },
  {
    "Width": 480,
    "Height": 270,
    "Bitrate": 400000,
    "FramerateNumerator": 15000,
    "Profile": "MAIN",
    "Level": "LEVEL_3_1",
  },
]

MediaPackage packaging groups

HLS

For HLS, we create segments of 6 seconds (industry standard)

  
Type: AWS::MediaPackage::PackagingConfiguration
Properties:
  Id: hls
  PackagingGroupId: !Ref MediaPackagePackagingGroup
  HlsPackage:
    HlsManifests:
      - ManifestName: index
        IncludeIFrameOnlyStream: true
        StreamSelection:
          StreamOrder: VIDEO_BITRATE_DESCENDING
    SegmentDurationSeconds: 6
    UseAudioRenditionGroup: false

Dash

For HLS, we create segments of 2 seconds (industry standard)

  
Type: AWS::MediaPackage::PackagingConfiguration
Properties:
  Id: dash
  PackagingGroupId: !Ref MediaPackagePackagingGroup
  DashPackage:
    DashManifests:
      - ManifestName: index
        ManifestLayout: FULL
        MinBufferTimeSeconds: 30
        StreamSelection:
          StreamOrder: VIDEO_BITRATE_DESCENDING
    SegmentDurationSeconds: 2
    SegmentTemplateFormat: NUMBER_WITH_TIMELINE

Incoming Bucket

Source files are uploaded to S3
A rule on EventBridge listens to “Object:Created” events and triggers the execution of the “Conversion” Step Function.

Conversion Step Function

Using a Lambda function and https://ffmpeg.org/, the source file is analyzed with ffprobe
Bitrate
Dimensions
Codecs
The file name is parsed to provide a movie name
A naming convention could be used to interact with IMDB to gather more information
An entry is created in DynamoDB
The video is sent to Rekognition to extract labels, persons, … Not implemented for this article
A Lambda function builds the outputs based on the source file definition and triggers MediaConvert
We don’t create a rendition bigger than the source
We don’t create renditions with a higher bitrate than the source
Generate video stills to be used as covers
An EventBridge rule listens to executed MediaConvert jobs and triggers

Packaging Step Function

Store still informations to DynamoDB
Images can be used by a CRM as Video covers
Store renditions informations to DynamoDB
These files can be used for download (offline viewing)
A Lambda function creates a SMIL manifest
A Lambda function creates a package using the manifest and the pre-defined HLS and Dash outputs
The URLs are stored to DynamoDB

Consumption

The video metadata and sources can be provided to the client via an API (not implemented for this article).
The client accessed the video content directly from the HLS or Dash URL served via Cloudfront

Result in action

To showcase the solution, we used a simple source file shot on a Smartphone: 11 seconds FHD (1920x1080) of 25MB.

All we needed to do to produce ready consumable content was to upload this file to S3. Our Serverless solution took care of all the underlying steps.

MP4

360p

Only Youtube videos can be embedded in this blog. Youtube re-encodes the content. Watch the original here.

Stills

https://d2bv705w0inzgj.cloudfront.net/bae61c4a-5ae1-43f2-bbf3-b22bd6fb20a2/frames/estoril_classics_2022.0000003.jpg

https://d2bv705w0inzgj.cloudfront.net/bae61c4a-5ae1-43f2-bbf3-b22bd6fb20a2/frames/estoril_classics_2022.0000005.jpg

https://d2bv705w0inzgj.cloudfront.net/bae61c4a-5ae1-43f2-bbf3-b22bd6fb20a2/frames/estoril_classics_2022.0000006.jpg

Adaptive bitrates

HLS

Access the resulting content on the hls-js demo page.

Dash‍

Access the resulting content on the dashif demo page.

What did we achieve?

We created a fully Serverless pipeline to transform a source video provided in any format in a group of ready consumable formats by any smartphone or browser anywhere in the world. Even viewers with limited bandwidth can enjoy our content without buffering, they will, however, settle for lower quality.

By leveraging Serverless solutions, we get all the known benefits of serverless:

Cost is kept to a minimum by paying only for what we use:
Video Conversion: Pay-per-movie conversion
Video Packaging: Pay per movie consumption
CDN: Pay per movie consumption
S3 Storage: Pay for stored content
DynamoDB: Pay for stored data
No servers to provision or maintain
The services scale with usage automatically

Extending the solution

With the solution we built, we barely scratched the surface of what can be done. There are several addons that can be built on top of this solution:

Provide DRM to protect your content
Use multiple manifests to allow high definition to selected viewers
Monetize your content with in-stream ads by leveraging AWS Media Tailor
Create live feeds from your VOD assets
And some more…

Daniel Muller

Senior Serverless Developer at ServerlessGuru

Daniel has has developed multiple large-scale serverless applications for OTT, BigData, and MarTech and is currently a Sr Serverless Dev.

Convert and distribute your videos with AWS Elemental

Services used

MediaConvert

MediaPackage

Simple Use case: Reduce and Convert for mobile sharing

Result

More complex use case: Create content for web distribution

Screen size consideration

Segmented videos: HLS and Dash

Generate HLS and Dash Segments

The solution

Configuration

MediaConvert outputs

MediaPackage packaging groups

Incoming Bucket

Conversion Step Function

Packaging Step Function

Consumption

Result in action

MP4

Stills

Adaptive bitrates

What did we achieve?

Extending the solution

The dream team

Looking for skilled architects & developers?

More from Serverless Guru

How Alignment completes IAM for Agentic Development

I Rebuilt Next.js Behavior Using Only Go and AWS SAM — And It Might Now Be My Favorite Stack

Automate Brand Visibility Tracking With Amazon Rekognition

Join the Community

Convert and Distribute Your Videos with AWS Elemental

Looking for Senior AWS Serverless Architects & Engineers?

Services used

MediaConvert

MediaPackage

Simple Use case: Reduce and Convert for mobile sharing

Result

More complex use case: Create content for web distribution

Screen size consideration

Segmented videos: HLS and Dash

Generate HLS and Dash Segments

The solution

Configuration

MediaConvert outputs

MediaPackage packaging groups

Incoming Bucket

Conversion Step Function

Packaging Step Function

Consumption

Result in action

MP4

Stills

Adaptive bitrates

What did we achieve?

Extending the solution

Daniel Muller

More from Serverless Guru

How Alignment completes IAM for Agentic Development

I Rebuilt Next.js Behavior Using Only Go and AWS SAM — And It Might Now Be My Favorite Stack

Automate Brand Visibility Tracking With Amazon Rekognition

Building A Translation And Transcription Application Using AWS Transcribe, And Translate

Short Story Generator with AWS Bedrock and Amplify

The Evolution of Serverless: From Compute to Full-Stack Cloud Architectures