SlideShare a Scribd company logo
CONTEXT-AWARE ENCODING AND 5G
YURIY REZNIK | YREZNIK@BRIGHTCOVE.COM
© B r i g h t c o v e I n c . A l l R i g h t s R e s e r v e d .
DASH-IF WORKSHOP ON MEDIA STREAMING MEETS 5G, DECEMBER 9-10, 2019, PORTLAND, OR
© Brightcove Inc. All Rights Reserved. 2
OUTLINE
• Brief history of context-aware encoding & variants
• How it works
• Types of CAE technologies today
• CAE and standards
◦ CAE technologies that are fully compatible with existing standards & players
◦ CAE technologies that may need extensions
• Discussion
◦ CAE and 5G
• is there any overlap?
• what type of information from 5G network layer could be useful for CAE
◦ Shall there be extension of a standard?
• E.g. for per-scene ladder signaling?
© Brightcove Inc. All Rights Reserved. 3
BRIEF HISTORY OF CAE
• Early 1990s: H.261, H.263, MPEG-1/2
◦ “fixed QP” regime – the grandfather of everything “Constant Quality”
• Late 1990s-early 2000s: RealVideo 8-10
◦ “RMVB” – heavy VBR encoding regime, optimized for downloads, still in use in Asia
• Early 2010s:
◦ British Telecom “Quality-driven streaming” (Mike Nilsson, et al, June 2012)
◦ InterDigital “Quality-based streaming” proposal to MPEG-DASH (Y. Reznik et al, m25996, July 2012)
◦ Intel labs “Quality-aware streaming” (Yiting Liao, et al, 2013)
◦ “Capped-CRF” – approaches – multiple sources, 2013+
◦ Beamr “Optimizer” – second pass encoding with adjusted targets, 2013+
◦ MediaMelon “QBR streaming”, 2014
• Late 2010s:
◦ Netflix “Per-Title Encoding” blog post, Dec. 2015 – ladder of resolutions and rates according to content
◦ Brightcove “Context-Aware Encoding”, Oct. 2016 – ladder design as end-to-end optimization problem
◦ Netflix “per-scene encoding”, 2018 – same as per-title, but on scene basis
◦ Content- and context-aware solutions from Harmonic, Elemental, Ateme, Bitmovin, EpicLabs, Mux, etc.
© Brightcove Inc. All Rights Reserved. 4
STATIC ABR ENCODING PROFILES
• Define sets of encoding parameters for each rendition:
◦ Resolutions, Bitrates, Codec constraints, etc.
◦ Same for all content, networks, user devices & usage patterns, etc.
• Some examples of ABR profiles used in practice:
RealVideo (1998): Apple HLS guidelines (2018): Brightcove VideoCloud (2013-2016):
video
bitrate
decoder
bitrate
cap
decoder
buffer
size
max
frame
rate
width height
h264
profile
450 771 1028 30 480 270 baseline
700 1194 1592 30 640 360 baseline
900 1494 1992 30 640 360 main
1200 1944 2592 30 960 540 main
1700 2742 3656 30 960 540 main
2500 3942 5256 30 1280 720 main
3500 5442 7256 30 1920 1080 high
3800 6192 8256 30 1920 1080 high
© Brightcove Inc. All Rights Reserved. 5
WHY STATIC ABR PROFILES ARE BAD?
• Static encoding profiles are not accounting for:
◦ differences in video complexity: differences in networks: differences in devices & user preferences:
• A better approach is to design encoding profiles dynamically, accounting for characteristics of
◦ content → content-aware encoding (aka per-title encoding)
◦ network → network-aware encoding
◦ full context (content + network + user statistics) → context-aware encoding
Source: Netflix, 2015 Source: Brightcove VideoCloud analytics, 2019 Source: Brightcove VideoCloud analytics, 2019
© Brightcove Inc. All Rights Reserved. 6
CAE APPROACHES: “CONTENT-AWARE VBR ENCODING”
• Basically, most encoders can be configure to operate either in
◦ “CBR” mode => reduces variation of bitrates: “VBR” mode => reduces variation of quality:
• CBR is required for cable & broadcast (c.f. SCTE 128)
• VBR (with some reasonable constraints) is working reasonably well for OTT
◦ Apple HLS constraints (2018):
• Live: max bitrate < 110% of target
• VOD: max bitrate < 200% of target (in practice it is better to limit it to about 150%)
• Reasons for constraints: minimize client’s mis-predictions, likelihood of buffering, issues with analytics, etc.
© Brightcove Inc. All Rights Reserved. 7
CAE APPROACHES: “PER-TITLE” AND “PER-SCENE ENCODING”
• Primary idea: design ABR encoding profiles individually for each video sequence (or scene within a sequence)
• Secondary idea: place ladder points such that they belong to the convex hull
• Notes:
◦ Netflix “convex hull” argument provides a method for finding best resolutions for any given target bitrate, but
• it does not, say how such bitrates should be placed, or how many of them are needed!
• it constrains the problem, but it does not show how to solve it exactly!
© Brightcove Inc. All Rights Reserved. 8
CAE APPROACHES: “CONTEXT-AWARE ENCODING”
• Example deployment architecture:
• Context Aware Encoding (CAE) is basically an
◦ ABR encoding profile generator that considers:
• properties of content and
• properties of networks and devices used to receive content
Operator
Transcoder
Players
Dynamic transmux system
Playbac
k API
Device
detection
Media Files
+
Metadata
Content
URLs
Rules
Engine
Rules
API
API, orchestrator
Delivery
preferences
Media
CDNs
Manifest
CDNs
Analytics engine
Transcoders
CAE profile
generators
Media
files
Job
requests
JIT packagers +
SSAI
Manifest
generators
Collect and process network & usage statistic for
all actively used devices
© Brightcove Inc. All Rights Reserved. 9
CONTEXT-AWARE ENCODING: THE PRINCIPLE
• Quality-rate function 𝑄(𝑅): Quality delivered by streaming client: Probabilities of loading of each stream:
• Average quality for a given ladder of rates 𝑅1, …, 𝑅𝑛, quality-rate function 𝑄(𝑅), and network density 𝑝(𝑅):
𝑄 𝑅1, …, 𝑅𝑛, 𝑝 = 𝑄 𝑅1 න
𝑅1
𝑅2
𝑝(𝑅)𝑑𝑅 + 𝑄 𝑅2 න
𝑅2
𝑅3
𝑝 𝑅 𝑑𝑅 + … + 𝑄 𝑅𝑛 න
𝑅𝑛
𝑅max
𝑝(𝑅)𝑑𝑅
• A quality-optimal ladder is a set of rates ෠
𝑅1, …, ෠
𝑅𝑛, such that:
𝑄 ෠
𝑅1, …, ෠
𝑅𝑛, 𝑝 = max
𝑅min<𝑅1≤⋯≤𝑅𝑛 <𝑅max
𝑅1≤𝑅1,max
𝑄 𝑅1, …,𝑅𝑛, 𝑝
© Brightcove Inc. All Rights Reserved. 10
CONTEXT-AWARE ENCODING: EXAMPLE INPUTS
• Content: Networks:
• Quality-rate models: Network models:
Based on data from:
J. Karlsson, and M. Riback.
Initial field performance
measurements of LTE,
Ericsson review, 3, 2008.
𝑄 𝑅 =
𝑅𝛽
𝛼𝛽 + 𝑅𝛽
𝑝 𝑅 = 𝛼 𝒩
𝜇1,𝜎1
𝑅 + 1 − 𝛼 𝒩
𝜇2,𝜎2
𝑅
Content α β
Easy 0.0555 0.8550
Medium 0.0724 0.8016
Complex 0.1015 0.7364
Network α μ1 σ1 μ2 σ2
Network 1 0.584 0.996 0.564 2.554 1.165
Network 2 0.584 1.992 1.129 5.108 2.331
Resolution=720p25
Codec=H.264/Main
Quality metric=SSIM
3 video sequences:
“Easy”, “Medium”,
“Complex”
© Brightcove Inc. All Rights Reserved. 11
CONTEXT-AWARE ENCODING: EXAMPLE RESULTS
Optimal ladders for Network 1: Optimal ladders for Network 2:
where:
◦ 𝑄𝑛 = quality at top rendition [SSIM]
◦ ത
𝑄 = average quality [SSIM]
◦ 𝜉 = gap to average quality achievable with infinite number of renditions [%]
• Key observation:
◦ optimal profiles designed for different sources and networks are different!
Content N Ladder bitrates [kbps] 𝑸𝒏
ഥ
𝑸 𝝃 [%]
Easy
2 138, 803 0.909 0.867 6.58
3 100, 512, 1209 0.931 0.888 4.35
4 100, 411, 866, 1645 0.946 0.897 3.34
5 100, 349, 694, 1155, 2087 0.955 0.902 2.76
Medium
2 175, 854 0.881 0.830 7.98
3 100, 518, 1219 0.906 0.854 5.31
4 100, 416, 876, 1663 0.924 0.866 4.00
5 100, 354, 701, 1165, 2104 0.936 0.873 3.25
Complex
2 234, 931 0.825 0.769 10.2
3 145, 590, 1304 0.867 0.797 6.96
4 102, 431, 898, 1704 0.888 0.812 5.22
5 100, 363, 716, 1183, 2134 0.904 0.821 4.16
Content N Ladder bitrates [kbps] 𝑸𝒏
ഥ
𝑸 𝝃 [%]
Easy
2 232, 1457 0.940 0.906 5.14
3 116, 811, 2124 0.955 0.924 3.27
4 100, 589, 1421, 2803 0.964 0.932 2.40
5 100, 486, 1107, 1974, 3577 0.971 0.937 1.92
Medium
2 293, 1549 0.920 0.878 6.23
3 158, 893, 2216 0.939 0.899 4.04
4 100, 601, 1438, 2828 0.949 0.909 2.97
5 100, 495, 1123, 1995, 3615 0.958 0.915 2.35
Complex
2 391, 1685 0.887 0.833 7.98
3 232, 1018, 2358 0.910 0.857 5.29
4 156, 712, 1569, 3001 0.924 0.869 3.94
5 114, 537, 1179, 2060, 3727 0.935 0.877 3.11
© Brightcove Inc. All Rights Reserved. 12
CONTEXT-AWARE ENCODING: HOW MANY STREAMS ARE NEEDED?
• There are two natural limits:
(1) Set limit for quality at top rendition: (2) Set limit for quality gap:
This shows that “easy” content can be This provides effective bound on the number of renditions
encoded with much fewer renditions! for “complex” content as well.
• This way, the problem of design of optimal profiles for single codec case is now fully defined:
◦ we know how to choose rates & number of streams
◦ best choices of resolutions follow by applying resolution-specific quality-rate functions
© Brightcove Inc. All Rights Reserved. 13
CAE APPROACHES: “QUALITY-BASED STREAMING”
• Architecture:
• Pros:
◦ Allows per-segment adaptation
◦ Allows clients to use advanced user- and context-aware adaptation strategies
• Cons:
◦ Requires modifications of the standard
◦ Quality-adaptive work in MPEG-DASH has not provided exact mechanism for enabling it
Encoder
Stream 1
Stream 2
Stream N
Quality data
Manifest
Origin CDN Client
Implements
decision logic
based on quality
metadata
...
© Brightcove Inc. All Rights Reserved. 14
CAE TYPES: SUMMARY
CAE type Example solutions What it affects Impact on standards
VBR encoding x264/x265 “capped CRF”,
Beamr CABR, Harmonic EyeQ,
Elemental QVBR
Encoders
Players need to be tested to
operate reliably under VBR
streams
Clarifications on the extent of VBR
variation allowed may be useful.
Per-title
encoding
Netflix per-title, Ateme CAE,
Bitmovin’ per-title, Cambria, etc.
Encoders only
Streams can be CBR or VBR
None
Context-aware
encoding
Brightcove CAE,
EpicLabs LightFlow,
Mux “audience-aware encoding”
Encoders only
Streams can be CBR or VBR
None
Per-scene
encoding
Netflix per-scene encoding Encoders, players Needs seamless multi-period
option (ability to switch to new
manifest on a per-scene basis)
Quality-based
streaming
MediaMelon QBR,
Bitmovin’ per-scene adaptation
Encoders, players Needs exact means of signaling of
quality annotations and definition
of anticipated client behavior (both
in cases of quality-aware and
legacy clients).
© Brightcove Inc. All Rights Reserved. 15
DISCUSSION TOPICS: CAE AND 5G
Main Questions:
• CAE / ABR ladder design and 5G:
◦ is there an overlap?
◦ if networks are improving, do we still need ABR?
• 5G network characteristics and their impact on streaming:
◦ are there any significant differences in shape of network throughput CDF in 5G vs older networks?
◦ is there any way clients can be advised by the core network about current load and hence shape of network
bandwidth PDFs and other relevant statistics?
© Brightcove Inc. All Rights Reserved. 16
DISCUSSION TOPICS: CAE AND STANDARDS
Two CAE architectures likely need standards support:
• Per-scene encoding:
◦ this requires clients to be able to adapt to a new encoding manifest provided on a per-scene basis
◦ what is needed is basically a “seamless multi-period” option in DASH
• could be constrained to: same codec, same number of streams, but bitrates will definitely be different
• Quality-driven streaming
◦ this needs exact means of signaling of quality annotations
• MPEG-B “carriage of timed metadata” spec is a good start, but its use for the purpose is not defined
anywhere
◦ what also needs to be understood and enabled is backwards-compatible regime of operation
• If new clients know nothing about quality metadata, they must be able to deliver same content as
reliably as new players, but perhaps less efficiently.
THANK YOU!
YURIY REZNIK | YREZNIK@BRIGHTCOVE.COM
© B r i g h t c o v e I n c . A l l R i g h t s R e s e r v e d .
17

More Related Content

PDF
Seattle Video Tech Meetup August 2019: Optimal Multi-codec Streaming
PDF
HTTP Adaptive Streaming – Where Is It Heading?
PDF
LwTE: Light-weight Transcoding at the Edge
PDF
HTTP Adaptive Streaming – Quo Vadis?
PDF
Online Bitrate ladder prediction for Adaptive VVC Streaming
PDF
PDF
HTTP Adaptive Streaming – Quo Vadis? (2023)
PDF
Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming
Seattle Video Tech Meetup August 2019: Optimal Multi-codec Streaming
HTTP Adaptive Streaming – Where Is It Heading?
LwTE: Light-weight Transcoding at the Edge
HTTP Adaptive Streaming – Quo Vadis?
Online Bitrate ladder prediction for Adaptive VVC Streaming
HTTP Adaptive Streaming – Quo Vadis? (2023)
Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming

Similar to 03-Reznik-DASH-IF-workshop-2019-CAE.pdf (20)

PPTX
Mission Impossible: Boosting Video Streaming Quality While Reducing Bandwidth
PDF
HTTP Adaptive Streaming – Quo Vadis (2024)
PDF
On ABR Streaming and CDN Performance
PDF
Doctoral Symposium presentation.pdf
PDF
Content_adaptive_video_coding_for_HTTP_Adaptive_Streaming.pdf
PDF
Content-adaptive Video Coding for HTTP Adaptive Streaming
PPT
Barcelona keynote web
PDF
ICME 2016 - High Efficiency Video Coding - Coding Tools and Specification: HE...
PDF
Video streaming using light-weight transcoding and in-network intelligence
PDF
LiveVBR presentation at VQEG NORM.pdf
PDF
HEVC overview main
PPTX
Video Standards - Key Insights
PDF
International Journal of Computational Engineering Research(IJCER)
PDF
Adaptive Delivery of Live Video Stream: Infrastructure cost vs. QoE
PDF
Serverless Media Workflow
PDF
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
PDF
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
PPTX
21-17-0034-00-0000-presentation-of-network-requirement-according-to-compressi...
PDF
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
PPT
Slide
Mission Impossible: Boosting Video Streaming Quality While Reducing Bandwidth
HTTP Adaptive Streaming – Quo Vadis (2024)
On ABR Streaming and CDN Performance
Doctoral Symposium presentation.pdf
Content_adaptive_video_coding_for_HTTP_Adaptive_Streaming.pdf
Content-adaptive Video Coding for HTTP Adaptive Streaming
Barcelona keynote web
ICME 2016 - High Efficiency Video Coding - Coding Tools and Specification: HE...
Video streaming using light-weight transcoding and in-network intelligence
LiveVBR presentation at VQEG NORM.pdf
HEVC overview main
Video Standards - Key Insights
International Journal of Computational Engineering Research(IJCER)
Adaptive Delivery of Live Video Stream: Infrastructure cost vs. QoE
Serverless Media Workflow
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resoluti...
21-17-0034-00-0000-presentation-of-network-requirement-according-to-compressi...
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
Slide
Ad

More from JunZhao68 (20)

PDF
语法专题3-状语从句.pdf 英语语法基础部分,涉及到状语从句部分的内容来米爱上
PDF
愛小孩的歐拉一 兼論 108 數學課綱.pdf for 欧拉&数论相关课程描述啊
PDF
svd15_86.pdf for SVD study and revosited
PDF
Quadra-T1-T2-T4_TechSpec.pdf for netint VPA
PDF
Python Advanced Course - part III.pdf for Python
PDF
Python Advanced Course - part I.pdf for Python
PDF
3 - Intro to SVE.pdf for intro ARM SVE part
PDF
pytorch-cheatsheet.pdf for ML study with pythroch
PDF
Vocabulary Cards for AI and KIDs MIT.pdf
PDF
how CNN works for tech Every parts introductions.pdf
PDF
eics22-slides for researchers need when implementing novel imteraction tech
PDF
Netflix-talk for live video streaming tech
PPTX
Linear system 1_linear in linear algebra.pptx
PDF
GDC2012 JMV Rotations with jim van verth
PDF
1-MIV-tutorial-part-1.pdf
PDF
GOP-Size_report_11_16.pdf
PDF
02-VariableLengthCodes_pres.pdf
PDF
MHV-Presentation-Forman (1).pdf
PDF
CODA_presentation.pdf
PDF
http3-quic-streaming-2020-200121234036.pdf
语法专题3-状语从句.pdf 英语语法基础部分,涉及到状语从句部分的内容来米爱上
愛小孩的歐拉一 兼論 108 數學課綱.pdf for 欧拉&数论相关课程描述啊
svd15_86.pdf for SVD study and revosited
Quadra-T1-T2-T4_TechSpec.pdf for netint VPA
Python Advanced Course - part III.pdf for Python
Python Advanced Course - part I.pdf for Python
3 - Intro to SVE.pdf for intro ARM SVE part
pytorch-cheatsheet.pdf for ML study with pythroch
Vocabulary Cards for AI and KIDs MIT.pdf
how CNN works for tech Every parts introductions.pdf
eics22-slides for researchers need when implementing novel imteraction tech
Netflix-talk for live video streaming tech
Linear system 1_linear in linear algebra.pptx
GDC2012 JMV Rotations with jim van verth
1-MIV-tutorial-part-1.pdf
GOP-Size_report_11_16.pdf
02-VariableLengthCodes_pres.pdf
MHV-Presentation-Forman (1).pdf
CODA_presentation.pdf
http3-quic-streaming-2020-200121234036.pdf
Ad

Recently uploaded (20)

PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Approach and Philosophy of On baking technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Encapsulation theory and applications.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Spectral efficient network and resource selection model in 5G networks
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
MIND Revenue Release Quarter 2 2025 Press Release
NewMind AI Weekly Chronicles - August'25-Week II
“AI and Expert System Decision Support & Business Intelligence Systems”
SOPHOS-XG Firewall Administrator PPT.pptx
Group 1 Presentation -Planning and Decision Making .pptx
Network Security Unit 5.pdf for BCA BBA.
MYSQL Presentation for SQL database connectivity
Digital-Transformation-Roadmap-for-Companies.pptx
Machine learning based COVID-19 study performance prediction
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Building Integrated photovoltaic BIPV_UPV.pdf
Approach and Philosophy of On baking technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation_ Review paper, used for researhc scholars
Encapsulation theory and applications.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Dropbox Q2 2025 Financial Results & Investor Presentation

03-Reznik-DASH-IF-workshop-2019-CAE.pdf

  • 1. CONTEXT-AWARE ENCODING AND 5G YURIY REZNIK | YREZNIK@BRIGHTCOVE.COM © B r i g h t c o v e I n c . A l l R i g h t s R e s e r v e d . DASH-IF WORKSHOP ON MEDIA STREAMING MEETS 5G, DECEMBER 9-10, 2019, PORTLAND, OR
  • 2. © Brightcove Inc. All Rights Reserved. 2 OUTLINE • Brief history of context-aware encoding & variants • How it works • Types of CAE technologies today • CAE and standards ◦ CAE technologies that are fully compatible with existing standards & players ◦ CAE technologies that may need extensions • Discussion ◦ CAE and 5G • is there any overlap? • what type of information from 5G network layer could be useful for CAE ◦ Shall there be extension of a standard? • E.g. for per-scene ladder signaling?
  • 3. © Brightcove Inc. All Rights Reserved. 3 BRIEF HISTORY OF CAE • Early 1990s: H.261, H.263, MPEG-1/2 ◦ “fixed QP” regime – the grandfather of everything “Constant Quality” • Late 1990s-early 2000s: RealVideo 8-10 ◦ “RMVB” – heavy VBR encoding regime, optimized for downloads, still in use in Asia • Early 2010s: ◦ British Telecom “Quality-driven streaming” (Mike Nilsson, et al, June 2012) ◦ InterDigital “Quality-based streaming” proposal to MPEG-DASH (Y. Reznik et al, m25996, July 2012) ◦ Intel labs “Quality-aware streaming” (Yiting Liao, et al, 2013) ◦ “Capped-CRF” – approaches – multiple sources, 2013+ ◦ Beamr “Optimizer” – second pass encoding with adjusted targets, 2013+ ◦ MediaMelon “QBR streaming”, 2014 • Late 2010s: ◦ Netflix “Per-Title Encoding” blog post, Dec. 2015 – ladder of resolutions and rates according to content ◦ Brightcove “Context-Aware Encoding”, Oct. 2016 – ladder design as end-to-end optimization problem ◦ Netflix “per-scene encoding”, 2018 – same as per-title, but on scene basis ◦ Content- and context-aware solutions from Harmonic, Elemental, Ateme, Bitmovin, EpicLabs, Mux, etc.
  • 4. © Brightcove Inc. All Rights Reserved. 4 STATIC ABR ENCODING PROFILES • Define sets of encoding parameters for each rendition: ◦ Resolutions, Bitrates, Codec constraints, etc. ◦ Same for all content, networks, user devices & usage patterns, etc. • Some examples of ABR profiles used in practice: RealVideo (1998): Apple HLS guidelines (2018): Brightcove VideoCloud (2013-2016): video bitrate decoder bitrate cap decoder buffer size max frame rate width height h264 profile 450 771 1028 30 480 270 baseline 700 1194 1592 30 640 360 baseline 900 1494 1992 30 640 360 main 1200 1944 2592 30 960 540 main 1700 2742 3656 30 960 540 main 2500 3942 5256 30 1280 720 main 3500 5442 7256 30 1920 1080 high 3800 6192 8256 30 1920 1080 high
  • 5. © Brightcove Inc. All Rights Reserved. 5 WHY STATIC ABR PROFILES ARE BAD? • Static encoding profiles are not accounting for: ◦ differences in video complexity: differences in networks: differences in devices & user preferences: • A better approach is to design encoding profiles dynamically, accounting for characteristics of ◦ content → content-aware encoding (aka per-title encoding) ◦ network → network-aware encoding ◦ full context (content + network + user statistics) → context-aware encoding Source: Netflix, 2015 Source: Brightcove VideoCloud analytics, 2019 Source: Brightcove VideoCloud analytics, 2019
  • 6. © Brightcove Inc. All Rights Reserved. 6 CAE APPROACHES: “CONTENT-AWARE VBR ENCODING” • Basically, most encoders can be configure to operate either in ◦ “CBR” mode => reduces variation of bitrates: “VBR” mode => reduces variation of quality: • CBR is required for cable & broadcast (c.f. SCTE 128) • VBR (with some reasonable constraints) is working reasonably well for OTT ◦ Apple HLS constraints (2018): • Live: max bitrate < 110% of target • VOD: max bitrate < 200% of target (in practice it is better to limit it to about 150%) • Reasons for constraints: minimize client’s mis-predictions, likelihood of buffering, issues with analytics, etc.
  • 7. © Brightcove Inc. All Rights Reserved. 7 CAE APPROACHES: “PER-TITLE” AND “PER-SCENE ENCODING” • Primary idea: design ABR encoding profiles individually for each video sequence (or scene within a sequence) • Secondary idea: place ladder points such that they belong to the convex hull • Notes: ◦ Netflix “convex hull” argument provides a method for finding best resolutions for any given target bitrate, but • it does not, say how such bitrates should be placed, or how many of them are needed! • it constrains the problem, but it does not show how to solve it exactly!
  • 8. © Brightcove Inc. All Rights Reserved. 8 CAE APPROACHES: “CONTEXT-AWARE ENCODING” • Example deployment architecture: • Context Aware Encoding (CAE) is basically an ◦ ABR encoding profile generator that considers: • properties of content and • properties of networks and devices used to receive content Operator Transcoder Players Dynamic transmux system Playbac k API Device detection Media Files + Metadata Content URLs Rules Engine Rules API API, orchestrator Delivery preferences Media CDNs Manifest CDNs Analytics engine Transcoders CAE profile generators Media files Job requests JIT packagers + SSAI Manifest generators Collect and process network & usage statistic for all actively used devices
  • 9. © Brightcove Inc. All Rights Reserved. 9 CONTEXT-AWARE ENCODING: THE PRINCIPLE • Quality-rate function 𝑄(𝑅): Quality delivered by streaming client: Probabilities of loading of each stream: • Average quality for a given ladder of rates 𝑅1, …, 𝑅𝑛, quality-rate function 𝑄(𝑅), and network density 𝑝(𝑅): 𝑄 𝑅1, …, 𝑅𝑛, 𝑝 = 𝑄 𝑅1 න 𝑅1 𝑅2 𝑝(𝑅)𝑑𝑅 + 𝑄 𝑅2 න 𝑅2 𝑅3 𝑝 𝑅 𝑑𝑅 + … + 𝑄 𝑅𝑛 න 𝑅𝑛 𝑅max 𝑝(𝑅)𝑑𝑅 • A quality-optimal ladder is a set of rates ෠ 𝑅1, …, ෠ 𝑅𝑛, such that: 𝑄 ෠ 𝑅1, …, ෠ 𝑅𝑛, 𝑝 = max 𝑅min<𝑅1≤⋯≤𝑅𝑛 <𝑅max 𝑅1≤𝑅1,max 𝑄 𝑅1, …,𝑅𝑛, 𝑝
  • 10. © Brightcove Inc. All Rights Reserved. 10 CONTEXT-AWARE ENCODING: EXAMPLE INPUTS • Content: Networks: • Quality-rate models: Network models: Based on data from: J. Karlsson, and M. Riback. Initial field performance measurements of LTE, Ericsson review, 3, 2008. 𝑄 𝑅 = 𝑅𝛽 𝛼𝛽 + 𝑅𝛽 𝑝 𝑅 = 𝛼 𝒩 𝜇1,𝜎1 𝑅 + 1 − 𝛼 𝒩 𝜇2,𝜎2 𝑅 Content α β Easy 0.0555 0.8550 Medium 0.0724 0.8016 Complex 0.1015 0.7364 Network α μ1 σ1 μ2 σ2 Network 1 0.584 0.996 0.564 2.554 1.165 Network 2 0.584 1.992 1.129 5.108 2.331 Resolution=720p25 Codec=H.264/Main Quality metric=SSIM 3 video sequences: “Easy”, “Medium”, “Complex”
  • 11. © Brightcove Inc. All Rights Reserved. 11 CONTEXT-AWARE ENCODING: EXAMPLE RESULTS Optimal ladders for Network 1: Optimal ladders for Network 2: where: ◦ 𝑄𝑛 = quality at top rendition [SSIM] ◦ ത 𝑄 = average quality [SSIM] ◦ 𝜉 = gap to average quality achievable with infinite number of renditions [%] • Key observation: ◦ optimal profiles designed for different sources and networks are different! Content N Ladder bitrates [kbps] 𝑸𝒏 ഥ 𝑸 𝝃 [%] Easy 2 138, 803 0.909 0.867 6.58 3 100, 512, 1209 0.931 0.888 4.35 4 100, 411, 866, 1645 0.946 0.897 3.34 5 100, 349, 694, 1155, 2087 0.955 0.902 2.76 Medium 2 175, 854 0.881 0.830 7.98 3 100, 518, 1219 0.906 0.854 5.31 4 100, 416, 876, 1663 0.924 0.866 4.00 5 100, 354, 701, 1165, 2104 0.936 0.873 3.25 Complex 2 234, 931 0.825 0.769 10.2 3 145, 590, 1304 0.867 0.797 6.96 4 102, 431, 898, 1704 0.888 0.812 5.22 5 100, 363, 716, 1183, 2134 0.904 0.821 4.16 Content N Ladder bitrates [kbps] 𝑸𝒏 ഥ 𝑸 𝝃 [%] Easy 2 232, 1457 0.940 0.906 5.14 3 116, 811, 2124 0.955 0.924 3.27 4 100, 589, 1421, 2803 0.964 0.932 2.40 5 100, 486, 1107, 1974, 3577 0.971 0.937 1.92 Medium 2 293, 1549 0.920 0.878 6.23 3 158, 893, 2216 0.939 0.899 4.04 4 100, 601, 1438, 2828 0.949 0.909 2.97 5 100, 495, 1123, 1995, 3615 0.958 0.915 2.35 Complex 2 391, 1685 0.887 0.833 7.98 3 232, 1018, 2358 0.910 0.857 5.29 4 156, 712, 1569, 3001 0.924 0.869 3.94 5 114, 537, 1179, 2060, 3727 0.935 0.877 3.11
  • 12. © Brightcove Inc. All Rights Reserved. 12 CONTEXT-AWARE ENCODING: HOW MANY STREAMS ARE NEEDED? • There are two natural limits: (1) Set limit for quality at top rendition: (2) Set limit for quality gap: This shows that “easy” content can be This provides effective bound on the number of renditions encoded with much fewer renditions! for “complex” content as well. • This way, the problem of design of optimal profiles for single codec case is now fully defined: ◦ we know how to choose rates & number of streams ◦ best choices of resolutions follow by applying resolution-specific quality-rate functions
  • 13. © Brightcove Inc. All Rights Reserved. 13 CAE APPROACHES: “QUALITY-BASED STREAMING” • Architecture: • Pros: ◦ Allows per-segment adaptation ◦ Allows clients to use advanced user- and context-aware adaptation strategies • Cons: ◦ Requires modifications of the standard ◦ Quality-adaptive work in MPEG-DASH has not provided exact mechanism for enabling it Encoder Stream 1 Stream 2 Stream N Quality data Manifest Origin CDN Client Implements decision logic based on quality metadata ...
  • 14. © Brightcove Inc. All Rights Reserved. 14 CAE TYPES: SUMMARY CAE type Example solutions What it affects Impact on standards VBR encoding x264/x265 “capped CRF”, Beamr CABR, Harmonic EyeQ, Elemental QVBR Encoders Players need to be tested to operate reliably under VBR streams Clarifications on the extent of VBR variation allowed may be useful. Per-title encoding Netflix per-title, Ateme CAE, Bitmovin’ per-title, Cambria, etc. Encoders only Streams can be CBR or VBR None Context-aware encoding Brightcove CAE, EpicLabs LightFlow, Mux “audience-aware encoding” Encoders only Streams can be CBR or VBR None Per-scene encoding Netflix per-scene encoding Encoders, players Needs seamless multi-period option (ability to switch to new manifest on a per-scene basis) Quality-based streaming MediaMelon QBR, Bitmovin’ per-scene adaptation Encoders, players Needs exact means of signaling of quality annotations and definition of anticipated client behavior (both in cases of quality-aware and legacy clients).
  • 15. © Brightcove Inc. All Rights Reserved. 15 DISCUSSION TOPICS: CAE AND 5G Main Questions: • CAE / ABR ladder design and 5G: ◦ is there an overlap? ◦ if networks are improving, do we still need ABR? • 5G network characteristics and their impact on streaming: ◦ are there any significant differences in shape of network throughput CDF in 5G vs older networks? ◦ is there any way clients can be advised by the core network about current load and hence shape of network bandwidth PDFs and other relevant statistics?
  • 16. © Brightcove Inc. All Rights Reserved. 16 DISCUSSION TOPICS: CAE AND STANDARDS Two CAE architectures likely need standards support: • Per-scene encoding: ◦ this requires clients to be able to adapt to a new encoding manifest provided on a per-scene basis ◦ what is needed is basically a “seamless multi-period” option in DASH • could be constrained to: same codec, same number of streams, but bitrates will definitely be different • Quality-driven streaming ◦ this needs exact means of signaling of quality annotations • MPEG-B “carriage of timed metadata” spec is a good start, but its use for the purpose is not defined anywhere ◦ what also needs to be understood and enabled is backwards-compatible regime of operation • If new clients know nothing about quality metadata, they must be able to deliver same content as reliably as new players, but perhaps less efficiently.
  • 17. THANK YOU! YURIY REZNIK | YREZNIK@BRIGHTCOVE.COM © B r i g h t c o v e I n c . A l l R i g h t s R e s e r v e d . 17