Joint Backward and Forward Temporal Masking for Perceptually Optimized x265-Based Video Coding

Dan Grois, Alex Giladi, Praveen Kumar Karadugattu, Niranjankumar Balasubramanian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

There is a strong and ever-growing demand for higher-resolution video content, such as UltraHD, which requires significantly higher bitrates. Providing such content at scale is a challenge due to limitations of the available last-mile bandwidth and content delivery network (CDN) storage and egress capacity. Lower bitrates are often considered an answer. This way, the high-resolution video content is often compressed with visually perceptible coding artifacts, thereby leading to an inferior user experience. Improved compression efficiency is thus the obvious solution for improving the user experience. However, in order to realize the gain from such efficiency improvements in a large-scale deployment, such improvements need to be applicable to an already deployed ecosystem such as set-top boxes or mobile devices, and SmartTVs, and to have a reasonably low computational complexity. This work proposes a low-complexity joint backward and forward temporal masking for reducing bitrate without perceptibly affecting visual quality. This is achieved by introducing a novel low-complexity scenecut-aware adaptive frame-level quantization framework, which considers temporal distances between frames and the closest scenecuts. The proposed framework has been implemented in the popular x265 open-source HEVC encoder. With that said, the framework is codec-independent and can be applied to other encoders and video coding standards. Different backward and forward masking time periods and quantizer behaviors are investigated to determine exact time periods for which temporal masking does not substantially impact video quality, as perceived by the human visual system (HVS). Extensive subjective quality assessments have been carried out for evaluating the benefits and advantages of the proposed scenecut-aware adaptive quantization framework. The subjective results showed significant bitrate savings of up to about 26%, while maintaining substantially the same perceived visual quality.

Original languageEnglish
Title of host publicationApplications of Digital Image Processing XLV
EditorsAndrew G. Tescher, Touradj Ebrahimi
PublisherSPIE
ISBN (Electronic)9781510654365
DOIs
StatePublished - 1 Jan 2022
Externally publishedYes
EventApplications of Digital Image Processing XLV 2022 - San Diego, United States
Duration: 22 Aug 202224 Aug 2022

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume12226
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

ConferenceApplications of Digital Image Processing XLV 2022
Country/TerritoryUnited States
CitySan Diego
Period22/08/2224/08/22

Keywords

  • H.265
  • HEVC
  • perceptual video quality
  • quality metrics
  • subjective assessment
  • video coding
  • x265

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Joint Backward and Forward Temporal Masking for Perceptually Optimized x265-Based Video Coding'. Together they form a unique fingerprint.

Cite this