Investigating the Value of Subtitles for Improved Movie Recommendations

Sagi Eden, Amit Livne, Oren Sar Shalom, Bracha Shapira, Dietmar Jannach

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Collaborative filtering (CF) is a highly effective recommendation approach based on preference patterns observed in user-item interaction data. Since pure collaborative methods can have certain limitations, e.g., when the data is sparse, hybrid approaches are a common solution, as they are able to combine collaborative information with side-information (SI) about the items. In this work, we explore the value of subtitle information for the problem of movie recommendation. Differently from previously explored types of movie SI, e.g., titles or synopsis, subtitles are not only longer, but also contain unique information that may help us to predict more accurately if a user will enjoy a movie. To assess the usefulness of subtitles, we propose a technical framework named SubtitleCF that combines user and item embeddings derived from interaction data and SI. The subtitles may be embedded in different ways, e.g., Latent Dirichlet Allocation (LDA) and neural techniques. Computational experiments with a framework instantiation that relies on Bayesian Personalized Ranking (BPR) as industry-strength method for item ranking and different text embedding methods demonstrate the value of subtitles in terms of prediction accuracy and coverage. Moreover, a user study (N=247) reveals that the information contained in subtitles can be leveraged to improve the decision-making processes of users.

Original languageEnglish
Title of host publicationUMAP2022 - Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization
PublisherAssociation for Computing Machinery, Inc
Pages99-109
Number of pages11
ISBN (Electronic)9781450392075
DOIs
StatePublished - 7 Apr 2022
Event30th ACM Conference on User Modeling, Adaptation and Personalization, UMAP2022 - Virtual, Online, Spain
Duration: 4 Jul 20227 Jul 2022

Publication series

NameUMAP2022 - Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization

Conference

Conference30th ACM Conference on User Modeling, Adaptation and Personalization, UMAP2022
Country/TerritorySpain
CityVirtual, Online
Period4/07/227/07/22

Keywords

  • Hybrid Systems
  • Movie Recommendation
  • Side Information
  • Subtitles

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Investigating the Value of Subtitles for Improved Movie Recommendations'. Together they form a unique fingerprint.

Cite this