Skip to main content
Article
Finding Similar Movies: Dataset, Tools, and Methods
HCI-Europe 2018, International Conferences in Central Europe on Human Computer Interaction (2018)
  • Hongkun Leng
  • Caleb De La Cruz Paulino
  • Momina Haider
  • Rui Lu
  • Zhenhui Zhou
  • Ole J Mengshoel
  • Per-Erik Brodin
  • Julien Forgeat
  • Alvin Jude
Abstract
Recommender systems are becoming ubiquitous in online commerce as well as in video-on-demand (VOD) and music streaming services. A popular form of giving recommendations is to base them on a currently selected product (or service), and provide “More Like This,” “Products Similar to This,” or “People Who Bought This also Bought” functionality. These recommendations are based on similarity computations, also known as item-item similarity computations. Currently, such computations are typically implemented by heuristic algorithms, which may not match the perceived item-item similarity of users. In contrast, we study in this paper a crowd-sourced,
data-driven approach to similarity. Specifically, we develop four similarity methods and investigate how user-contributed labels can be used to improve similarity computations to better match user perceptions in movie recommendations. The four methods were tested against the current best baseline method in a user experiment (n = 114) using the MovieLens 20M dataset. Our experiment showed that all our methods beat the benchmark method and the differences were both statistically and practically significant. This paper’s main contributions include user evaluation of similarity methods for movies; user-contributed labels indicating movie similarities; and
code for the annotation tool. These can be found at http://MovieSim.org.
Keywords
  • Recommender Systems,
  • Item-Item Similarity,
  • Crowdsourcing,
  • Supervised Learning,
  • MovieLens
Publication Date
May, 2018
Citation Information
Hongkun Leng, Caleb De La Cruz Paulino, Momina Haider, Rui Lu, et al.. "Finding Similar Movies: Dataset, Tools, and Methods" HCI-Europe 2018, International Conferences in Central Europe on Human Computer Interaction (2018)
Available at: http://works.bepress.com/ole_mengshoel/85/