Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

CMC Senior Theses

Distributed systems

Articles 1 - 1 of 1

Full-Text Articles in Statistical Models

Scalable Collaborative Filtering Recommendation Algorithms On Apache Spark, Walker Evan Casey Jan 2014

Scalable Collaborative Filtering Recommendation Algorithms On Apache Spark, Walker Evan Casey

CMC Senior Theses

Collaborative filtering based recommender systems use information about a user's preferences to make personalized predictions about content, such as topics, people, or products, that they might find relevant. As the volume of accessible information and active users on the Internet continues to grow, it becomes increasingly difficult to compute recommendations quickly and accurately over a large dataset. In this study, we will introduce an algorithmic framework built on top of Apache Spark for parallel computation of the neighborhood-based collaborative filtering problem, which allows the algorithm to scale linearly with a growing number of users. We also investigate several different variants …