data-security's Introduction

Fake Ratings in Recommender Systems

The Netflix Prize challenge inspired data enthusiasts around the globe to find the best collaborative filtering algorithm for predicting movie ratings. Their findings had and still have a lasting impact on the industry and research community. In this work we analyse the shift in performance of two of the central baseline predictors of the winning solution of this challenge when confronted with fake user ratings, an incident regularly taking place in the real world. Our results show that the predictive performance significantly decreases for the respective item targeted by such fake reviews.

Dataset & Overview

The benchmark dataset for this project can be found here in our repository, or be downloaded directly from the Website.

If you want to recreate the analysis you can run the following files in this order:

File	Content
movielens_descriptives.R	Analyze and describe dataset, plot feature distributions
model_selection.ipynb	Hyperparametertuning for kNN and SVD
scenario1.ipynb	Analyze change in prediction for review bombing
scenario2.ipynb	Analyze change in prediction for paid reviews
Fake_Ratings_in_Recommender_Systems.pdf	In-depth report of analysis

Descriptions of the code are given in the respective file.

Software Requirements

In order to run the Python code, you will need the following module versions (or higher):

python = 3.8.5
pandas = 1.1.3
numpy = 1.19.2
scikit-surprise = 1.1.1

In order to run the R code, you will need the following module versions (or higher):

R = 4.0.3
gridExtra = 2.3
dplyr = 1.0.2
readr = 1.4.0
ggplot2 = 3.3.3

Recommend Projects

adamolko / data-security Goto Github PK

data-security's Introduction

Fake Ratings in Recommender Systems

Dataset & Overview

Software Requirements

data-security's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent