MovieLen 100k with the original SafeOpt algorithm

Hi there!

I'm trying to replicate the result of SafeOpt on the MovieLen 100k dataset and use it as a baseline for my class project. I tried to follow the procedure described in the paper but I failed to reproduce the result. Can I get some help? Thank you!

Here are the steps that I followed:

<img width="321" alt="Screen Shot 2021-11-16 at 10 25 11 AM" src="https://user-images.githubusercontent.com/38264919/142043739-54779ff7-3258-4826-a6e4-98f6b7e17093.png">

<img width="320" alt="Screen Shot 2021-11-16 at 10 25 19 AM" src="https://user-images.githubusercontent.com/38264919/142043759-0eea4c49-5d5c-4c19-b26c-cae7e5f1c363.png">

And here is my code: 
<img width="683" alt="Screen Shot 2021-11-16 at 10 36 26 AM" src="https://user-images.githubusercontent.com/38264919/142045292-4ef0cbc9-5b59-4079-86c2-e715de67e641.png">

The problem right now is that 

- the algorithm breaks the safety constraint pretty quick (explored movies that have a 1.0 rating)
- the algorithm tends to be stuck at one or multiple high rating recommendations after some iterations and it is not able to generate novel results.
- similar to the second point, it explores only a small portion of the reachable set 

We experimented with different std and noise variance but did not actually make significant improvement. I wonder do you notice any mistake we made in our code? In addition, would it be possible to share the code you used for the evaluation in the paper?

Thank you very much!



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MovieLen 100k with the original SafeOpt algorithm #8

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

MovieLen 100k with the original SafeOpt algorithm #8

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions