MixUCB (RLC 2025)

This repository contains code for the following paper: "MixUCB: Enhancing Safe Exploration in Contextual Bandits with Human Oversight", by Jinyan Su, Wen Sun, Sarah Dean, Rohan Banerjee, Jiankai Sun, which has been accepted to the Reinforcement Learning Conference (RLC) 2025.

Installation

Create a conda environment using the provided requirements.txt file as follows:

conda create -n mixucb python=3.10
conda activate mixucb
pip install -r requirements.txt
pip install torch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 --index-url https://download.pytorch.org/whl/cpu

Main experimental script

The following script reproduces the experiments in the paper for the four datasets: synthetic, SPANet, heart disease, and MedNIST. It consists of (1) data generation, (2) running the algorithms, (3) generating plots.

bash run_all.sh

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
raw_data		raw_data
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate_multilabel_data.py		generate_multilabel_data.py
plot_tools.py		plot_tools.py
requirements.txt		requirements.txt
run_all.sh		run_all.sh
run_allucb.py		run_allucb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MixUCB (RLC 2025)

Installation

Main experimental script

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

sdean-group/MixUCB

Folders and files

Latest commit

History

Repository files navigation

MixUCB (RLC 2025)

Installation

Main experimental script

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages