-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Labels
enhancementNew feature or requestNew feature or request
Description
In his thesis, Jannis wrote a logger for DE runs and parsing into D4RL-style datasets so we can train offline RL algorithms from unifloral. This would be a generally really nice feature to have, so we should integrate this functionality. Proposal:
- Add an abstract class based on the DE DAC Logger
- Use the env.reset() and env.step() functions in the Logger
- Append dataset parsing immediately after the Logger runs
- Add example using this for training with unifloral
Related: Jannis used detpy as DE implementation which might be nice to add as its own env since it has several dynamic baselines (and we already have the datasets). He mentioned it should be doable, but it not fully trivial for the baselines. Might need investigation.
Code: https://github.com/automl-private/Bsc-JannisKastner-DeDAC/tree/main
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request
Type
Projects
Status
Todo