dataset-aggregation

Here are 5 public repositories matching this topic...

kwk2696 / sb3-jax-haiku

stable-baselines with JAX & Haiku

reinforcement-learning imitation-learning diffusion haiku dataset-aggregation proximal-policy-optimization behavior-cloning jax soft-actor-critic dm-haiku decision-transformers

Updated Jun 20, 2024
Python

aryan-programmer / pcb-fault-detection

Star

Modelling & Training for a AI-Driven PCB Fault Detection project.

Updated Oct 23, 2025
Jupyter Notebook

Hilton-AH / YODO-novel-RL-algorithm

Star

Using DAgger with our MPC treated as the expert, we are able to effectively distill knowledge into relatively simple networks while still being able to retain a large fraction of the performance. (Please see paper for full description).

reinforcement-learning robot-learning model-predictive-control gym-environment dataset-aggregation

Updated Aug 9, 2023
Jupyter Notebook

hartikainen / berkeley-cs294

Star

Berkeley CS 294: Deep Reinforcement Learning

reinforcement-learning deep-learning deep-reinforcement-learning dqn behavioral-cloning a3c cs294 berkeley-reinforcement-learning dataset-aggregation

Updated Jul 19, 2017
Jupyter Notebook

Hilton-AH / Imitation_Learning-Behavioral_Cloning-for-Robot-Learning

Star

Lunar Lander game from OpenAI Gym using behavioral cloning, DAgger methods, and POMDP(Partially-Observable Markov Decision Processes)

reinforcement-learning behavioral-cloning imitation-learning robot-learning dataset-aggregation

Updated Aug 9, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the dataset-aggregation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataset-aggregation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset-aggregation

Here are 5 public repositories matching this topic...

kwk2696 / sb3-jax-haiku

aryan-programmer / pcb-fault-detection

Hilton-AH / YODO-novel-RL-algorithm

hartikainen / berkeley-cs294

Hilton-AH / Imitation_Learning-Behavioral_Cloning-for-Robot-Learning

Improve this page

Add this topic to your repo