learning-and-adaptation

Gaussian Process Optimization based Learning for Trajectory Tracking, Okan Koc, Gajamohan Mohanarajah and Andreas Krause

=======================

TODO List for TGP (Tracking with Gaussian Processes):

Implement the saturating cost and compare performance with quadratic cost
Include the trajectory generation algorithm using splines as a new class (which can be later extended)
Include acquisition functions as a subclass of contextual bandits
Find an implemented version of PILCO + reference tracking + nominal model (Multi-Task PILCO, P.Englert)
Learn faster in complicated dynamical mismatches. Things to try:
1. reward shaping,
2. feedback added learning (indirect model learning)
3. conditioning on estimation data
4. using options in a hierarchical bandits setting [can mpc be incorporated to this approach with predetermined/flexible horizon?]
5. parametrize inputs cleverly [Gaussians or time varying linear feedback control structure?]
6. oracles: phasing as in DMP to get smooth approximating trajectories [could parameters be optimized via RKSH norm of cost differences?]

Remarks:

Can one smoothen inputs by penalizing input exploration and still achieve no-regret?
For finite horizon problems, it makes sense to explore progressively towards the end (cautious exploration)
TGP with robust trajectory generation as a point tracking algorithm to compare with PILCO

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
OLD		OLD
documentation		documentation
experiments		experiments
gpml-matlab-v3.1-2010-09-27		gpml-matlab-v3.1-2010-09-27
pilcoV0.9		pilcoV0.9
.gitignore		.gitignore
Controller.m		Controller.m
Estimator.m		Estimator.m
Filter.m		Filter.m
ILC.m		ILC.m
MPC.m		MPC.m
Model.m		Model.m
Quadrotor.m		Quadrotor.m
README		README
README.md		README.md
TGP.m		TGP.m
Trajectory.m		Trajectory.m
episodic_learning.m		episodic_learning.m
figure1.fig		figure1.fig
fix_root.m		fix_root.m
ker_matrix.m		ker_matrix.m
ker_matrix_iter.m		ker_matrix_iter.m
ker_vector.m		ker_vector.m
kernel.m		kernel.m
matlabfrag.m		matlabfrag.m
quad_est.mat		quad_est.mat
quad_exp.mat		quad_exp.mat
quad_feas_con.m		quad_feas_con.m
quad_initialize.m		quad_initialize.m
quad_traj_gen.m		quad_traj_gen.m
transfer_learning.m		transfer_learning.m
trj.mat		trj.mat

Provide feedback