Alpha-Ordered Gradients

Code for "Do Differentible Simulators Give Better Policy Gradients"?

Setup

Add the git repo to your PYTHONPATH. Then test by import alpha_gradient

We provide multiple examples that can be run.

To visualize the per-coordinate bias and variance on simple one-step examples,

BallWithWall example: python3 examples/ball_with_wall/alpha_coordinate_sweep.py
Pivot example: python3 examples/pivot/alpha_coordinate_sweep.py

We include some trajectory optimization examples.

Closed-loop policy optimization examples are:

Finite-Horizon Static-Policy LQR: python3 examples/linear_system/linear_test.py
Tennis: python3 examples/breakout/run_bc_policyopt.py

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
alpha_gradient		alpha_gradient
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
fobg_var_smoothing_sweep.npy		fobg_var_smoothing_sweep.npy
zobg_var_smoothing_sweep.npy		zobg_var_smoothing_sweep.npy