Building a denoising autoencoder in JAX and Haiku for better planning with model-based RL

Exploring model-based RL and learning about its challenges firsthand.

Using natural language processing to visualize news coverage.

My notes from learning to implement PPO including trust regions, importance sampling, and other topics.

Applying techniques based on the dynamics of the function being learned to improve performance on complex systems.