recent posts

Using natural language processing to visualize news coverage.

My notes from learning to implement PPO including trust regions, importance sampling, and other topics.

Applying techniques based on the dynamics of the function being learned to improve performance on complex systems.