Skip to content
#

tabular-rl

Here are 13 public repositories matching this topic...

Comparative study of Monte Carlo, SARSA & Q-Learning on custom MiniGrid environments. Phase-conditioned state representation, one-shot event-based reward shaping, Optuna hyperparameter search. TD methods: 100% success; MC: 61% on the hard env. RL course HW2, Reichman University.

  • Updated Jun 22, 2026
  • Jupyter Notebook

RL was cheaper. The heuristic was safer. Neither was correct. POLARIS stress-tests operational policies under chaos, demand spikes, and black swan events asking one question: which policy survives when everything goes wrong? Built with constrained RL, Bayesian modeling, CVaR risk metrics, and a human-in-the-loop governor.

  • Updated Mar 22, 2026
  • Python

Improve this page

Add a description, image, and links to the tabular-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tabular-rl topic, visit your repo's landing page and select "manage topics."

Learn more