Hello everybody,
when I used Garage's EpsilonGreedyStrategy and Gym environments I found that sampling is not deterministic. I've set the seed via deterministic.set_seed(seed).
After some investigation I found that Garage doesn't set any seeds for Gym. Is there any reason for that? As a user I would actually expect that Garage handles all that for me.
Happy to do a PR in case you feel like this should be added!