acme
acme copied to clipboard
A library of reinforcement learning components and agents
In the MOMPO model, I see a lot of 'task', 'objective', and 'reward', but I'm not sure where the objectives are input to this model. Sorry, it's quite a lot...
Hello, Could you please let me know if there is any seed function in acme to fix the TF RL agent randomness? I observed that results vary for each run...
Hello! a tutorial for proper setup of experiments, saving, logging and loading would be much appreciated! I run into problems restoring checkpoints: Currently I am using the following setup: -...
I followed the instruction of the installation. However, some bad things happened. This is a amd computer, window system and use Python 3.9.13. pip install dm-acme[tensorflow] works well. But pip...
Annotation with collections.abc.Mapping does not work with python3.8.
Hi, I have some silly questions about updating the agent. I know the general framework of training is as follow: ``` while True: # Make an initial observation. step =...
The gradients dq_da is currently incorrect. The gradients for each dimension from the action should be summed instead of averaged as per https://github.com/deepmind/rlax/blob/master/rlax/_src/policy_gradients_test.py#L55 For example, the D4PG agent also doesn't...
Ubuntu 20.04 cuda-11.4 4 GPU / 4 CPU core node Setup (from fresh VM): ```shell > apt-get update && apt-get install -y --no-install-recommends \ libgl1-mesa-glx libosmesa6 libglew-dev > pip install...
Hello, Could you please let me know whether there is any hyperparameter search available inside the acme library for acme tf agents like d4pg, dmpo, etc? (or) Could you please...
The current pinned version does not work for some of the d4rl locomotion v2 datasets due to the wrong specification in metadata types.