acme issues

Stupid question: Where do the 'objectives' go in the MOMPO model?

In the MOMPO model, I see a lot of 'task', 'objective', and 'reward', but I'm not sure where the objectives are input to this model. Sorry, it's quite a lot...

MotorCityCobra

Regarding Seed for tf rl agents in acme

3

Hello, Could you please let me know if there is any seed function in acme to fix the TF RL agent randomness? I observed that results vary for each run...

kmukeshreddy

proper way to save and load

2

Hello! a tutorial for proper setup of experiments, saving, logging and loading would be much appreciated! I run into problems restoring checkpoints: Currently I am using the following setup: -...

philippGraf

pip install dm-acme[jax] has problem

1

I followed the instruction of the installation. However, some bad things happened. This is a amd computer, window system and use Python 3.9.13. pip install dm-acme[tensorflow] works well. But pip...

ruyikang

Use typing.Mapping instead of collections.abc.Mapping

Annotation with collections.abc.Mapping does not work with python3.8.

ethanluoyc

Qestion about updating the agent

5

Hi, I have some silly questions about updating the agent. I know the general framework of training is as follow: ``` while True: # Make an initial observation. step =...

ZixuanLiu4869

Fix policy loss gradient in TD3

1

The gradients dq_da is currently incorrect. The gradients for each dimension from the action should be summed instead of averaged as per https://github.com/deepmind/rlax/blob/master/rlax/_src/policy_gradients_test.py#L55 For example, the D4PG agent also doesn't...

ethanluoyc

run_dqn demo fails with distributed training: ValueError: TrajectoryColumns cannot contain any None data references

6

Ubuntu 20.04 cuda-11.4 4 GPU / 4 CPU core node Setup (from fresh VM): ```shell > apt-get update && apt-get install -y --no-install-recommends \ libgl1-mesa-glx libosmesa6 libglew-dev > pip install...

rdevon

Regarding Hyperparameter Search for acme tf agents (d4pg, dmpo)

Hello, Could you please let me know whether there is any hyperparameter search available inside the acme library for acme tf agents like d4pg, dmpo, etc? (or) Could you please...

kmukeshreddy

BUGFIX: Unpin tensorflow_datasets

3

The current pinned version does not work for some of the d4rl locomotion v2 datasets due to the wrong specification in metadata types.

ethanluoyc

acme
acme copied to clipboard

Metadata

Stupid question: Where do the 'objectives' go in the MOMPO model?

Regarding Seed for tf rl agents in acme

proper way to save and load

pip install dm-acme[jax] has problem

Use typing.Mapping instead of collections.abc.Mapping

Qestion about updating the agent

Fix policy loss gradient in TD3

run_dqn demo fails with distributed training: ValueError: TrajectoryColumns cannot contain any None data references

Regarding Hyperparameter Search for acme tf agents (d4pg, dmpo)

BUGFIX: Unpin tensorflow_datasets

← Metadata

Owner

Metadata

acme acme copied to clipboard

Metadata

← Metadata

Owner

Metadata

acme
acme copied to clipboard