agents issues

Results 174 agents issues

Sort by recently updated

allow preprocessing combiner for CriticRnnNetwork

It seems like replacing `utils.mlp_layers` with `EncodingNetwork` in `CriticRnnNetwork` will allow the use of preprocessing combiners for actions and observations. This critic network is used for SAC agents. The `ActorDistributionRnnNetwork`...

samarth-robo

Correct setup for param max_sequence_length in ReverbAddEpisodeObserver

Hi, could you please clarify the description of the parm max_sequence_length in ReverbAddEpisodeObserver. The description is a little bit messy. max_sequence_length: An integer. `max_sequence_length` used to write to the replay...

JCMiles

TypeError when using 'check-numerics' in combination with 'PolicySavedModelTrigger'

If I enable tensor numeric checking and initialise a Learner with a PolicySavedModelTrigger I get the following error: > TypeError: Input 'resource' of 'AssignVariableOp' Op has type float32 that does...

ULudo

Adding New Action(s) to a Bandit Policy

I understand from the [Per-Arm Features tutorial](https://www.tensorflow.org/agents/tutorials/per_arm_bandits_tutorial) that it may be "cumbersome to add" a new action to a policy, but what is the procedure for doing do? For example,...

davidcereal

[PPO] PPOAgent works incorrectly

I'm trying to implement a PPO agent to play with LunarLander-v2 with tf_agents library like it was in [this tutorial](https://pylessons.com/LunarLander-v2-PPO/) ([_github repo_](https://github.com/pythonlessons/Reinforcement_Learning/tree/master/LunarLander-v2_PPO)) networks.py ``` from tf_agents.networks import actor_distribution_network, value_network from...

MaxTitkov

How to load policy saved with triggers.PolicySavedModelTrigger in tf Agent

When you get the chance, can you please take a look at this question on stackoverflow [1]? I tried to find a community discussion forum for tf agent but was...

quanvuong

ValueError: Only scalar actions are supported now!!

I am getting this error while trying to create a custom environment and feed it to the DQN agent. **ValueError: Only scalar actions are supported now, but action spec is:...

shamim237

Display of checkpoint information saved in tf_agents.utils.common.Checkpointer

I used [save](https://github.com/tensorflow/agents/blob/c584e7642fbb45df79ab78dc8884248871e14f3e/tf_agents/utils/common.py#L1026) in [tf_agents.utils.common.Checkpointer ](https://github.com/tensorflow/agents/blob/c584e7642fbb45df79ab78dc8884248871e14f3e/tf_agents/utils/common.py#L981-L1032) to save the checkpoint file. I would like to see the information (weight values, etc.) of the contents of the file, how can I...

atusi-nakajima

Error when calling get_initial_state on loaded policies saved with PolicySaver when batch_size != None

- OS Platform and Distribution: Google Colab - TensorFlow installed from: Binary - TensorFlow version: 2.6.0 - TF Agents version: 0.9.0 - Python version: Python 3.7.12 - Installed using virtualenv?...

hornstenfilip

`PPOAgent` ignores network.losses

`PPOAgent` does not use regularization losses that are defined in the `value_net` and `actor_net`. It should add them to the other losses.

zroug

type:bug

level:p1

agents
agents copied to clipboard

Metadata

allow preprocessing combiner for CriticRnnNetwork

Correct setup for param max_sequence_length in ReverbAddEpisodeObserver

TypeError when using 'check-numerics' in combination with 'PolicySavedModelTrigger'

Adding New Action(s) to a Bandit Policy

[PPO] PPOAgent works incorrectly

How to load policy saved with triggers.PolicySavedModelTrigger in tf Agent

ValueError: Only scalar actions are supported now!!

Display of checkpoint information saved in tf_agents.utils.common.Checkpointer

Error when calling get_initial_state on loaded policies saved with PolicySaver when batch_size != None

`PPOAgent` ignores network.losses

← Metadata

Owner

Metadata

agents agents copied to clipboard

Metadata

← Metadata

Owner

Metadata

agents
agents copied to clipboard