agents icon indicating copy to clipboard operation
agents copied to clipboard

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Results 191 agents issues
Sort by recently updated
recently updated
newest added

Tensorflow2 no longer supports the contrib module. Could we replace tf.contrib.distributions with tensorflow_probability.distributions? agents/scripts/network.py ``` # Copyright 2017 The TensorFlow Agents Authors. # # Licensed under the Apache License, Version...

Is work being done to enable tf-agents to be installed on Python 3.12? * Python v3.12.9 was released on 2025-02-04. Related to: * #932 * https://devguide.python.org/versions * https://github.com/tensorflow/agents?tab=readme-ov-file#releases

I have been trying to use TF-Agents, but it is so stuck up on the Training error, I have no idea how to resolve this, I either have broadcast error,...

How reliable TF-Agents are when it comes to code dependencies? I had issues installing with SDL libraries complaining, and then I installed unstable nightly build that complained about typing-extensions compatibility...

Hi, I am looking for something similar to what RLlib does in Tensorflow eco system. Since Rllib decided to ditch TF, and I am not sure about moving to PyTorch,...

Hello everyone, I'm a beginner in Python and while I was writing a Tetris game with the pygame library I encountered the errors figure[i].x += dx AttributeError: 'int' object has...

The following code in `PPOAgent.compute_advantages` ignores value predictions for final observations in the trajectory and instead passes one-before-last values to the `generalized_advantage_estimation` function twice: ```python # Arg value_preds was appended...

I've just about explored every possible angle I can think of on this problem except for my hardware (but i am not sure where I'd begin with this?) and I...

Hello, I am a student research assistant at the Creative Machines Lab at Columbia, contributing to the Smart Building project. During my work training a PPO agent, I encountered a...

I'm implementing a continuous-action space for TF-Agents, where I want the action to be a four-element array with elements: $s \in [0,10]$, $dx \in [-1,1]$, $dy \in [-1,1]$, $dz \in...