agents issues

Results 174 agents issues

Sort by recently updated

Data normalization for Contexual Bandits

Hi, I am trying to train LinUCB or LinTS policy for my data. Policy is getting trained as well but it seems policy is selecting one particular arm for all...

sj31867

bandits

Contextual Bandits High training time

I am trying to train LinUCB and LinTS with arm features. But it seems training time for these models are pretty high, with global features of ~200D and per arm...

sj31867

bandits

Tutorial on Multi objective optimization and Constraint Optimization

Hi, Can we have some tutorials on Multi objective optimization and constraint optimization ?

sj31867

ValueError: Exception encountered when calling layer "QNetwork" (Issues with Q-Networks)

Hi, I'm training with Q Networks. This is the first time I'll be using Q Networks so I am not sure why I am getting an error. Here's my code...

techGIAN

EnvironmentSteps tf_metric bug with parallel envs

I think there's an error when using the tensor flow EnvironmentSteps metric. Let's say we're using parallel environment (with 10 envs) and setting collect_steps_per_iteration (to 5) in a DynamicStepDriver. I...

vittorione94

dealing with mjWARN_BADQACC

Hello, This is more a feature enhancement. For environments that are unstable i.e. that rarely return a mjWARN_BADQACC would be great if instead of abruptly stopping the training would return...

vittorione94

How can I save network.DistributionNetwork?

Dear authors: I want to save an object of DistributionNetwork, is there any solution? I could only think of `pickle` which directly save the object, but I didn't find any...

Rejuy

Incorrect formatting for time_step, trajectory when using tf.print

When passing a `TimeStep` or `Trajectory` to `tf.print`, the output presents the objects incorrectly in that some field values get printed in the wrong places. The following code demonstrates this:...

coreyleveen

PolicySavedModelTrigger has no sweeping ability.

I am having an issue with the saved model trigger when using the learner API. Over a long enough time period, the saved model trigger has no limiter for the...

brianorbrain

WARNING:tensorflow:Value in checkpoint could not be found in the restored object: (root).agent._optimizer._variables.1

Use checkpointing from tf_agent, and report a lot of these warnings. Cannot disable these warnings.

huiyujie

agents
agents copied to clipboard

Metadata

Data normalization for Contexual Bandits

Contextual Bandits High training time

Tutorial on Multi objective optimization and Constraint Optimization

ValueError: Exception encountered when calling layer "QNetwork" (Issues with Q-Networks)

EnvironmentSteps tf_metric bug with parallel envs

dealing with mjWARN_BADQACC

How can I save network.DistributionNetwork?

Incorrect formatting for time_step, trajectory when using tf.print

PolicySavedModelTrigger has no sweeping ability.

WARNING:tensorflow:Value in checkpoint could not be found in the restored object: (root).agent._optimizer._variables.1

← Metadata

Owner

Metadata

agents agents copied to clipboard

Metadata

← Metadata

Owner

Metadata

agents
agents copied to clipboard