Sergio Guadarrama comments

Results 68 comments of


                                            Sergio Guadarrama

action output and policy_step_spec structures do not match:

What I meant is that one should use the same policy to collect the data that it uses for training. So mixing policies between algorithms it's not warranted to work.

action output and policy_step_spec structures do not match:

I'm not sure what code are you running, and what do you mean by (using greedy policy in training, since that should only be used for eval). Are you using...

Does TF-Agents not support XLA?

You can try overwriting the dtype of the `step_type` in the Policy given to the Driver.

Does TF-Agents not support XLA?

Unfortunately the DynamicDriver has dynamic shapes and doesn't allow jit-compilation, you can compile the Network or the Policy though.

Function to Make Predictions From Policy

Take a look at this [examples](https://github.com/tensorflow/agents/blob/93c6b1b40e869e27f6bbaaa1d6cc8d24ff367cb9/tf_agents/policies/policy_saver.py#L91) ``` saved_policy = tf.compat.v2.saved_model.load('policy_0') time_step = ... while True: policy_step = saved_policy.action(time_step) time_step = f(policy_step.action) ```

Add ability to pass multiple inputs to a single preprocessing layer in EncodingNetwork

Instead of modifying the encoding network we recommend using Nest_Map to create networks with multiple inputs. See example [here](https://github.com/tensorflow/agents/blob/76397a546c1f8bdea1d7690c878fb95e874751a8/tf_agents/networks/nest_map_test.py#L71) and [here](https://github.com/tensorflow/agents/blob/488e5399db40102dae256932f6c69343f6849128/tf_agents/examples/sac/haarnoja18/sac_train_eval.py#L81)

Sergio Guadarrama

action output and policy_step_spec structures do not match:

action output and policy_step_spec structures do not match:

Does TF-Agents not support XLA?

Does TF-Agents not support XLA?

Function to Make Predictions From Policy

Add ability to pass multiple inputs to a single preprocessing layer in EncodingNetwork

Add ability to pass multiple inputs to a single preprocessing layer in EncodingNetwork

Pip install issue

Batching ReverbAddEpisodeObserver with variable length episodes

Learner.run got stuck