adventures-in-ml-code issues

Bugfix policy gradinet reinforce tf2

Hi, there is a bg in `policy_gradient_reinforce_tf2.py` at `line 39`. `loss = network.train_on_batch(states, discounted_rewards)` to fix this I made two changes, 1. one_hot_encode the actions `one_hot_encode = np.array([[1 if a==i...

asokraju

TF2.30.0: ValueError: operands could not be broadcast together with shapes (31,) (32,32)

I am getting this error while running the code given in file "per_duelingq_spaceinv_tf2.py" on Google Colab that uses the Tensorflow version: 2.3.0 ``` ValueError Traceback (most recent call last) in...

swagatk

Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible

1

Hi. The code [Code](https://github.com/adventuresinML/adventures-in-ml-code/blob/master/policy_gradient_reinforce_tf2.py ) is not working with this line: `loss = network.train_on_batch(states, discounted_rewards)`.

danisch-khurshid-creator

ValueError: Shapes (31, 1) and (31, 2) are incompatible

2

loss = update_network(network, rewards, states, actions, num_actions) loss = network.train_on_batch(states, discounted_rewards)

Sanketd420

Policy Gradient REINFORCE algorithm not converging.

First of all, thank you for the tutorial [here](https://adventuresinmachinelearning.com/policy-gradient-tensorflow-2/)! I am trying to implement/run your code mentioned in the tutorial, however, the results are not converging after 500 steps as...

padmaja-kulkarni

keras_lstm.py with fix of not working gfile approach

``` # # start in commandline: python keras_lstm.py [-h] [--data_path DATA PATH] runopt # 'An integer: 1 to train, 2 to test' # i.e.: python keras_lstm.py 1 # or: python...

lorenzznerol

keras_lstm.py as .ipynb file incl. fix of gfile approach

``` """ This is the jupyter notebook version of the tutorial with some small fixes. Instead of running it in command line like "python keras_lstm.py 1" with runopt parameter here...

lorenzznerol

ValueError while running policy gradient code, when run on colab

ValueError Traceback (most recent call last) in () 56 57 if done: ---> 58 loss = update_network(network, rewards, states, actions, num_actions) 59 tot_reward = sum(rewards) 60 print(f"Episode: {episode}, Reward: {tot_reward},...

Abhishek-Rajendra

double/dueling Q learning

Should not the model be fitted (keras.fit(...)) and predicted (keras.predict(state)) in double Q learning (and also in dueling Q learning) examples? Seems you also forget to apply the same in...

MGKhKhD

Tensorflow on hadoop

Can you please provide a tutorial on how to run tf_word2vec.py in hadoop , basically to distribute the workload and then reduce.

tejo93

adventures-in-ml-code
adventures-in-ml-code copied to clipboard

Metadata

Bugfix policy gradinet reinforce tf2

TF2.30.0: ValueError: operands could not be broadcast together with shapes (31,) (32,32)

Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible

ValueError: Shapes (31, 1) and (31, 2) are incompatible

Policy Gradient REINFORCE algorithm not converging.

keras_lstm.py with fix of not working gfile approach

keras_lstm.py as .ipynb file incl. fix of gfile approach

ValueError while running policy gradient code, when run on colab

double/dueling Q learning

Tensorflow on hadoop

← Metadata

Owner

Metadata

adventures-in-ml-code adventures-in-ml-code copied to clipboard

Metadata

← Metadata

Owner

Metadata

adventures-in-ml-code
adventures-in-ml-code copied to clipboard