Sanketd420

Results 1 issues of Sanketd420

loss = update_network(network, rewards, states, actions, num_actions) loss = network.train_on_batch(states, discounted_rewards)