rlcard icon indicating copy to clipboard operation
rlcard copied to clipboard

How to replace the DMC operating device?

Open JocyeI opened this issue 2 years ago • 5 comments

I have replaced the DMC running device with CUDA, and the network output is all 0. The output of this action is more like a random selection action. May I ask how to avoid the DMC network output result being 0 when I use CUDA training

JocyeI avatar May 23 '23 02:05 JocyeI

tensor([0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.], device='cuda:0', grad_fn=<ReshapeAliasBackward0>)

JocyeI avatar May 23 '23 02:05 JocyeI

@aetheryang

JocyeI avatar May 23 '23 02:05 JocyeI

@Walhalla-Summary That is weird. How did you change it to CUDA?

daochenzha avatar May 23 '23 15:05 daochenzha

That's not the point. When I use cuda training, all the predicted outputs of the neural network are 0 instead of floating point numbers, which makes me feel strange. Is there any way to solve this problem @daochenzha

JocyeI avatar May 24 '23 06:05 JocyeI

@Walhalla-Summary How did you change it to CUDA training? More contexts are needed to reproduce the error

daochenzha avatar Jun 09 '23 19:06 daochenzha