deepirl_chainer
deepirl_chainer copied to clipboard
where is the demonstration data?
If I have to make demonstration, how do I compose the dimension of demonstration? For example, state = [batch, sequence, *observation_shape] action = [batch, sequence, *action_shape]
I trained "halfcheetah-v4" for above shape of demonstration, but gail and airl does not be learnt at all...
I does not know what I mistake...
and For binary cross entropy loss, why softmax function is more stable??
hi, i got some wrong when i run gail or airl, there is the error:
deepirl_chainer-master/irl/airl/airl.py", line 91, in _update_if_dataset_is_ready dataset = self._make_dataset() AttributeError: 'AIRL' object has no attribute '_make_dataset'
Is '_make_dataset' bulit_in function? thank you very much.