ActionCLIP About KLLoss

About KLLoss

Open xk-huang opened this issue 3 years ago • 0 comments

Thanks for your amazing work! The KLLoss in the implementation is divided by feature dims (times batch_size in code), instead of batch size. The docs of PyTorch points that reduction = 'batchmean' aligns with KL math definition. I'm writing to ask the reason for the implementation choice. Thanks in advance.

Feb 22 '22 06:02 xk-huang

ActionCLIP ActionCLIP copied to clipboard

About KLLoss

ActionCLIP
ActionCLIP copied to clipboard