fast_abs_rl Error while loading the pre-trained models during RL training

Error while loading the pre-trained models during RL training

Open bnaman50 opened this issue 2 years ago • 0 comments

Hello Chen,

Thanks for providing this code. It seems really helpful for my current research.

However, I am having issues with making this code work. I have setup the environment as suggested but still able to load the pre-trained models (both extractive and abstractive models).

Extractive Model gives error in line assert ext_meta['net'] == 'ml_rnn_extractor'. Looking at the meta.json file, net:rnn-ext_abs_rl. I am not sure why is this discrepancy.
For abstractive model, I face RuntimeError: CUDNN_STATUS_EXECUTION_FAILED error in line self._net = abstractor.to(self._device). I am not sure how to solve this error. I made sure that CUDA is available. Also, it is not the OOM memory as suggested in some of the pages since the GPU memory never exceeds 1 GB.

It would be great if you could help me out.

Thanks, Naman

Mar 11 '22 17:03 bnaman50

fast_abs_rl fast_abs_rl copied to clipboard

Error while loading the pre-trained models during RL training

fast_abs_rl
fast_abs_rl copied to clipboard