Yeming Wen comments

Results 9 comments of


                                            Yeming Wen

Conv2DVariationalDropout does not work in eager

Not sure for the model.fit, but for the functional api, returning randomvariable is fine in eager mode in my local script.

How to disable "wandb" while running finetune.py

I did something like `WANDB_MODE=disabled python finetune.py --args` when doing a testing run.

torch.cuda.OutOfMemoryError: CUDA out of memory When Trying to Save the Model

same error here with A40.

Trying to fine tune starcoderbase model using finetuning.oy - multiple GPUs

Wondering if you have tried lower the gradient_accumulation_steps, with larger batch size, the accumulation steps can be smaller.

Hardware requirements for inference and fine tuning.

Thanks for the hardware requirement info! It seems only have the requirement for inference. I wonder if there is anything for fine-tuning on downsteam tasks?

What if reducing the batch size?

Hi thanks for the suggestion! I also share a question with the other issue where the evaluation is slow. Running python seq2seq/run_seq2seq.py configs/eval.json takes 9 hours to complete on a...

What if reducing the batch size?

I am looking into this but it will take some time for me to figure it out.

What if reducing the batch size?

Hi, I am trying to time each example generation time. I found the generate method wrapper for the SpiderModel in https://github.com/ElementAI/picard/blob/main/seq2seq/utils/picard_model_wrapper.py However, I couldn't find the code which makes use...

What if reducing the batch size?

oh, generate is invoked in the Seq2SeqTrainer's prediction_step method.