Pooja Krishnan
Pooja Krishnan
Using structured_outputs, I get ['Output0', 'Output1', 'Output2 '].
> > > > Hello, I am facing the same issue. Originally I had an onnx model with opset 14, then I downgraded it to both 12 and 13 but...
> Thanks for reporting but could you give us a short reproducer as our CI didn't catch any regression here? I run it as a part of fairseq. This test...
Can you point me to the test case that tests the training of the transformer XL model in huggingface? Maybe I can tune the parameters accordingly to reproduce the issue
actually this is still a problem. Can you please try by setting the params d_embed and d_model iwith same value?
hi @awan-10 , Sorry for the delay in response! I was bit confused with the usage of params for deepspeed inference. I was able to get the following code working...
hi, 1) As discussed, I tried saving a deepspeed checkpoint along with state_dict. But when I try to load it via the checkpoint.json file as shown in the tutorial -https://www.deepspeed.ai/tutorials/inference-tutorial/....
Hello, I have one more question wrt to inference on torch checkpoint using deepspeed. Are the kernel injections (set using deepspeed.init_inference(model, replace_with_kernel_inject=True)) applied on torch based checkpoint as well? Can...