Logan Adams comments

Results 294 comments of


                                            Logan Adams

Workflow for AutoTP

> Hi @loadams can you help start the workflow? The model checkpoint path had been moved to the persistent storage as suggested. Apologies, I was out but it should be...

Workflow for AutoTP

> Hi @loadams I have added gptj and baichuan7b model to autotp workflow, can you help start the workflow? Thanks! Done. > Now this workflow is ready for testing autotp...

Workflow for AutoTP

> > > Hi @loadams I have added gptj and baichuan7b model to autotp workflow, can you help start the workflow? Thanks! > > > > > > Done. >...

Workflow for AutoTP

> Hi @loadams , I see the environment issue should have been fixed. Can you help restart the workflow? Thanks! @delock - yes, apologies that took so long.

Workflow for AutoTP

> @loadams I ran these two tests on my local environment. It didn't took so long. Can you help run this workflow again to see whether it is reproducible? Thanks!...

Workflow for AutoTP

> Hi @loadams, I tried run these UTs in my environment and didn't see this timeout. Since CPU UT is already covered by workflow `cpu-torch-latest`. I removed unit tests in...

amd-mi200 CI test failure

cc: @jithunnair-amd and @rraminen - new issue opened because we closed the previous one. Once we merge the ROCm update to 5.6 PR I believe there are still failing tests,...

[BUG] Can't pickle local object 'instrument_w_nvtx.<locals>.wrapped_fn'

Hi @annopackage - can you share a full minimal repro script with us please?

fix: swapping order of parameters in create_dir_symlink method.

@alvieirajr - were you able to validate that swapping these resolved your issues?

CUDA error: unknown error

@liuhui0401 - this seems like a cuda error, or a bad state that the GPUs are in. If you power cycle the machine, does nvidia-smi work?