Verdi March

Results 25 issues of Verdi March

Need a PyTorch script to probe instances and software stack, then prints the necessary EFA env vars. Use this script to avoid spending hours to debug runtime crash caused by...

stale

Pyxis runtime path cannot be /fsx, otherwise error to run Docker image (directly) on multiple nodes. ```console # NOTE: below works fine for -N1. $ srun -N2 --container-image=alpine grep PRETTY...

Fix notebooks not working here and there. - document how to start required containers - fix deprecated model names - fix `FAIS.load_local()` refuses to load local `.pkl` files. - fix...

**Describe the bug** The logic to repack model artifact (notably used in MXnet) uses a temp dir under `/tmp`. However, on SageMaker notebook instance classic, this partition is limited in...

type: feature request
contributions welcome
component: Inference APIs and Interfaces