amazon-sagemaker-examples icon indicating copy to clipboard operation
amazon-sagemaker-examples copied to clipboard

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Results 317 amazon-sagemaker-examples issues
Sort by recently updated
recently updated
newest added

*Issue #, if available:* 1. The current notebook breaks due to a bug in the inference toolkit, that processes only one responses instead of all, while that is being fixed,...

This change explicitly sets the input name in the model constructor. This change also sets framework version in compile command to avoid getting "ClientError: An error occurred (ValidationException) when calling...

*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....

*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....

*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....

Description of changes: - Update Training Compiler Tensorflow 2.6.3 IOC notebook example - Add Batch tuning example for Tensorflow Testing done: testing on personal sagemaker Natobook - [x] I have...

*Issue #, if available:* Daily CI failed *Description of changes:* There were some missing variables in the JAX estimator class (i.e. framework_class and py_version). So I added the missing variables...

**Link to the notebook** [Add the link to the notebook.](https://github.com/aws/amazon-sagemaker-examples/blob/main/sagemaker_batch_transform/pytorch_mnist_batch_transform/pytorch-mnist-batch-transform.ipynb) **Describe the bug** I refer to the above notebook to test in my sagemaker notebook instance. But got error: UnexpectedStatusException:...

*Issue #, if available:* The example code in the fp16 module in `training/distributed_training/pytorch/model_parallel` is strange when training gpt-j or gpt2 in a distributed environment. In the training code, it is...

*Issue #, if available:* *Description of changes:* Update GPT-J Model training example with Tensor Parallelism using SageMaker Model Parallel Library. Update testing scripts to enable latest features with smp. *Testing...