amazon-sagemaker-examples
amazon-sagemaker-examples copied to clipboard
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
*Description of changes:* Heterogeneous Clusters for Amazon SageMaker model training was [announced July 2022](https://aws.amazon.com/about-aws/whats-new/2022/07/announcing-heterogeneous-clusters-amazon-sagemaker-model-training/). We're providing extensive code examples for using the feature with TensorFlow and PyTorch. *Testing done:* Run...
**Link to the notebook** Add the link to the notebook. **What aspects of the notebook can be improved?** **What are your suggestions?**open/zahaiyan.com.www.Zhyyan.com
**Link to the notebook** [Add the link to the notebook.](https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_amazon_algorithms/linear_learner_mnist/linear_learner_mnist_with_file_system_data_source.ipynb) **Describe the bug** This code here when run errors out indicating invalid value type. ``` linear = sagemaker.estimator.Estimator( container, role,...
*Issue #, if available:* *Description of changes:* New notebook to accompany a blog post and SageMaker Hosting feature launch *Testing done:* ## Merge Checklist _Put an `x` in the boxes...
**Link to the notebook** [xgboost_customer_churn](https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_applying_machine_learning/xgboost_customer_churn/xgboost_customer_churn.ipynb) **Describe the bug** Notebook [xgboost_customer_churn](https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_applying_machine_learning/xgboost_customer_churn/xgboost_customer_churn.ipynb) fails during training, specifically in cell starting with line of code `sess = sagemaker.Session()`. Looks like coding incompatibilities with the...
**Link to the notebook** [Add the link to the notebook.](https://github.com/aws/amazon-sagemaker-examples/tree/main/introduction_to_amazon_algorithms/seq2seq_translation_en-de) **Describe the bug** When downloading the pretrained model from the AWS S3 bucket , I get an access denied error...
*Description of changes:* Add GPT2 model train script with HuggingFace Trainer and SageMaker Model Parallel ## Merge Checklist - [x] I have read the [CONTRIBUTING](https://github.com/aws/amazon-sagemaker-examples/blob/master/CONTRIBUTING.md) doc and adhered to the...
I tried to build my own tensorflow algorithm in `train.py` by adopting from `mnist-2.py` (available in [amazon-sagemaker-examples/sagemaker-python-sdk/tensorflow_script_mode_training_and_serving/](https://github.com/aws/amazon-sagemaker-examples/tree/master/sagemaker-python-sdk/tensorflow_script_mode_training_and_serving)) and pass it as entry_point using the pre-built deep learning image. The training...
I have successfully trained an LDA model on a corpus of ~100k documents. As a next step, I wish to predict the topic distribution associated with each document using batch...
**Link to the notebook** https://github.com/aws/amazon-sagemaker-examples/blob/main/step-functions-data-science-sdk/automate_model_retraining_workflow/automate_model_retraining_workflow.ipynb **Describe the bug** The Glue job does not work, I am receiving this error message: ` failed to execute with exception Internal service error: Invalid...