amazon-sagemaker-examples
amazon-sagemaker-examples copied to clipboard
Example π Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using π§ Amazon SageMaker.
*Issue #, if available:* *Description of changes:* - A new SageMaker Autopilot MLOps pipeline for TimeSeries use-case using CDK. A complete project with a sample data. - Reviewed with Spec....
Corrected typo in model_parameters dictionary key *Issue #, if available:* *Description of changes:* Corrected typo in model_parameters dictionary key. *Testing done:* ## Merge Checklist _Put an `x` in the boxes...
**Extract from the workbook "machine_learning_workflow_abalone.ipynb"** When adding tags in the following estimator : mes_tags = [{'key': 'cart', 'value': 'dataengineering'}] xgb = sagemaker.estimator.Estimator( image_uris.retrieve("xgboost", region, "1.2-1"), sagemaker_execution_role, train_instance_count=1, train_instance_type="ml.m4.4xlarge", train_volume_size=5, output_path=bucket_path...
**Link to the notebook** [Notebook](https://github.com/aws/amazon-sagemaker-examples/blob/main/inference/generativeai/llm-workshop/deploy-V7-lmi/llama2_70b-lmi-trtllm.ipynb) **Describe the bug** ``` A clear and concise description of what the bug is. **To reproduce** import time resp = sm_client.describe_endpoint(EndpointName=endpoint_name) status = resp["EndpointStatus"] print("Status:...
**Link to the notebook** https://github.com/aws/amazon-sagemaker-examples/blob/main/advanced_functionality/pytorch_extend_container_train_deploy_bertopic/BERTtopic_extending_container.ipynb **Describe the bug** When I run estimator.fit() in the "Local training: Fit" section, I get this error: `time="2023-10-11T09:42:45Z" level=warning msg="a network with name sagemaker-local exists...
*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....
**Describe the bug** I have an image classification model stored in s3 location. If I deploy the model as real-time inference, then I am able to predict any image size,...
*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....
https://github.com/aws/amazon-sagemaker-examples-community/blob/main/torchserve/inf2/llama2/llama-2-13b.ipynb Could create an endpoint as above for llama 13b base, but it gives a timeout error on container primary for 13b chat. For above, created the neuron artifacts for...
β¦ with Sharded Data Parallelism through a custom SMP Trainer. This example shows you how to use SMP Trainer as a drop-in replacement for HuggingFace Trainer to enable Sharded Data...