amazon-sagemaker-examples icon indicating copy to clipboard operation
amazon-sagemaker-examples copied to clipboard

Example πŸ““ Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Results 317 amazon-sagemaker-examples issues
Sort by recently updated
recently updated
newest added

*Issue #, if available:* *Description of changes:* - A new SageMaker Autopilot MLOps pipeline for TimeSeries use-case using CDK. A complete project with a sample data. - Reviewed with Spec....

Corrected typo in model_parameters dictionary key *Issue #, if available:* *Description of changes:* Corrected typo in model_parameters dictionary key. *Testing done:* ## Merge Checklist _Put an `x` in the boxes...

**Extract from the workbook "machine_learning_workflow_abalone.ipynb"** When adding tags in the following estimator : mes_tags = [{'key': 'cart', 'value': 'dataengineering'}] xgb = sagemaker.estimator.Estimator( image_uris.retrieve("xgboost", region, "1.2-1"), sagemaker_execution_role, train_instance_count=1, train_instance_type="ml.m4.4xlarge", train_volume_size=5, output_path=bucket_path...

**Link to the notebook** [Notebook](https://github.com/aws/amazon-sagemaker-examples/blob/main/inference/generativeai/llm-workshop/deploy-V7-lmi/llama2_70b-lmi-trtllm.ipynb) **Describe the bug** ``` A clear and concise description of what the bug is. **To reproduce** import time resp = sm_client.describe_endpoint(EndpointName=endpoint_name) status = resp["EndpointStatus"] print("Status:...

**Link to the notebook** https://github.com/aws/amazon-sagemaker-examples/blob/main/advanced_functionality/pytorch_extend_container_train_deploy_bertopic/BERTtopic_extending_container.ipynb **Describe the bug** When I run estimator.fit() in the "Local training: Fit" section, I get this error: `time="2023-10-11T09:42:45Z" level=warning msg="a network with name sagemaker-local exists...

*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....

**Describe the bug** I have an image classification model stored in s3 location. If I deploy the model as real-time inference, then I am able to predict any image size,...

*Issue #, if available:* *Description of changes:* *Testing done:* ## Merge Checklist _Put an `x` in the boxes that apply. You can also fill these out after creating the PR....

https://github.com/aws/amazon-sagemaker-examples-community/blob/main/torchserve/inf2/llama2/llama-2-13b.ipynb Could create an endpoint as above for llama 13b base, but it gives a timeout error on container primary for 13b chat. For above, created the neuron artifacts for...

… with Sharded Data Parallelism through a custom SMP Trainer. This example shows you how to use SMP Trainer as a drop-in replacement for HuggingFace Trainer to enable Sharded Data...