generative-ai-cdk-constructs icon indicating copy to clipboard operation
generative-ai-cdk-constructs copied to clipboard

CustomSageMakerEndpoint: Add autoscaling, value for min/max instances

Open airmonitor opened this issue 1 year ago • 2 comments

Describe the feature

The current solution doesn't allow to specify the minimum and maximum instance number. It deploys one instance only, without any scaling policy.

Use Case

Due to heavy instance load, the autoscaling group should perform scaling out to meet new demand.

Proposed Solution

two additional parameters max_instances, min_instances

Other Information

No response

Acknowledgements

  • [ ] I may be able to implement this feature request
  • [ ] This feature might incur a breaking change

airmonitor avatar May 03 '24 15:05 airmonitor

This issue is now marked as stale because it hasn't seen activity for a while. Add a comment or it will be closed soon. If you wish to exclude this issue from being marked as stale, add the "backlog" label.

github-actions[bot] avatar Jul 07 '24 01:07 github-actions[bot]

Closing this issue as it hasn't seen activity for a while. Please add a comment @mentioning a maintainer to reopen. If you wish to exclude this issue from being marked as stale, add the "backlog" label.

github-actions[bot] avatar Jul 14 '24 01:07 github-actions[bot]