CustomSageMakerEndpoint: Add autoscaling, value for min/max instances
Describe the feature
The current solution doesn't allow to specify the minimum and maximum instance number. It deploys one instance only, without any scaling policy.
Use Case
Due to heavy instance load, the autoscaling group should perform scaling out to meet new demand.
Proposed Solution
two additional parameters max_instances, min_instances
Other Information
No response
Acknowledgements
- [ ] I may be able to implement this feature request
- [ ] This feature might incur a breaking change
This issue is now marked as stale because it hasn't seen activity for a while. Add a comment or it will be closed soon. If you wish to exclude this issue from being marked as stale, add the "backlog" label.
Closing this issue as it hasn't seen activity for a while. Please add a comment @mentioning a maintainer to reopen. If you wish to exclude this issue from being marked as stale, add the "backlog" label.