cortex
cortex copied to clipboard
Production infrastructure for machine learning at scale
#### Description When clustering up, the following warning is emitted: ``` Custom AMI detected for nodegroup cx-ws-cpu-large-2. Please refer to https://github.com/weaveworks/eksctl/issues/3563 for upcoming breaking changes ``` #### Context It seems...
#### Motivation * Reduce latency when multiple requests are required * Stream output from the predictor as it's generated
#### Description This can be configured in the cluster config and updated while the cluster is running. The best way to do this is to create a secret resource that...
#### Description Provide access to the batch metrics e.g.: ``` { "succeeded": 10, "failed": 2 } ``` #### Motivation Enable users to perform specific post-job completion tasks based on the...
### Description Messages in SQS queue are not encrypted at the moment. Consider using SSE to encrypt messages. https://docs.aws.amazon.com/AWSSimpleQueueService/latest/SQSDeveloperGuide/sqs-server-side-encryption.html ### Motivation Encrypt messages at rest (while they are sitting in...
### Description [AWS's docs](https://docs.aws.amazon.com/autoscaling/ec2/userguide/capacity-rebalance.html), [blog 1](https://aws.amazon.com/blogs/compute/proactively-manage-spot-instance-lifecycle-using-the-new-capacity-rebalancing-feature-for-ec2-auto-scaling/), [blog 2](https://aws.amazon.com/about-aws/whats-new/2020/11/amazon-ec2-auto-scaling-ec2-fleet-and-spot-fleet-announce-capacity-rebalancing-for-ec2-spot-instances/) ### Additional context This is supported by eksctl Related to https://github.com/cortexlabs/cortex/issues/2185 (since it is recommended to use the capacity-optimized allocation strategy with...
### Description [Info](https://aws.amazon.com/about-aws/whats-new/2019/08/new-capacity-optimized-allocation-strategy-for-provisioning-amazon-ec2-spot-instances/) (with additional links at the bottom) ### Additional context This is supported by eksctl