cortex icon indicating copy to clipboard operation
cortex copied to clipboard

Production infrastructure for machine learning at scale

Results 121 cortex issues
Sort by recently updated
recently updated
newest added
trafficstars

Hi, We have recently migrated from version 0.35 to 0.42 and have observed that the Prometheus config has changed. We are no longer getting metrics such as kube_pod_status_phase etc. Although...

Is there a way to change the "default" path "/" to add a prefix like "/predict" for AsyncAPI? ``` - name: text-generator kind: AsyncAPI pod: port: 8080 - http: paths:...

enhancement

checklist: - [x] run `make test` and `make lint` - [x] make sure the targeted metrics still make sense - [ ] verify the dashboards

performance

checklist: - [x] run `make test` and `make lint` - [x] test manually (i.e. build/push all images, restart operator, and re-deploy APIs)

#### Motivation Respond to spot instance terminations more gracefully. That is to prevent getting failed requests when the traffic is supposed to migrate from the terminating instance to another one...

enhancement

#### Description Report any error (as it can be seen in `cortex describe `) that might occur due to an API misconfiguration. The Sentry DSN has to be configured by...

enhancement

#### Description Helps reduce the scale-out time. #### Additional https://docs.aws.amazon.com/autoscaling/ec2/userguide/ec2-auto-scaling-warm-pools.html https://medium.com/keikoproj/rapid-auto-scaling-on-eks-part-2-d094b9b2cd62 ##### Notes On the pricing model: > You have the option of keeping instances in the warm pool in...

performance
provisioning