notification-api
notification-api copied to clipboard
Refactor alerts to distinguish between infrastructure failure, capacity, or service limits
Candidates for refactoring:
- [ ] logs-10-celery-error-1-minute-critical: This is currently tracking
"?\"ERROR/Worker\" ?\"ERROR/ForkPoolWorker\" ?\"WorkerLostError\""
found in cloudwatcheks-cluster/application
logs. This is too generic. Identify a way to distinguish between intentional thrown errors (message limits) and legitimate celery failures.