[Performance Tracing] Enable distributed tracing for Alcor
Request
- End goal: Enable distributed tracing for Alcor
- Start with one microservice (e.g. VPC manager), enable API-level HTTP tracing and identify slow HTTP requests
- Try out function-level tracing and compare/combine with Alcor-internal performance trace
- Expand the tracing to all microservices and enable distributed tracing for all customer-facing APIs
Context Distributed Tracing 101: https://tracing.cloudnative101.dev/docs/index.html Distributed Tracing Lab for Java: https://tracing.cloudnative101.dev/docs/lab-jaeger-java.html Performance Tuning with Jaeger tracing: https://www.jaegertracing.io/docs/1.19/performance-tuning/
@haboy52581 You can update this issue with the Spring tracing information so that we can further discuss.
Zipkin vs. Jaeger: https://epsagon.com/observability/zipkin-or-jaeger-the-best-open-source-tools-for-distributed-tracing/
zipkin is more easy to integrate and less code change since dev based on the springboot