service-capacity-modeling Improve C* scaling logic when including EVCache in KV plan

Improve C* scaling logic when including EVCache in KV plan

Open alexsyeo opened this issue 1 year ago • 0 comments

In our current logic (https://github.com/Netflix-Skunkworks/service-capacity-modeling/blob/main/service_capacity_modeling/models/org/netflix/key_value.py#L85), we scale the C* cluster by a factor of 1 - estimated_kv_cache_hit_rate, where estimated_kv_cache_hit_rate is configurable (default 0.8).

Per a previous convo with @jolynch and @szimmer1, we discussed possibly tying in the read/write ratio from the user desires into this calculation.

One toy example:

estimated_cache_hit_rate = extra_model_arguments.get("estimated_cache_hit_rate", 0.8)
estimated_cache_miss_rate = 1 - estimated_cache_hit_rate
rps_interval.scale(min(estimated_cache_miss_rate, max(0.1, 1 - read_write_ratio)))

May 08 '23 17:05 alexsyeo

service-capacity-modeling service-capacity-modeling copied to clipboard

Improve C* scaling logic when including EVCache in KV plan

service-capacity-modeling
service-capacity-modeling copied to clipboard