elasticsearch
elasticsearch copied to clipboard
[ML] Refactoring http settings and adding stats endpoint
This PR adds some comments around the connection eviction and keep-alive strategy documentation for the apache http client implementation. I also increased the connection pool limit to exceed a single route's maximum. I've run into a situation during testing where a single service can lease all the connections in the pool and effectively blocking any other services from leasing a connection.
I also created a stats endpoint at /_inference_stats
. I was hoping to use /_inference/_stats
but theoretically a user could have that as their inference endpoint id 🤷♂️
Right now the endpoint returns stats for the internal apache connection pool. I think this could be useful because it could illuminate whether a cluster has reached the maximum connections in the pool.
TODO figure out how to execute the code on each node and aggregate the results.
@elasticmachine merge upstream
Pinging @elastic/ml-core (Team:ML)