aibrix icon indicating copy to clipboard operation
aibrix copied to clipboard

Pluggable Gateway

Open danehans opened this issue 9 months ago • 5 comments

🚀 Feature Description and Motivation

I am a kgateway maintainer. In the upcoming release, kgateway will support Kubernetes Gateway API Inference Extension. The aibrix Gateway overlaps with Gateway API Inference Extension, specifically the endpoint picker extension. Can the aibrix Gateway component become extensible in the architecture? A spec, conformance tests, etc. can be defined to support aibrix Gateway extensibility. By doing so, aibrix will benefit from wider adoption.

Use Case

As a kgateway user with Gateway API inference Extension, I prefer to integrate with Aibrix instead of having separate Gateways.

Proposed Solution

Develop an interface, spec, conformance tests, etc. to make the aibrix extensible and allow integration with Gateway API inference extension conformant implementations.

danehans avatar Mar 14 '25 16:03 danehans

Agreed here! We worked closely with aibrix contributors before and would love to continue that trend!

@Jeffwan @varungup90 I've looked for an aibrix community meeting in your docs and couldnt find one. Would be happy to pop in to to discuss what gaps we think need to be covered for GIE to be more composable with aibrix. If that sounds good, I can get a doc drafted on the topic.

Thanks! Looking forward to it!

kfswain avatar Mar 14 '25 17:03 kfswain

+1 as an Istio and kgateway maintainer, I would like our users stay with whatever gateways they feel comfortable.

linsun avatar Mar 14 '25 17:03 linsun

I followed the design of GIE in early days, and asked one question like If inference platform has their own model object, will there a conceptual conflict exists?, because according to the design, GIE has its own inferenceModel and inferencePool, but seems no answer. Nowadays, we met the question in our inference platform so we may consider to employ the envoy gateway ourself just because we don't want to have too much model-like objects to our users, which is confusing, and aibrix may introduce the same API as well, see https://github.com/vllm-project/aibrix/pull/299.

TBH, we don't want to reimplement the wheel. Not represent the aibrix community, but our platform.

kerthcet avatar Mar 17 '25 09:03 kerthcet

For AIBrix, our goal is to not restrict users to use specific component or module and be as much extensible. We should schedule a meeting to discuss more on this topic.

varungup90 avatar Mar 17 '25 19:03 varungup90

@varungup90 Any further updates on this? We're evaluating whether we can bring Istio

bjitd avatar May 14 '25 20:05 bjitd

@varungup90 - will this cause issues? We don't want to use another gateway and cause problems on our system when using this!

ericmeadows avatar Jun 29 '25 19:06 ericmeadows

@varungup90 - will this cause issues? We don't want to use another gateway and cause problems on our system when using this!

Current gateway will continue to work as-is, it will be an additional gateway option to select from.

varungup90 avatar Jun 29 '25 21:06 varungup90