kaito icon indicating copy to clipboard operation
kaito copied to clipboard

docs: New conceptual guide for GAIE and KAITO

Open sdesai345 opened this issue 3 months ago • 2 comments

Reason for Change: New conceptual guide to add context and visual diagrams for ./gateway-api-inference-extension.md

sdesai345 avatar Sep 19 '25 20:09 sdesai345

Title

Add conceptual guide for inference traffic routing with GAIE and KAITO


Description

  • Added conceptual guide for inference traffic routing

  • Detailed KAITO integration with Gateway API Inference Extension

  • Explained core concepts and request flow with diagrams

  • Provided advanced use cases and implementation guidance


Changes walkthrough 📝

Relevant files
Documentation
inference-traffic-routing.md
New conceptual guide for inference traffic routing             

website/docs/inference-traffic-routing.md

  • Added new conceptual documentation file
  • Detailed KAITO-GAIE integration architecture
  • Explained core concepts: InferencePool, InferenceModel, Endpoint
    Picker
  • Provided visual diagrams of system architecture and request flow
  • +161/-0 

    Need help?
  • Type /help how to ... in the comments thread for any questions about PR-Agent usage.
  • Check out the documentation for more information.
  • kaito-pr-agent[bot] avatar Sep 19 '25 20:09 kaito-pr-agent[bot]

    PR Reviewer Guide 🔍

    Here are some key observations to aid the review process:

    ⏱️ Estimated effort to review: 2 🔵🔵⚪⚪⚪
    🧪 No relevant tests
    🔒 No security concerns identified
    ⚡ Recommended focus areas for review

    Broken Link

    The link to ./gateway-api-inference-extension.md might be broken since the target file isn't created in this PR.

    Learn how to enable GAIE and get started with sample deployments on your Kubernetes cluster using [Gateway API Inference Extension with KAITO guide](./gateway-api-inference-extension.md).
    
    Diagram Inconsistency

    The Endpoint Picker Pod box uses inconsistent hyphen patterns that may break diagram rendering.

    +------------------------+           +--------------------------------+
    |     InferenceModel     | ------->  |  HTTPRoute (Gateway API)       |
    +------------------------+           |  - Route rules                 |
    
    Formatting Issue

    Extra trailing spaces in the GAIE Controller box may cause misalignment in rendered diagrams.

    | - Model metadata          |  
    

    kaito-pr-agent[bot] avatar Sep 19 '25 20:09 kaito-pr-agent[bot]