guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks icon indicating copy to clipboard operation
guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks copied to clipboard

Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides...

Results 4 guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks issues
Sort by recently updated
recently updated
newest added

### Proposal The 'Important Setup Instructions' section under 'Quick Start Guide' appears to be lacking detailed information that could aid users in successfully setting up the deployment. It would be...

*Issue #, if available:* *Description of changes:* By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Bumps [transformers](https://github.com/huggingface/transformers) from 4.46.0 to 4.50.0. Release notes Sourced from transformers's releases. Release v4.50.0 New Model Additions Model-based releases Starting with version v4.49.0, we have been doing model-based releases, additionally...

dependencies
python

Bumps [ray](https://github.com/ray-project/ray) from 2.39.0 to 2.43.0. Release notes Sourced from ray's releases. Ray-2.43.0 Highlights This release features new modules in Ray Serve and Ray Data for integration with large language...

dependencies
python