skypilot icon indicating copy to clipboard operation
skypilot copied to clipboard

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Results 530 skypilot issues
Sort by recently updated
recently updated
newest added

VPN for AWS Usage: ```bash $ sky launch --cloud aws --env TAILSCALE_KEY= 'python3 -m http.server 8000' # Lookup the private IP inside the VPN # Join Tailscale VPN in Mac...

In the current master (`d0f20abaa58d6da3876c58363fb1390c5d32a7a2`), the meaning of `DEVICE_MEM` in `sky show-gpus` seems not aligned. For example, in AWS, it represent the total device memory across all GPUs; while in...

Requested by #3347. This shows how run a torch model server with NVIDIA triton. Requires torch models to be exportable by `torch.jit` To run the example: ``` sky launch -c...

From user: > Can whatever's happening in sky.utils.kubernetes.gpu_labeler be automatically applied when new nodes join the k8s cluster? > Capacity changes happen manually right now. It has just happened a...

k8s

TPU v5 is recently being added to the catalog, which might be interesting to see if there is any usecase we can support.

enhancement
good first issue

A user requires supporting AWS cloudront or AWS Route53 for SkyServe. Original conversation: ```text Q: Underneath AWS covers is this ELB? within a AWS VM... Reason I ask - is...

serve

To improve observability it would be useful to have the spot controller expose a prometheus endpoint. This would give us better insight into our teams behaviour and usage patterns. ##...

GCP recently made the dynamic workload scheduler feature available in public preview. This system works by letting you submit a request to get N number of GPU VMs and puts...

### Description I encountered an issue with skypilot-nightly where a subprocess call hangs indefinitely. This occurs when trying to execute a task using the sky.exec method in my Python script....

User reported their `podPidsLimit` on the kubelet was set too low, causing very long times during provisioning and going from setup->run. > The process of sky serve up can be...

k8s