k8s-supporting-folding-at-home icon indicating copy to clipboard operation
k8s-supporting-folding-at-home copied to clipboard

add chart to public repo

Open joaocc opened this issue 5 years ago • 7 comments

Hi, Thx for publishing this. Have you thought on making this available in the helm chart public repo? Thx

joaocc avatar Mar 14 '20 22:03 joaocc

Hi,

Yes, that is an idea for the future. I first wanted to get this chart to the public and more mature before creating a PR to the public repo :)

Last time a did a change to the public repo it took some months to get merged, so I created this repo first.

Let´s hope that we do not need this charts in some months.

wind0r avatar Mar 15 '20 12:03 wind0r

We deployed this in a couple of clusters, and it seems to work, but not with GPU. If you have settings to switch between GPUs (nVidia/ATI/Intel) it would be great, as the chart could be even more plug-and-play. As for submitting chart, the merge can be quicker. In addition, we can add it to public chart repos, as with Helm3 things are a bit more distributed now.

joaocc avatar Mar 15 '20 13:03 joaocc

https://chartmuseum.com/ has link to create free chart-registry account in codefresh (https://codefresh.io/codefresh-signup/)

joaocc avatar Mar 15 '20 14:03 joaocc

Wow! That's really cool to hear!

https://github.com/wind0r/k8s-supporting-folding-at-home/issues/1 describes the changed needed to do to make it work at least with nVidia GPUs. I was able to successful test it with my Workstation but I do not have a GPU Cluster to validate the chart. Also, I still wanted to read about the License Agreements. :/

I will check how I can publish the helm chart :)

wind0r avatar Mar 15 '20 15:03 wind0r

I think both things could be separate. There are already some docker images that seem to support GPU (https://hub.docker.com/r/coreweave/fah-gpu). Using this image should allow the chart to avoid licensing issues, and keep the focus on enabling the GPUs to be accessible on the pods, and aligned with the image.

joaocc avatar Mar 15 '20 17:03 joaocc

Got this (the Docker image here: https://hub.docker.com/r/coreweave/fah-gpu ) running on a p3 machine on AWS. However, this requires some set-up on the machine: https://medium.com/@houdinisparks/deploying-gpu-containers-on-aws-ec2-with-docker-machine-c79c9062d90 And the use of --runtime=nvidia instead of the GPUs argument:

sudo docker run --runtime=nvidia --name fah -p 7396:7396 -v $(pwd)/fah-data:/root -d coreweave/fah-gpu:7.5.1

There is some information on how to get this setup in AWS here: https://aws.amazon.com/blogs/compute/running-gpu-accelerated-kubernetes-workloads-on-p3-and-p2-ec2-instances-with-amazon-eks/

fmannhardt avatar Mar 16 '20 10:03 fmannhardt

Thx @fmannhardt. here some links on gpu on kubernetes and AKS

  • https://kubernetes.io/docs/tasks/manage-gpus/scheduling-gpus/
  • https://docs.microsoft.com/en-us/azure/aks/gpu-cluster
  • https://github.com/Azure/aks-engine/blob/master/docs/topics/gpu.md

joaocc avatar Mar 16 '20 11:03 joaocc