grok-1 icon indicating copy to clipboard operation
grok-1 copied to clipboard

What's the lowest cost way for enthusiasts to run this model?

Open Librchain opened this issue 1 year ago • 11 comments

Librchain avatar Mar 18 '24 10:03 Librchain

Pay-as-you-go large GPU server

estlio avatar Mar 18 '24 11:03 estlio

Pay-as-you-go large GPU server

like Colab?

Librchain avatar Mar 18 '24 11:03 Librchain

Pay-as-you-go wouldn't be hosted locally and is costly. You have better options:

  • You might as well use an API if going for a non-local install.
  • Rent GPU hardware from local providers or through specialized online services that offer physical hardware for short-term projects.
  • Shared computing platforms allow you to use shared GPU resources for computing tasks. This can be more affordable than dedicated servers.

KristiyanTs avatar Mar 18 '24 11:03 KristiyanTs

Just buy 2x A100 LMAO

cosmic-zip avatar Mar 18 '24 11:03 cosmic-zip

Please close this issue and move to:

https://github.com/xai-org/grok-1/discussions

Reason: #69 #108

trholding avatar Mar 18 '24 12:03 trholding

To answer this question we have to get official answer for https://github.com/xai-org/grok-1/issues/62 issue first.

konard avatar Mar 18 '24 17:03 konard

Just buy 2x A100 LMAO

My nodes have 4 and it's CLEARLY not enough for this thing.

surak avatar Mar 19 '24 17:03 surak

@surak What GPU's did you use and what size? I'm going to look at trying to run it on 8 x 32GB V100 or 4 x 64GB Xilinx VU9p. But I'm wondering if that will even be enough.

webnexus-uk avatar Mar 19 '24 21:03 webnexus-uk

@surak What GPU's did you use and what size? I'm going to look at trying to run it on 8 x 32GB V100 or 4 x 64GB Xilinx VU9p. But I'm wondering if that will even be enough.

Most of my compute nodes have 4xA100 40gb. I had trouble also running in the grace hoppper 200 with 480gb, but there the problem was different.

surak avatar Mar 19 '24 21:03 surak

I have got my hands on 8 x A100 80GB GPU's. I'll have a look at trying it out this evening and let you know if it works.

webnexus-uk avatar Mar 22 '24 11:03 webnexus-uk

*Stable Horde is a Free crowdsourced distributed cluster for Stable Diffusion https://github.com/Haidra-Org/AI-Horde https://grafana.aihorde.net/d/decfb2fc-3165-4625-b8cd-c0e94220d5ad/landing-page?orgId=1

*https://io.net/

Hamguy21 avatar Mar 25 '24 11:03 Hamguy21