dalai icon indicating copy to clipboard operation
dalai copied to clipboard

Alpaca 7B and 13B on Linux

Open RIAZAHAMMED opened this issue 1 year ago • 12 comments

I have a server that has Linux and I installed Dalai on it. I made many modifications in appearance and tried. Even with 70GB RAM this thing became very slow. But I cannot say it is not working... See the attached image you will know what is wrong with it. It was a simple question... When will summer start? The Response was very funny... at leat good for a laugh after 3 hours of processing.

Screenshot 2023-03-25 at 07-30-17 Ashan's GPT

It may need GPU with better processing power. Or I may have to install cmd only server software. Thinking about all the options... Here this will give you an idea about the computer...

Screenshot from 2023-03-25 08-30-30

Any suggestions will be appreciated.

RIAZAHAMMED avatar Mar 25 '23 13:03 RIAZAHAMMED

3 hours?

With i9 9990 ( 16 threads CPU ) model 7B will be process such amount of text 1-2 min ... I tried also 30B and such amount text would be process around 15 min max.

I have no idea why is so slow for you.

mirek190 avatar Mar 25 '23 22:03 mirek190

3 hours?

With i9 9990 ( 16 threads CPU ) model 7B will be process such amount of text 1-2 min ... I tried also 30B and such amount text would be process around 15 min max.

I have no idea why is so slow for you.

I think Graphics is the problem... I have Zorin ... so I may have to just run command only server to avoid the graphics altogether.

RIAZAHAMMED avatar Mar 26 '23 01:03 RIAZAHAMMED

but dalai uses cpu not gpu.

mirek190 avatar Mar 26 '23 09:03 mirek190

but dalai uses cpu not gpu.

That's what I thought too.. I used a GPU machine and it runs faster on it.. I removed all the dims and reinstalled it and it got a bit better... I am not an expert with servers or hardware stuff..

see what it did for me in 12 mins

Screenshot 2023-03-27 at 04-41-31 Ashan's GPT

Not bad isn't it? It is not Shakespeare but it can do stuff.. hehehehe

RIAZAHAMMED avatar Mar 27 '23 09:03 RIAZAHAMMED

That's still extremely slow for 24 threads CPU .... Such amount of text with 7b model should be produced within 20 second ..my CPU is 16 threads.maybe this Xeon is very slow compare to i9 9900....

mirek190 avatar Mar 27 '23 11:03 mirek190

That's still extremely slow for 24 threads CPU .... Such amount of text with 7b model should be produced within 20 second ..my CPU is 16 threads.maybe this Xeon is very slow compare to i9 9900....

This thing I old... it was sitting in my storage. So I pulled it out and started it to see how this goes. Yeah, you are right probably need to try a different server

RIAZAHAMMED avatar Mar 27 '23 13:03 RIAZAHAMMED

Very related to the issue I reported here in the forum, please see here: #323. This is the exact problem I am having when using alpaca.7B. It also happens with the larger models. The system, being in that shape, is pretty useless.

VanHallein avatar Mar 29 '23 14:03 VanHallein

That's still extremely slow for 24 threads CPU .... Such amount of text with 7b model should be produced within 20 second ..my CPU is 16 threads.maybe this Xeon is very slow compare to i9 9900....

This thing I old... it was sitting in my storage. So I pulled it out and started it to see how this goes. Yeah, you are right probably need to try a different server

Did you find a solution?

RiccaDS avatar Mar 30 '23 11:03 RiccaDS

buenas como hago para conseguir (llama versión 65b). (https://huggingface.co/boboto/LLaMA-65B-HF/tree/main) o el torrent. porque estoy rastreando hace rato para instalar y probar encontré los pesos pero me faltan archivos de instalación, si alguno me puede ayudar le agradezco , aunque sea por mje.privado. [email protected] .

ADANCE-STACK avatar Apr 04 '23 09:04 ADANCE-STACK

That's still extremely slow for 24 threads CPU .... Such amount of text with 7b model should be produced within 20 second ..my CPU is 16 threads.maybe this Xeon is very slow compare to i9 9900....

This thing I old... it was sitting in my storage. So I pulled it out and started it to see how this goes. Yeah, you are right probably need to try a different server

Did you find a solution?

I am out of town through the end of the month so I am sorry I cannot work on this now... but once I am back in town I will get back to this... I am going to do a full performance tuning of the server and try again...

RIAZAHAMMED avatar Apr 04 '23 11:04 RIAZAHAMMED

buenas como hago para conseguir (llama versión 65b). (https://huggingface.co/boboto/LLaMA-65B-HF/tree/main) o el torrent. porque estoy rastreando hace rato para instalar y probar encontré los pesos pero me faltan archivos de instalación, si alguno me puede ayudar le agradezco , aunque sea por mje.privado. [email protected] .

As I am not familiar with your language I tried translating this... from my understanding, I think you need to follow the instructions in getting dalai installed and then install the models you want.

RIAZAHAMMED avatar Apr 04 '23 11:04 RIAZAHAMMED

I think Dalai doesn't like Xeon's I have the exact same problem...
A decent Build with more than plenty of RAM and an 8 core Xeon, nothing fancy but plenty more power than the average laptop or desktop, and still the output takes forever, even with the simplest questions.

titolindj avatar Apr 05 '23 17:04 titolindj