Cheng Yang

Results 41 comments of Cheng Yang

for me the chat is noticeably faster. commnad+K is about same, maybe the bottleneck is somewhere else. maybe you can try 8b models to be sure?

> > @yangcheng Thanks! But how do we pass our API key? https://console.groq.com/docs/api-keys says: > > > API keys are required for accessing the APIs. > > @spikecodes Set the...

> @ilhamprabuzakys I'm happy to report that the AppImage works decently enough for me, especially if you create a shortcut in your shell to launch it from the command line....

I have similar error , running in ubuntu 22.04 ``` (base) ubuntu:~/llm.c$ nvidia-smi Fri Jun 7 06:23:37 2024 +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 555.42.02 Driver Version: 555.42.02 CUDA Version: 12.5 | |-----------------------------------------+------------------------+----------------------+...

> Fixed by upgrading cuDNN version, previously was on 8.9.2 which broke with above error after upgrade to cuDNN 9.2.0 from 9.1.1, I got new error, which version are you...

hi, I have the same issue, the response was 402 payment required ``` 2022-10-10 02:24:10.372 10848-10891/com.stripe.android.samplestore I/okhttp.OkHttpClient:

ok. I figured why, the server was out of date, I switched to the version on https://glitch.com/edit/#!/stripe-example-mobile-backend and it works now.

good to know that, I was about to use encoder for streaming io as mentioned in https://github.com/bytedance/sonic#streaming-io , no specific configuration, just http write response. is it recommended to simplely...

happy to be early tester!