Thiago Padilha
Thiago Padilha
@ggerganov These are the changes I did to `main`: - Moved main I/O loop out of the `main()` function into a `run()` function which can be reused - In the...
> The run() abstraction is not necessary here. The run abstraction is necessary if we want to share the main loop with the tcp server, it is not practical for...
@tkafka I'm still maintaining these changes in my fork, and will keep rebasing for the foreseeable future (might even set up some script to do this semi-automatically in a daily...
@mqy since I started this PR, the files have been restructured multiple times. I will just keep updating the main example to support tcp until there's a better native solution...
Try out the `tcp_server` branch, more details of how it works in #278
> @tarruda This is great, is it going to be merged? I don' t think so, check the discussion in #278 for more context.
@vonjackustc can you change the target branch to `tcp_server`?
@vonjackustc I missed these new extra `printf` statements in one of the recent rebases, just integrated your changes to the `tcp_server` branch, thanks for catching it.
> You can change LLAMA_N_PARTS from { 5120, 2 } to { 5120, 1 } to support quantized alpaca-13b-q4.bin here: https://github.com/antimatter15/alpaca.cpp#getting-started-13b But it would lose compatibility with original llama. Maybe...
@frederik-h thank you, also working on Acer Predator Helios 300 (2021 model)