Max Makarov comments

Results 284 comments of


                                            Max Makarov

[META] Codec improvement and additional support including AV1/H.265, Intel/AMD GPU encoding support

> `nv(cuda)av1enc` is in order for GStreamer 1.26. > > https://gitlab.freedesktop.org/gstreamer/gstreamer/-/merge_requests/6754 It's already merged. How can I use AV1 with NVENC today?

Add simple server

How to make it use all GPUs in my system? I started like this: ``` torchrun --nproc_per_node 8 server.py --ckpt_dir /var/llama/65B --tokenizer_path /var/llama/tokenizer.model ``` But it only uses one GPU:

Add simple server

Yes, example.py uses all GPUs

Add simple server

Could you please give an example of an HTTP request?

Add simple server

This request crashes the server: ```bash curl -X POST http://127.0.0.1:8042/llama/ -H 'Content-Type: application/json' -d '{"prompts":["Hello. How are you?"], "max_gen_len": "256"}' ``` ```bash root@llama:/llama# torchrun --nproc_per_node 8 server.py --ckpt_dir /var/llama/65B --tokenizer_path...

[Feature] is any plan to support pytorch 2.0

Any updates?

Speaker Diarization - Extracting speaker embeddings for labels from files with multiple speakers from Cluster Diarizer models

the same question

Max Makarov

[META] Codec improvement and additional support including AV1/H.265, Intel/AMD GPU encoding support

Add simple server

Add simple server

Add simple server

Add simple server

[Feature] is any plan to support pytorch 2.0

Speaker Diarization - Extracting speaker embeddings for labels from files with multiple speakers from Cluster Diarizer models

[CS2] startmovie performance on Linux is poor compared to Windows version of the game

[CS2] startmovie performance on Linux is poor compared to Windows version of the game

Roadmap / Planned End of Support (October 2025)