Nicolas Patry comments

Results 978 comments of


                                            Nicolas Patry

How to make sure the local tgi server's performance is ok

Its doesn't test it per-say as when continuous batching is active many things could be happening at the same time. But every performance number is dominated by the number of...

[Bug]: Severe Errors

Thanks for the fix.

Launcher is not able to run the model even after model is completely downloaded when not connected to internet

This looks like a DNS error, not sure what we can do about it. `--net=host` might help here. If you have the model already locally (which doesn't seem to be...

Launcher is not able to run the model even after model is completely downloaded when not connected to internet

Have you change the moedl id to the local folder (within the docker ?)

Launcher is not able to run the model even after model is completely downloaded when not connected to internet

Are you sure it's not a network mounted disk and killing network also kills your disk ? All the stacktrace you provide suggest the docker is looking for a remote...

Out of memory error while training tokenizer

Hi, What version of tokenizers are you running ? BPE algorithm can be quite memory intensive when the length of the tokens is large, which can be the case in...

Endpoint interface for inpainting models that require two images.

@pcuenca @mishig25 This seems like a valid use case, wdyt ? Any models particularly fit for that ? Maybe we should consider some highly diffuser specific component maybe (akin to...

Endpoint interface for inpainting models that require two images.

> Is the final goal here a richer representation of ControlNet? I'm guessing the final goal is to showcase as best as possible what models can do. There's definitely a...

Endpoint interface for inpainting models that require two images.

> , so it's a 1-to-1 relationship, is that right? Indeed !

[Proposal] Support saving to safetensors

It's really hard to answer in general. It will skip a cpu allocation and create objects directly on the gpu (using SAFETENSORS_FAST_GPU=1 environment variable). For cpu the loading part should...