Nicolas Patry comments

Results 978 comments of


                                            Nicolas Patry

AMD hardware support for training and Inference

Not in the very near future. However we could build bindings around https://www.amd.com/en/graphics/servers-solutions-rocm first in order to start working. I'm not too familiar with the workflow, I do know there...

Metal Backend not properly loading large models at 16GB of RAM

@bayedieng could you try out : https://github.com/huggingface/candle/pull/1523 maybe ? It's possible for metal to be slower if there's not enough memory available to run the model I think, otherwise it...

Metal Backend not properly loading large models at 16GB of RAM

> because both phi 2 and stable diffusion turbo ran faster on metal then accelerate in the past. The SIMD line is this single one: https://github.com/huggingface/candle/blob/main/candle-metal-kernels/src/lib.rs#L1277 Otherwise it's because of...

model loading inference API

Multiple issues things at play that are currently known about: - Model loading is not really using correct information. `api-inference` doesn't know how to "guess" the model size properly, so...

Have collections of requests

Or the OpenAPI spec. Since community don't have too many parameters and have regular output types it could be the easiers (the transformers API was a nightmare to maintain exhaustively...

Allow choice of similarity function for Sentence Similarity task

Isn't there a setting or a config that could be used within `sentence-transformers` to know which similarity to use ? It feels like this shouldn't be passed by users as...

Allow choice of similarity function for Sentence Similarity task

I also remember there was such a configuration, but while writing this pipeline, we were told that cos_sim had basically won. (FYI)

Allow choice of similarity function for Sentence Similarity task

I don't have a super strong opinion against making a parameter but I have multiple indicators that make me feel like it's not correct. - The widgets don't send parameters...

Insert and remove from sys path in generic pipelines

IMO `generic` is not meant to be used very much. If it's tedious to use it's OK. AFAIK, `generic` is just meant to be used as a demo purpose for...

Return max_input_length if sequence is too long (NLP tasks)

No there's not, truncating is the only simple option. The reason is that the API for non TGI models use the pipeline, which raises an exception that would need to...