infinity Add Installation Option to Depend Only on ONNX, Excluding New Torch and CUDA Packages

Add Installation Option to Depend Only on ONNX, Excluding New Torch and CUDA Packages

Open bash99 opened this issue 1 year ago • 3 comments

trafficstars

Feature request

An installation option that allows users to install the project with dependencies limited to ONNX only, excluding newer versions of Torch and CUDA, particularly CUDA 12. This option would enable users to run the project in environments with older CUDA versions (e.g., 11.7), pure CPU setups, or other ONNX-compatible platforms like ONNX-DirectML.

There are some polular enbedding and reranker model has onnx binary like youdao/bce-embedding, and jina-embeddings-v2-base-zh.

Motivation

Current installation requirements can create barriers for users with legacy hardware or specific configurations. By providing an option to install without the latest Torch and CUDA dependencies, we can enhance accessibility and flexibility, allowing more users to effectively utilize the project in a broader range of environments.

Your contribution

Help test or fix few simple bugs.

Aug 09 '24 06:08 bash99

@bash99 I think this issue blocks currently: https://github.com/huggingface/optimum/issues/526

You could make it pytorch+cpu compatible. I am not sure if I want to maintain a legacy version of CUDA, I don't think I have the capacity to develop for that & maintain.

Aug 09 '24 17:08 michaelfeil

@michaelfeil I agree that the issue you mentioned is a blocker - however, by removing the dependency on torch you will enable people to choose what framework to use without forcing us to install torch and its specific dependencies. This is really important from a production standpoint and community offering - separating learning from inferencing, and keeping deployments small. This costs a lot of money, and time and runtime and startup time performance.

Aug 12 '24 07:08 talw-nym

@talw-nym I hear you, but your point is not clear. Optimum requires torch to be installed, and infinity depends on optimum for onnx inference -> please solve the issue in optimum & I'll pull it in here

Aug 12 '24 17:08 michaelfeil

infinity infinity copied to clipboard

Add Installation Option to Depend Only on ONNX, Excluding New Torch and CUDA Packages

Feature request

Motivation

Your contribution

infinity
infinity copied to clipboard