Diego Fiori issues

Results 32 issues of


                                            Diego Fiori

Add DeepSpeed-Inference to the list of supported backends

# Description Currently we don’t support any runtime specific for transformer models. DeepSpeed has implemented a runtime we could use for accelerating the Transformer models at inference time. # Integration...

speedster

Implement a new optimizer for TF and torch using FasterTransformer as a backend

# Description FasterTransformer is a library developed by Nvidia specifically for accelerating transformer architecture on Nvidia devices. We should test its performance and implement a conversion framework for converting TF,...

speedster

Support XLA for both torch and tensorflow

# Description Currently nebullvm does not support TF-built in compiler XLA, which also allows the model to be compiled on Google TPUs. XLA is available for JAX, TF and PyTorch....

enhancement

speedster

Add support for pre-trained reward models

# Description OpenAssistant has released on HF the reward models they trained on the open-source datasets. Even if they are not tailored for the user need, we could lavarege them...

chatllama

Add full support for offloading

# Description DeepSpeed supports offloading during training using the Zero-Infinity technology. We should add examples of working configuration files for the models we support. # TODO - [ ] Add...

chatllama

[Chatllama] Use upvotes in Stanford dataset as a measure for reward

# Description Currently we are supporting the following datasets: - [Stanford Human Preferences Dataset (SHP)](https://huggingface.co/datasets/stanfordnlp/SHP) - [Anthropic RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf But we are not using all the information contained in the dataset:...

good first issue

chatllama

Give indication on the size of the dataset needed for fine-tuning the model

# Description Once of the biggest difficulty when selecting and cleaning the data for training is to estimate to correct amount of data needed for training the model. ChatLLaMA training...

chatllama

Diego Fiori

Add DeepSpeed-Inference to the list of supported backends

Implement a new optimizer for TF and torch using FasterTransformer as a backend

Support XLA for both torch and tensorflow

Add support for pre-trained reward models

Add full support for offloading

[Chatllama] Use upvotes in Stanford dataset as a measure for reward

Implement optimized inference for ChatLLaMA

Add documentation

[Chatllama] Add multiple sources for generating synthetic data

Give indication on the size of the dataset needed for fine-tuning the model