Torsten Scholak

Results 113 comments of Torsten Scholak

@jlamypoirier if this is truly due to missing dependencies then why aren't these specified as such in setup.cfg?

@bigximik can you try https://github.com/NVIDIA/TransformerEngine?tab=readme-ov-file#pip please?

Hey @jlamypoirier, Let's move fast on this. The immediate need is a set of practical examples that show how different dataset configurations achieve useful, real-world outcomes. No deep dives, no...

https://github.com/EleutherAI/lm-evaluation-harness/blob/773dcd7f8fe95ae7ae73c687dec0a4dc1a6174b9/lm_eval/api/model.py#L315 https://github.com/EleutherAI/lm-evaluation-harness/blob/773dcd7f8fe95ae7ae73c687dec0a4dc1a6174b9/lm_eval/models/huggingface.py#L1263

https://github.com/huggingface/transformers/blob/24e311f42b54f5f5fab6efcaa0c82eebd5608ba3/src/transformers/generation/utils.py#L345 https://github.com/huggingface/transformers/blob/24e311f42b54f5f5fab6efcaa0c82eebd5608ba3/src/transformers/generation/utils.py#L2018C5-L2032C50

https://github.com/ServiceNow/Fast-LLM/blob/21182c2d152729d3ed8a6d53024acdf76d468d2f/fast_llm/models/gpt/huggingface.py#L24C18-L41C30

https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct

for those who follow me here: https://github.com/input-output-hk/haskell.nix/blob/aac430c326307a1b93a0d29b33a6249fd1798cb1/test/cabal-simple-prof/default.nix#L7-L16 does the trick

thanks for adding TorchTitan to the list, @jlamypoirier!