Florian Zimmermeister

Results 67 comments of Florian Zimmermeister

https://github.com/huggingface/transformers/commit/34d94094279d2c903d9d8a51a65edb265f22c849#diff-9ba75cc28be7924a2fc43de1d2c8c7779ad597129d33d1af39153951463cd0bc Rope Scaling got merged to the transformers repo

Most interesting is the dynamic ntk aware rope being added Maybe an option for tgi too adding the dynamic version ?

I tried to test using gptq weights, on v1.0 everything is fine, with the latest container ``` File "/opt/conda/bin/text-generation-server", line 8, in sys.exit(app()) File "/opt/conda/lib/python3.9/site-packages/text_generation_server/cli.py", line 78, in serve server.serve(...

flozi00/Llama-2-7b-german-assistant-v2-4bit-autogptq The only commit touched that part of code is #738 after the 1.0 release

Another issue found https://github.com/huggingface/text-generation-inference/blob/15fc64668f8d3dd407768286e5a0536aeb78c2e1/server/text_generation_server/utils/layers.py#L384 defined here https://github.com/huggingface/text-generation-inference/blob/15fc64668f8d3dd407768286e5a0536aeb78c2e1/server/text_generation_server/utils/layers.py#L486C24-L486C39 used here and not accessible from the other class so dynamic scaling is not working and raise function not defined error, linear scaling...

The pr looks very promising, I only have some open questions about integration. At the moment there are built in tools with static specified functions behind, what's the workflow to...

@domeccleston 👍 +1

https://github.com/xenova/transformers.js Pipelines in js 🤗