Simon M.

Results 6 comments of Simon M.

i tried every cmake flag i could find trying to get it to use the 64bit compiler, but VS always uses MSBuild x86 :/ When i try this command >...

it still / again does not work for me, because Grouped Query Attention is not implemented `tgi-scripts-text-generation-inference-awq-1 | File "/opt/conda/lib/python3.9/asyncio/base_events.py", line 601, in run_f[58/76432]tgi-scripts-text-generation-inference-awq-1 | self._run_once() tgi-scripts-text-generation-inference-awq-1 | File "/opt/conda/lib/python3.9/asyncio/base_events.py",...

i tried using continue with a small team, and it was horrendous to set up, especially since we had to switch models a few times. A way to automatically set...

I will message you on discord when I find the time. my choice of words was a bit harsh tbh 😅 maybe I was a bit ill-prepared

did you find any solution to this? it's really unintuitive

we would like to collect this information as well, to judge the acceptance vs. Github Copilot (personally, ofc I prefer the local continue approach) not sure if your "product" is...