Andrew Lapp comments

Results 203 comments of


                                            Andrew Lapp

Implement GQA: Reduce Wallclock by 6% (WIP per YouJiacheng's review)

I see a 2% reduction in runtime with the configuration below, but it complicates the model quite a bit and GQA has absurd compile times. I'll think about this some...

Numpy > 1.26.4

@Mikeriess I don't think it would be a substantial effort. We pinned to numpy 1 due to - https://github.com/outlines-dev/outlines/issues/976 - https://github.com/outlines-dev/outlines/pull/977 The imports would need to be fixed. Unless there's...

Numpy > 1.26.4

>Got it. Have had some compatibility issues with outlines and the newest llama.cpp version due to this, so if this is a trivial change that is great news 👍 Is...

Tensorboard Not Displaying

It's working now! Thanks for fixing quickly @Wauplin, great job!

Tensorboard Not Displaying

Hi, I'm seeing this issue again @XciD @severo It seems that - Tensorboards aren't being populated with updated when provided new logs as of ~48 hours ago. [Example](https://huggingface.co/distily/distily_validate_extra_grad_stats/tensorboard) - ~~**all...

Tensorboard Not Displaying

Appears to be resolved, thanks!

Tensorboard Not Displaying

Re-occurring.

Tensorboard Not Displaying

Here's the hacky script I'm using to render a hub tensorboard locally. `python3 run.py distily/distily_dataset_sweep` It retrieves all tfevent files from a repo, puts them in a temporary directory, and...

Tensorboard Not Displaying

Error re-occurring starting some time in the past few days.

Seems like outlines is somehow changing the distribution (+relative ranking) of output tokens?

>I can't unfortunately share the model weights or lora adapter, but am happy to help replicate this on my end and share any specific diagnostics! Are you able to share...