Costa Huang issues

Results 96 issues of


                                            Costa Huang

-- UPDATE 7/7/2024: after chatting with @lewtun, we'd like to see if vLLM is willing to support https://github.com/vllm-project/vllm/issues/6189 officially before merging this PR as it may cause confusion for the...

Add ppov2 sentiment example (as a replacement to imdb example)

The current PPO has a nice IMDB example that teaches the model to output more positive texts. However, we don't have something similar for PPOv2 yet. This PR introduces something...

Prototype Dataset Processor

This PR attempts to refactor and pull **all tokenization logic** out of the Trainer class. Having a separate tokenization process gives us higher visibility into what's being used in training,...

[Feature]: Precise model device placement

### 🚀 The feature, motivation and pitch Hi all, I was wondering if it's possible to do precise model device placement. For example, I would like to place the vLLM...

feature request

Support flash attention `flash-attn --no-build-isolation` with `uv sync`

I get the following error despite installation flash attention was successful via `uv add flash-attn --no-build-isolation`. See https://github.com/astral-sh/uv/issues/6402. Is there anyway to honor the `no-build-isolation` when doing `uv sync`? I...

enhancement

projects

`uv` installation location

Would it be possible to install `uv` to a different location other than the default one? We are using a shared filesystem which lives at a difference place than `/home`.

help wanted

question

configuration

Costa Huang

Online trainer refactor

[DRAFT] Vllm integration

Add ppov2 sentiment example (as a replacement to imdb example)

Prototype Dataset Processor

[Feature]: Precise model device placement

Support flash attention `flash-attn --no-build-isolation` with `uv sync`

`uv` installation location

OLMO + RL

Win rate plot experiment stuff

Remove experiment name