ART issues

Add a 10-second timeout to results_queue.join() to prevent indefinite hangs when lingering results aren't properly consumed. If a timeout occurs, drain any remaining items from the queue to allow training...

benediktstroebl

Always Block when train

3

[train.py] _calculate_logprobs - Processing chunk 2168 to 2176 [train.py] _calculate_logprobs - Processing chunk 2176 to 2184 [train.py] _calculate_logprobs - Processing chunk 2184 to 2192 [train.py] _calculate_logprobs - Processing chunk 2192...

johnson7788

Add experiment tracking via `trackio`

Hi folks! Would you be interested in adding experiment tracking support using `trackio`, a lightweight, free experiment tracking library from Hugging Face? Trackio docs: https://huggingface.co/docs/trackio/en/index It uses the same syntax...

abidlabs

Cannot train the tau-bench datasets on qwen3 model

10

I'm training the qwen3 model using the tau-bench datasets follow to the example source code in dev folder. Unfortunatedly, i've got the error as belowed: ``` import os from dotenv...

aongwachi1

Why Do Results from MCP-Trained Models Differ Greatly Between generate_benchmarks.py and train.py

Why does a model trained via MCP show a significant discrepancy in results when tested using generate_benchmarks.py, compared to the outcomes from train.py? A preliminary investigation indicates that the root...

wm19999

RuntimeError: torch.cuda.MemPool doesn't currently support expandable_segments during vLLM model initialization

3

Description ## Summary When training a LangGraph agent with `openpipe-art[backend,langgraph]`, the process fails at model initialization with the following error: RuntimeError: torch.cuda.MemPool doesn't currently support expandable_segments. The error occurs inside...

abhinav262666

AttributeError: 'LLM' object has no attribute 'engine_step' with Unsloth and vLLM

1

**Description** I am experiencing an AttributeError when attempting to run a script that uses Unsloth with vLLM. The traceback indicates that an LLM object from vLLM is being accessed for...

Gjw2333

uv venv error initializing art backend on skypilot/aws

4

I imagine this boils down to some change in a dependency - but raising here in case others encounter. I have not been able to identify a fix on my...

ecatkins

feat: Add MoE support

bradhilton

ART
ART copied to clipboard

Metadata

feat: Add retries to client.py

Fix deadlock in results_queue.join() during training

Always Block when train

Add experiment tracking via `trackio`

Cannot train the tau-bench datasets on qwen3 model

Why Do Results from MCP-Trained Models Differ Greatly Between generate_benchmarks.py and train.py

RuntimeError: torch.cuda.MemPool doesn't currently support expandable_segments during vLLM model initialization

AttributeError: 'LLM' object has no attribute 'engine_step' with Unsloth and vLLM

uv venv error initializing art backend on skypilot/aws

feat: Add MoE support

← Metadata

Owner

Metadata

ART ART copied to clipboard

Metadata

← Metadata

Owner

Metadata

ART
ART copied to clipboard