UIT.22520577
Results
2
comments of
UIT.22520577
@Zxy-MLlab @karthikv792 Have you fixed the bug yet? I also met the same bug. When evaluating BlocksWorld, the ground truth plan keeps natural names (red/blue/yellow/orange) while the LLM-extracted plan is...
**Command I’m running** ``` python3 plan-bench/llm_plan_pipeline.py \ --task t1 \ --config blocksworld \ --engine qwen3-4B-Instruct-2507 \ --verbose True ``` **Engine wiring (in `llm_utils.py`)** ``` elif engine == "qwen3-4B-Instruct-2507": model_name =...