UIT.22520577

Results 2 comments of UIT.22520577

@Zxy-MLlab @karthikv792 Have you fixed the bug yet? I also met the same bug. When evaluating BlocksWorld, the ground truth plan keeps natural names (red/blue/yellow/orange) while the LLM-extracted plan is...

**Command I’m running** ``` python3 plan-bench/llm_plan_pipeline.py \ --task t1 \ --config blocksworld \ --engine qwen3-4B-Instruct-2507 \ --verbose True ``` **Engine wiring (in `llm_utils.py`)** ``` elif engine == "qwen3-4B-Instruct-2507": model_name =...