Mak Sò
Mak Sò
Test coverage is strong but most integration or system tests use mocks for LLM and memory. Add real end-to-end tests that: - Run full example YAML workflows from CLI, exercising...
Currently, scoring routines for input readiness, safety, and prior/historical success in PathScorer and related components are stubbed with fixed values and TODOs. For v1.0.0, replace these with: - Real input...
Current scoring is still float-based aggregates. For some compliance and auditable use cases, users want to know which criteria were discretely passed or failed by a given candidate path. Add:...