goodai-ltm-benchmark
goodai-ltm-benchmark copied to clipboard
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
**Hi,** First of all, thank you for your amazing work on this project! It has been incredibly helpful and well-structured. I'm encountering an issue when trying to run runner/run_benchmark.py using...
Added task memories - Task memories are parsed from user messages + above context that is relevant to the message. - If a task is deemed to have been specified...