gladysh

Results 4 issues of gladysh

### What We may need better telemetry on how many input/output/total tokens went into `shortest`. Fortunately, Anthropic API [returns](https://docs.anthropic.com/en/docs/build-with-claude/tool-use#example-simple-tool-definition:~:text=When%20you%20send%20a%20tool%20use%20prompt%2C%20just%20like%20any%20other%20API%20request%2C%20the%20response%20will%20output%20both%20input%20and%20output%20token%20counts%20as%20part%20of%20the%20reported%20usage%20metrics.) token usage for each request. ### Why - Help people better...

### What Speed up test execution by finding ways to addressing issues outlined below. ### Why While working on #179, I have found that screenshooting perhaps takes the most time...

**Issue #124** This PR introduces basic caching mechanism to reduce costs and increase effectiveness of running test suites. **Performance Boost:** Achieves an average speedup of 400%-600%, automations like "Find Lionel...

### What Create a script similar to `nodemon` that would listen to changes and rebuild the project for us on the fly. ### Why Now, in order to test newly...