Boostrix

Results 459 comments of Boostrix

We also keep seeing the LLM not replacing arguments or hallucinating imaginary command names (#1894) - so, it would probably make sense to check if the command exists and if...

> - thanks for the information on both comments, I really like the command validator. I'm trying to keep the PRs very small and atomic, so I'm happy to add...

> Really appreciate the collation of the issues though, that's super helpful. Right, I do realize that those related RFEs are out of scope for something as basic as an...

all commands would ideally come with their own pytest based test, so that people could easily check whether their commands are working for diagnostic purposes - also, even at runtime...

> Added command result data class. This will change a little bit on what to do with the result, however no change to any existing commands. FWIW, I have locally...

> we want to create a battery of failing tests that showcase some of Auto-GPT's inconsistent behaviors or bugs. To get this started, one of the lowest-hanging fruits is probably...

the mount path issue was mentioned recently, and it would be a "non-code" fix, and if it works, would solve the problem (of not being able to run) for probably...

while that's a trivial change in theory, it's opening up a whole new can of worms. You will almost certainly want to make this optional by introducing a corresponding new...

I believe someone recently pointed out that there might be a confusing conceptual problem due to the disjoint between what the python sandbox is what other shell commands get to...

@valayDave summed up the issue pretty well when he pointed out how the agent gets to see very little surrounding/environmental state and is acting on weird assumptions, especially once commands...