OpenHands
OpenHands copied to clipboard
Add eval workflow that triggers remote eval job
End-user friendly description of the problem this fixes or functionality that this introduces
- [ ] Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
Give a summary of what the PR does, explaining any non-trivial design decisions
This workflow triggers a remote eval job that runs an evaluation of 1 instance for now. This is to get the basic setup working.
Link of any specific issues this addresses
To run this PR locally, use the following command:
docker run -it --rm -p 3000:3000 -v /var/run/docker.sock:/var/run/docker.sock --add-host host.docker.internal:host-gateway -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:07983bb-nikolaik --name openhands-app-07983bb docker.all-hands.dev/all-hands-ai/openhands:07983bb