Anton Osika

Results 13 issues of Anton Osika

- [ ] Fix Napoleon - [ ] Fix the TODO strings in documentation - [ ] Use a custom domain name for the project - [ ] Rebuild if...

# Chain of thought role playing prompt that can answer e.g. SAT questions well Credit goes to reddit user [oddlyspecificnumber7](https://old.reddit.com/user/oddlyspecificnumber7). [His post](https://old.reddit.com/r/ChatGPT/comments/10bpzjb/chatgpt_scores_80_correct12_out_of_15_on_sample/) where he got 12/15 SAT questions right. I...

We have scripts/benchmark.py. If we run it over more configs and store the results to RESULTS.md we will clearly be able to see what works and what does not. Would...

help wanted
good first issue

Create a step that asks “did it run/work/perfect”?, and store to memory folder. And let the benchmark.py script check that result, and convert it to a markdown table like benchmark/RESULTS.md...

help wanted
good first issue

I think the "catch KeyboardInterrupt" in scripts/benchmark.py is a good pattern for being able to shut down the run, so that it goes to the "ask for input" step

enhancement
good first issue

# Issue Template ## Expected Behavior Version 0.0.4 visible on https://github.com/AntonOsika/gpt-engineer right sidebar ## Current Behavior Version 0.0.3 visible on https://github.com/AntonOsika/gpt-engineer right sidebar ## More info: If one goes to...

In this way, we can also store this to benchmark results. A huge increase in tokens will not be worth a minor improvement in benchmark resultss.

enhancement
good first issue