can-ai-code
can-ai-code copied to clipboard
Self-evaluating interview for AI coders
Hi! Thanks for a useful benchmark. The separation of quants is great. Am I right in thinking that we currently have [only one](https://github.com/the-crypt-keeper/can-ai-code/blob/main/senior/recursion.yaml) question in the senior category? I'm afraid...
### Update: OpenAPI Integration for text-generation-webui This MR updates the evaluation script to use the OpenAI API with the [text-generation-webui](https://github.com/oobabooga/text-generation-webui). **Workflow** - No changes in the preparation steps. - To...
These are very exciting as they are the first (AFAIK) public fine-tuned models of the base `codellama-70b`. The original upload was all messed up as they tried to use the...
Im willing to test a few models and share the results. I've looked at the readme, but couldn't wrap my head around how to benchmark a model. Any help would...
Some interesting and unique capabilities in this one
- [x] Instead of side-by-side view, default to showing 'both' languages by summing their results - [ ] Replace streamlit table with aggrid - [ ] Multi-select with checkboxes -...
Currently #1 on the Big Code Model Leaderboard. CodeFuse-CodeLlama-34B also looks worth a look