can-ai-code issues

Evaluate Qwen/CodeQwen1.5-7B for code completion and FIM objectives

model request

Too few questions for senior category

2

Hi! Thanks for a useful benchmark. The separation of quants is great. Am I right in thinking that we currently have [only one](https://github.com/the-crypt-keeper/can-ai-code/blob/main/senior/recursion.yaml) question in the senior category? I'm afraid...

rodion-m

OpenAPI update for text-generation-webui

1

### Update: OpenAPI Integration for text-generation-webui This MR updates the evaluation script to use the OpenAI API with the [text-generation-webui](https://github.com/oobabooga/text-generation-webui). **Workflow** - No changes in the preparation steps. - To...

kisimoff

Please evaluate `Eurus-70b-nca-fixed` and `Eurus-70b-sft-fixed` (first public fine-tunes of base `codellama-70b`!!!)

5

These are very exciting as they are the first (AFAIK) public fine-tuned models of the base `codellama-70b`. The original upload was all messed up as they tried to use the...

jukofyork

Guide on how to evaluate models

1

Im willing to test a few models and share the results. I've looked at the readme, but couldn't wrap my head around how to benchmark a model. Any help would...

kisimoff

Evalute TechxGenus/Yi-9B-Coder

the-crypt-keeper

model request

Evaluate CohereForAI/c4ai-command-r-v01

Some interesting and unique capabilities in this one

the-crypt-keeper

model request

Refresh Streamlit Results App

- [x] Instead of side-by-side view, default to showing 'both' languages by summing their results - [ ] Replace streamlit table with aggrid - [ ] Multi-select with checkboxes -...

the-crypt-keeper

enhancement

Evaluate codefuse-ai/CodeFuse-DeepSeek-33B

Currently #1 on the Big Code Model Leaderboard. CodeFuse-CodeLlama-34B also looks worth a look

the-crypt-keeper

model request

Evaluate the new mistral-large and other new closed mistral models

the-crypt-keeper

model request

can-ai-code
can-ai-code copied to clipboard

Metadata

Evaluate Qwen/CodeQwen1.5-7B for code completion and FIM objectives

Too few questions for senior category

OpenAPI update for text-generation-webui

Please evaluate `Eurus-70b-nca-fixed` and `Eurus-70b-sft-fixed` (first public fine-tunes of base `codellama-70b`!!!)

Guide on how to evaluate models

Evalute TechxGenus/Yi-9B-Coder

Evaluate CohereForAI/c4ai-command-r-v01

Refresh Streamlit Results App

Evaluate codefuse-ai/CodeFuse-DeepSeek-33B

Evaluate the new mistral-large and other new closed mistral models

← Metadata

Owner

Metadata

can-ai-code can-ai-code copied to clipboard

Metadata

← Metadata

Owner

Metadata

can-ai-code
can-ai-code copied to clipboard