Code Llama for VSCode

An API which mocks llama.cpp to enable support for Code Llama with the Continue Visual Studio Code extension.

As of the time of writing and to my knowledge, this is the only way to use Code Llama with VSCode locally without having to sign up or get an API key for a service. The only exception to this is Continue with Ollama, but Ollama doesn't support Windows or Linux. On the other hand, Code Llama for VSCode is completely cross-platform and will run wherever Meta's own codellama code will run.

Now let's get started!

Setup

Prerequisites:

After you are able to use both independently, we will glue them together with Code Llama for VSCode.

Steps:

Move llamacpp_mock_api.py to your codellama folder and install Flask to your environment with pip install flask.
Run llamacpp_mock_api.py with your Code Llama Instruct torchrun command. For example:

torchrun --nproc_per_node 1 llamacpp_mock_api.py \
    --ckpt_dir CodeLlama-7b-Instruct/ \
    --tokenizer_path CodeLlama-7b-Instruct/tokenizer.model \
    --max_seq_len 512 --max_batch_size 4

Type /config in VSCode with Continue and make changes to config.py so it looks like this.

Restart VSCode or reload the Continue extension and you should now be able to use Code Llama for VSCode!

TODO: Response streaming

code-llama-for-vscode
code-llama-for-vscode copied to clipboard

Metadata

Code Llama for VSCode

Setup

← Metadata

Owner

Metadata

code-llama-for-vscode code-llama-for-vscode copied to clipboard

Metadata

Code Llama for VSCode

Setup

← Metadata

Owner

Metadata

code-llama-for-vscode
code-llama-for-vscode copied to clipboard