AI-Scientist Deploy the large model locally

Suppose I want to use the open source large model, what parameters and files do I need to set. I am now using ollama local deployment qwen2.

Sep 02 '24 05:09 LMJOK

usage: launch_scientist.py [-h] [--skip-idea-generation] [--skip-novelty-check] [--experiment EXPERIMENT] [--model {claude-3-5-sonnet-20240620,gpt-4o-2024-05-13,deepseek-coder-v2-0724,llama3.1-405b,qwen2bedrock/anthropic.claude-3-sonnet-20240229-v1:0,bedrock/anthropic.claude-3-5-sonnet-20240620-v1:0,bedrock/anthropic.claude-3-haiku-20240307-v1:0,bedrock/anthropic.claude-3-opus-20240229-v1:0vertex_ai/claude-3-opus@20240229,vertex_ai/claude-3-5-sonnet@20240620,vertex_ai/claude-3-sonnet@20240229,vertex_ai/claude-3-haiku@20240307}] [--writeup {latex}] [--parallel PARALLEL] [--improvement] [--gpus GPUS] [--num-ideas NUM_IDEAS] launch_scientist.py: error: argument --model: invalid choice: 'qwen2' (choose from 'claude-3-5-sonnet-20240620', 'gpt-4o-2024-05-13', 'deepseek-coder-v2-0724', 'llama3.1-405b', 'qwen2bedrock/anthropic.claude-3-sonnet-20240229-v1:0', 'bedrock/anthropic.claude-3-5-sonnet-20240620-v1:0', 'bedrock/anthropic.claude-3-haiku-20240307-v1:0', 'bedrock/anthropic.claude-3-opus-20240229-v1:0vertex_ai/claude-3-opus@20240229', 'vertex_ai/claude-3-5-sonnet@20240620', 'vertex_ai/claude-3-sonnet@20240229', 'vertex_ai/claude-3-haiku@20240307')

Sep 02 '24 05:09 LMJOK

If you use Ollama, you can modify the existing OpenAI API code like so: https://ollama.com/blog/openai-compatibility

Sep 02 '24 06:09 conglu1997

So I'm just going to change the way the API of Ollama is invoked from the API of launch_scientist.py?

If you use Ollama, you can modify the existing OpenAI API code like so: https://ollama.com/blog/openai-compatibility若使用 Ollama，您可按如下方式修改现有的 OpenAI API 代码：https://ollama.com/blog/openai-compatibility

So Do I need to replace the OPENAI API calls in the code with the ollama API calls? Replace the way all models are called?

Sep 02 '24 09:09 LMJOK

If you use Ollama, you can modify the existing OpenAI API code like so: https://ollama.com/blog/openai-compatibility

I have modified the code, but has not been shown to support, would like to ask how to modify. 1725367794583

error:/home/pc/anaconda3/envs/AI-scientist/bin/python /media/iplab/tony/lmj/bigmodol/AI-scientist/AI-Scientist/01launch_scientist.py Using GPUs: [0, 1] Using ollama with llama2.

Generating idea 1/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 2/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 3/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 4/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 5/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 6/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 7/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 8/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 9/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 10/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 11/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 12/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 13/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 14/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 15/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 16/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 17/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 18/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 19/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 20/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 21/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 22/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 23/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 24/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 25/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 26/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 27/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 28/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 29/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 30/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 31/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 32/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 33/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 34/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 35/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 36/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 37/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 38/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 39/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 40/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 41/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 42/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 43/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 44/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 45/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 46/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 47/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 48/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 49/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Generating idea 50/50 Iteration 1/3 Failed to generate idea: Model meta-llama/llama-2-instruct not supported.

Checking novelty of idea 0: adaptive_block_size Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported.

Checking novelty of idea 1: layerwise_learning_rates Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. Error: Model meta-llama/llama-2-instruct not supported. All ideas evaluated.

Process finished with exit code 0

Sep 03 '24 12:09 LMJOK

You should comment out try except so that the real errors show up.

Also, we highly do not advise you try using Llama2 models, any model weaker than the original GPT-4 will fail.

Sep 03 '24 16:09 conglu1997

Is it possible to use recent ollama deepseek r1 models with AI scientist? If so, please provide brief instructions.

Feb 06 '25 20:02 slives-lab

Of course, only needs an edit here: https://github.com/SakanaAI/AI-Scientist/blob/main/ai_scientist/llm.py

Feb 06 '25 20:02 conglu1997

Is the correct implementation shown below, or should it be more specific (e.g. ollama/deepseek-r1:7b)? Also, while ollama/deepseek isn't openAI, does should the existing OpenAI API code be modified as suggested here: https://ollama.com/blog/openai-compatibility'?

AVAILABLE_LLMS = [ # Existing models...

# Add local Ollama DeepSeek R1 model
"ollama/deepseek-r1",

]

Feb 06 '25 21:02 slives-lab

This should be routable to the OpenAI API, so whatever model name and creation of OpenAI client that supports that model

Feb 07 '25 10:02 conglu1997

Using ollama deepseek, python launch_scientist.py etc launches and proceeds, but I cannot resolve the error related to:

'Error: Failed to extract JSON from LLM output'

the images are missing, and the pdf just shows template headers - not complete. Most of the code appears to complete. Any ideas on how to fix? generated_images.png isn't generated. writeup.tex is 0KB.

Feb 09 '25 21:02 slives-lab

It would be prudent to implement the latest ollama/deepseek-R1 models into the working code, to avoid API charges from commercial vendors, and privacy concerns with commercial vendors also.

Feb 11 '25 17:02 slives-lab

I would recommend debugging which part fails with your integration! E.g. print debug logs from each LLM call.

Feb 11 '25 17:02 conglu1997