mPLUG-DocOwl Does TinyChart inference work on a CPU?

trafficstars

I'm running the inference notebook and just changed device to 'cpu':


import torch
from PIL import Image
from tinychart.model.builder import load_pretrained_model
from tinychart.mm_utils import get_model_name_from_path
from tinychart.eval.run_tiny_chart import inference_model
from tinychart.eval.eval_metric import parse_model_output, evaluate_cmds

def show_image(img_path):
    img = Image.open(img_path).convert('RGB')
    img.show()

# Build the model
model_path = "mPLUG/TinyChart-3B-768"

tokenizer, model, image_processor, context_len = load_pretrained_model(
    model_path, 
    model_base=None,
    model_name=get_model_name_from_path(model_path),
    device="cpu"
)

img_path = "my_image.png"
show_image(img_path)

text = "Create a brief summarization or extract key insights based on the chart image."
response = inference_model([img_path], text, model, tokenizer, image_processor, context_len, conv_mode="phi", max_new_tokens=1024)

I get an AssertionError on the last line: AssertionError: Torch not compiled with CUDA enabled

May 08 '24 19:05 matsuobasho

Hi @matsuobasho, TinyChart can be able to run on the CPU theoretically, but it may not yet supported by this code. I am going to check it and update the code this week.

May 09 '24 02:05 zhangliang-04

@zhangliang-04 thanks for the response, looking forward to the update

May 09 '24 15:05 matsuobasho

I'm running the inference notebook and just changed device to 'cpu':

import torch
from PIL import Image
from tinychart.model.builder import load_pretrained_model
from tinychart.mm_utils import get_model_name_from_path
from tinychart.eval.run_tiny_chart import inference_model
from tinychart.eval.eval_metric import parse_model_output, evaluate_cmds

def show_image(img_path):
    img = Image.open(img_path).convert('RGB')
    img.show()

# Build the model
model_path = "mPLUG/TinyChart-3B-768"

tokenizer, model, image_processor, context_len = load_pretrained_model(
    model_path, 
    model_base=None,
    model_name=get_model_name_from_path(model_path),
    device="cpu"
)

img_path = "my_image.png"
show_image(img_path)

text = "Create a brief summarization or extract key insights based on the chart image."
response = inference_model([img_path], text, model, tokenizer, image_processor, context_len, conv_mode="phi", max_new_tokens=1024)

I get an AssertionError on the last line: AssertionError: Torch not compiled with CUDA enabled

@matsuobasho I just update the code to support cpu inference. However, I do not encounter the error you have mentioned. You can first pull the new code and try again. If it still does not work, could you provide more information about the error? (e.g. the stack traces)

May 13 '24 09:05 zhangliang-04

@zhangliang-04 thank you for making this update. I incorporated the changes and it seems to be running. HOWEVER, the inference_model step has been running for the past 25 minutes on a 60 KB png file.

For reference using the same type of prompt as in tutorials:

text = "Create a brief summarization or extract key insights based on the chart image."
response = inference_model([img_path], text, model, tokenizer, image_processor, context_len, conv_mode="phi", max_new_tokens=1024)

I'm running on Windows 10 with 16 GB of RAM and a 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz CPU (3 years old).

At this processing speed, TinyChart is not useful.
Perhaps something is wrong in the default settings?

May 14 '24 14:05 matsuobasho

mPLUG-DocOwl mPLUG-DocOwl copied to clipboard

Does TinyChart inference work on a CPU?

mPLUG-DocOwl
mPLUG-DocOwl copied to clipboard