EnergonAI issues

OPT inference generate example

1

Hi, is there any generate example for OTP models?

Missing energonai_linear_func in setup.py

1

**Problem** ``` [root@2e71bfd17f96 inference]# export PYTHONPATH=/workspace/colossal/inference/examples/bert [root@2e71bfd17f96 inference]# energonai service init --config_file=/workspace/colossal/inference/examples/bert/bert_config.py Traceback (most recent call last): File “/opt/conda/lib/python3.9/site-packages/energonai/kernel/cuda_native/linear_func.py”, line 5, in energonai_linear = importlib.import_module(“energonai_linear_func”) File “/opt/conda/lib/python3.9/importlib/__init__.py”, line 127, in...

xgreat8

Connection refused on docker exposed port

1

**Problem** If we docker run energonai like: `docker run -ti --gpus all --rm --ipc=host -p 8010:8010 ... ` Then in container run: `export PYTHONPATH=/workspace/colossal/inference/examples/bert` `energonai service init --config_file=/workspace/colossal/inference/examples/bert/bert_config.py` The access...

xgreat8

[Feature]: Automatic Pipeline Parallelism

**Describe the feature:** We are going to introduce the automated pipeline parallelism feature into EnergonAI, which hopes that users only need to specify some simple arguments and achieve the pipeline...

dujiangsu

enhancement

can't find server.sh

3

I can't find server.sh,how can I run a example now？

zhangyilalala

torch.load() hangs indefinitely when reading OPT pre-trained model weights

1

I'm trying to use [OPT 66B](https://huggingface.co/facebook/opt-30b/tree/main) pre-trained model for inference on EnergonAI. After preprocessing the weights by the script of `preprocessing_ckpt_66b.py` and starting opt server, the service hangs there when...

larry-fuy

does EnergonAI support gpt model with int8 quantitation in model parallel?

1

does EnergonAI support gpt model with int8 quantitation in model parallel?

dearowen

CUDA error: no kernel image is available for execution on the device

3

Update: I think this is caused by running a VM on Unraid. The Ubuntu kernel being used is not quite normal. When attempting the OPT examples, via either Docker or...

KastanDay

num_beams for beam search

1

Hi, I want to use num_beams for generate, but PipelineModel can't. Can you support num_beams? Best wishes.

shammmmmless

inference of pre-trained model

1

Hi, I am very interested in the distributed inference of Colossal AI. Since we have pre-trained NLP models from Pytorch or JAX, I wonder if possible or what should be...

Emerald01

EnergonAI
EnergonAI copied to clipboard

Metadata

OPT inference generate example

Missing energonai_linear_func in setup.py

Connection refused on docker exposed port

[Feature]: Automatic Pipeline Parallelism

can't find server.sh

torch.load() hangs indefinitely when reading OPT pre-trained model weights

does EnergonAI support gpt model with int8 quantitation in model parallel?

CUDA error: no kernel image is available for execution on the device

num_beams for beam search

inference of pre-trained model

← Metadata

Owner

Metadata

EnergonAI EnergonAI copied to clipboard

Metadata

← Metadata

Owner

Metadata

EnergonAI
EnergonAI copied to clipboard