EnergonAI icon indicating copy to clipboard operation
EnergonAI copied to clipboard

Large-scale model inference.

Results 43 EnergonAI issues
Sort by recently updated
recently updated
newest added

![image](https://user-images.githubusercontent.com/19889835/229774960-be23598e-5452-46f2-bd05-fc04cd184d6d.png)

hello, I want to just inference of pre-trained model in the terminal, but I don't want to run a HTTP server. How could I do that?

I can't find it in energonai/engine.py "from energonai.engine import InferenceEngine"

(pytorch) root@USER-20211001RA:~/EnergonAI-main/examples/opt# python opt_fastapi.py opt-125m --checkpoint ./restored.pt /usr/local/anaconda3/envs/pytorch/lib/python3.9/site-packages/torch/library.py:130: UserWarning: Overriding a previously registered kernel for the same operator and the same dispatch key operator: aten::index.Tensor(Tensor self, Tensor?[] indices) -> Tensor...

- polish code - add setup script - upgrade version to 0.0.2

Hello, I have launched the opt-125M inference, and send request to server with locust. but what ever config the max_batch_size, the InferenceEngine always run in batch_size =1. how can i...

========================================================================================= **No pre-built kernel is found, build and load the layernorm kernel during runtime now** ========================================================================================= **No modifications detected for re-loaded extension module layernorm, skipping build step...** [W tensorpipe_agent.cpp:682] RPC...

I use anaconda, python 3.10 and pytorch 1.13.1 . When I ran the following Installation command: _pip install ._ an error happened. Part of the error message is: ``` Processing...

Hi, currently in the examples, only `linear` describes a naive example of offload, in other projects such as `opt`, `bloom`, `gpt`, there is no option for offload. I am wondering...

how to use this demo, could u provide any detail example