trafficstars

TinyGPT

Tiny C++11 GPT-2 inference implementation from scratch, which is mainly based on the project picoGPT.

Accompanying blog post: Write a GPT from scratch (TinyGPT)

Core class

Tensor: Tensor class similar to the numpy interface.
Model: GPT-2 model implementation with reference to gpt2_pico.py.
Tokenizer: BPE tokenizer with exactly the same logic as GPT-2 encoder.py.

Build and Run

1. Get the code

git clone --recurse-submodules https://github.com/keith2018/TinyGPT.git

2. Install Intel MKL(Math Kernel Library)

Official website: Intel®-Optimized Math Library for Numerical Computing on CPUs & GPUs

3. Download GPT-2 model file

python3 tools/download_gpt2_model.py

if success, you'll see the file model_file.data in directory assets/gpt2

4. Build and Run

mkdir build
cmake -B ./build -DCMAKE_BUILD_TYPE=Release
cmake --build ./build --config Release

This will generate the executable file and copy assets to directory app/bin, then you can run the demo:

cd app/bin
./TinyGPT_demo
[DEBUG] TIMER TinyGPT::Model::loadModelGPT2: cost: 800 ms
[DEBUG] TIMER TinyGPT::Encoder::getEncoder: cost: 191 ms
INPUT:Alan Turing theorized that computers would one day become
GPT:the most powerful machines on the planet.
INPUT:exit

Dependencies

GEMM acceleration
- intel-mkl https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl.html
Json parser
- json11 https://github.com/dropbox/json11
Tokenizer regular matching
- re2 https://github.com/google/re2
- abseil-cpp https://github.com/abseil/abseil-cpp

License

This code is licensed under the MIT License (see LICENSE).

TinyGPT
TinyGPT copied to clipboard

Metadata

TinyGPT

Core class

Build and Run

1. Get the code

2. Install Intel MKL(Math Kernel Library)

3. Download GPT-2 model file

4. Build and Run

Dependencies

License

← Metadata

Owner

Metadata

TinyGPT TinyGPT copied to clipboard

Metadata

TinyGPT

Core class

Build and Run

1. Get the code

2. Install Intel MKL(Math Kernel Library)

3. Download GPT-2 model file

4. Build and Run

Dependencies

License

← Metadata

Owner

Metadata

TinyGPT
TinyGPT copied to clipboard