CTranslate2 issues

Results 173 CTranslate2 issues

Sort by recently updated

convert opennmt py on the fly and inference with model in memory

Convert the model (currently support opennmt-py) and save to the memory => inference without saving model. Customizing the wrapper to save the memory used. TODO: implementation for other converters

minhthuc2502

MPS Support for Mac Apple Silicon?

Is MPS support on the roadmap? I wanted to use faster-whisper on my Mac computer, but it is only using CPU.

AdamEXu

enhancement

Qwen Support

Excellent work! Do you support the Qwen-1_8B-Chat model? Looking forward to your reply.

Vincent131499

Extract last hidden state

Hi, is there any way to extract the last hidden state (before lm_head dense layer) of the T5, GPT model ?. There are some kind of model need to take...

dathudeptrai

enhancement

Whisper encodings differ in batch vs serial

I've been examining the encoded output of whisper and I see that the results are different when the same input is sent in via batch or one-by-one. I made a...

ExarchD

Exception when exporting bloomz model

Operaring system: Ubuntu 22.04.2, Python 3.10.6, CTranslate2 3.16. When exporting the bigscience/bloomz model using: _ct2-transformers-converter --force --model bigscience/bloomz --output_dir bloomz --quantization float16_ The conversion process works well for other bloom...

jordimas

bug

perf: conv1d quantization

This PR adds quantized `Conv1D` inference on top of #1597. With previous `int8` quantization implementation, this quantized inference couldn't bring any speed up because quantization was bottleneck. To alleviate that,...

ebraraktas

finetuned whisper model can not use “initial_prompt”

1. initial_prompt I use convert offical whisper model to CTranslate2 format，I can use “initial_prompt” normally. I convert my finetuned whisper model to CTranslate2 format, when i use “initial_prompt”, I get...

v-yunbin

build: armv7 and android static omp support

This PR adds armv7 support: * Implement slower generic versions of `div`, `mul_add`, `reduce_add`, `reduce_max`. * Rename `CT2_ARM64_BUILD` as `CT2_ARM_BUILD` Tested this on Galaxy S21 with library built for `armv7`,...

ebraraktas

Feature request: AMD GPU support with oneDNN AMD support

Hi, CTranslate2 uses oneDNN. oneDNN latest versions has [support for AMD GPU](https://github.com/oneapi-src/oneDNN/tree/master/src/gpu/amd). It [require Intel oneAPI DPC++](https://developer.codeplay.com/products/oneapi/amd/2023.0.0/guides/get-started-guide-amd). The same approach can [potentially enable NVIDIA GPU](https://developer.codeplay.com/products/oneapi/nvidia/2023.0.0/guides/get-started-guide-nvidia) support too. It would help...

santhoshtr

enhancement

help wanted

CTranslate2
CTranslate2 copied to clipboard

Metadata

convert opennmt py on the fly and inference with model in memory

MPS Support for Mac Apple Silicon?

Qwen Support

Extract last hidden state

Whisper encodings differ in batch vs serial

Exception when exporting bloomz model

perf: conv1d quantization

finetuned whisper model can not use “initial_prompt”

build: armv7 and android static omp support

Feature request: AMD GPU support with oneDNN AMD support

← Metadata

Owner

Metadata

CTranslate2 CTranslate2 copied to clipboard

Metadata

← Metadata

Owner

Metadata

CTranslate2
CTranslate2 copied to clipboard