CTranslate2 issues

Results 173 CTranslate2 issues

Sort by recently updated

Build error with CUDA 10.2 Jetson Nano

Please help how can I use CTranslate2 with CUDA 10.2? The problem is in the Nvidia Jetson Nano does not work with CUDA 11 and 12(installation of cuda 11 or...

alexismailov2

Support Baichuan2?

Baichuan2 is a generative model similar to llama, I find the following two differences: - 1. qkv merge as W_pack, so i change the file /ctranslate2/converters/transformers.py ![image](https://github.com/guillaumekln/faster-whisper/assets/122880585/f48aa58b-cc8a-4472-a27e-46ffd887afeb) - 2. rotary...

lx0126z

feature request: support 4d attention masks

this is a Feature Request to implement custom 4D mask for Llama (and possibly any other model) similar to https://github.com/huggingface/transformers/pull/27539

poedator

faster_whisper batch operation time-consuming

Is this related to CTranslate? The following is copied from [this ](https://github.com/SYSTRAN/faster-whisper/issues/618) . I have made a test, for batching in faster-whisper. But faster_whisper batch encode consume multiple time as...

dyyzhmm

Support float16 on ARM CPUs with native float16 support

From https://github.com/guillaumekln/faster-whisper/issues/65 --- Some CPUs such as ARM Neoverse-N1 (Oracle Cloud free tier) support FP16 computation. It would be nice to have this feature because there could be up to...

FlippFuzz

enhancement

cpu

updated documentation

I didn't see the documentation being updated regarding recent additions like distil-whisper and Mistral. It'd be nice to have that updatd as well as an example of each like the...

BBC-Esq

documentation

CTranslate2
CTranslate2 copied to clipboard

Metadata

Build error with CUDA 10.2 Jetson Nano

Support Baichuan2?

feature request: support 4d attention masks

faster_whisper batch operation time-consuming

Support float16 on ARM CPUs with native float16 support

updated documentation

Can we support CoreML of apple silicon?

GEMM operator calculates the `c` output shape incorrectly when input `a` is transpose?

Support for core42/jais-13b-chat (Arabic LLM)

Whisper - Correct way to get prediction probability of each token and timestamp alignment

← Metadata

Owner

Metadata

CTranslate2 CTranslate2 copied to clipboard

Metadata

← Metadata

Owner

Metadata

CTranslate2
CTranslate2 copied to clipboard