CTranslate2 issues

Results 173 CTranslate2 issues

Sort by recently updated

Likely bug: log_prob is not affected by sampling_temperature

**Context** In language model generation, we use the hyperparameter `sampling_temperature` to adjust the probability distribution of predicting the next token. A smaller `sampling_temperature` sharpens the distribution, whereas a larger `sampling_temperature`...

YuchenLi01

feat: add ruy sgemm implementation

This PR adds SGEMM implementation with RUY. This is already [mentioned](https://github.com/SYSTRAN/faster-whisper/issues/237) in `faster-whisper` repository. I implemented this, because my experience with BLAS on Android was worse than this, and BLAS...

ebraraktas

Can install CTranslate2 in ppc64?

Hi, Can install CTranslate2 in ppc64? Regards

jesulo

Output tokens logits

Hello, Unless I'm mistaken, I don't see any option in the translator translate_batch or generate_tokens functions to output the logits/probabilities of the generated tokens. However, this computation must be done...

SimonBenhamou

DirectML Support

Assume that I already followed Microsoft's instructions to [Enable PyTorch with DirectML on Windows](//learn.microsoft.com/en-us/windows/ai/directml/gpu-pytorch-windows) and the DirectML library loads correctly according to MS's example code. If I wanted to use...

gdiaz384

enhancement

Support sequence_bias when decoding

In huggingface transformers, there is a generate option called `sequence_bias` to increase/decrease the logits of user-specified sequence of tokens, using the `SequenceBiasLogitsProcessor`. Would be nice to have such generation option...

mc-marcocheng

Exception when using some T5 model

When I was trying out some other T5 models and those models used the T5Tokenizer for eg. `ct2-transformers-converter --model Rostlab/prot_t5_xl_uniref50 --output_dir ./prot-t5-ct2/ ` There is an Exception: You're trying to...

Horikitasaku

CANN Backend support

# CANN Backend support ## Introduction `CANN` (Compute Architecture of Neural Networks), developed by Huawei, is a heterogeneous computing architecture for AI scenarios. It provides multi-layer programming interfaces to help...

3manifold

Memory increase

I used Ctranslate2-quantized version of fastchat-t5 (https://huggingface.co/limcheekin/fastchat-t5-3b-ct2), as the LLM of a question answering system. The QA system is wrapped in Rest API. The model works really well. But an...

AIApprentice101

[Feature] CANN Backend support

`CANN` (Compute Architecture of Neural Networks), developed by Huawei, is a heterogeneous computing architecture for AI scenarios. It provides multi-layer programming interfaces to help users quickly build AI applications and...

3manifold

CTranslate2
CTranslate2 copied to clipboard

Metadata

Likely bug: log_prob is not affected by sampling_temperature

feat: add ruy sgemm implementation

Can install CTranslate2 in ppc64?

Output tokens logits

DirectML Support

Support sequence_bias when decoding

Exception when using some T5 model

CANN Backend support

Memory increase

[Feature] CANN Backend support

← Metadata

Owner

Metadata

CTranslate2 CTranslate2 copied to clipboard

Metadata

← Metadata

Owner

Metadata

CTranslate2
CTranslate2 copied to clipboard