Kyrylo Medianovskyi

Results 1 comments of Kyrylo Medianovskyi

@Sadeghi85 @yugaljain1999 The high-level API does not support the batch inference and there is no beam search implementation in llama.cpp at the moment. Here is an example with Google MADLAD400...