Devis Lucato

Microsoft Seattle WA // Venice area Italy // London UK Engineer @Microsoft. Author of Semantic Kernel and Kernel Memory.

Results 288 comments of


                                            Devis Lucato

Http 500 error when exceeding Azure OpenAI quota

> > with batching: is the client sending too many chunks per batch? > > I'm still not sure how I can enable batching in Kernel Memory when running as...

Http 500 error when exceeding Azure OpenAI quota

Batch embedding generation ready and released. Thanks @alkampfergit https://github.com/microsoft/kernel-memory/pull/531 ! Quick notes: * batch support added to OpenAI and Azure OpenAI embedding generators * batch size configurable. Default for OpenAI...

Http 500 error when exceeding Azure OpenAI quota

Work left before closing: - reproducing 429 - change KM code to surface 429s appropriately. E.g. when calling KM service, if AI internally returns 429, KM web service should return...

Http 500 error when exceeding Azure OpenAI quota

^^ it's the same policy implemented here https://github.com/microsoft/kernel-memory/blob/main/extensions/AzureOpenAI/Internals/ClientSequentialRetryPolicy.cs, used both for OpenAI and Azure OpenAI. In case of throttling and 503, KM retries following the delay provided by the remote...

Make ITextEmbeddingGenerator.CountTokens and ITextGenerator.CountTokens ValueTask<int>

We tried making it async during the initial implementation, but it would affect the speed and complexity of the text chunker that would need quite a bit of rewrite, and...

Make ITextEmbeddingGenerator.CountTokens and ITextGenerator.CountTokens ValueTask<int>

IIRC Llama uses SentencePiece, anything available in that direction?

IDEA: Include directions for building and running Kernel Memory as a container

@glorious-beard thank you! as soon as I get a chance I'll do some tests 👍

IDEA: Include directions for building and running Kernel Memory as a container

some updates: * Docker image is available, notes in the main README * All settings can be set using env vars, using the usual .NET configuration approach, see Service's README...

Request adding source page number and returning with search result

Update: I started looking into it and made a few changes in #201 - This will need some more involved work, revisiting how text is extracted. It's doable, but not...

WithAzureOpenAITextGeneration but for summarizaion only

That's correct. Currently the service uses the same model for questions and summarization. You can use a different handler for summarization though, with you custom settings. Plugging in custom handlers...

‹
1
2
...
20
21
22
23
24
25
26
27
28
29
›