Nadav Timor comments

Results 13 comments of


                                            Nadav Timor

Could not find a version that satisfies the requirement lingvo

Same for me. Tried both with `pip` and `pip3`.

Allow choices restriction

You might also find the `HfArgumentParser` relevant: https://github.com/huggingface/transformers/blob/514de24abfd4416aeba6a6455ad5920f57f3567d/src/transformers/hf_argparser.py#L109

> > You might also find the `HfArgumentParser` relevant: https://github.com/huggingface/transformers/blob/514de24abfd4416aeba6a6455ad5920f57f3567d/src/transformers/hf_argparser.py#L109 > > Not really if you have to install full HF package for it... The alternative below doesn't need the...

What is the `max_seq_len` in Mistral?

> What is the `max_seq_len` (or `max_position_embeddings`) of Mistral-7B-v0.1 when training? > > The official code says it is 128_000. (https://github.com/mistralai/mistral-src/blob/147c4e68279b90eb61b19bdea44e16f5539d5a5d/mistral/model.py#L201C69-L201C69) > > The config file in huggingface says it...

Add support for Palm, Claude-2, Cohere, Azure OpenAI Llama2, CodeLlama (100+LLMs) - using LiteLLM

> I do not think we are going to be replacing the official OpenAI API call with yours. @StellaAthena, what are your thoughts on adding @ishaan-jaff's `litellm` alongside the official...

[RFC]: Automate Speculative Decoding

We recently [showed](https://arxiv.org/pdf/2405.04304) that even a relatively simple speculation lookahead controller can speed up the decoding. > Paradoxically, under conditions of high request rates or low speculation accuracy, latency may...

`AIChat` doesn't save messages if provided with an `output_schema` (despite setting `save_messages=True`)

> This is intentional. From the relase notes: https://github.com/minimaxir/simpleaichat/releases/tag/v0.2.0 > > > In all cases, no messages are saved when using schema to prevent unintended behavior. You will have to...

Access Function Calling responses before Pydantic's parsing

> Maybe a parameter to also return the API response? Sounds legit.

Cannot Connect to Pod's Exposed Public IP & Port from Pod within same Region

I also need help with connecting to pods. Connecting via "Basic SSH Terminal" works, but "SSH over exposed TCP" doesn't. I checked the `~/.ssh/authorized_keys` file on the pod, and it...

[Feature] Add suffix decoding speculative algorithm

> > In the comparison here, it seems that the performance of ngram is not yet higher than that of "no spec". Could you provide the specific startup parameters and...