mistral-inference issues

Non Latin Language support?

Dear Team, You've done a tremendous job! Thank you for creating a real alternative for French and other European languages. I would like to know is it possible to add...

ican24

Has any thought been given to using LoRA to increase the number of experts (100x) with minimal memory?

8

As I understand the current MoeLayer, a gate calculates the weight to be applied to the output of each expert, the top k are selected and run on the data,...

sixChar

Update README.md for vllm & docker docs.

fixed dead link. points to current documentation now.

slmatrix

Update README.md: Fix page not found for link to guardrailing

Actual behavior of https://docs.mistral.ai/usage/guardrailing: ![Screenshot 2023-12-31 at 6 34 14 PM](https://github.com/mistralai/mistral-src/assets/1118615/9805f684-36e3-49f4-9c3f-23278538dee9) Fix it to https://docs.mistral.ai/platform/guardrailing/: ![image](https://github.com/mistralai/mistral-src/assets/1118615/48754d80-02df-4884-ac62-512630835193)

martin0258

Fix typo

moritztng

More language support?

6

Hi, I'd like to know will mistral planning to support more languages?

OnceJune

Minor typos

Minor typos in `readme.md` and tutorials

sethupavan12

What is the best way for the inference process in LORA in PEFT approach

Here is the SFTtrainer method i used for finetuning mistral ``` trainer = SFTTrainer( model=peft_model, train_dataset=data, peft_config=peft_config, dataset_text_field=" column name", max_seq_length=3000, tokenizer=tokenizer, args=training_arguments, packing=packing, ) trainer.train() ``` I found different...

pradeepdev-1995

Mistral input context length limitation

Hi, I have used the source code here and downloaded the weight instruct-v0.2 from https://docs.mistral.ai/models/. And in the source code, I have set '''instruct: bool = True''' in main.py. I...

DanYoto

Which is the actual way to store the Adapter after PEFT finetuning

I am finetuning the mistral model using the following configurations ``` training_arguments = TrainingArguments( output_dir=output_dir, per_device_train_batch_size=per_device_train_batch_size, gradient_accumulation_steps=gradient_accumulation_steps, optim=optim, save_steps=save_steps, logging_strategy="steps", logging_steps=10, learning_rate=learning_rate, weight_decay=weight_decay, fp16=fp16, bf16=bf16, max_grad_norm=max_grad_norm, max_steps=13000, warmup_ratio=warmup_ratio, group_by_length=group_by_length, lr_scheduler_type=lr_scheduler_type...

pradeepdev-1995

mistral-inference
mistral-inference copied to clipboard

Metadata

Non Latin Language support?

Has any thought been given to using LoRA to increase the number of experts (100x) with minimal memory?

Update README.md for vllm & docker docs.

Update README.md: Fix page not found for link to guardrailing

Fix typo

More language support?

Minor typos

What is the best way for the inference process in LORA in PEFT approach

Mistral input context length limitation

Which is the actual way to store the Adapter after PEFT finetuning

← Metadata

Owner

Metadata

mistral-inference mistral-inference copied to clipboard

Metadata

← Metadata

Owner

Metadata

mistral-inference
mistral-inference copied to clipboard