mistral.rs issues

Implement Nomic Text Embed

1

This PR implements our first embedding model: nomic-ai/nomic-embed-text-v1!

EricLBuehler

new feature

models

create nix flake with default package and dev shell

This adds flake support for [Nix](https://nixos.org/)

ivandimitrov8080

documentation

new feature

Quantized Phi3: Features to add

- [ ] Support for LongRope (this is supported with ISQ in non-GGUF models, though) - The challenge is that the scalings information is not present in the GGUF file....

EricLBuehler

models

Gemma 2: ValueError: both hidden_act and hidden_activation are set

1

**Describe the bug** I am not sure if that's a bug. Python3.10, M1. ```python from mistralrs import Runner, Which, ChatCompletionRequest, Architecture runner = Runner( Which.Plain( model_id="google/gemma-2-9b-it", repeat_last_n=64, tokenizer_json=None, arch=Architecture.Gemma, )...

agravier

bug

resolved

triaged

Implement the Starcoder 2 model architecture

1

- [x] Loader and model - [ ] ISQ - [ ] AnyMoE - [ ] Device Mapping - [ ] X-LoRA/LoRA - [ ] Adapter activation

EricLBuehler

Support: Dolphin Vision 72B

[Dolphin Vision 72B](https://huggingface.co/cognitivecomputations/dolphin-vision-72b) is a fine-tune of base model [Qwen/Qwen2-72B](https://huggingface.co/Qwen/Qwen2-72B) but add vision: In this example is using transformers ```python import torch import transformers from transformers import AutoModelForCausalLM, AutoTokenizer from...

pabl-o-ce

models

Examples on Metal use CPU - no GPU - and produce no output...or take minutes to execute --> MacOS M1 Max 64Gb

17

**Describe the bug** High CPU use - no GPU use - MacOS 14.4.1 - Macbookpro M1 Max 64Gb cargo build --example phi3v --release --features metal It takes minutes to execute...

oddpxl

bug

triaged

Tracking issue for AnyMoE

This is a tracking issue for the development of AnyMoE, which will be broken up into several PRs. - [x] Core functionality, plain models, all APIs: #476 - [x] Support...

EricLBuehler

Implement GPTQ quantization

1

This PR adds GPTQ quantization ([paper here](https://arxiv.org/abs/2210.17323)) support. Refs: #418, #448.

EricLBuehler

new feature

Add LLaVA Support

6

## Introduction This implementation is based on my work for [candle](https://github.com/huggingface/candle). However, it incorporates some notable differences: * I have completely removed support for the model format used in the...

chenwanqq

new feature

models

mistral.rs
mistral.rs copied to clipboard

Metadata

Implement Nomic Text Embed

create nix flake with default package and dev shell

Quantized Phi3: Features to add

Gemma 2: ValueError: both hidden_act and hidden_activation are set

Implement the Starcoder 2 model architecture

Support: Dolphin Vision 72B

Examples on Metal use CPU - no GPU - and produce no output...or take minutes to execute --> MacOS M1 Max 64Gb

Tracking issue for AnyMoE

Implement GPTQ quantization

Add LLaVA Support

← Metadata

Owner

Metadata

mistral.rs mistral.rs copied to clipboard

Metadata

← Metadata

Owner

Metadata

mistral.rs
mistral.rs copied to clipboard