mlx-swift-examples issues

feature request: add support for minicpm4 model (a ultra fast model in small size)

as [tech report](https://github.com/OpenBMB/MiniCPM/blob/main/report/MiniCPM_4_Technical_Report.pdf) mentioned, it is really fast. mlx version already existed: https://huggingface.co/mlx-community/MiniCPM4-8B-4bit

Remember2015

Gemma3: Image Handling Error and <end_of_turn> Looping Issue

12

Nice release! However, I've run into a couple of small issues with Gemma3, any help/insights or fixes would be greatly appreciated! Thank you! - First off, I'm puzzled about this...

louisauguste

Proposal: Add thinking Case to Generation Enum

4

Currently, the `Generation` enum has three cases: `chunk`, `info`, and `toolCall`. Many newer APIs (such as Ollama’s `thinking` property in `Message`) now include special properties for "thinking" directly in their...

ronaldmannak

Halved bge-large model gives NaN embeddings when used from Swift

Creating a halved version of bge-large using the following python code: ```python hf_model = AutoModel.from_pretrained("BAAI/bge-large-en-v1.5") hf_model.half() hf_model.save_pretrained(model_path) tokenizer = AutoTokenizer.from_pretrained("BAAI/bge-large-en-v1.5") tokenizer.save_pretrained(tokenizer_path) ``` Seems to work just fine. However, loading this...

jrturton

Dynamic maybeQuantizeKVCache causes cache COW in TokenIterator.

7

When I work on the prompt cache with the KV quant cache, I've noticed maybeQuantizeKVCache converts the SimpleKVCache to the quantized KV cache. However, the cache reference passed to TokenIterator...

mzbac

add input_embeddings to LLM interface

See https://github.com/ml-explore/mlx-swift-examples/pull/238/files The gemma3_text model has an `input_embeddings` parameter: - https://github.com/ml-explore/mlx-lm/blob/main/mlx_lm/models/gemma3_text.py#L224 Per the python docs on `generate_step`: ``` input_embeddings (mx.array, optional): Input embeddings to use in place of prompt tokens....

davidkoski

feature: example applications could have settings to set download directory

1

In the example applications, on macOS, we could allow users to select a download directory for the weights. For example they could pick ~/.cache to match the python download directory...

davidkoski

Package fails to build

5

I'm on version 2.21.2 but it fails to build with error: Ambiguous use of 'dictionary' in the Tokenizer.swift, is there any workaround? any help is appreciated.

daylonvw

mlx-swift-examples
mlx-swift-examples copied to clipboard

Metadata

feature request: add support for minicpm4 model (a ultra fast model in small size)

Gemma3: Image Handling Error and <end_of_turn> Looping Issue

Proposal: Add thinking Case to Generation Enum

Halved bge-large model gives NaN embeddings when used from Swift

Dynamic maybeQuantizeKVCache causes cache COW in TokenIterator.

add input_embeddings to LLM interface

feature: example applications could have settings to set download directory

Package fails to build

← Metadata

Owner

Metadata

mlx-swift-examples mlx-swift-examples copied to clipboard

Metadata

← Metadata

Owner

Metadata

mlx-swift-examples
mlx-swift-examples copied to clipboard