mlx-examples Mlx generate text, by default halusonates more

Issue Description Problem: When using the mix generate text command with verbose set to false, and the following parameters:

Temperature: 0.1 or 0 Top p: 1 The LLM models seem to hallucinate more frequently.

Models Used:

Mistral 7B v3 IBM Granite Mistral 7B v2 Use Case: Text-to-SQL

Method Used: mlx.generate_text

Steps to Reproduce Set up the environment with the specified models. Run the mix generate text command with verbose set to false. Use a temperature of 0.1 or 0 and top p of 1. Observe the output for hallucinations. Expected Behavior The models should generate accurate and coherent SQL queries without hallucinations.

Actual Behavior The models produce outputs that are factually incorrect or nonsensical, indicating hallucinations.

Additional Information Dataset: 20,000 examples split for training and validation. Hardware: 128GB M3 Max

Aug 21 '24 03:08 kishoretvk

Could you share the commands you used and the precise model paths?

Hardware: 40 GB A100 GPU.

Also MLX is meant for Apple silicon.. is that a typo or were you running it on a linux machine?

Aug 21 '24 04:08 awni

Sorry my bad 128 gb ram M3 max

Aug 21 '24 16:08 kishoretvk

How about the commands / model paths?

Also when you say "by default halusonates more" what are you comparing against?

Aug 21 '24 17:08 awni