llama.cpp issues

[WIP, broken] Importer for GPTQ quantized LLaMA models

3

Based on: https://github.com/qwopqwop200/GPTQ-for-LLaMa Current status: Something is busted. The output starts out decent, but quickly degrades into gibberish. This doesn't happen with either the original GPTQ-for-LLaMa using the same weights,...

comex

enhancement

model

Check for reverse prompt by characters instead of tokens (#292)

2

This fixes bug #292 as suggested [here](https://github.com/ggerganov/llama.cpp/issues/292#issuecomment-1476318351).

tjohnman

bug

We could use std::unordered_map over std::map

2

If it is not necessary sorted maps, change std::map to std::unordered_map std::unordered_map is a hash table so it should be faster than std::map when storing many items. std::map can be...

Fabio3rs

enhancement

Reset token budget after every user intervention.

In interactive mode, every time the model has to respond to user input it has an increasingly reduced token budget, eventually generating only a few words before stopping. The token...

tjohnman

enhancement

Enable ANSI colors on Windows 10+

- On older versions function will silently fail without any ill effects - Only used when params.use_color==true ( --color ) - No windows.h dependency

anzz1

bug

Fix color codes emitting mid-UTF8 code.

1

Some moving around of ANSI color code emissions in recent patches has left us in a situation where RESET codes were getting defensively emitted after every token, resulting in multibyte...

blackhole89

bug

fix typo in comment

1

rebase error

eiz

bug

Add OpenBSD support

Add OpenBSD support.

kevlo

enhancement

build

move file magic/version to header, print expected version

bit of refactoring per https://github.com/ggerganov/llama.cpp/pull/252

eiz

enhancement

Add initial AVX512 support for dot product on Linux

NOTE: I am seeing different outputs when running with these changes. They seem of equal quality, but this isn't something I observed when first testing this out on alpaca.cpp. It's...

Ameobea

enhancement

performance

llama.cpp
llama.cpp copied to clipboard

Metadata

[WIP, broken] Importer for GPTQ quantized LLaMA models

Check for reverse prompt by characters instead of tokens (#292)

We could use std::unordered_map over std::map

Reset token budget after every user intervention.

Enable ANSI colors on Windows 10+

Fix color codes emitting mid-UTF8 code.

fix typo in comment

Add OpenBSD support

move file magic/version to header, print expected version

Add initial AVX512 support for dot product on Linux

← Metadata

Owner

Metadata

llama.cpp llama.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama.cpp
llama.cpp copied to clipboard