llama-dfdx issues

Results 5 llama-dfdx issues

Sort by recently updated

disabling cache has poor generation results

``` ubuntu@instance-20230508-1136:~/repos/llama-dfdx$ ./target/release/llama-dfdx --model llama-7b-hf --disable-cache generate "Why is pi round?" Detected model folder as LLaMa 7b. Model size: 13476 MB 13476 MB of model parameters will be held in...

sdake

[Unsafe][WIP] Using memmap'd data directly for CPU tensor storage, instead of copying.

Blocking questions: 1. Is it safe to std::mem::forget the tensor? Is that all we need to do? What about the other fields of tensor? 2. Is the Vec::from_raw_parts_mut usage safe?

coreylowman

Auto determine model type (chat/generate/instruct) in main

- llama is generation, so can't really be used with chat - vicuna is a chatbot - alpaca is instruction model If not able to determine "mode" a user could...

coreylowman

Add instructions for Alpaca 7b weights to README

Alpaca 7b should be the exact same structure, so as long as you can convert the weights into the same format with `convert.py` it should be runnable out of the...

coreylowman

Auto determine how much of the model to load into RAM

Use cases: 1. You can fit the whole model into GPU ram 2. You can fit part of the model into GPU ram 3. You need keep all the model...

coreylowman

llama-dfdx
llama-dfdx copied to clipboard

Metadata

disabling cache has poor generation results

[Unsafe][WIP] Using memmap'd data directly for CPU tensor storage, instead of copying.

Auto determine model type (chat/generate/instruct) in main

Add instructions for Alpaca 7b weights to README

Auto determine how much of the model to load into RAM

← Metadata

Owner

Metadata

llama-dfdx llama-dfdx copied to clipboard

Metadata

disabling cache has poor generation results

[Unsafe][WIP] Using memmap'd data directly for CPU tensor storage, instead of copying.

Auto determine model type (chat/generate/instruct) in main

Add instructions for Alpaca 7b weights to README

Auto determine how much of the model to load into RAM

← Metadata

Owner

Metadata

llama-dfdx
llama-dfdx copied to clipboard