TinyLlama issues

about the speed

1

Hi, May I ask a simple question: You claim 24K token/s with 1.1B model which is 56% efficiency. But my cuda code with pure cublas GEMM calls on 2048*2048 matrix...

wangyi-fudan

Hello, is this tokenizer using LLaMA’s Tokenizer or did you train it yourself?

1

Hello, is this tokenizer using LLaMA’s Tokenizer or did you train it yourself?

chenhk-chn

Convert weights to original llama weights.

Hi, Is it possible to convert this weights into https://github.com/facebookresearch/llama/tree/main/llama format?

PSanni

TPU Pretraining

How to train the model using TPUs?

kathir-ks

Update README_CN

Hi~ Great Work! I notice the Chinese version of README file seems to be outdated, e.g., hf space, news, and the missed intermediate checkpoint download link. So I tried to...

koalazf99

Where is the pretraing example of llama-1.1b-chat

aritralegndery

A potential bug in multi-GPU training

Hi, I found the following strange phenomena when running tiny llama pretraining. 1. When using multiple GPUs, I got **completely different results** when **running the same code twice**. Further, many...

zyushun

Encountered an issue while loading the model using transformers

1

I try to load the model with transformers, ` small_model = AutoModelForCausalLM.from_pretrained(approx_model_name, torch_dtype=torch.float16, device_map="auto", trust_remote_code=True)` but error occurs, `OSError: Unable to load weights from pytorch checkpoint file for '/mnt/data3/lyk/models/tinyllama-1.1b/pytorch_model.bin' at...

Yukang-Lin

模型和代码欢迎发布到wisemodel.cn开源社区

模型和代码文件欢迎发布到wisemodel.cn开源社区

LiuDQ-wm

The results under the FastChat framework are quite bizarre?

I used `TinyLlama-1.1B-intermediate-step-1431k-3T` for conversation under the FastChat framework. I asked a question: "What's your name?" The answer I got is: ```bash python -m fastchat.serve.cli --model-path $my_path_to_tiny_llama/tiny_llama/TinyLlama-1.1B-intermediate-step-1431k-3T/ : What's your...

Felixvillas

TinyLlama
TinyLlama copied to clipboard

Metadata

about the speed

Hello, is this tokenizer using LLaMA’s Tokenizer or did you train it yourself?

Convert weights to original llama weights.

TPU Pretraining

Update README_CN

Where is the pretraing example of llama-1.1b-chat

A potential bug in multi-GPU training

Encountered an issue while loading the model using transformers

模型和代码欢迎发布到wisemodel.cn开源社区

The results under the FastChat framework are quite bizarre?

← Metadata

Owner

Metadata

TinyLlama TinyLlama copied to clipboard

Metadata

← Metadata

Owner

Metadata

TinyLlama
TinyLlama copied to clipboard