OLMo issues

Rename

Fixes https://github.com/allenai/OLMo/issues/457 As this is a breaking change, it probably makes sense to release a new version with it

Muennighoff

Olmo / OLMo consistency

7

### 🐛 Describe the bug In the HF code we use OLMo but in training it's Olmo - This creates some inconsistencies when importing from the training modeling file ----...

Muennighoff

type/bug

Fix a bug w.r.t. how local tokenizers are handled

1

`hf_olmo/convert_olmo_to_hf.py` currently crashes if the YAML file in the input checkpoint refers to a local tokenizer (it tries to load the local path from HF). I added a check to...

gahdritz

Set fs_local_rank as global_rank when FS_LOCAL_RANK is not available

1

In scritp `scripts/run_with_environment.sh`，`FS_LOCAL_RANK` is set as `RANK`. ``` export RANK=$SLURM_PROCID export FS_LOCAL_RANK=$SLURM_PROCID ``` If the job is not launched by `scripts/run_with_environment.sh` and all ranks share the same filesystem, every local...

hxdtest

Does not support flash attention 2.0 on transformers.AutoModelForCausalLM.from_pretrained

### 🚀 The feature, motivation and pitch I am using Olmo 7B for RAG for efficient inference on low GPU resources but does not support flash attention 2.0 Here is...

KaifAhmad1

type/feature

Exception raised when passing a config to AutoModelForCausalLM.from_pretrained

### 🐛 Describe the bug model_config = AutoConfig.from_pretrained(pretrained_model_name_or_path=model_name) model = AutoModelForCausalLM.from_pretrained( model_name, config=model_config, cache_dir=cache_dir, local_files_only=False, revision=revision, trust_remote_code="True") ...... File "/opt/homebrew/Caskroom/miniconda/base/envs/whale/lib/python3.11/site-packages/transformers/models/auto/auto_factory.py", line 560, in from_pretrained cls.register(config.__class__, model_class, exist_ok=True) File "/opt/homebrew/Caskroom/miniconda/base/envs/whale/lib/python3.11/site-packages/transformers/models/auto/auto_factory.py", line...

hyxu678

type/bug

Fine-tuning starting from a checkpoint

### ❓ The question I would like to fine-tune OLMo-1B starting from one of its checkpoints with my data. I understand that there are three steps in order to accomplish...

0novanta

type/question

OLMo
OLMo copied to clipboard

Metadata

Rename

Olmo / OLMo consistency

Fix a bug w.r.t. how local tokenizers are handled

Set fs_local_rank as global_rank when FS_LOCAL_RANK is not available

Does not support flash attention 2.0 on transformers.AutoModelForCausalLM.from_pretrained

Exception raised when passing a config to AutoModelForCausalLM.from_pretrained

Fine-tuning starting from a checkpoint

Only rank0 log metrics to console

[Storage Cleaner] Unsharding improvements

Firehose Logging

← Metadata

Owner

Metadata

OLMo OLMo copied to clipboard

Metadata

← Metadata

Owner

Metadata

OLMo
OLMo copied to clipboard