LMOps issues

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

I got this error while doing Fine-tuning: File "/mnt/oss-data/xxx/minillm/transformers/src/transformers/generation/utils.py", line 3000, in sample next_tokens = torch.multinomial(probs, num_samples=1).squeeze(1) RuntimeError: probability tensor contains either `inf`, `nan` or element < 0 next_tokens =...

dongzhiwen1218

Timeout Error in all_gather during evaluate_ppo() on 2 H100 GPUs with miniLLM and ZeRO

2

Hi, I'm using ZeRO with optimizer and parameter offload to run minillm on 2 H100 gpus on a single node. After doing the generation evaluation, I get a timeout during...

Ispanicus

SFT data and pretrain data problem

The processed data size is 55G. Are you sure about that size? Or can you provide processed SFT data link and pre-training data link separately? Thanks for open source. 🙏🏻

Emperorizzis

BreadcrumbsLMOps/prompt_optimization /tasks.py EthosBinaryTask question

module: [EthosBinaryTask](https://github.com/microsoft/LMOps/blob/main/prompt_optimization/tasks.py) two questions: 1. df = df[(df[1] = 0.7)]: why using this condition to filter the data, the condition not appear in paper 2. exs = [{'id': x['index'], 'text':...

aaronlyt

try to employ the deepspeed-zero2

1

When the following code is executed, an error occurs ![Uploading 20231108-232040.jpeg…]() model, optimizer, _, lr_scheduler = deepspeed.initialize( model=model, optimizer=optimizer, args=args, lr_scheduler=lr_scheduler, mpu=None, config_params=ds_config )

thunderboom

Can I apply minillm to phoenix model

Hello, I would like to ask Can I apply minillm to phoenix model. What code do I need to modify and how to modify it? Thanks for your help

WenTingTseng

Structured Prompting: GPT_neo_modeling.py

2

``` causal_mask = self.bias[:, :, key_length - query_length : key_length, :key_length] ``` but in Structured Prompting the key_length exceeds the max_positions. How to address this issue. Thank you.

amurtadha

Bump werkzeug from 3.0.1 to 3.0.3 in /minillm/transformers/examples/research_projects/decision_transformer

Bumps [werkzeug](https://github.com/pallets/werkzeug) from 3.0.1 to 3.0.3. Release notes Sourced from werkzeug's releases. 3.0.3 This is the Werkzeug 3.0.3 security release, which fixes security issues and bugs but does not otherwise...

dependabot[bot]

dependencies

Bump jinja2 from 2.11.3 to 3.1.4 in /minillm/transformers/examples/research_projects/decision_transformer

Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.3 to 3.1.4. Release notes Sourced from jinja2's releases. 3.1.4 This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise...

dependabot[bot]

dependencies

llama version in Minillm

11

Is it Llama1 or Llama2? Thx

kaizizzzzzz

LMOps
LMOps copied to clipboard

Metadata

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Timeout Error in all_gather during evaluate_ppo() on 2 H100 GPUs with miniLLM and ZeRO

SFT data and pretrain data problem

BreadcrumbsLMOps/prompt_optimization /tasks.py EthosBinaryTask question

try to employ the deepspeed-zero2

Can I apply minillm to phoenix model

Structured Prompting: GPT_neo_modeling.py

Bump werkzeug from 3.0.1 to 3.0.3 in /minillm/transformers/examples/research_projects/decision_transformer

Bump jinja2 from 2.11.3 to 3.1.4 in /minillm/transformers/examples/research_projects/decision_transformer

llama version in Minillm

← Metadata

Owner

Metadata

LMOps LMOps copied to clipboard

Metadata

← Metadata

Owner

Metadata

LMOps
LMOps copied to clipboard