dsir
dsir copied to clipboard

Published 20 hours ago •

p-lambda

→

Metadata

DSIR large-scale data selection framework for language model training

Reame
Issues

Results 5 dsir issues

Sort by recently updated

KL reduction calculation

How do you calculate the KL reduction for dataset feature distribution?

Schopenhauer-loves-Hegel

The code for KL

Hi, Can you release the code for the computation of KL reduction in Figure 3 in the paper? Thank you very much!

BeachWang

Reproduce experiments for table 4

Kindly request to release code about DSIR with a neural importance weight estimator .

MarkDeng1

Hi, We follow the training pipeline in `experimental` to replicate the DSIR results. However, our average performance reached only 81.05, significantly below the reported benchmark of 82.30. Are there any...

BeachWang

About

DSIR large-scale data selection framework for language model training

data

language-models

large-scale

data-filtering

data-selection

importance-resampling

223

Stars

19

Forks

Watchers

Owner

p-lambda

← Metadata

223

Stars

19

Forks

Watchers

Owner

p-lambda

Metadata

DSIR large-scale data selection framework for language model training

Back

dsir
dsir copied to clipboard

Metadata

KL reduction calculation

add chinese language support

The code for KL

Reproduce experiments for table 4

Reproduction of experiments

← Metadata

Owner

Metadata

dsir dsir copied to clipboard

Metadata

KL reduction calculation

add chinese language support

The code for KL

Reproduce experiments for table 4

Reproduction of experiments

← Metadata

Owner

Metadata

dsir
dsir copied to clipboard