data-selection topic
Transformers-Domain-Adaptation
:no_entry: [DEPRECATED] Adapt Transformer-based language models to new text domains
Patron
[ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach
dsir
DSIR large-scale data selection framework for language model training
InstructionGPT-4
InstructionGPT-4
data-selection-survey
This is a collection of research papers for A Survey on Data Selection for Language Models
LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
SCAR
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models