Miguel de Benito Delgado issues

Results 72 issues of


                                            Miguel de Benito Delgado

Implement usual downstream tasks for IF testing

Ideas: - Mislabeled data detection - Active learning / subset selection (compare with random)

enhancement

benchmarking

Rename value.shapley.naive to exact

cleanup

Implement exact methods for LOO

Branch feature/loo https://github.com/appliedAI-Initiative/pyDVL/compare/develop...feature/loo implements LOO for linear smoothers, for which a closed form solution is known. Finish that and test.

good first issue

new-method

Interruptible samplers

#319 introduces `PermutationSampler` but it does not include the possibility of interrupting the sampling within a permutation, as required for TCMS. One possibility would be to make samplers not simple...

enhancement

breaking-change

Implement NTK scorer

Introduced in _Zhaoxuan Wu, Yao Shu, and Bryan Kian Hsiang Low, “[DAVINZ: Data Valuation Using Deep Neural Networks at Initialization](https://proceedings.mlr.press/v162/wu22j.html),” in Proceedings of the 39th International Conference on Machine Learning...

new-method

Implement stratified sampling

* [ ] Implement the optimal strategy described in [1]. * [x] #226 * [ ] Reproduce their results (possibly, but not necessarily including the stratified sampling strategy of [2]),...

good first issue

paper reproduction

Unify interfaces for data valuation

tbd See - [ ] #325 - [ ] #463

enhancement

design-problem

good first issue

Miguel de Benito Delgado

Implement usual downstream tasks for IF testing

Rename value.shapley.naive to exact

Implement exact methods for LOO

Interruptible samplers

Implement NTK scorer

Implement stratified sampling

Unify interfaces for data valuation

Notebook illustrating semi-values

Benchmarking tools

Make plotting functions composable