distributed-learning-contributivity issues

What happens when we clip batch_size ?

2

The batch size is clipped here --> https://github.com/SubstraFoundation/distributed-learning-contributivity/blob/b72fa98c0b4db45d368f577d0f6d1a861b1610c2/scenario.py#L584 So if it's clipped it means that sometimes we don't use all the data in the dataset. But we don't give any...

RomainGoussault

help wanted

good first issue

optimization

The MNIST model seems to deep/complexe

The MNIST nn have 1,199,882 parameters, for an accuracy of 0.9893 +/- 0.01 (keras fit, epochs=12, batch_size=168) To compare with, the cifar model have 1,250,858 parameters, and reach 0.7888 +/-...

arthurPignet

question

Scenario/mpl log show inconsistent syntax

See above: One example, sometime there is #, sometime there isn't. 2021-01-14 17:54:30 | INFO | ### Splitting data among partners: 2021-01-14 17:54:30 | INFO | Train data split: 2021-01-14...

arthurPignet

good first issue

invalid

Add make build & make tests

Add a makefile & add link for windows users

natct10

enhancement

Add 'both' option for validation dataset and test data set.

Currently we can set the validation and test sets as local or global. It could be interesting to had both usages

arthurPignet

enhancement

IS_reg_S default to Shapley when partners_count < 4

@bowni @Thomas-Galtier @arthurPignet What would happen if we removed this threshold ? https://github.com/SubstraFoundation/distributed-learning-contributivity/blob/808ee93c8593d3b226d3d1aeaa370c6d9d9689f8/mplc/contributivity.py#L452

RomainGoussault

question

Create changelog

To gather all release notes

natct10

The repo is way too heavy!

8

The repo is now 1.1GB which is not okay. It is likely that a dataset has been added somewhere. I will investigate this issue, but any help is welcome for...

natct10

bug

help wanted

optimization

ESC50 preprocessing is time consuming, the preprocessed data should be stored in cache

When using the ESC50 dataset, we preprocess the data at each instanciation of a dataset. However the preprocessing is quite long. It would be great to store the preprocessed inputs...

arthurPignet

good first issue

optimization

Explore federated learning for Cifar dataset

3

We are used to having good performance results for MNIST dataset (often reaching >80% accuracy) independently from scenario configuration, which allows for good comparison of contributivity methods implemented For Cifar...

jeromechambost

distributed-learning-contributivity
distributed-learning-contributivity copied to clipboard

Metadata

What happens when we clip batch_size ?

The MNIST model seems to deep/complexe

Scenario/mpl log show inconsistent syntax

Add make build & make tests

Add 'both' option for validation dataset and test data set.

IS_reg_S default to Shapley when partners_count < 4

Create changelog

The repo is way too heavy!

ESC50 preprocessing is time consuming, the preprocessed data should be stored in cache

Explore federated learning for Cifar dataset

← Metadata

Owner

Metadata

distributed-learning-contributivity distributed-learning-contributivity copied to clipboard

Metadata

← Metadata

Owner

Metadata

distributed-learning-contributivity
distributed-learning-contributivity copied to clipboard