Julien Vignoud
Julien Vignoud
Addresses items listed in #654 #656 #657 Fixes #656: - [x] Fixes text preprocessing, now generation doesn't repeat the last input token and memory leak is fixed - [x] Add...
Here is a list of potential improvements for gpt-tfjs in Disco: - [x] Create a compile method to initialize the optimizer (rather than initializing it when fitDataset is called). This...
Here are some issues with gpt-tfjs I noted while implementing tokenization: - [x] There is a memory leak in the training loop. The memory doesn't grow much (~0.01MB per iteration)...
Benchmark and report some information on what to expect and the row for improvement: - [ ] number of training iterations needed on an example dataset, expected loss/perplexity, example of...
A recent [refactoring](https://github.com/epfml/disco/commit/0892e405036ef46f9f8948be0795dff0bf8089d7) enforced tasks' preprocessing in a lazy and streaming fashion. The preprocessing first defines stateless preprocessing functions (e.g. `resize` and `normalize` for images) and then apply them successively...
#690 should be addressed first. Adding a custom task is currently very convoluted, and the process is not documented enough. For example, the no-code UI for adding a custom task...
After reviewing the DISCO website as a new user, here is a list of things that can be fixed and improved. This issue goes in pair with [#627](https://github.com/epfml/disco/issues/627) which addresses...
This issue addresses the overall web app structure, taking a step back from implementation details to look at the website's objectives and user experience. The following contains reflections and to-dos...
Differential Privacy is implemented but never used. It seems that it was intentionally removed during a code refactoring (in [this PR](https://github.com/epfml/disco/pull/578/files#diff-bb5fd2b79ca040c5cdd1e0c4ec57de0ef4cfe3413a482d6c9b4b35a38b0e93e1), l.179 of decentralized/base.ts) - [ ] Integrate DP again...
Starting a collaborative training on the decentralized MNIST task fails during the first round. There are a few things not working well: * It takes a long time before anything...