Kurt Shuster comments

Results 198 comments of


                                            Kurt Shuster

User speaks twice

mutators might be the way to go here; closing this issue as there's been no activity

List of Usability Changes

- [ ] provide an easy way to step through model forward passes to actually examine outputs of the modules

Assuming we have the following: ``` CHECKPOINT=/path/to/fsdp_sharded_checkpoint/checkpoint_last CONSOLIDATED=/path/to/new_consolidated_checkpoint/ RESHARDED=/path/to/new_resharded_checkpoint/ MP=16 ``` ### Step 0 (Optional, if necessary) [Consolidate the model](https://github.com/facebookresearch/metaseq/blob/bbcedfebb4c35f71cdda1f1a358491f3996a9fc3/metaseq/scripts/consolidate_fsdp_shards.py) from the FSDP shards into one checkpoint: ```bash python consolidate_fsdp_shards.py...

Kurt Shuster

User speaks twice

MemNN broken interactive

List of Usability Changes

DGX 2 with 16 V100

DGX 2 with 16 V100

DGX 2 with 16 V100

Repetition Penalties, Factual Nucleus

Repetition Penalties, Factual Nucleus