Hanlin Tang comments

Results 71 comments of


                                            Hanlin Tang

take resource on both GPU, while cmd line only spec-ed one: --device_id 0

Thanks, we've noticed this internally as well.

Predictions for number of samples < be.bsz

Layer buffers in neon are pre-allocated during model creation for a particular batch size. To run prediction on `samples < be.bsz`, you could either regenerate the model with a new...

Fatal Error in MacOS Sierra when running 'make' after installing dependencies

I am not familiar with spyder IDE, but likely there is a virtualenv setting needed somewhere (or sypder need to be installed in and run from your virtualenv). These may...

Multi task learning use Neon

Hello, You will first want to use the `MergeBroadcast` container (see `examples/image_caption.py` for an example where you have two inputs) to build your model. For the data side, you can...

how convert python train scripts to yaml config file ?

We don't have an automatic way of converting python train scripts to yaml. Also note that yaml support can be limited especially for more complex models with aeon-based dataloaders. Is...

Refactor away `composer.core`

Yes I would love to move away from `core`! A few other suggestions/thoughts: * turn folders -> _.py files whenever we don't actually need them * should we be putting...

Lengthy dataloader restoration with NLP datasets

The fast-fowarding behavior is required for CV, since it needs to replay the random augmentations to set the RNG state properly. We could add in a specific shortcut for NLP...

Focal Loss, Taylor Cross Entropy Loss, SnapMix, Adaptive Gradient Clipping

Thanks @alexriedel1 , for suggesting these. Yes, for the loss functions its just adding them to `loss.py`. We can take care of algorithm-izing them a bit later. For `SnapMix`, see...

Focal Loss, Taylor Cross Entropy Loss, SnapMix, Adaptive Gradient Clipping

@alexriedel1 , just an update -- we implemented adaptive gradient clipping in #924, give it a try/review and let us know if it helps!

Remove `xfail` in `tests/algorithms/algorithm_settings` with a `simple_hf_settings` option

Yes, we should add canonical settings for NLP for each algorithm!