Sheng Zha

Results 163 comments of Sheng Zha

@StrayBird-ATSH yes, let's lift the limit given that the process is parallelized.

Looks like all the crashes on the mxnet side in the detailed log are in the sparse features. The spacy issues is likely due to upstream change that are not...

@mathephysicist could you also provide some context on what the interface should look like? Any reference doc? @sxjscience SageMaker training job is different from the notebook

Indeed. On the other hand, the concern around having C library is it complicates the testing setup here. We might not have the luxury of the testing suites of many...

One approach that @eric-haibin-lin took was to include a make file and the c code in the example, and ask users to compile it when using.

@pengxin99 this is awesome. Thanks for the contribution! Looks like git wasn't able to do an auto-merge when checking out. Could you try the following for a rebase? ```bash git...

@pengxin99 the problem was that the parameter's get method is relying on current_context when calling grad() without argument. I will post a fix in GluonNLP first and see how to...

@eric-haibin-lin will help drive this PR forward. cc @pengxin99

To publish the benchmark numbers we need to pick the environment. we've been using EC2 so whoever contributes this may require access to AWS and EC2.