Jiao Wang
Jiao Wang
You may use the data: hdfs://172.16.0.105:8020/user/root/jwang/wnd_twitter_2
> > You may use the data: hdfs://172.16.0.105:8020/user/root/jwang/wnd_twitter_2 > > I will take a look. Since it's a private repo, would you mind giving me permission? Yes, I already sent...
I changed loss function, and still get this error. maybe it is environment problem. Can you try "py37-horovod-tf" on almaren-node-107?
Get this error: terminate called after throwing an instance of 'gloo::IoException' (pid=11655, ip=172.16.0.121) what(): [/tmp/pip-install-x2psu8_w/horovod_8e87f6e8dcad47a6a27653365dfc240d/third_party/gloo/gloo/transport/tcp/unbound_buffer.cc:69] Timed out waiting 30000ms for recv operation to complete
@leonardozcm I can run on your conda environment. Yes, it is environment issue. Can you wrap up your installation and configuration steps and add to documentation? @helenlly maybe we need...
Need persist table to avoid iterative computation. - [x] cache() - [x] uncache()
> Why do we repartition previously? @jenniew The previous repartition is in `maybe_dataframe_to_xshards()` which was originally created for ray estimator. It avoids empty partition for worker when partitions < num...
> > > Why do we must have the number of partitions equal to the number of workers? Repartition is expensive, if the number of partitions is already larger than...
On SPR I tested bigdl llm with codellama-7b-hf model on transformers 4.36.2. -------------------- Prompt -------------------- Once upon a time -------------------- Output -------------------- Once upon a time Once upon a time,...
I test on SPR with transformers 4.36.2 -------------------- Prompt -------------------- Once upon a time -------------------- Output -------------------- Once upon a time Once upon a time, there was a king who...