ps-lite
ps-lite copied to clipboard
A lightweight parameter server interface
include/ps/internal/utils.h: 29 template 30 inline V GetEnv(const char *key, V default_val) { 31 const char *val = Environment::Get()->find(key); 32 if (val == nullptr) { 33 return default_val; 34 } else...
How to enable parameter replication in ps-lite?
Hi I am new to ps-lite and I am trying to use ps-lite as part of a university group project. I am using the KVServer and KVWorker. Whenever I deploy...
is there any project demostrating abilities and best practices of ps-lite ? mxnet is great, but too heavy for understanding ps-lite. I am expecting one thousand lines codes which using...
'Bounded Delay' consistency model is mentioned in the paper "Scaling Distributed Machine Learning with the Parameter Server". How to implement it in ps-lite? Can give some tips? Thx.
I tried playing with parameter_server linear example and killing a server process/node hangs the running process. Shouldn't the replicated node take over for the killed server as described in the...
Hey, I've got everything working ! At least - what is in the current repository. The documents talk about gradient optimization implementations here: http://ps-lite.readthedocs.io/en/latest/overview.html Are those intended to be an...
Hi @mli I have a question about this project, according to some your docs, such as the asynchronous sgd, i find that the worker nodes will get the global model...
When building test routines, this appeared: ``` tests/test_kv_app.cc: In function ‘void RunWorker()’: tests/test_kv_app.cc:44:43: error: ‘fabs’ was not declared in this scope res += fabs(rets[i] - vals[i] * repeat); ``` my...
when I start a cluster by dmlc_ssh.py,I find it not use ib0. In my hosts : 10.10.10.4 Server_10_10_10_4 10.10.101.4 Server_10_10_10_4_IB eth0 ip is 10.10.10.4,ib0 ip is 10.10.101.4 In https://github.com/dmlc/ps-lite/blob/master/tracker/tracker.py line...