Xiaoyu Zhai
Xiaoyu Zhai
## Background MongoDB version: 4.2.6 mongoengine version: 0.23.1 pymongo: 3.12.0 In my scenario, I need to connect with `authentication_source` and `authentication_mechanism` and then do some queries. My code goes like...
Followed https://aws-dglke.readthedocs.io/en/latest/dist_train.html, and got an error of `recv` ``` /dgl_workspace/dgl/src/graph/network/tcp_socket.cc:180: recv error: Resource temporarily unavailable terminate called after throwing an instance of 'dmlc::Error' what(): [04:04:56] /dgl_workspace/dgl/src/graph/network/socket_communicator.cc:282: Check failed: tmp !=...
There is an issue that will be raised in distributed training, like ``` Traceback (most recent call last): File "/usr/local/bin/dglke_server", line 33, in sys.exit(load_entry_point('dglke==0.1.0.dev0', 'console_scripts', 'dglke_server')()) File "/usr/local/lib/python3.6/site-packages/dglke-0.1.0.dev0-py3.6.egg/dglke/kvserver.py", line 178,...
## k = 1 `dglke_partition --dataset FB15k -k 1 --data_path /dgl_workspace` will get ``` load dataset.. File not found. Downloading from https://data.dgl.ai/dataset/FB15k.zip Download finished. Unzipping the file... Unzip finished. Reading...
This is Xiaoyu Zhai, from Qihoo 360 AI Infra. Currently, our team is working on DGL Operator to make DGL distributed training easier on Kubernetes. And I am glad to...