QI JUN issues

Results 22 issues of


                                            QI JUN

Support range function in Python

In Python, the [range](https://www.w3schools.com/python/ref_func_range.asp) function returns a sequence of numbers, starting from 0 by default, and increments by 1 (by default), and stops before a specified number. For example: ```...

proposal

Polish batch matmul code

# What does this PR do ? Polish the batch matmul code, make it more pytorchnic

add profiling scripts

#279 Python provides [cProfile](https://docs.python.org/2/library/profile.html#module-cProfile) module to help us profiling python codes. A common usage can be: `python -m cProfile -o train.prof train.py` Since train.prof is a binary file, we have...

polish get started tutorials

- [x] Revise and polish elasticdl local tutorial, set up a mnist training experiment with minikube quickly #1912 - [ ] Add a deployment tutorial, save model when training, and...

user experience

How to use PyTorch dataloader

## 背景介绍 PyTorch的 [Dataset class](https://pytorch.org/tutorials/beginner/data_loading_tutorial.html#dataset-class)定义我们可以发现，PyTorch要求Dataset必须提供 `__len__` 接口和 `__getitem__`接口，这就要求数据集是已知长度的，并且是可以被随机访问的。这里与TensorFlow不同，TensorFlow的Dataset是可以从一个generator创建的，generator只要求用户实现 `__next__`接口即可，并不要求 `__len__` 接口和 `__getitem__` 接口。因此，我们需要提出一种新的思路。 ## 简单的做法 1. worker从master那里拿到一个task 2. worker 使用 recordio_reader提供的接口，把该task包含的record都读到内存中 3. records是一个即知道长度，又可以随机访问的数组，我们可以从这个数组中创建一个 RecordDataset 4. RecordDataset中，每一个record都是string类型的，用户需要提供一个feed函数把string类型转换为数值类型。我们发现，这个feed函数实际上就是...

verify loading checkpoint consistency between Python PS and Go PS

We generate a checkpoint using this script https://github.com/sql-machine-learning/elasticdl/blob/develop/scripts/travis/gen_mnist_checkpoint.py. Then we load the checkpoint to a job with two PS pods. Let's verify the consistency between Python PS and Go PS.

support CategoryCrossing preprocessing layer

```python CategoryCrossing(PreprocessingLayer): """This layer transforms multiple categorical inputs to categorical outputs by Cartesian product, and hash the output if necessary. If any of the inputs is sparse, then all outputs...

data transform

unify preprocess layer in model zoo

I find that there is some keras preprocessing layer in [census_wide_deep_model](https://github.com/sql-machine-learning/elasticdl/blob/develop/model_zoo/census_wide_deep_model/keras_process_layer.py). Could we move these preprocessing layer into elasticdl preprocessing directory? Or if the preprocessing layer has been implemented already,...

How to do optimzation in pserver

In TensorFlow 1.x, or similar computation graph based deep learning framework, we could serialize the model into a protobuf file. And then, we can split the file into two parts,...

parameter server

some refactor work of parameter server

Step 1: - [ ] embedding table Python data structure - [ ] tensor proto message and Python data structure - [ ] a new PS, which binds to a...