Junfan Zhang

Results 152 issues of Junfan Zhang

### What changes were proposed in this pull request? [Feature] Introduce graceful decommission for shuffle server ### Why are the changes needed? When shuffle servers needed to decommission are included...

### Background I found when getting shuffle result, the flame graph show the method of `getBlockIdsByPartitionId` occupy too much time. ![reliao_img_1658922962790](https://user-images.githubusercontent.com/8609142/181240913-8d38d12e-c62a-453d-a0df-f89a46e1b71a.png) But i found the default bitmapNum is 1, and...

### Motivation It's hard to analyze which process cost the most time of one remote request in current codebase, due to lacking corresponding trace id in client and server side....

只是看代码觉得此处有问题,望指正。 如有问题,我 pull request 一下

### Why In estimator, we need to print final best eval result when using the best exporter.

cla: yes
S
awaiting review

### Why In our internal env, we develop a new filesystem implementing HCFS. To serve for users better, we hope users could use tf dataset api to read new filesystem...

### Why Now TonY introduces the Sidecar Tensorboard, but sometimes it will fail due to hardware problems and unstable HDFS. But for users, it's better to unconscious restart it. So...

enhancement

Currently, venv only can be compressed with zip format. Maybe we should support `tar.gz` format.