elasticdl
                                
                                
                                
                                    elasticdl copied to clipboard
                            
                            
                            
                        Master handles the errMsg that worker reports
Currently, when a worker reports an error message to the master, the message seems not correctly handled.
If we support the failure of certain records, the master should not fail the job if the failure rate is acceptable.
If we do not support any failures, the job should be in the status of failure.
Now workers will report statistics to the master. Master can make its decision based on this.