xenon icon indicating copy to clipboard operation
xenon copied to clipboard

Support GPU information and job array query job status for SLURM

Open yh882317 opened this issue 4 years ago • 3 comments

Context

I am working on a project which aims to provide a user side solution for higher resource utilization on SLURM cluster. It requires information on pending jobs in the queue and running jobs.

Problem

The interface JobQueueScheduler.getJobStatus(jobIdentifier) in this line, returns jobstatus of the job. However, only contains basic information like start time, time limit, required number of nodes. For jobs have GPU requirement, they can not be recognized. Besides, there is also a problem on querying jobs generated by job array. The job array and the running jobs can be found by String[] jobIDs=scheduler.getJobs(PartitionName);. However, when I am trying to get the status of those jobs, there will be error raised saying no such jobs. The job array on the pending has the id like 1080_[5-1024] and jobs on the run have ids like 1080_2. When JobQueueScheduler.getJobStatus(jobIdentifier) is invoked, the error raise.

Question

Is it possible to provide information about GPU and job array via job status? After all, the implementation of jobstatus maintains a map schedulerSpecificInformation. Perhaps the related information can be added to this map. And also job array queries need to fix.

yh882317 avatar Aug 19 '20 13:08 yh882317