Skip to content

Support GPU information and job array query job status for SLURM #671

@yh882317

Description

@yh882317

Context

I am working on a project which aims to provide a user side solution for higher resource utilization on SLURM cluster.
It requires information on pending jobs in the queue and running jobs.

Problem

The interface JobQueueScheduler.getJobStatus(jobIdentifier) in this line, returns jobstatus of the job.
However, only contains basic information like start time, time limit, required number of nodes. For jobs have GPU requirement, they can not be recognized.
Besides, there is also a problem on querying jobs generated by job array. The job array and the running jobs can be found by String[] jobIDs=scheduler.getJobs(PartitionName);. However, when I am trying to get the status of those jobs, there will be error raised saying no such jobs. The job array on the pending has the id like 1080_[5-1024] and jobs on the run have ids like 1080_2.
When JobQueueScheduler.getJobStatus(jobIdentifier) is invoked, the error raise.

Question

Is it possible to provide information about GPU and job array via job status? After all, the implementation of jobstatus maintains a map schedulerSpecificInformation. Perhaps the related information can be added to this map. And also job array queries need to fix.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions