ovis icon indicating copy to clipboard operation
ovis copied to clipboard

ldms plugin job_id difficulties

Open morrone opened this issue 5 years ago • 3 comments

Currently, it is a requirement that all ldms plugins implement a job_id option and store the job_id in every metric set.

The first issue is that not all plugins will be running on components where a job_id makes sense. With some plugins, it may be obvious that a job would never be running on the nodes, and with others the plugin might run on nodes that run jobs, and on others it might run on components that do not run jobs. With the former, we might just relax the rules and allow them to not implement job_id. With the latter, we still have an issue, although perhaps just designating "0" in the job_id field to mean "no job" would be fine. But since job_id is a compile time option, and few people will want different builds for different nodes in the same center, the job_id will stay around uselessly.

Another issue with the current job_id implementation is that it seems to assume that only a single job is running on a node at a time. This is not the case on some of our clusters. So job_id is going to be insufficient there.

job_id is an integer, and that might be overly restrictive for some job managers. Switching to a string might make sense.

Finally, the all-macro implementation of the helpers that all plugins are required to use make too many assumptions about the plugin's internal implementation. It would be very nice to move to a proper function-based API for the implementation, which by its nature would probably eliminate the issues that the macro implementation has.

morrone avatar Apr 30 '19 17:04 morrone

this is addressed with the sampler_base and job_info in v4. we don't intend to change it in v3, I believe

baallan avatar May 01 '19 19:05 baallan

Maybe, but sampler_base comes with its own baggage issues. Too much in one API that isn't generally applicable to all plugins.

morrone avatar May 01 '19 22:05 morrone

true but much easier to fix

baallan avatar May 02 '19 00:05 baallan