flux-docs icon indicating copy to clipboard operation
flux-docs copied to clipboard

Parallel debugging description

Open dongahn opened this issue 5 years ago • 3 comments

As discussed this at 11AM meeting 7/16, we need a high level text on how to parallel debug a flux job. A good time will be after Perforce Software's support engineer will have a chance to poke at a Flux version on one of LC TOSS clusters. I will coordinate.

In the meanwhile, should this text go into https://flux-framework.readthedocs.io/en/latest/quickstart.html or somewhere else?

dongahn avatar Jul 17 '20 21:07 dongahn

In the meanwhile, should this text go into https://flux-framework.readthedocs.io/en/latest/quickstart.html or somewhere else?

Yeah, I think it seems reasonable to have a parallel debugging section on the Quickstart page for now. Eventually, maybe we can piece that section out and have a separate page just on debugging?

cmoussa1 avatar Jul 17 '20 22:07 cmoussa1

Eventually, maybe we can piece that section out and have a separate page just on debugging?

Debugging and performance tuning, and yes eventually :-)

dongahn avatar Jul 17 '20 22:07 dongahn

https://github.com/flux-framework/flux-sched/pull/694#issuecomment-663890636

I will be seeking some testing from Perforce and document in the next few weeks.

As part of this testing, some more best-practice recommendations as to how to configure the TotalView parallel debugger will be likely come out. That should be documented too.

dongahn avatar Jul 25 '20 19:07 dongahn