flux-sched icon indicating copy to clipboard operation
flux-sched copied to clipboard

JGF is missing vertices in `rv1` match-format writer

Open jameshcorbett opened this issue 1 year ago • 6 comments

On rzadams, which was just today configured to use the rv1 match format:

$ flux alloc -N2
flux-job: fqxGU4MP3XV started                                                                 00:00:17
Oct 14 19:16:24.615448 PDT sched-fluxion-resource.err[0]: grow_resource_db_jgf: db.load: unpack_edge: source and/or target vertex not found1654 -> 2196.
Oct 14 19:16:24.615462 PDT sched-fluxion-resource.err[0]: : Invalid argument
Oct 14 19:16:24.615469 PDT sched-fluxion-resource.err[0]: update_resource_db: grow_resource_db: Invalid argument
Oct 14 19:16:24.615473 PDT sched-fluxion-resource.err[0]: update_resource: update_resource_db: Invalid argument
Oct 14 19:16:24.616098 PDT sched-fluxion-resource.err[0]: populate_resource_db_acquire: update_resource: Invalid argument
Oct 14 19:16:24.616106 PDT sched-fluxion-resource.err[0]: populate_resource_db: loading resources using resource.acquire
Oct 14 19:16:24.616108 PDT sched-fluxion-resource.err[0]: init_resource_graph: can't populate graph resource database
Oct 14 19:16:24.616109 PDT sched-fluxion-resource.err[0]: mod_main: can't initialize resource graph database
Oct 14 19:16:24.616397 PDT sched-fluxion-resource.crit[0]: module exiting abnormally
Oct 14 19:16:24.842895 PDT sched-fluxion-qmanager.err[0]: update_on_resource_response: exiting due to sched-fluxion-resource.notify failure: Function not implemented
Oct 14 19:16:24.842907 PDT sched-fluxion-qmanager.err[0]: handshake_resource: update_on_resource_response: Function not implemented
Oct 14 19:16:24.842909 PDT sched-fluxion-qmanager.err[0]: handshake: handshake_resource: Function not implemented
Oct 14 19:16:24.842912 PDT sched-fluxion-qmanager.err[0]: mod_start: handshake: Function not implemented
Oct 14 19:16:24.842934 PDT sched-fluxion-qmanager.crit[0]: module exiting abnormally

I confirmed that vertex 1654 is not in the JGF produced for the scheduler, although 2196 is. 1654 is a rack vertex, 2196 is a node vertex.

jameshcorbett avatar Oct 15 '24 02:10 jameshcorbett