daos icon indicating copy to clipboard operation
daos copied to clipboard

DAOS-7203 control: Add histogram support to Prometheus exporter

Open mjmac opened this issue 4 years ago • 17 comments

Switch the object I/O counters to histograms in order to capture per-I/O size distributions in addition to total amounts. Update the Prometheus exporter to support passthrough histograms from native DAOS telemetry format.

mjmac avatar Apr 08 '21 21:04 mjmac

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar Feb 17 '24 01:02 github-actions[bot]

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar Feb 17 '24 01:02 github-actions[bot]

Screenshot 2024-02-16 at 20 33 45

mjmac avatar Feb 17 '24 02:02 mjmac

Functional on EL 9 Test Results (old)

135 tests  ±0   130 :white_check_mark:  - 1   1h 24m 30s :stopwatch: - 7m 9s  41 suites ±0     4 :zzz: ±0   41 files   ±0     0 :x: ±0   1 :fire: +1 

For more details on these errors, see this check.

Results for commit 427bb068. ± Comparison against base commit 98945324.

:recycle: This comment has been updated with latest results.

github-actions[bot] avatar Feb 17 '24 04:02 github-actions[bot]

Functional on EL 8.8 Test Results (old)

135 tests  ±0   130 :white_check_mark:  - 1   1h 34m 45s :stopwatch: + 5m 40s  41 suites ±0     4 :zzz: ±0   41 files   ±0     0 :x: ±0   1 :fire: +1 

For more details on these errors, see this check.

Results for commit 427bb068. ± Comparison against base commit 98945324.

:recycle: This comment has been updated with latest results.

github-actions[bot] avatar Feb 17 '24 05:02 github-actions[bot]

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar Feb 20 '24 23:02 github-actions[bot]

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar Feb 20 '24 23:02 github-actions[bot]

Functional Hardware Large Test Results (old)

64 tests  ±0   64 :white_check_mark: ±0   28m 16s :stopwatch: -26s 14 suites ±0    0 :zzz: ±0  14 files   ±0    0 :x: ±0 

Results for commit 427bb068. ± Comparison against base commit 98945324.

:recycle: This comment has been updated with latest results.

github-actions[bot] avatar Feb 21 '24 12:02 github-actions[bot]

Functional Hardware Medium Test Results (old)

130 tests  ±0   103 :white_check_mark:  - 1   2h 12m 10s :stopwatch: + 2m 18s  34 suites ±0    26 :zzz: ±0   34 files   ±0     1 :x: +1 

For more details on these failures, see this check.

Results for commit 427bb068. ± Comparison against base commit 98945324.

:recycle: This comment has been updated with latest results.

github-actions[bot] avatar Feb 21 '24 13:02 github-actions[bot]

Functional Hardware Medium Verbs Provider Test Results (old)

55 tests  ±0   54 :white_check_mark: ±0   4h 7m 30s :stopwatch: -1s  7 suites ±0    1 :zzz: ±0   7 files   ±0    0 :x: ±0 

Results for commit 427bb068. ± Comparison against base commit 98945324.

:recycle: This comment has been updated with latest results.

github-actions[bot] avatar Feb 21 '24 15:02 github-actions[bot]

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar Feb 21 '24 21:02 github-actions[bot]

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar Feb 21 '24 21:02 github-actions[bot]

Bug-tracker data: Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar May 03 '24 16:05 github-actions[bot]

Ticket title is 'Create standard DAOS dashboards for Grafana' Status is 'Resolved' Labels: 'rel20_TB3,rel20_TB4' https://daosio.atlassian.net/browse/DAOS-7203

github-actions[bot] avatar May 03 '24 16:05 github-actions[bot]

Test stage Functional on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-5382/11/execution/node/1177/log

daosbuild1 avatar May 03 '24 19:05 daosbuild1

FYI - I think the dkey_akey_enum_punch.py failures are being fixed in #14301

daltonbohning avatar May 03 '24 19:05 daltonbohning

FYI - I think the dkey_akey_enum_punch.py failures are being fixed in #14301

Actually, sorry. That looks like a different issue

daltonbohning avatar May 03 '24 19:05 daltonbohning

Test stage Functional Hardware Medium completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-5382/14/testReport/

daosbuild1 avatar Nov 01 '24 03:11 daosbuild1

Note to reviewers... This is finally in a state where I think it's ready for landing. I reworked the patch a bit. The original version changed the fetch/update bytes counters to histograms in addition to adding all of the logic for exporting via Prometheus. I've reverted that part of it, which should make this patch less risky. Longer term, I would like to revisit that other change, but I'll do that separately so that we can evaluate it in isolation.

mjmac avatar Nov 02 '24 17:11 mjmac