oceanbase icon indicating copy to clipboard operation
oceanbase copied to clipboard

[Feature]: Evaluate Profile-Guided Optimization (PGO) and LLVM BOLT

Open zamazan4ik opened this issue 1 year ago • 4 comments

Hi!

Recently I did many Profile-Guided Optimization (PGO) benchmarks on multiple projects (including many databases like PostgreSQL, MongoDB, Redis, and many others) - the results are available here, all databases results are located here. So that's why I think it's worth trying to apply PGO to OceanBase.

I can suggest the following things to do:

  • Evaluate PGO's applicability and benchmark results to OceanBase.
  • If PGO helps to achieve better performance - add a note to OceanBase's documentation about that. In this case, users and maintainers will be aware of another optimization opportunity for OceanBase.
  • Provide PGO integration into the build scripts. It can help users and maintainers easily apply PGO for their own workloads.
  • PGO-optimize prebuilt OceanBase binaries (if it's possible to prepare a generic enough test workload).

Here are some PGO integration examples in the existing build scripts in other projects:

Some PGO documentation examples:

  • ClickHouse: https://clickhouse.com/docs/en/operations/optimizing-performance/profile-guided-optimization
  • Databend: https://databend.rs/doc/contributing/pgo
  • Vector: https://vector.dev/docs/administration/tuning/pgo/
  • Nebula: https://docs.nebula-graph.io/3.5.0/8.service-tuning/enable_autofdo_for_nebulagraph/
  • GCC: Official docs, section "Building with profile feedback" (even AutoFDO build is supported)
  • Clang:
    • https://llvm.org/docs/HowToBuildWithPGO.html
    • https://llvm.org/docs/AdvancedBuilds.html

After PGO, I can suggest evaluating LLVM BOLT as an additional optimization step after PGO.

zamazan4ik avatar Sep 24 '23 23:09 zamazan4ik

Thank you for your suggestion. Currently, the code in OceanBase already supports PGO, and you can find the code at: https://github.com/oceanbase/oceanbase/blob/e4e09ecfb65d55d6225b080f6f2069159c94c1c7/cmake/Env.cmake#L55 You are welcome to provide any further optimization suggestions for this. Additionally, we would appreciate it if you could add our repository to the awesome repository related to FGO.

caifeizhi avatar Sep 25 '23 02:09 caifeizhi

Currently, the code in OceanBase already supports PGO, and you can find the code at

Thanks for pointing to the script! I didn't find it because I had been searching for PGO instead of FDO... :)

I have some questions about the current FDO implementation in OceanBase:

  1. Do you have the benchmark results of enabling FDO on OceanBase? I am interested in comparing Release vs Release + PGO OceanBase versions. So I will be able to put your benchmarks to my repo as an example.
  2. I see you use Sampling (AutoFDO) approach. Did you try to use the Instrumented FDO instead? If yes, could you please describe why you chose AutoFDO instead? My guess is that Instrumented FDO has just too big performance overhead compared to AutoFDO but it's just my guess.
  3. How did you collect the committed to the repo FDO profiles? How can I try to reproduce these profiles?
  4. Do you FDO-optimize prebuilt binaries in your releases (e.g. available on GitHub Releases page)?

Additionally, we would appreciate it if you could add our repository to the awesome repository related to FGO.

Of course! Just did it :)

zamazan4ik avatar Sep 25 '23 02:09 zamazan4ik

1. Do you have the benchmark results of enabling FDO on OceanBase? I am interested in comparing Release vs Release + PGO OceanBase versions. So I will be able to put your benchmarks to my repo as an example.

about 15% improvement;

2. I see you use Sampling (AutoFDO) approach. Did you try to use the Instrumented FDO instead? If yes, could you please describe why you chose AutoFDO instead? My guess is that Instrumented FDO has just too big performance overhead compared to AutoFDO but it's just my guess. 3. How did you collect the committed to the repo FDO profiles? How can I try to reproduce these profiles?

@Naynahs can you please take a look

4. Do you FDO-optimize prebuilt binaries in your releases (e.g. available on GitHub Releases page)?

yes, we use AUTOFDO=ON when rpm build, see it: https://github.com/oceanbase/oceanbase/blob/223cf90c93ddf0c163da2c47debe7ef4da2f46cd/build.sh#L207

caifeizhi avatar Sep 25 '23 04:09 caifeizhi

Yeah, I also want to know more about oceanbase, how to collect the committed to the repo FDO profiles? Thanks very much.

ruclz avatar Jan 05 '24 07:01 ruclz