online generic 24.04 writer performance ticket

perf-32249

Writer 24.04 flamegraphs

Mar 19 '24 12:03 caolanm

perf-6628

24.04 writer profile where we endured a slowdown after (possibly) selecting all text in 72 page and changing the font size.

Showing a cursor seems to have found some case where the notorious solveCrossovers gets called, odd

Apr 02 '24 12:04 caolanm

Showing a cursor seems to have found some case where the notorious solveCrossovers gets called, odd

That means that we have a selection overlay which (a) consists or more than one rectangle and (b) has a border, in which case we try and convert it into a "nicer" overlay polygon with a single continuous border.

Possibly we should just do what other software does and have multiple rectangles for that case, even if it means the borders don't look quite as nice.

Or maybe get alg to write us an optimised variant of that solveCrossovers() which only handles combining rectangles?

Apr 03 '24 06:04 grandinj

Or maybe get alg to write us an optimised variant of that solveCrossovers() which only handles combining rectangles?

Some interesting suggestions here, for such an algorithm: https://stackoverflow.com/questions/13746284/merging-multiple-adjacent-rectangles-into-one-polygon

Apr 03 '24 10:04 grandinj

The curious thing is that we split that polygon per-page; I wonder if by pushing that knowledge from up the stack down to here we could save trying to do 72 pages worth of pointless intersection, and just do that a page at a time (?) =) or perhaps we already do that (?)

Apr 03 '24 12:04 mmeeks

No, it doesn't look like we pass down the selection as per-page information, it's just one big vector of rectangles.

Anyhow, WIP attempt at speeding this up at https://gerrit.libreoffice.org/c/core/+/165752

Apr 03 '24 14:04 grandinj

Today's typical small meeting document

perf-12773

Apr 04 '24 10:04 caolanm

As @pedropintosilva suggested I did a few screen recordings (more to come still) now also for 24.04 with some smoke testing for performance issues I've noticed that I'll just leave here as discussed with him. As I imagine the document used might be relevant, I used the ebooks from https://books.libreoffice.org/en/ as odt:

Note I'm also happy to file those individually if that makes more sense to you

File loading

Empty white pages shown for a long time, also the UI is only slowly changing its state

writer-slow-loading.webm

Initial rendering when scrolling

Following the previous loading scrolling down also takes a decent time to show the page content (might be related to the larger ToC), though this only happens once initially

writer-slow-page-scrolling.webm

Toggling formatting marks is slow

writer-slow-toggle-formatting-marks.webm

Enabling page headers and typing is slow

writer-header-slow.webm

Scrolling through a document

I see that those tiles need to get rendered and transferred of course, just thought as a user this might cause the impression of performance issues, maybe some smart preloading of the next tiles could be done (or improved if already in place)

writer-scrolling-maybe-preload.webm

Apr 17 '24 06:04 juliusknorr

Might be interesting to work on this particular 600+ page book during testing to see if we can work out what is going on; I expect it is packed with large numbers of interesting features, charts etc. =)

Apr 17 '24 07:04 mmeeks

Todays 24.04 writer session, while under extra bgsave load

perf-6258

Apr 18 '24 10:04 caolanm

perf-16269

Todays writer 24.04 session

Apr 19 '24 10:04 caolanm

perf-2010

Todays writer 24.04 session

Apr 25 '24 10:04 caolanm

perf-26825

today's writer 24.04 session with bgsaves merged together

May 02 '24 10:05 caolanm

perf-25028

today's scheduled call writer profile

May 09 '24 10:05 caolanm

perf-30799

todays regular call writer profile

May 16 '24 10:05 caolanm

perf-17994

todays regular call writer profile

May 23 '24 10:05 caolanm

perf-29559

todays regular call writer profile

May 30 '24 13:05 caolanm

perf-19918

todays regular writer profile, GetAccessibleDescription seems to be falling back to querying "local" help for HelpText which isn't even included (--without-help on those builds) so I can at the least make vcl conditionally not even bother if built without local help support

Jun 06 '24 11:06 caolanm

https://gerrit.libreoffice.org/c/core/+/168447 for that idea

Jun 06 '24 12:06 caolanm

perf-22491

todays writer call on staging-perf

Jun 13 '24 10:06 caolanm

Interesting to see so much time in lcl_DrawLineForWrongListData - I assume drawing wiggly underlined data is back in the profile at ~25% of rendering cost; oddly I thought we had a bitmap cache for that which massively accelerated this for spell checking @grandinj any idea if there is an extra path here, or perhaps the cache is too small and needs to scale with the # of differently sized and colored views (we were stressing that today I think)

Jun 13 '24 12:06 mmeeks

Interesting to see so much time in lcl_DrawLineForWrongListData

The cache was added in commit 0759191e6923945469bc426b2c322ddeade12e09, and it 10 items big. The item key is { lineHeight, lineColor}.

Do you think you had more than 10 combinations of that? If so, we can just change that to cache size to 100 for the LOK case.

Jun 14 '24 07:06 grandinj

Quite probably we should size the cache based on the number of distinct views; I guess we want a comphelper/lok.hxx method that lets us know how many distinct views there are and (ultimately) get this to be what we call normalized views (ie. that share the same zoom/rendering attributes) - so that we don't over-enlarge the cache and have some sharing.

And then use that to do 5x the number of views or whatever (?) =) Would be grea tif you can look at that Noel !

Jun 14 '24 08:06 mmeeks

perf-70768-writer-share

24.04.4.3snapshot git hash: 8628721 Collabora Office 24.04.4.20240629 git hash: 2657979

Jul 02 '24 11:07 caolanm

perf-4224-writer

24.04.4.2 git hash: 20b6b94a Collabora Office 24.04.4.2 git hash: 2d2c24d (confusingly this one is master I think and the above is 24.04 branch)

Jul 02 '24 11:07 caolanm

Hmm - the writer profiles has a lot of deltas in it - which looks unhealthy; I've also started to see the horror of full-document invalidations per-keystroke creeping back into collaborative editing sessions; it would be extremely good to have a desktop/source/lib/init.cxx lok_sample() type method that calls your watchdog trigger system-call on Linux, and that is used whenever a large area - or EMPTY invalidation is emitted - ideally from the core itself and not later from the CallbackFoo queue logic - so we can see where these come from =)

Jul 03 '24 05:07 mmeeks

perf-32311