poop icon indicating copy to clipboard operation
poop copied to clipboard

reexamine the conditions for marking the ratio insignificant

Open andrewrk opened this issue 2 years ago • 1 comments
trafficstars

Currently it is done like this:

https://github.com/andrewrk/poop/blob/b01058a8081e5584f1c28f74c59643e413b562df/src/main.zig#L365

but maybe there is better way to do it, that is more well accepted in the field of statistics.

(perhaps std deviation should be involved?)

andrewrk avatar Jun 14 '23 01:06 andrewrk

perhaps std deviation should be involved?

  1. To be statistically correct, you would have first to do sufficient retries to estimate, if the standard deviation is the correct guess. https://stats.stackexchange.com/questions/108578/what-does-standard-deviation-tell-us-in-non-normal-distribution. This may or may not be practical depending on the system behavior (influences of other processes may make the exact distribution unobservable in practice), so it is usually skipped and Gaussean distribution assumed.
  2. The next thing is the to estimate confidence intervals, ie theory here https://medium.com/swlh/a-simple-refresher-on-confidence-intervals-1e29a8580697 and more practical motivation https://www.brainvitamins.net/blog/confidence-intervals-for-benchmarks/, however the Welch test with abbreviated context and optional explanation would be much better, see #2.

matu3ba avatar Nov 27 '23 10:11 matu3ba