zaminfluence icon indicating copy to clipboard operation
zaminfluence copied to clipboard

Add output example interpretation

Open akarlinsky opened this issue 3 years ago • 4 comments

The arXiv paper shows the results of zaman analysis for several papers, usually showing the number and share of observations dropped in the form of a table.

The example in the README just shows some plots, which I'm not really sure how to interpret. Adding an "annotated output" for the example via table and/or explaining what the plots show would be really great IMO.

akarlinsky avatar Nov 20 '21 16:11 akarlinsky

This is a great suggestion. I'll work on putting this together when I get a chance.

rgiordan avatar Dec 04 '21 16:12 rgiordan

+1, I'm having trouble understanding the output in the readme example.

maswiebe avatar Apr 02 '22 20:04 maswiebe

I put another example file, examples/interpreting_output.R, in https://github.com/rgiordan/zaminfluence/pull/37. I'd be interested to hear if it's helpful.

Perhaps a standalone function to produce tables similar to those in the paper would still be helpful.

rgiordan avatar Apr 19 '22 22:04 rgiordan

Nice, this was helpful! A standalone function would also be helpful, since I would be making one myself.

Note a small typo on line 70: 'for exmample'.

Also, the intuition for "large residuals and large |x1|" is high influence via large residuals and high leverage of x1, right? Might be helpful to make that explicit.

# For example, you can graph the reruns and predictions versus one another like so:
ggplot(summary_df) +
  geom_point(aes(x=prediction, y=rerun, color=param_name, shape=metric)) +
  geom_abline(aes(slope=1, intercept=0))

This graph isn't very clear, because the scale is so different for x1 and x2.

maswiebe avatar Apr 25 '22 18:04 maswiebe