yardstick Add thresholds at which to evaluate the ROC curve.

trafficstars

Feature

In some situations it might be preferable to pre-specify probability thresholds for the roc curve. Might it be worthwhile to add an argument to roc_curve for this?

Jan 18 '24 21:01 Dpananos

What would the output be? That is, what is an ROC curve with thresholds as an input supposed to look like? I am confused because I understand that the entire point of an ROC curve is to show results at all possible thresholds.

Jan 18 '24 21:01 tripartio

@Tripartio Here is an example of what the output should look like

For each threshold, the sensitivity and specificity are calculated and one can plot the ROC curve.

Currently, the ROC curve is plotted for all unique values of the estimate. This is sensible, but in my workflow I want to be able to compare models at the same thresholds. This is hard to do when thresholds are determined by the estimate, not all models will return the same estimate levels.

library(tidyverse)

N <- 1000
y <- factor(rbinom(N, 1, 0.5))
p <- runif(N)

thresholds <- c(-Inf, ppoints(100), Inf)

rocc <-map_dfr(thresholds, ~{
  
  predicted <- factor(as.integer(p>.x), levels = c(0, 1))
  
  sensitivity <- yardstick::sens_vec(y, predicted)
  specificity <- yardstick::spec_vec(y, predicted)
  
  tibble(
    .threshold=.x, 
    sensitivity = sensitivity, 
    specificity = specificity
  )

})   


rocc %>% 
  ggplot(aes(1-specificity, sensitivity)) + 
  geom_line()



rocc
#> # A tibble: 102 × 3
#>    .threshold sensitivity specificity
#>         <dbl>       <dbl>       <dbl>
#>  1   -Inf         0             1    
#>  2      0.005     0.00389       0.994
#>  3      0.015     0.00973       0.990
#>  4      0.025     0.0195        0.971
#>  5      0.035     0.0272        0.955
#>  6      0.045     0.0350        0.940
#>  7      0.055     0.0447        0.926
#>  8      0.065     0.0584        0.918
#>  9      0.075     0.0661        0.901
#> 10      0.085     0.0720        0.893
#> # ℹ 92 more rows

^{Created on 2024-01-18 with reprex v2.0.2}

Jan 18 '24 21:01 Dpananos

Hello @Dpananos 👋

this is not an unreasonable request! I could also imagine a scenario where you have many many unique values of estimate and selecting fewer for plotting is advantageous.

Jan 18 '24 22:01 EmilHvitfeldt

Happy to take this on, though I might need some guidance on how best to approach the change

Jan 18 '24 23:01 Dpananos

For my test set of 55k observations, the generated ROC table has 9300 entries. This is way too much to plot as you can't see that much detail. My colleague who used sklearn (I think) gave me a much more reasonable 400 entries.

May 14 '24 18:05 jxu

https://github.com/tidymodels/yardstick/blob/be744a3d419398a165aa5bbea0e7f14e750d5b79/R/prob-binary-thresholds.R#L6

The code is kinda confusing but I guess binary thresholds function is only designed to operate on every unique point of truth/estimate, not a given set of thresholds. So it would require some rewriting.

Jun 20 '24 15:06 jxu

yardstick yardstick copied to clipboard

Add thresholds at which to evaluate the ROC curve.

Feature

yardstick
yardstick copied to clipboard