Jaron Arbet
Jaron Arbet
An easy solution I found was to use `model=TRUE` in `coxph`. Then when calling `ggforest`, use `data = fit$model`. Then the sample sizes will be the number of non-missing values.
Any updates on this? I am trying to read a large `.txt.gz` file that contains character/string data. I know `fread` can read `.txt.gz` files, but the file is larger than...
> Maybe [this](https://diskframe.com/articles/04-ingesting-data.html)? Cool, I see they have a [workaround](https://github.com/DiskFrame/disk.frame/issues/351) for reading .gz files, so this should work. Thanks!
I also can't figure out how to save plots to file (since I can't get them to display in my Rstudio window either). Here is some example code that tries...
> If the number of expected subgroups is, say 2000, then using random 5k rows from top 20k rows and directly using top 20k rows may give different results. But...
> @jarbet In cola, the 5000 features are not sampled from all 200K probs, it is sampled from `top_n` top features. Let's say you may have 10k top most variable...