Pascal Heus
Pascal Heus
Just to report I'm seeing the same issue. Thanks for looking into this.
As a workaround for my flavour of this issue, I used the interactive NLTK installer to install punkt on my machine. See instructions at https://www.nltk.org/data.html In a nutshell run Python...
I have a similar question, and it would be very useful to have mechanisms to provide more information in LinkML. The [regex101](https://regex101.com/) website enumerates some of the potential flavors. One...
In addition to above, it would be useful to also have the home and possibly repository links in the popup (for 1-click navigation).
A dataset with thousands of files is unusual, and the size of the metadata being reported in this case is actually not that bad (DDI would be quite larger). Please...
Hmm, I do see the 'noise' argument as being valid... and the filesystem could also be read-only (in some secure or archive environments). So to save the `stats` file, the...
Actually, the filename in the cache does not matter, right? Just need the key.
Ah, thanks for the feedback, and it makes sense. Frequencies can certainly be quite large files. I'm actually planning to leverage the `.stats.csv` file for other purposes as well (along...
Sounds good. Happy to help as needed. Not that this is way easier to implement for `frequency` (#3175) as there it's only a sum of the weights (instead of each...
Adding `--subtype xsd` for the [W3C XML data types](https://www.w3.org/TR/xmlschema-2/) would be a good option. Don't think this needs to be under `moarstats` as most of the types can be derived...