Eric Kafe
Eric Kafe
DeepSeek proposes to add a _Legal_Notice_ file to the semcor package, with the following text: # Legal Notice for the SEMCOR Corpus This package, `semcor`, is a derivative work with...
@fcbond, it seems that the situation is indeed desperate for the _brown_ corpus itself, and that we need to remove it. However, there is probably hope for preserving _semcor_ under...
Thanks @ChristianeFellbaum , it is positive to learn that you had direct permission from the Brown Corpus compilers, because that could allow to bypass the LDC, especially if your agreement...
Thanks Christiane and Randee, it is sad to hear that you will be discontinuing the distribution of _semcor_, presumably due to legal concerns. Maybe we should read this as an...
Many scientific articles rely on the nltk corpus reader for the reproducibility of findings in the Brown Corpus, and removing it from _nltk_data_ would disrupt the reproducibility of these studies....
Thanks Christiane, Rada answered that she would like to be able to provide some clarification, but unfortunately doesn't think she has additional information. Now, concerning the ongoing clarification effort among...
The proposed list of free licences should probably be wider than just the OSI-approved **software** licenses. Here's why: * **OSI focuses on Software:** The OSI defines "open source" specifically for...
An audit of all packages in _nltk_data/index.xml_ has been performed from a FOSS (Free and Open Source Software) compliance perspective. This comprehensive and exhaustive categorization of all packages has resulted...
Marking this PR as "Ready for Review" to encourage broader feedback and community input. While I anticipate some modifications may be necessary, the current state provides a solid foundation for...
Summary of findings from #3460 experiments - Removing gc.collect() alone does not fix the segfault on Python 3.13. - Dict-only swap (replace __dict__ without changing __class__) fails functionally on all...