datahub
datahub copied to clipboard
JupyterHubs for use by Berkeley enrolled students
### Bug description `scripts/delete-unused-users.py` retrieves lists of users from the hub and then deletes ORM entries for inactive users. It does not paginate through the users properly however so one...
Deploying changes requires making a PR and seeing it through. This isn't well documented in a central place - we should document it. https://github.com/berkeley-dsep-infra/datahub/pull/411#issuecomment-454225099 has some useful starting material.
We should have a policy for when and how we will upgrade the JupyterHub chart. We should be pretty up to date as soon as possible, since otherwise long upgrades...
sqlite and NFS don't get along very well. However, Jupyter Notebooks uses several sqlite under $HOME by default. Our $HOME is on NFS, and this leads to weird problems. The...
# Summary Currently, our [data archival requests](https://github.com/berkeley-dsep-infra/datahub/issues/new?assignees=felder&labels=support&template=data_archival_request.yml) are out to the public. It may not be an ideal scenario as all the student email details are publicly accessible which may...
# Summary #2808 was a blocker for many students using the EECS hub. This issue was because user data were maxing out hub storage. @felder, with his awesome skills, immediately...
I think a semester or so ago, people might have had to upload individual copies of a large dataset to the biology hub as we didn't have shared directory functionality...
When looking through the biology hub, @felder discovered that some users are using as much as 250GB of space. While not prohibited, we should reach out to these users and...