pyDataverse
pyDataverse copied to clipboard
Add Pandas integration for Datafiles
Add a pandas integration for spreadsheet datafiles. There are two options to do this:
- Create an API request with DataAccess.get_datafile(), which should return an Pandas Dataframe instead of the requests.Response object.
- Create an API requests with DataAccess.get_datafile(), which should return an models.Datafile() object, with the data stored inside and an offered method .to_df() to get the data as a Pandas Dataframe.
Idea coming from https://github.com/gdcc/pyDataverse/issues/80.
Prepare
- [ ] Research both ways
- [ ] Talks with users
Implementation
- [ ] Write tests
- [ ] Write code
- [ ] Update Docs
- [ ] Write tutorial
- [ ] Update Docstrings
- [ ] Run pytest
- [ ] Run tox
- [ ] Run pylint
- [ ] Run mypy
Review
- [ ] Docs
Follow-Ups
As discussed during the 2024-02-14 meeting of the pyDataverse working group, we are closing old milestones in favor of a new project board at https://github.com/orgs/gdcc/projects/1 and removing issues (like this one) from those old milestones. Please feel free to join the working group! You can find us at https://py.gdcc.io and https://dataverse.zulipchat.com/#narrow/stream/377090-python