fastexcel
fastexcel copied to clipboard
A Python wrapper around calamine
Provide a `to_python` method that would convert data to a `list[list[int | float | str | datetime | date | timedelta | None]]`. Add a parameter to `to_pandas` and `to_polars`...
- add getting started in README and doc - add badges - add `CONTRIBUTING` - add `PULL REQUEST` template Straightforward but important for repo quality
Would be great to have some kind of non regression performance test like [pydantic-core](https://github.com/pydantic/pydantic-core/tree/main/tests/benchmarks) for example
I find that people will often have Excel sheets where they use multiple header rows to make up their column names. Here's a snippet of how I deal with that...
Add benchmarks to the README (speed & memory), with scripts allowing tor reproduce them. Add multiple scnearios. Some ideas could be: * Single sheet, not chunked * Single sheet, chunked...
Attached two visually-identical files below that can be used to reproduce the issue. Both appear to contain identical column data, but one fails to load while the other succeeds. data:image/s3,"s3://crabby-images/44b5e/44b5e2800aa64c3dfc7518ec7644b55fd3309c5f" alt="Screenshot...
Would you please also publish source code to pypi ? Then pip will fallback to source code build install. Thanks.
The Arrow project recently created a [new protocol for sharing Arrow data in Python". One of the goals of the protocol is allow exporting / importing Arrow data in Python...