Antoine Pitrou

Results 823 comments of Antoine Pitrou

> Implementations of `from_dataframe` should just disregard the data buffer dtype entirely. `column.dtype` already tells you what to expect in the data buffer (e.g. dtype `STRING` will mean an 8bit...

Is the spec supposed to be stable? There are a bunch of "TODO" and "TBD" statements, and implementations are generally very recent.

Well, if you have a `DATETIME` column, for example, what is the implied dtype for the data buffer? Is it INT64 perhaps (but it might also be INT32 for a...

> > Well, if you have a DATETIME column, for example, what is the implied dtype for the data buffer? It might be spelled out in the spec, but I'm...

I don't know about philosophical reasons ;-), but I don't think chown is very often used. Do you know of a context where it is?

Well, I don't know yet, but if we ever want chown() in pathlib, I think it should be higher-level and accept symbolic uids as well.

If users were allowed to pass `bytes` to pathlib, they would have to be converted to `str` internally anyway. I'd rather people do their own conversion if they need a...

[PEP 383](https://peps.python.org/pep-0383/) is the fundamental reference here.

> I don't follow, sorry - why would they have to be converted to `str`? Because pathlib uses `str` internally.

Hi @yaqi-zhao , 1) Can you clarify the PR title and description to explain what this is about? 2) Can you fill in information about the data files in https://github.com/apache/parquet-testing/blob/master/data/README.md?