Nhat Nguyen

Results 5 comments of Nhat Nguyen

> @ntnhaatj How's your progress on this? Were you able to figure out the last few tests? duckdb has just released v0.9.0 (which officially has fixed the shadowing name issue...

Hi @mamonu, my work is currently designed for my specific use case, primarily for data quality reporting. It doesn't guarantee the quality and coverage of all elementary features. It might...

hi, dont know whether this issue is the same as what you raise above. I'm experiencing a performance problem when using a glob pattern like `hdfs:///user/hive/warehouse/db/table/part_date=20240802*/*.parquet` in the HDFS filesystem....

@martindurant I ran profiling the `fs.glob` function on my HDFS directory ```python # 1 profiled_glob("hdfs:///user/hive/warehouse/a.db/table/part_date=20240825*") # 2 profiled_glob("hdfs:///user/hive/warehouse/a.db/table/part_date=20240825*/*.c000") ``` in both cases, the result always take most of the time...

@martindurant I see, my usage of `fsspec` is within `DuckDB` to interact with HDFS, which implicitly calls glob to scan directories. Unfortunately, the performance isn't good, likely due to reasons...