Yijie Shen

Results 29 comments of Yijie Shen

Great work! Is that possible to report new benchmark results for the latest master? @richox

We could always explore buffer based compression when it gets direct support from arrow-rs later.

#29 is tracking the same issue, it is waiting for apache/pulsar#6120

Tracked here in Pulsar repo: https://github.com/apache/pulsar/issues/6121

This issue is caused by two subsequent issues: - firstly, many of the REST APIs respond with 307 temporary redirects but not annotate them explicitly. (https://github.com/apache/pulsar/pull/6227) - PulsarAdmin should deal...

@frank-dkvan thanks for reporting! Could you please try to use pulsar-spark connector to read `topic_test1` out? Does that work? ```scala val df = spark .read .format("pulsar") .option("service.url", "pulsar://localhost:6650") .option("admin.url", "http://localhost:8080")...

@frank-dkvan, could you please try other data types? such as int, or complex types? I'm afraid pulsar sql cannot handle `Schema.BYTES` correctly.

We have implemented file-based shuffle with memory management in Blaze. I could have that PRed if it aligns with the goal of Ballista. https://github.com/blaze-init/blaze/blob/master/native-engine/datafusion-ext/src/shuffle_writer_exec.rs