opteryx issues

✨ support group by aliases

~~~sql SELECT EXTRACT(YEAR FROM hire_date) AS hire_year, COUNT(*) FROM employees GROUP BY hire_year; ~~~

✨ support filters on aggregates

~~~sql SELECT COUNT(*) AS total_rows, COUNT(*) FILTER (WHERE status = 'active') AS active_rows, AVG(salary) FILTER (WHERE department = 'engineering') AS avg_engineering_salary FROM employees; ~~~

joocer

✨ can we add FOR clause support to the parser

this would reduce some of the most complex python code

joocer

✨ can we update the Opteryx dialect to support join hints

~~~sql FROM a INNER HASH JOIN b ON a.id = b.id FROM a INNER NESTED JOIN b ON a.id = b.id FROM a INNER MERGE JOIN b ON a.id =...

joocer

✨ treat VARCHAR as byte arrays and introduce UNICODE type

do we need a unicode or similar type for times we really need unicode Could we use NORMALIZE as a modifier at all?

joocer

✨ reduce copies of data when reading from the buffer

we may be able to use latches to hold an item in the buffer until the reading is complete Reads appear to have a delay due to making copies. We...

joocer

✨ support PGOVERLAP (&&)

https://www.postgresql.org/docs/current/functions-array.html Test for overlap of two arrays Keep @> for literal and column matching only Implement

joocer

✨ Rewrite INNER JOIN where only the LEFT table is referenced in WHERE/HAVING/SELECT clauses as SEMI JOIN

This is generally being used as a containment test for the LEFT table, rewriting as a SEMI JOIN will be faster.

joocer

✨ Use the Int64 BloomFilter for any types that can be cast to int64 (eg dates and timestamps)

The Int64 BL is less accurate than the text bloom filter but is faster. Where we have multiple join columns we should merge the bloom filters - this should improve...

joocer

✨ use shared buffers for multi processing

~~~python import pyarrow as pa import multiprocessing.shared_memory as shm import numpy as np import multiprocessing def create_shared_arrow_array(): """Creates a large Arrow array and stores it in shared memory.""" data =...

joocer

opteryx
opteryx copied to clipboard

Metadata

✨ support group by aliases

✨ support filters on aggregates

✨ can we add FOR clause support to the parser

✨ can we update the Opteryx dialect to support join hints

✨ treat VARCHAR as byte arrays and introduce UNICODE type

✨ reduce copies of data when reading from the buffer

✨ support PGOVERLAP (&&)

✨ Rewrite INNER JOIN where only the LEFT table is referenced in WHERE/HAVING/SELECT clauses as SEMI JOIN

✨ Use the Int64 BloomFilter for any types that can be cast to int64 (eg dates and timestamps)

✨ use shared buffers for multi processing

← Metadata

Owner

Metadata

opteryx opteryx copied to clipboard

Metadata

← Metadata

Owner

Metadata

opteryx
opteryx copied to clipboard