trino
trino copied to clipboard
Pinot insert
Implement insert into pinot. TODO: add disk based sort, extract relevant commits, currently it is based off of #7160 (drop table)
:wave: @elonazoulay - this PR has become inactive. If you're still interested in working on it, please let us know, and we can try to get reviewers to help with that.
We're working on closing out old and inactive PRs, so if you're too busy or this has too many merge conflicts to be worth picking back up, we'll be making another pass to close it out in a few weeks.
Gonna close this one out due to inactivity, please reopen if you would like to continue doing work here.
Sure, I am still actively working on this one. I will push changes to it later today.
@bitsondatadev can we review please?
Sure! Go right ahead. We always welcome reviews.
@elonazoulay how's this looking? Is it ready for review?
Hey, I just resolved a dependency issue w new version. Waiting for the tests to complete and it should be good to go! Will likely break into multiple commits, whatever the reviewer thinks is the easiest way to review:
- Break out support for timestamp and boolean array types
- Minor fix to array types (handle null array element)
- Fix time boundary handling + test
Just want to make sure the tests are all good first:)
Tests passed! Ready for review. Any initial comments about large refactoring are welcome, will be happy to do anything you suggest.
One more note:
Added failsafe retry policy to the testInsertIntoHybridTableWithTimestamp test since there appears to be latency between uploading segments and servers downloading them so the data is available. This only happened on ci, maybe due to constrained resources. Locally all tests pass repeatedly.
@bitsondatadev I have reviewed the PR and LGTM. We are running trino in production from this branch for more than 6 months now. can we request review from trino guys?
Thanks @jatin5251! Let me see if @martint can take a look.
@bitsondatadev @martint can you guys have a look?
Hey @jatin5251 gonna tag @trinodb/devrel to include Manfred and Cole to follow up.
@bitsondatadev any luck?
@elonazoulay can you rebase and get this reviewed?
Are you looking at including the upgrade to Apache Pinto 1.0.0 as part of this PR? https://docs.pinot.apache.org/basics/releases/1.0.0
Update: the failing tests in ci all passed locally with the same command as failing step, i.e. ./mvnw test -pl plugin/trino-pinot. It appears the github runner does not have enough resources to run the tests.
Are you looking at including the upgrade to Apache Pinto 1.0.0 as part of this PR? https://docs.pinot.apache.org/basics/releases/1.0.0
That would be done in a separate pr. Will push that and let you know.
Update: the tests now fail after ~47m: increasing heap to 3g ,i.e. using MAVEN_INSTALL_OPTIONS. Before they were failing after about 22-23mins.
Trying to increase the flush interval which reduces resources on the pinot server test containers.
Are you looking at including the upgrade to Apache Pinto 1.0.0 as part of this PR? https://docs.pinot.apache.org/basics/releases/1.0.0
I created #19136 for upgrading to pinot 1.0.0 - cc @mosabua
FYI - the reason checks were failing was because too many parallel pinot smoke tests were running. I removed the legacy client pinot tests test verify this. Will add them back, testing how to do this without overloading the gh runner.
@mosabua @bitsondatadev can you please get it reviewed , its pending for long?
@mosabua @bitsondatadev can you please get it reviewed , its pending for long?
@elonazoulay can work directly with the maintainers who know the Pinot connector.
:wave: @elonazoulay - this PR has become inactive. If you're still interested in working on it, please let us know.
We're working on closing out old and inactive PRs, so if you're too busy or this has too many merge conflicts to be worth picking back up, we'll be making another pass to close it out in a few weeks.
Hi, this is still active. I'm waiting for this issue to be resolved, likely in the next minor release of Pinot.
This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua
Looks like you can proceed on this @elonazoulay since the linked issue seems resolved and a new release is out.
This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua
Closing this pull request, as it has been stale for six weeks. Feel free to re-open at any time.
I think @elonazoulay will pick this up again eventually or someone else can use this as a base for implementing independently. Next step would be a rebase.
I added the stale-ignore label to ensure we dont loose sight of this improvement.
also fyi @robertzych