Trino-LanceDB plugin
Description
Initial version of Trino-LanceDB plugin
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to [email protected]. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to [email protected]. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to [email protected]. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to [email protected]. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to [email protected]. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla
We should add documentation and the logo as well and maybe take this out of draft mode for first feedback and testing.
We should add documentation and the logo as well and maybe take this out of draft mode for first feedback and testing.
yes. will do once we get a green CI, (also with documentation + logo)
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to [email protected]. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla
We should add documentation and the logo as well and maybe take this out of draft mode for first feedback and testing.
yes. will do once we get a green CI, (also with documentation + logo)
upgraded to a test release version. PTAL @mosabua. i think lance is still working on a x-platform release so we might still bump the version later. but this should work for linux x86 platform also i think i signed and email the CLA but not sure why CI bot is still saying otherwise. please kindly check
CLA processing is a bit behind .. no worries. We can still proceed with reviewing and such
@mosabua Hi: Curious what is ETA of this? Thanks for adding this...we were looking for this
@bazooka720 .. no ETA .. we have to start reviewing more and also figure out how to package this since it add considerable size to the artifacts, but things are in progress.
sorry for the late follow up. I've chatted with folks at Lance and consider that the follow up PR is already in progress for predicate/project/filter pushdown, we would like to move quicker on this PR.
Overall I think the approach achieved basic functionality we discussed previously. the remaining work missing were mostly with
- Java side LanceDB/Core CI release and also getting it e2e working on a linux/x86 env
- getting it working with trino's CI framework
for step 1 I think @eddyxu has made great progress merging https://github.com/lancedb/lance/pull/2382 and https://github.com/lancedb/lance/pull/2516. i will be testing this next to ensure it works E2E. for step 2 we still have some complication regarding the GHA image used so any suggestion on how to get this sorted out would be highly appreciated.
Thanks @mosabua and other folks for the help and support. I think code-wise the PR is ready for review. will keep folks updated on CI and packaging side
I think we want to get this changed from draft to ready for review then @walterddr
In parallel we can work on CI and packaging.
Can you maybe let us know what the status is about building locally and what the specific issues are about CI and packaging.
Also note this connector might end up being part of the work on refactoring packaging. Note that the roadmap issue is still in the works to get fleshed out and implementation to begin.
https://github.com/trinodb/trino/issues/22597
I think we want to get this changed from draft to ready for review then @walterddr
DONE
In parallel we can work on CI and packaging.
Can you maybe let us know what the status is about building locally and what the specific issues are about CI and packaging.
I can build locally on either my own linux-x86 or my mac platform but i think the 1st goal here is to make linux-x86 work. previously when i launch the CI job the issue is with the ubuntu-22.04 image we used in CI that has a GLIB version issue
java.lang.UnsatisfiedLinkError: /tmp/liblancedb_jni15263364499459730028.so: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.38' not found (required by /tmp/liblancedb_jni15263364499459730028.so)
see link: https://github.com/trinodb/trino/actions/runs/9331728631/job/25686753922?pr=21880
Do you know what base image would actually fix that?
Do you know what base image would actually fix that?
I am not 100% sure. according to the log shown in the run. the base image is:
Ubuntu 22.0.4 LTS
so it should come packaged with GLIBC version 2.35. the error message indicate that lance-db JNI JAR is looking for
/lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.38' not found
so i think there's 2 options:
- upgrade GLIBC on this particular run to GLIBC_2.38 using an extra step in the ci run.
- downgrade GLIBC version to 2.35 in lance-db release
CC @LuQQiu and @eddyxu for suggestions from the lance side
This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua
@wendigo do you think we should upgrade our base image?
@walterddr is glib version this a runtime requirement as well?
@mosabua sorry for the delay, looks like i was looking at the wrong place, the issue comes with lancedb-core module compiled not lance-core module. thus our previous method to work around it was always aiming at the wrong target.
after chatting with lance folks we have a new version of lancedb-core and it should conform with the current base image so i dont think we need to bump it up.
and yes it is a runtime dependency.
This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua
Is this feature already available?
@walterddr Any blocker for this PR? Many users are asking for this integration
Yes .. we need to review it all still and get it ready for merge with @walterddr