LakeSoul
LakeSoul copied to clipboard
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
我尝试在内部网络里搭建一个LakeSoul的示例,我在Windows上搭建Hadoop、Spark、Java、Scala,但是在使用Scala运行提供的示例代码时异常,提示AWS证书问题(No AWS Credentials provided by SimpleAWSCredentialsProvider EnvironmentVariableCredentialsProvider InstanceProfileCredentialsProvider),查询资料后,需要往AWS进行权限认证,但是这和我们离线环境下使用相悖,是否是我这边的尝试有问题?我一开始也是在CentOS7上进行环境搭建,也是遇到了一系列的问题,在国内的博客上也几乎没有相关环境的搭建教程,请问是否能够提供一个完整的项目示例搭建教程?我们想往国内开源项目转,但是资料真的很匮乏。
Tracking issue of Flink support related issues. SubTasks: - [x] #58 - [x] #59 - [x] #60 - [x] #61 - [x] #62 - [x] #80 - [x] #63 -...
# Motivations The IO layer acts as a critical part for a table storage framework. However, current IO implementation suffers some drawbacks: 1. The IO stack tightly coupled with Spark,...
the [result](https://github.com/meta-soul/LakeSoul/wiki/01.-Data-Lake-Comparison) looks good. Could you please provide further details on the comparison settings?
Lakesoul has a high performance on upsert/meta-data management/concurrence writing compared with other products such as iceberg/detalake/hudi. Could we provide a standard benchmark data for testing? I think it will be...
我创建了表也是提示表不存在