feathub issues

Reorganize FlinkProcessor tests after resource leak is resolved

The organization of test files in FlinkProcessor's tests need to be improved after Flink's resource leak problem is resolved. Blocked by Flink ticket: https://issues.apache.org/jira/browse/FLINK-30258

yunfengzhou-hub

type:improvement

Replace input data with keys in OnlineStoreClient#get

yunfengzhou-hub

type:improvement

Support python install all dependencies

Add instruction to install "./python[all]" after the dependency confliction in PyFlink and PySpark is resolved.

yunfengzhou-hub

type:improvement

Provide examples showing the advantage of python SDK over SQL

yunfengzhou-hub

type:improvement

Only invoke the corresponding setUpClass method during it test

feathub_it_test_base.py: ```python # TODO: only invoke the corresponding base class's setUpClass() # method to reduce resource consumption. @classmethod def invoke_all_base_class_setupclass(cls): for base_class in cls.__bases__: if issubclass(base_class, unittest.TestCase): base_class.setUpClass() ```

yunfengzhou-hub

type:improvement

Identify whether Sink#to_json is needed

Currently, `Sink#to_json` is not used in production or test code. We need to make sure whether this method is useful. If so, we need to add test cases to verify...

yunfengzhou-hub

type:improvement

Refactor props in Registry#build_features

The `props` parameter in `Registry#build_features` should to other place to avoid job-specific global properties affecting feature descriptors saved in Registry.

yunfengzhou-hub

type:improvement

SparkProcessor supports reusing intermediate results

Optimize `SparkProcessor#materialize_features`'s performance by reusing intermediate results.

yunfengzhou-hub

type:improvement

Add test case to verify SparkJob completed exceptionally

```python class SparkJob(ProcessorJob): """Represent a Spark job.""" def __init__( self, job_future: Future, ) -> None: super().__init__() self._job_future = job_future # TODO: Add test case to verify this method's behavior when...

yunfengzhou-hub

type:improvement

Support VALUE_COUNTS and COLLECT_LIST in SparkProcessor

yunfengzhou-hub

type:feature

feathub
feathub copied to clipboard

Metadata

Reorganize FlinkProcessor tests after resource leak is resolved

Replace input data with keys in OnlineStoreClient#get

Support python install all dependencies

Provide examples showing the advantage of python SDK over SQL

Only invoke the corresponding setUpClass method during it test

Identify whether Sink#to_json is needed

Refactor props in Registry#build_features

SparkProcessor supports reusing intermediate results

Add test case to verify SparkJob completed exceptionally

Support VALUE_COUNTS and COLLECT_LIST in SparkProcessor

← Metadata

Owner

Metadata

feathub feathub copied to clipboard

Metadata

← Metadata

Owner

Metadata

feathub
feathub copied to clipboard