Lack of Spark Support for On-Demand Feature Views

Open FabioTomaz opened this issue 6 months ago • 2 comments

Currently, stream feature views support both spark and spark_sql execution modes, which enables them to scale efficiently with large datasets. However, on-demand feature views are limited to pandas or python modes only.

This limitation becomes a bottleneck when trying to perform transformations on features that haven't been precomputed, especially in cases where those transformations need to scale to large data volumes. Without Spark support, users are forced to rely on local computation, which significantly hinders scalability and performance.

Jun 29 '25 23:06 FabioTomaz

Team is working on it 👍

Aug 10 '25 16:08 HaoXuAI

A temporary hack is to use the ODFV as a UDF. That's ultimately what the real implementation will look like.

Aug 10 '25 23:08 franciscojavierarceo