The Snowflake batch materialization engine provides a highly scalable and parallel execution engine using a Snowflake Warehouse for batch materializations operations (materialize
and materialize-incremental
) when using a SnowflakeSource
.
The engine requires no additional configuration other than for you to supply Snowflake's standard login and context details. The engine leverages custom (automatically deployed for you) Python UDFs to do the proper serialization of your offline store data to your online serving tables.
When using all three options together, snowflake.offline
, snowflake.engine
, and snowflake.online
, you get the most unique experience of unlimited scale and performance + governance and data security.
The Spark batch materialization engine is considered alpha status. It relies on the offline store to output feature values to S3 via to_remote_storage
, and then loads them into the online store.
See SparkMaterializationEngine for configuration options.
The AWS Lambda batch materialization engine is considered alpha status. It relies on the offline store to output feature values to S3 via to_remote_storage
, and then loads them into the online store.
See LambdaMaterializationEngineConfig for configuration options.
See also Dockerfile for a Dockerfile that can be used below with materialization_image
.
Please see Batch Materialization Engine for an explanation of batch materialization engines.