1 of 5

Batch Materialization Engines

Please see for an explanation of batch materialization engines.

Bytewax

Description

The batch materialization engine provides an execution engine for batch materializing operations (materialize and materialize-incremental).

Snowflake

Description

The Snowflake batch materialization engine provides a highly scalable and parallel execution engine using a Snowflake Warehouse for batch materializations operations (materialize and materialize-incremental) when using a SnowflakeSource.

The engine requires no additional configuration other than for you to supply Snowflake's standard login and context details. The engine leverages custom (automatically deployed for you) Python UDFs to do the proper serialization of your offline store data to your online serving tables.

When using all three options together, snowflake.offline, snowflake.engine, and snowflake.online, you get the most unique experience of unlimited scale and performance + governance and data security.

Example

AWS Lambda (alpha)

Description

The AWS Lambda batch materialization engine is considered alpha status. It relies on the offline store to output feature values to S3 via to_remote_storage, and then loads them into the online store.

See LambdaMaterializationEngineConfig for configuration options.

See also Dockerfile for a Dockerfile that can be used below with materialization_image.

Example

Spark (contrib)

Description

The Spark batch materialization engine is considered alpha status. It relies on the offline store to output feature values to S3 via to_remote_storage, and then loads them into the online store.

See for configuration options.

Snowflake

Description

Example

AWS Lambda (alpha)

Description

See LambdaMaterializationEngineConfig for configuration options.

See also Dockerfile for a Dockerfile that can be used below with materialization_image.

Example

feature_store.py

from feast import FeatureStore, RepoConfig
from feast.repo_config import RegistryConfig
from feast.infra.online_stores.dynamodb import DynamoDBOnlineStoreConfig
from feast.infra.offline_stores.contrib.spark_offline_store.spark import SparkOfflineStoreConfig

repo_config = RepoConfig(
    registry="s3://[YOUR_BUCKET]/feast-registry.db",
    project="feast_repo",
    provider="aws",
    offline_store=SparkOfflineStoreConfig(
      spark_conf={
        "spark.ui.enabled": "false",
        "spark.eventLog.enabled": "false",
        "spark.sql.catalogImplementation": "hive",
        "spark.sql.parser.quotedRegexColumnNames": "true",
        "spark.sql.session.timeZone": "UTC"
      }
    ),
    batch_engine={
      "type": "spark.engine",
      "partitions": 10
    },
    online_store=DynamoDBOnlineStoreConfig(region="us-west-1"),
    entity_key_serialization_version=2
)

store = FeatureStore(config=repo_config)

feature_store.py

from feast import FeatureStore, RepoConfig
from feast.repo_config import RegistryConfig
from feast.infra.online_stores.dynamodb import DynamoDBOnlineStoreConfig
from feast.infra.offline_stores.contrib.spark_offline_store.spark import SparkOfflineStoreConfig

repo_config = RepoConfig(
    registry="s3://[YOUR_BUCKET]/feast-registry.db",
    project="feast_repo",
    provider="aws",
    offline_store=SparkOfflineStoreConfig(
      spark_conf={
        "spark.ui.enabled": "false",
        "spark.eventLog.enabled": "false",
        "spark.sql.catalogImplementation": "hive",
        "spark.sql.parser.quotedRegexColumnNames": "true",
        "spark.sql.session.timeZone": "UTC"
      }
    ),
    batch_engine={
      "type": "spark.engine",
      "partitions": 10
    },
    online_store=DynamoDBOnlineStoreConfig(region="us-west-1"),
    entity_key_serialization_version=2
)

store = FeatureStore(config=repo_config)

Batch Materialization Engines

Bytewax

Description

Snowflake

Description

Example

AWS Lambda (alpha)

Description

Example

Spark (contrib)

Description

Batch Materialization Engines

Snowflake

Description

Example

Bytewax

Description

Kubernetes Authentication

Resource Authentication

Configuration

Building a custom Bytewax Docker image

AWS Lambda (alpha)

Description

Example

Spark (contrib)

Description

Example in Python

Batch Materialization Engines

Bytewax

hashtagDescription

Snowflake

hashtagDescription

hashtagExample

AWS Lambda (alpha)

hashtagDescription

hashtagExample

Spark (contrib)

hashtagDescription

Batch Materialization Engines

Snowflake

hashtagDescription

hashtagExample

Bytewax

hashtagDescription

hashtagKubernetes Authentication

hashtagResource Authentication

hashtagConfiguration

hashtagBuilding a custom Bytewax Docker image

AWS Lambda (alpha)

hashtagDescription

hashtagExample

Spark (contrib)

hashtagDescription

hashtagExample in Python

Description

Description

Example

Description

Example

Description

Description

Example

Description

Kubernetes Authentication

Resource Authentication

Configuration

Building a custom Bytewax Docker image

Description

Example

Description

Example in Python