The Bytewax batch materialization engine provides an execution engine for batch materializing operations (materialize and materialize-incremental).
Guide
In order to use the Bytewax materialization engine, you will need a cluster running version 1.22.10 or greater.
Kubernetes Authentication
The Bytewax materialization engine loads authentication and cluster information from the . By default, kubectl looks for a file named config in the $HOME/.kube directory. You can specify other kubeconfig files by setting the KUBECONFIG environment variable.
Resource Authentication
Bytewax jobs can be configured to access as environment variables to access online and offline stores during job runs.
To configure secrets, first create them using kubectl:
Then configure them in the batch_engine section of feature_store.yaml:
Configuration
The Bytewax materialization engine is configured through the The feature_store.yaml configuration file:
The namespace configuration directive specifies which Kubernetes jobs, services and configuration maps will be created in.
Building a custom Bytewax Docker image
The image configuration directive specifies which container image to use when running the materialization job. To create a custom image based on this container, run the following command:
Once that image is built and pushed to a registry, it can be specified as a part of the batch engine configuration: