1 of 7

Offline stores

Please see for an explanation of offline stores.

File

Description

The File offline store provides support for reading FileSources.

Only Parquet files are currently supported.
All data is downloaded and joined using Python and may not scale to production workloads.

Example

Configuration options are available .

Snowflake

Description

The Snowflake offline store provides support for reading SnowflakeSources.

Snowflake tables and views are allowed as sources.
All joins happen within Snowflake.
Entity dataframes can be provided as a SQL query or can be provided as a Pandas dataframe. Pandas dataframes will be uploaded to Snowflake in order to complete join operations.
A SnowflakeRetrievalJob is returned when calling get_historical_features().
- This allows you to call

Example

Configuration options are available in .

BigQuery

Description

The BigQuery offline store provides support for reading BigQuerySources.

BigQuery tables and views are allowed as sources.
All joins happen within BigQuery.
Entity dataframes can be provided as a SQL query or can be provided as a Pandas dataframe. Pandas dataframes will be uploaded to BigQuery in order to complete join operations.
A is returned when calling get_historical_features().

Example

Configuration options are available .

Redshift

Description

The Redshift offline store provides support for reading RedshiftSources.

Redshift tables and views are allowed as sources.
All joins happen within Redshift.
Entity dataframes can be provided as a SQL query or can be provided as a Pandas dataframe. Pandas dataframes will be uploaded to Redshift in order to complete join operations.
A is returned when calling get_historical_features().

Example

Configuration options are available .

Permissions

Feast requires the following permissions in order to execute commands for Redshift offline store:

The following inline policy can be used to grant Feast the necessary permissions:

In addition to this, Redshift offline store requires an IAM role that will be used by Redshift itself to interact with S3. More concretely, Redshift has to use this IAM role to run and commands. Once created, this IAM role needs to be configured in feature_store.yaml file as offline_store: iam_role.

The following inline policy can be used to grant Redshift necessary permissions to access S3:

While the following trust relationship is necessary to make sure that Redshift, and only Redshift can assume this role:

Spark (contrib)

Description

The Spark offline store is an offline store currently in alpha development that provides support for reading SparkSources.

Disclaimer

This Spark offline store still does not achieve full test coverage and continues to fail some integration tests when integrating with the feast universal test suite. Please do NOT assume complete stability of the API.

Spark tables and views are allowed as sources that are loaded in from some Spark store(e.g in Hive or in memory).
Entity dataframes can be provided as a SQL query or can be provided as a Pandas dataframe. Pandas dataframes will be converted to a Spark dataframe and processed as a temporary view.
A SparkRetrievalJob is returned when calling get_historical_features()

Example

PostgreSQL (contrib)

Description

The PostgreSQL offline store is an offline store that provides support for reading PostgreSQL data sources.

DISCLAIMER: This PostgreSQL offline store still does not achieve full test coverage.

Entity dataframes can be provided as a SQL query or can be provided as a Pandas dataframe. Pandas dataframes will be converted to a Spark dataframe and processed as a temporary view.
A PostgreSQLRetrievalJob is returned when calling get_historical_features().
- This allows you to call
sslmode, sslkey_path, sslcert_path, and sslrootcert_path are optional

Example

Redshift

Description

The Redshift offline store provides support for reading RedshiftSources.

Redshift tables and views are allowed as sources.
All joins happen within Redshift.
Entity dataframes can be provided as a SQL query or can be provided as a Pandas dataframe. Pandas dataframes will be uploaded to Redshift in order to complete join operations.
A is returned when calling get_historical_features().

Example

Configuration options are available .

Permissions

Feast requires the following permissions in order to execute commands for Redshift offline store:

The following inline policy can be used to grant Feast the necessary permissions:

The following inline policy can be used to grant Redshift necessary permissions to access S3:

While the following trust relationship is necessary to make sure that Redshift, and only Redshift can assume this role:

Offline stores

File

hashtagDescription

hashtagExample

Snowflake

hashtagDescription

hashtagExample

BigQuery

hashtagDescription

hashtagExample

Redshift

hashtagDescription

hashtagExample

hashtagPermissions

Spark (contrib)

hashtagDescription

hashtagDisclaimer

hashtagExample

PostgreSQL (contrib)

hashtagDescription

hashtagExample

File

hashtagDescription

hashtagExample

Snowflake

hashtagDescription

hashtagExample

PostgreSQL (contrib)

hashtagDescription

hashtagExample

BigQuery

hashtagDescription

hashtagExample

Redshift

hashtagDescription

hashtagExample

hashtagPermissions

Spark (contrib)

hashtagDescription

hashtagDisclaimer

hashtagExample

Offline stores

Description

Example

Description

Example

Description

Example

Description

Example

Permissions

Description

Disclaimer

Example

Description

Example

Description

Example

Description

Example

Description

Example

Description

Example

Description

Example

Permissions

Description

Disclaimer

Example