Snowplow Self-Hosted Pipeline

Last updated: 16 April 2025

This Snowplow Product Directory provides descriptions of the Self-Hosted Pipeline products and services you have subscribed to on an Order Form or enabled as part of a pilot service. While our Self-Hosted Pipeline Product Directory may be updated from time to time, the descriptions of the products as of the Start Date in your Order Form will apply to the Self-Hosted Pipeline Products or Services specified in your Order Form. If new terms are introduced for new features or functions made available within a Self-Hosted Pipeline Product or Service during the Term of your Agreement, these new terms will apply to the use of those new features or functions if you use them.

Product Overview:

The Snowplow Self-Hosted Pipeline product allows companies to collect and operationalize behavioral data using Snowplow’s core behavioral data pipeline in production.

The Self-Hosted Pipeline enables organizations to collect behavioral data across multiple applications (through 20+ SDKs including Javascript, iOS, Android, Python and more), validate and enrich that data in real time, and stream it into your warehouse, data lake, and other destinations (e.g. Snowflake, Databricks, BigQuery, Redshift, OneLake and others).

Our Self-Hosted Pipeline license agreement enables customers to run a single Snowplow pipeline in production, removing the non-commercial use restriction defined under the terms of our SLULA 1.1 (Snowplow Limited Use License Agreement). The specific terms applicable to your use of the Self-Hosted Pipeline are detailed in your Order Form and License Agreement which you must sign.

Key features

Running Snowplow in production

Snowplow Self-Hosted Pipeline is a commercial license, enabling customers to run one Snowplow pipeline in production.

Customer Support

There are two tiers of 24×7 customer support available to Snowplow Self-Hosted Pipeline customers: Standard and Enhanced Support.

Self-Hosted Pipeline customers also have access to our help center, knowledge base, community forum and product documentation.

Critical CVE updates

The Self-Hosted Pipeline product provides (no less than quarterly) updates to patch critical CVEs.

Data Models

Data Model Packs allow companies using Snowplow Self-Hosted Pipeline to use Snowplow’s dbt data models to manage and model your behavioral data, further described in the table below.

With a subscription to Data Model Packs, customers may use the following dbt models for commercial purposes, removing the non-commercial use restriction defined under the terms of our SLULA 1.1 (Snowplow Limited Use License Agreement):

Data Model Packs

The Digital Analytics Data Model Pack is comprised of data models for the following use cases:

  1. User and Marketing Analytics: Understand your customer engagement with digital channels.
  2. Marketing Attribution: Understand the impact of different marketing channels on conversions and traffic levels.
  3. Funnel Analytics: Understand the sequential steps users take toward a specific goal, identifying drop-off points and optimizing the user journey for higher conversions.
  4. Video and Media Analytics: Understand engagement with video, audio, and streaming content, including clicks through to conversions and advertisements.

The Data Model Pack includes data models (written using dbt open source software) that aggregate the underlying event-level data in the cloud data destination into AI and Business Intelligence - ready tables. Example tables include a user-level table, a session-level table, and a pageview-level table. These tables can be used and customized by you to perform more sophisticated analytics and AI. The included dbt packages are: 

  • Unified Digital: Understand user behavior across web and mobile apps 
  • Attribution: attribute conversions and revenue through multiple attribution methods: 
  • Media Player: calculate aggregate play and ad statistics across video, audio and streaming content
  • Normalize: filter and flatten your event data into format more suitable for downstream applications 
  • Utils: contains our base processing logic for all other packages"

The data models implement several data processing steps, including but not limited to:

  • Deduplicating the underlying event data
  • Stitching user identities across different platforms and channels (e.g., web and mobile)
  • Accurately calculating time spent engaging with different content items (e.g., web pages, mobile screens)
  • Sessionizing the data

The data models aggregate the data in a performant, incremental fashion which may reduce your cost of data processing and increase the speed of data delivery. The data models are extendable and run in your selected cloud data destination.

The Ecommerce Analytics Data Model Pack includes the underlying dbt models (written using dbt open source software) in the Digital Analytics Data Model Pack, but also includes an associated ecommerce dbt model to help understand and optimize a digital shopping experience. 

The ecommerce dbt package creates AI and Business Intelligence-ready tables describing carts, checkouts, product performance, transactions, and sessions. 

Product Options

The following options are available to Snowplow Self-Hosted Pipeline customers and can be added to your Order Form.

NameMax vCPU (*)SupportNotes
Self-Hosted Pipeline (20 vCPU)20StandardCan be run in production with one loader
Self-Hosted Pipeline (20 vCPU) with Enhanced Support20EnhancedEnhanced support with higher SLAs
Self-Hosted Pipeline (21-40 vCPU)40Enhanced
Self-Hosted Pipeline (Unlimited vCPU)UnlimitedEnhanced

vCPUs

Product options are based on how many vCPUs a Self-Hosted Pipeline customer runs across all SLULA-licensed pipeline components, where:

  • A vCPU is any logical CPU core as reported by the host operating system. A vCPU can represent a physical core, a logical core created through the use of hyper-threading or simultaneous multithreading, or a shared logical CPU provided through virtualization
  • Max vCPUs indicates the maximum number of vCPUs permitted to use across the term of the contract and will be stated in your Order Form.

Out of Scope

The following are not included in any part of a Snowplow Self-Hosted Pipeline Product Option.

  1. Event Forwarding – the Self-Hosted Pipeline offering does not confer the right to run SLULA-licensed Event Forwarding instances
  2. Any managed service (e.g. upgrades, alerting, or services specifically forming part of the Snowplow BDP product)
  3. Any form of hosting
  4. Technical Account Manager
  5. Professional Services or implementation
  6. Support for non-standard components or forked components
  7. Anything explicitly stated as out of scope in our Statement of Support