Start creating behavioral data faster with Snowplow BDP Cloud.
Try Free

Snowplow Open Source

Build, deploy, and scale your next data creation project using Snowplow.







Total GitHub stars


Hours of open source development

What is Snowplow Open Source?

Snowplow is the leading open source Data Creation technology in the world today, and the third most adopted web tracker, behind Google Analytics and Facebook.

With our unique approach to Data Creation, which extends far beyond web behavioral analytics, users can manage customer data in a highly secure first-party environment. Only Snowplow has the ability to maintain a Universal Data Language, irrespective of where data is generated—reducing the time and complexity of cleaning and transforming data.

Snowplow’s open source schema registry, Iglu, ensures the data created adheres to defined schema definitions, while enabling the methodical management of schema updates as data needs evolve.

Create and consume a powerful data asset

Generate a rich data set

Get up and running fast with 21 out-of-the-box trackers for web, mobile, and server-side events, scalable to billions of events per day.

Enrich and validate your data

Create your own custom events and entity schemas, and capture an unlimited number of properties with each event. Enrich data in real-time with 16 out-of-the-box enrichments, or create your own custom enrichment during event creation and validation.

Model your data

With your data incrementally modeled in your warehouse or lake with either customizable or out-of-the-box, fully extensible, web and mobile dbt models, you are equipped with BI and AI-ready data to derive insight or predict customer behavior.

Deploy at scale

In less than an hour, your data will flow through Snowplow’s open source pipeline to your real-time streams, data lake, and data warehouse, with un-opinionated access to create data and integrate with your existing investments.

Trusted by thousands of organizations worldwide

Architectured to deliver high-quality behavioral data at scale

Dynamically manage and update schemas

Create your own schemas, and automatically handle schema migrations with our Iglu schema-ing and warehouse loader technology.

Control and obfuscate PII data

Collect fully anonymous data with our Javascript tracker, or obfuscate and remove PII with our configurable anonymizations.

Roll out tracking with confidence

Catch data quality issues before they hit production. Validate data in real time in dev environments with Snowplow Mini, and write automated tests against your tracking with Snowplow Micro.

Use out-of-the-box data models

Directly query data in your BI tool or ingest in your machine learning model with our performant web and mobile data models, which deliver aggregated tables by user, session, web page, or mobile screen.

Fully observe your pipeline

Gain full observability and monitoring over the behavioral data ingestion process. With every microservice publishing latency and volume metrics.

Access new features

With three or more updates released each month, Snowplow Open Source is considered the core of Snowplow Data Creation.

Get started today with Snowplow Open Source

Quick Start

Set up your pipeline in around an hour with the Open Source Quick Start.

View the Quick Start


Discover the Snowplow Open Source codebase on GitHub.

Explore Github Codebase


Have additional questions? Take a look at our comprehensive documentation.

Read the documentation

Discover the enterprise-ready platform

Scale Data Creation across your organization with Snowplow Behavioral Data Platform.