Snowplow Open Source
Build, deploy, and scale your next data creation project using Snowplow.
Total GitHub stars
Hours of open source development
What is Snowplow Open Source?
Snowplow is the leading open source Data Creation technology in the world today, and the third most adopted web tracker, behind Google Analytics and Facebook.
With our unique approach to Data Creation, which extends far beyond web behavioral analytics, users can manage customer data in a highly secure first-party environment. Only Snowplow has the ability to maintain a Universal Data Language, irrespective of where data is generated—reducing the time and complexity of cleaning and transforming data.
Snowplow’s open source schema registry, Iglu, ensures the data created adheres to defined schema definitions, while enabling the methodical management of schema updates as data needs evolve.
Create and consume a powerful data asset
Generate a rich data set
Get up and running fast with 21 out-of-the-box trackers for web, mobile, and server-side events, scalable to billions of events per day.
Enrich and validate your data
Create your own custom events and entity schemas, and capture an unlimited number of properties with each event. Enrich data in real-time with 16 out-of-the-box enrichments, or create your own custom enrichment during event creation and validation.
Model your data
With your data incrementally modeled in your warehouse or lake with either customizable or out-of-the-box, fully extensible, web and mobile dbt models, you are equipped with BI and AI-ready data to derive insight or predict customer behavior.
Deploy at scale
In less than an hour, your data will flow through Snowplow’s open source pipeline to your real-time streams, data lake, and data warehouse, with un-opinionated access to create data and integrate with your existing investments.
Trusted by thousands of organizations worldwide
Architectured to deliver high-quality behavioral data at scale
Dynamically manage and update schemas
Create your own schemas, and automatically handle schema migrations with our Iglu schema-ing and warehouse loader technology.
Control and obfuscate PII data
Roll out tracking with confidence
Catch data quality issues before they hit production. Validate data in real time in dev environments with Snowplow Mini, and write automated tests against your tracking with Snowplow Micro.
Use out-of-the-box data models
Directly query data in your BI tool or ingest in your machine learning model with our performant web and mobile data models, which deliver aggregated tables by user, session, web page, or mobile screen.
Fully observe your pipeline
Gain full observability and monitoring over the behavioral data ingestion process. With every microservice publishing latency and volume metrics.
Access new features
With three or more updates released each month, Snowplow Open Source is considered the core of Snowplow Data Creation.
Get started today with Snowplow Open Source
Set up your pipeline in around an hour with the Open Source Quick Start.
Discover the Snowplow Open Source codebase on GitHub.
Have additional questions? Take a look at our comprehensive documentation.
Discover the enterprise-ready platform
Scale Data Creation across your organization with Snowplow Behavioral Data Platform.