Blog

Introducing Git-Backed Management of Snowplow Data Products

By
Daniela Howard
&
December 16, 2024
Share this post

We're excited to announce the expansion of Snowplow's snowplow-cli capabilities to include data products, providing organizations with comprehensive Git-based workflow management for their entire tracking design process. This release builds on our existing Git-backed workflows for data structures, adding support for source applications, data products, and event specifications.

What's New

Our latest release extends the snowplow-cli, a command-line (CLI) tool, to Snowplow’s source applications, data products, and event specifications, making event tracking design local-first and version-control-friendly. Features include: 

  • Source Applications: Manage application definitions and configurations in a local file system, like GitHub
  • Event Specifications: Version control your event definitions and schemas with templates or documented approval workflows 
  • Data Products: Store and manage data contracts in machine-readable formats (YAML/JSON)
  • Synchronization: Changes in Git are reflected in the Snowplow Console UI

Key Benefits

  • Enforce data governance through formal approval workflows and automated compliance checks
  • Enable clear team ownership while facilitating cross-team collaboration
  • Streamline deployments with CI/CD pipeline integration and automated testing

Our expansion of snowplow-cli to supporting data products further transforms how data engineering and developer teams manage their event tracking. Teams can maintain strict governance standards through automated assessments, validated templates, and policy compliance checks while encouraging collaboration through familiar pull request workflows.

The integration also modernizes the development process by seamlessly connecting with existing CI/CD pipelines, automating contract validation, and supporting staged deployments. This approach provides analytics engineers and data teams with familiar Git-based workflows for managing data assets and version control.

Example Usage

Please see our documentation for complete information and useful tips: Managing Data Products in Git.

Getting Started

Snowplow’s latest release transforms your data management workflows. Now you can manage your data products directly in your Git repositories, with changes automatically synchronized to Snowplow's Console through our API integrations. This brings version control, code reviews, and collaborative workflows to your data product development. Book a demo to see how our Data Product Studio helps you manage data products with the same rigor as software development.

Subscribe to our newsletter

Get the latest blog posts to your inbox every week.

Get Started

Unlock the value of your behavioral data with customer data infrastructure for AI, advanced analytics, and personalized experiences