Skip to content
/ sayn Public
forked from 173TECH/sayn

Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).

License

Notifications You must be signed in to change notification settings

hustic/sayn

 
 

Repository files navigation

SAYN logo

SAYN is a modern data processing and modelling framework. Users define tasks (incl. Python, automated SQL transformations and more) and their relationships, SAYN takes care of the rest. It is designed for simplicity, flexibility and centralisation in order to bring significant efficiency gains to the data engineering workflow.

Use Cases

SAYN can be used for multiple purposes across the data engineering and analytics workflows:

  • Data extraction: complement tools such as Fivetran or Stitch with customised extraction processes.
  • Data modelling: transform raw data in your data warehouse (e.g. aggregate activity or sessions, calculate marketing campaign ROI, etc.).
  • Data science: integrate and execute data science models.

Key Features

SAYN has the following key features:

  • YAML based DAG (Direct Acyclic Graph) creation. This means all analysts, including non Python proficient ones, can easily add tasks to ETL processes with SAYN.
  • Automated SQL transformations: write your SELECT statement. SAYN turns it into a table/view and manages everything for you.
  • Jinja parameters: switch easily between development and product environment and other tricks with Jinja templating.
  • Python tasks: use Python scripts to complement your extraction and loading layer and build data science models.
  • Multiple databases supported.
  • and much more... See the Documentation.

Design Principles

SAYN aims to empower data engineers and analysts through its three core design principles:

  • Simplicity: data processes should be easy to create, scale and maintain. So your team can focus on data transformation instead of writing processes. SAYN orchestrates all your tasks systematically and provides a lot of automation features.
  • Flexibility: the power of data is unlimited and so should your tooling. SAYN supports both SQL and Python so your analysts can choose the most optimal solution for each process.
  • Centralisation: all analytics code should live in one place, making your life easier and allowing dependencies throughout the whole analytics process.

Quick Start

SAYN supports Python 3.7 to 3.10.

pip install sayn
sayn init test_sayn
cd test_sayn
sayn run

This is it! You completed your first SAYN run on the example project. Continue with the Tutorial: Part 1 which will give you a good overview of SAYN's true power!

Release Updates

If you want to receive update emails about SAYN releases, you can sign up here.

Support

If you need any help with SAYN, or simply want to know more, please contact the team at [email protected].

License

SAYN is open source under the Apache 2.0 license.


Made with ❤️ by 173tech.

About

Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%