top of page


Bujar Bakiu
Oct 14, 20225 min read
Dockerizing dbt Transformations for Managed Airflow: Docker, dbt, and GCP Cloud Composer
Airflow is one of the most popular pipeline orchestration tools out there. It has been around for more than 8 years, and it is used...


Kejdi Tako
Sep 14, 20223 min read
Distributed Machine Learning Model Training with Spark (PySpark)
GitHub repo: https://github.com/data-max-hq/pyspark-3-ways What is Spark? Apache Spark was designed to function as a simple API for...


Igli
Aug 24, 20224 min read
Deploy Airflow and Metabase in Kubernetes using Infrastructure-as-Code
A step-by-step guide to deploying Airflow and Metabase in GCP with Terraform and Helm providers. With the extensive usage of cloud...
bottom of page