Be Our Guest
Interested in being a guest on The Data Flowcast? Fill out the form and we will be in touch.
Data orchestration is revolutionizing the way companies manage and process data. In this episode, we explore the critical role of data orchestration in modern data workflows and how Apache Airflow is used to enhance data processing and AI model deployment.
Hannan Kravitz, Data Engineering Team Leader at Artlist, joins us to share his insights on leveraging Airflow for data engineering and its impact on their business operations.
Key Takeaways:
(01:00) Hannan introduces Artlist and its mission to empower content creators.
(04:27) The importance of collecting and modeling data to support business insights.
(06:40) Using Airflow to connect multiple data sources and create dashboards.
(09:40) Implementing a monitoring DAG for proactive alerts within Airflow.
(12:31) Customizing Airflow for business metric KPI monitoring and setting thresholds.
(15:00) Addressing decreases in purchases due to technical issues with proactive alerts.
(17:45) Customizing data quality checks with dynamic task mapping in Airflow.
(20:00) Desired improvements in Airflow UI and logging capabilities.
(21:00) Enabling business stakeholders to change thresholds using Streamlit.
(22:26) Future improvements desired in the Airflow project.
Resources Mentioned:
Interested in being a guest on The Data Flowcast? Fill out the form and we will be in touch.
By proceeding you agree to our Privacy Policy,
our Website Terms and to receive emails from Astronomer.