Why Airflow Became the Scheduling Backbone at Condé Nast Technology Lab with Arun Karthik

Jan 15, 2026

Data platforms are moving from batch-first pipelines to near real-time systems where orchestration, observability, scalability and governance all have to work together.

In this episode, Arun Karthik, Director, Data Solutions Engineering at Condé Nast Technology Lab, joins us to share how data engineering evolves from relational databases and ETL into distributed processing, modern orchestration with Apache Airflow and managed Airflow with Astronomer.

Key Takeaways:

00:00 Introduction.
02:13 Early data systems rely heavily on relational databases and batch-oriented processing models.
07:01 Scheduling requirements evolve beyond fixed time windows as dependencies increase.
10:14 Ease of use and developer experience influence adoption of orchestration frameworks.
13:22 Operating open source orchestration tools requires ongoing engineering effort.
14:45 Managed services help teams reduce infrastructure and maintenance responsibilities.
17:27 Observability improves confidence in pipeline execution and system health.
19:12 Governance considerations grow in importance as data platforms mature.
20:46 Building data systems requires balancing speed, reliability and long-term sustainability.

Resources Mentioned:

Thanks for listening to “The Data Flowcast: Mastering Apache Airflow^® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

Be Our Guest

Interested in being a guest on The Data Flowcast? Fill out the form and we will be in touch.

Get started free.

API Access

Alerting

SAML-Based SSO

Airflow AI Assistant

Deployment Rollbacks

Audit Logging

By proceeding you agree to our Privacy Policy, our Website Terms and to receive emails from Astronomer.

Talk to sales.

Book a Demo View Pricing