Astronomer's the Dataflow Cast

Inside Vinted’s Code-Generated Airflow Pipelines with Oscar Ligthart and Rodrigo Loredo

The shift from monolithic to decentralized data workflows changes how teams build, connect and scale pipelines.

In this episode, we feature Oscar Ligthart, Lead Data Engineer, and Rodrigo Loredo, Lead Analytics Engineer, both at Vinted, as we unpack their YAML-driven abstraction that generates Airflow DAGs and standardizes cross-team orchestration.

Key Takeaways:

  • 00:00 Introduction.
  • 05:28 Challenges of decentralization.
  • 06:45 YAML-based generator standardizes pipelines and dependencies.
  • 12:28 Declarative assets and sensors align cross-DAG dependencies.
  • 17:29 Task-level callbacks enable auto-recovery and clear ownership.
  • 21:39 Standardized building blocks simplify upgrades and maintenance.
  • 24:52 Platform focus frees domain work.
  • 26:49 Container-only standardization prevents sprawl.

Resources Mentioned:

Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

Be Our Guest

Interested in being a guest on The Data Flowcast? Fill out the form and we will be in touch.

Build, run, & observe your data workflows.
All in one place.

Build, run, & observe
your data workflows.
All in one place.

Try Astro today and get up to $20 in free credits during your 14-day trial.