Data Lineage with OpenLineage and Airflow

WATCH ON DEMAND

Summary:

If one out of your hundreds of DAGs fails, how do you know which downstream datasets have become out-of-date? The answer is data lineage. Data lineage is the complex set of relationships between your jobs and datasets. In this webinar, you'll learn how to use OpenLineage to collect lineage metadata from Airflow and assemble a lineage graph - a picture of your pipeline worth way more than a thousand words.

Key Takeaways

  • The Purpose and Use of Data Lineage
  • OpenLineage Core Concepts
  • Using OpenLineage with Airflow

Missed the Webinar? Sign up for the recap.

Hosted By

Julien Le Dem

Julien Le Dem

OpenLineage Project Lead

Raised in Normandy, France, Julien previously worked at Yahoo, Twitter, Dremio, and WeWork. His dream car is a 1982 DeLorean and his favorite movie character is Inigo Montoya (for his high motivation, focus, great heart and strong ethics).

Willy Lulciuc

Willy Lulciuc

Marquez Project Lead

Born in Constanta, Romania, Willy previously worked at WeWork, BounceX!, and Canary.is. He enjoys biking and roasting coffee beans to perfection, but still misses recess and loves pizza.

Viraj Parekh

Viraj Parekh

Field CTO @ Astronomer