NEW WEBINAR: 4 Things To Consider When Deciding How to Run Airflow|Register Today →
globe

Cross-Platform Data Lineage with OpenLineage

When: June 13, 11:50 — 12:30 (CET) at Berlin Buzzwords 2022

Hosted By

  • Julien Le Dem Julien Le Dem Chief Architect
    Astronomer

Summary

<blockquote> <em> Berlin Buzzwords is a conference on storing, processing, streaming and searching large amounts of digital data, with a focus on open source software projects. </em> </blockquote> <br>

OpenLineage provides a standard for lineage collection that spans multiple platforms, including Apache Airflow, Apache Spark, Flink, and dbt. This empowers teams to diagnose and address widespread data quality and efficiency issues in real time.

In this session, Julien Le Dem, our Chief Architect, will show how to trace data lineage across Apache Spark and Apache Airflow. He will walk through the OpenLineage architecture and provide a live demo of a running pipeline with real-time data lineage.

RSVP Today

By proceeding you agree to our Privacy Policy , our Website Terms and to receive emails from Astronomer.