Apache Airflow Logo

About Amazon EMR

AWS EMR is a managed cluster platform to run and scale big data workloads in a variety of open source frameworks such as Apache Spark, Hive, and Presto. Use Amazon EMR to run your compute-intensive Astro tasks handling petabytes of data for data analytics, processing, and machine learning.

Use Case

Gaining insights from large amounts of data using distributed machine learning is a common use case for orchestrating jobs in Amazon EMR using Astro. With Astro’s support for asynchronous EMR modules, you’ll get cost savings on orchestration and processing when running jobs on big data.