+

Apache Airflow<sup>®</sup> Logo

About Google Dataproc

Google Dataproc is a highly scalable service to run Apache Spark, Apache Flink, Presto, and many more open source tools fully integrated in Google Cloud. Use Google Dataproc to run your compute-intensive Astro tasks handling large amounts of data for data science and ETL processes.


Use Case

Gaining insights from large amounts of data using distributed machine learning is a common use case for orchestrating jobs in Google Dataproc using Astro. Astro offers specialized operators to effortlessly leverage async processes when interacting with Google Dataproc, making your pipeline more cost-effective.

Get started free.

OR

API Access
Alerting
SAML-Based SSO
Airflow AI Assistant
Deployment Rollbacks
Audit Logging

By proceeding you agree to our Privacy Policy, our Website Terms and to receive emails from Astronomer.