Astro Runtime architecture
Astro Runtime is a production ready data orchestration tool based on Apache Airflow that is distributed as a Docker image. It is intended to provide organizations with improved functionality, reliability, efficiency, and performance.
Astro Runtime includes the following features:
- Timely support for new patch, minor, and major versions of Apache Airflow. This includes bug fixes that have not been released by the open source project but are backported to Astro Runtime and available to users earlier.
- The
astronomer-providers
package. This package is an open source collection of Apache Airflow providers and modules maintained by Astronomer. It includes deferrable versions of popular operators such asExternalTaskSensor
,DatabricksRunNowOperator
, andSnowflakeOperator
. See Astronomer deferrable operators. - The
openlineage-airflow
package. OpenLineage standardizes the definition of data lineage, the metadata that forms lineage metadata, and how data lineage metadata is collected from external systems. See OpenLineage and Airflow. - A custom security manager that enforces user roles and permissions as defined by Astronomer. See Manage user permissions on Astronomer Software.
For more information about the features that are available in Astro Runtime releases, see the Astro Runtime release notes.
Runtime versioning
Astro Runtime versions are released regularly and use semantic versioning. Astronomer ships major, minor, and patch releases of Astro Runtime in the format of major.minor.patch
.
- Major versions are released for significant feature additions. This includes new major or minor versions of Apache Airflow as well as API or DAG specification changes that are not backwards-compatible.
- Minor versions are released for functional changes. This includes API or DAG specification changes that are backwards-compatible.
- Patch versions are released for bug and security fixes that resolve unwanted behavior. This includes new patch versions of Apache Airflow,
astronomer-providers
, andopenlineage-airflow
.
Every version of Astro Runtime correlates to an Apache Airflow version. All Deployments on Astronomer Software must run only one version of Astro Runtime, but you can run different versions of Astro Runtime on different Deployments within a given cluster or Workspace. See Create a Deployment.
For a list of supported Astro Runtime versions and more information on the Astro Runtime maintenance policy, see Astro Runtime versioning and lifecycle policy.
Astro Runtime and Apache Airflow parity
This table lists Astro Runtime releases and their associated Apache Airflow versions.
Astro Runtime | Apache Airflow version |
---|---|
4 | 2.2 |
5 | 2.3 |
6 | 2.4 |
7 | 2.5 |
8 | 2.6 |
Each Runtime version in a given minor series supports only a single version of Apache Airflow. For specific version compatibility information, see Runtime release notes.
Provider packages
The latest version of Astro Runtime has the following open source provider packages pre-installed:
- Amazon
apache-airflow-providers-amazon
- Astronomer Providers
astronomer-providers
- Astro Python SDK
astro-sdk-python
- Elasticsearch
apache-airflow-providers-elasticsearch
- Celery
apache-airflow-providers-celery
- Google
apache-airflow-providers-google
- HTTP
apache-airflow-providers-http
- Cloud Native Computing Foundation (CNCF) Kubernetes
apache-airflow-providers-cncf-kubernetes
- PostgreSQL (Postgres)
apache-airflow-providers-postgres
- Redis
apache-airflow-providers-redis
- Snowflake
apache-airflow-providers-snowflake
- OpenLineage with Airflow
openlineage-airflow
- Microsoft Azure
apache-airflow-providers-microsoft-azure
Provider package versioning
If an Astro Runtime release includes changes to an installed version of a provider package that is maintained by Astronomer (astronomer-providers
or openlineage-airflow
), the version change is documented in the Astro Runtime release notes.
To determine the version of any provider package installed in your current Astro Runtime image, run:
docker run --rm {image} pip freeze | grep <provider>
Python versioning
Astro Runtime supports Python 3.9. This is the only version of Python that Astro Runtime supports. If your data pipelines require an unsupported Python version, Astronomer recommends that you use the KubernetesPodOperator. See Run the KubernetesPodOperator on Astronomer Software.
Executors
In Airflow, the executor is responsible for determining how and where a task is completed.
In all local environments created with the Astro CLI, Astro Runtime runs the Local executor. On Software Depolyments, Astro Runtime is compatible with the Celery executor and Kubernetes executor
Soon, Astronomer will provide a new executor with intelligent worker packing, task-level resource requests, improved logging, and Kubernetes-like task isolation.
Distribution
Astro Runtime is distributed as a Debian-based Docker image. Runtime Docker images have the following format:
quay.io/astronomer/astro-runtime:<version>
quay.io/astronomer/astro-runtime:<version>-base
An Astro Runtime image must be specified in the Dockerfile
of your Astro project. Astronomer recommends using non-base
images, which incorporate ONBUILD commands that copy and scaffold your Astro project directory so you can more easily pass those files to the containers running each core Airflow component. A base
Astro Runtime image is recommended for complex use cases that require additional customization, such as installing Python packages from private sources.
For a list of all Astro Runtime Docker images, see Quay.io.
System distribution
The following table lists the operating systems and architectures supported by each Astro Runtime version. If you're using a Mac computer with an M1 chip, Astronomer recommends using Astro Runtime 6.0.4 or later.
Astro Runtime | Operating System (OS) | Architecture |
---|---|---|
4 | Debian 11.3 (bullseye) | AMD64 |
5 | Debian 11.3 (bullseye) | AMD64 |
6 | Debian 11.3 (bullseye) | AMD64 and ARM64 |
7 | Debian 11.3 (bullseye) | AMD64 and ARM64 |
8 | Debian 11.3 (bullseye) | AMD64 and ARM64 |
Astro Runtime 6.0.4 and later images are multi-arch and support AMD64 and ARM64 processor architectures for local development. Docker automatically uses the correct processor architecture based on the computer you are using.