June 12, 2025

In an AI world, it’s the workflow that allows you to build your moat

Leo Zheng CMO Astronomer

While LM Arena leaderboards grab headlines, an infrastructure battle is happening in the orchestration systems that transform AI from lab experiments into production solutions that solve real-world problems.

This is because in this new phase of AI-driven analytics and applications, it’s not the raw ingredients that differentiate products, it’s in how those ingredients are assembled. Competitive advantage is shifting from more abundant “raw materials” like code, algorithms, and even data to the orchestration workflows that turn them into coherent and useful solutions.

From Database Abstractions to AI Abstractions

To understand where we're heading, we need to recognize where we’ve been. For the past three decades, most successful applications were essentially sophisticated abstraction layers built on top of two fundamental technologies: databases and search. Whether it was your CRM, your e-commerce platform, or your content management system, scratch beneath the surface and you'd find database queries and search indices doing the heavy lifting on top of rows and columns stuffed with business data.

The winners of the digital transformation era weren't even necessarily those with the best databases or the fastest search engines. They were those who built the most compelling abstractions on top of these primitives to uniquely solve meaningful challenges and add value for their users.

Salesforce didn't revolutionize databases. It created workflows for managing customer relationships. ServiceNow didn't invent ticketing systems. It abstracted IT operations into manageable workflows. The pattern was consistent: take the primitives of databases and search, wrap them in domain-specific workflows, and deliver business value.

Now we're witnessing the next evolution: the shift from digital transformation to agentic transformation.

In this new era, software isn’t just reading from a database or writing to an index. It’s “thinking”, deciding, and acting in tight feedback loops. LLMs generate options, retrieval systems inject fresh context, vector stores need to be continuously updated with the latest data, and evaluators need to grade every output in real time. The raw ingredients (models, embeddings, APIs) keep getting cheaper and better. What doesn’t commoditize is the choreography that binds those pieces into dependable, repeatable outcomes.

We are fast approaching a reality where “a feature” is really:

A human or business process generates a prompt that expresses intent
A planner that breaks high-level intents into sub-tasks
A tool-calling runner that juggles APIs, databases, and humans
A critic that scores results and kicks failures back through the loop
A watchdog that spots drift in the prompt or the data

In other words, the competitive frontier has shifted from What model are you using? to How well do your agents cooperate under load, with guard-rails and retries? and How well do your agents interact with well-structured proprietary data or meet real-world SLAs?

And in this new paradigm, success depends as much on how you chain models, tools, and data together as on the LLMs themselves.

Every AI company is an orchestration company in disguise

We can already see this playing out. Clay sells "AI research agents" for GTM, Writer is now an "end-to-end agent builder platform", n8n markets "multi-step AI agents," and Zapier now allows customers to scale AI agents with the “the most connected AI orchestration platform”.

Strip away the positioning, and they're all building the same foundational architecture: platforms that chain AI capabilities into multi-step workflows with integrations, conditional logic, and monitoring.

Orchestration is becoming the universal primitive of AI applications. Just as every app eventually needed a database, every AI application eventually needs workflow orchestration. The companies that build the most reliable, user-friendly orchestration layer, whether they call it "agents," "workflows," or "automation", will capture disproportionate value as AI capabilities themselves commoditize.

The race isn't just to own use cases before the market realizes the convergence. It's to build the orchestration infrastructure which enables building a product so good that switching costs become prohibitive, even when competitors offer similar functionality.

Why code-first orchestration wins

This convergence toward building abstractions on top of orchestration platforms raises a critical question: if what everyone is building rhymes, what determines who wins? The answer lies in a fundamental architectural decision: whether to build on code-first, no-code, or somewhere in between.

When you build on a code-first orchestration platform like Apache Airflow®, you inherit:

Infinite extensibility: Anything that can be declaratively expressed in code can be dynamically executed as event-driven or on-demand data pipelines, with all dependencies maintained. Airflow even allows you to create pipelines that adapt to your data at runtime.
Ecosystem leverage: The ability to instantly integrate with any AI and data tool through API calls, alongside a wide array of pre-built operators (1,000+) streamlining integration with the most popular AI solutions
Operational maturity: Battle-tested patterns for monitoring, alerting, and debugging
Future-proof architecture: New capabilities and technologies can be added without architectural rewrites

This is why so many AI platforms are quietly building their orchestration layers on Airflow. They understand that orchestration isn't just a feature, it's the foundation everything else builds upon. And those who build on infrastructure that can easily evolve with their users' needs will be far better off.

We’re obviously a bit biased so don’t take it from us. Here are just some of the initial responses to a new Airflow course on generative AI.

The Democratization Moment with the Airflow AI SDK

Remember when wiring an LLM into a pipeline felt like cave diving through half-baked docs, juggling blackbox wrappers, and praying your half-guessed JSON parameters would conjure the desired response?

Of course you do, it was probably last week.

The Airflow AI SDK turns “call a foundation model to summarize a user review, branch on sentiment, then stream the cleaned payload to my database” into less than 20 lines of Python:

from airflow.sdk import dag, task, chain
from airflow_ai_sdk.models.base import BaseModel
from include.pii import redact_pii


class ReviewInsights(BaseModel):
    summary: str
    quotes: list[str]


@task.llm(
    model="gpt-4o-mini",
    system_prompt="Summarize the given review into a 1-2 sentence headline and extract quotes to support the summary",
    result_type=ReviewInsights,
)
def summarize(review_text: str) -> ReviewInsights:
    return redact_pii(review_text)


@task.llm_branch(
    model="gpt-4o-mini", system_prompt="Classify the review based on its sentiment."
)
def route(review_text: dict) -> str:
    return review_text["summary"]

One decorator (@task.llm) gives you prompt management, retries, and typed, Pydantic-validated outputs; use @task.llm_branch or @task.agent when you need control-flow decisions or a tool-using agent, and Airflow handles the rest: logging, observability, SLAs, the works.

The payoff is two-fold:

On-ramp for the 80%: Data and ML engineers can drop an LLM step into an existing DAG without learning a new framework.
Headroom for the 20%: Because it’s still pure Python, you can version-control prompts, inject custom tools, or swap models as the use-case matures. No “graduating” to a new platform later.

In short, the Airflow AI SDK lets you start with surgical, low-risk LLM workflows and scale to full agent architectures when the ROI is there.

The Strategic Implications

If orchestration is the thing that allows you to build your moat, the Airflow AI SDK is the accelerant. By standardising LLM calls as first-class Airflow tasks, it gives you:

Portability: DAGs are Python. Move from local development to your VPC or to fully managed SaaS with zero rewrites.
Composability: Mix @task.llm with dbt runs, Spark jobs, etc in the same graph
Observability by default: Every prompt, token count, and tool call is logged alongside the rest of the pipeline; no third-party “LLM monitor” tax that you have to pay.
Guardrails: Typed outputs, cost limits, and automatic retries keep these workflows boring in production, which is the highest compliment.
Community gravity: Operators and examples land in one place, so best practices spread faster than any closed agent builder could hope for.

Locking your AI workflows into a proprietary low-code studio means betting your core “graph” on somebody else’s roadmap and pricing model. Building atop open, code-first Airflow + the AI SDK flips that power dynamic: you own the graph, you swap models at will, and you’re never more than a pip install airflow-ai-sdk away from the next frontier.

Nearly every AI infrastructure vendor will tell you they’re “the operating system for agents.” However, the durable edge will belong to teams whose workflows are open, inspectable, and impossible to outgrow.

Ready to get started? Check out the Airflow SDK or register for the new Orchestrating Workflows for GenAI Applications course on DeepLearning.AI.

Build, run, & observe your data workflows. All in one place.

Get Started Free