Processing User Feedback: an LLM-fine-tuning reference architecture with Ray on Anyscale
Processing User Feedback: an LLM-fine-tuning reference architecture with Ray on Anyscale
Processing User Feedback: an LLM-fine-tuning reference architecture with Ray on Anyscale
Info
This page has not yet been updated for Airflow 3. The concepts shown are relevant, but some code may need to be updated. If you run any examples, take care to update import statements and watch for any other breaking changes.
The Processing User Feedback GitHub repository is a free and open-source reference architecture showing how to use Apache Airflow® with Anyscale, a distributed compute platform built on Ray, to build an automated system that processes and categorizes user feedback relating to video games using a fine-tuned Large Language Model (LLM). The repository includes full source code, documentation, and deployment instructions for you to adapt and implement this architecture in your own projects.

This reference architecture serves as a practical learning tool, illustrating how to use Apache Airflow to orchestrate fine-tuning of LLMs on the Anyscale platform. The Processing User Feedback application is designed to be adaptable, allowing you to tailor it to your specific use case. You can customize the workflow by:
By providing a flexible framework, this architecture enables developers and data scientists to implement and scale their own LLM-based feedback processing systems using distributed compute.
Note
This tutorial uses Anyscale with the Anyscale provider to run Ray jobs. If you want to run Ray jobs on other platforms, you can use the Ray provider instead. See also Orchestrate Ray jobs on Anyscale with Apache Airflow®.

The Processing User Feedback use case consists of 2 main components:
Additionally, the architecture includes an advanced champion-challenger version of the fine-tuning process.
The DAGs in this reference architecture highlight several key Airflow features and best practices:
Get the Astronomer GenAI cookbook to view more examples of how to use Airflow to build generative AI applications.
If you’d like to build your own pipeline using Anyscale with Airflow, feel free to fork the repository and adapt it to your use case. We recommend deploying the Airflow pipelines using a free trial of Astro.