Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable
  Frameworks

Yunfan Gao; Yun Xiong; Meng Wang; Haofen Wang

doi:10.48550/arxiv.2407.21059

Back

Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks

Preprint 2024 English

Authors

YG
Yunfan Gao
YX
Yun Xiong
MW
Meng Wang

Abstract

1 min read

Retrieval-augmented Generation (RAG) has markedly enhanced the capabilities of Large Language Models (LLMs) in tackling knowledge-intensive tasks. The increasing demands of application scenarios have driven the evolution of RAG, leading to the integration of advanced retrievers, LLMs and other complementary technologies, which in turn has amplified the intricacy of RAG systems. However, the rapid advancements are outpacing the foundational RAG paradigm, with many methods struggling to be unified under the process of "retrieve-then-generate". In this context, this paper examines the limitations of the existing RAG paradigm and introduces the modular RAG framework. By decomposing complex RAG systems into independent modules and specialized operators, it facilitates a highly reconfigurable framework. Modular RAG transcends the traditional linear architecture, embracing a more advanced design that integrates routing, scheduling, and fusion mechanisms. Drawing on extensive research, this paper further identifies prevalent RAG patterns-linear, conditional, branching, and looping-and offers a comprehensive analysis of their respective implementation nuances. Modular RAG presents innovative opportunities for the conceptualization and deployment of RAG systems. Finally, the paper explores the potential emergence of new operators and paradigms, establishing a solid theoretical foundation and a practical roadmap for the continued evolution and practical deployment of RAG technologies.

Discussion(0)

No comments yet. Be the first to comment.

Open reviews(0)

Public, signed peer feedback on this preprint.

No reviews yet.

Related publications

Preprint2023

Retrieval-Augmented Generation for Large Language Models: A Survey

Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Haofen Wang

arXiv (Cornell University)

Preprint2025

U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack

Yunfan Gao, Yunyun Xiong, Wenlong Wu, Zijing Huang, Bohan Li, Haofen Wang

Preprint2023

Chat-REC: Towards Interactive and Explainable LLMs-Augmented Recommender System

Yunfan Gao, Tao Sheng, You-Lin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang

arXiv (Cornell University)

Chapter in a book2024

Synopsis of Industry 5.0 Paradigm for Human-Robot Collaboration

İbrahim Yitmen, Amjad Almusaed

Article2026

Toward a Unified Architecture for Smart Home Energy Monitoring: Requirements, Design, and Use-Case Validation

Manuel Andruccioli, Kelvin Olaiya, A. C. Testa, Salvatore Bennici, Rares Vasiliu, Cui Congwen, Lin Jingzhe, Lou Kuok Keon, Bao Rui, Wang Taoyuan, Cheng Xinyuan, Yi Zhang, Paola Salomoni, Vittorio Ghini, Chan-Tong Lam, Su-kit Tang, Giovanni Delnevo