Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation
DOI:
https://doi.org/10.32996/jcsts.2024.6.5.30Keywords:
Retrieval-Augmented Generation, RAG, Embedding Models, Reranker Fine-Tuning, Semantic Retrieval, Agentic Architectures, Evaluation Metrics, Vector Databases, Query Routing, Dynamic ContextAbstract
Retrieval-Augmented Generation (RAG) systems have revolutionized the way large language models (LLMs) synthesize responses by coupling generative capabilities with dynamic retrieval from external knowledge bases. This integration not only enhances the factual accuracy and contextual relevance of responses but also reduces the potential for hallucination in generated content. In this paper, we present an extensive survey of RAG systems, covering theoretical underpinnings, various retrieval strategies, agentic architectures, and the technical developer stack necessary for system integration. Additionally, we detail advanced techniques for fine-tuning embedding and reranker models and establish comprehensive evaluation metrics applicable to both retrieval and generation components. This document synthesizes methods and best practices described in recent research articles [1]–[10], offering a roadmap for researchers and practitioners to design robust, efficient, and scalable RAG systems.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 https://creativecommons.org/licenses/by/4.0/

This work is licensed under a Creative Commons Attribution 4.0 International License.

Aims & scope
Call for Papers
Article Processing Charges
Publications Ethics
Google Scholar Citations
Recruitment