Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation

Sunil Karthik Kota

doi:10.32996/jcsts.2024.6.5.30

Authors

Sunil Karthik Kota Engineering Leader, Software Architect, AI & Automation Expert at CISCO, USA

DOI:

https://doi.org/10.32996/jcsts.2024.6.5.30

Keywords:

Retrieval-Augmented Generation, RAG, Embedding Models, Reranker Fine-Tuning, Semantic Retrieval, Agentic Architectures, Evaluation Metrics, Vector Databases, Query Routing, Dynamic Context

Abstract

Retrieval-Augmented Generation (RAG) systems have revolutionized the way large language models (LLMs) synthesize responses by coupling generative capabilities with dynamic retrieval from external knowledge bases. This integration not only enhances the factual accuracy and contextual relevance of responses but also reduces the potential for hallucination in generated content. In this paper, we present an extensive survey of RAG systems, covering theoretical underpinnings, various retrieval strategies, agentic architectures, and the technical developer stack necessary for system integration. Additionally, we detail advanced techniques for fine-tuning embedding and reranker models and establish comprehensive evaluation metrics applicable to both retrieval and generation components. This document synthesizes methods and best practices described in recent research articles [1]–[10], offering a roadmap for researchers and practitioners to design robust, efficient, and scalable RAG systems.

Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

rightbar

submission

menus

Notice: