Latest
Call for Papers: Vol. 42 closes 30 JuneNew: Quantum Security Summit registration openAxiom Standard 7042-2024 now ratifiedGrant cycle 2025 — $4.2M committedFellows election voting opens 15 JulyCall for Papers: Vol. 42 closes 30 JuneNew: Quantum Security Summit registration openAxiom Standard 7042-2024 now ratifiedGrant cycle 2025 — $4.2M committedFellows election voting opens 15 July
Digital Library

Research Archive

Search across 2.4 million peer-reviewed documents from journals, conferences, and standards.

Showing 2 of 2,418,902 results

Journal Article Open Access Natural Language Processing

Retrieval-Augmented Generation for Knowledge-Intensive Enterprise Tasks: Dense Retriever Design, Index Freshness Management, and Faithfulness Evaluation in Production RAG Systems

Retrieval-Augmented Generation (RAG) -- the combination of dense retrieval over a knowledge corpus with generative language model synthesis -- has emerged as the leading architectural pattern for grounding large language model outputs in verifiable factual sources, addressing the hallucination and knowledge staleness limitations of parametric LLM knowledge. Despite rapid practitioner adoption, the engineering design space of production RAG systems -- covering retriever architecture selection, embedding model choice, index freshness management, chunking strategy, context window utilization, and faithfulness evaluation -- lacks systematic empirical treatment. This paper presents the first comprehensive engineering study of production RAG systems, evaluating design decisions across 14 dimensions using a standardized enterprise question answering benchmark comprising 8,400 questions across legal, financial, and technical documentation corpora. We find that bi-encoder dense retrievers (DPR, E5-large) outperform BM25 sparse retrieval by 18.4 F1 points on complex multi-hop questions but underperform by 7.2 points on exact keyword lookup queries, motivating hybrid retrieval as the default architecture. Chunk size has the highest sensitivity of any single design parameter -- optimal chunk size varies by 4x across corpora depending on document structure. We introduce the RAG Faithfulness Score (RFS), a composite metric measuring citation accuracy, claim groundedness, and context utilization efficiency, and demonstrate its correlation with downstream user trust ratings (Pearson r=0.74). We release evaluation code, benchmark datasets, and optimal configuration templates for six enterprise RAG deployment profiles.

Obiora Chukwu, Maja Svensson, Yuki Matsumoto, Laila Benali· Jul 2022· 534 citations
Journal Article Open Access Cybersecurity

Policy as Code in DevOps: Automated Governance, Open Policy Agent Integration, and Compliance-as-Code Maturity in Cloud-Native Pipelines

As organizations scale their cloud-native DevOps operations, the manual enforcement of security, compliance, and operational policies becomes a significant bottleneck and audit risk. Policy as Code (PaC) — the expression of organizational policies in machine-readable, version-controlled formats that can be automatically evaluated within CI/CD pipelines — has emerged as a scalable alternative. This paper presents the first systematic academic treatment of Policy as Code in DevOps contexts, combining a systematic literature review, an evaluation of three leading PaC frameworks (Open Policy Agent/Rego, Kyverno, and AWS Config Rules), and an empirical study of four organizations that implemented PaC programs over 12–24 months. We define a Policy as Code Taxonomy covering eight policy domains — identity and access, network security, data classification, resource configuration, software supply chain, cost governance, operational thresholds, and regulatory mapping — and evaluate framework suitability across domains. Organizations with mature PaC implementations achieve 94% automated policy coverage (vs 41% baseline), reduce policy violation escape rate to production by 87%, and report audit preparation time reductions of 65%. We introduce the Policy Coverage Efficiency Score (PCES) as a standardized measure of PaC program maturity and provide a PaC implementation roadmap with phase-specific toolchain recommendations.

Isioma Nwofor, Björn Andersson, Shunsuke Nakajima, Marta Alves· Jul 2022· 344 citations