Research Archive

Journal Article Open Access Autonomous Systems

Multi-Agent Reinforcement Learning for Cooperative Drone Swarm Coordination: Decentralized Policy Learning, Communication Protocol Design, and Emergent Collective Behavior in Search and Rescue Operations

Autonomous drone swarms operating in search and rescue environments must coordinate complex collective behaviors -- area coverage, target localization, survivor extraction, and obstacle avoidance -- in GPS-denied, communication-constrained, and dynamically hazardous conditions that preclude centralized coordination architectures. Multi-agent reinforcement learning (MARL) offers a pathway to decentralized cooperative policy learning that is robust to agent failure and communication disruption, but the engineering of MARL systems that produce reliable cooperative behavior at swarm scales of 10 to 100 agents in realistic environments remains an open challenge. This paper presents SwarmRL, a MARL framework for cooperative drone swarm coordination evaluated in three progressively realistic environments: 2D grid simulation, 3D physics simulation using AirSim, and a real-world outdoor evaluation with a 12-drone swarm. SwarmRL employs a shared policy architecture with agent-specific observation encoders, a learned communication protocol using Graph Attention Networks for neighbor-to-neighbor message passing, and a centralized training with decentralized execution (CTDE) approach using QMIX as the cooperative value decomposition baseline. In the simulated search and rescue benchmark, SwarmRL achieves 94.2 percent target discovery rate within 600 seconds for a 20-agent swarm, outperforming both handcrafted coverage algorithms (78.3 percent) and independent RL agents without communication (67.1 percent). The learned communication protocol transmits 89 percent less data than broadcast communication while maintaining equivalent coordination quality. Real-world evaluation with 12 drones achieves 87.4 percent target discovery rate, demonstrating meaningful sim-to-real transfer.

Kelechi Okafor, Astrid Holm, Yuki Nishimura, Salma El-Amin· Feb 2026· 44 citations

Journal Article Open Access Artificial Intelligence

Autonomous DevOps: Toward Self-Healing, Self-Optimizing Delivery Pipelines Through Multi-Agent AI Orchestration

Autonomous DevOps represents the frontier of pipeline automation: delivery systems capable not merely of executing predefined workflows but of reasoning about system state, diagnosing anomalies, proposing remediations, and implementing approved changes without human intervention. This paper presents the theoretical foundations, architectural design, and empirical evaluation of an Autonomous DevOps system (AutoDevOps-1) built on multi-agent large language model orchestration. AutoDevOps-1 comprises four cooperating agents — Pipeline Analyst, Remediation Planner, Change Executor, and Safety Auditor — coordinated through a shared context protocol and governed by a formal safety constraint language we term Pipeline Safety Assertions (PSA). We evaluate AutoDevOps-1 in a production-equivalent staging environment replicating the infrastructure of a large financial institution, executing 4,200 simulated incident scenarios over six months. The system autonomously resolved 71.4% of incidents within defined safety constraints, escalating the remainder to human operators with structured context packages that reduced human resolution time by 58%. We conduct adversarial testing against 14 prompt injection and manipulation attack vectors, achieving containment in 92.9% of cases. We discuss the organizational, ethical, and liability implications of autonomous pipeline management and propose a governance framework for responsible deployment. This work establishes the empirical and conceptual foundations for the next generation of DevOps automation.

Adaobi Eze, Felix Zimmermann, Keiko Arai, Mohamed Al-Farsi· Feb 2026· 89 citations

Multi-Agent Reinforcement Learning for Cooperative Drone Swarm Coordination: Decentralized Policy Learning, Communication Protocol Design, and Emergent Collective Behavior in Search and Rescue Operations

Autonomous DevOps: Toward Self-Healing, Self-Optimizing Delivery Pipelines Through Multi-Agent AI Orchestration

Registration Required