Research Archive

Journal Article Open Access Cloud Computing

Federated Learning in Distributed Cloud Environments: Communication Efficiency, Differential Privacy Guarantees, and Model Convergence Under Non-IID Data Distributions

Federated learning enables collaborative model training across distributed clients without centralizing raw data, making it particularly valuable in regulated industries where data sharing is legally constrained. However, the practical deployment of federated learning in production cloud environments exposes significant engineering challenges: communication overhead from model update aggregation, degraded convergence under non-IID (heterogeneous) data distributions, and the tension between differential privacy noise injection and model utility. This paper presents FedCloud, a production-oriented federated learning framework designed for multi-organization deployment across cloud environments, and reports its empirical evaluation across three deployment scenarios: cross-hospital clinical NLP, cross-bank fraud detection, and cross-retailer demand forecasting. FedCloud implements adaptive FedAvg with gradient compression (achieving 7.3x communication reduction), client selection strategies optimized for stragglers in cloud environments, and Renyi differential privacy with per-round epsilon tracking. Under non-IID distributions simulating real organizational data heterogeneity, FedCloud achieves within 2.8% of centralized training accuracy for the clinical NLP task and within 4.1% for fraud detection, while satisfying epsilon less than 3 per training round. We characterize the accuracy-privacy-communication three-way trade-off surface empirically and provide a configuration selection guide for practitioners deploying federated learning under specific regulatory and performance constraints.

Nkechi Eze, Lars Berggren, Akira Yamamoto, Nadia Khalil· Jul 2019· 441 citations

Journal Article Subscription Cloud Computing

Serverless Computing in DevOps Pipelines: Performance Trade-offs, Cold Start Mitigation, and Organizational Adoption Patterns

Serverless computing platforms have disrupted traditional DevOps deployment models by abstracting infrastructure management entirely, yet their integration into continuous delivery pipelines introduces novel performance, cost, and observability challenges. This paper presents a comprehensive empirical evaluation of serverless-native DevOps workflows, combining platform benchmarking experiments with a multi-organization case study involving six companies that migrated production workloads to serverless architectures. Our benchmarks quantify cold start latency distributions across AWS Lambda, Azure Functions, and Google Cloud Functions under varying memory configurations, runtime environments, and invocation patterns. We find that cold starts impose median latency penalties of 180–740 ms depending on runtime and configuration, with Java and .NET runtimes exhibiting the highest variance. We introduce four cold start mitigation strategies — Provisioned Concurrency, Keep-Warm Scheduling, Lightweight Runtime Selection, and Dependency Minimization — and evaluate their cost-performance trade-offs. Case study findings reveal that serverless adoption fundamentally reshapes team topologies, reducing infrastructure operations burden by 65% while requiring new competencies in cost observability and function granularity design. This work provides the most comprehensive empirical treatment of serverless DevOps integration to date.

Kenji Watanabe, Amara Diallo, Florian Huber, Priscilla Nwosu· Jul 2019· 334 citations

Federated Learning in Distributed Cloud Environments: Communication Efficiency, Differential Privacy Guarantees, and Model Convergence Under Non-IID Data Distributions

Serverless Computing in DevOps Pipelines: Performance Trade-offs, Cold Start Mitigation, and Organizational Adoption Patterns

Registration Required