awesome-serverless-research
github.com/ddnirvana/awesome-serverless-research ↗A curated list of awesome serverless research works, including papers and open-sourced projects.
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me reliability and fault tolerance resources from awesome-serverless-research"
Installation instructions →What's inside
Reliability and Fault Tolerance
- A fault-tolerance shim for serverless computing (Eurosys'20)
The paper presents AFT, an atomic fault tolerance shim for serverless functions. AFT ensures atomic visibility of updates by enforcing read atomic isolation guarantee.
- Fault-tolerant and transactional stateful serverless workflows (OSDI'20)
The paper presents Beldi, a library and runtime system for writing fault-tolerant stateful serverless functions. Code available at
- Formal Foundations of Serverless Computing (OOPSLA'19)
It is the best paper of OOPSLA'19, which presents formal model to analyze (and prove) the reliability of serverless computing.
Benchmarking
- Architectural Implications of Function-as-a-Service Computing (Micro'19)
- Benchmarking, Analysis, and Optimization of Serverless Function Snapshots (ASPLOS'21)
This work introduces one system and one optimization. The system is vHive, an open-source and full-stack serverless framework. The framework is built based on several state-of-the-art infastructure, e.g., Firecracker. Besides, the paper proposes an optimization, called REAP, which is a software mechanism for serverless hosts that records functions' stable working set of guest memory pages and proactively prefetches it from disk into memory.
- Characterizing Serverless Platforms with ServerlessBench (SOCC'20)
This paper presents ServerlessBench, which is the first and state-of-the-art serverless benchmarks. The benchmark suite is open-sourced at
- Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider (ATC'20)
SIW is an industry paper by Microsoft Azure. You can learn about real-world issues/techniques of serverless computing from it.
Other Recommended Lists
- Awesome Serverless
A curated list of awesome services, solutions and resources for serverless / nobackend applications.
- awesome-serverless-security
A curated list of awesome serverless security resources such as (e)books, articles, whitepapers, blogs and research papers.
Stateful serverless computing
- Boki: Stateful Serverless Computing with Shared Logs (SOSP'21)
Boki solves many problems on stateful serverless computing, including state consistency, fault tolerance, and high-level abstractions for stateful serverless applications. The major novelity is its shared log design. The system is open-sourced at
- Cloudburst: stateful functions-as-a-service (VLDB'20)
The paper presents Cloudburst, a stateful FaaS platform that provides Python-like programming models. Cloudburst leverages Anna, a scalable key-value store, to achieve low-latency mutable state and communication.
Optimizations
- Catalyzer: Sub-millisecond Startup for ServerlessComputing with Initialization-less Booting (ASPLOS'20)
Catalyzer proposes init-less, which leverages fork/snapshots to skip the initialization costs during startup, and can achieve <1ms startup latency in the best case. The system is based on virtualization based sandboxes, gVisor.
- FaasCache: Keeping Serverless Computing Alive with Greedy-Dual Caching (ASPLOS'21)
- Faastlane: Accelerating Function-as-a-Service Workflows (ATC'21)
Faastlane aims to optimize the communication latency between functions, which is significant for function chain (or workflows in the title). The major idea is to use thread-level isolation, that means different functions are located in the same process, and Faastlane utilizes Intel MPK to provide isolation between different threads.
- Replayable Execution Optimized for Page Sharing for a Managed Runtime Environment (Eurosys'19)
Replayable optimizes CRIU for docker containers to boost container startup latency and reduce memory costs.
- SAND: Towards High-Performance Serverless Computing (ATC'18)
- SEUSS: skip redundant paths to make serverless fast (Eurosys'20)
SEUSS utilizes similir ideas as Catalyzer and Replayable to boost serverless latency (especially for startup latency) using fork/snapshots. The system is based on unikernels.
Review
Applications
- Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads (OSDI'20)
This paper presents Dorylus, a distributed system for training GNNs by utilizing serverless computing. Dorylus can achieve even better performance and low costs than CPU and GPU-based systems.
- Shuffling, Fast and Slow: Scalable Analytics on Serverless Infrastructure (NSDI'19)
This paper presents Locus, a serverless analytics system combines cheap but slow storage with fast but expensive storage to balance performance and costs.
- Sprocket: A Serverless Video Processing Framework (SOCC'18)
This paper presents Sprocket, a configurable, stage-based, scalable, serverless video processing framework that exploits intra-video parallelism to achieve low latency.
- The serverless shell (Middleware'21)
This paper presents an interesting serverless applications, which is a shell (called sshell), that allow users to run shell scripts on a serverless platform. The sourced code is avialable at
Other Related Works
Showing a sample of 37 resources. View the full list on GitHub →