Accepted Papers

Generating representative macrobenchmark microservice systems from distributed traces with ShortKut
Between Promise and Pain: The Reality of Automating Microservices Diagnosis with Large Language Models
Towards Fully Disaggregated Recommendation Model Serving
Guess Who’s Back: Comeback Interval based Multi-tier Memory Management
Autokernel: ML-native Dataplane Operating Systems
TARDIS: A GPU-Centric KV Cache Service for Efficient LLM Inference
Indispensable CPU-centric Checkpointing for GPUs
Cloud abstractions for AI workloads
Towards Microsecond-Scale VM Core Provisioning Agility on Serverless Platforms
Meshilon: A Merging-based Java Garbage Collector
CrazyOS: Memory management and scheduling should operate through a unified subsystem to improve maintainability
Staying in the Zone - Retrofitting Zoned Storage into a Scalable Enterprise File System
SHOC: A Simplified Programming Model for Hardware Offloading on DPUs
Implementing a persistent key-value store in a tamper-resistant device for SGX enclave applications
FPGAs are the Hero In-Network Computing Needs
Mainframe-style channel controllers for modern disaggregated memory systems
Metadata-Driven Near-Exclusive Caching for NVMe-oF SANs with eBPF
Can LLMs Replace Time-Tested System Policies? Perhaps
A System-level Abstraction and Service for Flourishing AI-powered Applications
CPU Autoscaling With a Kernel of Truth
Nesting Overlay File Systems with ShadowWhiteout
HyperGen: Optimizing Generative Inference with Long Prompts for Resource-Constrained Systems
Parsec: Fast, Scalable, and Secure Design with Wait-Free Parallelism
Chimera-VDB: Mixed-Precision Vector Database with HNSW Index for RAG-LLM
Exploring B+-Tree Implementations Using Scratchpad Memory
Using Recursive Attestation to Scale Trust in Modern Heterogeneous Cloud Architectures