Accepted Papers
- Generating representative macrobenchmark microservice systems from distributed traces with ShortKut
- Between Promise and Pain: The Reality of Automating Microservices Diagnosis with Large Language Models
- Towards Fully Disaggregated Recommendation Model Serving
- Guess Who’s Back: Comeback Interval based Multi-tier Memory Management
- Autokernel: ML-native Dataplane Operating Systems
- TARDIS: A GPU-Centric KV Cache Service for Efficient LLM Inference
- Indispensable CPU-centric Checkpointing for GPUs
- Cloud abstractions for AI workloads
- Towards Microsecond-Scale VM Core Provisioning Agility on Serverless Platforms
- Meshilon: A Merging-based Java Garbage Collector
- CrazyOS: Memory management and scheduling should operate through a unified subsystem to improve maintainability
- Staying in the Zone - Retrofitting Zoned Storage into a Scalable Enterprise File System
- SHOC: A Simplified Programming Model for Hardware Offloading on DPUs
- Implementing a persistent key-value store in a tamper-resistant device for SGX enclave applications
- FPGAs are the Hero In-Network Computing Needs
- Mainframe-style channel controllers for modern disaggregated memory systems
- Metadata-Driven Near-Exclusive Caching for NVMe-oF SANs with eBPF
- Can LLMs Replace Time-Tested System Policies? Perhaps
- A System-level Abstraction and Service for Flourishing AI-powered Applications
- CPU Autoscaling With a Kernel of Truth
- Nesting Overlay File Systems with ShadowWhiteout
- HyperGen: Optimizing Generative Inference with Long Prompts for Resource-Constrained Systems
- Parsec: Fast, Scalable, and Secure Design with Wait-Free Parallelism
- Chimera-VDB: Mixed-Precision Vector Database with HNSW Index for RAG-LLM
- Exploring B+-Tree Implementations Using Scratchpad Memory
- Using Recursive Attestation to Scale Trust in Modern Heterogeneous Cloud Architectures