Publications
2025
- ASPLOS’26SwiftSpec: Disaggregated Speculative Decoding and Fused Kernels for Low-Latency LLM Inference2025
2024
- NSDI’24GRACE: Loss-Resilient Real-Time Video through Neural CodecsIn 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24) , Apr 2024