Publications

2025

  1. arXiv, 2025
    SwiftSpec: Ultra-Low Latency LLM Decoding by Scaling Asynchronous Speculative Decoding
    Ziyi Zhang , Ziheng Jiang , Chengquan Jiang , Menghan Yu , Size Zheng , Haibin Lin , Henry Hoffmann , and Xin Liu
    2025

2024

  1. NSDI’24
    GRACE: Loss-Resilient Real-Time Video through Neural Codecs
    Yihua Cheng , Ziyi Zhang , Hanchen Li , Anton Arapin , Yue Zhang , Qizheng Zhang , Yuhan Liu , Kuntai Du , Xu Zhang , Francis Y. Yan , Amrita Mazumdar , Nick Feamster , and Junchen Jiang
    In 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24) , Apr 2024

2023

  1. SOSP’23
    Bagpipe: Accelerating Deep Recommendation Model Training
    Saurabh Agarwal , Chengpo Yan , Ziyi Zhang , and Shivaram Venkataraman
    In Proceedings of the 29th Symposium on Operating Systems Principles (SOSP 23) , Koblenz, Germany, Apr 2023