Publications

2026

[IEEE Micro] Beyond the Accelerator: A Full-Stack HW/SW Co-Design Analysis for Recommendation System Inference
Zhanqiu Hu, Mark Zhao, Zhiru Zhang, Udit Gupta
IEEE Micro

[ICLR’26] vCache: Verified Semantic Prompt Caching
Luis Gaspar Schroeder, Aditya Desai, Alejandro Cuadron, Kyle Chu, Shu Liu, Mark Zhao, Stephan Krusche, Alfons Kemper, Matei Zaharia, Joseph E. Gonzalez
International Conference on Learning Representations

[NSDI’26] Accelerating Mixture-of-Experts Training with Adaptive Expert Replication
Athinagoras Skiadopoulos, Mark Zhao, Swapnil Gandhi, Thomas Norrie, Shrijeet Mukherjee, Christos Kozyrakis
USENIX Symposium on Networked Systems Design and Implementation

2025

[VLDB’25] cedar: Optimized and Unified Machine Learning Input Data Pipelines
Mark Zhao, Emanuel Adamiak, and Christos Kozyrakis
Proceedings of the VLDB Endowment

2024

[SOSP’24] ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation
Swapnil Gandhi, Mark Zhao, Athinagoras Skiadopoulos, and Christos Kozyrakis
ACM Symposium on Operating Systems Principles

[OSDI’24] High-throughput and Flexible Host Networking for Accelerated Computing
Athinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao, Qizhe Cai, Saksham Agarwal, Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton, Rachit Agarwal, Shrijeet Mukherjee, and Christos Kozyrakis
USENIX Symposium on Operating Systems Design and Implementation

2023

[ATC’23] Tectonic-Shift: A Composite Storage Fabric for Large-Scale ML Training
Mark Zhao, Satadru Pan, Niket Agarwal, Zhaoduo Wen, David Xu, Anand Natarajan, Pavan Kumar, Shiva Shankar P, Ritesh Tijoriwala, Karan Asher, Hao Wu, Aarti Basant, Daniel Ford, Delia David, Nezih Yigitbasi, Pratap Singh, Carole-Jean Wu, and Christos Kozyrakis
USENIX Annual Technical Conference

[MLSys’23] RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Mark Zhao, Dhruv Choudhary, Devashish Tyagi, Ajay Somani, Max Kaplan, Sung-Han Lin, Sarunya Pumma, Jongsoo Park, Aarti Basant, Niket Agarwal, Carole-Jean Wu, and Christos Kozyrakis
Conference on Machine Learning and Systems

2022

[ISCA’22] Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training
Mark Zhao, Niket Agarwal, Aarti Basant, Buğra Gedik, Satadru Pan, Mustafa Ozdal, Rakesh Komuravelli, Jerry Pan, Tianshu Bao, Haowei Lu, Sundaram Narayanan, Jack Langman, Kevin Wilfong, Harsha Rastogi, Carole-Jean Wu, Christos Kozyrakis, and Parik Pol
IEEE/ACM International Symposium on Computer Architecture

[ASPLOS’22] ShEF: Shielded Enclaves for Cloud FPGAs
Mark Zhao, Mingyu Gao, and Christos Kozyrakis
ACM International Conference on Architectural Support for Programming Languages and Operating Systems

2021

[SoCC’21] Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
Francisco Romero*, Mark Zhao*, Neeraja J. Yadwadkar, and Christos Kozyrakis
ACM Symposium on Cloud Computing

2018

[CCS’18] HyperFlow: A Processor Architecture for Nonmalleable, Timing-Safe Information Flow Security
Andrew Ferraiuolo, Mark Zhao, Andrew C. Myers, and G. Edward Suh
ACM Conference on Computer and Communications Security

[S&P’18] FPGA-Based Remote Power Side-Channel Attacks
Mark Zhao and G. Edward Suh
IEEE Symposium on Security and Privacy
Distinguished Practical Paper Award
Top Pick in Hardware and Embedded Security