hzhua201 [at] gmail [dot] com
I am a Senior Researcher in Microsoft Research Asia (Shanghai). I obtained Ph.D. degree from The University of Hong Kong (HKU) in 2020, advised by Prof. Francis C.M. Lau. Before that, I obtained B.Eng in Electronic and Information Engineering from University of Electronic Science and Technology of China in 2014. My research interests are resource management, systems for machine learning, and cloud computing.
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Chaofan Lin+, Zhenhua Han*, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu
(* Corresponding author)
USENIX OSDI 2024
[Paper]
[Code]
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation
Ningxin Zheng, Huiqiang Jiang, Quanlu Zhang*, Zhenhua Han*, Lingxiao Ma, Yuqing Yang*, Fan Yang, Chengruidong Zhang, Lili Qiu, Mao Yang, Lidong Zhou
(* Corresponding author)
ACM SOSP 2023
Optimizing Dynamic Neural Networks with Brainstorm
Weihao Cui+, Zhenhua Han, Lingji Ouyang+, Yichuan Wang, Ningxin Zheng, Lingxiao Ma, Yuqing Yang, Fan Yang, Jilong Xue, Lili Qiu, Lidong Zhou, Quan Chen, Haisheng Tan, Minyi Guo
(* Corresponding author)
USENIX OSDI 2023
Dynamic Resource Allocation for Deep Learning Clusters with Separated Compute and Storage
Mingxia Li+, Zhenhua Han, Chi Zhang, Ruiting Zhou, Yuanchi Liu, Haisheng Tan
IEEE INFOCOM 2023
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning
Diandian Gu, Yihao Zhao, Yinmin Zhong , Yifan Xiong, Zhenhua Han, Peng Cheng, Fan Yang, Gang Huang, Xin Jin, Xuanzhe Liu
ASPLOS 2023
SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters
Hanyu Zhao*, Zhenhua Han*, Zhi Yang, Quanlu Zhang, Mingxia Li+, Fan Yang, Qianxi Zhang, Binyang Li, Yuqing Yang, Lili Qiu, Lintao Zhang, Lidong Zhou
(*co-first author)
EuroSys 2023
[Paper]
PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training
Wei Zhang+, Binghao Chen, Zhenhua Han, Quan Chen, Peng Cheng, Fan Yang, Ran Shu, Yuqing Yan, Minyi Guo
(* Corresponding author)
USENIX ATC ‘22
[Paper]
[Code]
[Demo Video]
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees
Hanyu Zhao*, Zhenhua Han*, Zhi Yang, Quanlu Zhang, Fan Yang, Lidong Zhou, Mao Yang, Francis C.M. Lau, Yuqi Wang, Yifan Xiong, Bin Wang
(*co-first author)
USENIX OSDI 2020
[Paper]
[Github]
Retiarii: A Deep Learning Exploratory-Training Framework
Quanlu Zhang, Zhenhua Han, Fan Yang, Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou
USENIX OSDI 2020
[Artifact Code]
Automating Cloud Deployment for Deep Learning Inference of Real-time Online Services
Yang Li*, Zhenhua Han*, Quanlu Zhang, Zhenhua Li, Haisheng Tan
(*Co-first authors)
IEEE INFOCOM 2020
Full version accepted by IEEE/ACM Transaction on Networking, 2023
Gandiva: Introspective Cluster Scheduling for Deep Learning
Wencong Xiao, Romil Bhardwaj , Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, Fan Yang, Lidong Zhou
USENIX OSDI 2018
[Paper]
Online File Caching in Latency-Sensitive Systems with Delayed Hits and Bypassing
Chi Zhang, Haisheng Tan, Guopeng Li, Zhenhua Han, Shaofeng H-C. Jiang, Xiang-Yang Li
IEEE INFOCOM 2022
Regularization-Based Coflow Scheduling in Optical Circuit Switches
Haisheng Tan; Chi Zhang; Chao Xu; Yupeng Li; Zhenhua Han; Xiang-Yang Li.
To appear on IEEE/ACM Transaction on Networking, 2021
Online Dispatching and Scheduling of Jobs with Heterogeneous Utilities in Edge Computing
Chi Zhang, Haisheng Tan, Haoqiang Huang, Zhenhua Han, Shaofeng H.-C. Jiang, Nikolaos Freris, Xiang-Yang Li.
ACM MobiHoc 2020
Scheduling Placement-Sensitive BSP Jobs with Inaccurate Execution Time Estimation
Zhenhua Han, Haisheng Tan, Shaofeng H.-C. Jiang, Xiaoming Fu, Wanli Cao, Francis C.M. Lau
To appear on IEEE/ACM Transaction on Networking, 2021
Previously accepted by IEEE INFOCOM 2020
OnDisc: Online Latency-Sensitive Job Dispatching and Scheduling in Heterogeneous Edge-Clouds
Zhenhua Han, Haisheng Tan, Xiang-Yang Li, Shaofeng H.-C. Jiang, Yupeng Li, Francis C.M. Lau.
IEEE/ACM Transaction on Networking Dec. 2019
Previously accepted by IEEE INFOCOM 2017
Joint Online Coflow Routing and Scheduling in Data Center Networks
Haisheng Tan, Shaofeng Jiang, Yupeng Li, Xiang-Yang Li, Chenzi Zhang, Zhenhua Han, Francis C.M. Lau
IEEE/ACM Transaction on Networking Aug. 2019
Camul: Online Caching on Multiple Caches with Relaying and Bypassing
Haisheng Tan, Shaofeng Jiang, Zhenhua Han*, Liuyan Liu, Kai Han, Qinglin Zhao
(* Corresponding author)
IEEE INFOCOM 2019.
Energy Efficient Dynamic Virtual Machine Management in Data Centers
Zhenhua Han, Haisheng Tan, Rui Wang, Guihai Chen, Yupeng Li, Francis C.M. Lau
IEEE/ACM Transaction on Networking Jan. 2019
Previously accepted by IEEE INFOCOM 2016
Efficient Online Learning Based Cross-Tier Uplink Scheduling in HetNets
Zhenhua Han, Haisheng Tan, Rui Wang, Shaojie Tang, Francis C.M. Lau.
To appear on IEEE/ACM Transaction on Networking, 2022
Previously accepted by IEEE INFOCOM 2018
Congestion Game with Agent and Resource Failures
Yupeng Li, Yongzheng Jia, Haisheng Tan, Rui Wang, Zhenhua Han, Francis C.M. Lau.
IEEE Journal on Selected Areas in Communications (JSAC) 2017