About
I am currently pursuing a Ph.D. in the Department of Intelligent Systems Engineering at Indiana University Bloomington, where I am affiliated with the HiPDAC lab under the supervision of Dr. Dingwen Tao. Prior to my enrollment at Indiana University, I earned a Bachelor of Science in Information Engineering from Shanghai Jiao Tong University in 2018 and a Master of Science in Electrical Engineering from the University of Southern California in 2020. During the summer of 2023, I undertook a research internship at the Pacific Northwest National Laboratory, supervised by Dr. Nathan R. Tallent. In the summer of 2025, I served as a Research Scientist Intern at ByteDance, working on machine learning systems within the Seed group. For further details or to contact me, please email me at bozhan@iu.edu.
Research
My research primarily focuses on High-Performance Computing (HPC), with particular emphasis on the following areas:
- Data Compression: Design of GPU-based lossy and lossless compression algorithms for scientific data and machine learning workloads, with an emphasis on efficiency, fidelity, and scalability.
- Parallel Computing: Development of optimized GPU kernels, parallel algorithms, and distributed workflows to accelerate large-scale scientific and engineering applications.
- Quantum Computing: Investigation of high-performance methods to support large-scale quantum circuit simulation and emerging quantum computing applications, with a focus on addressing memory and scalability challenges.
- Machine Learning Systems: Research on system-level optimizations for training and inference, including memory footprint reduction, model compression, and GPU-accelerated data pipelines.
Selected Publications
- (IPDPS'26)
Boyuan Zhang, Ding Zhou, Yafan Huang, Shihui Song, Hao Feng, Jinda Jia, Chengming Zhang, and Zhi Zhang.
"Near-Zero Cost KV Cache Compression for Large Language Model Inference."
In Proceedings of the 39th IEEE International Parallel and Distributed Processing Symposium, 2026.
- (IPDPS'26)
Boyuan Zhang, Luanzheng Guo, Jiannan Tian, Jinyang Liu, Daoce Wang, Chengming Zhang, Bo Fang, Fengguang Song, Jan Strube, Nathan R. Tallent, and Dingwen Tao.
"Accelerating AI Compression through Lightweight Lossless Encoding and Pipelined Workflows."
In Proceedings of the 39th IEEE International Parallel and Distributed Processing Symposium, 2026.
- (ICS'25 Best Paper Runner-Up)
Boyuan Zhang, Bo Fang, Fanjiang Ye, Luanzheng Guo, Fengguang Song, Nathan R. Tallent, and Dingwen Tao.
"BMQSim: Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework."
In Proceedings of the 39th ACM International Conference on Supercomputing, pp. 689-704. 2025.
- (ICS'25 Best Paper Candidate)
Boyuan Zhang, Yafan Huang, Sheng Di, Fengguang Song, Guanpeng Li, and Franck Cappello.
"Pushing the Limits of GPU Lossy Compression: A Hierarchical Delta Approach."
In Proceedings of the 39th ACM International Conference on Supercomputing, pp. 654-669. 2025.
- (SC'24)
Hao Feng*, Boyuan Zhang*, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, and Dingwen Tao.
"Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression."
In SC24: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1-16. IEEE, 2024.
- (HPDC'23)
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Dingwen Tao, and Franck Cappello.
"FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs."
In Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, pp. 129–142, 2023.
- (ICS'23)
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Martin Swany, Dingwen Tao, and Franck Cappello.
"GPULZ: Optimizing LZSS Lossless Compression for Multi-Byte Data on Modern GPUs."
In Proceedings of the 37th ACM International Conference on Supercomputing, pp. 348–359, 2023.