58th IEEE/ACM International Symposium on Microarchitecture
|
Haoran Geng, Xiaoyang Lu, Yuezhi Che, Ziang Tian, Dazhao Cheng, Xian-He Sun, Michael Niemier, X. Sharon Hu |
COSMOS: RL-Enhanced Locality-Aware Counter Cache Optimization for Secure Memory
|
2025 International Conference for High Performance Computing, Networking, Storage, and Analysis
|
Weihu Wang, Yaqi Xia, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
MXBLAS: Accelerating 8-bit Deep Learning with a Unified Micro-Scaled GEMM Library
|
2025 International Conference for High Performance Computing, Networking, Storage, and Analysis
|
Zheng Zhang, Hulin Wang, Hongming Xu, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
HyTiS: Hybrid Tile Scheduling for GPU GEMM with Enhanced Wave Utilization and Cache Locality
|
ACM Conference on Human Factors in Computing Systems
|
Siyu Wang, Janice Jianing Si, Huanghuang Liang, Chuang Hu, Yujun Zhu, Xiaobo Zhou, Kanye Ye Wang, Da |
Understanding the Challenges Students Face in Non-English Programming Environments Due to the Programming Language Transition: A Case Study of Keywords in the Chinese Version of Scratch
|
IEEE International Conference on Data Engineering
|
Wenhan Wu, Yili Gong, Jiawei Jiang, Chuang Hu, Xiaobo Zhou, Dazhao Cheng |
Defending against Attribute Inference Attacks in Post-Training of Recommendation Systems via Unlearning
|
ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming
|
Hulin Wang, Yaqi Xia, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion
|
IEEE Transactions on Mobile Computing
|
Rui Ge, Huanghuang Liang, Zheng Gong, Chuang Hu, Xiaobo Zhou, Dazhao Cheng |
Streamlining Data Transfer in Collaborative SLAM through Bandwidth-aware Map Distillation
|
IEEE Transactions on Parallel and Distributed Systems
|
Huanghuang Liang, Xin Yang, Xiaoming Han, Boan Liu, Chuang Hu, Dan Wang, Xiaobo Zhou, Dazhao Cheng |
Spread+: Scalable Model Aggregation in Federated Learning with Non-IID Data
|
USENIX Annual Technical Conference
|
Yaqi Xia, Weihu Wang, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
Revitalizing Sparse Matrix-Matrix Multiplication on Tensor Cores with Asynchronous and Balanced Kernel Optimization
|
International World Wide Web Conference
|
Wenhan Wu, Chuang Hu |
Aegis: Post-Training Attribute Unlearning in Federated Recommender Systems against Attribute Inference Attacks
|
APNET2024
8th Asia-Pacific Workshop on Networking
|
Zhili He, Tianyu Tu, Kanye Ye Wang, Bing Luo, Dazhao Cheng, Chuang Hu |
Federated Spectrum Management Through Hedonic Coalition Formation
|
IEEE Internet of Things Journal
|
Tianyu Tu, Zhili He, Zhigao Zheng, Zimu Zheng, Jiawei Jiang, Yili Gong, Chuang Hu, Dazhao Cheng |
Towards Lifelong Unseen Task Processing with a Lightweight Unlabeled Data Schema for AIoT
|
JPDC2024
Journal of Parallel and Distributed Computing
|
Wei Rang, Huanghuang Liang, Ye Wang, Xiaobo Zhou, Dazhao Cheng |
A Unified Hybrid Memory System for Scalable Deep Learning and Big Data Applications
|
International Conference for High Performance Computing, Networking, Storage, and Analysis
|
Weihu Wang, Yaqi Xia, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
Accelerating Distributed DLRM Training with Optimized TT Decomposition and Micro-Batching
|
International Conference for High Performance Computing, Networking, Storage, and Analysis
|
Zheng Zhang, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators
|
International Conference for High Performance Computing, Networking, Storage, and Analysis
|
Yaqi Xia, Donglin Yang, Xiaobo Zhou, Dazhao Cheng |
Scaling New Heights: Transformative Cross-GPU Sampling for Training Billion-Edge Graphs
|
TBD2024
IEEE Transactions on Big Data
|
Huanghuang Liang, Zheng Zhang, Yili Gong, Chuang Hu, Dazhao Cheng |
A Survey on Spatio-temporal Big Data Analytics Ecosystem: Resource Management, Processing Platform, and Applications
|
IEEE Transactions on Computers
|
Xinquan Cai, Qianlong Sang, Chuang Hu, Yili Gong, Kun Suo, Xiaobo Zhou, Dazhao Cheng |
Incendio: Priority-based Scheduling for Alleviating Cold Start in Serverless Computing
|
IEEE Transactions on Computers
|
Hulin Wang, Donglin Yang, Yaqi Xia, Zheng Zhang, Qigang Wang, Jianping Fan, Xiaobo Zhou, Dazhao Chen |
Raptor-T: A Fused and Memory-Efficient Sparse Transformer for Long and Variable-Length Sequences
|
TCC2024
IEEE Transactions on Cloud Computing
|
Liu Liu, Zhijun Ding, Dazhao Cheng, Xiaobo Zhou |
Locality-aware and Fault-tolerant Batching for Machine Learning on Distributed Datasets
|