I am currently a graduate student at UC Merced working with Prof. Hyeran Jeon. Before this, I earned my M.S. degree in Software Engineering and my B.E. degree in Computer Science from Shandong University, working with Prof. Lei Ju.
I am excited to delve into research topics related to machine learning systems and machine learning-oriented computer architecture. My research interests extend to heterogeneous computing, high-performance/parallel computing (CUDA), as well as static and dynamic program analysis and optimization.
💻 Experiences
- 08/2023 - 11/2023, Research Intern, ByteDance, San Jose, CA, USA
- Analyzing and optimizing the GPU memory wastage and fragmentation in large language model (LLM) training.
- 05/2023 - 08/2023, Research Intern, ByteDance, Seattle, WA, USA
- Analyzing and optimizing the GPU memory wastage and fragmentation in large language model (LLM) training.
- 11/2022 - 02/2023, Part-time internship of software engineer, Uber, Sunnyvale, CA, USA
- Detecting and fixing the data races in the Uber’s Golang code base.
- 06/2022 - 08/2022, Research Intern, PNNL, Richland, WA, USA
- Detecting floating-point data overflow of GPU-accelerated applications.
📝 Publications
[ASPLOS’23] “DrGPUM: Guiding Memory Optimization for GPU-accelerated Applications”, Mao Lin, Keren Zhou, and Pengfei Su. The 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Mar 25-29, 2023, Vancouver, BC, Canada. [Paper][Code][Kudos][PDF]
[PyTorch Conference’22] “Poster: Squeezing GPU Memory Usage in PyTorch”, Mao Lin, Keren Zhou, and Pengfei Su. Dec. 2022, New Orleans, LA, USA. [Poster]
[TCAD’22] “A comprehensive memory management framework for CPU-FPGA heterogenous SoCs”, Zelin Du, Qianling Zhang, Mao Lin, Shiqing Li, Xin Li, and Lei Ju. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2022). [Paper][PDF]
💬 Talks
- 04/2023, Present DrGPUM at the 1st UC Merced EECS Research Symposium
- 03/2023, Present paper DrGPUM at ASPLOS’23 [Slides]
- 12/2022, Present a poster about squeezing GPU memory usage in PyTorch at PyTorch Conference’22
👔 Professional Services
- Artifact Evaluation Committee: PPoPP’23, ASPLOS’24
📚 Teaching
- Teaching Assistant of Intro to Programming Laboratory Skills/Techniques (CSE 022), 2023 Spring
- Teaching Assistant of Data Structure (CSE 030), 2022 Spring
- Teaching Assistant of Advanced Programming (CSE 024), 2021 Fall
- Teaching Assistant of Intro to Object Orient Program (CSE 165), 2021 Fall
🧩 Skills
- Languages: C/C++; Python; CUDA; Go; Java; Shell; html; CSS; JSON
- Platforms: Linux; CPU-GPU HMPSoCs; CPU-FPGA HMPSoCs
- Frameworks: Pytorch; Darknet; Tensorflow
- Toolchains: Git; Nsight System; Linux perf; GDB; Xilinx Vivado Suite