Mao

Graduate Student at UC Merced

Researching ML/LLM systems, heterogeneous computer architectures, high-performance computing, and program analysis and optimization.

Merced, CA, USA
UC Merced
mlin59@ucmerced.edu

About Me

I am currently a graduate student at UC Merced, where I am fortunate to work with Prof. Hyeran Jeon. Prior to this, I received my M.S. in Software Engineering and my B.E. in Computer Science from Shandong University, where I had the privilege to work with Prof. Lei Ju.

My research interests include ML/LLM systems, heterogeneous computer architectures and systems, high-performance and parallel computing (CUDA), as well as static and dynamic program analysis and optimization.

Research Areas

GPU Memory Management Unified Virtual Memory LLM/ML Systems Program Analysis Parallel Computing

Experience

ByteDance - Research Intern

05/2023 - 11/2023

Seattle/San Jose, CA/WA, USA

Analyzing and optimizing GPU memory wastage and fragmentation in large language model (LLM) training.

Uber - Part-time Software Engineer

11/2022 - 02/2023

Sunnyvale, CA, USA

Detecting and fixing data races in Uber's Golang code base.

PNNL - Research Intern

06/2022 - 08/2022

Richland, WA, USA

Detecting floating-point data overflow of GPU-accelerated applications.

Research Highlights

GPU Memory Management

Developing novel techniques for device memory and unified virtual memory management in GPU systems.

ASPLOS '23 ISCA '25

GPU Performance Analysis

Creating tools and frameworks for analyzing and optimizing GPU-accelerated applications, with a focus on memory efficiency.

ASPLOS '23

Deep Learning Systems

Tackling memory management challenges in deep learning, especially for large language model (LLM) training and inference.

EuroMLSys '25

Publications

ISCA '25 2025

Forest: Access-aware GPU UVM Management

Mao Lin, Yuan Feng, Guilherme Cox, and Hyeran Jeon

The 52nd Annual International Symposium on Computer Architecture, June 21–25, 2025, Tokyo, Japan

EuroMLSys '25 2025

Understanding Oversubscribed Memory Management for Deep Learning Training

Mao Lin and Hyeran Jeon

The 5th Workshop on Machine Learning and Systems, March 30–April 3, 2025, Rotterdam, Netherlands

ASPLOS '23 2023

DrGPUM: Guiding Memory Optimization for GPU-accelerated Applications

Mao Lin, Keren Zhou, and Pengfei Su

The 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Mar 25-29, 2023, Vancouver, BC, Canada

PyTorch Conference '22 2022

Poster: Squeezing GPU Memory Usage in PyTorch

Mao Lin, Keren Zhou, and Pengfei Su

Dec. 2022, New Orleans, LA, USA

TCAD '22 2022

A Comprehensive Memory Management Framework for CPU-FPGA Heterogenous SoCs

Zelin Du, Qianling Zhang, Mao Lin, Shiqing Li, Xin Li, and Lei Ju

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2022)

Talks & Presentations

06/2025
Tokyo, Japan

ISCA '25 Presentation

Present "Forest: Access-aware GPU UVM Management"

03/2025
Rotterdam, Netherlands

EuroMLSys '25 Presentation

Present "Understanding Oversubscribed Memory Management for Deep Learning Training"

03/2023
Remote

ASPLOS '23 Conference

Present "DrGPUM: Guiding Memory Optimization for GPU-accelerated Applications"

12/2022
New Orleans, LA, USA

PyTorch Conference '22

Present Poster "Squeezing GPU Memory Usage in PyTorch"

Professional Services

Artifact Evaluation Committee

PPoPP '23 ASPLOS '24 ISCA '25 SOSP '25 IISWC '25 EuroSys '26

Reviewer

GPGPU '25

Teaching Experience

Computer Architecture (CSE 140)

Teaching Assistant 2024 Spring, 2025 Spring

Intro to Programming Laboratory Skills/Techniques (CSE 022)

Teaching Assistant 2023 Spring

Data Structure (CSE 030)

Teaching Assistant 2022 Spring

Advanced Programming (CSE 024)

Teaching Assistant 2021 Fall, 2024 Fall

Intro to Object Orient Program (CSE 165)

Teaching Assistant 2021 Fall

Technical Skills

Programming Languages

C/C++ Python CUDA Go Java Shell HTML/CSS JSON

Platforms & Systems

Linux/Windows/MacOS CPU-GPU HMPSoCs CPU-FPGA HMPSoCs

Frameworks & Libraries

PyTorch TensorFlow Darknet

Development Tools

Nsight Systems Linux perf GDB Git Xilinx Vivado Suite

Get in Touch

Location

Merced, CA, USA

Institution

UC Merced