Youwei Xiao
Youwei Xiao (肖有为)
School of Integrated Circuits
Peking University
Beijing, China
I am a Ph.D. candidate at the School of Integrated Circuits, Peking University, advised by Prof. Yun Liang. I received my Bachelor of Science in EECS from Peking University in 2022. My research is broadly about compiler-centered hardware-software co-design for MLSys, computer architecture, and EDA: how software workloads can guide architectural customization, how those customized capabilities can be synthesized into hardware, and how compiler and system layers can make the result usable by real applications.
One thread of my work studies hardware synthesis as the implementation layer of this stack. I build EDA software abstractions that connect high-level design intent with efficient RTL-level hardware. This includes the high-level synthesis framework Hector (ICCAD 2022), the Rust-based hardware description language Cement (FPGA 2024), and more recent work on e-graph-based synthesis optimization in SkyEgg. Across these projects, I have used MLIR, Rust, and compiler IR design to make hardware synthesis more programmable, analyzable, and optimizable.
A second thread focuses on architecture customization. I explored how compiler analysis, application profiling, design space exploration, and formal methods can automate the discovery of useful accelerators and custom instructions. Cayman (DAC 2025) generates domain-specific accelerators while considering control flow and data access strategies. ISAMORE (ASPLOS 2026) uses e-graph anti-unification to find reusable custom instructions from equivalent program fragments.
These two threads motivated a larger goal: a fully integrated co-design toolchain that can derive architecture design, hardware implementation, and compiler support from agile specifications or even directly from target applications. I initiated and led the APS project with lab classmates toward this goal. One long-term example is generating an optimized ML ASIC solution with full ML compiler support from target ML models, with as little manual intervention as possible. I also help organize tutorials at major EDA and architecture conferences to share our work on agile hardware specialization and co-design methodologies; see the APS tutorials.
More recently, I have been extending this agenda into LLM-era compiler and system techniques. On one side, I study LLMs and agents as new interfaces for compiler optimization and hardware-software co-design, including EggMind for LLM-guided equality-saturation strategy synthesis and agentic co-design workflows such as the next-generation APS and Spine. On the other side, I work on compiler and runtime systems for emerging ML workloads, including IntelliC, a human- and LLM-friendly infrastructure for retargetable tensor compilation and superoptimization across NVIDIA, Qualcomm Hexagon, and Huawei Ascend architectures; distributed tensor compilation and runtime work such as PTO Runtime; and Hive, an inference infrastructure for multi-agent systems with programming-surface and control-layer support.
news
| Mar 31, 2026 | ISAMORE winnes the ASPLOS 2026 Best Paper Award (5/1048)! |
|---|---|
| Mar 19, 2026 | I will present EggMind at the Architecture 2.0 Workshop and ISAMORE in the main program at ASPLOS 2026. |
| Jan 19, 2026 | Successfully held our APS-MLIR tutorial at ASP-DAC 2026 in Hong Kong! |
selected publications
- Arch 2.0EggMind: LLM-Driven Two-Dimensional Intelligence for Scalable Equality SaturationIn Architecture 2.0: Workshop on AI for Computing Systems Design, 2026