I'm an assistant professor at Shanghai Innovation Institute. I got my Ph.D. degree at Shanghai Jiao Tong University, majored in Computer Science. I am a member of APEX Lab, advised by Prof. Weinan Zhang and Prof. Yong Yu, closely collaborating with Tianqi Chen via open-source community.

My research interests include distributed machine learning system and machine learning compiler.

News

2025.07: I'm looking for self-motivated Ph.D. students who like coding and have interests in (distributed) machine learning system and AI infrastructure. Feel free to contact me via email if you are interested.

Education

Shanghai Jiao Tong University

Ph.D. in Computer Science, School of Computer Science 2020 - 2025

Research area: machine learning compiler, machine learning system

Shanghai Jiao Tong University

B.Sc. in Computer Science, ACM Honors Class, Zhiyuan Collage. 2016 - 2020

Projects

Apache TVM

GitHub Repo   ...   ...

  • Open source machine learning compiler, enabling deployment models on diverse hardware backends
  • Leading TensorIR project, the next generation Tensor-level IR for tensor hardware
  • Co-leading TVM Unity/Relax project, the next generation Graph-level IR for dynamic models
  • Contributing to several key features: TVMScript, Meta-Schudule, runtime, frontend
  • Serving in Apache TVM Program Management Committee (PMC)

MLC-LLM

GitHub Repo   ...   ...

  • Compile LLMs and deploy models natively on every device with GPU acceleration

Web-LLM

GitHub Repo   ...   ...

  • Bringing large language models to web browsers with local GPU capabilities.

Publications

Productively Deploying Emerging Models on Emerging Platforms: A Top-Down Approach for Testing and Debugging

ISSTA 2025 [Paper]

Siyuan Feng*, Jiawei Liu*, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang, Tianqi Chen

Relax: Composable Abstractions for End-to-End Dynamic Machine Learning

ASPLOS 2025 [Paper]

Ruihang Lai*, Junru Shao*, Siyuan Feng*, Steven S. Lyubomirsky*, Bohan Hou, Wuwei Lin, Zihao Ye, Hongyi Jin, Yuchen Jin, Jiawei Liu, Lesheng Jin, Yaxing Cai, Ziheng Jiang, Yong Wu, Sunghyun Park, Prakalp Srivastava, Jared Roesch, Todd C. Mowry, Tianqi Chen

WebLLM: A High-Performance In-Browser LLM Inference Engine

Preprint [Arxiv]

Charlie F. Ruan, Yucheng Qin, Xun Zhou, Ruihang Lai, Hongyi Jin, Yixin Dong, Bohan Hou, Meng-Shiun Yu, Yiyan Zhai, Sudeep Agarwal, Hangrui Cao, Siyuan Feng, Tianqi Chen

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

ASPLOS 2023 [Paper]

Siyuan Feng*, Bohan Hou*, Hongyi Jin, Wuwei Lin, Junru Shao, Ruihang Lai, Zihao Ye, Lianmin Zheng, Cody Hao Yu, Yong Yu, Tianqi Chen

Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators

OSDI 2023 [Paper]

Jie Zhao, Siyuan Feng, Xiaoqiang Dan, Fei Liu, Chengke Wang, Sheng Yuan, Wenyuan Lv, Qikai Xie

Tensor Program Optimization with Probabilistic Programs

NeurIPS 2022 [Paper]

Junru Shao, Xiyou Zhou, Siyuan Feng, Bohan Hou, Ruihang Lai, Hongyi Jin, Wuwei Lin, Masahiro Masuda, Cody Hao Yu, and Tianqi Chen.

CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

WWW 2019 [Paper]

Huichu Zhang, Siyuan Feng, Chang Liu, Yaoyao Ding, Yichen Zhu, Zihan Zhou, Weinan Zhang, Yong Yu, Haiming Jin, Zhenhui Li

Services

Apache Software Foundation

Community member 2024 - Present

Apache TVM

Project management committee (PMC) member 2022 - Present