Hi, I’m Yiyan Zhai :)

I will be joining CMU Catalyst Group this Fall to start my PhD with Prof. Tianqi Chen! I received my B.S. in Computer Science from Carnegie Mellon University. My research interests lie at building efficient and scalable ML systems.

I have been working with Prof. Tianqi Chen at CMU Catalyst Group on:

  • FlashInfer-Bench, a kernel benchmarking loop that goes from kernel generation → evaluation → drop-in replacement in serving stacks (FlashInfer/SGLang/vLLM).
  • WebLLM Assistant, which integrates Overleaf and Google Workspace with in-browser agents using WebLLM.

I am also fortunate to collaborate with Prof. Juncheng Yang at Harvard SEAS on:

News 📰

  • May 2026: Clock2Q+ is accepted by VLDB industrial track 2026. See you in Boston! 🎉
  • May 2026: We presented FlashInfer-Bench at MLSys 2026.
  • Apr 2026: I will be joining CMU as a PhD student this Fall! 🎓
  • Jan 2026: FlashInfer-Bench is accepted by MLSys 2026! 🎉