Hi, I’m Yiyan Zhai :)
I will be joining CMU Catalyst Group this Fall to start my PhD with Prof. Tianqi Chen! I received my B.S. in Computer Science from Carnegie Mellon University. My research interests lie at building efficient and scalable ML systems.
I have been working with Prof. Tianqi Chen at CMU Catalyst Group on:
- FlashInfer-Bench, a kernel benchmarking loop that goes from kernel generation → evaluation → drop-in replacement in serving stacks (FlashInfer/SGLang/vLLM).
- WebLLM Assistant, which integrates Overleaf and Google Workspace with in-browser agents using WebLLM.
I am also fortunate to collaborate with Prof. Juncheng Yang at Harvard SEAS on:
- Cache replacement algorithms for real-world enterprise storage systems (VMware vSAN)
- Resilient routing for LLM inference
News 📰
- May 2026: Clock2Q+ is accepted by VLDB industrial track 2026. See you in Boston! 🎉
- May 2026: We presented FlashInfer-Bench at MLSys 2026.
- Apr 2026: I will be joining CMU as a PhD student this Fall! 🎓
- Jan 2026: FlashInfer-Bench is accepted by MLSys 2026! 🎉
