Publications

2026

  1. Enabling Performant and Flexible Model-Internal Observability for LLM Inference
    Nengneng Yu, Sixian Xiong, Yibo Zhao, and 2 more authors
    arXiv preprint arXiv:2605.11093, May 2026