2026 Enabling Performant and Flexible Model-Internal Observability for LLM Inference Nengneng Yu, Sixian Xiong, Yibo Zhao, and 2 more authors arXiv preprint arXiv:2605.11093, May 2026 DOI HTML PDF