Building reliable agents under uncertainty.
- 🧑🔬 Ph.D. researcher at University of Electro-Communications (UEC), Tokyo (Apr 2024 – 2026)
- 🎯 Working on Reinforcement Learning, with current focus on plasticity, world models, multi-agent RL, and real-system deployment
- 🏆 JST Next-Generation Researcher (2025 – 2027)
- 💼 Past: RL Algorithm Engineer @ InspirAI · RL Research Intern @ Baidu
- ✍️ I write technical notes on Zhihu — 10K+ followers
- Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming — IEEE TMM, 2026
- A Survey on DRL based UAV Communications and Networking — IEEE COMST, 2025 (co-authored)
- Understanding World Models through Multi-Step Pruning Policy via Reinforcement Learning — Information Sciences, 2024
→ Full list on Google Scholar



