tinyzqh

Follow

tinyzqh tinyzqh

Follow

Ph.D at UEC in Tokyo. MS in Northeastern University. Worked @ Baidu @ InspirAI.

263 followers · 253 following

The University of Electric Communications
Japan
https://tinyzqh.github.io/

Achievements

Achievements

tinyzqh/README.md

Hi, I'm Zhiqiang He (何志强) 👋

Building reliable agents under uncertainty.

🎓 About Me

🧑‍🔬 Ph.D. researcher at University of Electro-Communications (UEC), Tokyo (Apr 2024 – 2026)
🎯 Working on Reinforcement Learning, with current focus on plasticity, world models, multi-agent RL, and real-system deployment
🏆 JST Next-Generation Researcher (2025 – 2027)
💼 Past: RL Algorithm Engineer @ InspirAI · RL Research Intern @ Baidu
✍️ I write technical notes on Zhihu — 10K+ followers

📌 Selected Publications

Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming — IEEE TMM, 2026
A Survey on DRL based UAV Communications and Networking — IEEE COMST, 2025 (co-authored)
Understanding World Models through Multi-Step Pruning Policy via Reinforcement Learning — Information Sciences, 2024

→ Full list on Google Scholar

🛠️ Tech Stack

🏆 Trophies

Pinned Loading

Systems-Intelligent-Lab/PA-MoE Systems-Intelligent-Lab/PA-MoE Public

[IEEE TMM ACCEPTED] Official implementation of Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming in IEEE Transactions on Multimedia.

Python 2
MSPP MSPP Public

[IS] Official implementation of "Understanding world models through multi-step pruning policy via reinforcement learning" in Information Sciences.

Python 8 3
light_mappo light_mappo Public

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 869 116
control-of-jump-systems-based-on-reinforcement-learning control-of-jump-systems-based-on-reinforcement-learning Public

[Algorithms] Official implementation of “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”

Python 25 4
awesome-reinforcement-learning awesome-reinforcement-learning Public

Learning Resources And Links Of Reinforcement Learning （updating）

Python 294 83
Opencv-Computer-Vision-Practice-Python- Opencv-Computer-Vision-Practice-Python- Public

OpenCV Computer Vision Practice (Python)

Python 193 53