Skip to content
View tinyzqh's full-sized avatar

Block or report tinyzqh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tinyzqh/README.md

Hi, I'm Zhiqiang He (何志强) 👋

Building reliable agents under uncertainty.

Homepage Email Scholar Zhihu GitHub Rank Total Stars


🎓 About Me

  • 🧑‍🔬 Ph.D. researcher at University of Electro-Communications (UEC), Tokyo (Apr 2024 – 2026)
  • 🎯 Working on Reinforcement Learning, with current focus on plasticity, world models, multi-agent RL, and real-system deployment
  • 🏆 JST Next-Generation Researcher (2025 – 2027)
  • 💼 Past: RL Algorithm Engineer @ InspirAI · RL Research Intern @ Baidu
  • ✍️ I write technical notes on Zhihu — 10K+ followers

📌 Selected Publications

  • Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video StreamingIEEE TMM, 2026
  • A Survey on DRL based UAV Communications and NetworkingIEEE COMST, 2025 (co-authored)
  • Understanding World Models through Multi-Step Pruning Policy via Reinforcement LearningInformation Sciences, 2024

→ Full list on Google Scholar


🛠️ Tech Stack

Python C++ C# PyTorch LaTeX Linux Git


🏆 Trophies

trophies

Pinned Loading

  1. Systems-Intelligent-Lab/PA-MoE Systems-Intelligent-Lab/PA-MoE Public

    [IEEE TMM ACCEPTED] Official implementation of Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming in IEEE Transactions on Multimedia.

    Python 2

  2. MSPP MSPP Public

    [IS] Official implementation of "Understanding world models through multi-step pruning policy via reinforcement learning" in Information Sciences.

    Python 8 3

  3. light_mappo light_mappo Public

    Lightweight version of MAPPO to help you quickly migrate to your local environment.

    Python 869 116

  4. control-of-jump-systems-based-on-reinforcement-learning control-of-jump-systems-based-on-reinforcement-learning Public

    [Algorithms] Official implementation of “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”

    Python 25 4

  5. awesome-reinforcement-learning awesome-reinforcement-learning Public

    Learning Resources And Links Of Reinforcement Learning (updating)

    Python 294 83

  6. Opencv-Computer-Vision-Practice-Python- Opencv-Computer-Vision-Practice-Python- Public

    OpenCV Computer Vision Practice (Python)

    Python 193 53