Skip to content
View PPPP-kaqiu's full-sized avatar

Block or report PPPP-kaqiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PPPP-kaqiu/README.md

Ziqi Wang (王梓齐)

Personal Website | Google Scholar | Rednote

I am a final-year master's student at the School of Artificial Intelligence and Data Science, University of Science and Technology of China (USTC), advised by Prof. Tong Xu and Prof. Enhong Chen.

I am currently a research intern at 通义千问, where I focus on the infrastructure and algorithmic framework for agents and agent reinforcement learning. My work centers on building reliable, scalable systems that support the training, evaluation, and deployment of next-generation agentic models.

Previously, I worked on large language model reasoning and training systems at Baidu ERNIE (Star) and StepFun, gaining hands-on experience across pre-training, long-context modeling, post-training, and reasoning-oriented system design, along with large-scale distributed training at the scale of hundreds to over a thousand GPUs.

My research interests lie in agents, reinforcement learning, reasoning, and the infrastructure that connects these areas into practical and scalable intelligent systems. If you are interested in collaboration or discussion on these topics, feel free to contact me.


Projects

  • Step2-mini & Step3: Cost-Effective Multimodal Intelligence [Website] [Technical Report] [arXiv]
    Contributed to the end-to-end development of state-of-the-art large language and multimodal reasoning models, scaling from tens to hundreds of billions of parameters with high efficiency.

  • Nano Agent Team [GitHub]
    Built an experimental multi-agent collaboration framework based on a file-system blackboard model, enabling autonomous planning, role coordination, and human-in-the-loop execution for complex tasks.

  • Awesome-Parallel-Reasoning [GitHub Repository]
    Maintaining a curated repository that tracks the rapidly evolving landscape of parallel reasoning and multi-agent collaboration research, tools, and best practices in the LLM era.


Education

  • M.S. in Artificial Intelligence and Data Science, University of Science and Technology of China (2023 - Present)
  • B.S. in Artificial Intelligence and Data Science, University of Science and Technology of China (2019 - 2023)

Statistics

The right card is auto-updated daily and includes public repositories I contributed to, including external repositories.

Built with simplicity in mind.

Pinned Loading

  1. Awesome-Parallel-Reasoning Awesome-Parallel-Reasoning Public

    Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.

    HTML 48 4

  2. zczc/nano_agent_team zczc/nano_agent_team Public

    Python 24 5