Skip to content
View pipixin321's full-sized avatar
🎯
Focusing
🎯
Focusing
  • HUST(Huazhong University of Science and Technology)
  • Wuhan
  • 09:34 (UTC +08:00)

Block or report pipixin321

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pipixin321/README.md

Hi there πŸ‘‹, I'm Huaxin Zhang

Huaxin Zhang github Google Scholar

I am a Master of HUST (Huazhong University of Science and Technology), supervised by Prof. Changxin Gao and Prof. Nong Sang.

πŸ”­ Reseach-wise, I mainly focus on:

  • Multi-modal Large Language Models
  • Video Understanding, more specifically, Weakly-supervised Temporal Action Localization (WSTAL) & Weakly-suervised Video Anomaly Detection (WSVAD).

πŸ˜„ I am open to:

  • A internship/job/PhD offer with computer vision/multimodal LLM research and engineering.

πŸ“« Contact me by:

πŸ’¬ News:

  • 2025-02-27: Holmes-VAU is accepted on CVPR 2025.
  • 2024-07-01: We release our code and model of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM".[project page]
  • 2024-06-10: We release our code and model of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilities".[project page]
  • 2024-01-29: I start my internship in Baidu VIS, to do some research on Multi-modal Large Language Model (MLLM).
  • 2023-12-09: One paper about point supervised temporal action localization is accepted on AAAI 2024.

Huaxin's github stats

Pinned Loading

  1. HolmesVAU HolmesVAU Public

    [CVPR 2025 Highlight] Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"

    Python 54 3

  2. HolmesVAD HolmesVAD Public

    Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"

    Python 130 5

  3. HR-Pro HR-Pro Public

    [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"

    Python 32 1

  4. GlanceVAD GlanceVAD Public

    [ICME 2025] Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"

    Python 24 1

  5. Awesome-Video-MLLMs Awesome-Video-MLLMs Public

    πŸ”₯ πŸ”₯ πŸ”₯ Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding πŸ“Ή

    17 1

  6. Arcana Arcana Public

    Forked from syp2ysy/Arcana

    Implementation of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilitie"

    Python 1