Skip to content
View lauyikfung's full-sized avatar

Highlights

  • Pro

Block or report lauyikfung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lauyikfung/README.md

Hi there! 👋 I am a first-year Ph.D. student of UCLA following Prof. Quanquan Gu, mainly researching in optimization and architecture of Large Language Models. Previously I was a Yao Class Student in Tsinghua University advised by Prof. Zhilin Yang.

See my:

My Projects

  • MARS-M: When Variance Reduction Meets Matrices: Paper, github
  • Robust Layerwise Scaling Rules by Proper Weight Decay Tuning: Paper
  • RPG: On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning: Paper, github
  • Kimi K1.5: Scaling Reinforcement Learning with LLMs: Paper
  • TPA: Tensor Product Attention Is All You Need: Paper, github
  • MARS: Unleashing the Power of Variance Reduction for Training Large Models: Paper, github
  • T-Rex: Text-assisted Retrosynthesis Prediction: Paper, github
  • Capricorn: Enhancing Hi-C contact matrices for loop detection with Capricorn, a multi-view diffusion model: Paper, github

More about me

Besides research, I love traveling around and I am also fond of topics including transportation (subways/undergrounds/light rails, etc.), geography (especially Chinese geography), linguistics, Vocaloid (Hatsune Miku & IA in Japanese, Luo Tianyi in Chinese) as well as financing (VC investment etc., welcome to talk with me about AI startups and investment in AI).

Popular repositories Loading

  1. SDPG SDPG Public

    SDPG: Self-Distilled Policy Gradient

    Python 38 3

  2. A-Summary-Sheet-of-Optimization-in-Deep-Learning A-Summary-Sheet-of-Optimization-in-Deep-Learning Public

    TeX 12

  3. SichuaMahjongAI SichuaMahjongAI Public

    Python 8

  4. T-Rex T-Rex Public

    T-Rex: Text-assisted Retrosynthesis Prediction

    Python 8 2

  5. blog blog Public

    HTML 3

  6. multiomics multiomics Public

    Python 3