-
The Chinese University of Hong Kong
- Hong Kong
- https://westcoastgod-photography.vercel.app/
- in/oscar-z-cw337
- https://makerworld.com/en/@westcoastgod
Highlights
- Pro
Stars
A real-time material point method (MPM) simulation library using CUDA.
🏰 The Maze Game offers straightforward maze navigation challenges, built with Prim's & DFS Algorithms. Featuring responsive design for easy play on any device, including mobile, with intuitive on-s…
Multi-Joint dynamics with Contact. A general purpose physics simulator.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
There are compilations of surgery-related tasks, datasets, and papers.
Generative Models by Stability AI
Image-to-Image Translation in PyTorch
A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrument, verb, target> labels for every surgical fine-grained act…
Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformers", MICCAI 2022.
Globally reasoned multi-task model for surgical scene understanding. A multi-task model for segmentation and scene graph. Offical Implementation of "Global-Reasoned Multi-Task Learning Model for Su…
Multi-modal agentic framework for surgical procedures
Stable Diffusion web UI
A solution to visualize and explore 3D models in your browser.
ORBIT-Surgical: An Open-Simulation Framework for Learning Surgical Augmented Dexterity
Pioneering Automated GUI Interaction with Native Agents
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
maze datasets for investigating OOD behavior of ML systems
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.
PMEmo: A Dataset For Music Emotion Computing
Examples of using Depth API for real-time, dynamic occlusions
This repository is used for AIST2010 Group Project
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.


