Skip to content

Vision Foundation Models: SAM, ViT, CLIP, DINOv2, object detection, segmentation, and multimodal AI for computer vision.

License

Notifications You must be signed in to change notification settings

umitkacar/awesome-vision-models

Repository files navigation

๐ŸŽฏ Awesome SAM Foundation Models

Comprehensive Resource Collection for Segment Anything

Awesome GitHub stars GitHub forks GitHub watchers

SAM 2.1 Updated Papers License [![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=for-the-badge)](http://makeapullrequest.com) [![Made with Love](https://img.shields.io/badge/Made%20with-โค๏ธ-red.svg?style=for-the-badge)](https://github.com/umitkacar/SAM-Foundation-Models)

๐Ÿš€ Curated collection of SAM resources, implementations, optimizations & applications

Updated with 2024-2025 SOTA developments


๐Ÿ“– Explore โ€ข ๐Ÿ”ฅ Latest Updates โ€ข ๐Ÿ’ก Applications โ€ข ๐Ÿ› ๏ธ Tools โ€ข ๐Ÿ“š Learn


๐Ÿš€ Production-Ready Repository

โœ… Quality Assurance

  • 96.55% Code Coverage - Comprehensive testing
  • 100+ Linting Rules - Ruff fast linter
  • Strict Type Checking - MyPy validation
  • Pre-commit Hooks - Automated quality checks
  • Security Auditing - UV vulnerability scanning
  • Parallel Testing - pytest-xdist (16 workers)

๐Ÿ› ๏ธ Modern Development Stack

  • Build System: Hatch (PEP 621 compliant)
  • Linting: Ruff (10-100x faster than flake8)
  • Formatting: Black (opinionated)
  • Type Checking: MyPy (strict mode)
  • Testing: Pytest + Coverage + xdist
  • CI/CD: GitHub Actions (multi-platform)

๐Ÿ“ฆ Quick Start for Developers

# Clone repository
git clone https://github.com/umitkacar/SAM-Foundation-Models.git
cd SAM-Foundation-Models

# Install development dependencies
pip install -e ".[dev]"

# Set up pre-commit hooks
pre-commit install

# Run tests
pytest -n auto --cov=src tests/

# Run all quality checks
make check

Full documentation: See DEVELOPMENT.md for complete setup guide


๐ŸŒŸ Highlights

โšก 6x Faster

SAM 2 processes images
6x faster than original SAM

๐ŸŽฏ 3x Efficient

Requires 3x fewer
user interactions

๐Ÿ“ฑ 30+ FPS

Real-time on mobile
devices (EdgeSAM)

๐Ÿ† ICLR 2025

Best Paper
Honorable Mention

๐ŸŽฌ 44 FPS

Real-time video
segmentation

๐Ÿฅ 26+ Tasks

Medical imaging
benchmarks


๐Ÿ“‘ Table of Contents

Click to expand/collapse

๐Ÿ”ฅ Official Models & Latest Updates

๐ŸŽฏ Meta's Official SAM Family

๐ŸŒŸ SAM 2.1 - Latest Release (September 2024)
Feature Performance Status
๐Ÿš€ Speed 6x faster than SAM 1 โœ… Released
๐ŸŽฏ Efficiency 3x fewer interactions โœ… Stable
๐ŸŽฌ Video FPS 44 FPS real-time โœ… Production
๐Ÿ“œ License Apache 2.0 โœ… Open Source
๐Ÿ† Recognition ICLR 2025 Best Paper โœ… Awarded

๐Ÿ“ฅ Official Resources

# Install SAM 2
pip install segment-anything-2

# Quick Start
from sam2 import SAM2Model
model = SAM2Model.from_pretrained("facebook/sam2-hiera-large")

Links:

๐ŸŽจ Original SAM (April 2023)
๐Ÿ”— Integration Platforms
Platform Features Link
๐Ÿค– Ultralytics Production-ready YOLO integration Docs
๐Ÿค— HuggingFace Model hub, Transformers support Hub
โ˜๏ธ SageMaker AWS deployment ready JumpStart

๐ŸŽฌ SAM 2 & Video Segmentation

๐ŸŒŸ Advanced Video Segmentation (2024-2025)

๐Ÿ† SAM2Long

ICCV 2025 | Stars

  • ๐ŸŽฏ Training-free memory tree
  • ๐Ÿ”„ Long video segmentation
  • ๐ŸŽญ Handles occlusion/reappearance
  • ๐Ÿ“Š Project Page

๐Ÿ”ฅ Grounded-SAM-2

Multi-modal Integration | Stars

  • ๐Ÿค Grounding DINO + Florence-2 + SAM 2
  • ๐ŸŽฏ Detect + Segment + Track
  • ๐ŸŽฌ Video object tracking
  • ๐ŸŒ HuggingFace ready

๐ŸŽค AL-Ref-SAM2

AAAI 2025 | Stars

  • ๐Ÿ”Š Audio-Language-Referenced VOS
  • ๐Ÿง  GPT temporal-spatial reasoning
  • ๐ŸŽฏ Training-free paradigm
  • ๐Ÿš€ State-of-the-art results

๐Ÿฅ Surgical SAM 2

NeurIPS 2024 | Stars

  • โšก 86 FPS real-time
  • ๐Ÿฅ Medical surgery segmentation
  • ๐Ÿ’ช 3x faster performance
  • ๐Ÿ“ฑ Resource-constrained ready

๐Ÿ“ผ Classic Video & Tracking


โšก Optimization & Mobile Deployment

๐Ÿ“ฑ Edge & Mobile Champions

Model Speed Size Device Highlights
๐Ÿš€ EdgeSAM 30+ FPS - iPhone 14 First mobile SAM @ 30+ FPS
โšก EfficientSAM 10-20 img/s 9.8M Edge Best accuracy-efficiency trade-off
๐Ÿ“ฑ MobileSAM 40x 60x smaller Mobile Lightweight variant
๐Ÿƒ FastSAM ~100 img/s 68M GPU Maximum throughput
๐Ÿ“Š Performance Comparison: EfficientSAM vs FastSAM
Metric EfficientSAM-Ti EfficientSAM-S FastSAM
COCO AP 45.0 (+4.1) - (+6.5) 37.0
LVIS AP - (+5.3) - (+7.8) -
Params 9.8M - 68M
Speed 10-20 img/s 10-20 img/s ~100 img/s

Winner: ๐Ÿ† EfficientSAM for accuracy, ๐Ÿ† FastSAM for speed

๐Ÿ› ๏ธ Optimization Resources

Click to see optimization tools & papers

๐Ÿ“š Papers & Surveys

๐Ÿ”— Repositories

Model Link Special Feature
๐Ÿš€ EdgeSAM GitHub โ€ข Paper CNN-based, iPhone ready
โšก EfficientSAM GitHub โ€ข Site Best AP/params ratio
๐Ÿ“ฑ MobileSAM GitHub 60x compression
๐Ÿƒ FastSAM GitHub โ€ข Docs Prompt-free
๐Ÿ”ฌ TinySAM GitHub Ultra-compact
๐Ÿ’Ž HQ-SAM GitHub NeurIPS 2023, Quality++

๐Ÿ“š Academic Research & Surveys

๐Ÿ“– Comprehensive Surveys (2024-2025)

๐ŸŒŸ Must-Read Surveys

ArXiv:2306.06211 โ€ข Updated: Oct 2024 โ€ข ๐Ÿ† Most Comprehensive

๐Ÿ“… Coverage: April 2023 - September 2024
๐ŸŽฏ Topics: SAM & SAM 2, Prompt Engineering
๐Ÿ“Š Papers: 200+ analyzed
โญ Rating: โญโญโญโญโญ

ArXiv:2305.08196 โ€ข Foundation model analysis

๐ŸŽฏ Topics: Computer Vision Applications
๐Ÿ”ฌ Depth: Technical Deep Dive
๐Ÿ“Š Applications: Multiple Domains

ArXiv:2507.22792 โ€ข 2025 โ€ข Video-specific review

๐ŸŽฌ Focus: Video Object Segmentation & Tracking
โฐ Timeline: Past โ†’ Present โ†’ Future
๐ŸŽฏ Comprehensive VOST analysis
๐Ÿ—‚๏ธ Curated Collections
Repository Description Stars
๐Ÿ“š Awesome-Segment-Anything First comprehensive survey Stars
๐ŸŽฌ Awesome-SAM2 SAM 2 specific resources Stars
๐Ÿฅ SAM4MIS Medical imaging collection Stars

๐Ÿ“‘ Paper Lists


๐Ÿ’ก Domain-Specific Applications

๐Ÿฅ Medical Imaging (2024-2025)

๐Ÿฉบ SAM4MIS

CIBM 2024 | Stars

Benchmarks:

  • โœ… 15+ medical benchmarks
  • โœ… 26+ different tasks
  • ๐Ÿ”ฌ Mammography, MRI, CT
  • ๐Ÿ‘๏ธ Retinal vessel segmentation
  • ๐Ÿซ€ Ultrasound imaging

๐ŸงŠ SAM-Med3D

General-purpose 3D Medical Segmentation

Dataset:

  • ๐Ÿ“ฆ SA-Med3D-140K
  • ๐ŸŽฏ 22K 3D images
  • ๐Ÿท๏ธ 143K masks
  • ๐Ÿ”ฌ Multi-modal (CT, MRI, etc.)
  • ๐ŸŽฏ Few-shot 3D prompting
๐Ÿ”ฌ More Medical Applications

๐ŸŽฏ Key Papers (2024-2025)

Paper Venue Application Performance
Interactive 3D Medical Segmentation ArXiv 2024 Zero-shot 3D CT State-of-art
SAM 2 for 3D Medical Imaging JMIR AI 2025 Abdominal CT scans Promising
ProtoSAM-3D PubMed Volumetric imaging Interactive
AutoProSAM WACV 2025 Multi-organ 3D Automated
SAM-Med2D Analysis BMC 2024 2D Medical images Improved

๐Ÿ“– Reviews

๐Ÿ›ฐ๏ธ Remote Sensing & Agriculture

๐ŸŒพ Agricultural Applications

๐Ÿ›ฐ๏ธ SAMGeo

SciPy 2024 Presentation

  • ๐ŸŒ Automated remote sensing segmentation
  • ๐Ÿ“ฆ Open-source geospatial package
  • ๐ŸŽฏ User-friendly API

MDPI Remote Sensing 2024

# Automated sample generation
โœ… Sentinel-2 imagery (10m resolution)
โœ… Landsat-8 support (30m resolution)
โœ… Automatic quality filtering
โœ… Sample cleaning pipeline

ArXiv 2023 | Zero-shot Performance Evaluation

  • ๐ŸŽฏ Crop-type mapping
  • ๐Ÿ“Š Precision agriculture
  • ๐Ÿ” Zero-shot capabilities
  • ๐ŸŒพ Multi-spectral challenges

๐Ÿ—บ๏ธ ESRI ArcGIS Integration

  • ๐Ÿ™๏ธ Urban planning
  • ๐ŸŒฒ Environmental monitoring
  • ๐Ÿ’ง Water body extraction
  • ๐Ÿ—๏ธ Infrastructure mapping

๐Ÿš— Autonomous Driving & Robotics

๐Ÿค– Robotics & AV Applications

๐Ÿš™ SAMUNet

2025 | Shape-aware 3D Object Detection

  • ๐ŸŽฏ Pillar-based detection
  • ๐Ÿš— Autonomous driving optimized
  • ๐Ÿ“Š Enhanced 3D understanding

โ˜๏ธ Point Cloud Resources

  • ๐Ÿ“Š 7 SOTA Point Cloud Models
  • ๐ŸŽฏ LiDAR-based detection
  • ๐Ÿค– Robotic perception
  • ๐ŸŒ 3D scene understanding

Applications:

โœ… 3D Object Detection
โœ… Semantic Segmentation
โœ… Instance Segmentation
โœ… Panoptic Segmentation

๐Ÿญ Industrial Quality Control

๐Ÿ” Defect Detection & QC

MDPI Sensors 2025

Application: Oil & Gas Pipeline Inspection
Method: Ultrasonic B-scan Analysis
Models: SAM 1 (ViT-Base) + SAM 2 (Hiera-Base+)
Performance: F1-Score 0.940
Defect Type: Lack of Fusion (LOF)

MDPI Processes 2025 | YOLO11 + SAM

  • ๐ŸŽฏ Micro-vibration motor QC
  • ๐Ÿค– YOLO11 detection + SAM segmentation
  • ๐Ÿ“Š Quantitative severity assessment
  • โœ… 90%+ accuracy
  • โšก Real-time capable

Benefits:

โœ… Automated inspection
โœ… Cost-effective solution
โœ… Real-time analysis
โœ… Quantitative metrics

๐ŸŽจ Specialized Domains

๐ŸŒˆ Other Applications
Domain Project Description
๐Ÿ“ Depth Depth-Anything-V2 Monocular depth estimation
๐ŸŽฎ 3D Gaussian gaussian-grouping 3D Gaussian splatting
๐Ÿ—๏ธ 3D Recon garfield 3D reconstruction
๐Ÿ”ท Mesh MeshAnything 3D mesh generation
๐ŸŽฌ 4D SA4D 4D scene understanding
๐Ÿ“ OCR OCR-SAM Text recognition
๐Ÿ• Food FOOD-SAM Food segmentation
๐Ÿ“ธ Deblur SAM-Deblur Image deblurring

๐ŸŽ“ Training & Fine-Tuning

๐Ÿงฌ Fine-Tuning Frameworks & LoRA Adapters

๐Ÿ”ง Top Fine-Tuning Repositories
Repository Method Domain Updated
finetune-SAM LoRA + Full ๐Ÿฅ Medical โœ… 2024
SAM-fine-tune ๐ŸŒŒ LoRA ๐ŸŒ General โœ… Active
lora_sam LoRA + ๐Ÿค— ๐ŸŽฏ Vision โœ… 2024
SAMed LoRA ๐Ÿฅ Medical โœ… Stable
med-sam-brain PEFT + LoRA ๐Ÿง  Brain Tumor โœ… 2024
๐Ÿ“š Tutorials & Learning Resources

๐ŸŽฏ Official Tutorials

Resource Level Topics
Labellerr: SAM + LoRA ๐ŸŸข Beginner One-shot learning, ship segmentation
Encord: Fine-Tune Guide ๐ŸŸก Intermediate Complete pipeline, best practices
Medium: PEFT for Segmentation ๐ŸŸก Intermediate Parameter-efficient methods

๐Ÿ“„ Research Papers

  • ๐Ÿ”ฌ Conv-LoRA - OpenReview
    • Lightweight convolutional parameters
    • Combined with LoRA
    • Enhanced performance

๐Ÿ’ก Quick Start Example

from transformers import SamModel, SamProcessor
from peft import LoraConfig, get_peft_model

# Load base model
model = SamModel.from_pretrained("facebook/sam-vit-base")

# Configure LoRA
lora_config = LoraConfig(
    r=16,
    lora_alpha=32,
    target_modules=["qkv"],
    lora_dropout=0.05,
)

# Apply LoRA
model = get_peft_model(model, lora_config)

# Fine-tune on your data
# ... training code ...

๐Ÿ› ๏ธ Production Deployment & Tools

๐Ÿท๏ธ Annotation Tools with SAM Integration

โœจ CVAT

Computer Vision Annotation Tool

SAM 2 Integration (2024):

  • ๐ŸŽฌ Video annotation @ 44 FPS
  • ๐ŸŽฏ Semi-automatic segmentation
  • ๐Ÿš€ Real-time object tracking
  • ๐Ÿ–ฑ๏ธ Interactive prompting

Resources:

๐Ÿท๏ธ Label Studio

ML Data Labeling Platform

Features:

  • ๐Ÿค— HuggingFace Spaces integration
  • ๐ŸŒ Multi-modal (image, video, text, audio)
  • ๐Ÿ” SSO & RBAC
  • โ˜๏ธ Cloud-native ML pipelines
  • ๐Ÿณ Docker/Kubernetes deployment

Resources:

๐Ÿ“Š CVAT vs Label Studio Comparison
Feature CVAT Label Studio
Best For Video annotation, beginners Enterprise, multi-modal
SAM Integration โœ… SAM 2 native โœ… Via HuggingFace
Video Tools โญโญโญโญโญ โญโญโญ
Enterprise โญโญโญ โญโญโญโญโญ
Ease of Use โญโญโญโญโญ โญโญโญโญ
ML Pipeline โญโญโญ โญโญโญโญโญ
Pricing Free + Enterprise Free + Enterprise
๐Ÿ”ง Other Annotation Tools
  • ๐Ÿ–ฅ๏ธ AnyLabeling - Desktop labeling with SAM
  • ๐Ÿง‚ SALT - Segment Anything Labelling Tool

โ˜๏ธ Deployment Platforms

๐Ÿš€ Deployment Options

๐Ÿค— HuggingFace Ecosystem

# Install transformers
pip install transformers

# Use SAM with transformers
from transformers import SamModel, SamProcessor

model = SamModel.from_pretrained("facebook/sam-vit-huge")
processor = SamProcessor.from_pretrained("facebook/sam-vit-huge")

Resources:

๐Ÿ”„ Export & Conversion Tools

Tool Format Features
SAM2ONNX ONNX SAM 2 converter
sam_onnx_full_export ONNX Complete export
sam4onnx ONNX Optimization
samexporter Multi Multi-format

๐Ÿ”ง SAM Extensions & Variants

๐ŸŽฏ Grounding & Multi-Modal

๐Ÿ”ฅ Popular Extensions

Stars

The Ultimate Combo:

๐ŸŽฏ Grounding DINO (Detection)
    โ†“
๐ŸŽจ SAM (Segmentation)
    โ†“
๐Ÿ–ผ๏ธ Stable Diffusion (Generation)
    โ†“
โœจ Detect โ†’ Segment โ†’ Generate ANYTHING

Features:

  • โœ… Text-to-detection
  • โœ… Automatic segmentation
  • โœ… Image inpainting
  • โœ… HuggingFace integration

๐Ÿ›ก๏ธ RobustSAM

CVPR 2024 | Adversarial Robustness

  • ๐Ÿ”’ Robust to adversarial attacks
  • ๐ŸŽฏ Enhanced generalization
  • ๐Ÿ“Š Better performance on corrupted images

๐ŸŽจ Personalize-SAM

DreamBooth Integration

๐Ÿงฉ Full Pipeline Solutions
Project Description Key Feature
SEEM Segment Everything Everywhere Universal segmentation
Full-SAM Complete pipeline End-to-end solution
AUTODISTILL Auto labeling Dataset generation
GroundingDINO Language grounding Text-guided detection
RAM Recognize Anything Image tagging
CLIP Vision-Language Foundation model

๐Ÿ’ป Implementation Libraries

๐Ÿ”ค Multi-Language Support

โš™๏ธ C++ Implementations
Repository Framework Platform
segment-anything-cpp-wrapper Pure C++ Cross-platform
sam-cpp-macos Extended macOS
sam.cpp GGML Multi-platform
SegmentAnything-OnnxRunner ONNX Cross-platform
SAM-ONNX-AX650-CPP QT + Lama GUI + Inpaint
๐Ÿ”ท Other Languages

C#

  • ๐Ÿ’Ž SamSharp - C# implementation

Rust ๐Ÿฆ€

LibTorch ๐Ÿ”ฅ

๐Ÿ“ฑ Mobile Runtimes

MNN Framework

๐ŸŽจ Creative Tools Integration

ComfyUI


๐Ÿ“Š Datasets & Benchmarks

๐Ÿ—„๏ธ Official Meta Datasets

SA-1B (Images) SA-V (Videos)

๐Ÿ“ฆ SA-1B Dataset

Images: 11 Million
Masks: 1+ Billion
Type: Open world images
License: Licensed & privacy-respecting
Status: โœ… Released 2023

Superlatives:

  • ๐Ÿ† Largest segmentation dataset
  • ๐ŸŒ Diverse geography
  • ๐ŸŽฏ High-quality annotations

๐ŸŽฌ SA-V Dataset

Videos: 51,000
Countries: 47
Masks: 600,000+
Resolution: 240p โ†’ 4K
Duration: 4s โ†’ 138s
Status: โœ… Released 2024

Features:

  • ๐ŸŒ Global diversity
  • ๐ŸŽฅ Multi-resolution
  • โฑ๏ธ Variable length

๐ŸŽฏ Benchmark Datasets

๐Ÿ“Š SAM 2 Evaluation Benchmarks
Benchmark Task Metrics
DAVIS Video object segmentation J&F score
MOSE Multi-object segmentation J&F score
LVOS Long-term video segmentation Success rate
YouTube-VOS Large-scale video J&F score
COCO Instance segmentation AP, AR
LVIS Large vocabulary segmentation AP, APr

SAM 2 Performance:

โœ… 3x fewer interactions
โœ… Better accuracy
โœ… 6x faster inference
โœ… SOTA on all benchmarks

๐Ÿ“– Educational Resources & Tutorials

๐ŸŽ“ Learn SAM from Scratch to Production

๐Ÿ“š Comprehensive Guides

๐ŸŒŸ Must-Read Tutorials
Resource Level Topics Link
๐ŸŽฏ Encord Ultimate Guide ๐ŸŸข All Architecture, Training, Applications Read
๐Ÿค– Roboflow Breakdown ๐ŸŸข Beginner Concepts, Use Cases Read
๐Ÿ› ๏ธ Roboflow How-to ๐ŸŸก Intermediate Practical Implementation Read
๐Ÿท๏ธ V7 Labs Guide ๐ŸŸข All Complete Overview Read
๐Ÿ‘จโ€๐Ÿ’ป LabelVisor Hands-on ๐ŸŸก Intermediate Effortless Segmentation Read

โšก Performance & Optimization

๐Ÿš€ Optimization Deep Dives

๐Ÿ“Š Comparison Articles

๐ŸŽฅ Video Resources

๐Ÿ“บ YouTube Channels
Channel Focus Subscriber Count
Rob Mulla ML Tutorials, SAM Applications Data Science
ArjanCodes Software Engineering, Clean Code Python & AI

๐ŸŽฏ Key Insights & Trends (2024-2025)

๐Ÿš€ Major Developments Timeline

timeline
    title SAM Evolution 2024-2025
    2024-07 : SAM 2 Release
           : Video Segmentation
    2024-09 : SAM 2.1 Update
           : ICLR 2025 Award
    2024-12 : Medical Breakthrough
           : 26+ Tasks, 15+ Benchmarks
    2025-02 : AWS Integration
           : SageMaker JumpStart
Loading

๐Ÿ“Š Performance Benchmarks

Metric Value Model
โšก Speed Improvement 6x faster SAM 2 vs SAM 1
๐ŸŽฏ Efficiency Gain 3x fewer interactions SAM 2
๐Ÿ“ฑ Mobile FPS 30+ FPS EdgeSAM (iPhone 14)
๐ŸŽฌ Video FPS 44 FPS SAM 2 real-time
๐Ÿฅ Surgical FPS 86 FPS Surgical SAM 2
๐Ÿญ Industrial Accuracy F1: 0.940 Weld Defect Detection
๐ŸŽ“ Medical Tasks 26+ tasks SAM4MIS

๐Ÿ”ฌ Research Trends

๐ŸŒŸ Emerging Research Directions

๐ŸŽฏ Key Trends 2024-2025

1. ๐Ÿš€ Training-Free Adaptation
   โ””โ”€ LoRA, Conv-LoRA, Prompt Tuning

2. ๐ŸŽฌ Video Understanding
   โ””โ”€ SAM 2, SAM2Long, Temporal Consistency

3. ๐Ÿฅ Medical Imaging
   โ””โ”€ 3D Segmentation, Multi-modal Fusion

4. ๐Ÿ“ฑ Edge Deployment
   โ””โ”€ Quantization, Pruning, Distillation

5. ๐ŸŒ Multi-Modal Integration
   โ””โ”€ Audio-Visual, Language-Vision

6. ๐ŸŽฏ Zero-Shot Transfer
   โ””โ”€ Cross-domain, Few-shot Learning

7. ๐ŸŽฎ 3D/4D Understanding
   โ””โ”€ Point Clouds, Temporal Dynamics

๐Ÿ“ˆ Growth Areas

Domain Growth Key Applications
๐Ÿฅ Medical โญโญโญโญโญ 3D imaging, Surgery
๐Ÿš— Autonomous โญโญโญโญ LiDAR, Perception
๐Ÿ›ฐ๏ธ Remote Sensing โญโญโญโญ Agriculture, Mapping
๐Ÿญ Industrial โญโญโญโญโญ QC, Defect Detection
๐ŸŽจ Creative โญโญโญ Editing, Generation

๐ŸŽจ "Anything" Projects Ecosystem

๐Ÿ’ซ 2024 Innovative Projects

๐ŸŒŸ Click to explore 2024 projects
Project Description Stars
ReplaceAnything ๐ŸŽญ Replace objects in images Stars
Depth-Anything ๐Ÿ“ Monocular depth estimation Stars
OMG-Seg ๐ŸŒ Open-world segmentation Stars
OVSAM ๐Ÿ“– Open-vocabulary SAM Stars

๐ŸŽฏ 2023 Classic Projects

๐Ÿ“š Essential 2023 projects

๐ŸŽจ Creative & Editing

๐Ÿ” Detection & Segmentation

๐Ÿ“Š Analysis & Understanding

๐ŸŽฎ 3D & Reconstruction


๐Ÿค Contributing

๐Ÿ’ก Help Us Grow This Collection!

Contributors PRs

We welcome contributions! Please ensure:

  • โœ… Resources are from reputable sources
  • โœ… Links are active and high-quality
  • โœ… Descriptions are accurate and concise
  • โœ… Proper categorization
  • โœ… Include relevant badges/stars
  • โœ… Add performance metrics when available

๐Ÿ“„ Citation

๐Ÿ“ BibTeX Citations

SAM (Original)

@article{kirillov2023segment,
  title={Segment Anything},
  author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Doll{\'a}r, Piotr and Girshick, Ross},
  journal={arXiv:2304.02643},
  year={2023}
}

SAM 2

@article{ravi2024sam2,
  title={SAM 2: Segment Anything in Images and Videos},
  author={Ravi, Nikhila and Gabeur, Valentin and Hu, Yuan-Ting and Hu, Ronghang and Ryali, Chaitanya and Ma, Tengyu and Khedr, Haitham and R{\"a}dle, Roman and Rolland, Chloe and Gustafson, Laura and Mintun, Eric and Pan, Junting and Alwala, Kalyan Vasudev and Carion, Nicolas and Wu, Chao-Yuan and Girshick, Ross and Doll{\'a}r, Piotr and Feichtenhofer, Christoph},
  journal={arXiv:2408.00714},
  year={2024}
}

๐Ÿ“š Documentation & Resources

๐Ÿ“– Core Documentation

๐Ÿ“‹ Status & Setup

๐ŸŽฏ Repository Status

Build Status Tests Coverage Type Checking Linting Security

Latest Release: v1.0.0 | Status: โœ… Production Ready | Last Updated: November 2025


๐ŸŒŸ Star History

Star History Chart


๐Ÿ“Š Repository Stats

Last Updated Maintenance Resources


Last Updated: January 2025 ๐Ÿ—“๏ธ Maintainer: Community-driven ๐Ÿ‘ฅ License: Collection of resources with individual licenses ๐Ÿ“œ


Disclaimer: This is a curated collection. Each project has its own license.

โญ Star this repo to stay updated with the latest SAM developments!

๐Ÿ‘€ Watch for new resources and updates!


Made with โค๏ธ by the SAM Community

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •