Change the repository type filter
All
Repositories list
47 repositories
- Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning
3D-deformable-attention
PublicMotion-X
PublicGrounding-DINO-1.5-API
PublicGrounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model SeriesTAPTR
PublicMotionCLR
Public- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
- [CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"
X-Pose
PublicGroundingDINO
Public[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"DINO
Public[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"- [ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"
Stable-DINO
Public[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"deepdataspace
PublicThe Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.