- Module:
project_terrier - Codename:
TERRIER - Date: 2026-04-10
- Paper: Progressive Representation Learning for Real-Time UAV Tracking
- ArXiv: https://arxiv.org/abs/2409.16652
- IEEE DOI: https://doi.org/10.1109/IROS58592.2024.10803050
- Upstream repo: https://github.com/vision4robotics/PRL-Track
- Upstream license: Apache-2.0
- Build policy: Dual compute mandatory (
mlx|cuda|cpu), YOLO26 baseline mandatory
TERRIER ports the PRL-Track tracking architecture into ANIMA as a reproducible, deployment-oriented module for UAV defense use cases. The module keeps the paper's core components (AR, SR, HMG) while adapting data pipelines and inference integration for:
- ANIMA runtime and ROS2 stack.
- YOLO26-first detection bootstrap.
- Dual-compute execution (MLX on Apple Silicon and CUDA on RTX-class GPU).
- ArXiv entry exists (
2409.16652v1, published 2024-09-25). - IEEE IROS publication exists (
10.1109/IROS58592.2024.10803050). - PDF in
papers/2409.16652.pdfis consistent with arXiv abstract/method/results.
- Upstream repo exists and is active enough for reproduction bootstrapping:
- stars: 52
- forks: 11
- latest push: 2024-09-29
- CLI scripts resolve and parse args (
tools/train.py --help,tools/test.py --help). - Important risk: upstream model hardcodes CUDA (
.cuda()inModelBuilder), blocking direct CPU/MLX execution without patching.
- Shared dataset volume mounted:
/Volumes/AIFlowDev/RobotFlowLabs/datasets/. - Present locally:
shared/coco,wave10_staging/visdrone. - Missing in shared volume snapshot: GOT-10K, LaSOT, UAVDT, DroneVehicle, SeaDronesSee, 1.8M mega UAV (explicit path not found).
- Paper-reported values are internally coherent (UAVTrack112/UAVTrack112_L/UAV123 + real-world FPS 42.6).
- External citation signal exists:
- Semantic Scholar citation count: 19 (as of 2026-04-10)
- OpenAlex published article citation count: 11 (as of 2026-04-10)
- Status: VERIFIED WITH RISKS
- Risks requiring explicit handling:
- Upstream CUDA-only assumptions.
- Incomplete local availability of training datasets.
- Upstream training recipe discrepancy between paper text (70 epochs) and repo config defaults (100 epochs).
- Progressive representation stack:
- Appearance-aware regulator (AR)
- Semantic-aware regulators (SR)
- Hierarchical modeling generator (HMG)
- Depthwise correlation based template-search fusion.
- Multi-head localization/classification output structure.
- Any component requiring direct dependence on non-portable CUDA-only code paths.
- Non-UAV-defense-specific tooling not needed for ANIMA deployment.
- Full-scale training run in this bootstrap phase.
- Detection front-end anchored on YOLO26.
- Unified backend selector and parity checks for
mlx|cuda|cpu. - Dataset abstraction for internal 1.8M UAV corpus + staged public datasets.
- Load PRL-Track style model scaffold and run synthetic template/search forward pass.
- Expose backend selection via CLI and config.
- Support YOLO26 detector integration point.
- Produce deterministic smoke-check output for CI.
- Dual-compute API compatibility (
mlx|cuda|cpu). - Reproducible synthetic tests.
- Clear failure modes for missing weights/datasets.
TinyBackboneextracts hierarchical features (5 scales).- Depthwise cross-correlation fuses template/search feature maps.
ARrefines shallow representation.SRmodules refine deeper semantic representation.HMGperforms hierarchical cross-attention fusion.- Heads output:
loc: [B,4,H,W]cls1: [B,2,H,W]cls2: [B,1,H,W]
- YOLO26 adapter is used as detector bootstrap before tracking handoff.
- COCO, GOT-10K, LaSOT
- 1.8M Mega UAV dataset (internal)
- VisDrone
- UAVDT
- DroneVehicle
- SeaDronesSee
- Do not force downloads in code paths by default.
- Check mounted shared volume first.
- Keep download scripts idempotent and non-destructive.
- Complete evidence-backed paper/repo/dataset verification.
- Generate PRD suite and task backlog.
- Implement ANIMA package skeleton with AR/SR/HMG core and smoke tests.
- Implement backend abstraction and parity utility.
- Add YOLO26 integration interface.
- Reproduce upstream benchmark pipeline with canonical datasets.
- Resolve epoch/config discrepancy by controlled experiments.
- Match published benchmark metrics within tolerance band.
- Fine-tune with 1.8M UAV + public UAV datasets.
- Evaluate defense-oriented scenarios and latency targets.
- Integrate ROS2 + simulator hooks.
- Paper and repo verified with documented risks.
- Full PRD/task suite generated (
prds/,tasks/,ASSETS.md). - Initial PRL-Track architecture scaffold implemented in
src/. - CLI supports
--backend mlx|cuda|cpu|auto. - Synthetic unit tests pass locally.
- Upstream CUDA hard-binding requires patch/port for complete parity.
- Dataset readiness gap for full reproduction and adaptation.
- Real-world 42.6 FPS claim cannot be validated without edge hardware run.
PRD.md(this file) replaced from scaffold to verified execution plan.ASSETS.md,prds/*,tasks/*generated.- Initial code scaffold and tests implemented.