Skip to content

Latest commit

 

History

History
21 lines (18 loc) · 909 Bytes

File metadata and controls

21 lines (18 loc) · 909 Bytes

Towards a Better Match in Siamese Network Based Visual Object Tracker

arXiv

Introduction

  1. Siamese的limitations
    1. CNN对图像的transformations(scaling, rotation)并不具有不变性
    2. 难以确定真正target的特征空间位置。一定的context有助于跟踪,但过多的context不好

Siam-BM Tracker

  1. Angle Estimation: 原search area旋转一定的角度作为candidate patch
  2. Spatial Mask
    1. target ratio不为1,可能引入distractor
    2. ratio大于某个阈值时,引入空间mask
  3. 模板更新 $$\begin{array}l \phi(T_t)=\lambda_S\times \phi(T_1)+(1-\lambda_S)\times\phi(T_t^u) \ \phi(T_t^u)=(1-\lambda_U)\times \phi(T_{t-1}^u)+\lambda_U\times\hat\phi(T_{t-1}) \end{array}$$

$\hat\phi(T_{t-1})$为t-1时刻的跟踪特征

Thoughts

个人感觉这篇paper没有太多亮点,最有效的可能是模板更新。