Robust Superpixel Tracking

Fan Yang1,3, Huchuan Lu1 and Ming-Hsuan Yang2

1 School of Information and Communication Engineering, Dalian University of Technology, Dalian, China
2 Electrical Engineering and Computer Science, University of California at Merced, Merced, United States
3 Department of Computer Science, University of Maryland, College Park, United States


While numerous algorithms have been proposed for object tracking with demonstrated success, it remains a challenging problem for a tracker to handle large appearance change due to factors such as scale, motion, shape deformation, and occlusion. One of the main reasons is the lack of effective image representation to account for appearance variation. Most trackers use high-level appearance structure or low-level cues for representing and matching target objects. In this paper, we propose a tracking method from the perspective of mid-level vision with structural information captured in superpixels. We present a discriminative appearance model based on superpixels, thereby facilitating a tracker to distinguish the target and the background with mid-level cues. The tracking task is then formulated by computing a target-background confidence map, and obtaining the best candidate by maximum a posterior estimate. Experimental results demonstrate that our tracker is able to handle heavy occlusion and recover from drifts. In conjunction with online update, the proposed algorithm is shown to perform favorably against existing methods for object tracking. Furthermore, the proposed algorithm facilitates foreground and background segmentation during tracking.

Experimental Results

Code and Datasets

The MATLAB implementation can be downloaded from here (version 2.4, both Windows and Linux supported). Please see README for more details.
The sequences from our dataset are available here with groundtruth (note that we use a resized version of singer1). Other sequences can be found from the PROST, VTD, FRAG and PDAT datasets.