Unsupervised Video Object Segmentation
51 papers with code • 6 benchmarks • 8 datasets
The unsupervised scenario assumes that the user does not interact with the algorithm to obtain the segmentation masks. Methods should provide a set of object candidates with no overlapping pixels that span through the whole video sequence. This set of objects should contain at least the objects that capture human attention when watching the whole video sequence i.e objects that are more likely to be followed by human gaze.
Datasets
Most implemented papers
EpO-Net: Exploiting Geometric Constraints on Dense Trajectories for Motion Saliency
To handle the nonrigid background like a sea, we also propose a robust fusion mechanism between motion and appearance-based features.
Tukey-Inspired Video Object Segmentation
We investigate the problem of strictly unsupervised video object segmentation, i. e., the separation of a primary object from background in video without a user-provided object mask or any training on an annotated dataset.
Joint-task Self-supervised Learning for Temporal Correspondence
Our learning process integrates two highly related tasks: tracking large image regions \emph{and} establishing fine-grained pixel-level associations between consecutive video frames.
MAST: A Memory-Augmented Self-supervised Tracker
Recent interest in self-supervised dense tracking has yielded rapid progress, but performance still remains far from supervised methods.
D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos
We further show that D2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.
Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation
Unsupervised video object segmentation (VOS) aims to detect the most salient object in a video sequence at the pixel level.
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering
Online unsupervised video object segmentation (UVOS) uses the previous frames as its input to automatically separate the primary object(s) from a streaming video without using any further manual annotation.
Video Object Segmentation using Supervoxel-Based Gerrymandering
Focusing on the problem of strictly unsupervised video object segmentation, we devise a method called supervoxel gerrymandering that links masks of foregroundness and backgroundness via local and non-local consensus measures.
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
This paper proposes an end-to-end trainable network, SegFlow, for simultaneously predicting pixel-wise object segmentation and optical flow in videos.
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
The detection of moving objects is done in an unsupervised way by exploiting structure from motion.