Semi-Supervised Video Object Segmentation

94 papers with code • 15 benchmarks • 13 datasets

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Benchmarks

Add a Result

These leaderboards are used to track progress in Semi-Supervised Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2017 (val)	Cutie+ (base)	See all
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (test-dev)	Cutie+ (base, MEGA)	See all
YouTube-VOS 2018	Cutie+ (base, MEGA)	See all
DAVIS (no YouTube-VOS training)	HMMN	See all
YouTube-VOS 2019	Cutie+ (base, MEGA)	See all
VOT2020	SwinB-DeAOT-L	See all
MOSE	Cutie+ (base, MEGA)	See all
Long Video Dataset	ISVOS	See all
YouTube	FEELVOS	See all
DAVIS-2017	STCN + TrickVOS (PT)	See all
Long Video Dataset (3X)	XMem	See all
BURST-val	Cutie (base, MEGA, 600 pixels)	See all
BURST-test	Cutie (base, MEGA, 600 pixels)	See all
DAVIS-2016	STCN + TrickVOS (PT)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semi-Supervised Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

561

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

117

Datasets

Subtasks

One-shot visual object segmentation

Most implemented papers

Most implemented Social Latest No code

One-Shot Video Object Segmentation

kmaninis/OSVOS-PyTorch • • CVPR 2017

This paper tackles the task of semi-supervised video object segmentation, i. e., the separation of an object from the background in a video, given the mask of the first frame.

Paper
Code

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

JonathonLuiten/PReMVOS • • 24 Jul 2018

We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations.

Paper
Code

Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

hkchengrex/MiVOS • • CVPR 2021

We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-mask and mask propagation, allowing for higher generalizability and better performance.

Paper
Code

Lucid Data Dreaming for Video Object Segmentation

omkar13/MaskTrack • • 28 Mar 2017

Our approach is suitable for both single and multiple object segmentation.

Paper
Code

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

BehradToghi/ECCV_Youtube_VOS • • ECCV 2018

End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.

Paper
Code

Make One-Shot Video Object Segmentation Efficient Again

dvl-tum/e-osvos • • NeurIPS 2020

In the semi-supervised setting, the first mask of each object is provided at test time.

Paper
Code

Fast Online Object Tracking and Segmentation: A Unifying Approach

foolwood/SiamMask • • CVPR 2019

In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach.

Paper
Code

FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

tensorflow/models • • CVPR 2019

Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use.

Paper
Code

Video Object Segmentation using Space-Time Memory Networks

seoungwugoh/STM • • ICCV 2019

In our framework, the past frames with object masks form an external memory, and the current frame as the query is segmented using the mask information in the memory.

Paper
Code

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

hkchengrex/STCN • • NeurIPS 2021

This paper presents a simple yet effective approach to modeling space-time correspondences in the context of video object segmentation.

Paper
Code

Semi-Supervised Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result