Video Summarization

68 papers with code • 5 benchmarks • 13 datasets

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey
Image credit: iJRASET

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Summarization

Dataset	Best Model	Compare
SumMe	PGL-SUM	See all
TvSum	RR-STG	See all
Query-Focused Video Summarization Dataset	EgoVLPv2	See all
Shot2Story20K	SUM-shot	See all
videoxum	VTSUM-BLIP	See all

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

KaiyangZhou/vsumm-reinforce • • 29 Dec 2017

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos.

Paper
Code

Summarizing Videos with Attention

ok1zjf/VASNet • • 5 Dec 2018

In this work we propose a novel method for supervised, keyshots based video summarization by applying a conceptually simple and computationally efficient soft, self-attention mechanism.

Paper
Code

Video Summarization using Deep Semantic Features

590shun/video_summarize_dsf • • 28 Sep 2016

For this, we design a deep neural network that maps videos as well as descriptions to a common semantic space and jointly trained it with associated pairs of videos and descriptions.

Paper
Code

Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision

zaeemzadeh/IPM • CVPR 2019

In our algorithm, at each iteration, the maximum information from the structure of the data is captured by one selected sample, and the captured information is neglected in the next iterations by projection on the null-space of previously selected samples.

Paper
Code

Rethinking the Evaluation of Video Summaries

mayu-ot/rethinking-evs • CVPR 2019

Video summarization is a technique to create a short skim of the original video while preserving the main stories/content.

Paper
Code

Unsupervised video summarization framework using keyframe extraction and video skimming

shruti-jadon/Video-Summarization-using-Keyframe-Extraction-and-Video-Skimming • • 10 Oct 2019

Video is one of the robust sources of information and the consumption of online and offline videos has reached an unprecedented level in the last few years.

Paper
Code

Convolutional Hierarchical Attention Network for Query-Focused Video Summarization

ckczzj/AAAI2020 • • 31 Jan 2020

This paper addresses the task of query-focused video summarization, which takes user's query and a long video as inputs and aims to generate a query-focused video summary.

Paper
Code

Supervised Video Summarization via Multiple Feature Sets with Parallel Attention

TIBHannover/MSVA • • 23 Apr 2021

The proposed architecture utilizes an attention mechanism before fusing motion features and features representing the (static) visual content, i. e., derived from an image classification model.

Paper
Code

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization

2023-MindSpore-4/Code12 • • 26 Apr 2021

Traditional video summarization methods generate fixed video representations regardless of user interest.

Paper
Code

Egocentric Video-Language Pretraining

showlab/egovlp • • 3 Jun 2022

Video-Language Pretraining (VLP), which aims to learn transferable representation to advance a wide range of video-text downstream tasks, has recently received increasing attention.

Paper
Code

Video Summarization

Benchmarks Add a Result

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result