TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Prediction	BAIR Robot Pushing	DVD-GAN-FP	FVD	109.8	# 5
Video Generation	BAIR Robot Pushing	DVD-GAN-FP	FVD score	109.8	# 11
Video Generation	BAIR Robot Pushing	DVD-GAN-FP	Cond	1	# 1
Video Generation	BAIR Robot Pushing	DVD-GAN-FP	Pred	15	# 8
Video Generation	BAIR Robot Pushing	DVD-GAN-FP	Train	15	# 2
Video Generation	Kinetics-600 12 frames, 128x128	DVD-GAN	FID	2.16	# 1
Video Generation	Kinetics-600 12 frames, 64x64	DVD-GAN	FVD	31.1	# 4
Video Prediction	Kinetics-600 12 frames, 64x64	DVD-GAN-FP	FVD	69.15±0.78	# 11
Video Prediction	Kinetics-600 12 frames, 64x64	DVD-GAN-FP	Cond	5	# 2
Video Prediction	Kinetics-600 12 frames, 64x64	DVD-GAN-FP	Pred	11	# 2
Video Generation	Kinetics-600 48 frames, 64x64	DVD-GAN	FID	12.92	# 1
Video Generation	Kinetics-600 48 frames, 64x64	DVD-GAN	Inception Score	219.05	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-video-generation-on-complex/video-generation-on-kinetics-600-12-frames-1)](https://paperswithcode.com/sota/video-generation-on-kinetics-600-12-frames-1?p=efficient-video-generation-on-complex)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-video-generation-on-complex/video-generation-on-kinetics-600-48-frames)](https://paperswithcode.com/sota/video-generation-on-kinetics-600-48-frames?p=efficient-video-generation-on-complex)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-video-generation-on-complex/video-generation-on-kinetics-600-12-frames)](https://paperswithcode.com/sota/video-generation-on-kinetics-600-12-frames?p=efficient-video-generation-on-complex)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-video-generation-on-complex/video-prediction-on-bair-robot-pushing-1)](https://paperswithcode.com/sota/video-prediction-on-bair-robot-pushing-1?p=efficient-video-generation-on-complex)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-video-generation-on-complex/video-generation-on-bair-robot-pushing)](https://paperswithcode.com/sota/video-generation-on-bair-robot-pushing?p=efficient-video-generation-on-complex)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-video-generation-on-complex/video-prediction-on-kinetics-600-12-frames)](https://paperswithcode.com/sota/video-prediction-on-kinetics-600-12-frames?p=efficient-video-generation-on-complex)`

Adversarial Video Generation on Complex Datasets

15 Jul 2019 · Aidan Clark, Jeff Donahue, Karen Simonyan ·

Generative models of natural images have progressed towards high fidelity samples by the strong leveraging of scale. We attempt to carry this success to the field of video modeling by showing that large Generative Adversarial Networks trained on the complex Kinetics-600 dataset are able to produce video samples of substantially higher complexity and fidelity than previous work. Our proposed model, Dual Video Discriminator GAN (DVD-GAN), scales to longer and higher resolution videos by leveraging a computationally efficient decomposition of its discriminator. We evaluate on the related tasks of video synthesis and video prediction, and achieve new state-of-the-art Fr\'echet Inception Distance for prediction for Kinetics-600, as well as state-of-the-art Inception Score for synthesis on the UCF-101 dataset, alongside establishing a strong baseline for synthesis on Kinetics-600.

PDF Abstract

Code

Add Remove Mark official

Harrypotterrrr/DVD-GAN

Tasks

Add Remove

3D Character Animation From A Single Photo

Video Generation

Video Prediction

Datasets

UCF101

Kinetics

Kinetics 400

Kinetics-600 BAIR Robot Pushing

Results from the Paper

Edit

Ranked #1 on Video Generation on Kinetics-600 12 frames, 128x128

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Prediction	BAIR Robot Pushing	DVD-GAN-FP	FVD	109.8	# 5	Compare
Video Generation	BAIR Robot Pushing	DVD-GAN-FP	FVD score	109.8	# 11	Compare
			Cond	1	# 1	Compare
			Pred	15	# 8	Compare
			Train	15	# 2	Compare
Video Generation	Kinetics-600 12 frames, 128x128	DVD-GAN	FID	2.16	# 1	Compare
Video Generation	Kinetics-600 12 frames, 64x64	DVD-GAN	FVD	31.1	# 4	Compare
Video Prediction	Kinetics-600 12 frames, 64x64	DVD-GAN-FP	FVD	69.15±0.78	# 11	Compare
			Cond	5	# 2	Compare
			Pred	11	# 2	Compare
Video Generation	Kinetics-600 48 frames, 64x64	DVD-GAN	FID	12.92	# 1	Compare
Video Generation	Kinetics-600 48 frames, 64x64	DVD-GAN	Inception Score	219.05	# 1	Compare

Methods

Add Remove

1x1 Convolution • 3D Convolution • Adam • Average Pooling • Batch Normalization • BigGAN • CGRU • Conditional Batch Normalization • Convolution • Dense Connections • Dot-Product Attention • DVD-GAN • DVD-GAN DBlock • DVD-GAN GBlock • Early Stopping • Feedforward Network • GAN Hinge Loss • GRU • Leaky ReLU • Linear Layer • Non-Local Block • Non-Local Operation • Off-Diagonal Orthogonal Regularization • Orthogonal Regularization • Projection Discriminator • ReLU • Residual Block • Residual Connection • SAGAN • SAGAN Self-Attention Module • Sigmoid Activation • Softmax • Spectral Normalization • Truncation Trick • TTUR

Edit Social Preview

Adversarial Video Generation on Complex Datasets

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove