Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Browse SoTA
> Audio
Audio
100 benchmarks • 71 tasks • 159 datasets • 1380 papers with code
Classification
Classification
324 benchmarks
3240 papers with code
Text Classification
180 benchmarks
1104 papers with code
Graph Classification
69 benchmarks
380 papers with code
Audio Classification
23 benchmarks
131 papers with code
Medical Image Classification
8 benchmarks
122 papers with code
See all 18 tasks
2D Semantic Segmentation
Image Segmentation
3 benchmarks
1494 papers with code
Text Style Transfer
3 benchmarks
80 papers with code
Scene Parsing
60 benchmarks
75 papers with code
2D Semantic Segmentation
90 benchmarks
38 papers with code
Reflection Removal
5 benchmarks
29 papers with code
See all 14 tasks
Speech Recognition
Speech Recognition
279 benchmarks
1090 papers with code
Automatic Speech Recognition (ASR)
7 benchmarks
481 papers with code
Visual Speech Recognition
10 benchmarks
40 papers with code
Robust Speech Recognition
22 papers with code
Distant Speech Recognition
2 benchmarks
10 papers with code
See all 11 tasks
Few-Shot Learning
Few-Shot Learning
63 benchmarks
1037 papers with code
One-Shot Learning
1 benchmark
93 papers with code
Few-Shot Semantic Segmentation
16 benchmarks
74 papers with code
Cross-Domain Few-Shot
10 benchmarks
55 papers with code
Unsupervised Few-Shot Learning
12 papers with code
See all 13 tasks
Emotion Recognition
Emotion Recognition
56 benchmarks
458 papers with code
Speech Emotion Recognition
18 benchmarks
98 papers with code
Emotion Recognition in Conversation
12 benchmarks
72 papers with code
Multimodal Emotion Recognition
3 benchmarks
53 papers with code
Emotion-Cause Pair Extraction
2 benchmarks
19 papers with code
See all 13 tasks
Conformal Prediction
147 papers with code
Text Simplification
11 benchmarks
117 papers with code
Music Source Separation
3 benchmarks
53 papers with code
Audio Source Separation
8 benchmarks
44 papers with code
Decision Making Under Uncertainty
44 papers with code
See all 9 tasks
Speech Synthesis
Speech Synthesis
16 benchmarks
290 papers with code
Expressive Speech Synthesis
11 papers with code
Emotional Speech Synthesis
3 papers with code
text-to-speech translation
2 papers with code
Speech Synthesis - Assamese
1 benchmark
1 papers with code
See all 16 tasks
Accented Speech Recognition
Speech Synthesis
16 benchmarks
290 papers with code
Speech Enhancement
Speech Enhancement
17 benchmarks
218 papers with code
Speech Dereverberation
4 benchmarks
16 papers with code
Bandwidth Extension
1 benchmark
15 papers with code
Packet Loss Concealment
4 papers with code
Speech Intelligibility Evaluation
Language Identification
Language Identification
7 benchmarks
123 papers with code
Dialect Identification
32 papers with code
Native Language Identification
1 benchmark
5 papers with code
Audio Classification
Audio Classification
23 benchmarks
131 papers with code
Environmental Sound Classification
3 benchmarks
23 papers with code
Audio Multiple Target Classification
1 papers with code
Semi-supervised Audio Classification
1 papers with code
Voice Conversion
Voice Conversion
2 benchmarks
149 papers with code
DeepFake Detection
DeepFake Detection
6 benchmarks
129 papers with code
Synthetic Speech Detection
8 papers with code
Human Detection of Deepfakes
1 papers with code
Multimodal Forgery Detection
1 benchmark
1 papers with code
Music Generation
Music Generation
129 papers with code
Music Performance Rendering
4 papers with code
Music Texture Transfer
1 papers with code
Audio Generation
Audio Generation
7 benchmarks
64 papers with code
Voice Cloning
17 papers with code
Audio Super-Resolution
4 benchmarks
14 papers with code
Room Impulse Response (RIR)
9 papers with code
Text-To-Speech Synthesis
Text-To-Speech Synthesis
7 benchmarks
92 papers with code
Prosody Prediction
1 benchmark
2 papers with code
Zero-Shot Multi-Speaker TTS
2 papers with code
Audio Signal Processing
blind source separation
44 papers with code
Audio Signal Processing
20 papers with code
Audio Compression
12 papers with code
Audio Effects Modeling
2 papers with code
Sound Event Detection
Sound Event Detection
4 benchmarks
74 papers with code
Audio Source Separation
Audio Source Separation
8 benchmarks
44 papers with code
Target Sound Extraction
4 benchmarks
4 papers with code
Directional Hearing
2 benchmarks
1 papers with code
Single-Label Target Sound Extraction
Sound Classification
Sound Classification
46 papers with code
Audio Tagging
Audio Tagging
1 benchmark
41 papers with code
Audio captioning
Audio captioning
4 benchmarks
39 papers with code
Zero-shot Audio Captioning
2 benchmarks
1 papers with code
Acoustic Scene Classification
Acoustic Scene Classification
5 benchmarks
37 papers with code
Sound Event Localization and Detection
Sound Event Localization and Detection
5 benchmarks
28 papers with code
Environmental Sound Classification
Environmental Sound Classification
3 benchmarks
23 papers with code
Self-Supervised Sound Classification
1 papers with code
Instrument Recognition
Instrument Recognition
3 benchmarks
20 papers with code
Text-to-Music Generation
Text-to-Music Generation
2 benchmarks
13 papers with code
Direction of Arrival Estimation
Direction of Arrival Estimation
1 benchmark
12 papers with code
Voice Anti-spoofing
Voice Anti-spoofing
3 benchmarks
12 papers with code
Audio inpainting
Audio inpainting
11 papers with code
Instance Search
Instance Search
9 papers with code
Audio Fingerprint
1 papers with code
Audio Denoising
Audio Denoising
3 benchmarks
9 papers with code
Online Beat Tracking
Inference Optimization
9 papers with code
Audio-Visual Synchronization
Audio-Visual Synchronization
8 papers with code
Chord Recognition
Chord Recognition
7 papers with code
Bird Classification
Bird Audio Detection
3 papers with code
Bird Classification
Bird Species Classification With Audio-Visual Data
Audio Effects Modeling
Pitch control
2 papers with code
Timbre Interpolation
1 papers with code
Audio declipping
Audio declipping
3 papers with code
Music Compression
Music Compression
3 papers with code
Visually Guided Sound Source Separation
Visually Guided Sound Source Separation
3 papers with code
Vowel Classification
Vowel Classification
3 papers with code
Hearing Aid and device processing
Cadenza 1 - Task 1 - Headphone
1 benchmark
1 papers with code
Cadenza 1 - Task 2 - In Car
1 benchmark
1 papers with code
Hearing Aid and device processing
Audio Signal Recognition
Audio Signal Recognition
1 papers with code
Gunshot Detection
1 papers with code
fake voice detection
fake voice detection
1 benchmark
2 papers with code
Acoustic Novelty Detection
Acoustic Novelty Detection
1 benchmark
1 papers with code
Audio Dequantization
Audio Dequantization
1 papers with code
Directional Hearing
Real-time Directional Hearing
1 benchmark
1 papers with code
Music Quality Assessment
Music Quality Assessment
1 papers with code
Shooter Localization
Shooter Localization
1 papers with code
Soundscape evaluation
Soundscape evaluation
1 papers with code
Speaker Orientation
Speaker Orientation
1 papers with code
Target Sound Extraction
Streaming Target Sound Extraction
1 benchmark
1 papers with code
Active Speaker Localization
Active Speaker Localization