Reinforcement Learning (RL)
Visual Question Answering (VQA)
3D Object Super-Resolution
Medical Image Segmentation
Optical Character Recognition (OCR)
Facial Recognition and Modelling
Multi-Label Classification
Video Object Segmentation
Out-of-Distribution Detection
Video Semantic Segmentation
Explainable artificial intelligence
3D Point Cloud Classification
3D Point Cloud Interpolation
Image-to-Image Translation
Human-Object Interaction Detection
Point Cloud Classification
3D Point Cloud Reconstruction
Document Text Classification
Temporal Action Localization
Few-Shot Transfer Learning for Saliency Prediction
Video Instance Segmentation
Privacy Preserving Deep Learning
Few-Shot Semantic Segmentation
Few-Shot Object Detection
Referring Expression Segmentation
Sign Language Recognition
3D Multi-Person Pose Estimation
Facial Landmark Detection
3D Character Animation From A Single Photo
Weakly supervised segmentation
RGB Salient Object Detection
Multi-Label Image Classification
LIDAR Semantic Segmentation
Vehicle Re-Identification
Handwritten Text Recognition
Class Incremental Learning
Decision Making Under Uncertainty
Semi-Supervised Object Detection
Facial Expression Recognition (FER)
Open Vocabulary Semantic Segmentation
Class-Incremental Semantic Segmentation
Physics-informed machine learning
3D Absolute Human Pose Estimation
Action Recognition In Videos
Sign Language Translation
Single-Image-Based Hdr Reconstruction
Zero-Shot Action Recognition
Surface Normals Estimation
Natural Language Transduction
Weakly Supervised Action Localization
Transparent Object Detection
Vision-Language Navigation
Infrared And Visible Image Fusion
Breast Cancer Histology Image Classification
Probabilistic Deep Learning
Few-Shot Image Classification
Abnormal Event Detection In Video
Computer Vision Techniques Adopted in 3D Cryogenic Electron Microscopy
Pedestrian Attribute Recognition
Image Manipulation Detection
Semi-Supervised Video Object Segmentation
Multi-View 3D Reconstruction
Universal Domain Adaptation
Action Quality Assessment
Activity Recognition In Videos
Document Image Classification
Text based Person Retrieval
Surgical phase recognition
Simultaneous Localization and Mapping
Intrinsic Image Decomposition
Unsupervised Object Segmentation
Diffusion Personalization
Multi-target Domain Adaptation
Camouflaged Object Segmentation
Weakly-supervised instance segmentation
Content-Based Image Retrieval
Multi-Object Tracking and Segmentation
Zero-Shot Transfer Image Classification
3D Point Cloud Linear Classification
Shape Representation Of 3D Point Clouds
Bird's-Eye View Semantic Segmentation
Dense Pixel Correspondence Estimation
Human Interaction Recognition
Space-time Video Super-resolution
Semi-Supervised Image Classification
3D Multi-Person Pose Estimation (absolute)
Precipitation Forecasting
Referring expression generation
Image/Document Clustering
3D Multi-Person Pose Estimation (root-relative)
Grounded Situation Recognition
Open Vocabulary Object Detection
Semi-Supervised Instance Segmentation
Temporal Sentence Grounding
Point Cloud Super Resolution
Unsupervised Image-To-Image Translation
Lung Nodule Classification
3D Semantic Scene Completion
Composed Image Retrieval (CoIR)
Multispectral Object Detection
Single-View 3D Reconstruction
Sketch-to-Image Translation
3D Shape Reconstruction From A Single 2D Image
3D Open-Vocabulary Instance Segmentation
Audio-Visual Synchronization
Birds Eye View Object Detection
Seeing Beyond the Visible
Semi-Supervised Domain Generalization
Unsupervised Semantic Segmentation
Visual Question Answering
Visual Relationship Detection
Event data classification
Image Manipulation Localization
Instance Shadow Detection
Medical Image Enhancement
Open Vocabulary Panoptic Segmentation
Training-free 3D Point Cloud Classification
Multimodal Machine Translation
2D Semantic Segmentation task 3 (25 classes)
Long Video Retrieval (Background Removed)
Segmentation Of Remote Sensing Imagery
Short-term Object Interaction Anticipation
Spatio-Temporal Video Grounding
Synthetic Image Detection
The Semantic Segmentation Of Remote Sensing Imagery
Unsupervised 3D Point Cloud Linear Evaluation
Unsupervised Anomaly Detection
Facial Expression Recognition
Generalized Referring Expression Comprehension
Traffic Accident Detection
Unsupervised Landmark Detection
Visual Speech Recognition
continual anomaly detection
Calving Front Delineation In Synthetic Aperture Radar Imagery
Semi-supervised Anomaly Detection
3D Semantic Occupancy Prediction
Age and Gender Estimation
Historical Color Image Dating
Image and Video Forgery Detection
Infrared image super-resolution
Personality Trait Recognition
Personalized Segmentation
Spatial Relation Recognition
Unsupervised Anomaly Detection with Specified Settings -- 0.1% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 1% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 10% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 20% anomaly
Unsupervised Anomaly Detection with Specified Settings -- 30% anomaly
Visual Social Relationship Recognition
Zero-shot Text-to-Video Generation
Video Frame Interpolation
Continual Semantic Segmentation
Micro-expression Generation
Unsupervised Panoptic Segmentation
Hierarchical Text Segmentation
Human-Object Interaction Concept Discovery
Multi-Person Pose Estimation
Part-aware Panoptic Segmentation
Prediction Of Occupancy Grid Maps
Semi-Supervised Video Classification
Supervised Image Retrieval
Synthetic Image Attribution
Training-free 3D Part Segmentation
Unsupervised Image Decomposition
Weakly Supervised 3D Point Cloud Segmentation
Weakly-supervised panoptic segmentation
drone-based object tracking
Brain Visual Reconstruction
Conditional Image Generation
4D Spatio Temporal Semantic Segmentation
Clothing Attribute Recognition
Damaged Building Detection
Dynamic Texture Recognition
Few Shot Open Set Object Detection
Generalized Zero-Shot Learning - Unseen
Human-Object Interaction Anticipation
Keypoint detection and image matching
Manufacturing Quality Control
Multi-Person Pose Estimation and Tracking
Multi-modal image segmentation
Partial Video Copy Detection
Perpetual View Generation
Prompt-driven Zero-shot Domain Adaptation
Repetitive Action Counting
Single-shot HDR Reconstruction
Sketch-Based Image Retrieval
Unsupervised Instance Segmentation
Vehicle Key-Point and Orientation Estimation
Video Individual Counting
Video-to-image Affordance Grounding
Visual Sentiment Prediction
human-scene contact detection
Constrained Diffeomorphic Image Registration
Continuous Affect Estimation
Document Image Skew Estimation
Fashion Compatibility Learning
Fine-Grained Image Classification
Flooded Building Segmentation
Generative Temporal Nursing
Human fMRI response prediction
IFC Entity Classification
Image Similarity Detection
Image-To-Gps Verification
Image-based Automatic Meter Reading
Indoor Scene Reconstruction
Laminar-Turbulent Flow Localisation
Language-Based Temporal Localization
MLLM Evaluation: Aesthetics
Mental Workload Estimation
Micro-gesture Recognition
Motion Expressions Guided Video Segmentation
Multi-Oriented Scene Text Detection
Multi-object colocalization
Multilingual Text-to-Image Generation
Multimodal Emotion Recognition
Occluded 3D Object Symmetry Detection
Open Set Video Captioning
Partial Point Cloud Matching
Partially View-aligned Multi-view Learning
Personality Trait Recognition by Face
Physical Attribute Prediction
Point cloud classification dataset
Point- of-no-return (PNR) temporal localization
Pose Contrastive Learning
Prostate Zones Segmentation
Pulmorary Vessel Segmentation
Reference Expression Generation
Safety Perception Recognition
Specular Reflection Mitigation
State Change Object Detection
Surface Normals Estimation from Point Clouds
Transform A Video Into A Comics
Unsupervised Long Term Person Re-Identification
Unsupervised Zero-Shot Panoptic Segmentation
Video Correspondence Flow
Vietnamese Multimodal Learning
Yield Mapping In Apple Orchards
eXtreme-Video-Frame-Interpolation
lidar absolute pose regression
self-supervised scene text recognition
video narration captioning
3D Canonical Hand Pose Estimation
Atomic action recognition
Calving Front Delineation From Synthetic Aperture Radar Imagery
Computer Vision Transduction
Crosslingual Text-to-Image Generation
Document To Image Conversion
Frame Duplication Detection
Image Operation Chain Detection
Kinematic Based Workflow Recognition
MLLM Aesthetic Evaluation
Motion Detection In Non-Stationary Scenes
Satellite Orbit Determination
Segmentation Based Workflow Recognition
Sperm Morphology Classification
Video & Kinematic Base Workflow Recognition
Video Based Workflow Recognition
Video, Kinematic & Segmentation Base Workflow Recognition