Pose Estimation
1341 papers with code • 28 benchmarks • 113 datasets
Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.
A common benchmark for this task is MPII Human Pose
( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )
Libraries
Use these libraries to find Pose Estimation models and implementationsSubtasks
- 3D Human Pose Estimation
- Keypoint Detection
- 3D Pose Estimation
- 6D Pose Estimation
- 6D Pose Estimation
- Hand Pose Estimation
- 6D Pose Estimation using RGB
- Multi-Person Pose Estimation
- Head Pose Estimation
- Human Pose Forecasting
- 6D Pose Estimation using RGBD
- Animal Pose Estimation
- Vehicle Pose Estimation
- RF-based Pose Estimation
- Car Pose Estimation
- Hand Joint Reconstruction
- Activeness Detection
- Semi-supervised 2D and 3D landmark labeling
Most implemented papers
Mask R-CNN
Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance.
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
We present an approach to efficiently detect the 2D pose of multiple people in an image.
Convolutional Pose Machines
Pose Machines provide a sequential prediction framework for learning rich implicit spatial models.
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Stacked Hourglass Networks for Human Pose Estimation
This work introduces a novel convolutional network architecture for the task of human pose estimation.
Deep High-Resolution Representation Learning for Visual Recognition
High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection.
Deep High-Resolution Representation Learning for Human Pose Estimation
We start from a high-resolution subnetwork as the first stage, gradually add high-to-low resolution subnetworks one by one to form more stages, and connect the mutli-resolution subnetworks in parallel.
High-Resolution Representations for Labeling Pixels and Regions
The proposed approach achieves superior results to existing single-model networks on COCO object detection.
Non-local Neural Networks
Both convolutional and recurrent operations are building blocks that process one local neighborhood at a time.
Simple Baselines for Human Pose Estimation and Tracking
There has been significant progress on pose estimation and increasing interests on pose tracking in recent years.