Published June 19, 2014
| Submitted
Discussion Paper
Open
R-CNNs for Pose Estimation and Action Detection
Abstract
We present convolutional neural networks for the tasks of keypoint (pose) prediction and action classification of people in unconstrained images. Our approach involves training an R-CNN detector with loss functions depending on the task being tackled. We evaluate our method on the challenging PASCAL VOC dataset and compare it to previous leading approaches. Our method gives state-of-the-art results for keypoint and action prediction. Additionally, we introduce a new dataset for action detection, the task of simultaneously localizing people and classifying their actions, and present results using our approach.
Additional Information
This work was supported by the Intel Visual Computing Center, ONR SMARTS MURI N000140911051, ONR MURI N000141010933, a Google Research Grant and a Microsoft Research fellowship. The GPUs used in this research were generously donated by the NVIDIA Corporation.Attached Files
Submitted - 1406.5212.pdf
Files
1406.5212.pdf
Files
(10.5 MB)
Name | Size | Download all |
---|---|---|
md5:9ea1a76cdd7c9bd071cd04824ab671ae
|
10.5 MB | Preview Download |
Additional details
- Eprint ID
- 118449
- Resolver ID
- CaltechAUTHORS:20221219-204859995
- Intel Visual Computing Center
- Office of Naval Research (ONR)
- N000140911051
- Office of Naval Research (ONR)
- N000141010933
- Microsoft Research
- NVIDIA Corporation
- Created
-
2022-12-20Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field