Dexter 1

A Dataset for Evaluation of 3D Articulated Hand Motion Tracking.

S. Sridhar, A. Oulasvirta, and C. Theobalt, Interactive Markerless Articulated Hand Motion Tracking Using RGB and Depth Data, Proceedings of the IEEE International Conference on Computer Vision (ICCV), Sydney, Australia, 2013.

Motions

Dexter 1 consists of 7 sequences of challenging, slow and fast hand motions that covers the abduction-adduction and flexion-extension of the hand. Roughly the first 250 frames in each sequence correspond to slow motions while the remaining frames are fast motions. All sequences are with a single actor's right hand.

Data

RGB: 5 Sony DFW-V500 RGB cameras at 25 fps
Depth: 1 Creative Gesture Camera (Close-range ToF depth sensor) at 25 fps
Depth: 1 Kinect structured light camera at 30 fps
Ground Truth: Manually annotated on ToF depth data for 3D fingertip positions

While the RGB and ToF data is mostly synchronized, the structured light data is not synchronized.

Downloads

Compressed Tarball: Single file (tar.gz, 2.9 GB), SHA-256:

da3405357efa09c126a8f8fc3d01b121adc7ba260a22b66e9c4a4e9c5bc37479

Browse: Link
Camera Setup: [ Picture 1 | Picture 2 ]

Note that the RGB data used in the paper and previews has been enhanced by adjusting color levels. See script AdjustLevels.sh for more details.

License

If you use this dataset in your work, you are required to cite the following paper. BibTeX, 1 KB

@inproceedings{handtracker_iccv2013,
author = {Sridhar, Srinath and Oulasvirta, Antti and Theobalt, Christian},
title = {Interactive Markerless Articulated Hand Motion Tracking using RGB and Depth Data},
booktitle = {Proceedings of the {IEEE} International Conference on Computer Vision ({ICCV)}},
url = {http://handtracker.mpi-inf.mpg.de/projects/handtracker_iccv2013/},
numpages = {8},
month = Dec,
year = {2013}
}

Evaluation

The following table lists the average fingertip error for several published algorithms. The error is computed as the average Euclidean distance of all estimated 3D fingertip locations from the 5 ground truth 3D positions. (The palm center annotation is not used in all except one method.)

Algorithm	Average Error [mm]
Sridhar et al., ICCV 2013	13.1^[1] 31.8^[2]
Sridhar et al., 3DV 2014	24.1
Franziska Mueller, 2014 (offline method)	15.2
Sridhar et al., CVPR 2015	19.6

^[1] Includes palm center, first 250 frames only
^[2] Palm center not used, all frames

If you would like to include your algorithm results here, please contact us.

Sequence Details

Please click on the image above for a larger preview of the links below for a video preview.

adbadd: Abduction-adduction of all fingers together. Please note the spelling.
fingercount: Counting using each finger and the thumb.
fingerwave: Waving of fingers.
flexex1: Flexion-extension of all fingers.
pinch: Pinching while moving around.
random: Random motions with articulation.
tigergrasp: Making a posture like a tiger grasping.

Dataset Structure

The root directory (containing this file) consists of 3 sub-directories.

calibration: Contains information for calibrating the cameras for intrinsic and extrinsic parameters. We recommend using the MATLAB calibration toolbox.
preview: Montage video preview of 3 cameras and structured light data for all sequences.
data: All the data resides here.
- SEQUENCE_NAME
  - multicam: Data from RGB cameras stored as PNG files in separate directories.
  - tof: The ToF data.
    - confmap: Confidence map as 16-bit PNG.
    - depth: Depth map as 16-bit PNG.
    - rgb: Empty.
    - uvmap: Emtpy.
    - vertices: Point cloud from depth map stored in PCD format (see pointclouds.org). Units are mm.
  - structlight: Unsynchronized structured light data in ONI format.
  - annotations: Contains manually annotated data and a preview of annotations. See README.txt inside the directory for more details.

Acknowledgments

We thank Thomas Helten for helping with data capture.

Contact

Srinath Sridhar
ssridhar@mpi-inf.mpg.de

Back to ICCV 2013

Page last updated .

Imprint/Impressum | Data Protection/Datenschutzhinweis