Dexter+Object

A Dataset for Evaluation of Joint Hand+Object Tracking

S. Sridhar, F. Mueller, M. Zollhöfer, D. Casas, A. Oulasvirta, C. Theobalt
Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input
European Conference on Computer Vision (ECCV) 2016, Amsterdam, The Netherlands.

Dexter+Object is a dataset for evaluating algorithms for joint hand and object tracking. It consists of 6 sequences with 2 actors (1 female), and varying interactions with a simple object shape. Fingertip positions and cuboid corners were manually annotated for all sequences. This dataset accompanies the ECCV 2016 paper, Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input.

License

If you use this dataset, you are required to cite the following paper. BibTeX, 1 KB

@inproceedings{RealtimeHO_ECCV2016,
 author = {Sridhar, Srinath and Mueller, Franziska and Zollhoefer, Michael and Casas, Dan and Oulasvirta, Antti and Theobalt, Christian},
 title = {Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input},
 booktitle = {Proceedings of European Conference on Computer Vision ({ECCV})},
 url = {http://handtracker.mpi-inf.mpg.de/projects/RealtimeHO/},
 numpages = {17},
 month = October,
 year = {2016}
}

Downloads

Compressed Zip: Single file (zip, 1.2 GB), SHA-256:

cfbe7df40ad340faddc0c2cedf4b10de0f88a7d6be668149d22fcc3659623fa1

Browse: Link

Data

RGB: Creative Senz3D color camera
Depth: Creative Senz3D close range TOF depth camera co-located with the color camera
Ground Truth: Manually annotated on depth data for 3D fingertip positions, and 3 object (cuboid) corners
Depth Camera Intrinsics: Can be used to backproject depth image to create a 3D point cloud

Evaluation Metric

Please see the supplementary document for the definition of the error measure. We recommend using the same measure to facilitate direct comparison with other methods.

Sequence Details

Please click the links below for a video preview of each sequence.

Grasp1: User grasping a small cuboid.
Grasp2: User grasping a big cuboid.
Pinch: User pinching a small cuboid.
Rigid: User moving rigidly while holding a small cuboid.
Rotate: User holding and rotating a small cuboid.
Occlusion: A small cuboid occluding the user's hand.

Dataset Structure

The root directory (containing this file) consists of 4 sub-directories.

data: All the data resides here.
- SEQUENCE_NAME
  - color: Color images in BMP format.
  - depth: Depth map as 16-bit PNG. Background/invalid pixels have a value of 32001.
  - annotations: Contains manually annotated data for fingertip positions and 3 cuboid corners. See README.txt inside the directory for more details.
preview: Preview videos of all the sequences in the dataset and a montage of results from our tracker.
scripts: Bash scripts used to make previews. [ new: ] script for computing all 8 cuboid corners.
calibration: Intrinsic calibration matrix for depth map backprojection.
[ new: ] hand_dimensions: Sketches of the hands of both users specifying dimensions in centimeters.

Acknowledgments

We thank Perttu Lähteenlahti for helping with data annotation.

Contact

Franziska Mueller
frmueller@mpi-inf.mpg.de

Srinath Sridhar
ssridhar@mpi-inf.mpg.de

Back to ECCV 2016

This page was last updated: 02/06/2020

Imprint/Impressum | Data Protection/Datenschutzhinweis