Skip to content

This repository contains code utilities for accessing components of the HOH dataset.

Notifications You must be signed in to change notification settings

Terascale-All-sensing-Research-Studio/HOHDataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Please check out our paper (https://arxiv.org/abs/2310.00723) and project page (https://hohdataset.github.io) for more information and access to the dataset variants mentioned below.

To access the raw data, and all below dataset variants, connect to the datacenter using the following:

HOH Keyframe Dataset

This is a version of HOH that includes data for the O, G, T, and R keyframes as defined in the HOH paper. Data included:

  • Kinect Color images (all cameras)
  • Object, Giver Hand, Receiver Hand segmentation masks, corresponding to color images
  • OpenPose skeletons for each participant, corresponding to color images
  • Full scene point clouds with background curtains removed
  • Object 3D model alignment transformations
  • Per-capture metadata, including keyframe indices, comfort ratings, object information, etc.

HOH Minimum Dataset

This is a version of the dataset that includes data for all frames (including keyframes). Data included:

  • Filtered point clouds isolating the Object, Giver Hand, and Receiver Hand
  • Object 3D model alignment transformations
  • Per-capture metadata
  • All other metadata related to participant responses and demographics

HOH Object Dataset

This is a version of the dataset that includes all object-related data, including:

  • All object 3D models
    • Full resolution ("*_cleaned.obj")
    • Watertight ("*_watertight.obj")
    • Simplified (to approximately 10,000 vertices) ("*_simplified.obj")
    • Simplified_2000 (to approximately 2,000 vertices) ("*_simplified_2000.obj")
  • All object metadata ("Metadata_HOH.csv")

If you need data not included in the above subsets, download the raw data which is described in the HOH paper. The data is many terabytes and will have to be downloaded manually. The following are not included in the above data subsets:

  • Kinect Color images for non-keyframes
  • Object, Giver Hand, Receiver Hand segmentation masks for non-keyframes
  • OpenPose skeletons for each participant for non-keyframes
  • PointGrey color images
  • Full scene point clouds for non-keyframes If you decide to download any the Kinect color images, also download the Sync_Info directory to the data directory. It contains data necessary for accessing them in the pre-synchronized order. See examples/pull_raw_data.py for examples concerning the raw data.

Data Extraction

To extract the data subsets, please place all .zip files that you intend to use in the data directory, and then run extract_data.py. Additionally, please look at the examples directory for code demonstrations on how to use various components of the data.

NOTE: File dropped_segmented_pointclouds.json contains a list of known missing segmented point clouds. Point clouds may be missing due to the frame being dropped during recording or significant failure during tracking. Access this list by loading the json file, accessing the key "files", and selecting an entry in the list. Each list entry is a dict containing the handover ("sample"), the frame index ("frame_idx"), and the target ("target") that is missing.

NOTE: File 3d_model_alignment_status.json contains a list of frame windows in which the object 3D model alignment quality is insufficient or unknown. Access this list for a particular handover using key "<capture_directory>_<handover_index>", and then key "windows". This will yield a list of frame index ranges denoting windows of bad alignment.

NOTE: File dropped_keyframe_masks.json contains a list of handovers which are missing any keyframe masks. Access the data for a particular handover using key "<capture_directory>_<handover_index>", and then key "object", "giver", or "receiver" to access the corresponding list of missing masks.

NOTE: For the following participant IDs, no identifiable data is released: 19860, 57643, 98754, 43017

NOTE: Unless otherwise accounted for in the above files, assume missing data is due to a dropped frame during recording.

About

This repository contains code utilities for accessing components of the HOH dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages