Skip to content
/ r3ds Public

Official repository of the paper "R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding"

License

Notifications You must be signed in to change notification settings

3dlg-hcvc/r3ds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

R3DS

R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding

ECCV 2024

Qirui Wu, Sonia Raychaudhuri, Daniel Ritchie, Manolis Savva, Angel X. Chang

Project | arXiv | Data (HuggingFace)

drawing

We introduce the Reality-linked 3D Scenes (R3DS) dataset of synthetic 3D scenes mirroring the real-world scene arrangements from Matterport3D. Compared to prior work, R3DS has more complete and densely populated scenes with objects linked to real-world observations in panoramas. R3DS also provides an object support hierarchy, and matching object sets (e.g., same chairs around a dining table) for each scene. Overall, R3DS contains 19K objects represented by 3784 distinct CAD models from over 100 object categories. We demonstrate the effectiveness of R3DS on the Panoramic Scene Understanding task. We find that: 1) training on R3DS enables better generalization; 2) support relation prediction trained with R3DS improves performance compared to heuristically calculated support; and 3) R3DS offers a challenging benchmark for future work on panoramic scene understanding.

Data

drawing

R3DS has been hosted on HuggingFace at https://huggingface.co/datasets/3dlg-hcvc/rlsd. You can request access from the HuggingFace page and proceed to checkout the data:

git clone git@hf.co:datasets/3dlg-hcvc/rlsd

The dataset is structured into different folders:

mp3d_arch -- contains a subset of the Matterport3D architecture files that we have used
scenes -- contains the annotated json files for each scene
renderings -- contains various renderings of the annotated scene
    original_panoramas -- original panoramas (without annotations) from Matterport3D
    annotated_panoramas -- panorama renderings colored by object instance ids
    annotated_scene_hierarchy -- hierarchy of the annotated objects for each scene
    annotated_topdown -- topdown images of each annotated scene colored by object instance ids
annotated_model_ids.txt -- list of all model ids used to annotate the scenes

The annotated scene json has the following structure:

{
    "format": "sceneState",
    "version": "v0.2.0",
    "scene": {
        "up": ..., # "x", "y" and "z"
        "front": ..., # "x", "y" and "z"
        "unit": 1,
        "assetSource": [
            "wayfair",
            "3dw"
        ],
        "object": [
            {
                "id": "a_883",
                "modelId": "wayfair.ANDO6573",
                "index": 0,
                "parentIndex": -1,
                "parentId": "1_8_1",
                "transform": {
                    "rows": 4,
                    "cols": 4,
                    "data": ... # 1x16 array
                },
                "obb": {
                        "centroid": ..., # 1x3 array
                        "axesLengths": ..., # 1x3 array,
                        "normalizedAxes": ..., # 1x9 array,
                        "min": ..., # 1x3 array,
                        "max": ... # 1x3 array
                    }
            }, ... # multiple object annotations
        ],
        "camera": [
                {
                    "up": ..., # "x", "y" and "z"
                    "position": ..., # "x", "y" and "z"
                    "target": ..., # "x", "y" and "z"
                    "isOrtho": false,
                    "name": "current"
                }
            ],
        "arch": {
            "ref": "mp3dArch.1pXnuDYAj8r_L1"
        }
    },
    "selected": [],
    "maskObjectAssignments": [
        {
            "photoId": ..., # same as viewpointId
            "maskId": 279,
            "clickPoint": [
                0.8349366636325659,
                0.4788185224119439
            ],
            "objectInstanceId": "a_883"
        }, ... # multiple mask2object assignments
    ],
    "maskComments": [ # comments from annotators against missed annotations
        {
            "reason": "WRONG-CATEGORY",
            "comment": "Picture",
            "maskId": "304"
        }, ... # multiple comments
    ],
    "maskInfos": [
        {
            "id": 276,
            "label": "curtain", # category
            "type": "mask",
            "pointNormalized": ..., # "x" and "y"
        }, ... # multiple mask infos
    ]
}

TODO

  • Add code reference to DPC experiments.

Bibtex

If you find our work useful, please cite

@article{wu2024r3ds,
    title={R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding}, 
    author={Qirui Wu and Sonia Raychaudhuri and Daniel Ritchie and Manolis Savva and Angel X Chang},
    year={2024},
    eprint={2403.12301},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

About

Official repository of the paper "R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published