R3DS

R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding

ECCV 2024

Qirui Wu, Sonia Raychaudhuri, Daniel Ritchie, Manolis Savva, Angel X. Chang

We introduce the Reality-linked 3D Scenes (R3DS) dataset of synthetic 3D scenes mirroring the real-world scene arrangements from Matterport3D. Compared to prior work, R3DS has more complete and densely populated scenes with objects linked to real-world observations in panoramas. R3DS also provides an object support hierarchy, and matching object sets (e.g., same chairs around a dining table) for each scene. Overall, R3DS contains 19K objects represented by 3784 distinct CAD models from over 100 object categories. We demonstrate the effectiveness of R3DS on the Panoramic Scene Understanding task. We find that: 1) training on R3DS enables better generalization; 2) support relation prediction trained with R3DS improves performance compared to heuristically calculated support; and 3) R3DS offers a challenging benchmark for future work on panoramic scene understanding.

Data

R3DS has been hosted on HuggingFace at https://huggingface.co/datasets/3dlg-hcvc/rlsd. You can request access from the HuggingFace page and proceed to checkout the data:

git clone git@hf.co:datasets/3dlg-hcvc/rlsd

The dataset is structured into different folders:

mp3d_arch -- contains a subset of the Matterport3D architecture files that we have used
scenes -- contains the annotated json files for each scene
renderings -- contains various renderings of the annotated scene
    original_panoramas -- original panoramas (without annotations) from Matterport3D
    annotated_panoramas -- panorama renderings colored by object instance ids
    annotated_scene_hierarchy -- hierarchy of the annotated objects for each scene
    annotated_topdown -- topdown images of each annotated scene colored by object instance ids
annotated_model_ids.txt -- list of all model ids used to annotate the scenes

The annotated scene json has the following structure:

{
    "format": "sceneState",
    "version": "v0.2.0",
    "scene": {
        "up": ..., # "x", "y" and "z"
        "front": ..., # "x", "y" and "z"
        "unit": 1,
        "assetSource": [
            "wayfair",
            "3dw"
        ],
        "object": [
            {
                "id": "a_883",
                "modelId": "wayfair.ANDO6573",
                "index": 0,
                "parentIndex": -1,
                "parentId": "1_8_1",
                "transform": {
                    "rows": 4,
                    "cols": 4,
                    "data": ... # 1x16 array
                },
                "obb": {
                        "centroid": ..., # 1x3 array
                        "axesLengths": ..., # 1x3 array,
                        "normalizedAxes": ..., # 1x9 array,
                        "min": ..., # 1x3 array,
                        "max": ... # 1x3 array
                    }
            }, ... # multiple object annotations
        ],
        "camera": [
                {
                    "up": ..., # "x", "y" and "z"
                    "position": ..., # "x", "y" and "z"
                    "target": ..., # "x", "y" and "z"
                    "isOrtho": false,
                    "name": "current"
                }
            ],
        "arch": {
            "ref": "mp3dArch.1pXnuDYAj8r_L1"
        }
    },
    "selected": [],
    "maskObjectAssignments": [
        {
            "photoId": ..., # same as viewpointId
            "maskId": 279,
            "clickPoint": [
                0.8349366636325659,
                0.4788185224119439
            ],
            "objectInstanceId": "a_883"
        }, ... # multiple mask2object assignments
    ],
    "maskComments": [ # comments from annotators against missed annotations
        {
            "reason": "WRONG-CATEGORY",
            "comment": "Picture",
            "maskId": "304"
        }, ... # multiple comments
    ],
    "maskInfos": [
        {
            "id": 276,
            "label": "curtain", # category
            "type": "mask",
            "pointNormalized": ..., # "x" and "y"
        }, ... # multiple mask infos
    ]
}

TODO

Add code reference to DPC experiments.

Bibtex

If you find our work useful, please cite

@article{wu2024r3ds,
    title={R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding}, 
    author={Qirui Wu and Sonia Raychaudhuri and Daniel Ritchie and Manolis Savva and Angel X Chang},
    year={2024},
    eprint={2403.12301},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
docs		docs
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

R3DS

R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding

ECCV 2024

Data

TODO

Bibtex

About

Releases

Packages

Contributors 2

License

3dlg-hcvc/r3ds

Folders and files

Latest commit

History

Repository files navigation

R3DS

R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding

ECCV 2024

Data

TODO

Bibtex

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages