News ✨ ✨

[2024-10-07] Our code and dataset is released.
[2024-02-26] Our paper is accepted by CVPR 2024.

Bi-Layout

This is PyTorch implementation of our paper "No More Ambiguity in 360° Room Layout via Bi-Layout Estimation"(CVPR 2024).
[Project Page]

Inherent ambiguity in layout annotations poses significant challenges to developing accurate 360° room layout estimation models. To address this issue, we propose a novel Bi-Layout model capable of predicting two distinct layout types. One stops at ambiguous regions, while the other extends to encompass all visible areas. Our model employs two global context embeddings, where each embedding is designed to capture specific contextual information for each layout type. With our novel feature guidance module, the image feature retrieves relevant context from these embeddings, generating layout-aware features for precise bi-layout predictions.

A unique property of our Bi-Layout model is its ability to inherently detect ambiguous regions by comparing the two predictions. To circumvent the need for manual correction of ambiguous annotations during testing, we also introduce a new metric for disambiguating ground truth layouts. Our method demonstrates superior performance on benchmark datasets, notably outperforming leading approaches. Specifically, on the MatterportLayout dataset, it improves 3DIoU from 81.70% to 82.57% across the full test set and notably from 54.80% to 59.97% in subsets with significant ambiguity.

Installation

Install our dependencies:

conda create -n bi_layout python=3.8 -y
conda activate bi_layout
pip install -r requirements.txt
conda install pytorch==1.12.0 torchvision==0.13.0 cudatoolkit=11.3 -c pytorch -y

Preparing Model Weights

You can download our model weights at here.

Make sure the model weight files are stored as follows:

checkpoints/
|-- Bi_Layout_Net/
    |-- mp3d/
        |-- mp3d_best_model.pkl
    |-- zind_all/
        |-- zind_all_best_model.pkl
    |-- zind_simple/
        |-- zind_simple_best_model.pkl

Preparing Dataset

MatterportLayout

You can download our processed MatterportLayout dataset at here.

Make sure the dataset files are stored as follows:

src/dataset/mp3d/
|-- image/
    |-- 17DRP5sb8fy_08115b08da534f1aafff2fa81fc73512.png
|-- label/
    |-- 17DRP5sb8fy_08115b08da534f1aafff2fa81fc73512.json
|-- split/
    |-- test.txt
    |-- train.txt
    |-- val.txt
|-- all_mix_labels_in_uv_v2/
    |-- 17DRP5sb8fy_08115b08da534f1aafff2fa81fc73512.txt

ZInd

Office ZInd dataset is at here.

Make sure the dataset files are stored as follows:

src/dataset/ZInd/
|-- 0000/
    |-- panos/
        |-- floor_01_partial_room_01_pano_14.jpg
    |-- zind_data.json
|-- room_shape_simplicity_labels.json
|-- zind_partition.json

Evaluation

We report "full_2d" and "full_3d" as 2DIoU and 3DIoU.
For equivalent branch, please refer to "ValEpochIoU".
For disambiguate result, please refer to "Oracle_ValEpochIoU".

You can evaluate by executing the following command:

If you want to save the visual results, please add "--save_eval" to the command.

MatterportLayout dataset

python main.py --cfg src/config/mp3d.yaml --mode test

ZInd All dataset

python main.py --cfg src/config/zind_all.yaml --mode test

ZInd Simple dataset

python main.py --cfg src/config/zind_simple.yaml --mode test

Training

Execute the following commands to train (e.g., MatterportLayout dataset):

python main.py --cfg src/config/mp3d.yaml --mode train

You can copy and modify the configuration in YAML file for other training.
You can change the configuration of pin memory at line 26 in "dataset/build.py" to see the training speed change.

Acknowledgements

The code style is modified based on Swin-Transformer.

Some components refer to the following projects:

Citation

If you use this code for your research, please cite

@inproceedings{tsai2024no,
        title={No more ambiguity in 360◦ room layout via bi-layout estimation},
        author={Tsai, Yu-Ju and Jhang, Jin-Cheng and Zheng, Jingjing and Wang, Wei and Chen, Albert 
        and Sun, Min and Kuo, Cheng-Hao and Yang, Ming-Hsuan},
        booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
        year={2024}
      }

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets/figure		assets/figure
config		config
dataset		dataset
evaluation		evaluation
loss		loss
models		models
postprocessing		postprocessing
preprocessing		preprocessing
src		src
utils		utils
visualization		visualization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert_ckpt.py		convert_ckpt.py
inference.py		inference.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News ✨ ✨

Bi-Layout

Installation

Preparing Model Weights

Preparing Dataset

MatterportLayout

ZInd

Evaluation

Training

Acknowledgements

Citation

About

Releases

Packages

Contributors 2

Languages

License

LIAGM/Bi_Layout

Folders and files

Latest commit

History

Repository files navigation

News ✨ ✨

Bi-Layout

Installation

Preparing Model Weights

Preparing Dataset

MatterportLayout

ZInd

Evaluation

Training

Acknowledgements

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages