Abstract

This is the artifact for the paper "Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach." This artifact supplies the DCGen toolkit and supplementary materials for the paper.

This repository contains:

Code implementation of DCGen, i.e., the Python script and instructions to run DCGen to preprocess websites, segment images, and generate UI code from screenshots with DCGen algorithm.
Sample dataset. The sample of our experiment data is available in /data. We will release the full dataset as soon as the paper is published.
Link to supplementary materials. We provide all the screen recordings in the usefulness study and our prompt details via this link.
A user-friendly tool based on DCGen.

Quick links: Demo video | DCGen Examples | Code usage | Tool usage

Abstract

To explore automatic design-to-code solutions, we begin with a motivating study on GPT-4o, identifying three key issues in UI code generation: element omission, distortion, and misarrangement. We find that focusing on smaller visual segments helps multimodal large language models (MLLMs) mitigate these failures. In this paper, we introduce DCGen, a divide-and-conquer approach that automates the translation of webpage designs into UI code. DCGen divides screenshots into manageable segments, generates descriptions for each, and reassembles them into a complete UI code for the full design. Extensive testing on real-world websites and various MLLMs demonstrates that DCGen improves visual similarity by up to 14% compared to competing methods. Human evaluations confirm that DCGen enables developers to implement webpages faster and with greater fidelity to the original designs. To our knowledge, DCGen is the first segment-aware, MLLM-based solution for generating UI code directly from screenshots.

Demo video

This video demonstrates how developers can use DCGen to create a webpage from a UI design through simple copy and paste. DCGen enables users to review and regenerate code for specific image segments, easily replacing any erroneous code with the correct version for the final webpage.

demo.mp4

Examples

Here are two examples from the usefulness study. DCGen demonstrates its effectiveness by significantly reducing element omissions and distortions, leading to faster development and improved webpage quality.

Code usage

0. Setup

pip install -r requirements.txt

from utils import *
import single_file

1. Save & Process Website

single_file("https://www.overleaf.com", "./test.html")
simplify_html("test.html", "test_simplified.html", pbar=True)
driver = get_driver(file="./test.html")
take_screenshot(driver, "test.png")

2. Image Segmentation

img_seg = ImgSegmentation("0.png", max_depth=1)
seg.display_tree()

3. DCGen

# Example prompt
prompt_dict = {
    "promt_leaf": 'Here is a screenshot of a webpage with a red rectangular bounding box. Focus on the bounding box area. Respond with the content of the HTML+CSS code.',

    "promt_node": 'Here are 1) a screenshot of a webpage with a red rectangular bounding box , and 2) code of different elements in the bounding box. Utilize the provided code to write a new HTML and CSS file to replicate the website in the bounding box. Here is the code of different parts of the webpage in the bounding box:\n=============\n'
}

bot = GPT4(key_path="./path/to/key.txt", model="gpt-4o")
img_seg = ImgSegmentation("0.png", max_depth=1)
dc_trace = DCGenTrace.from_img_seg(img_seg, bot, prompt_leaf=prompt_dict["promt_leaf"], prompt_node=prompt_dict["promt_node"])
dc_trace.generate_code(recursive=True, cut_out=False)
dc_trace.display_tree()

4. Calculate Score (linux only)

Install requirements for the metric toolkit
```
pip install -r metrics/requirements.txt
```
Modify configurations in ./metrics/Design2Code/metrics/multi_processing_eval.py:
```
orig_reference_dir = "path/to/original_data_dir"
test_dirs = {
        "exp_name": "path/to/exp_data_dir"
    }
```
The original_data_dir contains original HTML files 1.html, 2.html, ..., their corresponding screenshots 1.png, 2.png, ..., and optionally a placeholder image placeholder.png.

The exp_data_dir contains the generated HTML files with the same name as the original ones 1.html, 2.html, ....

Run the evaluation script

cd metrics/Design2Code
python metrics/multi_processing_eval.py

DCGen tool

Run locally

Start a server

cd Tool
python app.py

Visit http://127.0.0.1:5000 via local browser
Usage:

Generate image for entire screenshot

View the code of any image segment

Generate code for a image segment

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Tool		Tool
assets		assets
data/original		data/original
metrics		metrics
.gitignore		.gitignore
README.md		README.md
calculate_metrics.py		calculate_metrics.py
experiments.py		experiments.py
requirements.txt		requirements.txt
single_file.py		single_file.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstract

Demo video

Examples

Code usage

0. Setup

1. Save & Process Website

2. Image Segmentation

3. DCGen

4. Calculate Score (linux only)

DCGen tool

About

Releases

Packages

Languages

WebPAI/DCGen

Folders and files

Latest commit

History

Repository files navigation

Abstract

Demo video

Examples

Code usage

0. Setup

1. Save & Process Website

2. Image Segmentation

3. DCGen

4. Calculate Score (linux only)

DCGen tool

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages