Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

This is the artifact for the paper "Divide-and-Conquer: Generating UI Code from Screenshots." The paper is accepted by the ACM International Conference on the Foundations of Software Engineering (FSE'2025). This artifact supplies the DCGen toolkit and supplementary materials for the paper.

This repository contains:

Code implementation of DCGen, i.e., the Python script and instructions to run DCGen to preprocess websites, segment images, and generate UI code from screenshots with DCGen algorithm.
Sample dataset. The sample of our experiment data is available in /data. The full dataset is available on 🤗HuggingFace
A user-friendly tool based on DCGen.

Quick links: 📺Demo video | 📝DCGen Examples | 🖥️Code usage | 🔨Tool usage | 🤗Dataset

Abstract

To explore automatic design-to-code solutions, we begin with a motivating study on GPT-4o, identifying three key issues in UI code generation: element omission, distortion, and misarrangement. We find that focusing on smaller visual segments helps multimodal large language models (MLLMs) mitigate these failures. In this paper, we introduce DCGen, a divide-and-conquer approach that automates the translation of webpage designs into UI code. DCGen divides screenshots into manageable segments, generates descriptions for each, and reassembles them into a complete UI code for the full design. Extensive testing on real-world websites and various MLLMs demonstrates that DCGen improves visual similarity by up to 14% compared to competing methods. Human evaluations confirm that DCGen enables developers to implement webpages faster and with greater fidelity to the original designs. To our knowledge, DCGen is the first segment-aware, MLLM-based solution for generating UI code directly from screenshots.

Demo video

This video demonstrates how developers can use DCGen to create a webpage from a UI design through simple copy and paste. DCGen enables users to review and regenerate code for specific image segments, easily replacing any erroneous code with the correct version for the final webpage.

demo.mp4

Examples

Here are two examples from the usefulness study. DCGen demonstrates its effectiveness by significantly reducing element omissions and distortions, leading to faster development and improved webpage quality.

Code usage

0. Setup

pip install -r requirements.txt
playwright install

from utils import *
import single_file

1. Save & Process Website

single_file("https://www.overleaf.com", "./test.html")
simplify_html("test.html", "test_simplified.html", pbar=True)
driver = get_driver(file="./test.html")
take_screenshot(driver, "test.png")

2. Image Segmentation

img_seg = ImgSegmentation("0.png", max_depth=1)
seg.display_tree()

3. DCGen

The demo code and prompt can be found in scripts/experiments.py

from scripts.experiments import *
# Please refer to scripts/experiments.py for baseline prompt and DCGen prompt

bot = GPT4("path/to/you/key.txt", model="gpt-4o")
seg_params = {
    "max_depth": 2,
    "var_thresh": 50,
    "diff_thresh": 45,
    "diff_portion": 0.9,
    "window_size": 50
}
import os
os.chdir("../data")
# run single experiment. Params: bot, input image path, output html path
dcgen(bot, "../data/demo/0.png", "./test.html") 
# run dcgen for entire folder. Params: bot, input image folder, output html folder
dcgen_exp(bot, "./demo/", "dcgen_demo", multi_thread=True, seg_params=seg_params)
# baseline
single_turn_exp(bot, "./demo/", f"direct_demo", prompt_direct, multi_thread=True)
# make sure to move placeholder.png into respective dir before taking screenshots
os.system("mv placeholder.png dcgen_demo/placeholder.png")
os.system("mv placeholder.png direct_demo/placeholder.png")
take_screenshots_for_dir("./dcgen_demo/", replace=True)
take_screenshots_for_dir("./direct_demo/", replace=True)

4. Evaluate

Install requirements for the metric toolkit
```
cd scripts
bash install.sh
```
Modify configurations in scripts/evaluate.py:
```
original_reference_dir = "../data/demo"
test_dirs = {
    "dcgen": "../data/dcgen_demo",
    "direct": "../data/direct_demo",
}
exp_name = "test"
```
The original_reference_dir contains original HTML files 1.html, 2.html, ..., their corresponding screenshots 1.png, 2.png, ..., and the placeholder image placeholder.png.

The test_dirs contains the generated HTML files with the same name as the original ones 1.html, 2.html, ..., and the placeholder image placeholder.png.
Run the evaluation script
```
python evaluate.py
```

Note that the fine-grained metrics can only be run on linux or mac.

DCGen tool

Run locally

Start a server

cd Tool
python app.py

Visit http://127.0.0.1:5000 via local browser
Usage:

Generate image for entire screenshot

View the code of any image segment

Generate code for a image segment

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Tool		Tool
assets		assets
data		data
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Abstract

Demo video

Examples

Code usage

0. Setup

1. Save & Process Website

2. Image Segmentation

3. DCGen

4. Evaluate

DCGen tool

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

WebPAI/DCGen

Folders and files

Latest commit

History

Repository files navigation

Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Abstract

Demo video

Examples

Code usage

0. Setup

1. Save & Process Website

2. Image Segmentation

3. DCGen

4. Evaluate

DCGen tool

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages