CalvinBackup/ScreenCoder_UI2Code

Fork 0

mirror of https://github.com/leigest519/ScreenCoder.git synced 2026-02-12 17:52:47 +00:00

Go to file

leigest519 371a427d75 Delete demo_video directory

2025-07-29 11:23:27 +08:00

data

Initial commit

2025-07-28 18:54:41 +08:00

demo

Initial commit

2025-07-28 18:54:41 +08:00

tmp

Initial commit

2025-07-28 18:54:41 +08:00

UIED

Initial commit

2025-07-28 18:54:41 +08:00

.gitignore

Initial commit

2025-07-28 18:54:41 +08:00

block_parsor.py

Initial commit

2025-07-28 18:54:41 +08:00

data.zip

Initial commit

2025-07-28 18:54:41 +08:00

html_generator.py

Initial commit

2025-07-28 18:54:41 +08:00

image_box_detection.py

Initial commit

2025-07-28 18:54:41 +08:00

image_replacer.py

Initial commit

2025-07-28 18:54:41 +08:00

LICENSE

Initial commit

2025-07-28 18:10:58 +08:00

logo.png

Add files via upload

2025-07-28 22:20:38 +08:00

main.py

Initial commit

2025-07-28 18:54:41 +08:00

mapping.py

Initial commit

2025-07-28 18:54:41 +08:00

README.md

Update README.md

2025-07-29 11:21:37 +08:00

requirements.txt

Initial commit

2025-07-28 18:54:41 +08:00

teaser.jpg

Add files via upload

2025-07-28 22:51:05 +08:00

teaser.pdf

Add files via upload

2025-07-28 22:48:57 +08:00

tmp.zip

Initial commit

2025-07-28 18:54:41 +08:00

UIED.zip

Initial commit

2025-07-28 18:54:41 +08:00

utils.py

Initial commit

2025-07-28 18:54:41 +08:00

README.md

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Yilei Jiang^1*, Yaozhi Zheng^1*, Yuxuan Wan^2*, Jiaming Han¹, Qunzhong Wang¹,
Michael R. Lyu², Xiangyu Yue^1✉

¹CUHK MMLab, ²CUHK ARISE Lab

^*Equal contribution ^✉Corresponding author

Demo Videos

A showcase of how ScreenCoder transforms UI screenshots into structured, editable HTML/CSS code using a modular multi-agent framework.

Project Structure

main.py: The main script to generate final HTML code for a single screenshot.
UIED/: Contains the UIED (UI Element Detection) engine for analyzing screenshots and detecting components.
- run_single.py: Python script to run UI component detection on a single image.
html_generator.py: Takes the detected component data and generates a complete HTML layout with generated code for each module.
image_replacer.py: A script to replace placeholder divs in the final HTML with actual cropped images.
mapping.py: Maps the detected UIED components to logical page regions.
requirements.txt: Lists all the necessary Python dependencies for the project.
doubao_api.txt: API key file for the Doubao model (should be kept private and is included in .gitignore).

Setup and Installation

Clone the repository:

git clone https://github.com/JimmyZhengyz/screencoder.git
cd screencoder

Create a virtual environment:

python3 -m venv .venv
source .venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```
Set up API Key:
- Create a file named doubao_api.txt in the root directory.
- Paste your Doubao API key into this file.

Usage

The typical workflow is a multi-step process as follows:

Initial Generation with Placeholders: Run the Python script to generate the initial HTML code for a given screenshot.
- Block Detection:
```
python block_parsor.py
```
- Generation with Placeholders (Gray Images Blocks):
```
python html_generator.py
```
Final HTML Code: Run the python script to generate final HTML code with copped images from the original screenshot.
- Placeholder Detection:
```
python image_box_detection.py
```
- UI Element Detection:
```
python UIED/run_single.py
```
- Mapping Alignment Between Placeholders and UI Elements:
```
python mapping.py
```
- Placeholder Replacement:
```
python image_replacer.py
```
Simple Run: Run the python script to generate the final HTML code:
```
python main.py
```

Acknowledgements

This project builds upon several outstanding open-source efforts. We would like to thank the authors and contributors of the following projects: UIED, DCGen, Design2Code

Languages

Python 91.2%

Jupyter Notebook 4.3%

Shell 2.5%

HTML 1.3%

Dockerfile 0.6%

README.md Unescape Escape

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Demo Videos

Youtube Page

Instagram Page

Design Draft（allow customized modifications!）

Project Structure

Setup and Installation

Usage

Acknowledgements

README.md