Overview

This project implements a web app where users upload a photo/scan of a hand-drawn software/system/process diagram. The app extracts shapes, arrows, and text, builds a structured graph, and exports a JSON file representing the graphical elements.

The pipeline uses robust classical Computer Vision (OpenCV) plus OCR (Tesseract). An optional neural segmentation model scaffold is included for improved results (PyTorch).

Setup

Install Tesseract.

Linux: sudo apt install tesseract-ocr
MacOS: brew install tesseract
Windows: install from the internet and add to PATH.

Install all required packages.

pip install -r requirements.txt

Usage

Linux/MacOS: bash run.sh
Windows: run_windows.bat

Open http://127.0.0.1:5000

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
static/css		static/css
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
cv_helpers.py		cv_helpers.py
gliffy_exporter.py		gliffy_exporter.py
inference.py		inference.py
nn_segmentation.py		nn_segmentation.py
requirements.txt		requirements.txt
run.sh		run.sh
run_windows.bat		run_windows.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Setup

Usage

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

lishuaiphd/computer-vision-model-import

Folders and files

Latest commit

History

Repository files navigation

Overview

Setup

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages