Visual Automation Engine 🤖

A modular Python-based visual automation engine primarily designed to support a Power Apps-driven application, enabling automated screen interactions through coordinate mapping, image recognition, and resilient error-handling workflows.

Although its main use case is within a solution built on Power Apps, the repository is not limited to that ecosystem and can also be executed or adapted for use with other programs and desktop environments thanks to its flexible visual automation architecture.

Features

🖱️ Click, type, hotkeys, and key sequences via coordinate mapping
🖼️ Image detection with CLICK_WHEN_IMAGE and WAIT_FOR_IMAGE
🔁 Error recovery with ON_FAIL_GOTO, LABEL, GOTO, and RETRY
📅 Automatic date clicking with CLICK_NEXT_DATE
🟡 Random yellow box detection with CLICK_RANDOM_YELLOW
📂 Dynamic base directory — configurable from the menu
🧩 Clean modular architecture across 7 modules

Project Structure

visual-automation-engine/
├── main.py              # Entry point and menu
├── core.py              # Main orchestration class
├── acciones.py          # Action execution logic
├── vision.py            # Image/color detection
├── logger.py            # Logging and error screenshots
├── utils.py             # Helper functions
├── config.py            # Configuration and valid actions
├── automation_images/   # Reference images (add your own)
│   └── debug/           # Auto-generated logs and screenshots
└── example_automation.txt  # Example instruction file

Requirements

pip install pyautogui keyboard pillow

Usage

python main.py

Menu Options

Record new automation — capture coordinates interactively
Append to existing file — add steps to an existing .txt
Run automation — execute a .txt instruction file
Validate file — check a .txt for errors before running
Show stored images — list reference images available
Set base directory — configure where .txt files and images are stored

Instruction File Format

Each line follows this format:

ACTION,X,Y,DELAY,PAYLOAD

Action	Description
`CLICK`	Click at coordinates
`TYPE`	Click and type text
`TYPE_RAW`	Type without clicking
`PRESS`	Press a single key
`HOTKEY`	Execute a key combination
`KEYEVENTS`	Replay a recorded key sequence
`CLICK_WHEN_IMAGE`	Click when image appears on screen
`WAIT_FOR_IMAGE`	Wait until image is detected
`CLICK_NEXT_DATE`	Click next available date in a calendar
`CLICK_RANDOM_YELLOW`	Click a random yellow box in a region
`LABEL`	Define a named jump point
`GOTO`	Jump to a label
`RETRY`	Set retry count for next action
`ON_FAIL_GOTO`	Jump to label if next action fails

See example_automation.txt for a complete working example.

Notes

automation_images/ — place your reference .png images here
debug/ — auto-generated logs and error screenshots (not versioned)
config_usuario.json — stores your local base directory (not versioned)

Author

d4nnABR

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visual Automation Engine 🤖

Features

Project Structure

Requirements

Usage

Menu Options

Instruction File Format

Notes

Author

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Visual Automation Engine 🤖

Features

Project Structure

Requirements

Usage

Menu Options

Instruction File Format

Notes

Author