🤖 Testing Agent - AI-Powered Intelligent Test Case Generation and Execution

An intelligent test automation framework that automatically generates and executes multiple test cases from a single test description. The Testing Agent uses Large Language Models (LLM) to intelligently create comprehensive test scenarios and execute them with robust browser automation through a sophisticated multi-agent architecture.

🚀 Features

🧠 Intelligent Test Case Generation

Test Case Generation: Automatically generates multiple comprehensive test cases from a single test description
TestAgentMain Orchestrator: Central coordinator that manages test generation and execution workflow
InstructionAgent: Specialized agent for parsing test descriptions and generating varied test scenarios
Smart Test Planning: Creates detailed test steps with expected outcomes for thorough validation

🎯 Automated Test Execution

TestAgent Execution: Individual test agent instances for isolated test case execution
Sequential Processing: Executes test cases one by one for thorough testing
Result Validation: Validates expected outcomes using AI-powered tools during execution
Error Handling: Robust error handling with comprehensive error detection and recovery
Session Management: Proper initialization and cleanup for each test execution

🌐 Advanced Browser Automation

Multi-Tab Management: Create, switch, and manage multiple browser tabs
Smart Element Detection: Automatically identifies interactive elements (buttons, inputs, links)
Intelligent Selectors: Uses both CSS selectors and XPath with fallback strategies
Cross-Browser Support: Built on Playwright for reliable automation
Real-time DOM Analysis: Dynamic page analysis and element mapping

🔍 Intelligent Validation & Testing

LLM-Powered Validation: Uses AI to validate login success, form submissions, and page states
Smart Error Detection: Automatically detects and reports validation failures
Context-Aware Testing: Understands the intent behind tests and validates accordingly
Visual DOM Parsing: Extracts only visible and interactive elements
Real-time State Tracking: Maintains up-to-date page state and element maps

📝 Comprehensive Logging & Memory

Multi-Agent Logging: Separate logging for each agent with synchronized debug files
Session Persistence: Saves complete automation sessions with detailed logs
Memory System: Remembers successful patterns and learns from failures
Debug Mode: Detailed logging for development and troubleshooting
Execution History: Complete audit trail of all actions and results across agents

🏗️ Test Execution Architecture

Testing Agent - Test Generation & Execution System
├── 🎯 TestAgentMain           # Main orchestrator for test workflow
│   ├── Test Case Generation   # Generates multiple test scenarios from description
│   ├── Sequential Execution   # Executes test cases one by one
│   ├── Parallel Processing    # Fast execution without delays
│   └── Error Management      # Handles execution errors and exceptions
├── 📋 InstructionAgent        # Test case generation specialist
│   ├── Test Description Parsing  # Analyzes test requirements
│   ├── Scenario Generation   # Creates multiple test variations
│   ├── Step Planning        # Generates detailed test steps
│   └── Expected Outcomes     # Defines validation criteria
├── 🤖 TestAgent              # Individual test execution engine
│   ├── Test Initialization   # Sets up test environment
│   ├── Plan Execution       # Executes test steps with browser automation
│   ├── Outcome Validation    # Validates results against expected outcomes
│   └── Session Cleanup      # Proper cleanup after test completion
├── 🔧 Tool Agent             # Specialized validation and analysis
│   ├── LLM-Powered Analysis  # Intelligent page state validation
│   ├── Login Verification    # Automated login success detection
│   └── Form Validation       # Smart form submission verification
├── 🌐 Browser Controller     # High-level browser automation
│   ├── Multi-Tab Management  # Tab creation, switching, and closing
│   ├── Element Interaction   # Click, input, and navigation actions
│   └── DOM Tree Parsing      # Real-time page structure analysis
└── 📊 Shared Components      # Common utilities and logging
    ├── LLM Client           # Gemini Flash integration
    ├── Logging System      # Multi-agent synchronized logging
    └── Debug Tools          # Development and troubleshooting

🛠️ Installation

Prerequisites

Python 3.8 or higher
Google Gemini API key

Setup

Clone the repository

git clone https://github.com/mushfiqur-rahman/Testing-Agent.git
cd Testing-Agent

Install dependencies

pip install -r requirements.txt
playwright install

Set up environment variables Create a .env file in the project root:
```
GEMINI_API_KEY=your_gemini_api_key_here
```
Important: Never commit your .env file to version control. It's already included in .gitignore.

🎯 Quick Start

Test Case Generation and Execution

from src.test_agent_main import TestAgentMain
import os

# Initialize the main test orchestrator
API_KEY = os.getenv('GEMINI_API_KEY')
main_agent = TestAgentMain(
    api_key=API_KEY,
    max_actions=15,
    debug=True
)

# Define your test scenario description
test_description = """
Navigate to the login form test page and test the login functionality:
1. Go to file:///path/to/your/login_form.html
2. Fill in the email field with 'test@example.com'
3. Fill in the password field with 'password123'
4. Click the login button
5. Verify that your test is successful
"""

# Generate multiple test cases from the description
test_cases = main_agent.generate_test_cases(test_description)

# Execute all generated test cases
main_agent.execute_all_test_cases(test_cases)

Individual Test Agent Usage

from src.test_agent import TestAgent

# For running a single test case
test_agent = TestAgent(
    api_key=API_KEY,
    max_actions=15,
    debug=True
)

# Initialize and execute
test_agent.initialize()
user_goal = "Navigate to login page and test login functionality"
expected_outcome = "User successfully logs in and sees dashboard"

results = test_agent.execute_plan(user_goal, expected_outcome)

# Clean up
test_agent.cleanup()

🚀 Getting Started

Clone and setup the project

git clone https://github.com/yourusername/Testing-Agent.git
cd Testing-Agent
pip install -r requirements.txt
playwright install

Configure your environment

echo "GEMINI_API_KEY=your_api_key_here" > .env

Run your first test
```
python src/test_agent.py
```
View the results Check the logs/ directory for detailed execution logs and session data. goal = """ Navigate to example.com and:
Find the search box
Search for 'testing automation'
Click on the first result
Verify the page loaded successfully """

Execute the automation plan

execution_log = agent.execute_plan(goal)

Get results and save session

summary = agent.get_session_summary() print(f"Completed: {summary['successful_steps']}/{summary['total_steps']} steps") agent.save_session_log()

Cleanup

browser_controller.close()


## 🚀 Getting Started

1. **Clone and setup the project**
   ```bash
   git clone https://github.com/yourusername/Testing-Agent.git
   cd Testing-Agent
   pip install -r requirements.txt
   playwright install

Configure your environment

echo "GEMINI_API_KEY=your_api_key_here" > .env

Run your first test
```
python src/test_agent.py
```
View the results Check the logs/ directory for detailed execution logs and session data.

Login Form Testing Example

# Test a login form with multi-agent validation
login_goal = """
Test the login functionality:
1. Go to the login page
2. Fill in email field with 'test@example.com'
3. Fill in password field with 'password123'
4. Click the login button
5. Use tools to verify successful login
"""

execution_log = agent.execute_plan(login_goal)

# The Tool Agent will automatically:
# - Analyze the page state after login
# - Detect success/failure messages
# - Validate the user's email presence
# - Provide detailed validation results

🎮 Available Actions

The multi-agent system supports these browser actions:

Action	Description	Agent	Parameters
`navigate_to`	Navigate to a URL	Main Agent	`url`
`click_element`	Click an element by index	Main Agent	`index`
`input_text`	Input text into form fields	Main Agent	`index`, `text`
`switch_tab`	Switch between tabs	Main Agent	`index`
`open_tab`	Open new tab	Main Agent	`url` (optional)
`close_tab`	Close a tab	Main Agent	`index`
`go_back`	Navigate back in history	Main Agent	None
`tools`	Execute intelligent validation	Tool Agent	`reason`
`end`	Terminate the session	Main Agent	`reason`

🔧 Tool Agent Capabilities

The Tool Agent provides specialized actions for complex validation:

Login Verification: Automatically detects successful/failed logins
Form Validation: Validates form submissions and error states
Page State Analysis: Analyzes current page state and content
Error Detection: Identifies and reports page errors or issues
Content Verification: Validates expected content presence

📁 Project Structure

Testing-Agent/
├── src/                      # Source code
│   ├── agent/               # AI agent components
│   │   ├── core_utils/      # Core utilities
│   │   │   ├── llm.py       # Gemini Flash LLM client
│   │   │   └── logging_utils.py  # Logging utilities
│   │   ├── main_agent/      # Main agent logic
│   │   │   ├── agent.py     # Core agent class
│   │   │   └── prompt_generator.py  # Prompt generation
│   │   ├── tool_agent/      # Tool management
│   │   │   └── tools.py     # Tool implementations
│   │   └── instruction_agent/  # Instruction handling
│   │       └── initial.py   # Initial instructions
│   ├── browser/             # Browser automation
│   │   ├── browser_context.py     # Browser session management
│   │   ├── dom_tree_builder.py    # DOM tree construction
│   │   └── dom_tree_parser.py     # Element parsing
│   └── controller/          # High-level automation
│       └── browser_controller.py  # Main controller
├── tests/                   # Test suite
│   ├── browser_controller_test.py
│   ├── dom_tree_builder_test.py
│   ├── dom_tree_parser_test.py
│   └── main.py             # Test runner
├── html/                   # Test HTML files
│   ├── login_form.html     # Login form test page
│   ├── test_page.html      # Basic test page
│   └── test_page2.html     # Complex test page
├── logs/                   # Session logs and debug files
├── requirements.txt        # Python dependencies
├── .env                    # Environment variables (create this)
└── README.md              # This file

🔧 Configuration

Multi-Agent Configuration

# Initialize with custom settings
agent = Agent(
    llm=llm_client,
    max_actions=30,        # Max actions per plan
    debug=True            # Enable detailed logging for all agents
)

# The main agent automatically coordinates with:
# - Tool Agent for validation tasks
# - Instruction Agent for prompt optimization
# - Browser Controller for automation

Tool Agent Configuration

# Tool Agent is automatically initialized with LLM client
# and provides intelligent validation capabilities
# No manual configuration required

Debug Mode for Multi-Agent System

# Enable comprehensive logging across all agents
agent = Agent(llm, debug=True)

# This creates synchronized log files:
# - agent_debug_YYYYMMDD_HHMMSS.log (Main Agent decisions)
# - tools_debug_YYYYMMDD_HHMMSS.log (Tool Agent validations)
# - agent_session_YYYYMMDD_HHMMSS.json (Complete session data)

Browser Configuration

The browser runs in headless mode by default. To see the browser:

# In src/browser/browser_context.py, modify the launch parameters
browser = playwright.chromium.launch(headless=False)

📊 Logging and Debugging

Debug Mode

Enable comprehensive logging:

agent = Agent(llm, debug=True)

This creates timestamped log files in the logs/ directory:

agent_debug_YYYYMMDD_HHMMSS.log - Agent decision making
memory_debug_YYYYMMDD_HHMMSS.log - Memory operations
agent_session_YYYYMMDD_HHMMSS.json - Complete session data

Session Analysis

# Get detailed session summary
summary = agent.get_session_summary()
print(f"Duration: {summary['session_duration_seconds']:.2f} seconds")
print(f"Success Rate: {summary['successful_steps']}/{summary['total_steps']}")

# Save session for later analysis
filename = agent.save_session_log()
print(f"Session saved to: {filename}")

🧪 Testing

Run the included tests:

# Run the test suite
python tests/main.py

# Or run individual test files
python tests/browser_controller_test.py
python tests/dom_tree_builder_test.py
python tests/dom_tree_parser_test.py

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Ready to automate? Start with the Quick Start guide and build your first AI-powered browser automation! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
html		html
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
clean_cache.py		clean_cache.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🤖 Testing Agent - AI-Powered Intelligent Test Case Generation and Execution

🚀 Features

🧠 Intelligent Test Case Generation

🎯 Automated Test Execution

🌐 Advanced Browser Automation

🔍 Intelligent Validation & Testing

📝 Comprehensive Logging & Memory

🏗️ Test Execution Architecture

🛠️ Installation

Prerequisites

Setup

🎯 Quick Start

Test Case Generation and Execution

Individual Test Agent Usage

🚀 Getting Started

Execute the automation plan

Get results and save session

Cleanup

Login Form Testing Example

🎮 Available Actions

🔧 Tool Agent Capabilities

📁 Project Structure

🔧 Configuration

Multi-Agent Configuration

Tool Agent Configuration

Debug Mode for Multi-Agent System

Browser Configuration

📊 Logging and Debugging

Debug Mode

Session Analysis

🧪 Testing

Run the included tests:

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages