Security AI Agent

A Security AI Agent for automating vulnerability assessments for websites and smart contracts.

Features

Input Handling: Accepts GitHub repository URLs or Solidity smart contract code
CVE Knowledge Base: Queries for known CVEs related to the target
AI Audit Analyzer: Advanced AI-powered Solidity code auditor that leverages knowledge from past audit reports
Tool Selection: Dynamically selects appropriate security tools based on the target type
Scan Execution: Executes selected security tools against the target
Result Aggregation: Merges and deduplicates results from multiple tools
Summary Generation: Produces human-readable reports with remediation suggestions
GitHub Repository Support: Directly scan code from GitHub repositories
Web Interface: User-friendly frontend for scanning and viewing results

AI Audit Analyzer

The AI Audit Analyzer is a sophisticated component that uses GPT-4 to perform intelligent security audits on Solidity smart contracts. Key features include:

Knowledge-Based Analysis: Leverages patterns and findings from past audit reports to enhance detection accuracy
Comprehensive Vulnerability Detection: Identifies 10+ categories of vulnerabilities including:
- Reentrancy vulnerabilities
- Integer overflow/underflow
- Access control issues
- Unchecked return values
- Gas optimization issues
- Business logic issues
- Oracle manipulation vulnerabilities
- Front-running vulnerabilities
- Denial of Service (DoS) vulnerabilities
Structured Output: Provides detailed findings with severity levels, locations, and remediation recommendations
Pattern Recognition: Automatically extracts and applies vulnerability patterns from historical audit data

Quick Start

Prerequisites

Python 3.9+: Required for the backend
Node.js 18+: Required for the frontend
uv: Fast Python package manager (Installation Guide)
OpenAI API Key: Get your API key from OpenAI
GitHub Token (optional): For private repositories or to avoid rate limits. Get it from GitHub Settings

Installation & Setup

1. Install UV (if not already installed)

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

# Add to PATH (restart shell or run):
source $HOME/.local/bin/env

2. Clone and Setup Project

git clone <your-repo-url>
cd sec-agent

3. Setup Python Backend with UV

# Create virtual environment
uv venv

# Activate virtual environment
source .venv/bin/activate

# Install dependencies (choose one based on your needs):
uv sync                    # Core dependencies only
uv sync --extra ai         # Core + AI features (recommended)
uv sync --extra security   # Core + security tools
uv sync --extra full       # Everything (AI + security + dev tools)

4. Setup Frontend

cd frontend
npm install
cd ..

5. Configure Environment Variables

Create a .env file in the project root:

OPENAI_API_KEY=your_openai_api_key
GITHUB_TOKEN=your_github_token  # Optional
LOG_LEVEL=INFO
PORT=8080

Running the Application

Start Backend Server

# Make sure virtual environment is activated
source .venv/bin/activate

# Start the backend server (choose one):
uv run python -m backend.api.server
# OR
python backend/api/server.py
# OR
PORT=8080 GITHUB_TOKEN=your_github_token OPENAI_API_KEY=your_openai_key python -m backend.api.server

Start Frontend (in a new terminal)

cd frontend
npm start

Access the Application

Frontend: Open your browser and navigate to http://localhost:3000
Backend API: Available at http://localhost:8080
API Status: Check http://localhost:8080/api/status

Usage

Enter a GitHub repository URL in the input field
Optionally provide your GitHub token for private repositories
Click submit and wait for the scan to complete
View results organized by severity:
- Summary: Executive summary and overall assessment
- Issues: Security issues categorized by severity (High, Medium, Low, Info)
- Debug: Raw response data for debugging

You can also paste Solidity code directly into the interface for analysis.

Project Structure

Backend (`backend/`)

backend/core/ - Core business logic and functionality
- security_agent.py - Main Security Agent module
- input_handler.py - Input validation and classification
- cve_knowledge_base.py - CVE querying and analysis
- tool_selector.py - Security tool selection
- scan_executor.py - Tool execution
- result_aggregator.py - Result aggregation and deduplication
- result_summarizer.py - Report generation using LLMs
backend/utils/ - Helper utilities
- cve_loader.py - CVE data loading utilities
- helpers.py - Common utility functions

Frontend (`frontend/`)

A React-based user interface for the security agent.

Architecture Diagram

flowchart TD
    %% Entry Points
    User((User)) --> WebInterface[React Frontend<br/>Port 3000]
    WebInterface --> FlaskAPI[Flask API Server<br/>Port 8080]

    %% Main Security Agent Component
    FlaskAPI --> SecurityAgent[SecurityAgent<br/>Main Orchestrator]

    %% Core Analysis Flow
    SecurityAgent --> InputHandler[InputHandler<br/>URL/File Validation]
    SecurityAgent --> CVEQuery[CVE Knowledge Query<br/>Vulnerability Database]
    
    %% Parallel Analysis Paths
    SecurityAgent --> TraditionalPath[Traditional Security Tools]
    SecurityAgent --> AIPath[AI-Powered Analysis]

    %% Traditional Security Analysis
    subgraph TraditionalPath[Traditional Security Scanning]
        ToolSelector[ToolSelector<br/>Select Security Tools]
        ScanExecutor[ScanExecutor<br/>Execute Scans]
        Slither[Slither<br/>Static Analysis Tool]
        
        ToolSelector --> ScanExecutor
        ScanExecutor --> Slither
    end

    %% AI-Powered Analysis Branch
    subgraph AIPath[AI-Powered Smart Contract Analysis]
        AIAnalyzer[AI Audit Analyzer<br/>Single File Analysis]
        ChunkedAnalyzer[Chunked AI Analyzer<br/>Multi-File/Large Repos]
        ChunkingManager[Chunking Manager<br/>File Splitting & Batching]
        BatchClient[OpenAI Batch Client<br/>Rate-Limited Processing]
        
        AIAnalyzer --> OpenAI
        ChunkedAnalyzer --> ChunkingManager
        ChunkingManager --> BatchClient
        BatchClient --> OpenAI
    end

    %% Input Type Decision
    InputHandler --> |"Single .sol file"| AIAnalyzer
    InputHandler --> |"Multiple .sol files<br/>or GitHub repo"| ChunkedAnalyzer
    InputHandler --> |"For Slither analysis"| ToolSelector

    %% Result Processing
    SecurityAgent --> ResultAggregator[Result Aggregator<br/>Merge & Deduplicate]
    SecurityAgent --> ResultSummarizer[Result Summarizer<br/>Generate Reports]
    
    TraditionalPath --> ResultAggregator
    AIPath --> ResultAggregator
    ResultAggregator --> ResultSummarizer
    ResultSummarizer --> FlaskAPI

    %% External Services
    subgraph ExternalServices[External Services]
        OpenAI[OpenAI API<br/>GPT-4o-mini/GPT-4o]
        GitHub[GitHub API<br/>Repository Access]
        CVEDatabase[(CVE Database<br/>Known Vulnerabilities)]
        AuditReports[(Past Audit Reports<br/>Knowledge Base)]
    end
    
    CVEQuery -.-> CVEDatabase
    ResultSummarizer -.-> OpenAI
    InputHandler -.-> GitHub
    AIAnalyzer -.-> AuditReports
    ChunkedAnalyzer -.-> AuditReports

    %% Data Flow Types
    SecurityAgent -.-> |"Metadata & Context"| ResultAggregator

    %% Styling
    classDef core fill:#e1f5fe,stroke:#01579b,stroke-width:2px;
    classDef analysis fill:#f3e5f5,stroke:#4a148c,stroke-width:2px;
    classDef external fill:#e8f5e8,stroke:#2e7d32,stroke-width:2px;
    classDef interface fill:#fff3e0,stroke:#e65100,stroke-width:2px;
    classDef processing fill:#fce4ec,stroke:#880e4f,stroke-width:2px;

    class SecurityAgent,InputHandler,CVEQuery,ResultAggregator,ResultSummarizer core;
    class AIAnalyzer,ChunkedAnalyzer,ChunkingManager,BatchClient,ToolSelector,ScanExecutor analysis;
    class OpenAI,GitHub,CVEDatabase,AuditReports external;
    class WebInterface,FlaskAPI interface;
    class Slither processing;

Dependency Management with UV

This project uses uv for fast Python package management. The dependencies are organized into groups:

Core: Basic web framework, document processing, GitHub integration
ai: OpenAI, LangChain, ChromaDB for AI-powered analysis
security: Slither analyzer and security tools
dev: Testing, linting, and development tools
full: All of the above

Common UV Commands

# Install different dependency groups
uv sync --extra ai         # Recommended for most users
uv sync --extra security   # For security tool integration
uv sync --extra full       # Everything

# Add new dependencies
uv add requests            # Add to core dependencies
uv add --dev pytest       # Add development dependency

# Update dependencies
uv lock --upgrade          # Update lock file
uv sync                    # Sync updated dependencies

# Run commands without activating venv
uv run python -m backend.api.server
uv run pytest

For detailed UV usage, see README-uv.md or run ./scripts/uv-commands.sh for a reference.

Requirements

Python 3.9+ (managed by uv)
Node.js 18+ (for frontend)
OpenAI API key (GPT-4 or GPT-4o-mini recommended)
GitHub token (optional, for private repositories)

Advanced Configuration

Environment Variables

Create a .env file in the project root:

OPENAI_API_KEY=your_openai_api_key
GITHUB_TOKEN=your_github_token
LOG_LEVEL=INFO
PORT=8080

Troubleshooting

UV and Virtual Environment Issues

# If UV is not found, ensure it's in your PATH
echo $PATH | grep -q "/.local/bin" || echo "Add ~/.local/bin to your PATH"

# Recreate virtual environment if corrupted
rm -rf .venv
uv venv
source .venv/bin/activate
uv sync --extra ai

# Check UV and Python versions
uv --version
uv python list

Dependency Conflicts

Some security tools have conflicting dependencies. Use separate environments:

# For Slither analysis
uv sync --extra security

# For other conflicting tools, create separate environments
uv venv --name mythril-env
uv pip install mythril

Backend Server Issues

# Check if backend is running
curl http://localhost:8080/api/status

# Run with debug logging
LOG_LEVEL=DEBUG uv run python -m backend.api.server

# Check port availability
lsof -i :8080

Frontend Issues

# Clear npm cache and reinstall
cd frontend
rm -rf node_modules package-lock.json
npm install

# Run on different port if 3000 is occupied
PORT=3001 npm start

Example Output

The security agent produces detailed reports including:

Overall security risk assessment
Detailed vulnerability findings
Technical analysis of detected issues
Remediation suggestions for each vulnerability
Severity-based categorization of issues

Troubleshooting

Backend not starting: Check that your OpenAI API key is valid and set correctly
GitHub scanning fails: Ensure your GitHub token has the necessary permissions
Frontend not loading: Make sure both backend (port 8080) and frontend (port 3000) are running

For more detailed information about GitHub scanning features, see README-github-scanning.md.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
backend		backend
cached_files		cached_files
examples		examples
frontend		frontend
scripts		scripts
security_agent/data/sources		security_agent/data/sources
test_contracts		test_contracts
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
.solhint.json		.solhint.json
README.md		README.md
ai_audit_test_results.json		ai_audit_test_results.json
backup_fix_scan_executor.py		backup_fix_scan_executor.py
complex_vulnerable.sol		complex_vulnerable.sol
demo.py		demo.py
fix_executor_return.py		fix_executor_return.py
fix_scan_executor.py		fix_scan_executor.py
fix_script.py		fix_script.py
install_security_tools.sh		install_security_tools.sh
install_slither.sh		install_slither.sh
main.py		main.py
manticore		manticore
myth		myth
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_ai_audit.py		test_ai_audit.py
test_ai_audit_api.py		test_ai_audit_api.py
test_contract.sol		test_contract.sol
test_github_scan.py		test_github_scan.py
test_scan.py		test_scan.py
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Security AI Agent

Features

AI Audit Analyzer

Quick Start

Prerequisites

Installation & Setup

1. Install UV (if not already installed)

2. Clone and Setup Project

3. Setup Python Backend with UV

4. Setup Frontend

5. Configure Environment Variables

Running the Application

Start Backend Server

Start Frontend (in a new terminal)

Access the Application

Usage

Project Structure

Backend (backend/)

Frontend (frontend/)

Architecture Diagram

Dependency Management with UV

Common UV Commands

Requirements

Advanced Configuration

Environment Variables

Troubleshooting

UV and Virtual Environment Issues

Dependency Conflicts

Backend Server Issues

Frontend Issues

Example Output

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Backend (`backend/`)

Frontend (`frontend/`)

Packages