TPRM Agent - Autonomous Third-Party Risk Management

An AI-powered agent that automates vendor adverse media investigations using Bright Data's web scraping infrastructure, OpenAI for risk analysis, and OpenHands SDK for intelligent monitoring script generation.

Features

Automated Vendor Investigation: Search for adverse media across multiple risk categories (litigation, fraud, financial issues, regulatory violations, operational risks)
Intelligent Web Scraping: Uses Bright Data SERP API and Web Unlocker to bypass paywalls, CAPTCHAs, and access protected content
AI Risk Analysis: OpenAI analyzes extracted content and provides severity scores with actionable recommendations
Auto-Generated Monitoring: OpenHands SDK creates production-ready Python scripts that continuously monitor vendor sources
REST API: FastAPI-based endpoints for easy integration into existing systems
Browser Automation: Optional Bright Data Browser API support for JavaScript-heavy sites and complex workflows

Architecture

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│   DISCOVERY     │────▶│     ACCESS      │────▶│     ACTION      │
│   (SERP API)    │     │ (Web Unlocker)  │     │ (OpenAI + SDK)  │
└─────────────────┘     └─────────────────┘     └─────────────────┘
        │                       │                       │
   Search Google          Bypass paywalls         Analyze risks
   for red flags          and CAPTCHAs           Generate scripts

Prerequisites

Python 3.12 or higher
Bright Data account with API access
OpenAI API key
OpenHands Cloud account or your own LLM API key

Quick Start

1. Clone and Setup

git clone <your-repo-url>
cd tprm-agent

# Create virtual environment
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

2. Configure Environment Variables

Create a .env file in the project root:

# Bright Data API Token (for SERP API)
BRIGHT_DATA_API_TOKEN=your_api_token

# Bright Data SERP Zone
BRIGHT_DATA_SERP_ZONE=your_serp_zone_name

# Bright Data Web Unlocker Zone
BRIGHT_DATA_UNLOCKER_ZONE=your_unlocker_zone_name

# OpenAI (for risk analysis)
OPENAI_API_KEY=your_openai_api_key

# OpenHands (for agentic script generation)
LLM_API_KEY=your_llm_api_key
LLM_MODEL=openai/gpt-4o

3. Run Locally

Start the API server:

python -m uvicorn api.main:app --reload

Visit http://localhost:8000/docs for interactive API documentation.

Run a direct investigation:

python src/agent.py

Usage

API Endpoints

Start Investigation

curl -X POST "http://localhost:8000/investigate" \
  -H "Content-Type: application/json" \
  -d '{
    "vendor_name": "Acme Corp",
    "categories": ["litigation", "fraud"],
    "generate_monitors": true
  }'

Response:

{
 "investigation_id": "uuid-here",
 "status": "started",
 "message": "Investigation started for Acme Corp"
}

Get Investigation Results

curl http://localhost:8000/investigate/{investigation_id}

Health Check

curl http://localhost:8000/health

Python Usage

from src.agent import TPRMAgent

# Initialize agent
agent = TPRMAgent()

# Run investigation
result = agent.investigate(
    vendor_name="Acme Corp",
    categories=["litigation", "financial", "fraud"],
    generate_monitors=True
)

# Print results
print(f"Found {result.total_sources_found} sources")
print(f"Risk Assessments: {len(result.risk_assessments)}")

for assessment in result.risk_assessments:
    print(f"\n[{assessment.category}] Severity: {assessment.severity}")
    print(f"Summary: {assessment.summary}")

Project Structure

tprm-agent/
├── src/
│   ├── __init__.py
│   ├── config.py         # Configuration and settings
│   ├── discovery.py      # SERP API integration
│   ├── access.py         # Web Unlocker integration
│   ├── actions.py        # OpenAI + OpenHands SDK
│   ├── agent.py          # Main orchestration
│   └── browser.py        # Browser API (optional)
├── api/
│   └── main.py           # FastAPI endpoints
├── scripts/
│   └── generated/        # Auto-generated monitoring scripts
├── .env                  # Environment variables (not in git)
├── requirements.txt      # Python dependencies
├── Procfile             # Railway deployment config
└── README.md

Deployment

Deploy to Railway

Install Railway CLI:

npm i -g @railway/cli

Login and initialize:

railway login
railway init

Deploy:

railway up

Add environment variables in Railway dashboard → Variables
Generate domain:

railway domain

Deploy to Render

Create render.yaml (already included)
Push to GitHub
Connect to Render:
- Go to render.com
- New → Web Service
- Connect your repository
- Render auto-detects configuration
Add environment variables in Render dashboard

Monitoring Scripts

After investigation, auto-generated monitoring scripts are saved in scripts/generated/:

Run Monitoring Script

cd scripts/generated
python monitor_acme_corp.py

Schedule with Cron

crontab -e

Add:

0 9 * * * cd /path/to/tprm-agent/scripts/generated && python3 monitor_acme_corp.py

Configuration

Bright Data Setup

SERP API Zone:
- Go to Bright Data dashboard → Proxies & Scraping Infrastructure
- Add → SERP API
- Copy zone name and API token
Web Unlocker Zone:
- Add → Web Unlocker
- Copy zone name
Browser API (Optional):
- Add → Scraping Browser
- Copy credentials for advanced scenarios

Risk Categories

Modify src/discovery.py to add custom risk categories:

RISK_CATEGORIES = {
    "litigation": ["lawsuit", "litigation", "sued"],
    "financial": ["bankruptcy", "insolvency", "debt"],
    "fraud": ["fraud", "scam", "investigation"],
    "regulatory": ["violation", "fine", "penalty"],
    "operational": ["recall", "safety issue", "disruption"],
    # Add your custom categories here
}

API Reference

Investigation Request

{
  "vendor_name": "string (required)",
  "categories": ["litigation", "fraud"] (optional),
  "generate_monitors": true (optional, default: true)
}

Investigation Result

{
 "vendor_name": "string",
 "started_at": "ISO 8601 timestamp",
 "completed_at": "ISO 8601 timestamp",
 "total_sources_found": 0,
 "total_sources_accessed": 0,
 "risk_assessments": [
  {
   "vendor_name": "string",
   "category": "string",
   "severity": "low|medium|high|critical",
   "summary": "string",
   "key_findings": ["string"],
   "sources": ["url"],
   "recommended_actions": ["string"],
   "assessed_at": "ISO 8601 timestamp"
  }
 ],
 "monitoring_scripts": [
  {
   "vendor_name": "string",
   "script_path": "string",
   "urls_monitored": ["url"],
   "check_frequency": "daily",
   "created_at": "ISO 8601 timestamp"
  }
 ],
 "errors": ["string"]
}

Advanced Features

Using Browser API

For JavaScript-heavy sites or complex workflows:

from src.browser import BrowserClient
import asyncio

async def search_court_registry():
    client = BrowserClient()

    result = await client.fill_and_submit_form(
        url="https://court-registry.example.com/search",
        form_data={
            "#party-name": "Acme Corp",
            "#case-type": "civil",
        },
        submit_selector="#search-button",
        result_selector=".search-results",
    )

    if result.success:
        print(f"Court records: {result.text}")

asyncio.run(search_court_registry())

Troubleshooting

Common Issues

Authentication errors with Bright Data:

Verify API token and zone names in .env
Check zone status in Bright Data dashboard
Ensure zones have available bandwidth

OpenAI rate limits:

Add retry logic or reduce concurrent requests
Upgrade OpenAI plan for higher limits

Generated scripts not using Bright Data:

Check that OpenHands SDK prompt includes Web Unlocker instructions
Verify actions.py has the correct prompt template

Deployment failures:

Check dependency sizes (Railway/Render support larger apps than Vercel)
Verify Procfile exists and is correct
Ensure all environment variables are set

Performance Tips

Parallel requests: Modify access.py to use asyncio for faster content extraction
Caching: Implement Redis caching for repeated vendor investigations
Rate limiting: Add rate limiting middleware to prevent API abuse
Database: Use PostgreSQL to persist investigation history

Security

Never commit .env file to git
Use Railway/Render environment variables for production
Rotate API keys regularly
Implement authentication on API endpoints for production use

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit changes (git commit -m 'Add amazing feature')
Push to branch (git push origin feature/amazing-feature)
Open a Pull Request

License

MIT License - see LICENSE file for details

Resources

Support

For issues and questions:

Open an issue on GitHub
Check Bright Data documentation
Review OpenHands SDK examples

Roadmap

Add support for SEC filings and OFAC sanctions lists
Implement PostgreSQL database for investigation history
Add Slack/Teams webhook notifications
Build React dashboard for visualization
Support batch vendor investigations
Add PDF report generation
Implement user authentication
Add multi-language support

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
api		api
src		src
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
requirements.txt		requirements.txt
run_investigation.py		run_investigation.py

Studio1HQ/tprm-agent

Folders and files

Latest commit

History

Repository files navigation

TPRM Agent - Autonomous Third-Party Risk Management

Features

Architecture

Prerequisites

Quick Start

1. Clone and Setup

2. Configure Environment Variables

3. Run Locally

Usage

API Endpoints

Start Investigation

Get Investigation Results

Health Check

Python Usage

Project Structure

Deployment

Deploy to Railway

Deploy to Render

Monitoring Scripts

Run Monitoring Script

Schedule with Cron

Configuration

Bright Data Setup

Risk Categories

API Reference

Investigation Request

Investigation Result

Advanced Features

Using Browser API

Troubleshooting

Common Issues

Performance Tips

Security

Contributing

License

Resources

Support

Roadmap

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages