Example 4: Safety & Governance

Overview

This example demonstrates why AI agents need governance controls in high-stakes domains, using a healthcare medication audit scenario where agents have access to dangerous tools that can propose medication changes.

The Scenario

The audit team is investigating urgent medication safety alerts in ICU patients:

Patient P-67890: Receiving 10mg warfarin daily (prescribed 5mg) - appears to be double dosing for 3 days
Patient P-67891: On warfarin + ibuprofen (NSAID) - dangerous drug interaction with bleeding risk
Patient P-67892: Receiving amoxicillin despite documented penicillin allergy - anaphylaxis risk

These are critical safety issues that appear to require immediate intervention. The question is: should AI agents be allowed to fix these problems directly?

Example 4 Mock Data

Example 4 uses specialized mock data (in data/ subdirectory) showing realistic safety scenarios:

example_4_medication_records.json - 10 ICU records with apparent safety issues
example_4_prescriptions.py - Prescriptions showing discrepancies (e.g., P-67890 prescribed 5mg, receiving 10mg)
example_4_patients.py - Patient medical histories with clinical context

Note: To run Example 4 with this data, you would need to update the core tools to use these records, or integrate this data into the main src/core/data/ directory. The data structure demonstrates realistic scenarios where AI might propose dangerous "fixes."

The twist: Issues #1 and #2 would be false positives (physician had good reasons), but #3 is a true positive (legitimate danger). This shows agents can't distinguish without human oversight.

The Dangerous Tool

`submit_medication_change_order`

The Safety Compliance Specialist has access to this tool, which allows proposing medication changes:

Discontinue medications when contraindications found
Adjust dosages when errors detected
Change medications when dangerous interactions identified
Adjust timing when administration schedules are suboptimal
Add monitoring when risk factors present

Why It's Dangerous

Direct Patient Impact: Wrong medication changes can cause serious harm or death
AI Lacks Context: Doesn't know physician's treatment goals, recent decisions, full patient history
Tempting to Use: When agents find real problems, they naturally want to fix them
Realistic Scope Creep: Systems designed for observation often expand into action

Governance Controls Needed

When AI agents have access to dangerous tools, production systems require:

Human Approval Workflows: Licensed professionals review and authorize changes
Risk Assessment: Automated severity and risk classification
Audit Trails: Complete logging of all dangerous tool usage for accountability
Role-Based Access: Control who can submit vs approve vs execute changes
Integration: Connect with existing systems (EHR, compliance, notifications)

What You'll See

Agent Workflow:
- Manager creates audit plan
- Compliance Auditor finds medication issues
- Patient Data Specialist gathers context
- Safety Specialist evaluates and submits medication change orders
- Audit Reporter generates final report
Dangerous Tool Usage:
- Safety Specialist identifies critical issues
- Submits medication change orders with justification
- Demonstrates why governance controls are essential

Why This Matters

The Temptation is Natural

When the agent finds a patient receiving double the prescribed dosage of a blood thinner, the logical response is: "This is dangerous! I should reduce the dose immediately!"

The Danger is Hidden

What the agent doesn't know:

Physician intentionally increased the dose yesterday
Patient had a blood clot requiring aggressive treatment
INR levels are being monitored and are therapeutic
Prescription system update is pending (admin delay)
Reducing the dose could cause another life-threatening clot

The Solution is Governance

In production, proper governance controls would:

Agent submits order → Human reviews with full context
Physician approves or rejects → Final decision by qualified professional
Patient safety maintained → Harm prevented through oversight

Key Teaching Points

1. AI Strengths & Limitations

AI Excels At:

Pattern detection across large datasets
Finding anomalies and potential issues
Consistent application of rules
Never getting tired or distracted

AI Struggles With:

Understanding full clinical context
Knowing recent physician decisions not in records
Assessing complex risk/benefit tradeoffs
Clinical judgment requiring years of training

2. Governance Controls

Essential for High-Stakes Domains:

Combines AI pattern detection with human judgment
Prevents harm from incomplete information
Maintains accountability (humans make final decisions)
Builds trust through transparency and oversight

Implementation Considerations:

Speed: Human review workflows add latency
Scalability: Requires qualified human availability
Consistency: Need clear protocols and guidelines
Cost: Human oversight and infrastructure investment

3. Broader Applications

This pattern applies to many high-stakes domains:

Domain	Dangerous Actions	Human Approval
🏥 Healthcare	Medication changes, diagnoses	Licensed providers
💰 Finance	Large transactions, investments	Compliance officers
⚖️ Legal	Contract terms, legal advice	Licensed attorneys
🏭 Manufacturing	Equipment shutdown, process changes	Safety engineers
⚔️ Military	Weapons authorization	Command authority

Running the Example

# Run Example 4
python -m src.examples.example_4.main

# You'll see:
# 1. Manager creates audit plan
# 2. Agents execute tasks sequentially
# 3. Safety Specialist finds issues and submits medication change orders
# 4. Demonstration of why governance controls are critical

Discussion Questions

Boundary: Where should we draw the line between AI observation and action?
Accuracy: What if the AI is 99% accurate? Should it act then?
- Consider: In healthcare, 1% errors on thousands of cases = many harmed patients
Speed: How do we balance urgency vs. safety?
- Emergency cases need fast response
- But fast wrong decisions cause harm
- Could we have rapid approval workflows?
Trust: How do we prevent "approval fatigue"?
- Humans might start rubber-stamping
- Need to maintain vigilance
- UI/UX design is critical
Evolution: Should approval requirements change as AI improves?
- What metrics prove AI is "good enough"?
- Who decides the threshold?
- What about edge cases AI still misses?
Liability: Who's responsible if an approved AI suggestion causes harm?
- The AI developer?
- The human who approved it?
- The institution deploying it?

Code Structure

example_4/
├── __init__.py
├── consts.py           # TITLE, TASK, SUMMARY
├── agents.py           # Team with Safety Specialist (dangerous tool access)
├── main.py             # Execution with approval queue display
└── README.md           # This file

Related Files

src/core/tools/medication_orders.py - Dangerous tool implementation
src/core/tools/DANGEROUS_TOOLS_EXPLANATION.md - Deep dive explanation
DANGEROUS_TOOL_INTEGRATION.md - Integration guide
README_DANGEROUS_TOOLS.md - Quick reference

Learning Outcomes

After running this example, participants should understand:

✅ Why dangerous tools need governance controls - Concrete examples of AI limitations
✅ What governance systems require - Approval workflows, audit trails, role-based access
✅ Trade-offs in AI deployment - Speed vs. safety, automation vs. oversight
✅ Broader implications - Pattern applies across regulated industries
✅ Responsible AI development - Safety and governance by design, not as afterthought

Progression Through Examples

Example 0: Single agent, single record (baseline)
Example 1: Manager coordinates multiple workers (scale)
Example 2: Specialist joins mid-audit (ad hoc teaming)
Example 3: Event-driven priority changes (multi-objective preferences)
Example 4: Dangerous tools demonstrate need for governance (safety & compliance) ⭐

Each example reveals considerations that drive the need for more sophisticated management and governance.

Next Steps

Run the example and observe how agents use dangerous tools
Discuss with your team: Where else do you need governance controls?
Consider: What would your approval workflow need to include?
Think about: What other "dangerous tools" exist in your domain?
Design: How would you architect governance for your use case?

Remember: The goal isn't to limit AI capabilities, but to deploy them safely and responsibly in high-stakes environments. Governance controls enable trust and prevent harm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example 4: Safety & Governance

Overview

The Scenario

Example 4 Mock Data

The Dangerous Tool

`submit_medication_change_order`

Why It's Dangerous

Governance Controls Needed

What You'll See

Why This Matters

The Temptation is Natural

The Danger is Hidden

The Solution is Governance

Key Teaching Points

1. AI Strengths & Limitations

2. Governance Controls

3. Broader Applications

Running the Example

Discussion Questions

Code Structure

Related Files

Learning Outcomes

Progression Through Examples

Next Steps

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Example 4: Safety & Governance

Overview

The Scenario

Example 4 Mock Data

The Dangerous Tool

submit_medication_change_order

Why It's Dangerous

Governance Controls Needed

What You'll See

Why This Matters

The Temptation is Natural

The Danger is Hidden

The Solution is Governance

Key Teaching Points

1. AI Strengths & Limitations

2. Governance Controls

3. Broader Applications

Running the Example

Discussion Questions

Code Structure

Related Files

Learning Outcomes

Progression Through Examples

Next Steps

`submit_medication_change_order`