Miscalibrated Confidence in Enterprise Pharma AI

Interactive technical document exploring how to build reliable pharmaceutical AI agents that accurately assess their own confidence.

Live: ignatpenshin.github.io/pharma-agent

Problem

Enterprise pharma AI agents suffer from miscalibrated confidence — claiming high certainty on errors and low certainty on correct answers. This document presents a system architecture that makes failures predictable and recoverable rather than attempting to eliminate them.

What's Inside

Taxonomy of False Confidence — 10 failure mechanisms across retrieval, reasoning, and epistemic blindness categories
System Architecture — Layered RAG + Meta-Cognitive Classifier with 6 pre-generation signals
5-Zone Response System — GREEN/YELLOW/ORANGE/RED/GRAY confidence zones with distinct response behaviors and human-in-the-loop routing
Calibration Pipeline — Two-stage confidence model (logistic regression → isotonic regression) with cold-start protocol
Eval-for-Eval — How to evaluate the evaluator: golden datasets, per-dimension reliability, inter-annotator agreement
Hard Questions — Honest gaps that engineering alone cannot close

30+ peer-reviewed references. All links verified.

Features

Interactive footnotes explaining technical terms (calibration, RAG, NLI, GRADE, AUROC, etc.)
Scroll-tracking navigation rail
Context sidebar with key metrics and zone distribution
Responsive layout (desktop → tablet → phone)
Color-coded confidence zones throughout

Tech

React 19 · Vite · Vanilla CSS · GitHub Pages

Run Locally

npm install
npm run dev

Deploy

npm run deploy

Author

Ignat Penshin — AI Engineer

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
public		public
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Miscalibrated Confidence in Enterprise Pharma AI

Problem

What's Inside

Features

Tech

Run Locally

Deploy

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

ignatpenshin/pharma-agent

Folders and files

Latest commit

History

Repository files navigation

Miscalibrated Confidence in Enterprise Pharma AI

Problem

What's Inside

Features

Tech

Run Locally

Deploy

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages