A curated list of research papers and resources focused on AI agents for disease-specific applications, covering neurodegenerative diseases (Alzheimer’s, Parkinson’s), metabolic diseases (Diabetes), cardiovascular diseases, cancer, mental health disorders, and more. This collection emphasizes AI agent systems that directly support disease diagnosis, treatment, drug discovery, and clinical decision-making through LLMs, multi-agent systems, and tool integrations.
graph TB
A["🏥 AI Agents for Disease"] --> B["Cancer &<br/>Oncology"]
A --> C["Mental Health &<br/>Psychiatry"]
A --> D["Neurodegenerative<br/>Diseases"]
A --> E["Cardiovascular<br/>Diseases"]
A --> F["Cerebrovascular<br/>& Stroke"]
A --> G["Metabolic<br/>Diseases"]
A --> H["Genetic &<br/>Rare Diseases"]
A --> I["Infectious<br/>Diseases"]
A --> J["Other Specialties"]
A --> K["General &<br/>Cross-Disease"]
J --> J1["Dermatology"]
J --> J2["Dental"]
J --> J3["Hepatology"]
J --> J4["Musculoskeletal"]
J --> J5["Respiratory"]
J --> J6["Wound Care"]
J --> J7["Chronic Pain"]
K --> K1["General Clinical AI<br/>(High-Impact)"]
K --> K2["Drug Discovery"]
K --> K3["Datasets &<br/>Benchmarks"]
K --> K4["Surveys"]
We will try to keep this list updated. If you find any errors or any missing papers, please don’t hesitate to open issues or pull requests.
- Latest Papers
- Papers by Disease Category
- 1. Cancer & Oncology
- 2. Mental Health & Psychiatric Disorders
- 3. Neurodegenerative Diseases
- 4. Cardiovascular Diseases
- 5. Cerebrovascular & Stroke
- 6. Metabolic Diseases
- 7. Genetic & Rare Diseases
- 8. Infectious Diseases
- 9. Dermatology
- 10. Dental & Oral Health
- 11. Hepatology
- 12. Musculoskeletal Diseases
- 13. Respiratory & Pulmonary
- 14. Wound Care & Pressure Injuries
- 15. Chronic Pain
- 16. General Clinical AI (High-Impact)
- 17. Drug Discovery & Development
- 18. Datasets & Benchmarks
- 19. Related Surveys
Papers below are listed chronologically. For disease-specific organization, see Papers by Disease Category.
- [JMIR Medical Informatics 2026] Prompting and Fine-Tuning Large Language Models for Parkinson Disease Diagnosis [paper]
- [arxiv 2026.3] Meissa: Multi-modal Medical Agentic Intelligence [paper] [Github]
- [ICLR 2026] CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework [paper]
- [ICLR 2026] ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue [paper]
- [EACL 2026 Workshop] Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis? [paper]
- [arxiv 2026.3] MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus [paper]
- [arxiv 2026.3] MedCollab: Causal-Driven Multi-Agent Collaboration for Full-Cycle Clinical Diagnosis via IBIS-Structured Argumentation [paper]
- [arxiv 2026.3] From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG [paper] [Github]
- [arxiv 2026.3] MIND: Unified Inquiry and Diagnosis RL with Criteria Grounded Clinical Supports for Psychiatric Consultation [paper]
- [arxiv 2026.3] DUCX: Decomposing Unfairness in Tool-Using Chest X-ray Agents [paper]
- [arxiv 2026.3] OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation [paper]
- [arxiv 2026.3] TARSE: Test-Time Adaptation via Retrieval of Skills and Experience for Reasoning Agents [paper]
- [arxiv 2026.3] ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning [paper]
- [arxiv 2026.3] A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series [paper]
- [arxiv 2026.2] 3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis [paper]
- [arxiv 2026.2] Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? [paper] [Github]
- [arxiv 2026.2] Which Tool Response Should I Trust? Tool-Expertise-Aware Chest X-ray Agent with Multimodal Agentic Learning [paper]
- [arxiv 2026.2] MedClarify: An Information-Seeking AI Agent for Medical Diagnosis with Case-Specific Follow-up Questions [paper]
- [arxiv 2026.2] LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology [paper]
- [arxiv 2026.2] NutriOrion: A Hierarchical Multi-Agent Framework for Personalized Nutrition Intervention Grounded in Clinical Guidelines [paper]
- [arxiv 2026.2] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records [paper]
- [arxiv 2026.2] CoMMa: Contribution-Aware Medical Multi-Agents From A Game-Theoretic Perspective [paper]
- [AAAI 2026 Workshop] SynthAgent: A Multi-Agent LLM Framework for Realistic Patient Simulation [paper]
- [arxiv 2026.2] MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation [paper]
- [arxiv 2026.2] Picking the Right Specialist: Attentive Neural Process-based Selection of Task-Specialized Models as Tools for Agentic Healthcare Systems [paper]
- [arxiv 2026.2] A Multi-Agent Framework for Medical AI: Leveraging Fine-Tuned GPT, LLaMA, and DeepSeek R1 for Evidence-Based and Bias-Aware Clinical Query Processing [paper]
- [arxiv 2026.2] MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling [paper]
- [arxiv 2026.2] MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs [paper]
- [arxiv 2026.2] Advancing AI Trustworthiness Through Patient Simulation: Risk Assessment of Conversational Agents for Antidepressant Selection [paper]
- [arxiv 2026.2] LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation [paper]
- [arxiv 2026.2] Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning [paper]
- [ICHI 2026] Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark [paper]
- [arxiv 2026.2] ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease [paper]
- [arxiv 2026.2] MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning [paper] [Github]
- [arxiv 2026.2] Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation [paper]
- [arxiv 2026.2] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis [paper]
- [arxiv 2026.2] ExperienceWeaver: Optimizing Small-sample Experience Learning for LLM-based Clinical Text Improvement [paper]
- [arxiv 2026.1] EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning [paper] [Github]
- [arxiv 2026.1] Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning [paper]
- [arxiv 2026.1] DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data [paper]
- [arxiv 2026.1] AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning [paper]
- [arxiv 2026.1] AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization [paper]
- [arxiv 2026.1] MedConsultBench: A Full-Cycle, Fine-Grained, Process-Aware Benchmark for Medical Consultation Agents [paper]
- [EACL 2026] Knowing When to Abstain: Medical LLMs Under Clinical Uncertainty [paper]
- [arxiv 2026.1] Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging [paper] [Github]
- [arxiv 2026.1] MEDVISTAGYM: A Scalable Training Environment for Thinking with Medical Images via Tool-Integrated Reinforcement Learning [paper]
- [arxiv 2026.1] MedEinst: Benchmarking the Einstellung Effect in Medical LLMs through Counterfactual Differential Diagnosis [paper]
- [arxiv 2026.1] DemMA: Dementia Multi-Turn Dialogue Agent with Expert-Guided Reasoning and Action Simulation [paper]
- [arxiv 2026.1] IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation [paper]
- [arxiv 2026.1] Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making [paper]
- [arxiv 2026.1] An Explainable Agentic AI Framework for Uncertainty-Aware and Abstention-Enabled Acute Ischemic Stroke Imaging Decisions [paper]
- [ICLR 2026] MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale [paper] [Github]
- [AAAI 2026] LungNoduleAgent: A Collaborative Multi-Agent System for Precision Diagnosis of Lung Nodules [paper] [Github]
- [Nature Communications 2026] Wearable Intelligent Throat Enables Natural Speech in Stroke Patients with Dysarthria [paper]
- [npj Digital Medicine 2026] MoMA: a mixture-of-multimodal-agents architecture for enhancing clinical prediction modelling [paper]
- [Comput Biol Med 2026] Multimodal diagnosis of Parkinson's disease with an internet-based collaborative agent architecture of medical language models [paper]
- [arXiv 2025.02] Agentic AI for Scaling Diagnosis and Care in Neurodegenerative Disease [paper]
- [medRxiv 2025.10] Agentic Generative AI System for Classification of Pathology-Confirmed Primary Progressive Aphasia Variants [paper]
- [CHI 2025] Patrika: AI-Enabled Conversational Journaling for Advancing Parkinson's Disease Symptom Tracking [paper]
- [MDS 2025] A Conversational GPT Agent for Parkinson's Disease (PD-GPT) [paper]
- [IEEE JBHI 2026] Leveraging Large Language Models for Personalized Parkinson's Disease Treatment [paper]
- [Scientific Reports 2025] Detection and diagnosis of diabetic retinopathy in retinal fundus images using agentic AI approaches [paper]
- [Scientific Reports 2025] My diabetes care: an AI-based mobile app with conversational agent for type 2 diabetes self-management [paper]
- [J Healthcare Informatics Research 2025] Assessing the User Experience of an LLM-Based Conversational Assistant in Diabetes Mellitus Care [paper]
- [medRxiv 2025] Developing a GraphRAG-enabled local-LLM for Gestational Diabetes Mellitus [paper]
- [arxiv 2025.12] Hybrid-Code: A Privacy-Preserving, Redundant Multi-Agent Framework for Reliable Local Clinical Coding [paper]
- [arxiv 2025.12] ClinDEF: A Dynamic Evaluation Framework for Large Language Models in Clinical Reasoning [paper]
- [arxiv 2025.12] HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data [paper]
- [arxiv 2025.12] Bidirectional human-AI collaboration in brain tumour assessments improves both expert human and AI agent performance [paper]
- [arxiv 2025.12] On-device Large Multi-modal Agent for Human Activity Recognition [paper]
- [arxiv 2025.12] Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight [paper]
- [arxiv 2025.12] Agent-Based Output Drift Detection for Breast Cancer Response Prediction in a Multisite Clinical Decision Support System [paper]
- [arxiv 2025.12] ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges [paper]
- [arxiv 2025.12] AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning [paper]
- [arxiv 2025.12] A Multi-Agent Large Language Model Framework for Automated Qualitative Analysis [paper]
- [arxiv 2025.12] Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis [paper]
- [arxiv 2025.12] INFORM-CT: INtegrating LLMs and VLMs FOR Incidental Findings Management in Abdominal CT [paper]
- [arxiv 2025.12] Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations [paper]
- [arxiv 2025.12] Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis [paper]
- [arxiv 2025.12] MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data [paper]
- [arxiv 2025.12] MedAI: Evaluating TxAgent's Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition [paper] [Benchmark & Competition]
- [arxiv 2025.12] CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment [paper] [Github]
- [arxiv 2025.12] AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding [paper]
- [arxiv 2025.12] Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology [paper]
- [arxiv 2025.12] DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning [paper]
- [arxiv 2025.12] ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes [paper]
- [arxiv 2025.12] MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare [paper]
- [ICCV 2025 Highlight] Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation [paper] [Github]
- [arxiv 2025.12] Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases [paper]
- [arxiv 2025.12] Many-to-One Adversarial Consensus: Exposing Multi-Agent Collusion Risks in AI-Based Healthcare [paper]
- [arxiv 2025.12] FinAgent: An Agentic AI Framework Integrating Personal Finance and Nutrition Planning [paper]
- [arxiv 2025.12] Radiologist Copilot: Agentic AI Assistant for Holistic Radiology Reporting with Quality Control [paper]
- [arxiv 2025.12] UCAgents: Unidirectional Convergence for Visual Evidence Anchored Multi-Agent Medical Decision-Making [paper]
- [arxiv 2025.12] First, do NOHARM: towards clinically safe large language models [paper]
- [arxiv 2025.12] Causal Reinforcement Learning based Agent-Patient Interaction with Clinical Domain Knowledge [paper]
- [arxiv 2025.11] MedEyes: Learning Dynamic Visual Focus for Medical Progressive Diagnosis [paper] [GitHub]
- [arxiv 2025.11] MedSAM3: Delving into Segment Anything with Medical Concepts [paper] [Github]
- [arxiv 2025.11] SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction [paper]
- [arxiv 2025.11] KOM: A Multi-Agent Artificial Intelligence System for Precision Management of Knee Osteoarthritis (KOA) [paper]
- [arxiv 2025.11] KRAL: Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy [paper]
- [arxiv 2025.11] Medical Malice: A Dataset for Context-Aware Safety in Healthcare LLMs [paper]
- [arxiv 2025.11] MedBench v4: A Robust and Scalable Benchmark for Evaluating Chinese Medical Language Models, Multimodal Models, and Intelligent Agents [paper]
- [arxiv 2025.11] Fair-GNE: Generalized Nash Equilibrium-Seeking Fairness in Multiagent Healthcare Automation [paper]
- [arxiv 2025.11] MedDCR: Learning to Design Agentic Workflows for Medical Coding [paper]
- [arxiv 2025.11] Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval [paper]
- [arxiv 2025.11] OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition [paper]
- [arxiv 2025.11] From Passive to Proactive: A Multi-Agent System with Dynamic Task Orchestration for Intelligent Medical Pre-Consultation [paper]
- [arxiv 2025.11] Fine-Tuning DialoGPT on Common Diseases in Rural Nepal for Medical Conversations [paper]
- [arxiv 2025.10] Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction [paper]
- [arxiv 2025.10] FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning [paper]
- [arxiv 2025.10] SNOMED CT-powered Knowledge Graphs for Structured Clinical Data and Diagnostic Reasoning [paper]
- [arxiv 2025.10] MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision [paper]
- [arxiv 2025.10] Haibu Mathematical-Medical Intelligent Agent:Enhancing Large Language Model Reliability in Medical Tasks via Verifiable Reasoning Chains [paper]
- [arxiv 2025.10] Reinforcement Learning for Clinical Reasoning: Aligning LLMs with ACR Imaging Appropriateness Criteria [paper]
- [EMNLP 2025 Industry] CLARITY: Clinical Assistant for Routing, Inference, and Triage [paper]
- [arxiv 2025.9] AgenticAD: A Specialized Multiagent System Framework for Holistic Alzheimer Disease Management [paper]
- [arxiv 2025.9] Online Decision Making with Generative Action Sets [paper]
- [arxiv 2025.9] PAME-AI: Patient Messaging Creation and Optimization using Agentic AI [paper]
- [arxiv 2025.9] A co-evolving agentic AI system for medical imaging analysis [paper]
- [arxiv 2025.9] FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering [paper] [Github]
- [arxiv 2025.9] MedFact: Benchmarking the Fact-Checking Capabilities of Large Language Models on Chinese Medical Texts [paper] [Github]
- [arxiv 2025.9] Agentic Temporal Graph of Reasoning with Multimodal Language Models: A Potential AI Aid to Healthcare [paper]
- [arxiv 2025.9] Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform [paper]
- [arxiv 2025.9] Demo: Healthcare Agent Orchestrator (HAO) for Patient Summarization in Molecular Tumor Boards [paper] [Github]
- [arxiv 2025.8] MedResearcher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework [paper] [Github]
- [arxiv 2025.8] ChatThero: An LLM-Supported Chatbot for Behavior Change and Therapeutic Support in Addiction Recovery [paper]
- [arxiv 2025.8] Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture [paper]
- [arxiv 2025.8] Trustworthy Agents for Electronic Health Records through Confidence Estimation [paper]
- [arxiv 2025.8] AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays [paper]
- [arxiv 2025.8] End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning [paper]
- [arxiv 2025.8] A Multi-Agent Approach to Neurological Clinical Reasoning [paper]
- [arxiv 2025.8] PASS: Probabilistic Agentic Supernet Sampling for Interpretable and Adaptive Chest X-Ray Reasoning [paper]
- [arxiv 2025.8] ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis [paper]
- [arxiv 2025.8] Colacare: Enhancing electronic health record modeling through large language model-driven multi-agent collaboration [paper][project page]
- [arxiv 2025.8] FEAT: A Multi-Agent Forensic AI System with Domain-Adapted Large Language Model for Automated Cause-of-Death Analysis [paper]
- [arxiv 2025.8] Are Large Language Models Dynamic Treatment Planners? An In Silico Study from a Prior Knowledge Injection Angle [paper]
- [arxiv 2025.8] Tree-of-Reasoning: Towards Complex Medical Diagnosis via Multi-Agent Reasoning with Evidence Tree [paper]
- [arxiv 2025.8] A Multi-Agent System for Complex Reasoning in Radiology Visual Question Answering [paper]
- [arxiv 2025.8] Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning [paper] [code]
- [arxiv 2025.8] Agent-Based Feature Generation from Clinical Notes for Outcome Prediction [paper]
- [arxiv 2025.8] GMAT: Grounded Multi-Agent Clinical Description Generation for Text Encoder in Vision-Language MIL for Whole Slide Image Classification [paper]
- [arxiv 2025.8] A Multi-Agent Approach to Neurological Clinical Reasoning [paper]
- [biorxiv 2025.8] BioScientistAgent: Designing LLM-Biomedical Agents with KG-Augmented RL Reasoning Modules for Drug Repurposing and Mechanistic of Action Elucidation [paper]
- [arxiv 2025.7] Agentic AI framework for end-to-end medical data inference [paper]
- [arxiv 2025.7] Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication [paper]
- [arxiv 2025.7] A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models [paper] [project page]
- [arxiv 2025.7] Infherno: End-to-end agent-based FHIR resource synthesis from free-form clinical notes [paper]
- [arxiv 2025.7] Multi-agent retrieval-augmented framework for evidence-based counterspeech against health misinformation [paper]
- [arxiv 2025.7] AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions [paper]
- [arxiv 2025.7] Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis [paper]
- [arxiv 2025.7] DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making [paper]
- [arxiv 2025.7] KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs [paper]
- [arxiv 2025.7] STELLA: Self-Evolving LLM Agent for Biomedical Research [paper] [Github]
- [arxiv 2025.6] MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility [paper]
- [arxiv 2025.6] MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning [paper]
- [arxiv 2025.6] From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs [paper]
- [arxiv 2025.6] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology [paper]
- [arxiv 2025.6] An agentic system for rare disease diagnosis with traceable reasoning [paper] [demo]
- [arxiv 2025.6] PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue [paper]
- [arxiv 2025.6] The Optimization Paradox in Clinical AI Multi-Agent Systems [paper]
- [EMNLP 2025] AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents [paper]
- [arxiv 2025.6] AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data [paper]
- [arxiv 2025.6] VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety [paper]
- [ACL 2025 Findings] AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation [paper] [code]
- [ACL 2025] ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents [paper] [Github] [Project]
- [arxiv 2025.6] RadFabric: Agentic AI System with Reasoning Capability for Radiology [Paper] [Project] |
- [arxiv 2025.5] CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents [paper] [code]
- [arxiv 2025.5] BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum [paper]
- [arxiv 2025.5] Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making [paper]
- [NeurIPS 2025] CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic [paper]
- [arxiv 2025.5] Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering [paper] [code]
- [arxiv 2025.5] Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine [paper]
- [NeurIPS 2025] Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions [paper]
- [arxiv 2025.5] CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering [paper]
- [arxiv 2025.5] A Risk Taxonomy for Evaluating AI-Powered Psychotherapy Agents [paper]
- [NeurIPS 2025] MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks [paper] [project page]
- [arxiv 2025.5] A Multimodal Multi-Agent Framework for Radiology Report Generation [paper]
- [EMNLP 2025] DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue [paper] [code]
- [biorxiv 2025.5] Biomni: A general-purpose biomedical ai agent [paper]
- [arxiv 2025.4] Llm agent swarm for hypothesis-driven drug discovery [paper]
- [arxiv 2025.4] Customizing emotional support: How do individuals construct and interact with LLM-powered chatbots [paper]
- [arxiv 2025.4] An LLM-Driven Multi-Agent Debate System for Mendelian Diseases [paper]
- [arxiv 2025.4] Txgemma: Efficient and agentic llms for therapeutics [paper]
- [medrxiv 2025.4] TrialGenie: Empowering Clinical Trial Design with Agentic Intelligence and Real World Data [paper]
- [arxiv 2025.3] TAMA: A Human--AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews [paper]
- [arxiv 2025.3] Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent [paper]
- [arxiv 2025.3] The Application of MATEC (Multi-AI Agent Team Care) Framework in Sepsis Care [paper]
- [EMNLP 2025] MDTeamGPT: A Self-Evolving LLM-Based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation [paper] [GitHub]
- [arxiv 2025.3] RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration [paper]
- [arxiv 2025.3] MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways [paper]
- [arxiv 2025.3] TxAgent: An AI agent for therapeutic reasoning across a universe of tools [paper]
- [arxiv 2025.3] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning [paper] [project page]
- [arxiv 2025.3] Towards conversational ai for disease management [paper]
- [arxiv 2025.3] GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation [paper]
- [EMNLP 2025 Findings] MIND: Towards Immersive Psychological Healing with Multi-Agent Inner Dialogue [paper]
- [arxiv 2025.2] Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline [paper]
- [arxiv 2025.2] RAG-Enhanced Collaborative LLM Agents for Drug Discovery [paper]
- [EMNLP 2025 Findings] Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge [paper]
- [arxiv 2025.2] An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation [paper]
- [ACL 2025] Cami: A counselor agent supporting motivational interviewing through state inference and topic exploration [paper]
- [ICML 2025] MedRAX: Medical Reasoning Agent for Chest X-ray [paper] [code]
- [ICCV 2025] PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology [Paper] [project page] [Github]
- [arxiv 2025.2] M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging [paper]
- [NEJM AI 2025] MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents [paper] [project page]
- [arxiv 2025.1] Exploring the inquiry-diagnosis relationship with advanced patient simulators [paper] [project page]
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding [paper] [project page]
- [arxiv 2025.1] AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling [paper]
- [medrxiv 2025.1] Advancing the prediction and understanding of placebo responses in chronic back pain using large language models [paper]
- [Nature] Towards conversational diagnostic artificial intelligence [paper]
- [Nature Communications 2025] AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning [paper]
- [Intelligent Medicine] Evaluating large language models and agents in healthcare: key challenges in clinical applications [paper]
- [npj Digital Medicine] Evaluating large language models as agents in the clinic [paper]
- [Nature Medicine 2025] An evaluation framework for clinical use of large language models in patient interaction tasks [paper]
- [Nature Communications 2025] An automated framework for assessing how well LLMs cite relevant medical references [paper]
- [Nature BME 2025] CRISPR-GPT for agentic automation of gene-editing experiments [paper]
- [Nature Methods 2025] GeneAgent: self-verification language agent for gene-set analysis using domain databases [paper]
- [npj Digital Medicine] CARE-AD: A Multi-Agent Large Language Model Framework for Alzheimer's Disease Prediction Using Longitudinal Clinical Notes [paper]
- [npj Digital Medicine] Vision-language model for report generation and outcome prediction in CT pulmonary angiogram [paper]
- [npj Artificial Intelligence] HealthcareAgent: Eliciting the Power of Large Language Models for Medical Consultation [paper]
- [biorxiv 2025.6] HEAL-KGGen: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Genetic Biomarker-Based Medical Diagnosis [paper]
- [JAMIA Open 2025] Conversational health agents: a personalized large language model-powered agent framework [paper]
- [JMIR] The Effectiveness of a Custom AI Chatbot for Type 2 Diabetes Mellitus Health Literacy: Development and Evaluation Study [paper]
- [JMIR Aging 2025] The PDC30 Chatbot—Development of a Psychoeducational Resource on Dementia Caregiving Among Family Caregivers: Mixed Methods Acceptability Study [paper]
- [JoVE] Evidence-based knowledge synthesis and hypothesis validation: Navigating biomedical knowledge bases via explainable ai and agentic systems [paper]
- [arxiv 2024.8] Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction [paper]
- [Bioinformatics 2025] ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning [paper]
- [Clinical Neurophysiology 2025] Agent-guided AI-powered interpretation and reporting of nerve conduction studies and EMG (INSPIRE) [paper]
- [Expert Systems with Applications 2025] A two-stage proactive dialogue generator for efficient clinical information collection using large language model [paper]
- [Physics in Medicine & Biology 2025] A feasibility study of automating radiotherapy planning with large language model agents [paper]
- [JCO 2025] A large language model (LLM)-based multi-agent framework for risk stratification and treatment recommendations in localized prostate cancer (locPCa). [paper]
- [IEEE EMBC 2025] Knowledge-infused LLM-powered conversational health agent: A case study for diabetes patients [paper]
- [ICLR 2025] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models [paper]
- [ACL 2025] Medical Graph RAG: Evidence-based Medical Large Language Model via Graph Retrieval-Augmented Generation [paper]
- [ACL Findings 2025] MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration [paper]
- [ACL Findings 2025] ASTRID--An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems [paper]
- [NAACL 2025] A Layered Debating Multi-Agent System for Similar Disease Diagnosis [paper]
- [NAACL 2025] Menti: Bridging medical calculator and llm agent with nested tool calling [paper]
- [COLING 2025] Unveiling performance challenges of large language models in low-resource healthcare: A demographic fairness perspective [paper]
- [ICMI 2025] An LLM-powered Socially Interactive Agent with Adaptive Facial Expressions for Conversing about Health [paper]
- [MICCAI 2025 (Oral)] WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis [Paper] [GitHub]
- [MICCAI 2025] Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis [Paper] [GitHub]
- [MICCAI 2025] DentEval: Fine-tuning-Free Expert-Aligned Assessment in Dental Education via LLM Agents [Paper] [GitHub]
- [MICCAI 2025] CSAP-Assist: Instrument-Agent Dialogue Empowered Vision-Language Models for Collaborative Surgical Action Planning [Paper] [GitHub]
- [MICCAI 2025] MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions [Paper] [Github]
- [MICCAI 2025 workshop] AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation [paper] [github]
- [ICT4AWE 2025] MentalRAG: Developing an Agentic Framework for Therapeutic Support Systems [paper]
- [MLHC 2025] Evaluation of Multi-Agent LLMs in Multidisciplinary Team Decision-Making for Challenging Cancer Cases [paper]
- [Journal of imaging informatics in medicine] AgentMRI: A Vison Language Model-Powered AI System for Self-regulating MRI Reconstruction with Multiple Degradations [paper]
- [COLM 2025] Can A Society of Generative Agents Simulate Human Behavior and Inform Public Health Policy? A Case Study on Vaccine Hesitancy [paper]
- [AAMAS 2025] On the limits of agency in agent-based models [paper]
- [ACL 2025 Findings] Cod, towards an interpretable medical agent using chain of diagnosis [paper] [Github]
- [Advanced Intelligent Systems 2025] Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning [paper]
- [NeurIPS 2025] Clinicallab: Aligning agents for multi-departmental clinical diagnostics in the real world [paper]
- [Cell Reports Medicine 2025] Development and Testing of a Novel Large Language Model-Based Clinical Decision Support Systems for Medication Safety in 12 Clinical Specialties [paper]
- [PMLR 2025] KG4Diagnosis: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Medical Diagnosis [paper]
- [npj Digital Medicine 2025] Enhancing diagnostic capability with multi-agents conversational large language models [paper] [Github]
- [medRxiv 2025] AI agents in clinical medicine: a systematic review [paper]
- [Applied Sciences 2025] A Conversational Agent for Empowering People with Parkinson’s Disease in Exercising Through Motivation and Support [paper]
- [Alz & Dement TRCI 2025] AI approaches for phenotyping Alzheimer’s disease and related dementias using electronic health records [paper]
- [Bioengineering 2025] PARKA AI: A Sensor-Integrated Mobile Application for Parkinson’s Disease Monitoring and Self-Management [paper]
- [Scientific Reports 2025] Web based AI-driven framework combining multi-modal data with CNN and LLM for Parkinson’s disease diagnosis [paper]
- [ACM Workshop 2024] LLM-Powered Multimodal AI Conversations for Diabetes Prevention [paper]
- [arxiv 2024.12] PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children [paper]
- [Bioinformatics] AI-HOPE: an AI-driven conversational agent for enhanced clinical and genomic data integration in precision medicine research [paper]
- [arxiv 2024.10] IMAS: A Comprehensive Agentic Approach to Rural Healthcare Delivery [paper] [project page]
- [arxiv 2024.10] KGARevion: An AI Agent for Knowledge-Intensive Biomedical QA [paper] [Github] [Project]
- [arxiv 2024.10] Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics [paper]
- [arxiv 2024.9] Chatting Up Attachment: Using LLMs to Predict Adult Bonds [paper]
- [MLHC 2024] MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance [paper] [project page]
- [arxiv 2024.8] Agentic llm workflows for generating patient-friendly medical reports [paper] [project page]
- [ACM UIST 2024] Compeer: A generative conversational agent for proactive peer support [paper]
- [arxiv 2024.7] Cactus: Towards psychological counseling conversations using cognitive behavioral theory [paper]
- [TMI] Integration of Multi-Source Medical Data for Medical Diagnosis Question Answering [paper]
- [ICLR 2025 Oral] Pathgen-1.6m: 1.6 million pathology image-text pairs generation through multi-agent collaboration [paper] [project page]
- [arxiv 2024.7] MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control [paper]
- [arxiv 2024.12] Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System [paper]
- [ICML 2024 AI for Science Workshop] TriageAgent: Towards Better Multi-Agents Collaborations for Large Language Model-Based Clinical Triage [paper]
- [KDD'24 Workshop] EHRFlow: A Large Language Model-Driven Iterative Multi-Agent Electronic Health Record Data Analysis Workflow [paper]
- [MLHS 2025] Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering [paper]
- [NeurIPS 2024] MEDIQ: Question-Asking LLMs and a Benchmark for Medical Information-Seeking [paper] [project page]
- [arxiv 2024.6] CliBench: A Multifaceted and Multigranular Evaluation of Clinical Diagnosis with LLMs [paper]
- [arxiv 2024.5] AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments [paper]
- [AAAI 2025 workshop AI4Research] Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration [paper]
- [arxiv 2024.5] Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents [paper]
- [EMNLP 2024] Ehragent: Code empowers large language models for few-shot complex tabular reasoning on electronic health records [paper]
- [NeurIPS 2024 Oral] Mdagents: An adaptive collaboration of llms for medical decision-making [paper] [project page]
- [arxiv 2024.3] Llms-based few-shot disease predictions using ehr: A novel approach combining predictive agent reasoning and critical agent instruction [paper]
- [npj Digital Medicine] PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models [paper]
- [PACIFIC SYMPOSIUM ON BIOCOMPUTING 2024] A conversational agent for early detection of neurotoxic effects of medications through automated intensive observation [paper]
- [JAMIA Open 2024] Conversational health agents: A personalized llm-powered agent framework [paper] [project page]
- [JMIR 2024] Mitigating cognitive biases in clinical decision-making through multi-agent conversations using large language models: simulation study [paper]
- [JMIR 2024] A language model--powered simulated patient with automated feedback for history taking: Prospective study [paper]
- [IEEE SoftCOM 2024] A multi-agent architecture for privacy-preserving natural language interaction with FHIR-based electronic health records [paper]
- [IEEE Access 2024] Knowledge-Routed Automatic Diagnosis With Heterogeneous Patient-Oriented Graph [paper]
- [EMNLP Findings 2024] MMedAgent: Learning to Use Medical Tools with Multi-modal Agent [paper]
- [EMNLP 2024] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models [paper] [Github]
- [ACL Findings 2024] Medagents: Large language models as collaborators for zero-shot medical reasoning [paper]
- [AAAI 2024] PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology [paper] [Github]
- [CHI EA 2024] Conversational AI in health: Design considerations from a Wizard-of-Oz dermatology case study with users, clinicians and a medical LLM [paper]
- [Healthcare Information 2024] A Medical Consultation System for Geriatric Disease Based on Multi-agent Architecture and Knowledge Graph [paper]
- [Cell 2024] Empowering biomedical discovery with AI agents [paper] [Github]
- [Front Public Health 2024] MoveONParkinson: developing a personalized motivational solution for Parkinson’s disease management [paper]
- [NeurIPS workshop 2023] Are we going mad? benchmarking multi-agent debate between language models for medical q&a [paper]
- [AMIA Annual Symposium Proceedings] Understanding the benefits and challenges of using large language model-based conversational agents for mental well-being support [paper]
- [Clinical NLP 2023] DERA: enhancing large language model completions with dialog-enabled resolving agents [paper] [dataset]
- [CHI 2023] Assertiveness-based agent communication for a personalized medicine on medical imaging diagnosis [paper]
AI agents for cancer diagnosis, pathology, oncology MDT, survival prediction, radiotherapy, and tumor imaging.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence | arXiv | 2026.02 | Paper | Not Available |
| LungNoduleAgent: A Collaborative Multi-Agent System for Precision Diagnosis of Lung Nodules | AAAI | 2026.1 | Paper | GitHub |
| HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data | arXiv | 2025.12 | Paper | Not Available |
| Bidirectional human-AI collaboration in brain tumour assessments improves both expert human and AI agent performance | arXiv | 2025.12 | Paper | Not Available |
| Agent-Based Output Drift Detection for Breast Cancer Response Prediction in a Multisite Clinical Decision Support System | arXiv | 2025.12 | Paper | Not Available |
| Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations | arXiv | 2025.12 | Paper | Not Available |
| Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology | arXiv | 2025.12 | Paper | Not Available |
| SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction | arXiv | 2025.11 | Paper | Not Available |
| Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction | NeurIPS’25 Workshop | 2025.10 | Paper | Not Available |
| Healthcare Agent Orchestrator (HAO) for Patient Summarization in Molecular Tumor Boards | arXiv | 2025.09 | Paper | GitHub |
| GMAT: Grounded Multi-Agent Clinical Description Generation for Text Encoder in Vision-Language MIL | arXiv | 2025.08 | Paper | Not Available |
| Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs | arXiv | 2025.08 | Paper | GitHub |
| Are Large Language Models Dynamic Treatment Planners? An In Silico Study from a Prior Knowledge Injection Angle | arXiv | 2025.08 | Paper | Not Available |
| Evaluation of Multi-Agent LLMs in Multidisciplinary Team Decision-Making for Challenging Cancer Cases | MLHC | 2025 | Paper | Not Available |
| Evidence-based diagnostic reasoning with multi-agent copilot for human pathology | arXiv | 2025.06 | Paper | Not Available |
| PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue | arXiv | 2025.06 | Paper | Not Available |
| Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering | arXiv | 2025.05 | Paper | GitHub |
| CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis | NeurIPS | 2025.05 | Paper | Not Available |
| Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent | arXiv | 2025.03 | Paper | Not Available |
| A feasibility study of automating radiotherapy planning with large language model agents | Physics in Medicine & Biology | 2025 | Paper | Not Available |
| A large language model (LLM)-based multi-agent framework for risk stratification and treatment recommendations in localized prostate cancer (locPCa). | JCO | 2025 | Paper | Not Available |
| PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology | ICCV | 2025.02 | Paper | project GitHub |
| WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis | MICCAI (Oral) | 2025 | Paper | GitHub |
| Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering | MLHS | 2025 | Paper | GitHub |
| Pathgen-1.6m: 1.6 million pathology image-text pairs generation through multi-agent collaboration | ICLR (Oral) | 2024 | Paper | GitHub |
| PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology | AAAI | 2024 | Paper | GitHub |
AI agents for CBT, psychiatric consultation, depression, anxiety, addiction, and peer support.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| MIND: Unified Inquiry and Diagnosis RL for Psychiatric Consultation | arXiv | 2026.03 | Paper | Not Available |
| Advancing AI Trustworthiness Through Patient Simulation for Antidepressant Selection | arXiv | 2026.02 | Paper | Not Available |
| coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts | arXiv | 2026.01 | Paper | Not Available |
| Towards Efficient and Robust Linguistic Emotion Diagnosis for Mental Health | arXiv | 2026.01 | Paper | Not Available |
| ChatThero: An LLM-Supported Chatbot for Behavior Change and Therapeutic Support in Addiction Recovery | arXiv | 2025.08 | Paper | GitHub Reproduce |
| VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety | arXiv | 2025.06 | Paper | Not Available |
| AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation | ACL Findings | 2025.06 | Paper | GitHub |
| A Risk Taxonomy for Evaluating AI-Powered Psychotherapy Agents | arXiv | 2025.05 | Paper | Not Available |
| Customizing emotional support: How do individuals construct and interact with LLM-powered chatbots | arXiv | 2025.04 | Paper | Not Available |
| MIND: Towards Immersive Psychological Healing with Multi-Agent Inner Dialogue | EMNLP Findings | 2025.02 | Paper | GitHub Reproduce |
| Cami: A counselor agent supporting motivational interviewing through state inference and topic exploration | ACL | 2025.02 | Paper | GitHub |
| Autocbt: An autonomous multi-agent framework for cognitive behavioral therapy in psychological counseling | arXiv | 2025.01 | Paper | Not Available |
| Chatting Up Attachment: Using LLMs to Predict Adult Bonds | arXiv | 2024.09 | Paper | Not Available |
| PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children | arXiv | 2024.12 | Paper | GitHub |
| Cactus: Towards psychological counseling conversations using cognitive behavioral theory | EMNLP Findings | 2024.07 | Paper | GitHub |
| MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating | arXiv | 2024.07 | Paper | GitHub |
| Compeer: A generative conversational agent for proactive peer support | ACM UIST | 2024.07 | Paper | GitHub |
| MentalRAG: Developing an Agentic Framework for Therapeutic Support Systems | ICT4AWE | 2025 | Paper | Not Available |
| Understanding the benefits and challenges of using large language model-based conversational agents for mental well-being support | AMIA Annual Symposium Proceedings | 2023.07 | Paper | Not Available |
AI agents for Alzheimer’s disease, dementia, Parkinson’s, and neurological reasoning.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| ALPACA: A Reinforcement Learning Environment for Medication Repurposing in Alzheimer’s Disease | arXiv | 2026.02 | Paper | Not Available |
| DemMA: Dementia Multi-Turn Dialogue Agent with Expert-Guided Reasoning and Action Simulation | arXiv | 2026.01 | Paper | Not Available |
| AgenticAD: A Specialized Multiagent System Framework for Holistic Alzheimer Disease Management | arXiv | 2025.09 | Paper | Not Available |
| A Multi-Agent Approach to Neurological Clinical Reasoning | arXiv | 2025.8 | Paper | Not Available |
| CARE-AD: a multi-agent large language model framework for Alzheimer’s disease prediction | npj Digital Medicine | 2025 | Paper | GitHub |
| The PDC30 Chatbot--Development of a Psychoeducational Resource on Dementia Caregiving Among Family Caregivers: Mixed Methods Acceptability Study | JMIR Aging | 2025 | Paper | Not Available |
| Agent-guided AI-powered interpretation and reporting of nerve conduction studies and EMG (INSPIRE) | Clinical Neurophysiology | 2025 | Paper | Not Available |
| A Conversational Agent for Early Detection of Neurotoxic Effects of Medications through Automated Intensive Observation | PACIFIC SYMPOSIUM ON BIOCOMPUTING | 2024 | Paper | Not Available |
| Patrika: AI-Enabled Conversational Journaling for Advancing Parkinson's Disease Symptom Tracking | CHI | 2025 | Paper | Not Available |
| A Conversational GPT Agent for Parkinson's Disease (PD-GPT) | MDS Abstracts | 2025 | Paper | Not Available |
| Leveraging Large Language Models for Personalized Parkinson's Disease Treatment | IEEE JBHI | 2026 | Paper | Not Available |
| Agentic AI for Scaling Diagnosis and Care in Neurodegenerative Disease | arXiv | 2025.02 | Paper | Not Available |
| Agentic Generative AI System for Classification of Pathology-Confirmed Primary Progressive Aphasia Variants | medRxiv | 2025.10 | Paper | Not Available |
| Prompting and Fine-Tuning Large Language Models for Parkinson Disease Diagnosis | JMIR Medical Informatics | 2026 | Paper | Not Available |
| Multimodal diagnosis of Parkinson's disease with an internet-based collaborative agent architecture of medical language models | Computers in Biology and Medicine | 2026 | Paper | Not Available |
| Web based AI-driven framework combining multi-modal data with CNN and LLM for Parkinson’s disease diagnosis | Scientific Reports | 2025 | Paper | Not Available |
| PARKA AI: A Sensor-Integrated Mobile Application for Parkinson’s Disease Monitoring and Self-Management | Bioengineering | 2025 | Paper | Not Available |
| A Conversational Agent for Empowering People with Parkinson’s Disease in Exercising Through Motivation and Support | Applied Sciences | 2025 | Paper | Not Available |
| AI approaches for phenotyping Alzheimer’s disease and related dementias using electronic health records | Alzheimer's & Dementia: Translational Research & Clinical Interventions | 2025 | Paper | Not Available |
| MoveONParkinson: developing a personalized motivational solution for Parkinson’s disease management | Frontiers in Public Health | 2024 | Paper | Not Available |
AI agents for heart failure, cardiac imaging, cardiologist-level diagnosis, and heart rate analysis.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation | arXiv | 2025.02 | Paper | Not Available |
| ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes | arXiv | 2025.12 | Paper | Not Available |
| Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis | MICCAI | 2025.07 | Paper | GitHub |
| Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics | arXiv | 2024.10 | Paper | Not Available |
AI agents for stroke diagnosis, dysarthria, and cerebrovascular disease management.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| An Explainable Agentic AI Framework for Uncertainty-Aware and Abstention-Enabled Acute Ischemic Stroke Imaging Decisions | arXiv | 2026.01 | Paper | Not Available |
| Wearable Intelligent Throat Enables Natural Speech in Stroke Patients with Dysarthria | Nature Communications | 2026 | Paper | Not Available |
AI agents for diabetes management, PCOS diagnosis, and nutrition intervention.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| NutriOrion: A Hierarchical Multi-Agent Framework for Personalized Nutrition Intervention Grounded in Clinical Guidelines | arXiv | 2026.02 | Paper | Not Available |
| Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis | arXiv | 2025.12 | Paper | Not Available |
| The Effectiveness of a Custom AI Chatbot for Type 2 Diabetes Mellitus Health Literacy: Development and Evaluation Study | JMIR | 2025 | Paper | Not Available |
| Knowledge-infused LLM-powered conversational health agent: A case study for diabetes patients | IEEE EMBC | 2025 | Paper | Not Available |
| FinAgent: An Agentic AI Framework Integrating Personal Finance and Nutrition Planning | arXiv | 2025.12 | Paper | Not Available |
| Detection and diagnosis of diabetic retinopathy in retinal fundus images using agentic AI approaches | Scientific Reports | 2025 | Paper | Not Available |
| My diabetes care: an AI-based mobile app with conversational agent for type 2 diabetes self-management | Scientific Reports | 2025 | Paper | Not Available |
| LLM-Powered Multimodal AI Conversations for Diabetes Prevention | ACM Workshop | 2024 | Paper | Not Available |
| Assessing the User Experience of an LLM-Based Conversational Assistant in Diabetes Mellitus Care | J Healthcare Informatics Research | 2025 | Paper | Not Available |
| Developing a GraphRAG-enabled local-LLM for Gestational Diabetes Mellitus | medRxiv | 2025 | Paper | Not Available |
AI agents for rare disease diagnosis, Mendelian diseases, CRISPR, gene analysis, and genomics.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with LLMs Trained via RL | arXiv | 2026.03 | Paper | Not Available |
| An agentic system for rare disease diagnosis with traceable reasoning | arXiv | 2025.6 | Paper | [demo] |
| An LLM-Driven Multi-Agent Debate System for Mendelian Diseases | arXiv | 2025.04 | Paper | Not Available |
| Geneagent: self-verification language agent for gene-set analysis using domain databases | Nature Methods | 2025 | Paper | GitHub |
| CRISPR-GPT for agentic automation of gene-editing experiments | Nature BME | 2025 | Paper | GitHub |
| HEAL-KGGen: A Hierarchical Multi-Agent LLM Framework for Genetic Biomarker-Based Medical Diagnosis | biorxiv | 2025 | Paper | GitHub |
| AI-HOPE: An AI-Driven conversational agent for enhanced clinical and genomic data integration | Bioinformatics | 2024.12 | Paper | GitHub |
| dna-claude-analysis: AI-powered personal genome analysis agent using Claude | GitHub | 2025 | Not Available | GitHub |
AI agents for sepsis, antimicrobial therapy, and vaccination.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| The Application of MATEC (Multi-AI Agent Team Care) Framework in Sepsis Care | arXiv | 2025.3 | Paper | Not Available |
| KRAL: Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy | arXiv | 2025.11 | Paper | Not Available |
| AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions | arXiv | 2025.07 | Paper | huggingface |
| Can A Society of Generative Agents Simulate Human Behavior and Inform Public Health Policy? A Case Study on Vaccine Hesitancy | COLM | 2025 | Paper | Not Available |
| Fine-Tuning DialoGPT on Common Diseases in Rural Nepal for Medical Conversations | arXiv | 2025.11 | Paper | Not Available |
AI agents for skin disease diagnosis and dermatology applications.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation | ICCV 2025 Highlight | 2025.12 | Paper | Github |
| Conversational AI in health: Design considerations from a Wizard-of-Oz dermatology case study with users, clinicians and a medical LLM | CHI ‘EA | 2024 | Paper | Not Available |
AI agents for dental imaging interpretation and dental education.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation | arXiv | 2026.03 | Paper | Not Available |
| DentEval: Fine-tuning-Free Expert-Aligned Assessment in Dental Education via LLM Agents | MICCAI | 2025 | Paper | GitHub |
AI agents for liver disease diagnosis and hepatopathy clinical trials.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus | arXiv | 2026.03 | Paper | Not Available |
| Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline | arXiv | 2025.02 | Paper | Not Available |
AI agents for knee osteoarthritis and musculoskeletal disease management.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| KOM: A Multi-Agent Artificial Intelligence System for Precision Management of Knee Osteoarthritis (KOA) | arXiv | 2025.11 | Paper | Not Available |
AI agents for pulmonary embolism, CT pulmonary angiography, and respiratory diseases.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Vision-language model for report generation and outcome prediction in CT pulmonary angiogram | npj Digital Medicine | 2025 | Paper | GitHub |
AI agents for pressure ulcer classification and wound assessment.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning | arXiv | 2025.10 | Paper | Not Available |
AI agents for chronic pain prediction and management.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Advancing the prediction and understanding of placebo responses in chronic back pain using large language models | medrxiv | 2025.01 | Paper | Not Available |
High-impact general clinical AI agents published in top venues (Nature family, Cell family, Lancet, NEJM/NEJM AI, NeurIPS, ICML, ICLR, ACL, EMNLP, MICCAI Oral, AAAI).
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework | ICLR | 2026.03 | Paper | Project |
| ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue | ICLR | 2026.03 | Paper | Not Available |
| MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | ICLR | 2026 | Paper | GitHub |
| Towards conversational diagnostic artificial intelligence | Nature | 2025 | Paper | Not Available |
| AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning | Nature Communications | 2025 | Paper | Not Available |
| An evaluation framework for clinical use of large language models in patient interaction tasks | Nature Medicine | 2025 | Paper | Not Available |
| An automated framework for assessing how well LLMs cite relevant medical references | Nature Communications | 2025 | Paper | Not Available |
| Evaluating large language models as agents in the clinic | npj Digital Medicine | 2025 | Paper | Not Available |
| Enhancing diagnostic capability with multi-agents conversational large language models | npj Digital Medicine | 2025 | Paper | GitHub |
| HealthcareAgent: Eliciting the Power of Large Language Models for Medical Consultation | npj Artificial Intelligence | 2025 | Paper | Not Available |
| Development and Testing of a Novel Large Language Model-Based Clinical Decision Support Systems for Medication Safety in 12 Clinical Specialties | Cell Reports Medicine | 2025 | Paper | Not Available |
| Clinicallab: Aligning agents for multi-departmental clinical diagnostics in the real world | NeurIPS | 2025 | Paper | Not Available |
| MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making | NeurIPS (Oral) | 2024 | Paper | GitHub |
| Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions | NeurIPS | 2025 | Paper | Not Available |
| MedRAX: Medical reasoning agent for chest x-ray | ICML | 2025.02 | Paper | GitHub |
| MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding | ICML | 2025 | Paper | GitHub |
| ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents | ACL | 2025 | Paper | GitHub |
| Medical Graph RAG: Evidence-based Medical Large Language Model via Graph Retrieval-Augmented Generation | ACL | 2025 | Paper | Not Available |
| MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | ACL Findings | 2025 | Paper | GitHub |
| Cod, towards an interpretable medical agent using chain of diagnosis | ACL Findings | 2025 | Paper | GitHub |
| ASTRID--An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems | ACL Findings | 2025 | Paper | Not Available |
| Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge | EMNLP Findings | 2025.2 | Paper | GitHub |
| MDTeamGPT: A Self-Evolving LLM-Based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation | EMNLP | 2025.3 | Paper | GitHub |
| DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue | EMNLP | 2025.5 | Paper | GitHub |
| CLARITY: Clinical Assistant for Routing, Inference, and Triage | EMNLP Industry | 2025.10 | Paper | Not Available |
| AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents | EMNLP | 2025 | Paper | GitHub |
| MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | EMNLP Findings | 2024 | Paper | GitHub |
| Ehragent: Code empowers large language models for few-shot complex tabular reasoning on electronic health records | EMNLP | 2024 | Paper | Not Available |
| MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions | MICCAI | 2025 | Paper | GitHub |
| CSAP-Assist: Instrument-Agent Dialogue Empowered Vision-Language Models for Collaborative Surgical Action Planning | MICCAI | 2025 | Paper | GitHub |
| MEDIQ: Question-Asking LLMs and a Benchmark for Medical Information-Seeking | NeurIPS | 2024 | Paper | GitHub |
| MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning | ACL 2024 Findings | 2023.11 | Paper | GitHub |
| AAAI PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology | AAAI | 2024 | Paper | GitHub |
| A Layered Debating Multi-Agent System for Similar Disease Diagnosis | NAACL | 2025 | Paper | Not Available |
| Menti: Bridging medical calculator and llm agent with nested tool calling | NAACL | 2025 | Paper | Not Available |
| PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models | npj Digital Medicine | 2024.01 | Paper | Not Available |
| MoMA: a mixture-of-multimodal-agents architecture for enhancing clinical prediction modelling | npj Digital Medicine | 2026 | Paper | Not Available |
AI agents for drug discovery, drug-target interaction, clinical trials, pharmacovigilance, and therapeutic reasoning.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Causal-Enhanced AI Agents for Medical Research Screening | arXiv | 2026.01 | Paper | Not Available |
| MedAI: Evaluating TxAgent’s Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition | arXiv | 2025.12 | Paper | Benchmark & Competition |
| BioScientistAgent: Designing LLM-Biomedical Agents with KG-Augmented RL Reasoning Modules | biorxiv | 2025.08 | Paper | Not Available |
| STELLA: Self-Evolving LLM Agent for Biomedical Research | arXiv | 2025.07 | Paper | GitHub |
| Large Language Model Agent for Modular Task Execution in Drug Discovery | arXiv | 2025.07 | Paper | GitHub |
| Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | arXiv | 2025.05 | Paper | Not Available |
| Llm agent swarm for hypothesis-driven drug discovery | arXiv | 2025.04 | Paper | Not Available |
| Txgemma: Efficient and agentic llms for therapeutics | arXiv | 2025.04 | Paper | Not Available |
| TrialGenie: Empowering Clinical Trial Design with Agentic Intelligence and Real World Data | medRxiv | 2025.04 | Paper | Not Available |
| TxAgent: An AI agent for therapeutic reasoning across a universe of tools | arXiv | 2025.03 | Paper | GitHub |
| RAG-Enhanced Collaborative LLM Agents for Drug Discovery | arXiv | 2025.02 | Paper | Not Available |
| Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration | AAAI 2025 workshop AI4Research | 2024.11 | Paper | GitHub |
| Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction | arXiv | 2024.08 | Paper | GitHub |
| MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance | MLHC | 2024 | Paper | GitHub |
| ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning | Bioinformatics | 2025 | Paper | Not Available |
High-impact datasets and benchmarks for evaluating medical AI agents.
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents | NEJM AI | 2025.01 | Paper | GitHub |
| MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks | NeurIPS | 2025.05 | Paper | GitHub |
| AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | arXiv | 2024.05 | Paper | GitHub |
| Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents | arXiv | 2024.05 | Paper | GitHub |
| CliBench: A Multifaceted and Multigranular Evaluation of Clinical Diagnosis with LLMs | arXiv | 2024.06 | Paper | GitHub |
| MediQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning | arXiv | 2024.06 | Paper | GitHub |
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| A Comprehensive Survey of Agentic AI in Healthcare | Authorea / TechRxiv | 2025 | Paper | GitHub |
| AI agents in clinical medicine: a systematic review | medRxiv | 2025 | Paper | Not Available |
| LLM-based agentic systems in medicine and healthcare | Nature Machine Intelligence | 2024 | Paper | Not Available |
| A Survey of LLM-based Agents in Medicine: How far are we from Baymax? | ACL 2025 Findings | 2025 | Paper | GitHub |
| The Landscape of Medical Agents: A Survey | TechRxiv | 2025 | Paper | GitHub |
| Agentic large-language-model systems in medicine: A systematic review and taxonomy | TechRxiv | 2025 | Paper | Not Available |
| Agentic large language models for healthcare: current progress and future opportunities | Medicine Advances (Wiley) | 2025 | Paper | Not Available |
| A Survey of LLM-based Multi-agent Systems in Medicine | TechRxiv / OpenReview | 2025 | Paper | Not Available |
| AI agent in healthcare: applications, evaluations, and future directions | npj Artificial Intelligence | 2026 | Paper | Not Available |
| A foundational architecture for AI agents in healthcare | Cell Reports Medicine | 2025 | Paper | Not Available |
| Coordinated AI agents for advancing healthcare | Nature Biomedical Engineering | 2025 | Paper | Not Available |
| Next-generation agentic AI for transforming healthcare | Cell Reports Medicine | 2025 | Paper | Not Available |
| Large Language Model Agents for Biomedicine: A Comprehensive Review | Information (MDPI) | 2025 | Paper | Not Available |
| Artificial intelligence agents in healthcare research: A scoping review | PLOS ONE | 2025 | Paper | Not Available |
| Benchmarking large language model-based agent systems for clinical decision tasks | npj Digital Medicine | 2026 | Paper | Not Available |
| Enhancing diagnostic capability with multi-agents conversational large language models | npj Digital Medicine | 2025 | Paper | GitHub |
| Applications of artificial intelligence-based conversational agents in healthcare: A systematic umbrella review | International Journal of Medical Informatics | 2025 | Paper | Not Available |
| Scoping Review of Agentic AI Systems in Healthcare | HAL | 2025 | Paper | Not Available |
| AI Agents in Modern Healthcare: From Foundation to Pioneer | Preprints.org | 2025 | Paper | Not Available |
| Multi-Agent AI Systems in Healthcare: A Systematic Review Enhancing Clinical Decision-Making | Asian Journal of Medical Principles and Clinical Practice | 2025 | Paper | Not Available |
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Agentic AI in Radiology: Evolution from Large Language Models to Future Clinical Integration | Radiology: Artificial Intelligence (RSNA) | 2025 | Paper | Not Available |
| From chatbots to agentic workflows: ensuring responsible deployment of large language models in radiology | Indian Journal of Radiology and Imaging | 2025 | Paper | Not Available |
| Agentic AI and Large Language Models in Radiology: Opportunities and Hallucination Challenges | Bioengineering (MDPI) | 2025 | Paper | Not Available |
| Agentic systems in radiology: Design, Applications, Evaluation, and Challenges | arXiv | 2025 | Paper | Not Available |
| Agentic AI in radiology: emerging potential and unresolved challenges | British Journal of Radiology | 2025 | Paper | Not Available |
| Agentic systems in radiology: Principles, opportunities, privacy risks, regulation, and sustainability concerns | Radiography | 2025 | Paper | Not Available |
| The Role of Agentic AI in Musculoskeletal Radiology: A Scoping Review | Tomography (MDPI) | 2025 | Paper | Not Available |
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Reimagining psychiatric care with agentic AI: promise, challenges, and a roadmap forward | npj Digital Medicine | 2026 | Paper | Not Available |
| Large language model-driven agents in nursing practice: A scoping review | Nurse Education Today | 2025 | Paper | Not Available |
| Simulated patient systems powered by large language model-based AI agents offer potential for transforming medical education | Communications Medicine | 2025 | Paper | Not Available |
| Title | Venue | Date | Paper Link | Project Page |
|---|---|---|---|---|
| Empowering biomedical discovery with AI agents | Cell | 2024 | Paper | GitHub |
| Agentic AI and the rise of in silico team science in biomedical research | Nature Biotechnology | 2026 | Paper | Not Available |