GitHub - JhCircle/Kardia-R1: Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning

Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning

💗 We introduce Kardia-R1: teaching LLMs to understand, reason, and care — with transparent empathy 🌱

🔥 News

2026.01.14 🎉 Excited to share that our work, Kardia-R1, has been accepted at WWW 2026!
2025.12.02 🎉 Our Kardia-R1 paper released on arXiv — check it out now!
2025.12.03 🚀 The full KardiaBench dataset (22K multi-turn dialogues, 671 personas) is now officially released and open-sourced on HuggingFace!

### Authentication & Loading the Dataset
huggingface-cli login
from datasets import load_dataset
dataset = load_dataset("Jhcircle/KadiaBench")

💞 What is Kardia-R1?

🧠 Kardia-R1 is a reasoning-centric empathetic dialogue framework that unifies
user understanding → emotional reasoning → safe, supportive responses,
empowered by Rubric-as-Judge GRPO Reinforcement Learning for transparent and controllable empathy.

🧩 Key Features

KardiaBench: 671 real-world user profiles + 22,080 multi-turn empathetic dialogues
Four-span structured cognition
- <|understanding_begin|>...<|understanding_end|> — Interpret user intent & emotions using persona and context
- <|reasoning_begin|>...<|reasoning_end|> — Perform internal appraisal and empathetic reasoning
- <|emotion_begin|>...<|emotion_end|> — Identify the correct fine-grained emotion label
- <|response_begin|>...<|response_end|> — Generate supportive, persona-aligned empathetic replies
Rubric-as-Judge RL (Verifiable)
- Interpretable, criterion-based, LLM-judged reinforcement learning
Backbone-Agnostic Gains
- Improves Qwen, Gemma, and more across all empathy metrics
Superior to SoTA LLMs
- Outperforms GPT-4o, DeepSeek-R1, PsyLLM in emotion accuracy & empathetic quality

🎯 Rubric-as-Judge RL (Verifiable Reinforcement Learning)

Human-interpretable rubric: Relevance · Empathy · Persona Consistency · Safety · Fluency
Transparent scoring → controllable improvement
No black-box reward models → fully interpretable and aligned behavior

📈 Superior Performance

Consistent gains across every empathy dimension
Stronger emotional grounding and persona alignment
Scalable to Qwen (3B/7B) / Gemma (2B/7B） backbones
Robust, generalizable empathetic cognition across diverse emotional contexts

🌟 Kardia-R1 achieves state-of-the-art empathy, persona consistency, and emotion accuracy,
surpassing both general-purpose LLMs and specialized empathetic dialogue systems.

📦 KardiaBench Dataset

📂 Dataset Overview

KardiaBench is a large-scale empathetic dialogue dataset designed for
reasoning-centered emotional support, containing:

22,080 empathetic multi-turn dialogues
671 fully documented personas
Fine-grained emotional states
Four-span structured reasoning format
Fully anonymized & cleaned

HuggingFace dataset: 👉 KardiaBench

📥 Load the Dataset

To prevent misuse of sensitive data, our dataset requires an access request on HuggingFace. Please follow the instructions below to obtain access.

Submit an Access Request describing the intended use
Wait for approval from the maintainers — we will review and approve requests as quickly as possible

from datasets import load_dataset

dataset = load_dataset("Jhcircle/KadiaBench")

If you encounter AccessDenied or 403 Forbidden errors, your access request may still be pending or your HuggingFace authentication may be missing.

Login manually if needed:

huggingface-cli login

📘Data Fields

Field	Description
person	Full raw user profile string including MBTI, About, Signature, and Recent Activities.
mbti	The user’s MBTI type extracted from the profile (e.g., “INFP”, “ISTP”).
emotion	Target emotional state representing the user’s current feelings in the scenario (e.g., “anxious”, “terrified”).
situation	Starting background context or emotional scenario for the conversation.
anon_username	An anonymized username for privacy-preserving user identity.
messages	Full structured dialogue as a list of message objects, including the system prompt, user turns, and assistant responses.

📚 Citation

If our work is helpful, please cite:

@article{yuan2025kardia,
  title={Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning},
  author={Yuan, Jiahao and Cui, Zhiqing and Wang, Hanqing and Gao, Yuansheng and Zhou, Yucheng and Naseem, Usman},
  journal={arXiv preprint arXiv:2512.01282},
  year={2025}
}

🙇 Acknowledgement

We gratefully acknowledge EmpatheticDialogues for foundational inspiration, PersonalityCafe for publicly shared personas, DeepSeek-R1 and Qwen3 for their GRPO insights, and all annotators and psychology experts for their invaluable support in building KardiaBench.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning

💗 We introduce Kardia-R1: teaching LLMs to understand, reason, and care — with transparent empathy 🌱

🔥 News

💞 What is Kardia-R1?

🧩 Key Features

🎯 Rubric-as-Judge RL (Verifiable Reinforcement Learning)

📈 Superior Performance

📦 KardiaBench Dataset

📂 Dataset Overview

📥 Load the Dataset

📘Data Fields

📚 Citation

🙇 Acknowledgement

About

Uh oh!

Releases

Packages

License

JhCircle/Kardia-R1

Folders and files

Latest commit

History

Repository files navigation

Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning

💗 We introduce Kardia-R1: teaching LLMs to understand, reason, and care — with transparent empathy 🌱

🔥 News

💞 What is Kardia-R1?

🧩 Key Features

🎯 Rubric-as-Judge RL (Verifiable Reinforcement Learning)

📈 Superior Performance

📦 KardiaBench Dataset

📂 Dataset Overview

📥 Load the Dataset

📘Data Fields

📚 Citation

🙇 Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages