feat: attach exemplars to learning paths by ccf · Pull Request #118 · ccf/primer

ccf · 2026-03-16T21:38:12Z

Summary

attach exemplar session references to Growth learning-path recommendations
surface learning paths in the Growth skills tab and enrich the shared cards with study links
mark the exemplar-to-learning-path roadmap item shipped

Testing

pytest tests/test_growth.py -q
cd frontend && ./node_modules/.bin/vitest run --run src/components/growth/tests/learning-path-cards.test.tsx src/pages/tests/growth.test.tsx

Note

Medium Risk
Updates backend analytics to derive and rank exemplar sessions for each learning-path recommendation, which can affect recommendation outputs and query performance. UI changes are straightforward but rely on new API fields being present and correctly shaped.

Overview
Learning-path recommendations now return a small set of ranked exemplar sessions (including workflow fingerprint/archetype, tools, duration, and estimated cost) so engineers can study concrete peer examples.

The Growth Skills tab now surfaces a new Learning Paths section, and LearningPathCards renders “Study Exemplars” links to /sessions/:id when exemplars are available. Backend refactors reuse a shared _build_patterns_and_exemplars pipeline to power both pattern sharing and exemplar selection, and tests cover the new API shape and UI rendering.

^{Written by Cursor Bugbot for commit 99d0897. This will update automatically on new commits. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Bugbot Autofix prepared a fix for the issue found in the latest run.

✅ Fixed: Redundant heavyweight function call doubles DB load
- Extracted pattern-building logic into a shared _build_patterns_and_exemplars helper so get_learning_paths reuses its already-queried data (sessions, engineers, facets, tools) instead of re-executing all queries via get_pattern_sharing.

Or push these changes by commenting:

@cursor push c79dd71b1f

Preview (c79dd71b1f)

diff --git a/src/primer/server/services/insights_service.py b/src/primer/server/services/insights_service.py
--- a/src/primer/server/services/insights_service.py
+++ b/src/primer/server/services/insights_service.py
@@ -647,11 +647,14 @@
     engineers = db.query(Engineer.id, Engineer.name).filter(Engineer.id.in_(engineer_ids)).all()
     eng_names = {eid: name for eid, name in engineers}
 
-    # Session types per session
+    # Facets (expanded columns for reuse in pattern/exemplar building)
     all_facets = (
         db.query(
             SessionFacets.session_id,
             SessionFacets.session_type,
+            SessionFacets.outcome,
+            SessionFacets.agent_helpfulness,
+            SessionFacets.brief_summary,
             SessionFacets.goal_categories,
         )
         .filter(SessionFacets.session_id.in_(session_ids_q))
@@ -659,7 +662,9 @@
     )
     session_to_type: dict[str, str] = {}
     session_to_goals: dict[str, list[str]] = {}
+    session_facets: dict[str, object] = {}
     for f in all_facets:
+        session_facets[f.session_id] = f
         if f.session_type:
             session_to_type[f.session_id] = f.session_type
         if f.goal_categories:
@@ -672,11 +677,13 @@
             if isinstance(cats, list):
                 session_to_goals[f.session_id] = cats
 
-    # Build session_id → engineer_id
+    # Build session_id → engineer_id and session_map
     session_to_engineer: dict[str, str] = {}
+    session_map: dict[str, object] = {}
     for eid, sess_list in eng_sessions.items():
         for s in sess_list:
             session_to_engineer[s.id] = eid
+            session_map[s.id] = s
 
     # Tool usages
     all_tools = (
@@ -685,7 +692,11 @@
         .all()
     )
     eng_tool_names: dict[str, set[str]] = defaultdict(set)
+    session_tools: dict[str, list[str]] = defaultdict(list)
+    session_tool_count: dict[str, int] = defaultdict(int)
     for tu in all_tools:
+        session_tools[tu.session_id].append(tu.tool_name)
+        session_tool_count[tu.session_id] += tu.call_count
         eid = session_to_engineer.get(tu.session_id)
         if eid:
             eng_tool_names[eid].add(tu.tool_name)
@@ -731,14 +742,58 @@
         team_skill_universe[f"tool:{tool}"] = cnt
 
     team_skill_count = len(team_skill_universe) if team_skill_universe else 1
-    pattern_sharing = get_pattern_sharing(
-        db,
-        team_id=team_id,
-        engineer_id=engineer_id if team_id is None else None,
-        start_date=start_date,
-        end_date=end_date,
+
+    # Workflow profiles and costs for exemplar derivation
+    session_workflows: dict[str, dict[str, object]] = {
+        row.session_id: {
+            "archetype": row.archetype,
+            "label": row.label,
+            "steps": list(row.steps or []),
+        }
+        for row in (
+            db.query(
+                SessionWorkflowProfile.session_id,
+                SessionWorkflowProfile.archetype,
+                SessionWorkflowProfile.label,
+                SessionWorkflowProfile.steps,
+            )
+            .filter(SessionWorkflowProfile.session_id.in_(session_ids_q))
+            .all()
+        )
+    }
+    session_costs: dict[str, float] = defaultdict(float)
+    for row in (
+        db.query(
+            ModelUsage.session_id,
+            ModelUsage.model_name,
+            func.sum(ModelUsage.input_tokens).label("input_tokens"),
+            func.sum(ModelUsage.output_tokens).label("output_tokens"),
+            func.sum(ModelUsage.cache_read_tokens).label("cache_read_tokens"),
+            func.sum(ModelUsage.cache_creation_tokens).label("cache_creation_tokens"),
+        )
+        .filter(ModelUsage.session_id.in_(session_ids_q))
+        .group_by(ModelUsage.session_id, ModelUsage.model_name)
+        .all()
+    ):
+        session_costs[row.session_id] += estimate_cost(
+            row.model_name,
+            row.input_tokens or 0,
+            row.output_tokens or 0,
+            row.cache_read_tokens or 0,
+            row.cache_creation_tokens or 0,
+        )
+
+    _, _, exemplar_sessions = _build_patterns_and_exemplars(
+        sessions=sessions,
+        session_map=session_map,
+        session_to_engineer=session_to_engineer,
+        eng_names=eng_names,
+        session_facets=session_facets,
+        session_tools=session_tools,
+        session_tool_count=session_tool_count,
+        session_workflows=session_workflows,
+        session_costs=session_costs,
     )
-    exemplar_sessions = pattern_sharing.exemplar_sessions
 
     paths: list[EngineerLearningPath] = []
     for eid in engineer_ids:
@@ -982,6 +1037,41 @@
             row.cache_creation_tokens or 0,
         )
 
+    patterns, bright_spots, exemplar_sessions = _build_patterns_and_exemplars(
+        sessions=sessions,
+        session_map=session_map,
+        session_to_engineer=session_to_engineer,
+        eng_names=eng_names,
+        session_facets=session_facets,
+        session_tools=session_tools,
+        session_tool_count=session_tool_count,
+        session_workflows=session_workflows,
+        session_costs=session_costs,
+    )
+
+    return PatternSharingResponse(
+        patterns=patterns,
+        bright_spots=bright_spots,
+        exemplar_sessions=exemplar_sessions,
+        total_clusters_found=len(patterns),
+        sessions_analyzed=len(sessions),
+    )
+
+
+def _build_patterns_and_exemplars(
+    *,
+    sessions: list,
+    session_map: dict[str, object],
+    session_to_engineer: dict[str, str],
+    eng_names: dict[str, str],
+    session_facets: dict[str, object],
+    session_tools: dict[str, list[str]],
+    session_tool_count: dict[str, int],
+    session_workflows: dict[str, dict[str, object]],
+    session_costs: dict[str, float],
+) -> tuple[list[SharedPattern], list[BrightSpot], list[ExemplarSession]]:
+    """Build pattern clusters, bright spots, and exemplar sessions from pre-loaded data."""
+
     def _make_approach(sid: str) -> EngineerApproach:
         s = session_map[sid]
         eid = session_to_engineer[sid]
@@ -1006,7 +1096,6 @@
         approaches = [_make_approach(sid) for sid in sids]
         eng_set = {a.engineer_id for a in approaches}
 
-        # Best approach: successful + shortest duration
         successful = [
             approach
             for approach in approaches
@@ -1021,7 +1110,6 @@
         successes = sum(1 for outcome in outcomes if is_success_outcome(outcome))
         sr = round(successes / len(outcomes), 3) if outcomes else None
 
-        # Insight
         insight_parts = [f"{len(eng_set)} engineers worked on {cluster_label}"]
         if best and avg_dur and avg_dur > 0:
             pct = round((1 - best.duration_seconds / avg_dur) * 100)
@@ -1063,8 +1151,8 @@
 
     # 2. Goal category clusters
     goal_groups: dict[str, list[str]] = defaultdict(list)
-    for f in all_facets:
-        cats = f.goal_categories
+    for f in session_facets.values():
+        cats = getattr(f, "goal_categories", None)
         if cats:
             if isinstance(cats, str):
                 try:
@@ -1076,7 +1164,6 @@
                     goal_groups[cat].append(f.session_id)
 
     for cat, sids in goal_groups.items():
-        # Filter to valid session ids
         valid_sids = [sid for sid in sids if sid in session_map]
         eng_set = {session_to_engineer[sid] for sid in valid_sids}
         if len(eng_set) >= 2 and len(valid_sids) >= 3:
@@ -1088,7 +1175,6 @@
         if s.project_name:
             project_groups[s.project_name].append(s.id)
 
-    # Track projects already covered by type+project clusters
     covered_projects: set[str] = set()
     for (_, proj), sids_tp in type_project_groups.items():
         if len({session_to_engineer[sid] for sid in sids_tp}) >= 2:
@@ -1099,13 +1185,12 @@
         if len(eng_set) >= 2 and proj not in covered_projects:
             patterns.append(_build_pattern(f"project:{proj}", "project", proj, sids))
 
-    # Sort by engineer_count desc
     patterns.sort(key=lambda p: p.engineer_count, reverse=True)
 
-    return PatternSharingResponse(
-        patterns=patterns,
-        bright_spots=_derive_bright_spots(patterns),
-        exemplar_sessions=_derive_exemplar_sessions(
+    return (
+        patterns,
+        _derive_bright_spots(patterns),
+        _derive_exemplar_sessions(
             patterns,
             session_map=session_map,
             session_facets=session_facets,
@@ -1113,8 +1198,6 @@
             session_workflows=session_workflows,
             session_costs=session_costs,
         ),
-        total_clusters_found=len(patterns),
-        sessions_analyzed=len(sessions),
     )

src/primer/server/services/insights_service.py

ccf · 2026-03-16T23:24:35Z

@cursor push c79dd71

…pre-loaded data Extract pattern-building and exemplar-derivation logic from get_pattern_sharing into a shared _build_patterns_and_exemplars helper. get_learning_paths now: - Expands its existing facets query to include extra columns needed for pattern building (outcome, agent_helpfulness, brief_summary) - Builds session_map, session_facets, session_tools, session_tool_count from data already being queried - Runs only 2 additional queries (workflow profiles, model costs) - Calls _build_patterns_and_exemplars directly instead of get_pattern_sharing This eliminates 4 redundant DB queries (sessions, engineer names, facets, tool usages) that were previously re-executed by get_pattern_sharing. Applied via @cursor push command

feat: attach exemplars to learning paths

dc7de01

cursor bot reviewed Mar 16, 2026

View reviewed changes

src/primer/server/services/insights_service.py Outdated Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: attach exemplars to learning paths#118

feat: attach exemplars to learning paths#118
ccf wants to merge 2 commits intomainfrom
feat/learning-path-exemplars

ccf commented Mar 16, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment •

edited

Loading

Uh oh!

Uh oh!

ccf commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ccf commented Mar 16, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

cursor bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ccf commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ccf commented Mar 16, 2026 •

edited by cursor bot

Loading

cursor bot left a comment •

edited

Loading