fix: increase maxTokens and prevent markdown wrapping in timeline lor… #76

anabelle · 2025-10-31T17:08:00Z

…e JSON generation

Timeline lore summary generation was failing due to two issues:

maxTokens limit of 480 was too small, causing JSON responses to be truncated
Model was returning markdown-wrapped JSON (json...), adding overhead

Changes:

Increased maxTokens from 480 to 1000 to allow complete JSON responses
Updated prompt to explicitly request raw JSON without markdown formatting
Added clear instruction to start with { and end with }

🤖 Generated with Claude Code

Summary by CodeRabbit

Refactor
- Enhanced JSON output parsing and normalization for improved consistency and reliability.
- Increased capacity for timeline lore summary generation, enabling more detailed and comprehensive summaries.

…e JSON generation Timeline lore summary generation was failing due to two issues: 1. maxTokens limit of 480 was too small, causing JSON responses to be truncated 2. Model was returning markdown-wrapped JSON (```json...```), adding overhead Changes: - Increased maxTokens from 480 to 1000 to allow complete JSON responses - Updated prompt to explicitly request raw JSON without markdown formatting - Added clear instruction to start with { and end with } 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

coderabbitai · 2025-10-31T17:08:11Z

Walkthrough

Enhanced JSON output formatting in prompts with explicit raw JSON boundaries. Increased token budget for timeline summary generation from 480 to 1000 tokens. Added JSON extraction and normalization parsing flow to post-process model outputs.

Changes

Cohort / File(s)	Summary
Prompt and parsing refinements `plugin-nostr/lib/service.js`	Updated JSON output directives to enforce RAW JSON format (no markdown fences). Added explicit JSON boundary requirements (`{` and `}`). Increased `maxTokens` from 480 to 1000 for timeline lore summary generation. Introduced `_extractJsonObject` and `_normalizeTimelineLoreDigest` normalization step in parsing flow to adjust tags and insights.

Sequence Diagram(s)

sequenceDiagram
    participant Model as Language Model
    participant Parse as JSON Parser
    participant Normalize as Normalizer
    
    Note over Model: Model output<br/>(raw JSON format<br/>starts with { ends with })
    Model->>Parse: Raw JSON string
    activate Parse
    Parse->>Parse: _extractJsonObject()
    Parse-->>Normalize: Extracted object
    deactivate Parse
    
    activate Normalize
    Normalize->>Normalize: _normalizeTimelineLoreDigest()
    Normalize->>Normalize: Adjust tags, insights<br/>based on ranked context
    Normalize-->>Normalize: Return normalized digest
    deactivate Normalize
    
    Note over Normalize: Final processed output<br/>(normalized fields)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Focus areas for review:
- Verify the _extractJsonObject and _normalizeTimelineLoreDigest implementations are correct and handle edge cases
- Confirm the new 1000-token budget doesn't cause unintended side effects or performance issues
- Validate that JSON boundary enforcement ({ and }) is properly parsed without breaking existing logic
- Check that the normalization step doesn't inadvertently alter intended summary content

Poem

🐰 Hops through prompts so tight and clean,
Raw JSON, cleanest ever seen!
A thousand tokens, brighter tales,
Normalization never fails! ✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The pull request title "fix: increase maxTokens and prevent markdown wrapping in timeline lore JSON generation" directly and clearly addresses the two primary objectives outlined in the PR description: increasing the maxTokens parameter from 480 to 1000 to allow complete JSON responses, and updating the prompt to explicitly request raw JSON without markdown formatting. The title is specific, uses conventional commit format ("fix:" prefix), and is concise enough to quickly convey the main changes without unnecessary noise. While the PR summary includes additional supporting changes like structural adaptations and normalization logic, the title appropriately captures the core issue being resolved.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/timeline-lore-json-parsing

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull Request Overview

This PR refines the prompt instructions for JSON response generation in the Nostr plugin's summary creation service. The changes aim to prevent the language model from wrapping JSON responses in markdown code fences.

Updated prompt instructions to explicitly prohibit markdown formatting
Increased token limit from 480 to 1000 to accommodate responses
Added clearer emphasis on raw JSON output format

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-10-31T17:09:55Z

plugin-nostr/lib/service.js

        type,
        prompt,
-        { maxTokens: 480, temperature: 0.45 },
+        { maxTokens: 1000, temperature: 0.45 },


The maxTokens increase from 480 to 1000 represents a 108% increase. This change could impact API costs and response time significantly. Consider whether the full 1000 tokens are necessary, or if a more conservative value (e.g., 600-750) would suffice for JSON responses that typically don't require such large limits.

Suggested change

{ maxTokens: 1000, temperature: 0.45 },

{ maxTokens: 750, temperature: 0.45 },

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

plugin-nostr/lib/service.js (1)
6859-6865: Undefined timelineLoreMaxPostsInPrompt can break prompt sizing

timelineLoreMaxPostsInPrompt is used to cap posts in the prompt but is never initialized in the constructor. Math.min(undefined, n) → NaN; slice(-NaN) coerces to 0, likely pulling the entire batch and ballooning prompts.

Fix by setting a sane default in the constructor (and optionally guard where used):
@@
     this.timelineLoreBuffer = [];
     this.timelineLoreMaxBuffer = 120;
-  this.timelineLoreBatchSize = 50;
+    this.timelineLoreBatchSize = 50;
+    // Max posts we include in the lore prompt (configurable)
+    this.timelineLoreMaxPostsInPrompt = Math.max(
+      8,
+      Math.min(
+        60,
+        Number(this.runtime?.getSetting?.('CTX_TIMELINE_LORE_MAX_POSTS_IN_PROMPT') ?? '30')
+      )
+    );
And add a defensive guard at use site:
-      const maxPostsInPrompt = Math.min(this.timelineLoreMaxPostsInPrompt, batch.length);
+      const cap = Number.isFinite(this.timelineLoreMaxPostsInPrompt) ? this.timelineLoreMaxPostsInPrompt : 30;
+      const maxPostsInPrompt = Math.min(Math.max(8, cap), Math.max(1, batch.length));

🧹 Nitpick comments (2)

plugin-nostr/lib/service.js (2)

6996-7045: Brace-scan JSON extraction can mis-detect inside strings (optional hardening)

_current approach counts all “{”/“}” without considering quoted strings/escapes; rare but can misparse. Consider a lightweight state machine that ignores braces inside strings.

6949-6956: Add character budget guard to prevent excessive prompt sizes beyond post count limits

timelineLoreMaxPostsInPrompt already restricts post quantity, but large posts can still balloon the prompt beyond provider limits. The suggested character budget (6000 chars for posts section) is a practical safeguard. While a warning exists, enforcing the budget upfront is cleaner:

@@
-      const postLines = recentBatch.map((item, idx) => {
+      let remaining = 6000; // rough prompt budget for posts section
+      const postLines = [];
+      for (let idx = 0; idx < recentBatch.length; idx++) {
+        const item = recentBatch[idx];
         const shortAuthor = item.pubkey ? `${item.pubkey.slice(0, 8)}…` : 'unknown';
         const cleanContent = this._stripHtmlForLore(item.content || '');
         const rationale = this._coerceLoreString(item.rationale || 'signal');
         const signalLine = this._coerceLoreStringArray(item.metadata?.signals || [], 4).join('; ') || 'no explicit signals';
-
-        return [
+        const block = [
           `[#${idx + 1}] Author: ${shortAuthor} • Score: ${typeof item.score === 'number' ? item.score.toFixed(2) : 'n/a'} • Importance: ${item.importance}`,
           `CONTENT: ${cleanContent}`,
           `RATIONALE: ${rationale}`,
           `SIGNALS: ${signalLine}`,
-        ].join('\n');
-      }).join('\n\n');
+        ].join('\n');
+        if (remaining - block.length < 0) break;
+        remaining -= block.length + 2;
+        postLines.push(block);
+      }
+      const postLinesStr = postLines.join('\n\n');
@@
-POSTS TO ANALYZE (${recentBatch.length} posts):
-${postLines}
+POSTS TO ANALYZE (${postLines.length} posts):
+${postLinesStr}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a241ea9 and 726f54c.

📒 Files selected for processing (1)

plugin-nostr/lib/service.js (2 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

plugin-nostr/lib/service.js (2)

plugin-nostr/lib/generation.js (1)

raw (7-7)

plugin-nostr/debug-text-generation.js (1)

raw (118-118)

🔇 Additional comments (1)

plugin-nostr/lib/service.js (1)

6929-6947: Prompt hardening for RAW JSON is solid

Clear “RAW JSON ONLY” and explicit “start with { / end with }” constraints reduce fence/overhead issues and improve parse reliability. LGTM.

…e JSON generation (#76) Timeline lore summary generation was failing due to two issues: 1. maxTokens limit of 480 was too small, causing JSON responses to be truncated 2. Model was returning markdown-wrapped JSON (```json...```), adding overhead Changes: - Increased maxTokens from 480 to 1000 to allow complete JSON responses - Updated prompt to explicitly request raw JSON without markdown formatting - Added clear instruction to start with { and end with } 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings October 31, 2025 17:08

anabelle merged commit dfc3706 into master Oct 31, 2025
5 of 6 checks passed

Copilot AI reviewed Oct 31, 2025

View reviewed changes

coderabbitai bot reviewed Oct 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: increase maxTokens and prevent markdown wrapping in timeline lor… #76

fix: increase maxTokens and prevent markdown wrapping in timeline lor… #76

Uh oh!

anabelle commented Oct 31, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 31, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	{ maxTokens: 1000, temperature: 0.45 },
	{ maxTokens: 750, temperature: 0.45 },

fix: increase maxTokens and prevent markdown wrapping in timeline lor… #76

fix: increase maxTokens and prevent markdown wrapping in timeline lor… #76

Uh oh!

Conversation

anabelle commented Oct 31, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anabelle commented Oct 31, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 31, 2025 •

edited

Loading