fix: large system message in Xpert Assistant #213

naincy128 · 2025-09-06T10:23:42Z

Description:

This PR addresses a bug where excessively large system messages, typically caused by unit content with extensive video transcripts, result in Xpert message failures. In these scenarios, the system prompt consumes the entire token limit, leaving no room for learner or LA messages, thus breaking the interaction flow.

Resolution:

Unit content is now handled using proportional trimming before sending to Xpert.
This ensures the prompt includes a balanced amount of all content types, leaving enough room for both learner and LA messages.

Test Coverage:

Confirmed that learner messages now reach Xpert successfully.
Added unit tests to validate trimming behaviour for extremely large unit contents.

All tests and lint checks pass.
Results (1.32s):
55 passed

2U Private Jira Link

Jira ticket: https://2u-internal.atlassian.net/browse/COSMO2-14

learning_assistant/api.py

Copilot

Pull Request Overview

This PR fixes a bug where extremely large system messages (typically from extensive video transcripts) were causing Xpert Assistant failures by consuming the entire token limit and leaving no room for learner or LA messages.

Implements proportional trimming for unit content to ensure balanced inclusion of all content types
Calculates dynamic limits based on static content size to maintain room for chat messages
Adds comprehensive test coverage for various content scenarios including large video transcripts

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
learning_assistant/api.py	Implements proportional trimming logic for unit content handling
tests/test_api.py	Adds extensive test cases for content trimming scenarios

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

learning_assistant/api.py

tests/test_api.py

Copilot

Pull Request Overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tests/test_api.py

Copilot

Pull Request Overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

learning_assistant/api.py

tests/test_api.py

Copilot

Pull Request Overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tests/test_api.py

Copilot

Pull Request Overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tests/test_api.py

learning_assistant/api.py

mraman-2U

Update the release version here
Add the new version to the change log

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-08T04:52:03Z

learning_assistant/api.py

+                    trimmed_unit_content.append({"content_type": ctype, "content_text": ""})
+                    continue
+
+                allowed_chars = max(1, int((len(text) / total_chars) * adjusted_unit_limit))


Division by zero is possible if total_chars is 0. Although there's a check at line 159, this code could still execute if total_chars becomes 0 through other paths.

Suggested change

allowed_chars = max(1, int((len(text) / total_chars) * adjusted_unit_limit))

if total_chars == 0:

allowed_chars = 1 # fallback to minimum allowed

else:

allowed_chars = max(1, int((len(text) / total_chars) * adjusted_unit_limit))

Copilot · 2025-10-08T04:52:03Z

tests/test_api.py

+        'A' * 200,         # Long string case to test trimming
+        [  # VIDEO content case
+            {'content_type': 'VIDEO', 'content_text': f"Video transcript {i} " + ("A" * 200)} for i in range(10)
+        ],
+        [  # TEXT content case
+            {'content_type': 'TEXT', 'content_text': f"Paragraph {i} " + ("B" * 100)} for i in range(20)
+        ],
+        [  # Mixed VIDEO + TEXT case
+            {'content_type': 'VIDEO', 'content_text': "Video intro " + ("C" * 100)},
+            {'content_type': 'TEXT', 'content_text': "Some explanation " + ("D" * 100)},


[nitpick] Creating large test strings with repeated characters could be memory inefficient. Consider using smaller test data or generating content on-demand within the test method.

Suggested change

'A' * 200, # Long string case to test trimming

[ # VIDEO content case

{'content_type': 'VIDEO', 'content_text': f"Video transcript {i} " + ("A" * 200)} for i in range(10)

],

[ # TEXT content case

{'content_type': 'TEXT', 'content_text': f"Paragraph {i} " + ("B" * 100)} for i in range(20)

],

[ # Mixed VIDEO + TEXT case

{'content_type': 'VIDEO', 'content_text': "Video intro " + ("C" * 100)},

{'content_type': 'TEXT', 'content_text': "Some explanation " + ("D" * 100)},

'A' * 20, # Long string case to test trimming (reduced size)

[ # VIDEO content case

{'content_type': 'VIDEO', 'content_text': f"Video transcript {i} " + ("A" * 20)} for i in range(10)

],

[ # TEXT content case

{'content_type': 'TEXT', 'content_text': f"Paragraph {i} " + ("B" * 10)} for i in range(20)

],

[ # Mixed VIDEO + TEXT case

{'content_type': 'VIDEO', 'content_text': "Video intro " + ("C" * 10)},

{'content_type': 'TEXT', 'content_text': "Some explanation " + ("D" * 10)},

jcapphelix reviewed Sep 8, 2025

View reviewed changes

learning_assistant/api.py Outdated Show resolved Hide resolved

jcapphelix reviewed Sep 8, 2025

View reviewed changes

learning_assistant/api.py Outdated Show resolved Hide resolved

mraman-2U reviewed Sep 18, 2025

View reviewed changes

learning_assistant/api.py Show resolved Hide resolved

Copilot AI review requested due to automatic review settings September 23, 2025 14:05

Copilot AI reviewed Sep 23, 2025

View reviewed changes

learning_assistant/api.py Outdated Show resolved Hide resolved

tests/test_api.py Outdated Show resolved Hide resolved

tests/test_api.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings September 23, 2025 14:15

Copilot AI reviewed Sep 23, 2025

View reviewed changes

tests/test_api.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings September 23, 2025 15:02

Copilot AI reviewed Sep 23, 2025

View reviewed changes

learning_assistant/api.py Outdated Show resolved Hide resolved

learning_assistant/api.py Outdated Show resolved Hide resolved

tests/test_api.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings September 23, 2025 16:08

Copilot AI reviewed Sep 23, 2025

View reviewed changes

tests/test_api.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings September 24, 2025 05:15

naincy128 force-pushed the Cosmos2-14/Naincy128 branch from 6f6523c to 6ccb48d Compare September 24, 2025 05:15

Copilot AI reviewed Sep 24, 2025

View reviewed changes

tests/test_api.py Outdated Show resolved Hide resolved

learning_assistant/api.py Outdated Show resolved Hide resolved

learning_assistant/api.py Outdated Show resolved Hide resolved

mraman-2U approved these changes Oct 7, 2025

View reviewed changes

Copilot AI review requested due to automatic review settings October 8, 2025 04:51

Copilot AI reviewed Oct 8, 2025

View reviewed changes

naincy128 added 8 commits October 9, 2025 10:33

fix: large system message in Xpert Assistant

1498819

fix: update test proportional logic

be78155

fix: modification in the flow

411ed66

fix: update in total length calculation

bc5f38d

feat: use getattr for unit_content_max_length

c37a5a5

fix: content length limit exceeded after formatting

8598c77

chore: refine content trimming, empty content handling, and tests

0f3d9f0

chore: changelog and init update for Xpert Assistant fix

3caaa4e

naincy128 force-pushed the Cosmos2-14/Naincy128 branch from aa3daa4 to 3caaa4e Compare October 9, 2025 05:03

naincy128 merged commit c37d063 into edx:main Oct 9, 2025
4 checks passed

fix: large system message in Xpert Assistant #213

fix: large system message in Xpert Assistant #213

Uh oh!

Conversation

naincy128 commented Sep 6, 2025 • edited by jcapphelix Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description:

Resolution:

Test Coverage:

2U Private Jira Link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mraman-2U left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

naincy128 commented Sep 6, 2025 •

edited by jcapphelix

Loading