Enable Unrestricted LLM Output with Parsing + Add One-Shot Anomaly Detection Support by scherkao31 · Pull Request #39 · sintel-dev/sigllm

scherkao31 · 2025-04-01T15:22:07Z

This PR introduces several changes :

Optional Unrestricted Token Output: Introduced a new boolean option to control whether the LLM is restricted to numeric tokens.
parse_anomaly_response Function: Added a new function to process the more flexible LLM output. This ensures that we can still extract the necessary anomaly information even when the output is not strictly numerical.
One-Shot Anomaly Detection: Integrated the ability to provide a single example of a normal sequence during the detection process, enabling one-shot.

… input if 1-shot

sarahmish

Thanks @scherkao31! Great features to add! I have a few minor comments.

sigllm/core.py

sigllm/data.py

sigllm/primitives/jsons/sigllm.primitives.prompting.huggingface.HF.json

...ves/jsons/sigllm.primitives.prompting.timeseries_preprocessing.rolling_window_sequences.json

sigllm/primitives/prompting/timeseries_preprocessing.py

sarahmish · 2025-04-01T19:40:31Z

sigllm/primitives/transformation.py

+    if normal:
+        # Handle as single time series (window_size, 1)
+        return _as_string(X)
+    else:
+        # Handle as multiple windows (num_windows, window_size, 1)
+        results = list(map(_as_string, X))
+        return np.array(results)


I'm wondering if we should have it more generalized, regardless of normal or not.

if X.ndim < 2: return _as_string(X) else: ...

What do you think?

sarahmish · 2025-04-01T19:44:07Z

sigllm/primitives/transformation.py

+def parse_anomaly_response(X):
+    """Parse a list of lists of LLM responses to extract anomaly values and format them as strings.
+
+    Args:
+        X (List[List[str]]): List of lists of response texts from the LLM in the format
+                           "Answer: no anomalies" or "Answer: [val1, val2, ..., valN]"
+    Returns:
+        List[List[str]]: List of lists of parsed responses where each element is either
+                        "val1,val2,...,valN" if anomalies are found,
+                        or empty string if no anomalies are present
+    """


reformat docstrings to keep arg name and description on separate lines:

"""Extract anomalies from LLM response. Parse a list of lists of LLM responses to extract anomaly values and format them as strings. Args: X (List[List[str]]): List of lists of response texts from the LLM in the format "Answer: no anomalies" or "Answer: [val1, val2, ..., valN]". Returns: List[List[str]]: List of lists of parsed responses where each element is either "val1,val2,...,valN" if anomalies are found, or empty string if no anomalies are present. """

first line of docstrings should be shorter. I already have a description mentioned in my comment above

sarahmish · 2025-04-01T19:47:18Z

sigllm/primitives/transformation.py

+                        or empty string if no anomalies are present
+    """
+
+    def parse_single_response(text: str) -> str:


I would remove type checking here since it's a private function. I also recommend removing comments that are not necessary.

For private functions, we normally make them start with _, so that would be

def _parse_single_response(text):

you still have str typing in your implementation

sigllm/pipelines/prompter/mistral_prompter_0shot.json

sarahmish

Thanks @scherkao31 for making the changes! Some previous comments were not addressed so I kept them as unresolved.

In addition, I want to note:

mistral_detector should be modified to set restrict_tokens to True.
the PR is missing the hyperparameter settings for MSL
do you plan include hyperparameters of Yahoo?

sarahmish · 2025-04-04T17:42:09Z

sigllm/data.py

+    """Load the CSV with the given name from S3.
+
+    If the CSV has never been loaded before, it will be downloaded
+    from the [d3-ai-orion bucket](https://d3-ai-orion.s3.amazonaws.com) or


update d3-ai-orion to sintel-sigllm

sarahmish · 2025-04-04T17:42:29Z

sigllm/data.py

+    directory, and then returned.
+
+    Otherwise, if it has been downloaded and cached before, it will be directly
+    loaded from the `orion/data` folder without contacting S3.


update orion/data to sigllm/data

sarahmish · 2025-04-04T17:43:19Z

sigllm/data.py

+    If a `test_size` value is given, the data will be split in two parts
+    without altering its order, making the second one proportionally as
+    big as the given value.


test_size argument doesn't make sense in load_normal. I would remove it. please make sure the code, docstrings, and function arguments are not using this variable.

sarahmish · 2025-04-04T18:02:33Z

sigllm/data.py

+        use_timestamps (bool):
+            If True, start and end are interpreted as timestamps.
+            If False, start and end are interpreted as row indices.


I would not make this a function argument, but rather I would check if start or end are of integer or timestamp type. See comment below.

sarahmish · 2025-04-04T18:07:46Z

sigllm/data.py

+    # Handle slicing if start or end is specified
+    if start is not None or end is not None:
+        if use_timestamps:
+            # If start and end are timestamps
+            mask = (data['timestamp'] >= start) & (data['timestamp'] <= end)
+            data = data[mask]
+        else:
+            # If start and end are indices
+            data = data.iloc[start:end]


couple of things

we don't know the exact name of the timestamp column, so you should use the variable.

the user typically is not keen on specifying so many parameters. we can check if start or end are integers then we can use them to slice. Otherwise, assume that they are timestamps.

check if the edge cases start=None but end has a value works and vice versa.

if start is not None or end is not None: if data.index.isin([start, end]): # If start and end are indices data = data.iloc[start:end] else: # If start and end are timestamps mask = (data[timestamp_column] >= start) & (data[timestamp_column] <= end) data = data[mask]

sarahmish · 2025-04-04T18:12:01Z

sigllm/primitives/transformation.py

+                        or empty string if no anomalies are present
+    """
+
+    def parse_single_response(text: str) -> str:


you still have str typing in your implementation

sarahmish · 2025-04-04T18:12:39Z

sigllm/primitives/transformation.py

+def parse_anomaly_response(X):
+    """Parse a list of lists of LLM responses to extract anomaly values and format them as strings.
+
+    Args:
+        X (List[List[str]]): List of lists of response texts from the LLM in the format
+                           "Answer: no anomalies" or "Answer: [val1, val2, ..., valN]"
+    Returns:
+        List[List[str]]: List of lists of parsed responses where each element is either
+                        "val1,val2,...,valN" if anomalies are found,
+                        or empty string if no anomalies are present
+    """


first line of docstrings should be shorter. I already have a description mentioned in my comment above

sarahmish · 2025-04-04T18:15:25Z

sigllm/primitives/prompting/huggingface.py

            Additional padding token to forecast to reduce short horizon predictions.
            Default to `0`.
+        restrict_tokens (bool):
+            Whether to restrict tokens or not. Default to `True`.


I still don't see a change to mistral_detector

sarahmish · 2025-04-14T19:55:09Z

sigllm/primitives/prompting/huggingface_messages.json

-    "user_message": "Below is a [SEQUENCE], please return the anomalies in that sequence in [RESPONSE]. Only return the numbers. [SEQUENCE]"
+    "system_message": "You are an expert in time series analysis. Your task is to detect anomalies in time series data.",
+    "user_message": "Below is a [SEQUENCE], please return the anomalies in that sequence in [RESPONSE]. Only return the numbers. [SEQUENCE]",
+    "user_message_2": "Below is a [SEQUENCE], analyze the following time series and identify any anomalies. If you find anomalies, provide their values in the format [first_anomaly, ..., last_anomaly]. If no anomalies are found, respond with 'no anomalies'. Be concise, do not write code, do not permorm any calculations, just give your answers as told.: [SEQUENCE]",


typo permorm -> perform

…tection Support (#39) * Core Changed to get normal behavior in pipeline * Transformation changed * anomalies.py changed * Hugginface.py changed : no restrictions token, and also has normal as input if 1-shot * Timeseries preprocessing.py * jsons files added for primitives * jsons files added for primitives * pipelines 0shot and 1shot added * add boolean for restrict_tokens in HF * good messages.json for prompt * Added load_normal in sigllm.data * Fixed load_normal in sigllm.data * Fixed lint format * Fixed lint format Ruff * Fixed from review Sarah * Fixed lint format after working on Sarah's reviews * Dataset prompter parameters * .jons removed from input names in 1_shot pipeline.json * .jons removed from input names in 1_shot pipeline.json * fix PR issues & add unittests * add unittests for parse_anomaly_response * remove unused functions * add new functionality tests * update ubuntu image * change normal->single * fix lint * swap normal -> single --------- Co-authored-by: Salim Cherkaoui <salim31@dai-desk32.lids.mit.edu> Co-authored-by: Sarah Alnegheimish <sarahalnegheimish@gmail.com>

* init benchmark * fix details * fix lint * add benchmark tests * paper benchmark results (#41) * Enable Unrestricted LLM Output with Parsing + Add One-Shot Anomaly Detection Support (#39) * Core Changed to get normal behavior in pipeline * Transformation changed * anomalies.py changed * Hugginface.py changed : no restrictions token, and also has normal as input if 1-shot * Timeseries preprocessing.py * jsons files added for primitives * jsons files added for primitives * pipelines 0shot and 1shot added * add boolean for restrict_tokens in HF * good messages.json for prompt * Added load_normal in sigllm.data * Fixed load_normal in sigllm.data * Fixed lint format * Fixed lint format Ruff * Fixed from review Sarah * Fixed lint format after working on Sarah's reviews * Dataset prompter parameters * .jons removed from input names in 1_shot pipeline.json * .jons removed from input names in 1_shot pipeline.json * fix PR issues & add unittests * add unittests for parse_anomaly_response * remove unused functions * add new functionality tests * update ubuntu image * change normal->single * fix lint * swap normal -> single --------- Co-authored-by: Salim Cherkaoui <salim31@dai-desk32.lids.mit.edu> Co-authored-by: Sarah Alnegheimish <sarahalnegheimish@gmail.com> * support load_normal --------- Co-authored-by: scherkao31 <93837850+scherkao31@users.noreply.github.com> Co-authored-by: Salim Cherkaoui <salim31@dai-desk32.lids.mit.edu>

Salim Cherkaoui added 14 commits March 31, 2025 16:33

Core Changed to get normal behavior in pipeline

2ba851e

Transformation changed

bc285a1

anomalies.py changed

502603c

Hugginface.py changed : no restrictions token, and also has normal as…

0769e90

… input if 1-shot

Timeseries preprocessing.py

2929f8b

jsons files added for primitives

785f96d

jsons files added for primitives

16bb0a5

pipelines 0shot and 1shot added

4ffaffa

add boolean for restrict_tokens in HF

8f8fc07

good messages.json for prompt

bd334c9

Added load_normal in sigllm.data

dbf8ed1

Fixed load_normal in sigllm.data

6f08214

Fixed lint format

fbedec1

Fixed lint format Ruff

fa98d60

sarahmish reviewed Apr 1, 2025

View reviewed changes

Salim Cherkaoui added 5 commits April 1, 2025 17:06

Fixed from review Sarah

8ea8f97

Fixed lint format after working on Sarah's reviews

293f1ca

Dataset prompter parameters

8b6dd6e

.jons removed from input names in 1_shot pipeline.json

3689912

.jons removed from input names in 1_shot pipeline.json

42efea0

sarahmish reviewed Apr 4, 2025

View reviewed changes

sarahmish reviewed Apr 14, 2025

View reviewed changes

fix PR issues & add unittests

5d99162

sarahmish approved these changes Apr 16, 2025

View reviewed changes

sarahmish added 6 commits April 17, 2025 12:09

add unittests for parse_anomaly_response

49e67d8

remove unused functions

11ff33a

add new functionality tests

a2e28f3

update ubuntu image

f293d84

change normal->single

f3f7b4c

fix lint

540ea92

swap normal -> single

5876feb

sarahmish approved these changes Apr 18, 2025

View reviewed changes

sarahmish merged commit dc854da into sintel-dev:master Apr 18, 2025
11 checks passed

sarahmish assigned scherkao31 May 29, 2025

sarahmish added this to the 0.0.4 milestone May 29, 2025

Conversation

scherkao31 commented Apr 1, 2025

Uh oh!

sarahmish left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sarahmish left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sarahmish Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sarahmish Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sarahmish Apr 4, 2025 •

edited

Loading

sarahmish Apr 14, 2025 •

edited

Loading