-
Notifications
You must be signed in to change notification settings - Fork 63
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Priority Level
Medium (Nice to have)
Is your feature request related to a problem? Please describe.
Currently, failed parses from e.g. LLMCodeColumnConfig give an error like
[WARNING] ⚠️ Generation for record at index N failed. Will omit this record from the dataset.
|----------
| Cause: The provided output schema was unable to be parsed from model 'model' responses while running generation for column 'column'.
| Solution: This is most likely temporary as we make additional attempts. If you continue to see more of this, simplify or modify the output schema for structured output and try again. If you are attempting token-intensive tasks like generations with high-reasoning effort, ensure that max_tokens in the model config is high enough to reach completion.
|----------
The trace (if set up to be exported) is also lost.
Describe the solution you'd like
Ideally, users should have the option to keep these records in the dataset with the json parsed to null. Three reasons:
- Some use-cases (with the ORDERED strategy) expect one output for every seed - this violates that.
- The
dropstrategy means losing other work done within the record, which might be useful to the user. - The trace is very useful in debugging parse failures.
Describe alternatives you've considered
Writing my own parser which also permits arbitrary text and parses to null - quite complicated though.
Additional context
No response
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request