testing framework for transformRequest+Response #72

knjiang · 2026-01-29T02:45:53Z

This PR adds testing framework for transformRequest and transformResponse. This is specifically an offline testing framework where we test lingua transformations against saved snapshots.

We use the existing cases and WASM bindings generated from previous PR transform_request, transform_response and validate_*_json to verify valid transforms and no regressions.

High level, my mental model is:

coverage-report ensures internal consistency between the Universal model & all the providers
transforms ensures external compatibility with OpenAPI schema validations during every transform and using the actual SDK post-transform.

Diagram:

  Phase 1: Capture (one-time, requires API keys)                                                                                
                                                                                                                                
  getCaseForProvider(caseName, source)                                                                                          
             │                                                                                                                  
             ▼                                                                                                                  
  ┌─────────────────────────────────────┐                                                                                       
  │   transformAndValidateRequest()     │                                                                                       
  │   • transform_request() ────────────┼──► validate_*_request()                                                               
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
  ┌─────────────────────────────────────┐                                                                                       
  │   callProvider(target, request)     │                                                                                       
  │   • openai.chat.completions.create()│                                                                                       
  │   • anthropic.messages.create()     │                                                                                       
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
           validate_*_response()                                                                                                
                     │                                                                                                          
                     ▼                                                                                                          
           writeFileSync(path, response)                                                                                        
           transforms/{src}_to_{tgt}/{case}.json                                                                                
                                                                                                                                
  Phase 2: Test (CI, no API calls)                                                                                              
                                                                                                                                
  getCaseForProvider(caseName, source)                                                                                          
             │                                                                                                                  
             ▼                                                                                                                  
  ┌─────────────────────────────────────┐                                                                                       
  │   transformAndValidateRequest()     │                                                                                       
  │   • transform_request() ────────────┼──► validate_*_request()                                                               
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
             toMatchSnapshot("request")                                                                                         
                     │                                                                                                          
                     ▼                                                                                                          
  ┌─────────────────────────────────────┐                                                                                       
  │   loadAndValidateResponse()         │                                                                                       
  │   • readFileSync(path) ─────────────┼──► validate_*_response()                                                              
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
  ┌─────────────────────────────────────┐                                                                                       
  │   transformResponseData()           │                                                                                       
  │   • transform_response() ───────────┼──► validate_*_response()                                                              
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
             toMatchSnapshot("response")

knjiang · 2026-01-29T02:46:12Z

Support google parameters #75
add google test cases #74
add google roundtrip #70
testing framework for transformRequest+Response #72 👈 (View in Graphite)
lingua-wasm bindings for request/response #69
Add universal param configs #61
add anthropic messages parameter test cases #59
add chat completion parameter test cases #58
add openai responses parameter test cases #54
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

knjiang · 2026-01-29T02:47:36Z

payloads/transforms/responses_to_anthropic/reasoningRequestTruncated.json

+{
+  "error": "400 {\"type\":\"error\",\"error\":{\"type\":\"invalid_request_error\",\"message\":\"`max_tokens` must be greater than `thinking.budget_tokens`. Please consult our documentation at https://docs.claude.com/en/docs/build-with-claude/extended-thinking#max-tokens-and-context-window-size\"},\"request_id\":\"req_011CXasVyLu26rs4f6bS7DRJ\"}",
+  "name": "Error"
+}


only error i found so far, this seems to be valid since we define in our case max_tokens = 100 with high reasoning.

idk if we want to throw our own error or something. https://github.com/braintrustdata/lingua/blob/main/payloads/cases/simple.ts#L146

knjiang · 2026-01-29T04:10:29Z

crates/lingua/src/universal/response.rs

+                map.insert(
+                    "input_tokens_details".into(),
+                    serde_json::json!({ "cached_tokens": self.prompt_cached_tokens.unwrap_or(0) }),
+                );


caught this from the validate_response_json js binding. responses require input_token_details and output_token_details

knjiang · 2026-01-29T04:34:34Z

payloads/scripts/transforms/lingua-capture.ts

+}
+/* eslint-enable @typescript-eslint/consistent-type-assertions */
+
+const isParamCase = (name: string) => name.endsWith("Param");


temporary, i didn't want to explode the diff so doing this here.

knjiang · 2026-01-29T05:05:56Z

payloads/transforms/anthropic_to_chat-completions/complexReasoningRequest.json

@@ -0,0 +1,35 @@
+{


anthropic_to_chatcompletions means anthropic payload using a chat completions model - we save the actual chat completion response payload so we don't have to incur the LLM cost.

remh · 2026-02-04T06:15:51Z

payloads/scripts/transforms/transforms.test.ts

+          } catch (e) {
+            // Check if this is an expected error (known provider incompatibility)
+            const errorReason = transformErrors[pairKey]?.[caseName];
+            if (errorReason) {


we should check that the errorReason matches the actual reason.

the reason listed in transform_errors.json is a semantic reason for the reader, not the actual error.

remh · 2026-02-04T06:28:14Z

payloads/scripts/transforms/transforms.test.ts

+// Explicitly skipped tests (add here only if intentionally not supported)
+// Format: "source_to_target_caseName"
+const SKIPPED_TESTS = new Set<string>([
+  // Add entries here with comments explaining why


do we need that if the list is currently empty?

this is a remnant deleting.

knjiang mentioned this pull request Jan 29, 2026

lingua-wasm bindings for request/response #69

Merged

knjiang commented Jan 29, 2026

View reviewed changes

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 0cc799d to 6093a42 Compare January 29, 2026 02:50

knjiang force-pushed the 01-27-request_typescript_and_python_bindings branch 2 times, most recently from 9e4d458 to e383b62 Compare January 29, 2026 04:07

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 6093a42 to 5c74eae Compare January 29, 2026 04:07

knjiang commented Jan 29, 2026

View reviewed changes

knjiang marked this pull request as ready for review January 29, 2026 05:04

knjiang commented Jan 29, 2026

View reviewed changes

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 5c74eae to 0c166a9 Compare January 29, 2026 05:38

knjiang force-pushed the 01-27-request_typescript_and_python_bindings branch from e383b62 to 8d60e4b Compare January 29, 2026 05:38

knjiang requested review from ankrgyl and remh January 30, 2026 20:00

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 0c166a9 to d09967b Compare February 2, 2026 18:00

knjiang force-pushed the 01-27-request_typescript_and_python_bindings branch from 2342f75 to 5133b4d Compare February 3, 2026 21:39

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from d09967b to 13cb50c Compare February 3, 2026 21:39

remh requested changes Feb 4, 2026

View reviewed changes

knjiang changed the base branch from 01-27-request_typescript_and_python_bindings to graphite-base/72 February 9, 2026 15:05

knjiang force-pushed the graphite-base/72 branch from 5133b4d to d900a27 Compare February 9, 2026 15:05

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 13cb50c to 4325ef5 Compare February 9, 2026 15:05

graphite-app bot changed the base branch from graphite-base/72 to main February 9, 2026 15:05

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch 2 times, most recently from b710035 to 21f7e1f Compare February 9, 2026 15:07

knjiang requested a review from remh February 10, 2026 15:59

testing framework for transformRequest+Response

8758352

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 21f7e1f to 8758352 Compare February 11, 2026 03:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

testing framework for transformRequest+Response #72

testing framework for transformRequest+Response #72

Uh oh!

knjiang commented Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang commented Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang Jan 29, 2026

Uh oh!

knjiang Jan 29, 2026

Uh oh!

remh Feb 4, 2026

Uh oh!

knjiang Feb 9, 2026

Uh oh!

remh Feb 4, 2026

Uh oh!

knjiang Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

testing framework for transformRequest+Response #72

Are you sure you want to change the base?

testing framework for transformRequest+Response #72

Uh oh!

Conversation

knjiang commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

knjiang commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

knjiang Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

knjiang Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

knjiang Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

knjiang Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

remh Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

knjiang Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

remh Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

knjiang Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

knjiang commented Jan 29, 2026 •

edited

Loading

knjiang commented Jan 29, 2026 •

edited

Loading

knjiang Jan 29, 2026 •

edited

Loading

knjiang Jan 29, 2026 •

edited

Loading