Skip to content

Conversation

@femave
Copy link
Contributor

@femave femave commented Nov 8, 2025

Integration between Braintrust and Evalion.

  • Created a new Jupyter notebook for evaluating voice AI agents using Evalion and Braintrust.
  • Added assets including screenshots for Braintrust dataset, experiment, and playground interfaces.

femave and others added 10 commits November 8, 2025 14:18
- Created a new Jupyter notebook for evaluating voice AI agents using Evalion and Braintrust.
- Added assets including screenshots for Braintrust dataset, experiment, and playground interfaces.
- Updated execution counts for code cells to reflect the order of execution.
- Added output logs for package installations to provide feedback on dependencies.
- Set environment variables with actual API keys for demonstration purposes.
- Improved latency metric extraction and scoring logic for better evaluation accuracy.
- Added detailed output messages during evaluation to track progress and results.
- Updated documentation to clarify the evaluation lifecycle and results structure.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants