-
Notifications
You must be signed in to change notification settings - Fork 100
Use default agent preset for integration tests #1565
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Use default agent preset for integration tests #1565
Conversation
Update integration tests to use the default agent preset from openhands.tools.preset.default so tests validate the same agent configuration shipped to production (GUI/CLI). Changes: - BaseIntegrationTest now uses get_default_tools() and get_default_condenser() by default, with an enable_browser property for tests that need browser - Functional tests (t01-t08) now inherit default tools from base class - t05 and t06 (browsing tests) enable browser via enable_browser property - t09 keeps custom condenser for token-based condensation testing - behavior_helpers.py uses get_default_tools() for behavior tests - b05 uses default tools from base class This ensures integration tests validate the exact agent configuration that will be used in production. Fixes #372 Co-authored-by: openhands <openhands@all-hands.dev>
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
🧪 Integration Tests ResultsOverall Success Rate: 96.0% 📁 Detailed Logs & ArtifactsClick the links below to access detailed agent/LLM logs showing the complete reasoning process for each model. On the GitHub Actions page, scroll down to the 'Artifacts' section to download the logs.
📊 Summary
📋 Detailed Resultslitellm_proxy_mistral_devstral_2512
Skipped Tests:
Failed Tests:
litellm_proxy_moonshot_kimi_k2_thinking
Skipped Tests:
litellm_proxy_vertex_ai_gemini_3_pro_preview
litellm_proxy_deepseek_deepseek_chat
Skipped Tests:
litellm_proxy_claude_sonnet_4_5_20250929
litellm_proxy_gpt_5.1_codex_max
Skipped Tests:
|
Summary
Update integration tests to use the default agent preset from
openhands.tools.preset.defaultso tests validate the same agent configuration shipped to production (GUI/CLI).Changes:
get_default_tools()andget_default_condenser()by default, with anenable_browserproperty for tests that need browser supportenable_browserpropertyget_default_tools()for behavior testsThis ensures integration tests validate the exact agent configuration that will be used in production.
Fixes #372
Checklist
@simonrosenberg can click here to continue refining the PR
Agent Server images for this PR
• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server
Variants & Base Images
eclipse-temurin:17-jdknikolaik/python-nodejs:python3.12-nodejs22golang:1.21-bookwormPull (multi-arch manifest)
# Each variant is a multi-arch manifest supporting both amd64 and arm64 docker pull ghcr.io/openhands/agent-server:d2506a1-pythonRun
All tags pushed for this build
About Multi-Architecture Support
d2506a1-python) is a multi-arch manifest supporting both amd64 and arm64d2506a1-python-amd64) are also available if needed