Agentic-Helper

A Java library for multi-provider AI orchestration (OpenAI, Azure OpenAI, Anthropic/Claude, Azure Anthropic/Claude).

High-level AgentService with rate limiting, retries, and structured outputs.

Credits

This project was originally forked from simple-openai by Sashir Estela.

Agentic-Helper adds:

AgentService for high-level agent orchestration
Multi-provider support (OpenAI, Azure OpenAI, Anthropic/Claude, Azure Anthropic/Claude)
JSON-based instance configuration
Automatic rate limiting and retries
Structured outputs with typed results
Stateless API (no threads/assistants required)
Autonomous Agent Mode - agents run multi-step tool loops independently
Web search, code interpreter, and function calling tools
Vision (multimodal) support
Image generation (DALL-E) support
Embeddings support

Installation

Option 1: Local Install (Recommended for development)

# Clone and install locally
git clone https://github.com/Yann-Favin-Leveque/agentic.git
cd agentic
mvn clean install -DskipTests

Then add to your project's pom.xml:

<dependency>
    <groupId>io.github.yann-favin-leveque</groupId>
    <artifactId>agentic-helper</artifactId>
    <version>1.6.8</version>
</dependency>

Option 2: Maven Central

The library is published on Maven Central. No extra repository configuration needed:

<dependency>
    <groupId>io.github.yann-favin-leveque</groupId>
    <artifactId>agentic-helper</artifactId>
    <version>1.6.8</version>
</dependency>

Option 3: GitHub Packages

Add the repository to your pom.xml:

<repositories>
    <repository>
        <id>github</id>
        <url>https://maven.pkg.github.com/Yann-Favin-Leveque/agentic</url>
    </repository>
</repositories>

<dependency>
    <groupId>io.github.yann-favin-leveque</groupId>
    <artifactId>agentic-helper</artifactId>
    <version>1.6.8</version>
</dependency>

Quick Start

import io.github.yannfavinleveque.agentic.agent.service.AgentService;
import io.github.yannfavinleveque.agentic.agent.config.AgentServiceConfig;
import io.github.yannfavinleveque.agentic.agent.core.Agent;
import io.github.yannfavinleveque.agentic.agent.model.AgentResult;

// 1. Configure instances via JSON
String instancesJson = System.getenv("LLM_INSTANCES");

AgentServiceConfig config = AgentServiceConfig.builder()
    .instancesJson(instancesJson)
    .requestsPerSecond(5)
    .build();

// 2. Create the service
AgentService service = new AgentService(config);

// 3. Register an agent programmatically
service.registerAgent(Agent.builder()
    .id("assistant")
    .name("My Assistant")
    .model("gpt-4o")
    .instructions("You are a helpful assistant.")
    .build());

// 4. Make a request
AgentResult result = service.requestAgent("assistant", "What is the capital of France?")
    .get(60, TimeUnit.SECONDS);

System.out.println(result.getContent());
// Output: The capital of France is Paris.

// OR: Use a model directly (no agent registration needed)
AgentResult result2 = service.requestModel("gpt-4o", "What is 2+2?")
    .get(60, TimeUnit.SECONDS);

Direct Model Usage (No Agent Registration)

Use requestModel() to call any model directly without registering an agent:

// Simple request with model name
AgentResult result = service.requestModel("gpt-4o", "Hello!")
    .get(60, TimeUnit.SECONDS);

// With options (web search, structured output, images, etc.)
AgentResult result = service.requestModel("gpt-4o", "What is today's date?",
    ModelRequestOptions.withWebSearch())
    .get(60, TimeUnit.SECONDS);

// With code interpreter
AgentResult result = service.requestModel("gpt-4o", "Calculate factorial of 10",
    ModelRequestOptions.withCodeInterpreter())
    .get(60, TimeUnit.SECONDS);

// With structured output
AgentResult result = service.requestModel("gpt-4o", "Analyze this data",
    ModelRequestOptions.withResultClass(MyResult.class))
    .get(60, TimeUnit.SECONDS);

// With multiple options
AgentResult result = service.requestModel("gpt-4o", "Research and analyze",
    ModelRequestOptions.builder()
        .webSearch(true)
        .temperature(0.7)
        .maxTokens(2000)
        .instructions("You are a research assistant")
        .build())
    .get(120, TimeUnit.SECONDS);

Configuration

JSON Instance Configuration

Set the LLM_INSTANCES environment variable with your provider configurations:

[
  {
    "id": "openai-main",
    "url": "https://api.openai.com",
    "key": "sk-xxx",
    "models": "gpt-4o,gpt-4o-mini,text-embedding-3-small,dall-e-3",
    "provider": "openai",
    "enabled": true
  },
  {
    "id": "azure-1",
    "url": "https://my-resource.openai.azure.com",
    "key": "azure-key",
    "models": "gpt-4o,gpt-5.1-chat",
    "provider": "azure-openai",
    "apiVersion": "2024-08-01-preview",
    "enabled": true
  },
  {
    "id": "anthropic-main",
    "url": "https://api.anthropic.com",
    "key": "sk-ant-xxx",
    "models": "claude-opus-4-5,claude-sonnet-4-5,claude-haiku-4-5",
    "provider": "anthropic",
    "enabled": true
  },
  {
    "id": "azure-anthropic",
    "url": "https://my-resource.services.ai.azure.com",
    "key": "azure-key",
    "models": "claude-sonnet-4-5,claude-haiku-4-5",
    "provider": "azure-anthropic",
    "apiVersion": "2023-06-01",
    "enabled": true
  }
]

Instance Configuration Fields:

Field	Required	Description
`id`	Yes	Unique identifier for the instance
`url`	Yes	Base URL of the API endpoint
`key`	Yes	API Key for authentication
`models`	Yes	Comma-separated list of deployed models
`provider`	Yes	Provider type: `openai`, `azure-openai`, `anthropic`, or `azure-anthropic`
`apiVersion`	Azure only	API version (required for Azure providers)
`enabled`	No	Whether instance should be loaded (default: `true`)

Configuration Options

AgentServiceConfig config = AgentServiceConfig.builder()
    .instancesJson(instancesJson)              // Required: JSON string with instances
    .requestsPerSecond(5)                      // Rate limit per instance (default: 5)
    .maxRetries(3)                             // Max retry attempts (default: 3)
    .defaultResponseTimeout(120000L)           // Timeout in ms (default: 120000)
    .build();

Spring Boot Integration

@Configuration
public class AgentServiceConfiguration {

    @Value("${llm.instances}")
    private String instancesJson;

    @Bean
    public AgentService agentService() {
        AgentServiceConfig config = AgentServiceConfig.builder()
            .instancesJson(instancesJson)
            .requestsPerSecond(15)
            .build();

        return new AgentService(config);
    }
}

Agent Requests

Simple Request

// Register agent
service.registerAgent(Agent.builder()
    .id("simple")
    .name("Simple Agent")
    .model("gpt-4o")
    .build());

// Make request
AgentResult result = service.requestAgent("simple", "What is 2+2?")
    .get(60, TimeUnit.SECONDS);

System.out.println(result.getContent()); // "4"

With System Prompt

service.registerAgent(Agent.builder()
    .id("pirate")
    .name("Pirate Agent")
    .model("gpt-4o")
    .instructions("You are a pirate. Always respond like a pirate would.")
    .build());

AgentResult result = service.requestAgent("pirate", "Hello!")
    .get(60, TimeUnit.SECONDS);

System.out.println(result.getContent());
// "Ahoy, matey! Welcome aboard!"

Multi-turn Conversations

Automatic History Management (Recommended)

Use createConversation() for automatic history management:

// Create a conversation
String convId = service.createConversation();

// First turn
AgentResult result1 = service.requestAgent("assistant", "My name is Alice.", convId)
    .get(60, TimeUnit.SECONDS);

// Second turn - history is managed automatically!
AgentResult result2 = service.requestAgent("assistant", "What is my name?", convId)
    .get(60, TimeUnit.SECONDS);

System.out.println(result2.getContent()); // "Your name is Alice."

// Clean up when done
service.deleteConversation(convId);

Manual History Management

You can also manage history manually if needed:

import io.github.yannfavinleveque.agentic.agent.model.Message;

List<Message> history = new ArrayList<>();

// First turn
AgentResult result1 = service.requestAgent("assistant", "My name is Alice.")
    .get(60, TimeUnit.SECONDS);

// Add to history manually
history.add(Message.user("My name is Alice."));
history.add(Message.assistant(result1.getContent()));

// Second turn - with manual history
AgentResult result2 = service.requestAgent("assistant", "What is my name?", history)
    .get(60, TimeUnit.SECONDS);

System.out.println(result2.getContent()); // "Your name is Alice."

Vision (Images)

Send images for analysis using multimodal messages:

service.registerAgent(Agent.builder()
    .id("vision")
    .name("Vision Agent")
    .model("gpt-4o")  // or claude-haiku-4-5
    .instructions("You are an image analyst.")
    .build());

// Create message with image
List<Message> history = new ArrayList<>();
history.add(Message.builder()
    .role("user")
    .content(List.of(
        Message.ContentPart.text("What color is this?"),
        Message.ContentPart.pngBase64(imageBase64)  // Base64 encoded PNG
    ))
    .build());

AgentResult result = service.requestAgent("vision", "Analyze the image.", history)
    .get(60, TimeUnit.SECONDS);

Supported image formats:

Message.ContentPart.pngBase64(base64) - PNG image
Message.ContentPart.jpegBase64(base64) - JPEG image
Message.ContentPart.imageUrl(url) - Image from URL

Web Search

Enable web search for real-time information:

service.registerAgent(Agent.builder()
    .id("searcher")
    .name("Web Search Agent")
    .model("gpt-4o")  // or claude-haiku-4-5
    .instructions("Use web search to find current information.")
    .webSearch(true)  // Enable web search
    .build());

AgentResult result = service.requestAgent("searcher", "What is today's weather in Paris?")
    .get(120, TimeUnit.SECONDS);

Function Calling

Define custom functions for the agent to call:

import io.github.yannfavinleveque.agentic.agent.model.FunctionConfig;

service.registerAgent(Agent.builder()
    .id("weather-bot")
    .name("Weather Bot")
    .model("gpt-4o")
    .instructions("Use the get_weather function when asked about weather.")
    .functions(List.of(
        FunctionConfig.builder()
            .name("get_weather")
            .description("Get current weather for a location")
            .parameters(Map.of(
                "type", "object",
                "properties", Map.of(
                    "location", Map.of("type", "string", "description", "City name")
                ),
                "required", List.of("location")
            ))
            .build()
    ))
    .build());

AgentResult result = service.requestAgent("weather-bot", "What's the weather in London?")
    .get(60, TimeUnit.SECONDS);

// Check if function was called
if (result.getContent().contains("Function call:")) {
    // Handle function call and continue conversation
}

Code Interpreter

Enable code execution for complex calculations:

service.registerAgent(Agent.builder()
    .id("calculator")
    .name("Code Interpreter Agent")
    .model("gpt-4o")
    .instructions("Use code interpreter to solve math problems.")
    .codeInterpreter(true)  // Enable code interpreter
    .build());

AgentResult result = service.requestAgent("calculator", "Calculate the factorial of 20")
    .get(120, TimeUnit.SECONDS);

System.out.println(result.getContent());
// "The factorial of 20 is 2,432,902,008,176,640,000"

Autonomous Agent Mode

Overview

Autonomous mode enables agents to independently execute multi-step tasks using tools, without the caller manually managing the tool-calling loop. The agent decides which tools to call, processes results, and repeats until the task is complete. Termination happens when the agent calls an auto-injected task_over function.

This is ideal for complex workflows where the agent needs to:

Search for data, analyze it, and produce a summary
Make multiple API calls in sequence with decision-making between them
Execute a plan with conditional branching based on tool results

How It Works

You register an agent with autonomous(true) and define its tools via functions()
You call requestAgent() with a ToolExecutor that knows how to execute each tool
The library manages the loop internally:
- Sends the user message to the LLM
- If the LLM calls tools → executes them via your ToolExecutor, sends results back
- If the LLM responds with text only (thinking aloud) → sends a nudge to continue
- If the LLM calls task_over → deserializes the result and returns it
The loop terminates when task_over is called or maxIterations is reached

The task_over function is auto-injected by the library. Its parameter schema is automatically generated from the agent's resultClass, so the LLM returns structured data that maps directly to your Java class.

Basic Usage

// 1. Define tools
FunctionConfig searchFunc = FunctionConfig.builder()
    .name("search_database")
    .description("Search a database for information")
    .parameters(Map.of(
        "type", "object",
        "properties", Map.of(
            "query", Map.of("type", "string", "description", "Search query")),
        "required", List.of("query"),
        "additionalProperties", false))
    .build();

FunctionConfig analyzeFunc = FunctionConfig.builder()
    .name("analyze_data")
    .description("Analyze data and return insights")
    .parameters(Map.of(
        "type", "object",
        "properties", Map.of(
            "data", Map.of("type", "string", "description", "Data to analyze")),
        "required", List.of("data"),
        "additionalProperties", false))
    .build();

// 2. Register autonomous agent
service.registerAgent(Agent.builder()
    .id("researcher")
    .name("Research Agent")
    .model("gpt-5.1-chat")  // or "claude-sonnet-4-5"
    .instructions("You are a research assistant. Search for data, analyze it, "
        + "then call task_over with a structured summary.")
    .resultClass("ResearchResult")
    .autonomous(true)
    .maxIterations(10)
    .functions(List.of(searchFunc, analyzeFunc))
    .build());

// 3. Provide a ToolExecutor and call
AgentResult result = service.requestAgent("researcher",
    "Research the current state of renewable energy.",
    call -> {
        switch (call.getName()) {
            case "search_database":
                return myDatabase.search(call.getArgumentsAsMap().get("query").toString());
            case "analyze_data":
                return myAnalyzer.analyze(call.getArgumentsAsMap().get("data").toString());
            default:
                return "Unknown tool: " + call.getName();
        }
    }
).get(180, TimeUnit.SECONDS);

// result is a ResearchResult instance
ResearchResult research = (ResearchResult) result;
System.out.println(research.getTopic());
System.out.println(research.getFindings());

ToolExecutor Interface

ToolExecutor is a functional interface that you implement to execute tool calls:

@FunctionalInterface
public interface ToolExecutor {
    String execute(FunctionCall functionCall) throws Exception;
}

Input: A FunctionCall with getName(), getArguments() (raw JSON string), getArgumentsAsMap(), and getArgumentsAs(Class<T>) for typed deserialization
Output: A String result that gets sent back to the LLM
Errors: If your executor throws an exception, the error message is sent to the LLM as the tool result (e.g., "Error executing search_database: Connection timeout"), and the loop continues - the agent can decide to retry or proceed differently

// Using a lambda
ToolExecutor executor = call -> {
    if ("get_weather".equals(call.getName())) {
        WeatherParams params = call.getArgumentsAs(WeatherParams.class);
        return weatherService.getWeather(params.getLocation());
    }
    return "Unknown tool";
};

// Using a method reference
ToolExecutor executor = this::handleToolCall;

Structured Results with resultClass

The resultClass field determines the schema of the task_over function and the return type. Your class must implement AgentResult:

@Data
@Builder
@NoArgsConstructor
@AllArgsConstructor
public class ResearchResult implements AgentResult {

    @JsonProperty("topic")
    private String topic;

    @JsonProperty("findings")
    private List<String> findings;

    @JsonProperty("conclusion")
    private String conclusion;

    @Override
    public String getContent() {
        return "Topic: " + topic + ", Findings: " + findings + ", Conclusion: " + conclusion;
    }
}

The library automatically:

Generates a JSON schema from this class
Injects it as the task_over function's parameter schema
Deserializes the LLM's task_over call arguments into your class

If no resultClass is configured, task_over accepts an empty object and returns a DefaultResult with the raw JSON arguments.

Conversation Management

Without conversationId (internal cleanup):

// Library creates and deletes the conversation internally
AgentResult result = service.requestAgent("researcher", "Research AI trends",
    this::executeToolCall
).get(180, TimeUnit.SECONDS);
// Conversation is automatically cleaned up after completion

With conversationId (external management):

// You manage the conversation lifecycle
String convId = service.createConversation();

try {
    // First task
    AgentResult result1 = service.requestAgent("researcher",
        "Research solar energy.", convId, this::executeToolCall
    ).get(180, TimeUnit.SECONDS);

    // Second task - agent remembers the first conversation
    AgentResult result2 = service.requestAgent("researcher",
        "Now compare with wind energy based on your previous research.",
        convId, this::executeToolCall
    ).get(180, TimeUnit.SECONDS);
} finally {
    service.deleteConversation(convId);
}

When using an external conversationId, the conversation history accumulates across calls, giving the agent full context from previous interactions.

Tool Output Trimming

For agents that call tools returning large outputs (e.g., database queries, API responses), you can limit the token size of tool results stored in conversation history:

service.registerAgent(Agent.builder()
    .id("researcher")
    .model("gpt-5.1-chat")
    .autonomous(true)
    .maxToolTokenOutput(200)  // ~800 characters max per tool output
    .functions(List.of(searchFunc))
    .build());

Uses an estimate of ~4 characters per token
Outputs exceeding the limit are truncated with a [trimmed] notice
null (default) = no trimming
Only applies to autonomous mode tool results

This prevents conversation history from growing too large when tools return verbose data, keeping API costs and context window usage under control.

Agent Reflection (Thinking Aloud)

During the autonomous loop, the agent may respond with text only (no tool calls). This happens when the agent wants to "think aloud" - reasoning about what to do next before calling a tool.

The library handles this automatically:

Stores the agent's text in conversation history
Sends a nudge message: "Continue with the task. When you are done, call the 'task_over' function with the final result."
Continues the loop

You can encourage this behavior in your instructions:

.instructions("Before each tool call, think step by step about what "
    + "information you still need and why. After each tool result, "
    + "reflect on what you learned before deciding your next action.")

Claude models tend to think aloud naturally. GPT models are more direct by default but will reflect if instructed to.

JSON Configuration

Autonomous agents can also be defined in JSON files:

{
  "id": "researcher",
  "name": "Research Agent",
  "model": "gpt-5.1-chat",
  "instructions": "You are a research assistant...",
  "resultClass": "ResearchResult",
  "autonomous": true,
  "maxIterations": 15,
  "maxToolTokenOutput": 200,
  "functions": [
    {
      "name": "search_database",
      "description": "Search for information",
      "parameters": {
        "type": "object",
        "properties": {
          "query": { "type": "string", "description": "Search query" }
        },
        "required": ["query"],
        "additionalProperties": false
      }
    }
  ]
}

Full Example

A complete example with two tools and structured output:

// Result class
@Data @Builder @NoArgsConstructor @AllArgsConstructor
public class AnalysisResult implements AgentResult {
    @JsonProperty("summary") private String summary;
    @JsonProperty("key_points") private List<String> keyPoints;
    @JsonProperty("confidence") private double confidence;

    @Override
    public String getContent() {
        return summary;
    }
}

// Setup
AgentServiceConfig config = AgentServiceConfig.builder()
    .instancesJson(System.getenv("LLM_INSTANCES"))
    .agentResultClassPackage("com.myapp.model")
    .build();
AgentService service = new AgentService(config);

// Register agent
service.registerAgent(Agent.builder()
    .id("analyst")
    .name("Data Analyst")
    .model("claude-sonnet-4-5")
    .instructions(
        "You are a data analyst. To complete an analysis:\n"
        + "1. Use fetch_data to retrieve relevant datasets\n"
        + "2. Use run_query to execute analytical queries\n"
        + "3. When done, call task_over with your analysis")
    .resultClass("AnalysisResult")
    .autonomous(true)
    .maxIterations(20)
    .maxToolTokenOutput(500)
    .functions(List.of(fetchDataFunc, runQueryFunc))
    .build());

// Execute
String convId = service.createConversation();
try {
    AnalysisResult result = (AnalysisResult) service.requestAgent(
        "analyst",
        "Analyze customer churn patterns for Q4 2025",
        convId,
        call -> {
            if ("fetch_data".equals(call.getName())) {
                return dataService.fetch(call.getArgumentsAs(FetchParams.class));
            } else if ("run_query".equals(call.getName())) {
                return queryEngine.execute(call.getArgumentsAs(QueryParams.class));
            }
            return "Unknown tool: " + call.getName();
        }
    ).get(300, TimeUnit.SECONDS);

    System.out.println("Summary: " + result.getSummary());
    System.out.println("Key points: " + result.getKeyPoints());
    System.out.println("Confidence: " + result.getConfidence());
} finally {
    service.deleteConversation(convId);
}

Embeddings

Generate text embeddings for semantic search:

// Single text
float[] embedding = service.requestEmbedding("Hello world", "text-embedding-3-small")
    .get(30, TimeUnit.SECONDS);

// Default model
float[] embedding = service.requestEmbedding("Hello world")
    .get(30, TimeUnit.SECONDS);

System.out.println("Dimensions: " + embedding.length); // 1536

// Batch embeddings
List<String> texts = List.of("Hello", "World", "Test");
List<float[]> embeddings = service.requestEmbeddings(texts, "text-embedding-3-small")
    .get(60, TimeUnit.SECONDS);

Image Generation

Generate images using DALL-E:

import io.github.yannfavinleveque.agentic.domain.image.Size;
import io.github.yannfavinleveque.agentic.domain.image.ImageRequest.Quality;

// Simple (returns base64)
String imageBase64 = service.requestImage("A cat in space")
    .get(120, TimeUnit.SECONDS);

// With options
String imageBase64 = service.requestImage(
    "A beautiful sunset over mountains",
    "dall-e-3",
    Size.X1024,
    Quality.HD
).get(120, TimeUnit.SECONDS);

// Edit an existing image
String edited = service.requestImageEdit(existingImageBase64, "Add sunglasses to the cat")
    .get(120, TimeUnit.SECONDS);

Agent JSON Schema

Agents can be defined in JSON files or registered programmatically.

JSON file (src/main/resources/agents/my-agent.json):

{
  "id": "my-agent",
  "name": "My Assistant",
  "model": "gpt-4o",
  "instructions": "You are a helpful assistant.",
  "temperature": 0.7,
  "webSearch": false,
  "codeInterpreter": false,
  "functions": []
}

Schema:

Field	Type	Required	Description
`id`	string	Yes	Unique agent identifier
`name`	string	Yes	Human-readable agent name
`model`	string	Yes	Model to use (e.g., `gpt-4o`, `claude-sonnet-4-5`)
`instructions`	string	No	System prompt / instructions
`temperature`	number	No	Randomness 0.0-2.0 (default: model default)
`webSearch`	boolean	No	Enable web search tool (default: `false`)
`codeInterpreter`	boolean	No	Enable code interpreter (default: `false`)
`functions`	array	No	Custom function definitions
`responseTimeout`	number	No	Max response time in ms (default: `120000`)
`maxTokens`	number	No	Maximum tokens in response
`resultClass`	string	No	Class name for structured outputs
`autonomous`	boolean	No	Enable autonomous tool loop mode (default: `false`)
`maxIterations`	number	No	Max loop iterations for autonomous mode (default: `25`)
`maxToolTokenOutput`	number	No	Max tokens per tool output in autonomous mode (null = no limit)

Function definition:

{
  "functions": [
    {
      "name": "get_weather",
      "description": "Get weather for a location",
      "parameters": {
        "type": "object",
        "properties": {
          "location": {
            "type": "string",
            "description": "City name"
          }
        },
        "required": ["location"]
      }
    }
  ]
}

Environment Variables

Variable	Description
`LLM_INSTANCES`	JSON array of instance configurations (required)
`ENABLED_PROVIDERS`	Comma-separated list of providers to enable (optional)

Provider Filtering

Use ENABLED_PROVIDERS to limit which providers are loaded:

# Only use OpenAI direct
export ENABLED_PROVIDERS=openai

# Only use Azure providers
export ENABLED_PROVIDERS=azure-openai,azure-anthropic

# Only use Anthropic direct
export ENABLED_PROVIDERS=anthropic

# Use all providers (default)
unset ENABLED_PROVIDERS

Multi-Provider Support

AgentService supports four providers with automatic routing:

Provider	Description	Models
`openai`	OpenAI API direct	gpt-4o, gpt-4o-mini, dall-e-3, text-embedding-3-small
`azure-openai`	Azure OpenAI	gpt-4o, gpt-5.1-chat (deployed models)
`anthropic`	Anthropic API direct	claude-opus-4-5, claude-sonnet-4-5, claude-haiku-4-5
`azure-anthropic`	Azure AI (Claude)	claude-sonnet-4-5, claude-haiku-4-5, claude-opus-4-5

The service automatically:

Routes requests to instances that have the requested model
Load-balances across multiple instances
Handles rate limiting per instance
Retries on transient failures

API Reference

AgentService Methods

Method	Description
`requestAgent(agentId, message)`	Simple agent request
`requestAgent(agentId, message, conversationId)`	Request with automatic history management
`requestAgent(agentId, message, history)`	Request with manual conversation history
`requestAgentVision(agentId, message, imageBase64)`	Vision request with single image
`requestModel(model, message)`	Direct model request (no agent)
`requestModel(model, message, options)`	Direct model with options
`createConversation()`	Create new conversation (returns ID)
`deleteConversation(conversationId)`	Delete conversation
`getConversationMessageCount(conversationId)`	Get message count
`requestAgent(agentId, message, toolExecutor)`	Autonomous agent request (internal conversation)
`requestAgent(agentId, message, conversationId, toolExecutor)`	Autonomous agent request with external conversation
`registerAgent(agent)`	Register agent programmatically
`requestEmbedding(text)`	Generate single embedding
`requestEmbeddings(texts)`	Generate batch embeddings
`requestImage(prompt)`	Generate image (base64)
`requestImageEdit(imageBase64, prompt)`	Edit existing image

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

simple-openai by Sashir Estela - The foundation of this library
CleverClient - HTTP client library

Changelog

v1.6.0 (2026-02-06)

New: Direct Anthropic API support (provider: "anthropic") - use Claude models via api.anthropic.com without Azure
Four providers now supported: openai, azure-openai, anthropic, azure-anthropic

v1.5.0 (2026-02-06)

New: Autonomous Agent Mode - agents autonomously execute multi-step tasks with tool loops
New: ToolExecutor functional interface for user-provided tool execution logic
New: AutonomousAgentRunner manages the full tool-calling loop internally
New: Auto-injected task_over function with schema from resultClass for structured termination
New: requestAgent(agentId, message, toolExecutor) overload for autonomous agents
New: requestAgent(agentId, message, conversationId, toolExecutor) overload with conversation persistence
New: maxToolTokenOutput field to trim tool outputs in autonomous mode (prevents context overflow)
New: maxIterations field to limit autonomous loop iterations (default: 25)
New: Agent reflection support - agents can "think aloud" between tool calls
New: Automatic nudging when agent responds without tool calls or task_over
Works with both OpenAI (gpt-5.1-chat) and Claude (claude-sonnet-4-5) providers
Integration tests for both providers covering trimming, conversation continuity, and multi-tool usage

v1.4.0 (2026-02-06)

New: Support FQCN for resultClass and parameterClass without requiring package config
New: Inline parameters schema for FunctionConfig
New: Structured FunctionCall support in AgentResult

v1.2.0 (2026-02-05)

New: Automatic conversation management with createConversation() / deleteConversation()
New: ConversationManager for in-memory conversation history storage
New: conversationId parameter in requestAgent() for automatic multi-turn
New: conversationId in ModelRequestOptions for requestModel() conversations
New: requestAgentVision(agentId, message, imageBase64) for simplified vision calls
New: requestImage() and requestImageEdit() aliases for API consistency
Backwards compatible: List<Message> history parameter still works

v1.1.9 (2026-02-05)

Restored full response logging (configurable via log wrapper)
Removed legacy createAllAgents() / createAgent() methods
Fixed method ambiguity with requestAgentVision() rename

v1.1.8 (2026-02-05)

Migrated to OpenAI Responses API for unified stateless architecture
Added requestImage and requestImageEdit aliases

v1.1.6 (2026-02-05)

New: Direct model usage - use requestAgent("gpt-4o", ...) without registering an agent
New: Model suffixes for tools - gpt-4o-websearch, gpt-4o-codeinterpreter
Fix: Structured output JSON schema format (name at format level, not json_schema level)
Added 29 comprehensive integration tests covering all providers and features

v1.1.5 (2026-02-05)

Breaking: Migrated to stateless Responses API (no more threads/assistants)
Renamed requestAgentV2 to requestAgent (new stateless API)
Removed legacy OpenAI Assistants API code
Removed ChatCompletionService and AgentRequestService (merged into UnifiedRequestService)
Added web search, code interpreter, and function calling support
Added vision (multimodal) support for images
Simplified agent registration with Agent.builder()
Multi-turn conversations now use List<Message> history parameter

v1.0.7 (2025-12-06)

feat: Simplify ArrayNode schema generation for more flexible JSON arrays
feat: Add support for Jackson JsonNode types in structured outputs

v1.0.6 (2025-12-05)

feat: Improve retry logging and error messages

v1.0.5 (2025-12-04)

chore: Remove file logging from library

v1.0.4 (2025-12-04)

Added retry logic to embedding and image generation
Added retry logic to all chat completion variants
Improved retry for rate limits (respects retry-after header)
Progressive timeout for consecutive timeout errors
Smart retry: skip 4xx client errors (except 429)

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
codestyle		codestyle
doc-archive		doc-archive
media		media
src		src
.devfile.yaml		.devfile.yaml
.env.example		.env.example
.gitignore		.gitignore
.sdkmanrc		.sdkmanrc
LICENSE		LICENSE
MIGRATION_GUIDE.md		MIGRATION_GUIDE.md
README.md		README.md
pom.xml		pom.xml
rundemo.sh		rundemo.sh

Folders and files

Latest commit

History

Repository files navigation

Agentic-Helper

Credits

Table of Contents

Installation

Option 1: Local Install (Recommended for development)

Option 2: Maven Central

Option 3: GitHub Packages

Quick Start

Direct Model Usage (No Agent Registration)

Configuration

JSON Instance Configuration

Configuration Options

Spring Boot Integration

Agent Requests

Simple Request

With System Prompt

Multi-turn Conversations

Automatic History Management (Recommended)

Manual History Management

Vision (Images)

Web Search

Function Calling

Code Interpreter

Autonomous Agent Mode

Overview

How It Works

Basic Usage

ToolExecutor Interface

Structured Results with resultClass

Conversation Management

Tool Output Trimming

Agent Reflection (Thinking Aloud)

JSON Configuration

Full Example

Embeddings

Image Generation

Agent JSON Schema

Environment Variables

Provider Filtering

Multi-Provider Support

API Reference

AgentService Methods

License

Acknowledgments

Changelog

v1.6.0 (2026-02-06)

v1.5.0 (2026-02-06)

v1.4.0 (2026-02-06)

v1.2.0 (2026-02-05)

v1.1.9 (2026-02-05)

v1.1.8 (2026-02-05)

v1.1.6 (2026-02-05)

v1.1.5 (2026-02-05)

v1.0.7 (2025-12-06)

v1.0.6 (2025-12-05)

v1.0.5 (2025-12-04)

v1.0.4 (2025-12-04)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages