[Feature]: Adapter should persist, not immediately unloaded after inference.

### 🚀 The feature, motivation and pitch

Use llamafactory to tune a adapter:
```
kubectl port-forward svc/lamini-operator 8000:8000
curl -X POST "http://localhost:8000/v1alpha/tune" \
  -H "Content-Type: application/json" \
  -d '{
    "job_name": "test-job-21",
    "base_model": "meta-llama/Llama-3.2-3B",
    "dataset": "test.jsonl",
    "hyperparameters": {
      "max_steps": "1"
    }
  }'
```

Wait for the above job to finish, then send an inference:

```shell
kubectl port-forward svc/inference-router 8001:8000
curl http://localhost:8001/v1/completions \
    -H "Content-Type: application/json" --header 'Authorization: Bearer sk-1234' \
    -d '{
        "model": "meta-llama/Llama-3.2-3B-Instruct",
        "prompt": "San Francisco is a good test adf",
        "lora_request": {
            "lora_name": "test_adapter",
            "lora_path": "/app/lamini/jobs/test-job-21"
        }
    }'
```

Notice that the adapter got unloaded immediately after finishing the inference

![Image](https://github.com/user-attachments/assets/aed34aed-7a23-4c67-8f78-7deb1e4170e8)

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [ ] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature]: Adapter should persist, not immediately unloaded after inference. #8

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: Adapter should persist, not immediately unloaded after inference. #8

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions