Citation Intent Open LLMs

Supplementary material for paper "Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs".

Experiential evaluation

Current top results for each model

SciCite

ACL-ARC

Rank	Model	F1-Score
1	Qwen 2.5 - 14B	78.33
2	Gemma 2 - 27B	77.86
3	Mistral Nemo - 12B	77.39
4	Gemma 2 - 9B	75.12
5	Phi 3 Medium - 14B	74.67
6	LLaMA 3 - 8B	74.39
7	Qwen 2 - 7B	72.89
8	LLaMA 3.1 - 8B	72.46
9	Gemma 2 - 2B	68.79
10	Phi 3.5 Mini - 3.8B	68.25
11	LLaMA 3.2 - 3B	67.99
12	LLaMA 3.2 - 1B	45.44

Rank	Model	F1-Score
1	Qwen 2.5 - 14B	63.68
2	Gemma 2 - 27B	58.95
3	Gemma 2 - 9B	57.19
4	Qwen 2 - 7B	51.26
5	LLaMA 3.1 - 8B	48.45
6	Mistral Nemo - 12B	48.11
7	Phi 3.5 Mini - 3.8B	43.74
8	Phi 3 Medium - 14B	43.46
9	Gemma 2 - 2B	40.96
10	LLaMA 3.2 - 3B	40.07
11	LLaMA 3 - 8B	38.06
12	LLaMA 3.2 - 1B	24.60

Instructions

Prerequisites

Support for additional inference providers is under development

LM Studio (version 0.3.10 or higher)
LM Studio CLI (lms)

Setup and Configuration

Configure Models

The default configuration includes all models used in the paper
- Open experimental-configs/models.q8.json
- Select your target models and specify their context lengths
Model Installation - Choose one of these methods to download the required models:
- Use the LM Studio UI
- Run the command: lms get <model-name>
Experiment Configuration

In the default configuration, all parameters are selected
- Open experimental-configs\experimens-cfg.json
- Select your desired evaluation parameters

Running the Evaluation

Navigate to the root directory
Execute the evaluation script:

python citation_intent_classification_experiments.py

Fine-tuning

Prerequisites

LLaMA-Factory (commit: 24c7842)
All LLaMA-Factory dependencies installed

LLaMA-Factory is very quick to iterate, so later versions may not be totally compatible with the current config files - although the changes are usually very minor).

The training parameters in llama-factory-configs/{dataset}/training_args.yaml are platform-independent and can be used with any Supervised Fine-tuning system.

Dataset Preparation

Copy Dataset Files

Source locations:

datasets/aplaca_format_scicite/
└── scicite_train_alpaca.json
└── scicite_dev_alpaca.json

datasets/alpaca_format/acl-arc/
└── aclarc_train_alpaca.json
└── aclarc_dev_alpaca.json

Destination: LLaMA-Factory/data/

Update Dataset Information

Add the following to LLaMA-Factory/data/dataset_info.json:

"scicite": {
    "file_name": "scicite_train_alpaca.json",
    "columns": {
        "prompt": "instruction",
        "query": "input",
        "response": "output",
        "system": "system"
    }
},
"scicite-calibration": {
    "file_name": "scicite_dev_alpaca.json",
    "columns": {
        "prompt": "instruction",
        "query": "input",
        "response": "output",
        "system": "system"
    }
},
"aclarc": {
    "file_name": "aclarc_train_alpaca.json",
    "columns": {
        "prompt": "instruction",
        "query": "input",
        "response": "output",
        "system": "system"
    }
},
"aclarc-calibration": {
    "file_name": "aclarc_dev_alpaca.json",
    "columns": {
      "prompt": "instruction",
      "query": "input",
      "response": "output",
      "system": "system"
    }
}

Configuration Setup

Create a new directory: LLaMA-Factory/config/
Copy all configuration files from llama-factory-configs/ to the new directory

Training

For this step consult the LLaMA-Factory docs as well.

Choose one of these methods:

GUI Method
- Launch LLaMA Board interface
- Load your configuration
- Start training run

CLI Method

llamafactory-cli train path/to/training_args.yaml

Model Export

Export the model using the dev set of the selected dataset (either scicite_dev_alpaca.json or aclarc_dev_alpaca.json) as a calibration dataset

Optional: GGUF Conversion

To create GGUF model versions, install llama.cpp and run:

python convert_hf_to_gguf_update.py

License

Released under GNU GPL v2.0.

Who do I talk to?

This repository is maintained by Paris Koloveas from Athena RC

Email: pkoloveas@athenarc.gr

Citing this work

If you utilize any of the processes and scripts in this repository, please cite us in the following way:

@inproceedings{10.1007/978-3-032-05409-8_13,
  author    = {Koloveas, Paris
               and Chatzopoulos, Serafeim
               and Vergoulis, Thanasis
               and Tryfonopoulos, Christos},
  editor    = {Balke, Wolf-Tilo
               and Golub, Koraljka
               and Manolopoulos, Yannis
               and Stefanidis, Kostas
               and Zhang, Zheying},
  title     = {Can LLMs Predict Citation Intent? An Experimental Analysis of In-Context Learning and Fine-Tuning on Open LLMs},
  booktitle = {Linking Theory and Practice of Digital Libraries},
  year      = {2026},
  publisher = {Springer Nature Switzerland},
  address   = {Cham},
  pages     = {207--224},
  isbn      = {978-3-032-05409-8}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
api		api
datasets		datasets
experimental-configs		experimental-configs
llama-factory-configs		llama-factory-configs
notebooks		notebooks
results		results
system_prompts		system_prompts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
citation_intent_classification_experiments.py		citation_intent_classification_experiments.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citation Intent Open LLMs

Experiential evaluation

Current top results for each model

Instructions

Prerequisites

Setup and Configuration

Running the Evaluation

Fine-tuning

Prerequisites

Dataset Preparation

Configuration Setup

Training

Model Export

Optional: GGUF Conversion

License

Who do I talk to?

Citing this work

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Citation Intent Open LLMs

Experiential evaluation

Current top results for each model

Instructions

Prerequisites

Setup and Configuration

Running the Evaluation

Fine-tuning

Prerequisites

Dataset Preparation

Configuration Setup

Training

Model Export

Optional: GGUF Conversion

License

Who do I talk to?

Citing this work

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages