AR-MAP

Are Autoregressive Large Language Models Implicit Teachers for Diffusion Large Language Models?

A comprehensive framework for transferring alignment knowledge from AR-LLMs to Diffusion Models

Figure: The AR-MAP Framework. Transferring alignment from AR Teachers to Diffusion Students.

📖 Overview

AR-MAP (Autoregressive Model Alignment for Diffusion) is a novel transfer learning framework that leverages preference-aligned Autoregressive LLMs (AR-LLMs) as implicit teachers for Diffusion LLMs (DLLMs). This repository contains the complete implementation including:

Multi-aspect DPO training for helpfulness, truthfulness, and mathematical reasoning
Comprehensive evaluation suite across multiple benchmarks
Model merging utilities for LoRA adapters
Support for multiple model architectures (Qwen, Dream, SDAR)

🌟 Features

Multi-Aspect Optimization: Train models on multiple preference dimensions simultaneously
- Helpfulness alignment
- Truthfulness enhancement
- Mathematical reasoning improvement
Flexible Training Pipeline:
- DPO (Direct Preference Optimization) training
- LoRA fine-tuning support
- Multi-GPU distributed training
Comprehensive Evaluation:
- AlpacaEval for helpfulness
- TruthfulQA for truthfulness
- Arena-Hard for general capabilities
- Automated GPT-4 based evaluation
Model Support:
- Qwen 2.5 series
- Dream diffusion models
- SDAR models
- Easy extension to other architectures

🚀 Quick Start

Installation

conda create --name armap python=3.10
conda activate armap
pip install torch==2.6.0
pip install --no-cache-dir \
  https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.4.post1/\
flash_attn-2.7.4.post1+cu12torch2.6cxx11abiFALSE-cp310-cp310-linux_x86_64.whl
pip install -r requirements.txt

Basic Usage

1. Training with DPO

Use LlamaFactory for DPO training:

cd LlamaFactory-main

2. Merging LoRA Adapters

After training, merge LoRA weights back to the base model:

# For Qwen models
python merge-lora-ar.py \
  --base_model your_path/base_model \
  --lora_adapter your_path/lora_adapter \
  --output your_path/merged_model \
  --weight 1.0

# For Dream models
python merge-lora-dream.py \
  --base_model your_path/Dream-base \
  --lora_adapter your_path/lora_adapter \
  --output your_path/merged_model \
  --weight 6.0

# For SDAR models
python merge-lora-sdar.py \
  --base_model your_path/SDAR-base \
  --lora_adapter your_path/lora_adapter \
  --output your_path/merged_model \
  --weight 6.0

3. Evaluation

Evaluate your models on various benchmarks:

# Helpfulness evaluation (AlpacaEval)
cd eval-qwen
bash eval_helpful.sh

# Truthfulness evaluation
cd eval-qwen
python help_eval.py --model_name_or_path your_path/model

# Arena-Hard evaluation
cd eval-qwen
bash eval_arena.sh

📁 Project Structure

AR-MAP/
├── merge-lora-ar.py          # LoRA merging for Qwen models
├── merge-lora-dream.py        # LoRA merging for Dream models
├── merge-lora-sdar.py         # LoRA merging for SDAR models
├── eval-qwen/                 # Evaluation scripts for Qwen
│   ├── help_eval.py          # Helpfulness evaluation
│   ├── arena_qwen3.py        # Arena-Hard evaluation
│   └── eval_*.sh             # Evaluation bash scripts
├── eval-dream/                # Evaluation scripts for Dream
│   ├── dream-helpful.py      # Helpfulness evaluation
│   ├── dream-truthful.py     # Truthfulness evaluation
│   └── dream/                # Dream model implementation
├── eval-sdar/                 # Evaluation scripts for SDAR
│   ├── help_eval_sdar.py     # Helpfulness evaluation
│   ├── sdar_truthful.py      # Truthfulness evaluation
│   ├── ifeval_eval_sdar.py   # IFEval benchmark
│   └── jetengine_ext/        # Optimized inference engine
├── eval-dataset/              # Evaluation datasets
│   ├── alpaca-*.jsonl        # AlpacaEval datasets
│   ├── arena-*.jsonl         # Arena-Hard datasets
│   └── TruthfulQA.csv        # TruthfulQA dataset
├── train-dataset/             # Training datasets
│   ├── dpo_helpful.json      # Helpfulness preference data
│   ├── dpo_math.json         # Math preference data
│   └── dpo_truthful.json     # Truthfulness preference data

├── LlamaFactory-main/         # Training framework
└── requirements.txt           # Python dependencies

🔧 Configuration

Model Paths

Update the following paths in the scripts to match your setup:

# In merge-lora-*.py
BASE_MODEL_PATH = "your_path/base_model"
LORA_PATH = "your_path/lora_adapter"
OUTPUT_PATH = "your_path/merged_model"

# In eval scripts
model_name_or_path = "your_path/model"
dataset_path = "your_path/dataset"

API Configuration

For GPT-4 based evaluation, configure your API endpoint:

# In evaluation scripts
endpoint = "your_api_endpoint"
api_key = "your_api_key"  # Keep this secure!
deployment_name = "your_deployment"

📊 Evaluation Metrics

Our framework evaluates models across multiple dimensions:

Helpfulness: Measured via AlpacaEval with GPT-4 as judge
Truthfulness: Evaluated on TruthfulQA benchmark
Mathematical Reasoning: Tested on MATH and GSM8K datasets, Please note that we use the framework in TraceRL for evaluation.
General Capabilities: Arena-Hard benchmark
Instruction Following: IFEval benchmark

🎯 Training Data

The training datasets are organized by aspect:

dpo_helpful.json: Preference pairs for helpfulness
dpo_math.json: Preference pairs for mathematical reasoning
dpo_truthful.json: Preference pairs for truthfulness

Each dataset contains pairs of (chosen, rejected) responses for DPO training.

🔬 Model Architectures

Supported Models

QwenSeries: Standard autoregressive models
Dream: Diffusion-based language models with block attention
SDAR: Semi-autoregressive diffusion models

Merging Strategies

Different models require different merging coefficients:

Qwen: Standard merging (weight=1.0)
Dream/SDAR: Higher coefficients (weight=3.0) for better performance

📈 Results

Please refer to our paper (ARMAP_ARXIV.pdf) for detailed experimental results and analysis.

🛠️ Advanced Usage

🤝 Acknowledgements

This work builds upon several excellent open-source projects:

LlamaFactory for training infrastructure
Dream for diffusion language models
SDAR for semi-autoregressive models
TraceRL for evaluation framework

📝 Citation

If you find this work useful, please cite our paper.

📄 License

This project is released under the MIT License. See LICENSE file for details.

🔗 Contact

For questions or issues, please open an issue on GitHub or contact the authors.

Made with ❤️ for better language model alignment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AR-MAP

Are Autoregressive Large Language Models Implicit Teachers for Diffusion Large Language Models?

A comprehensive framework for transferring alignment knowledge from AR-LLMs to Diffusion Models

📖 Overview

🌟 Features

🚀 Quick Start

Installation

Basic Usage

1. Training with DPO

2. Merging LoRA Adapters

3. Evaluation

📁 Project Structure

🔧 Configuration

Model Paths

API Configuration

📊 Evaluation Metrics

🎯 Training Data

🔬 Model Architectures

Supported Models

Merging Strategies

📈 Results

🛠️ Advanced Usage

🤝 Acknowledgements

📝 Citation

📄 License

🔗 Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dream-main		Dream-main
LlamaFactory-main		LlamaFactory-main
dLLM-RL-main		dLLM-RL-main
eval-dataset		eval-dataset
eval-dream		eval-dream
eval-qwen		eval-qwen
eval-sdar		eval-sdar
train-dataset		train-dataset
.DS_Store		.DS_Store
AR-MAP.png		AR-MAP.png
README.md		README.md
merge-lora-ar.py		merge-lora-ar.py
merge-lora-dream.py		merge-lora-dream.py
merge-lora-sdar.py		merge-lora-sdar.py
requirements.txt		requirements.txt
translate_comments.py		translate_comments.py

AMAP-ML/AR-MAP

Folders and files

Latest commit

History

Repository files navigation

AR-MAP

Are Autoregressive Large Language Models Implicit Teachers for Diffusion Large Language Models?

A comprehensive framework for transferring alignment knowledge from AR-LLMs to Diffusion Models

📖 Overview

🌟 Features

🚀 Quick Start

Installation

Basic Usage

1. Training with DPO

2. Merging LoRA Adapters

3. Evaluation

📁 Project Structure

🔧 Configuration

Model Paths

API Configuration

📊 Evaluation Metrics

🎯 Training Data

🔬 Model Architectures

Supported Models

Merging Strategies

📈 Results

🛠️ Advanced Usage

🤝 Acknowledgements

📝 Citation

📄 License

🔗 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages