Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models

This repository contains the official implementation of ConceptLM and Next-Concept-Prediction (NCP),

Release ConceptLM models (The largest Pythia and Llama models are released)
Release test code
Release all models, including ablation studies, analyses, and their training curves
Release training code
...

🔍 Introduction

We propose Next Concept Prediction (NCP), a generative pretraining paradigm built on top of Next Token Prediction (NTP). Our model, ConceptLM, quantizes hidden states using Vector Quantization and constructs a concept vocabulary. It leverages both NCP and NTP to drive parameter updates and generates a concept to guide the generation of the following tokens.

Key highlights:

Introduce a Harder LLM pre-training objective for NCP.
Build up Concept Representation in Discrete Latent Space (Concept Vocabulary) upon LLM continuous latent space.
Introduce a Novel architecture ConceptLM integrates NCP and NTP.

🖼️ Overview

📁 Code Structure

We implement the generate function for the Llama model. So that the details of ConceptLM_Llama differ from the Pythia and GPT-2 implementations.

.
├── figures/        
├── lm_eval/   
  ├── lm_eval/   
      ├── lm_eval_files/
      ├── ConceptLM_arc/
          ├── ConceptLM_GPT2/
          ├── ConceptLM_Pythia/
          └── ConceptLM_Llama/
      └── README.md
└── README.md

📁 Quick Start

To reproduce our results, you can download our model first, then run:

git clone https://github.com/LUMIA-Group/ConceptLM

cd ./lm_eval

pip install -e .

pip install transformers==4.51, vector_quantize_pytorch, flash_attn

# download our models

bash run_lm_eval.sh

📖 Citation

If you have any questions or are interested in our work, please feel free to contact us at liuyl03181@gmail.com.

If you find this work useful, please consider citing our paper:

@article{conceptlm2026,
  title   = {Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models},
  author  = {Yuliang Liu and Yunchong Song and Yixuan Wang and Kewen Ge and Alex Lamb and Qipeng Guo and Kai Chen and Bowen Zhou and Zhouhan Lin},
  journal = {Arxiv},
  year    = {2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models

🔍 Introduction

🖼️ Overview

📁 Code Structure

📁 Quick Start

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
figures		figures
lm_eval		lm_eval
.gitignore		.gitignore
README.md		README.md
run_lm_eval.sh		run_lm_eval.sh

Folders and files

Latest commit

History

Repository files navigation

Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models

🔍 Introduction

🖼️ Overview

📁 Code Structure

📁 Quick Start

📖 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages