Unembedding Token Rarity

Uncovering connections between LLM token representations and semantic rarity.

Classical IR features like inverse frequency are powerful indicators of rarity, and help ranking schemes like BM25 weight term importance. While neural reranking models often produce similar results to classical methods, it is unclear what features these model uses and how they are represented in the network.

To better understand how an important feature like term rarity is captured, we try to predict classical per-token rarity metrics as a linear function of dense token embeddings. A high R^2 value indicates the token embedding is highly predictive, showing the model has learned to embed signals related to semantic rarity.

Establishing how LLMs capture rarity may eventually help us interpret more complicated feature circuits and simplify model structure.

Rarity Metrics in the Top 500 Principle Components

Visualizing Log Inverse Document Frequency Predicted without PCA

Red represents the actual inverse document frequency of a token.

Blue represents the predicted inverse collection frequency of a token.

Visualizing Stopwords Predicted with PCA

Red represents the actual stopword tokens.

Blue represents the predicted stopword tokens.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
experiments		experiments
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unembedding Token Rarity

Rarity Metrics in the Top 500 Principle Components

Visualizing Log Inverse Document Frequency Predicted without PCA

Visualizing Stopwords Predicted with PCA

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unembedding Token Rarity

Rarity Metrics in the Top 500 Principle Components

Visualizing Log Inverse Document Frequency Predicted without PCA

Visualizing Stopwords Predicted with PCA

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages