SIMVALE

Abstract

The rise of Large Language Model (LLM)-powered simulators has enabled highly realistic modeling of complex social phenomena, significantly reducing the costs and efforts associated with real-world data collection. However, their reliability remains a persistent challenge, and existing validation approaches offer only partial generalization, typically evaluating simulator realism along isolated dimensions and at fixed levels of granularity. To this end, this thesis introduces SIMVALE (SIMulator VAlidation with Latent Embeddings) — a generalizable multi-dimensional validation framework enabling quantitative assessment of LLM-based simulators. SIMVALE leverages latent embedding clustering and yields both global and local feature-based metrics to evaluate how closely a simulator reflects the reality it aims to model, ultimately supporting its improvement. Furthermore, SIMVALE is tested on a real case study to assess the simulator’s ability to replicate general behavioral patterns, toxic profiles, and the effects of moderation interventions.

The framework

SIMVALE is a novel validation framework for LLM-powered simulators that enables their quantitative assessment across multiple behavioral dimensions — which can be tailored to the simulator's modeling goals — and supports evaluations at both coarse, i.e. global, and fine-grained, i.e. local, levels of detail.

The supported validation is two-fold, as it is designed to assess both the simulator's realism, by comparing its generated content to a real-world reference dataset, and the effects of possible alterations on the simulator's output, by comparing its original content to its altered counterpart.

But HOW?

The framework is built upon four key phases. First, a transparent feature extraction pipeline is applied to each dataset to extract task-agnostic and domain-specific features, organized into distinct macro-dimensions. Task-agnostic features describe general textual properties, such as linguistic style, sentiment and readability, while domain-specific features are defined according to specific validation objectives, e.g. personality and toxicity.

In the second phase, BERT models are fine-tuned individually on selected domain-specific features using the real data, allowing each model to specialize in detecting linguistic nuances associated with its corresponding target. These fine-tuned models are then employed to derive target-aware, contextualized embeddings for all textual instances in both real and simulated data.

In the third phase, the resulting embeddings are clustered separately for each dataset to uncover homogeneous latent, domain-specific subsets of instances, referred to as profiles. These are then aligned across datasets with a similarity-based, one-to-one mapping strategy focusing on the most dominant traits characterizing each profile, allowing for a finer-grained assessment of the simulator by comparing corresponding profile subsets.

Unlike more conventional approaches that would perform clustering directly on — domain-specific — features, with each textual instance represented by a narrow set of scalar values, our method leverages fine-tuned embeddings to uncover richer and more subtle behavioral groupings. Indeed, these vector representations encode not only the general linguistic knowledge acquired during BERT's pre-training, but also the domain-specific patterns learned through the fine-tuning process. As a result, this allows to group instances in a more meaningful, expressive and behaviorally-coherent manner.

Finally, an evaluation pipeline is applied to quantify the distributional divergences across the macro-dimensional features between dataset pairs. To assess realism each real feature is compared with its original simulated counterpart, while to evaluate alteration effects the original simulated features are compared against their altered versions. Features exhibiting significant differences are identified through a statistical test, and further analyzed using a set of comparative metrics. These analyses are conducted both globally, considering the entire datasets, and locally, focusing on the matched profile datasets.

Furthermore, in this work we test the SIMVALE framework on a novel LLM-based OSN simulator, designed to model the emergence of toxic behavior and to evaluate the effects of different moderation interventions on harmful content.

Thus, the validation focuses on assessing the simulator's realism in reproducing real-world OSN patterns, as well as the efficacy of its supported moderation interventions in mitigating toxicity. As discussed earlier, both evaluations are conducted globally, on the entire datasets, and locally, targeting specific toxicity-related content profiles.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
scripts		scripts
01_merge_data.ipynb		01_merge_data.ipynb
02_data_preparation.ipynb		02_data_preparation.ipynb
03_feat_extr_pipe.ipynb		03_feat_extr_pipe.ipynb
04_toxicity_vect.ipynb		04_toxicity_vect.ipynb
05_llm_pipe_tox.ipynb		05_llm_pipe_tox.ipynb
06_clustering_emb.ipynb		06_clustering_emb.ipynb
07_validation_simulator.ipynb		07_validation_simulator.ipynb
08a_validation_ofsa.ipynb		08a_validation_ofsa.ipynb
08b_validation_neutral.ipynb		08b_validation_neutral.ipynb
08c_validation_empathizing.ipynb		08c_validation_empathizing.ipynb
08d_validation_prescriptive.ipynb		08d_validation_prescriptive.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SIMVALE

Abstract

The framework

But HOW?

About

Uh oh!

Releases

Packages

Languages

lcodagiorgio/SIMVALE

Folders and files

Latest commit

History

Repository files navigation

SIMVALE

Abstract

The framework

But HOW?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages