Benchmark collection of 172 distributions of ordinal data

This benchmark collection includes 172 simulated distributions of ordinal data. The collection is inspired by benchmarks introduced by Haslbeck et al. (2022) and Marron & Wand (1992) and is meant as a benchmark to validate and test statistical methods, such as modality detection methods.

The ordinal distributions in our benchmark were created by categorizing simulated data into either 5 or 10 categories (with the exception of the claw distribution). Each distribution type was simulated for sample sizes N = 50, N = 500, N = 5000, and N = 50,000. Simulated distributions are either unimodal, bimodal, trimodal, or have five modes. The underlying data generating process is either Gaussian or non-Gaussian, with non-Gaussian distributions being (mixtures of) beta distributions, or sampled from probabilities of specific cases which we found crucial for evaluating modality detection methods. Additionally, the benchmark entails uniform distributions as a special form of unimodals, and the claw distribution which is a mixture of five Gaussians.

Raw data

The distributions can be accessed as an RData file (benchmark_distributions.RData) or as a csv file in the zip folder benchmark_distributions.zip.

Replicate the data base

All simulated distributions are fully replicable, The corresponding R code is stored in create_benchmark.R.

Visualization

Plots and names of all distributions in the benchmark can be found in benchmark_distributions.pdf

References

Haslbeck, J., Vermunt, J. K., & Waldorp, L. (2022). The impact of ordinal scales on gaussian mixture recovery. Behavior Research Methods, 55 (4), 2143–2156. https://doi.org/10.3758/s13428-022-01883-8

Marron, J., & Wand, M. (1992). Exact mean integrated squared error. The Annals of Statistics, 20 (2), 712–736. https://doi.org/10.1214/aos/1176348653

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmark collection of 172 distributions of ordinal data

Raw data

Replicate the data base

Visualization

References

About

Uh oh!

Releases 3

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
benchmark_distributions.RData		benchmark_distributions.RData
benchmark_distributions.pdf		benchmark_distributions.pdf
benchmark_distributions.zip		benchmark_distributions.zip
create_benchmark.R		create_benchmark.R

mfhstdt/benchmark_ordinal_distributions

Folders and files

Latest commit

History

Repository files navigation

Benchmark collection of 172 distributions of ordinal data

Raw data

Replicate the data base

Visualization

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages