niceday

niceday is an R package for non-parametrically estimating fold-differences in multivariate outcomes. niceday specifically estimates log-fold differences in covariate-weighted conditional means of an outcome even when the desired outcome may not be directly observed. Instead, the observed outcome reflects the desired outcome but with category- and sample-specific multiplicative distortions. One of the key advantages of niceday is that no assumptions about the distribution of outcomes or the structural relationship between the outcome and adjustment covariates (e.g., linearity) need to be made.

We expect niceday to be especially useful for microbiome researchers, because

High-throughput sequencing of microbiomes displays variation in sequencing depth, and unequal detection of taxa (e.g., due to extraction bias, amongst other things). Therefore, niceday allows estimation of fold-differences on the true abundance scale, but just the observed sequencing scale.
Researchers often want to adjust for additional covariates (either known confounders or simply to make more reasonable comparisons between groups), but don’t know what form the adjustment should take – e.g., linear vs nonlinear, etc.

We welcome your feedback and questions, and hope you find this package useful!

Installation

To install and load niceday, use the following code. You may get some messages about the loaded packages, but these aren’t a problem.

# install.packages("remotes")
# remotes::install_github("statdivlab/niceday")
library(niceday)

Use

Here’s an example of how to run niceday’s main fitting function, ndFit. Check out the vignettes for more information! Here, we have an observed data matrix W, metadata data, contrast of interest A, and adjustment covariates X.

library(niceday)
data(EcoZUR_meta)
data(EcoZUR_count)
my_ndfit <- ndFit(W = EcoZUR_count[, 1:50], # consider only the first 50 taxa to run quickly
      data = EcoZUR_meta,
      A = ~ Diarrhea,
      X = ~ sex + age_months,
      num_crossval_folds = 2, # use more folds in practice
      num_crossfit_folds = 2, # for cross validation and cross fitting
      sl.lib.pi = c("SL.mean"), # choosing single learner for the example to run quickly,
      sl.lib.m = c("SL.mean"))  # in practice would use other options as well

If you want to go deep down the rabbit hole, you can look at how we ran it for the data analysis in our paper here.

Citation

If you use niceday for your analysis, please cite our manuscript.

Grant Hopkins, Sarah Teichman, Ellen Graham, and Amy Willis. “Nonparametric Identification and Estimation of Ratios of Multi-Category Means under Preferential Sampling.” https://arxiv.org/abs/2510.23920

Bug reports and feature requests

If you identify a bug in niceday or have a feature request, please post an issue here.

Nomenclature

niceday stands for Nonparametrically Identified (de-Confounded) Estimands for Difference Abundance, Yippee!

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
R		R
data		data
man		man
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
LICENSE.md		LICENSE.md
NAMESPACE		NAMESPACE
README.Rmd		README.Rmd
README.md		README.md
niceday.Rproj		niceday.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Uh oh!

Repository files navigation

niceday

Installation

Use

Citation

Bug reports and feature requests

Nomenclature

About

Licenses found

Uh oh!

Releases 1

Packages

Contributors 3

Uh oh!

Languages

License

Licenses found

statdivlab/niceday

Folders and files

Latest commit

History

Repository files navigation

niceday

Installation

Use

Citation

Bug reports and feature requests

Nomenclature

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Uh oh!

Languages

Packages