DataSimilarity: Quantifying Similarity of Datasets and Multivariate Two- And k-Sample Testing

A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. It provides newly implemented methods as well as wrapper functions for existing methods that enable calling many different methods in a unified framework. The methods were selected from the review and comparison of Stolte et al. (2024) <doi:10.1214/24-SS149>.

Version: 0.1.1
Depends: R (≥ 3.5.0)
Imports: boot, stats
Suggests: ade4, approxOT, Ball, caret, clue, cramer, crossmatch, dbscan, densratio, DWDLargeR, e1071, Ecume, energy, expm, FNN, gTests, gTestsMulti, HDLSSkST, hypoRF, kernlab, kerTests, KMD, knitr, LPKsample, Matrix, mvtnorm, nbpMatching, pROC, purrr, randtoolbox, rlemon, rpart, rpart.plot, testthat, RSNNS
Published: 2025-03-18
DOI: 10.32614/CRAN.package.DataSimilarity
Author: Marieke Stolte ORCID iD [aut, cre, cph], Luca Sauer [aut], David Alvarez-Melis [ctb] (Original python implementation of OTDD, <https://github.com/microsoft/otdd.git>), Nabarun Deb [ctb] (Original implementation of rank-based Energy test (DS), <https://github.com/NabarunD/MultiDistFree.git>), Bodhisattva Sen [ctb] (Original implementation of rank-based Energy test (DS), <https://github.com/NabarunD/MultiDistFree.git>)
Maintainer: Marieke Stolte <stolte at statistik.tu-dortmund.de>
License: GPL (≥ 3)
NeedsCompilation: no
CRAN checks: DataSimilarity results

Documentation:

Reference manual: DataSimilarity.pdf
Vignettes: Using DataSimilarity (source, R code)

Downloads:

Package source: DataSimilarity_0.1.1.tar.gz
Windows binaries: r-devel: not available, r-release: DataSimilarity_0.1.1.zip, r-oldrel: not available
macOS binaries: r-devel (arm64): DataSimilarity_0.1.1.tgz, r-release (arm64): DataSimilarity_0.1.1.tgz, r-oldrel (arm64): DataSimilarity_0.1.1.tgz, r-devel (x86_64): DataSimilarity_0.1.1.tgz, r-release (x86_64): DataSimilarity_0.1.1.tgz, r-oldrel (x86_64): DataSimilarity_0.1.1.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=DataSimilarity to link to this page.