Package: syntheticdata 0.1.1
syntheticdata: Synthetic Clinical Data Generation and Privacy-Preserving Validation
Generates synthetic clinical datasets that preserve statistical properties while reducing re-identification risk. Implements Gaussian copula simulation, bootstrap with noise injection, and Laplace noise perturbation, with built-in utility and privacy validation metrics. Useful for privacy-aware data sharing in multi-site clinical research. Validates synthetic data quality via distributional similarity (Kolmogorov-Smirnov), discriminative accuracy (real-vs-synthetic classifier), and nearest-neighbor privacy ratio. Methods described in Jordon et al. (2022) <doi:10.48550/arXiv.2205.03257> and Snoke et al. (2018) <doi:10.1111/rssa.12358>.
Authors:
syntheticdata_0.1.1.tar.gz
syntheticdata_0.1.1.zip(r-4.7)syntheticdata_0.1.1.zip(r-4.6)syntheticdata_0.1.1.zip(r-4.5)
syntheticdata_0.1.1.tgz(r-4.6-any)syntheticdata_0.1.1.tgz(r-4.5-any)
syntheticdata_0.1.1.tar.gz(r-4.7-any)syntheticdata_0.1.1.tar.gz(r-4.6-any)
syntheticdata_0.1.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
syntheticdata/json (API)
| # Install 'syntheticdata' in R: |
| install.packages('syntheticdata', repos = c('https://cuiweig.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/cuiweig/syntheticdata/issues
Pkgdown/docs site:https://cuiweig.github.io
clinical-datadifferential-privacyhealthcareprivacy-preservingsynthetic-data
Last updated from:24fa64a23f. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 137 | ||
| source / vignettes | OK | 156 | ||
| linux-release-x86_64 | OK | 161 | ||
| macos-release-arm64 | OK | 117 | ||
| macos-oldrel-arm64 | OK | 70 | ||
| windows-devel | OK | 139 | ||
| windows-release | OK | 75 | ||
| windows-oldrel | OK | 77 | ||
| wasm-release | OK | 102 |
Exports:compare_methodsmodel_fidelityprivacy_risksynthesizevalidate_synthetic
Dependencies:clidplyrgenericsgluelifecyclemagrittrpillarpkgconfigR6rlangtibbletidyselectutf8vctrswithr
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Compare multiple synthesis methods | compare_methods |
| Downstream model fidelity test | model_fidelity |
| Compute privacy risk metrics | privacy_risk |
| Generate synthetic data from a real dataset | synthesize |
| Validate synthetic data quality | validate_synthetic |
