Model Selection when multiple imputation is used to protect confidentiality in public use data

Satkartar K. Kinney; Jerome P. Reiter; James O. Berger

doi:10.29012/jpc.v2i2.588

PDF

Published: Apr 1, 2011

DOI: https://doi.org/10.29012/jpc.v2i2.588

Keywords:

synthetic data, Bayesian methods, BIC, stochastic search variable selection

Satkartar K. Kinney

National Institute of Statistical Sciences

Jerome P. Reiter

Duke University

https://orcid.org/0000-0002-8374-3832

James O. Berger

Duke University

Abstract

Several statistical agencies use, or are considering the use of, multiple imputation to limit the risk of disclosing respondents' identities or sensitive attributes in public use files. For example, agencies can release partially synthetic datasets, comprising the units originally surveyed with some values, such as sensitive values at high risk of disclosure, or values of key identifiers, replaced with multiple imputations. We describe how secondary analysts of such multiply-imputed datasets can implement Bayesian model selection procedures that appropriately condition on the multiple datasets and the information released by the agency about the imputation models. We illustrate by deriving Bayes factor approximations and a data augmentation step for stochastic search variable selection algorithms.

How to Cite

Kinney, Satkartar K., Jerome P. Reiter, and James O. Berger. 2011. “Model Selection When Multiple Imputation Is Used to Protect Confidentiality in Public Use Data”. Journal of Privacy and Confidentiality 2 (2). https://doi.org/10.29012/jpc.v2i2.588.

Issue

Vol. 2 No. 2 (2011)

Section

Articles

Copyright is retained by the authors. By submitting to this journal, the author(s) license the article under the Creative Commons License – Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), unless choosing a more lenient license (for instance, public domain). For situations not allowed under CC BY-NC-ND, short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source.

Authors of articles published by the journal grant the journal the right to store the articles in its databases for an unlimited period of time and to distribute and reproduce the articles electronically.

about2

The Journal of Privacy and Confidentiality is an open-access multi-disciplinary journal whose purpose is to facilitate the coalescence of research methodologies and activities in the areas of privacy, confidentiality, and disclosure limitation. The JPC seeks to publish a wide range of research and review papers, not only from academia, but also from government (especially official statistical agencies) and industry, and to serve as a forum for exchange of views, discussion, and news. For more information, see the About the Journal page.

Make a Submission

supp

Supplementary materials

reviewerg

Information

hiring

The JPC editorial team is looking to expand!

We are looking for graduate students wanting to gain valuable experience and insights into the journal publishing process. For additional information, see our job description page.

Article Sidebar

Main Article Content

Abstract

Article Details

Most read articles by the same author(s)