AIMC Topic: Data Anonymization

Clear Filters Showing 31 to 40 of 42 articles

Transferability of neural network clinical deidentification systems.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: Neural network deidentification studies have focused on individual datasets. These studies assume the availability of a sufficient amount of human-annotated data to train models that can generalize to corresponding test data. In real-world...

Application of Bayesian networks to generate synthetic health data.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: This study seeks to develop a fully automated method of generating synthetic data from a real dataset that could be employed by medical organizations to distribute health data to researchers, reducing the need for access to real data. We h...

Fold-stratified cross-validation for unbiased and privacy-preserving federated learning.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: We introduce fold-stratified cross-validation, a validation methodology that is compatible with privacy-preserving federated learning and that prevents data leakage caused by duplicates of electronic health records (EHRs).

Using word embeddings to improve the privacy of clinical notes.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: In this work, we introduce a privacy technique for anonymizing clinical notes that guarantees all private health information is secured (including sensitive data, such as family history, that are not adequately covered by current technique...

Ensuring electronic medical record simulation through better training, modeling, and evaluation.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: Electronic medical records (EMRs) can support medical research and discovery, but privacy risks limit the sharing of such data on a wide scale. Various approaches have been developed to mitigate risk, including record simulation via genera...

The machine giveth and the machine taketh away: a parrot attack on clinical text deidentified with hiding in plain sight.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: Clinical corpora can be deidentified using a combination of machine-learned automated taggers and hiding in plain sight (HIPS) resynthesis. The latter replaces detected personally identifiable information (PII) with random surrogates, allo...

Deep Learning Approaches Outperform Conventional Strategies in De-Identification of German Medical Reports.

Studies in health technology and informatics
One of the major obstacles for research on German medical reports is the lack of de-identified medical corpora. Previous de-identification tasks focused on non-German medical texts, which raised the demand for an in-depth evaluation of de-identificat...

Impact of De-Identification on Clinical Text Classification Using Traditional and Deep Learning Classifiers.

Studies in health technology and informatics
Clinical text de-identification enables collaborative research while protecting patient privacy and confidentiality; however, concerns persist about the reduction in the utility of the de-identified text for information extraction and machine learnin...

Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: To conduct a systematic review of deep learning models for electronic health record (EHR) data, and illustrate various deep learning architectures for analyzing different data sources and their target applications. We also highlight ongoin...

Community-Acquired Pneumonia Case Validation in an Anonymized Electronic Medical Record-Linked Expert System.

Clinical infectious diseases : an official publication of the Infectious Diseases Society of America
An electronic anonymized patient portal analysis using radiographic reports and admission and discharge diagnoses had sensitivity, specificity, positive predictive value, and negative predictive value of 84.7%, 78.2%, 75%, and 87%, respectively, for ...