Machine learning for improving high-dimensional proxy confounder adjustment in healthcare database studies: An overview of the current literature.
Journal:
Pharmacoepidemiology and drug safety
PMID:
35729705
Abstract
PURPOSE: Supplementing investigator-specified variables with large numbers of empirically identified features that collectively serve as 'proxies' for unspecified or unmeasured factors can often improve confounding control in studies utilizing administrative healthcare databases. Consequently, there has been a recent focus on the development of data-driven methods for high-dimensional proxy confounder adjustment in pharmacoepidemiologic research. In this paper, we survey current approaches and recent advancements for high-dimensional proxy confounder adjustment in healthcare database studies.