Comparing the Ability of Regression Modeling and Bayesian Additive Regression Trees to Predict Costs in a Responsive Survey Design Context.

Journal: Journal of official statistics
Published Date:

Abstract

Responsive survey designs rely upon incoming data from the field data collection to optimize cost and quality tradeoffs. In order to make these decisions in real-time, survey managers rely upon monitoring tools that generate proxy indicators for cost and quality. There is a developing literature on proxy indicators for the risk of nonresponse bias. However, there is very little research on proxy indicators for costs and almost none aimed at predicting costs under alternative design strategies. Predictions of survey costs and proxy error indicators can be used to optimize survey designs in real time. Using data from the National Survey of Family Growth, we evaluate alternative modeling strategies aimed at predicting survey costs (specifically, interviewer hours). The models include multilevel regression (with random interviewer effects) and Bayesian Additive Regression Trees (BART).

Authors

  • James Wagner
    University of Michigan, 4053 ISR, 426 Thompson St., Ann Arbor, MI 48104, U.S.A.
  • Brady T West
    University of Michigan, 4053 ISR, 426 Thompson St., Ann Arbor, MI 48104, U.S.A.
  • Michael R Elliott
    University of Michigan, 4053 ISR, 426 Thompson St., Ann Arbor, MI 48104, U.S.A.
  • Stephanie Coffey
    U.S. Census Bureau, 4600 Silver Hill Road, Suitland, MD 20746, U.S.A.

Keywords

No keywords available for this article.