An open competition involving thousands of competitors failed to construct useful abstract classifiers for new diagnostic test accuracy systematic reviews.

Journal: Research synthesis methods
PMID:

Abstract

There are currently no abstract classifiers, which can be used for new diagnostic test accuracy (DTA) systematic reviews to select primary DTA study abstracts from database searches. Our goal was to develop machine-learning-based abstract classifiers for new DTA systematic reviews through an open competition. We prepared a dataset of abstracts obtained through database searches from 11 reviews in different clinical areas. As the reference standard, we used the abstract lists that required manual full-text review. We randomly splitted the datasets into a train set, a public test set, and a private test set. Competition participants used the training set to develop classifiers and validated their classifiers using the public test set. The classifiers were refined based on the performance of the public test set. They could submit as many times as they wanted during the competition. Finally, we used the private test set to rank the submitted classifiers. To reduce false exclusions, we used the Fbeta measure with a beta set to seven for evaluating classifiers. After the competition, we conducted the external validation using a dataset from a cardiology DTA review. We received 13,774 submissions from 1429 teams or persons over 4 months. The top-honored classifier achieved a Fbeta score of 0.4036 and a recall of 0.2352 in the external validation. In conclusion, we were unable to develop an abstract classifier with sufficient recall for immediate application to new DTA systematic reviews. Further studies are needed to update and validate classifiers with datasets from other clinical areas.

Authors

  • Yuki Kataoka
    Hospital Care Research Unit, Hyogo Prefectural Amagasaki General Medical Center, Amagasaki, Hyogo, Japan.
  • Shunsuke Taito
    Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.
  • Norio Yamamoto
    Department of Orthopaedic Surgery, Kagawa Prefectural Central Hospital, Takamatsu, Kagawa 760-8557, Japan.
  • Ryuhei So
    Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.
  • Yusuke Tsutsumi
    Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.
  • Keisuke Anan
    Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.
  • Masahiro Banno
    Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.
  • Yasushi Tsujimoto
    Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.
  • Yoshitaka Wada
    Department of Rehabilitation Medicine I, School of Medicine, Fujita Health University, Toyoake, Japan.
  • Shintaro Sagami
    Center for Advanced IBD Research and Treatment, Kitasato University Kitasato Institute Hospital, Tokyo, Japan.
  • Hiraku Tsujimoto
    Hospital Care Research Unit, Hyogo Prefectural Amagasaki General Medical Center, Amagasaki, Japan.
  • Takashi Nihashi
    Department of Radiology, Komaki City Hospital, Komaki, Japan.
  • Motoki Takeuchi
    Department of Emergency and General Internal Medicine, Fujita Health University School of Medicine, Toyoake, Japan.
  • Teruhiko Terasawa
    Section of General Internal Medicine, Department of Emergency and General Internal Medicine, Fujita Health University School of Medicine, Toyoake, Japan.
  • Masahiro Iguchi
    Department of Neurology, Fukushima Medical University, Fukushima, Japan.
  • Junji Kumasawa
    Department of Critical Care Medicine, Sakai City Medical Center, Sakai, Osaka, Japan.
  • Takumi Ichikawa
    Yahoo Japan Corporation, Tokyo, Japan.
  • Ryuki Furukawa
    Yahoo Japan Corporation, Tokyo, Japan.
  • Jun Yamabe
    Smart 119 Inc., 7th floor, Chiba Chuo Twin Building No. 2, 2-5-1 Chuo, Chiba, Japan.
  • Toshi A Furukawa
    Department of Health Promotion and Human Behavior, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto University, Kyoto, Japan.