World of ScoreCraft: Novel Multi-Scorer Experiment on the Impact of a Decision Support System in Sleep Staging.

Journal: Journal of sleep research

Published Date: Jun 19, 2025

Abstract

Manual scoring of polysomnography (PSG) is a time-intensive task, prone to inter-scorer variability that can impact diagnostic reliability. This study investigates the integration of decision support systems (DSS) into PSG scoring workflows, focusing on their effects on accuracy, scoring time and potential biases toward recommendations from artificial intelligence (AI) compared to human-generated recommendations. Using a novel online scoring platform, we conducted a repeated-measures study with sleep technologists, who scored traditional and self-applied PSGs. Participants were occasionally presented with recommendations labelled as either human- or AI-generated. As the goal of this study was to isolate the effect of perceived recommendation sources on scorer behaviour, all recommendations were human-generated. We found that traditional PSGs tended to be scored slightly more accurately than self-applied PSGs, but this difference was not statistically significant. Correct recommendations significantly improved scoring accuracy for both PSG types, while incorrect recommendations reduced accuracy. No significant bias was observed toward or against AI-generated recommendations compared to human-generated recommendations. These findings highlight the potential of DSSs to enhance PSG scoring reliability. However, ensuring the accuracy of the suggestions is critical to maximising its benefits. Future research should explore the long-term impacts of DSS on scoring workflows and strategies for integrating AI in clinical practice.

Authors

Benedikt Holm

School of Technology, Department of Computer Science, Reykjavik University, Reykjavík, Iceland.
Arnar Óskarsson

School of Technology, Department of Computer Science, Reykjavik University, Reykjavík, Iceland.
Björn Elvar Þorleifsson

School of Technology, Department of Computer Science, Reykjavik University, Reykjavík, Iceland.
Hörður Þór Hafsteinsson

School of Technology, Department of Computer Science, Reykjavik University, Reykjavík, Iceland.
Sigríður Sigurðardóttir

School of Technology, Sleep Institute, Reykjavík University, Reykjavík, Iceland.
Heiður Grétarsdóttir

School of Technology, Sleep Institute, Reykjavík University, Reykjavík, Iceland.
Kenan Hoelke

School of Technology, Sleep Institute, Reykjavík University, Reykjavík, Iceland.
Gabriel Marc Marie Jouan

School of Technology, Department of Computer Science, Reykjavik University, Reykjavík, Iceland.
Thomas Penzel
Erna Sif Arnardottir

School of Technology, Sleep Institute, Reykjavík University, Reykjavík, Iceland.
María Óskarsdóttir

School of Technology, Department of Computer Science, Reykjavik University, Reykjavík, Iceland.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40537888)

World of ScoreCraft: Novel Multi-Scorer Experiment on the Impact of a Decision Support System in Sleep Staging.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

World of ScoreCraft: Novel Multi-Scorer Experiment on the Impact of a Decision Support System in Sleep Staging.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals