Large-scale machine learning of media outlets for understanding public reactions to nation-wide viral infection outbreaks.

Journal: Methods (San Diego, Calif.)
PMID:

Abstract

From May to July 2015, there was a nation-wide outbreak of Middle East respiratory syndrome (MERS) in Korea. MERS is caused by MERS-CoV, an enveloped, positive-sense, single-stranded RNA virus belonging to the family Coronaviridae. Despite expert opinions that the danger of MERS might be exaggerated, there was an overreaction by the public according to the Korean mass media, which led to a noticeable reduction in social and economic activities during the outbreak. To explain this phenomenon, we presumed that machine learning-based analysis of media outlets would be helpful and collected a number of Korean mass media articles and short-text comments produced during the 10-week outbreak. To process and analyze the collected data (over 86 million words in total) effectively, we created a methodology composed of machine-learning and information-theoretic approaches. Our proposal included techniques for extracting emotions from emoticons and Internet slang, which allowed us to significantly (approximately 73%) increase the number of emotion-bearing texts needed for robust sentiment analysis of social media. As a result, we discovered a plausible explanation for the public overreaction to MERS in terms of the interplay between the disease, mass media, and public emotions.

Authors

  • Sungwoon Choi
    Electrical and Computer Engineering, Seoul National University, Seoul 08826, Republic of Korea; IT Convergence, Korea University, Seoul 02841, Republic of Korea. Electronic address: nebulach23@gmail.com.
  • Jangho Lee
    Electrical and Computer Engineering, Seoul National University, Seoul 08826, Republic of Korea. Electronic address: ubuntu@snu.ac.kr.
  • Min-Gyu Kang
    Internal Medicine, Chungbuk National University Hospital, Cheongju 28644, Republic of Korea. Electronic address: irreversibly@gmail.com.
  • Hyeyoung Min
    RNA Biopharmacy Laboratory, College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea. Electronic address: hymin@cau.ac.kr.
  • Yoon-Seok Chang
    Department of Internal Medicine, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Gyeonggi-do 13620, Republic of Korea. Electronic address: addchang@snu.ac.kr.
  • Sungroh Yoon
    4 Department of Electrical and Computer Engineering and Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea.