A privacy-preserving dependable deep federated learning model for identifying new infections from genome sequences.

Journal: Scientific reports
PMID:

Abstract

The traditional molecular-based identification (TMID) technique of new infections from genome sequences (GSs) has made significant contributions so far. However, due to the sensitive nature of the medical data, the TMID technique of transferring the patient's data to the central machine or server may create severe privacy and security issues. In recent years, the progression of deep federated learning (DFL) and its remarkable success in many domains has guided as a potential solution in this field. Therefore, we proposed a dependable and privacy-preserving DFL-based identification model of new infections from GSs. The unique contributions include automatic effective feature selection, which is best suited for identifying new infections, designing a dependable and privacy-preserving DFL-based LeNet model, and evaluating real-world data. To this end, a comprehensive experimental performance evaluation has been conducted. Our proposed model has an overall accuracy of 99.12% after independently and identically distributing the dataset among six clients. Moreover, the proposed model has a precision of 98.23%, recall of 98.04%, f1-score of 96.24%, Cohen's kappa of 83.94%, and ROC AUC of 98.24% for the same configuration, which is a noticeable improvement when compared to the other benchmark models. The proposed dependable model, along with empirical results, is encouraging enough to recognize as an alternative for identifying new infections from other virus strains by ensuring proper privacy and security of patients' data.

Authors

  • Sk Tanzir Mehedi
    Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Tangail 1902, Bangladesh.
  • Lway Faisal Abdulrazak
    Department of Space Technology Engineering, Electrical Engineering Technical College, Middle Technical University, Baghdad, Iraq.
  • Kawsar Ahmed
    Group of Biophotomatiχ, Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail-1902, Bangladesh; Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh. Electronic address: kawsar.ict@mbstu.ac.bd.
  • Muhammad Shahin Uddin
    Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh.
  • Francis M Bui
    Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK S7N 5A9, Canada.
  • Li Chen
    Department of Endocrinology and Metabolism, Qilu Hospital, Shandong University, Jinan, China.
  • Mohammad Ali Moni
    Bone Biology Divisions, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia; The University of Sydney, School of Medical Sciences, Faculty of Medicine & Health, NSW 2006, Australia. Electronic address: mohammad.moni@sydney.edu.au.
  • Fahad Ahmed Al-Zahrani
    Department of Computer Engineering, College of Computer and Information Systems, Umm Al-Qura University, Mecca, Saudi Arabia.