Automated treatment planning with deep reinforcement learning for head-and-neck (HN) cancer intensity modulated radiation therapy (IMRT).

Journal: Physics in medicine and biology
PMID:

Abstract

To develop a deep reinforcement learning (DRL) agent to self-interact with the treatment planning system to automatically generate intensity modulated radiation therapy (IMRT) treatment plans for head-and-neck (HN) cancer with consistent organ-at-risk (OAR) sparing performance.With IRB approval, one hundred and twenty HN patients receiving IMRT were included. The DRL agent was trained with 20 patients. During each inverse optimization process, the intermediate dosimetric endpoints' values, dose volume constraints' values and structure objective function losses were collected as the DRL. By adjusting the objective constraints as, the agent learned to seek optimal rewards by balancing OAR sparing and planning target volume (PTV) coverage. Reward computed from current dosimetric endpoints and clinical objectives were sent back to the agent to update action policy during model training. The trained agent was evaluated with the rest 100 patients.The DRL agent was able to generate a clinically acceptable IMRT plan within12.4±3.1min without human intervention. DRL plans showed lower PTV maximum dose (109.2%) compared to clinical plans (112.4%) (< .05). Average median dose of left parotid, right parotid, oral cavity, larynx, pharynx of DRL plans were 15.6 Gy, 12.2 Gy, 25.7 Gy, 27.3 Gy and 32.1 Gy respectively, comparable to 17.1 Gy, 15.7 Gy, 24.4 Gy, 23.7 Gy and 35.5 Gy of corresponding clinical plans. The maximum dose of cord + 5 mm, brainstem and mandible were also comparable between the two groups. In addition, DRL plans demonstrated reduced variability, as evidenced by smaller 95% confidence intervals. The total MU of the DRL plans was 1611 vs 1870 (< .05) of clinical plans. The results signaled the DRL's consistent planning strategy compared to the planners' occasional back-and-forth decision-making during planning.The proposed DRL agent is capable of efficiently generating HN IMRT plans with consistent quality.

Authors

  • Dongrong Yang
    Department of Urology, The Second Affiliated Hospital of Soochow University, Suzhou, China.
  • Xin Wu
    Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, National Center of Technology Innovation for Synthetic Biology, No. 32, Xiqi Road, Tianjin Airport Economic Park, Tianjin 300308, China. Electronic address: wuxin@tib.cas.cn.
  • Xinyi Li
    Department of Radiation Oncology, Duke University Medical Center, Durham, NC, United States.
  • Ryan Mansfield
    Department of Radiation Oncology, Duke University Medical Center, Durham, NC 27710, United States of America.
  • Yibo Xie
    Information Center, Beijing Hospital, National Center of Gerontology, National Health Commission, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing, China.
  • Qiuwen Wu
    Department of Radiation Oncology, Duke University Medical Center, Durham, NC, United States.
  • Q Jackie Wu
    Department of Radiation Oncology, Duke University Medical Center, Durham, NC 27710, United States of America.
  • Yang Sheng
    Department of Radiation Oncology, Duke University Medical Center, Durham, NC, United States.