Skip to main content
Article
TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction
Medical Image Computing and Computer Assisted Intervention – MICCAI 2022
  • Numan Saeed, Mohamed Bin Zayed University of Artificial Intelligence
  • Ikboljon Sobirov, Mohamed Bin Zayed University of Artificial Intelligence
  • Roba Al Majzoub, Mohamed Bin Zayed University of Artificial Intelligence
  • Mohammad Yaqub, Mohamed bin Zayed University of Artificial Intelligence
Document Type
Conference Proceeding
Abstract

When oncologists estimate cancer patient survival, they rely on multimodal data. Even though some multimodal deep learning methods have been proposed in the literature, the majority rely on having two or more independent networks that share knowledge at a later stage in the overall model. On the other hand, oncologists do not do this in their analysis but rather fuse the information in their brain from multiple sources such as medical images and patient history. This work proposes a deep learning method that mimics oncologists’ analytical behavior when quantifying cancer and estimating patient survival. We propose TMSS, an end-to-end Transformer based Multimodal network for Segmentation and Survival predication that leverages the superiority of transformers that lies in their abilities to handle different modalities. The model was trained and validated for segmentation and prognosis tasks on the training dataset from the HEad & NeCK TumOR segmentation and the outcome prediction in PET/CT images challenge (HECKTOR). We show that the proposed prognostic model significantly outperforms state-of-the-art methods with a concordance index of 0.763±0.14" role="presentation" style="box-sizing: inherit; display: inline; line-height: normal; word-spacing: normal; overflow-wrap: normal; white-space: nowrap; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; border: 0px; padding: 0px; margin: 0px; position: relative;">0.763±0.140.763±0.14 while achieving a comparable dice score of 0.772±0.030" role="presentation" style="box-sizing: inherit; display: inline; line-height: normal; word-spacing: normal; overflow-wrap: normal; white-space: nowrap; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; border: 0px; padding: 0px; margin: 0px; position: relative;">0.772±0.0300.772±0.030 to a standalone segmentation model. The code is publicly available at https://t.ly/V-_W.

DOI
10.1007/978-3-031-16449-1_31
Publication Date
9-17-2022
Keywords
  • Survival analysis,
  • segmentation,
  • vision transformers,
  • cancer
Comments

IR Deposit conditions: non-described

Citation Information
N. Saeed, I.Sobirov, R. Al Majzoub, and M. Yaqub, "TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction", in Medical Image Computing and Computer Assisted Intervention (MICCAI 2022), Lecture Notes in Computer Science, vol 13437, pp. 319-329, Sept 2022, doi:10.1007/978-3-031-16449-1_31