Skip to main content
Article
Towards Visual Question Answering on Pathology Images
ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference
  • Xuehai He, University of California, San Diego
  • Zhuo Cai, Tsinghua University
  • Wenlan Wei, Wuhan University
  • Yichen Zhang, University of California, San Diego
  • Luntian Mou, Beijing University of Technology
  • Eric Xing, Mohamed bin Zayed University of Artificial Intelligence
  • Pengtao Xie, University of California, San Diego
Document Type
Book
Abstract

Pathology imaging is broadly used for identifying the causes and effects of diseases or injuries. Given a pathology image, being able to answer questions about the clinical findings contained in the image is very important for medical decision making. In this paper, we aim to develop a pathological visual question answering framework to analyze pathology images and answer medical questions related to these images. To build such a framework, we create PathVQA, a pathology VQA dataset with 32,795 questions asked from 4,998 pathology images. We also propose a three-level optimization framework which performs self-supervised pretraining and VQA finetuning end-to-end to learn powerful visual and textual representations jointly and automatically identifies and excludes noisy self-supervised examples from pretraining. We perform experiments on our created PathVQA dataset and the results demonstrate the effectiveness of our proposed methods. The datasets and code are available at https://github.com/UCSD-AI4H/PathVQA.

DOI
10.18653/v1/2021.acl-short.90
Publication Date
8-1-2021
Disciplines
Comments

IR deposit conditions: none described

OA version available on ACL Anthology

Citation Information
X. He et al., “Towards visual question answering on pathology images,” ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference, vol. 2, pp. 708–718, 2021, doi: 10.18653/V1/2021.ACL-SHORT.90.