pith. sign in

arxiv: 1905.12008 · v1 · pith:YCADLQYUnew · submitted 2019-05-28 · 💻 cs.CV · cs.AI· cs.CL· cs.LG

Leveraging Medical Visual Question Answering with Supporting Facts

classification 💻 cs.CV cs.AIcs.CLcs.LG
keywords competitionfactsfourlearningsupportingtasksaccuracyachieved
0
0 comments X
read the original abstract

In this working notes paper, we describe IBM Research AI (Almaden) team's participation in the ImageCLEF 2019 VQA-Med competition. The challenge consists of four question-answering tasks based on radiology images. The diversity of imaging modalities, organs and disease types combined with a small imbalanced training set made this a highly complex problem. To overcome these difficulties, we implemented a modular pipeline architecture that utilized transfer learning and multi-task learning. Our findings led to the development of a novel model called Supporting Facts Network (SFN). The main idea behind SFN is to cross-utilize information from upstream tasks to improve the accuracy on harder downstream ones. This approach significantly improved the scores achieved in the validation set (18 point improvement in F-1 score). Finally, we submitted four runs to the competition and were ranked seventh.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.