kvqa

Visual question answering (VQA) with knowledge is a task that requires knowledge to answer questions on images/video. This additional requirement of knowledge poses an interesting challenge on top of the classic VQA tasks.

Noa Garcia, Zekun Yang, Chenhui Chu, Mayu Otani, Yuta Nakashima

Knowledge VQA

KnowIT VQA: Answering knowledge-based questions about videos

We propose a novel video understanding task by fusing knowledge-based and video question answering. First, we introduce KnowIT VQA, a …

Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima

BERT representations for video question answering

Visual question answering (VQA) aims at answering questions about the visual content of an image or a video. Currently, most work on …

Zekun Yang, Noa Garcia, Chenhui Chu, Mayu Otani, Yuta Nakashima, Haruo Takemura

ContextNet: representation and exploration for painting classification and retrieval in context

© 2019, The Author(s). In automatic art analysis, models that besides the visual elements of an artwork represent the relationships …

Noa Garcia, Benjamin Renoust, Yuta Nakashima