Home
Projects
Research Topics
People
Publications
Contact
Light
Dark
Automatic
English
日本語
kvqa
Knowledge VQA
Visual question answering (VQA) with knowledge is a task that requires knowledge to answer questions on images/video. This additional requirement of knowledge poses an interesting challenge on top of the classic VQA tasks.
Noa Garcia
,
Zekun Yang
,
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
KnowIT VQA: Answering knowledge-based questions about videos
We propose a novel video understanding task by fusing knowledge-based and video question answering. First, we introduce KnowIT VQA, a …
Noa Garcia
,
Mayu Otani
,
Chenhui Chu
,
Yuta Nakashima
Cite
arXiv
URL
BERT representations for video question answering
Visual question answering (VQA) aims at answering questions about the visual content of an image or a video. Currently, most work on …
Zekun Yang
,
Noa Garcia
,
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
,
Haruo Takemura
Cite
DOI
ContextNet: representation and exploration for painting classification and retrieval in context
© 2019, The Author(s). In automatic art analysis, models that besides the visual elements of an artwork represent the relationships …
Noa Garcia
,
Benjamin Renoust
,
Yuta Nakashima
Cite
DOI
Cite
×