Home
Projects
Research Topics
People
Publications
Contact
Light
Dark
Automatic
English
日本語
Publications
Type
Journal article
Book
Conference paper
Report
Date
2024
2023
2022
2021
2020
2019
2018
2017
2016
Revisiting Pixel-Level Contrastive Pre-Training on Scene Images
Contrastive image representation learning through instance discrimination has shown impressive transfer performance. Recent strategies …
Zongshang Pang
,
Yuta Nakashima
,
Mayu Otani
,
Hajime Nagahara
Cite
URL
MIDAS: Mixing Ambiguous Data With Soft Labels for Dynamic Facial Expression Recognition
Dynamic facial expression recognition (DFER) is an important task in the field of computer vision. To apply automatic DFER in practice, …
Ryosuke Kawamura
,
Hideaki Hayashi
,
Noriko Takemura
,
Hajime Nagahara
PDF
Cite
Instruct Me More! Random Prompting for Visual In-Context Learning
Large-scale models trained on extensive datasets, have emerged as the preferred approach due to their high generalizability across …
Jiahao Zhang
,
Bowen Wang
,
Liangzhi Li
,
Yuta Nakashima
,
Hajime Nagahara
PDF
Cite
Compressive Acquisition of Light Field Video Using Aperture-Exposure-Coded Camera
We propose a method for compressively acquiring a light field video using a single camera equipped with an optical aperture-exposure …
Ryoya Mizuno
,
Keita Takahashi
,
Michitaka Yoshida
,
Chihiro Tsutake
,
Toshiaki Fujii
,
Hajime Nagahara
Cite
URL
Uncurated image-text datasets: Shedding light on demographic bias
The increasing tendency to collect large and uncurated datasets to train vision-and-language models has raised concerns about fair …
Noa Garcia
,
Yusuke Hirota
,
Yankun Wu
,
Yuta Nakashima
Cite
URL
Toward verifiable and reproducible human evaluation for text-to-image generation
Human evaluation is critical for validating the performance of text-to-image generative models, as this highly cognitive process …
Mayu Otani
,
Riku Togashi
,
Yu Sawai
,
Ryosuke Ishigami
,
Yuta Nakashima
,
Esa Rahtu
,
Janne Heikkilä
,
Shin’ichi Satoh
Cite
URL
Not only generative art: Stable diffusion for content-style disentanglement in art analysis
The duality of content and style is inherent to the nature of art. For humans, these two elements are clearly different: content refers …
Yankun Wu
,
Yuta Nakashima
,
Noa Garcia
Cite
DOI
URL
Multi-modal humor segment prediction in video
Humor can be induced by various signals in the visual, linguistic, and vocal modalities emitted by humans. Finding humor in videos is …
Zekun Yang
,
n̆derlineYuta Nakashima
,
Haruo Takemura
Cite
DOI
URL
Model-agnostic gender debiased image captioning
Image captioning models are known to perpetuate and amplify harmful societal bias in the training set. In this work, we aim to mitigate …
Yusuke Hirota
,
Yuta Nakashima
,
Noa Garcia
Cite
URL
Learning bottleneck concepts in image classification
Interpreting and explaining the behavior of deep neural networks is critical for many tasks. Explainable AI provides a way to address …
Bowen Wang
,
Liangzhi Li
,
Yuta Nakashima
,
Hajime Nagahara
Cite
URL
ICDAR’23: Intelligent Cross-Data Analysis and Retrieval
Recently, there has been an increased interest in cross-data research problems, such as predicting air quality using life logging …
Guillaume Habault
,
Minh-Son Dao
,
Michael Alexander Riegler
,
Duc-Tien Dang-Nguyen
,
Yuta Nakashima
,
Cathal Gurrin
Cite
Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility …
Bowen Wang
,
Liangzhi Li
,
n̆derlineYuta Nakashima
,
Ryo Kawasaki
,
Hajime Nagahara
Cite
DOI
URL
Inverse Rendering of Translucent Objects using Physical and Neural Renderers
In this work, we propose an inverse rendering model that estimates 3D shape, spatially-varying reflectance, homogeneous subsurface …
Chenhao Li
,
Trung Thanh Ngo
,
Hajime Nagahara
Cite
Human-Imperceptible Identification With Learnable Lensless Imaging
Lensless imaging protects visual privacy by capturing heavily blurred images that are imperceptible for humans to recognize the subject …
Thuong Nguyen Canh
,
Trung Thanh Ngo
,
Hajime Nagahara
Cite
URL
Enhancing Fake News Detection in Social Media via Label Propagation on Cross-modal Tweet Graph
Fake news detection in social media has become increasingly important due to the rapid proliferation of personal media channels and the …
Wanqing Zhao
,
Yuta Nakashima
,
Haiyuan Chen
,
Noboru Babaguchi
Cite
DOI
URL
Development of a vertex finding algorithm using Recurrent Neural Network
Deep learning is a rapidly-evolving technology with the possibility to significantly improve the physics reach of collider experiments. …
Kiichi Goto
,
Taikan Suehara
,
Tamaki Yoshioka
,
Masakazu Kurata
,
Hajime Nagahara
,
Yuta Nakashima
,
Noriko Takemura
,
Masako Iwasaki
Cite
Depth Quality Improvement with a 607 MHz Time-Compressive Computational Pseudo-dToF CMOS Image Sensor
In this paper, we present a prototype pseudo-direct time-of-flight (ToF) CMOS image sensor, achieving high distance accuracy, …
Anh Ngoc Pham
,
Ibrahim Thoriq
,
Keita Yasutomi
,
Shoji Kawahito
,
Hajime Nagahara
,
Keiichiro Yagi Kagawa
Cite
URL
Deep Sensing for Compressive Video Acquisition
A camera captures multidimensional information of the real world by convolving it into two dimensions using a sensing matrix. The …
Michitaka Yoshida
,
Akihiko Torii
,
Masatoshi Okutomi
,
Rin Ichiro Taniguchi
,
Hajime Nagahara
,
Yasushi Yagi
Cite
URL
Cross-language font style transfer
In this paper, we propose a cross-language font style transfer system that can synthesize a new font by observing only a few samples …
Chenhao Li
,
Yuta Taniguchi
,
Min Lu
,
Shin'ichi Konomi
,
Hajime Nagahara
Cite
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
Video summarization aims to select a most informative subset of frames in a video to facilitate efficient video browsing. Unsupervised …
Zongshang Pang
,
Yuta Nakashima
,
Mayu Otani
,
Hajime Nagahara
Cite
CARE-MI: Chinese benchmark for misinformation evaluation in maternity and infant care
The recent advances in NLP, have led to a new trend of applying LLMs to real-world scenarios. While the latest LLMs are astonishingly …
Tong Xiang
,
Liangzhi Li
,
Wangyue Li
,
Mingbai Bai
,
Lu Wei
,
Bowen Wang
,
Noa Garcia
Cite
URL
Automatic evaluation of atlantoaxial subluxation in rheumatoid arthritis by a deep learning model
This work aims to develop a deep learning model, assessing atlantoaxial subluxation (AAS) in rheumatoid arthritis (RA), which can often …
Yasutaka Okita
,
Toru Hirano
,
Bowen Wang
,
Yuta Nakashima
,
Saki Minoda
,
Hajime Nagahara
,
Atsushi Kumanogoh
Cite
DOI
URL
Automated grading system of retinal arterio-venous crossing patterns: A deep learning approach replicating ophthalmologist’s diagnostic process of arteriolosclerosis
The morphological feature of retinal arterio-venous crossing patterns is a valuable source of cardiovascular risk stratification as it …
Liangzhi Li
,
Manisha Verma
,
Bowen Wang
,
Yuta Nakashima
,
Hajime Nagahara
,
Ryo Kawasaki
Cite
DOI
URL
ACT2G: Attention-based Contrastive Learning for Text-to-Gesture Generation
Recent increase of remote-work, online meeting and tele-operation task makes people find that gesture for avatars and communication …
Hitoshi Teshima
,
Naoki Wake
,
Diego Thomas
,
Yuta Nakashima
,
Hiroshi Kawasaki
,
Katsushi Ikeuchi
Cite
DOI
URL
Quantifying Societal Bias Amplification in Image Captioning
Vision-and-language tasks have increasingly drawn more attention as a means to evaluate human-like reasoning in machine learning …
Yusuke Hirota
,
Yuta Nakashima
,
Noa Garcia
PDF
Cite
Gender and Racial Bias in Visual Question Answering Datasets
Yusuke Hirota
,
Yuta Nakashima
,
Noa Garcia
PDF
Cite
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
Evaluation measures have a crucial impact on the direction of research. Therefore, it is of utmost importance to develop appropriate …
Riku Togashi
,
Mayu Otani
,
Yuta Nakashima
,
Janne Heikkilä Esa Rahtu
,
Tetsuya Sakai
PDF
Cite
Acquiring a Dynamic Light Field Through a Single-Shot Coded Image
We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D …
Ryoya Mizuno
,
Keita Takahashi
,
Michitaka Yoshida
,
Chihiro Tsutake
,
Toshiaki Fujii
,
Hajime Nagahara
PDF
Cite
Information Extraction from Public Meeting Articles
Public meeting articles are the key to understanding the history of public opinion and public sphere in Australia. Information …
Felix Giovanni Virgo
,
Chenhui Chu
,
Takaya Ogawa
,
Koji Tanaka
,
Kazuki Ashihara
,
Yuta Nakashima
,
Noriko Takemura
,
Hajime Nagahara
,
Takao Fujikawa
Cite
Anonymous identity sampling and reusable synthesis for sensitive face camouflage
An increasing amount of face images are being captured, shared, or applied in various applications. These images usually contain lots …
Zhenzhong Kuang
,
Longbin Teng
,
Xingchi He
,
Jiajun Ding
,
Yuta Nakashima
,
Noboru Babaguchi
Cite
DOI
URL
Tone Classification for Political Advertising Video using Multimodal Cues
Politics has always gotten much attention throughout history, and video advertisement has become one of the most essential tools for …
Anh-Khoa Vo
,
Yuta Nakashima
Cite
Multi-label disengagement and behavior prediction in online learning
Student disengagement prediction in online learning environments is beneficial in various ways, especially to help provide timely cues …
Manisha Verma
,
Yuta Nakashima
,
Noriko Takemura
,
Hajime Nagahara
Cite
Match them up: visually explainable few-shot image classification
Few-shot learning (FSL) approaches, mostly neural network-based, assume that pre-trained knowledge can be obtained from base (seen) …
Bowen Wang
,
Liangzhi Li
,
Manisha Verma
,
Yuta Nakashima
,
Ryo Kawasaki
,
Hajime Nagahara
Cite
DOI
URL
Integration of gesture generation system using gesture library with DIY robot design kit
Conversational agents are expected to improve the quality of communication by adding gestures to the speech, and are considered to be a …
Hitoshi Teshima
,
Naoki Wake
,
Diego Thomas
,
Yuta Nakashima
,
David Baumert
,
Hiroshi Kawasaki
,
Katsushi Ikeuchi
Cite
URL
ICDAR'22: Intelligent Cross-Data Analysis and Retrieval
We have witnessed the rise of cross-data against multimodal data problems recently. The cross-modal retrieval system uses a textual …
Minh-Son Dao
,
Michael Alexander Riegler
,
Duc-Tien Dang-Nguyen
,
Cathal Gurrin
,
Yuta Nakashima
,
Mianxiong Dong
Cite
Emotional Intensity Estimation based on Writer’s Personality
We propose a method for personalized emotional intensity estimation based on a writer’s personality test for Japanese SNS posts. …
Haruya Suzuki
,
Sora Tarumoto
,
Tomoyuki Kajiwara
,
Takashi Ninomiya
,
Yuta Nakashima
,
Hajime Nagahara
Cite
Depthwise spatio-temporal STFT convolutional neural networks for human action recognition
Conventional 3D convolutional neural networks (CNNs) are computationally expensive, memory intensive, prone to overfitting, and most …
Sudhakar Kumawat
,
Manisha Verma
,
Yuta Nakashima
,
Shanmuganathan Raman
Cite
DOI
URL
Deep Gesture Generation for Social Robots Using Type-Specific Libraries
Body language such as conversational gesture is a powerful way to ease communication. Conversational gestures do not only make a speech …
Hitoshi Teshima
,
Naoki Wake
,
Diego Thomas
,
Yuta Nakashima
,
Hiroshi Kawasaki
,
Katsushi Ikeuchi
Cite
Corpus Construction for Historical Newspapers: A Case Study on Public Meeting Corpus Construction Using OCR Error Correction
Koji Tanaka
,
Chenhui Chu
,
Tomoyuki Kajiwara
,
Yuta Nakashima
,
Noriko Takemura
,
Hajime Nagahara
,
Takao Fujikawa
Cite
The semantic typology of visually grounded paraphrases
Visually grounded paraphrases (VGPs) are different phrasal expressions describing the same visual concept in an image. Previous studies …
Chenhui Chu
,
Vinicius Oliveira
,
Felix Giovanni Virgo
,
Mayu Otani
,
Noa Garcia
,
Yuta Nakashima
Cite
DOI
URL
Transferring domain-agnostic knowledge in video question answering
Tianran Wu
,
Noa Garcia
,
Mayu Otani
,
Chenhui Chu
,
Yuta Nakashima
,
Haruo Takemura
Cite
SCOUTER: Slot attention-based classifier for explainable image recognition
Explainable artificial intelligence has been gaining attention in the past few years. However, most existing methods are based on …
Liangzhi Li
,
Bowen Wang
,
Manisha Verma
,
Yuta Nakashima
,
Ryo Kawasaki
,
Hajime Nagahara
PDF
Cite
Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning
Deep hashing has been widely used to approximate nearest-neighbor search for image retrieval tasks. Most of them are trained with …
Bowen Wang
,
Liangzhi Li
,
Yuta Nakashima
,
Takehiro Yamamoto
,
Hiroaki Ohshima
,
Yoshiyuki Shoji
,
Kenro Aihara
,
Noriko Kando
Cite
URL
GCNBoost: Artwork Classificationby Label Propagation Through a Knowledge Graph
Video question answering (VideoQA) is designed to answer a given question based on a relevant video clip. The current available …
Cheikh Brahim El Vaigh
,
Noa Garcia
,
Benjamin Renoust
,
Chenhui Chu
,
Yuta Nakashima
,
Hajime Nagahara
PDF
Cite
Explain me the painting: Multi-topic knowledgeable art description generation
Have you ever looked at a painting and wondered what is the story behind it? This work presents a framework to bring art closer to …
Zechen Bai
,
Yuta Nakashima
,
Noa Garcia
PDF
Cite
Built year prediction from Buddha face with heterogeneous labels
Buddha statues are a part of human culture, especially of the Asia area, and they have been alongside human civilisation for more than …
Yiming Qian
,
Cheikh Brahim El Vaigh
,
Yuta Nakashima
,
Benjamin Renoust
,
Hajime Nagahara
,
Yutaka Fujioka
Cite
URL
PoseRN: A 2D pose refinement network for bias-free multi-view 3D human pose estimation
We propose a new 2D pose refinement network that learns to predict the human bias in the estimated 2D pose. There are biases in 2D pose …
Akihiko Sayo
,
Diego Thomas
,
Hiroshi Kawasaki
,
Yuta Nakashima
,
Katsushi Ikeuchi
PDF
Cite
Museum Experience into a Souvenir: Generating Memorable Postcards from Guide Device Behavior Log
This paper proposes a method for automatically generating postcards that reflect each visitor’s museum experience by analyzing …
Yoshiyuki Shoji
,
Kenro Aihara
,
Noriko Kando
,
Yuta Nakashima
,
Hiroaki Ohshima
,
Shio Takidaira
,
Masaki Ueta
,
Takehiro Yamamoto
,
Yusuke Yamamoto
Cite
DOI
URL
Learners' efficiency prediction using facial behavior analysis
In the e-learning context, how much the learner is concentrated and engaged, or the learners’ efficiency, is essential for …
Manisha Verma
,
Yuta Nakashima
,
Hirokazu Kobori
,
Ryota Takaoka
,
Noriko Takemura
,
Tsukasa Kimura
,
Hajime Nagahara
,
Masayuki Numao
,
Kazumitsu Shinohara
Cite
URL
Attending self-attention: A case study of visually grounded supervision in vision-and-language transformers
The impressive performances of pre-trained visually grounded language models have motivated a growing body of research investigating …
Jules Samaran
,
Noa Garcia
,
Mayu Otani
,
Chenhui Chu
,
Yuta Nakashima
Cite
URL
A comparative study of language Transformers for video question answering
With the goal of correctly answering questions about images or videos, visual question answering (VQA) has quickly developed in recent …
Zekun Yang
,
Noa Garcia
,
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
,
Haruo Takemura
Cite
DOI
URL
WRIME: A new dataset for emotional intensity estimation with subjective and objective annotations
We annotate 17,000 SNS posts with both the writer’s subjective emotional intensity and the reader’s objective one to construct a …
Tomoyuki Kajiwara
,
Chenhui Chu
,
Noriko Takemura
,
Yuta Nakashima
,
Hajime Nagahara
Cite
URL
MTUNet: Few-shot image classification with visual explanations
Few-shot learning (FSL) approaches, mostly neural network-based, are assuming that the pre-trained knowledge can be obtained from base …
Bowen Wang
,
Liangzhi Li
,
Manisha Verma
,
Yuta Nakashima
,
Ryo Kawasaki
,
Hajime Nagahara
PDF
Cite
Noisy-LSTM: Improving temporal awareness for video semantic segmentation
Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is …
Bowen Wang
,
Liangzhi Li
,
Yuta Nakashima
,
Ryo Kawasaki
,
Hajime Nagahara
,
Yasushi Yagi
Cite
DOI
URL
The laughing machine: Predicting humor in video
Humor is a very important communication tool; yet, it is an open problem for machines to understand humor. In this paper, we build a …
Yuta Kayatani
,
Zekun Yang
,
Mayu Otani
,
Noa Garcia
,
Chenhui Chu
,
Yuta Nakashima
,
Haruo Takemura
Cite
URL
Preventing fake information generation against media clone attacks
Fake media has been spreading due to remarkable advances in media processing and machine leaning technologies, causing serious problems …
Noboru Babaguchi
,
Isao Echizen
,
Junichi Yamagishi
,
Naoko Nitta
,
Yuta Nakashima
,
Kazuaki Nakamura
,
Kazuhiro Kono
,
Seiko Myojin Fuming Fand
,
Zhenzhong Kuang
,
Huy H Nguyen
,
Ngoc-Dung T Tieu
Cite
DOI
URL
Generation and detection of media clones
With the spread of high-performance sensors and social network services (SNS) and the remarkable advances in machine learning …
Isao Echizen
,
Noboru Babaguchi
,
Junichi Yamagishi
,
Naoko Nitta
,
Yuta Nakashima
,
Kazuaki Nakamura
,
Kazuhiro Kono
,
Fuming Fand
,
Seiko Myojin
,
Zhenzhong Kuang
,
Huy H Nguyen
,
Ngoc-Dung T Tieu
Cite
DOI
URL
CFA Handling and Quality Analysis for Compressive Light Field Camera
A light field can carry rich visual information of a real 3-D scene, leading to many attractive applications. However, the acquisition …
Kohei Sakai
,
Yasutaka Inagaki
,
Keita Takahashi
,
Toshiaki Fujii
,
Hajime Nagahara
Cite
DOI
Cross-lingual visual grounding
Visual grounding is a vision and language understanding task aiming at locating a region in an image according to a specific query …
Wenjian Dong
,
Mayu Otani
,
Noa Garcia
,
Yuta Nakashima
,
Chenhui Chu
Cite
DOI
URL
IDSOU at WNUT-2020 Task 2: Identification of informative COVID-19 English tweets
We introduce the IDSOU submission for the WNUT-2020 task 2: identification of informative COVID-19 English Tweets. Our system is an …
Sora Ohashi
,
Tomoyuki Kajiwara
,
Chenhui Chu
,
Noriko Takemura
,
Yuta Nakashima
,
Hajime Nagahara
PDF
Cite
Improving topic modeling through homophily for legal documents
Topic modeling that can automatically assign topics to legal documents is very important in the domain of computational law. The …
Kazuki Ashihara
,
Cheikh Brahim El Vaigh
,
Chenhui Chu
,
Benjamin Renoust
,
Noriko Okubo
,
Noriko Takemura
,
Yuta Nakashima
,
Hajime Nagahara
Cite
DOI
URL
Following Embryonic Stem Cells, Their Differentiated Progeny, and Cell-State Changes During iPS Reprogramming by Raman Spectroscopy
Monitoring cell-state transition in pluripotent cells is invaluable for application and basic research. In this study, we demonstrate …
Arno Germond
,
Yulia Panina
,
Mikio Shiga
,
Hirohiko Niioka
,
Tomonobu M. Watanabe
Cite
DOI
URL
Diagnostic performance for pulmonary adenocarcinoma on CT: comparison of radiologists with and without three-dimensional convolutional neural network
Objectives To compare diagnostic performance for pulmonary invasive adenocarcinoma among radiologists with and without …
Masahiro Yanagawa
,
Hirohiko Niioka
,
Masahiko Kusumoto
,
Kazuo Awai
,
Mitsuko Tsubamoto
,
Yukihisa Satoh
,
Tomo Miyata
,
Yuriko Yoshida
,
Noriko Kikuchi
,
Akinori Hata
,
Shohei Yamasaki
,
Shoji Kido
,
Hajime Nagahara
,
Jun Miyake
,
Noriyuki Tomiyama
Cite
DOI
Visually grounded paraphrase identification via gating and phrase localization
Visually grounded paraphrases (VGPs) describe the same visual concept but in different wording. Previous studies have developed models …
Mayu Otani
,
Chenhui Chu
,
Yuta Nakashima
Cite
DOI
URL
Red-Fluorescent Pt Nanoclusters for Detecting and Imaging HER2 in Breast Cancer Cells
Overexpression of human epidermal growth factor receptor 2 (HER2) is associated with more frequent cancer recurrence and metastasis. …
Shin-ichi Tanaka
,
Hiroki Wadati
,
Kazuhisa Sato
,
Hidehiro Yasuda
,
Hirohiko Niioka
Cite
DOI
URL
Improvement of nerve imaging speed with coherent anti-Stokes Raman scattering rigid endoscope using deep-learning noise reduction
A coherent anti-Stokes Raman scattering (CARS) rigid endoscope was developed to visualize peripheral nerves without labeling for …
Naoki Yamato
,
Hirohiko Niioka
,
Jun Miyake
,
Mamoru Hashimoto
PDF
Cite
DOI
YOLO in the Dark - Domain adaptation method for merging multiple models -
Generating models to handle new visual tasks requires additional datasets, which take considerable effort to create. We propose a …
Yukihiro Sasagawa
,
Hajime Nagahara
PDF
Cite
Knowledge-based video question answering with unsupervised scene descriptions
To understand movies, humans constantly reason over the dialogues and actions shown in specific scenes and relate them to the overall …
Noa Garcia
,
Yuta Nakashima
Cite
URL
Demographic influences on contemporary art with unsupervised style embeddings
Computational art analysis has, through its reliance on classification tasks, prioritised historical datasets in which the artworks are …
Nikolai Huckle
,
Noa Garcia
,
Yuta Nakashima
Cite
Acquiring dynamic light fields through coded aperture camera
We investigate the problem of compressive acquisition of a dynamic light field. A promising solution for compressive light field …
Kohei Sakai
,
Keita Takahashi
,
Toshiaki Fujii
,
Hajime Nagahara
PDF
Cite
Nerve segmentation with deep learning from label-free endoscopic images obtained using coherent anti-stokes Raman scattering
Semantic segmentation with deep learning to extract nerves from label-free endoscopic images obtained using coherent anti-Stokes Raman …
Naoki Yamato
,
Mana Matsuya
,
Hirohiko Niioka
,
Jun Miyake
,
Mamoru Hashimoto
Cite
DOI
URL
公開集会記事からの情報抽出
田中 昂志
,
芦原 和樹
,
Chenhui Chu
,
中島 悠太
,
武村 紀子
,
長原 一
,
藤川 隆男
Cite
OCR誤り訂正を⽤いた歴史新聞データからのコーパス構築
⽥中 昂志
,
Chenhui Chu
,
梶原 智之
,
中島 悠太
,
武村 紀⼦
,
⻑原 ⼀
,
藤川 隆男
Cite
Constructing a public meeting corpus
In this paper, we propose a method for constructing a large corpus about a century of public meetings in historical Australian …
PDF
Cite
Yoga-82: a new dataset for fine-grained classification of human poses
Human pose estimation is a well-known problem in computer vision to locate joint positions. Existing datasets for the learning of poses …
Manisha Verma
,
Sudhakar Kumawat
,
Yuta Nakashima
,
Shanmuganathan Raman
PDF
Cite
arXiv
Convolutional Neural Network Can Recognize Drug Resistance of Single Cancer Cells
textlessptextgreaterIt is known that single or isolated tumor cells enter cancer patients’ circulatory systems. These circulating …
Kiminori Yanagisawa
,
Masayasu Toratani
,
Ayumu Asai
,
Masamitsu Konno
,
Hirohiko Niioka
,
Tsunekazu Mizushima
,
Taroh Satoh
,
Jun Miyake
,
Kazuhiko Ogawa
,
Andrea Vecchione
,
Yuichiro Doki
,
Hidetoshi Eguchi
,
Hideshi Ishii
Cite
DOI
URL
Detecting learner drowsiness based on facial expressions and head movements in online courses
Drowsiness is a major factor that hinders learning. To improve learning efficiency, it is important to understand students’ …
Shogo Terai
,
Shizuka Shirai
,
Mehrasa Alizadeh
,
Ryosuke Kawamura
,
Noriko Takemura
,
Yuki Uranishi
,
Haruo Takemura
,
Hajime Nagahara
Cite
DOI
KnowIT VQA: Answering knowledge-based questions about videos
We propose a novel video understanding task by fusing knowledge-based and video question answering. First, we introduce KnowIT VQA, a …
Noa Garcia
,
Mayu Otani
,
Chenhui Chu
,
Yuta Nakashima
Cite
arXiv
URL
Warmer Environments Increase Implicit Mental Workload Even If Learning Efficiency Is Enhanced
© Copyright © 2020 Kimura, Takemura, Nakashima, Kobori, Nagahara, Numao and Shinohara. Climate change is one of the most important …
T. Kimura
,
N. Takemura
,
Y. Nakashima
,
H. Kobori
,
H. Nagahara
,
M. Numao
,
K. Shinohara
Cite
DOI
Speech-driven face reenactment for a video sequence
We present a system for reenacting a person’s face driven by speech. Given a video sequence with the corresponding audio track of …
Yuta Nakashima
,
Takaaki Yasui
,
Leon Nguyen
,
Noboru Babaguchi
Cite
DOI
Joint learning of vessel segmentation and artery/vein classification with post-processing
Retinal imaging serves as a valuable tool for diagnosis of various diseases. However, reading retinal images is a difficult and …
Liangzhi Li
,
Manisha Verma
,
Yuta Nakashima
,
Ryo Kawasaki
,
Hajime Nagahara
Cite
arXiv
URL
IterNet: retinal image segmentation utilizing structural redundancy in vessel networks
Retinal vessel segmentation is of great interest for diagnosis of retinal vascular diseases. To further improve the performance of …
Liangzhi Li
,
Manisha Verma
,
Yuta Nakashima
,
Hajime Nagahara
,
Ryo Kawasaki
Cite
DOI
arXiv
URL
ContextNet: representation and exploration for painting classification and retrieval in context
© 2019, The Author(s). In automatic art analysis, models that besides the visual elements of an artwork represent the relationships …
Noa Garcia
,
Benjamin Renoust
,
Yuta Nakashima
Cite
DOI
BERT representations for video question answering
Visual question answering (VQA) aims at answering questions about the visual content of an image or a video. Currently, most work on …
Zekun Yang
,
Noa Garcia
,
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
,
Haruo Takemura
Cite
DOI
Action recognition from a single coded image
Cameras are prevalent in society at the present time, for example, surveillance cameras, and smartphones equipped with cameras and …
Tadashi Okawara
,
Michitaka Yoshida
,
Hajime Nagahara
,
Yasushi Yagi
Cite
URL
5D Light Field Synthesis from a Monocular Video
Commercially available light field cameras have difficulty in capturing 5D (4D + time) light field videos. They can only capture still …
Kyuho Bae
,
Andre Ivan
,
Hajime Nagahara
,
In Kyu Park
PDF
Cite
3D Image Reconstruction from Multi-focus Microscopic Images
This paper presents a method for reconstructing 3D image from multi-focus microscopic images captured with different focuses. We model …
Takahiro Yamaguchi
,
Hajime Nagahara
,
Ken'ichi Morooka
,
Yuta Nakashima
,
Yuki Uranishi
,
Shoko Miyauchi
,
Ryo Kurazume
Cite
DOI
歴史研究におけるビッグデータの活用-オーストラリアを中心に
藤川 隆男
,
Chenhui Chu
,
梶原 智之
,
長原 一
Cite
Reflectance and Shape Estimation with a Light Field Camera Under Natural Illumination
Reflectance and shape are two important components in visually perceiving the real world. Inferring the reflectance and shape of an …
Thanh Trung Ngo
,
Hajime Nagahara
,
Ko Nishino
,
Rin Ichiro Taniguchi
,
Yasushi Yagi
Cite
DOI
Public meeting corpus construction and content delivery
Chenhui Chu
,
Koji Tanaka
,
Haolin Ren
,
Benjamin Renoust
,
Yuta Nakashima
,
Noriko Takemura
,
Hajime Nagahara
,
Takao Fujikawa
Cite
Deep-UV excitation fluorescence microscopy for detection of lymph node metastasis using deep neural network
Tatsuya Matsumoto
,
Hirohiko Niioka
,
Yasuaki Kumamoto
,
Junya Sato
,
Osamu Inamori
,
Ryuta Nakao
,
Yoshinori Harada
,
Eiichi Konishi
,
Eigo Otsuji
,
Hideo Tanaka
,
Jun Miyake
,
Tetsuro Takamatsu
Cite
DOI
URL
Contextualized multi-sense word embedding
Currently, distributed word representations are employed in many natural language processing tasks. However, when generating one …
Kazuki Ashihara
,
Tomoyuki Kajiwara
,
Yuki Arase
,
Satoru Uchida
Cite
DOI
URL
Legal information as a complex network: Improving topic modeling through homophily
Topic modeling is a key component to computational legal science. Network analysis is also very important to further understand the …
Kazuki Ashihara
,
Chenhui Chu
,
Benjamin Renoust
,
Noriko Okubo
,
Noriko Takemura
,
Yuta Nakashima
,
Hajime Nagahara
Cite
DOI
Human shape reconstruction with loose clothes from partially observed data by pose specific deformation
Reconstructing the entire body of moving human in a computer is important for various applications, such as tele-presence, virtual …
Akihiko Sayo
,
Hayato Onizuka
,
Diego Thomas
,
Yuta Nakashima
,
Hiroshi Kawasaki
,
Katsushi Ikeuchi
Cite
DOI
Deep compressive sensing for visual privacy protection in flatcam imaging
Detection followed by projection in conventional privacy cameras is vulnerable to software attacks that threaten to expose image sensor …
Thuong Nguyen Canh
,
Hajime Nagahara
Cite
DOI
Metric for automatic machine translation evaluation based on pre-trained sentence embeddings
This study describes a segment-level metric for automatic machine translation evaluation (MTE). Although various MTE metrics have been …
Hiroki Shimanaka
,
Tomoyuki Kajiwara
,
Mamoru Komachi
Cite
DOI
URL
A 3-D Display Pipeline from Coded-Aperture Camera to Tensor Light-Field Display Through CNN
We propose an efficient pipeline from input to output for a tensor light-field display. Conventionally, a dense light field (i.e., tens …
Keita Maruyama
,
Yasutaka Inagaki
,
Keita Takahashi
,
Toshiaki Fujii
,
Hajime Nagahara
Cite
DOI
Excitation of erbium-doped nanoparticles in 1550-nm wavelength region for deep tissue imaging with reduced degradation of spatial resolution
Masahito Yamanaka
,
Hirohiko Niioka
,
Taichi Furukawa
,
Norihiko Nishizawa
Cite
DOI
URL
Application of deep learning (3-dimensional convolutional neural network) for the prediction of pathological invasiveness in lung adenocarcinoma
Masahiro Yanagawa
,
Hirohiko Niioka
,
Akinori Hata
,
Noriko Kikuchi
,
Osamu Honda
,
Hiroyuki Kurakami
,
Eiichi Morii
,
Masayuki Noguchi
,
Yoshiyuki Watanabe
,
Jun Miyake
,
Noriyuki Tomiyama
Cite
DOI
URL
歴史新聞データからのコーパス構築
田中 昂志
,
Chenhui Chu
,
中島 悠太
,
武村 紀子
,
長原 一
,
藤川 隆男
Cite
Multimodal learning analytics: Society 5.0 project in Japan
Shizuka Shirai
,
Noriko Takemura
,
Yuta Nakashima
,
Hajime Nagahara
,
Haruo Takemura
Cite
Fall detection using optical level anonymous image sensing system
Fall is one of the leading causes of injury for the elderly individuals. Systems that automatically detect falls can significantly …
Chao Ma
,
Atsushi Shimada
,
Hideaki Uchiyama
,
Hajime Nagahara
,
Rin Ichiro Taniguchi
Cite
DOI
Video meets knowledge in visual question answering
In this work, we address knowledge-based visual question answering in videos. First, we introduce KnowIT VQA, a video dataset with …
Noa Garcia
,
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
Cite
Rethinking the evaluation of video summaries
Video summarization is a technique to create a short skim of the original video while preserving the main stories/content. There exists …
Mayu Otani
,
Yuta Nakashima
,
Esa Rahtu
,
Janne Heikkilä
Cite
DOI
arXiv
Negative lexically constrained decoding for paraphrase generation
Paraphrase generation can be regarded as monolingual translation. Unlike bilingual machine translation, paraphrase generation rewrites …
Tomoyuki Kajiwara
Cite
DOI
URL
Historical and modern features for Buddha statue classification
© 2019 Copyright held by the owner/author(s). While Buddhism has spread along the Silk Roads, many pieces of art have been displaced. …
B. Renoust
,
M.O. Franca
,
J. Chan
,
N. Garcia
,
V. Le
,
A. Uesaka
,
Y. Nakashima
,
H. Nagahara
,
J. Wang
,
Y. Fujioka
Cite
DOI
High-Speed Imaging Using CMOS Image Sensor With Quasi Pixel-Wise Exposure
Several recent studies on compressive video sensing realized scene capture beyond the fundamental trade-off limit between spatial …
Michitaka Yoshida
,
Toshiki Sonoda
,
Hajime Nagahara
,
Kenta Endo
,
Yukinobu Sugiyama
,
Rin Ichiro Taniguchi
Cite
DOI
URL
Facial expression recognition with skip-connection to leverage low-level features
Deep convolutional neural networks (CNNs) have established their feet in the ground of computer vision and machine learning, used in …
Manisha Verma
,
Hirokazu Kobori
,
Yuta Nakashima
,
Noriko Takemura
,
Hajime Nagahara
Cite
DOI
URL
Efficacy of Novel Multispectral Imaging Device to Determine Anastomosis for Esophagogastrostomy
© 2019 The Authors Background: Biomedical imaging devices that utilize the optical characteristics of hemoglobin (Hb) have become …
R. Tsutsumi
,
T. Ikeda
,
H. Nagahara
,
H. Saeki
,
Y. Nakashima
,
E. Oki
,
Y. Maehara
,
M. Hashizume
Cite
DOI
Controllable text simplification with lexical constraint loss
We propose a method to control the level of a sentence in a text simplification task. Text simplification is a monolingual translation …
Daiki. Nishihara
,
Tomoyuki. Kajiwara
,
Yuki. Arase
Cite
DOI
URL
Contextualized context2vec
Lexical substitution ranks substitution candidates from the viewpoint of paraphrasability for a target word in a given sentence. There …
Kazuki Ashihara
,
Tomoyuki Kajiwara
,
Yuki Arase
,
Satoru Uchida
Cite
DOI
URL
Context-aware embeddings for automatic art analysis
© 2019 Association for Computing Machinery. Automatic art analysis aims to classify and retrieve artistic representations from a …
Noa Garcia
,
Benjamin Renoust
,
Yuta Nakashima
Cite
DOI
Buda.art: A multimodal content-based analysis and retrieval system for Buddha statues
© 2019 Copyright held by the owner/author(s). We introduce BUDA.ART, a system designed to assist researchers in Art History, to explore …
Benjamin Renoust
,
Matheus Oliveira M.O. Franca
,
Jacob Chan
,
Van Le
,
Ayaka Uesaka
,
Yuta Nakashima
,
Hajime Nagahara
,
Jueren Wang
,
Yutaka Fujioka
Cite
DOI
A Coded Aperture for Watermark Extraction from Defocused Images
© 2019, Springer Nature Switzerland AG. Barcodes and 2D codes are widely used for various purposes, such as electronic payments and …
H. Hamasaki
,
S. Takeshita
,
K. Nakai
,
T. Sonoda
,
H. Kawasaki
,
H. Nagahara
,
S. Ono
Cite
DOI
Space-time-brightness sampling using an adaptive pixel-wise coded exposure
Most conventional digital video cameras face a fundamental trade-off between spatial resolution, temporal resolution and dynamic range …
Hajime Nagahara
,
Dengyu Liu
,
Toshiki Sonoda
,
Jinwei Gu
Cite
DOI
Representing a partially observed non-rigid 3D human using eigen-texture and eigen-deformation
Reconstruction of the shape and motion of humans from RGB-D is a challenging problem, receiving much attention in recent years. Recent …
Ryosuke Kimura
,
Akihiko Sayo
,
Fabian Lorenzo Dayrit
,
Yuta Nakashima
,
Hiroshi Kawasaki
,
Ambrosio Blanco
,
Katsushi Ikeuchi
Cite
DOI
arXiv
Finding important people in a video using deep neural networks with conditional random fields
Finding important regions is essential for applications, such as content-aware video compression and video retargeting to automatically …
Mayu Otani
,
Atsushi Nishida
,
Yuta Nakashima
,
Tomokazu Sato
,
Naokazu Yokoya
Cite
DOI
Invited Article: Label-free nerve imaging with a coherent anti-Stokes Raman scattering rigid endoscope using two optical fibers for laser delivery
Keigo Hirose
,
Shuichiro Fukushima
,
Taichi Furukawa
,
Hirohiko Niioka
,
Mamoru Hashimoto
Cite
DOI
URL
Designing coded aperture camera based on PCA and NMF for light field acquisition
A light field, which is often understood as a set of dense multi-view images, has been utilized in various 2D/3D applications. …
Yusuke Yagi
,
Keita Takahashi
,
Toshiaki Fujii
,
Toshiki Sonoda
,
Hajime Nagahara
Cite
DOI
Summarization of user-generated sports video by using deep action recognition features
Automatically generating a summary of a sports video poses the challenge of detecting interesting moments, or highlights, of a game. …
Antonio Tejero-De-Pablos
,
Yuta Nakashima
,
Tomokazu Sato
,
Naokazu Yokoya
,
Marko Linna
,
Esa Rahtu
Cite
DOI
arXiv
URL
Iterative applications of image completion with CNN-based failure detection
Image completion is a technique to fill missing regions in a damaged or redacted image. A patch-based approach is one of major …
Takahiro Tanaka
,
Norihiko Kawai
,
Yuta Nakashima
,
Tomokazu Sato
,
Naokazu Yokoya
Cite
DOI
iParaphrasing: Extracting visually grounded paraphrases via an image
A paraphrase is a restatement of the meaning of a text in other words. Paraphrases have been studied to enhance the performance of many …
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
Cite
arXiv
URL
PCA-coded aperture for light field photography
A light field, which is often understood as a set of dense multi-view images, has been utilized in various 2D/3D applications. …
Yusuke Yagi
,
Keita Takahashi
,
Toshiaki Fujii
,
Toshiki Sonoda
,
Hajime Nagahara
Cite
DOI
Visually grounded paraphrase extraction
Chenhui Chu
,
Mayu Otani
,
Yuta Nakashima
Cite
The dynamic photometric stereo method using a multi-tap CMOS image sensor
The photometric stereo method enables estimation of surface normals from images that have been captured using different but known …
T. Yoda
,
H. Nagahara
,
R.-I. Taniguchi
,
K. Kagawa
,
K. Yasutomi
,
S. Kawahito
Cite
DOI
RUSE: Regressor using sentence embeddings for automatic machine translation evaluation
We introduce the RUSE metric for the WMT18 metrics shared task. Sentence embeddings can capture global information that cannot be …
Hiroki Shimanaka
,
Tomoyuki Kajiwara
,
Mamoru Komachi
Cite
DOI
URL
Metric for automatic machine translation evaluation based on universal sentence representations
Sentence representations can capture a wide range of information that cannot be captured by local features based on character or word …
Hiroki Shimanaka
,
Tomoyuki Kajiwara
,
Mamoru Komachi
Cite
DOI
URL
Learning to capture light fields through a coded aperture camera
We propose a learning-based framework for acquiring a light field through a coded aperture camera. Acquiring a light field is a …
Yasutaka Inagaki
,
Yuto Kobayashi
,
Keita Takahashi
,
Toshiaki Fujii
,
Hajime Nagahara
Cite
DOI
Joint optimization for compressive video sensing and reconstruction under hardware constraints
Compressive video sensing is the process of encoding multiple sub-frames into a single frame with controlled sensor exposures and …
Michitaka Yoshida
,
Akihiko Torii
,
Masatoshi Okutomi
,
Kenta Endo
,
Yukinobu Sugiyama
,
Rin Ichiro Taniguchi
,
Hajime Nagahara
Cite
DOI
Graphical classification of DNA sequences of HLA alleles by deep learning
© 2018 The Author(s) Alleles of human leukocyte antigen (HLA)-A DNAs are classified and expressed graphically by using artificial …
J. Miyake
,
Y. Kaneshita
,
S. Asatani
,
S. Tagawa
,
H. Niioka
,
T. Hirano
Cite
DOI
Complex word identification based on frequency in a learner corpus
We introduce the TMU systems for the Complex Word Identification (CWI) Shared Task 2018. TMU systems use random forest classifiers and …
Tomoyuki Kajiwara
,
Mamoru Komachi
Cite
DOI
URL
Coherent anti-stokes Raman scattering rigid endoscope toward robot-assisted surgery
© 2018 Optical Society of America. Label-free visualization of nerves and nervous plexuses will improve the preservation of …
K. Hirose
,
T. Aoki
,
T. Furukawa
,
S. Fukushima
,
H. Niioka
,
S. Deguchi
,
M. Hashimoto
Cite
DOI
Adapting local features for face detection in thermal image
A thermal camera captures the temperature distribution of a scene as a thermal image. In thermal images, facial appearances of …
Chao Ma
,
Ngo Thanh Trung
,
Hideaki Uchiyama
,
Hajime Nagahara
,
Atsushi Shimada
,
Rin Ichiro Taniguchi
Cite
DOI
Augmented reality marker hiding with texture deformation
Augmented reality (AR) marker hiding is a technique to visually remove AR markers in a real-time video stream. A conventional approach …
Norihiko Kawai
,
Tomokazu Sato
,
Yuta Nakashima
,
Naokazu Yokoya
Cite
DOI
Adaptive background model registration for moving cameras
We propose a framework for adaptively registering background models with an image for background subtraction with moving cameras. …
Tsubasa Minematsu
,
Hideaki Uchiyama
,
Atsushi Shimada
,
Hajime Nagahara
,
Rin Ichiro Taniguchi
Cite
DOI
Novel view synthesis with light-weight view-dependent texture mapping for a stereoscopic HMD
The proliferation of off-the-shelf head-mounted displays (HMDs) let end-users enjoy virtual reality applications, some of which render …
Thiwat Rongsirigul
,
Yuta Nakashima
,
Tomokazu Sato
,
Naokazu Yokoya
Cite
DOI
Video summarization using textual descriptions for authoring video blogs
Authoring video blogs requires a video editing process, which is cumbersome for ordinary users. Video summarization can automate this …
Mayu Otani
,
Yuta Nakashima
,
Tomokazu Sato
,
Naokazu Yokoya
Cite
DOI
Hyperspectral imaging using flickerless active LED illumination
© 2017 SPIE. Hyperspectral imaging is used in various fields because it can obtain much more information than imaging by conventional …
Makoto Ohsaki
,
Hajime Nagahara
,
Tetsuo Ikeda
,
Rin Ichiro Taniguchi
Cite
DOI
Video question answering to find a desired video eegment
Mayu Otani
,
Yuta Nakashima
,
Esa Rahtu
,
Janne Heikkilä
Cite
Unsupervised Video Summarization using Deep Video Features
Mayu Otani
,
Yuta Nakashima
,
Esa Rahtu
,
Janne Heikkilä
,
Naokazu Yokoya
Cite
ReMagicMirror: Action learning using human reenactment with the mirror metaphor
We propose ReMagicMirror, a system to help people learn actions (e.g., martial arts, dances). We first capture the motions of a teacher …
Fabian Lorenzo Dayrit
,
Ryosuke Kimura
,
Yuta Nakashima
,
Ambrosio Blanco
,
Hiroshi Kawasaki
,
Katsushi Ikeuchi
,
Tomokazu Sato
,
Naokazu Yokoya
Cite
DOI
Realtime novel view synthesis with eigen-texture regression
Realtime novel view synthesis, which generates a novel view of a real object or scene in realtime, enjoys a wide range of applications …
Yuta Nakashima
,
Fumio Okura
,
Norihiko Kawai
,
Hiroshi Kawasaki
,
Ambrosio Blanco
,
Katsushi Ikeuchi
PDF
Cite
Mixed features for face detection in thermal image
© 2017 SPIE. An infrared (IR) camera captures the temperature distribution of an object as an IR image. Because facial temperature is …
C. Ma
,
N.T. Trung
,
H. Uchiyama
,
H. Nagahara
,
A. Shimada
,
R.-I. Taniguchi
Cite
DOI
Incremental structural modeling on sparse visual SLAM
© 2017 MVA Organization All Rights Reserved. This paper presents an incremental structural modeling approach that improves the …
R. Roberto
,
H. Uchiyama
,
J.P. Lima
,
H. Nagahara
,
R.-I. Taniguchi
,
V. Teichrieb
Cite
DOI
Increasing pose comprehension through augmented reality reenactment
Standard video does not capture the 3D aspect of human motion, which is important for comprehension of motion that may be ambiguous. In …
Fabian Lorenzo Dayrit
,
Yuta Nakashima
,
Tomokazu Sato
,
Naokazu Yokoya
Cite
DOI
Fine-grained video retrieval for multi-clip video
Mayu Otani
,
Yuta Nakashima
,
Esa Rahtu
,
Janne Heikkilä
Cite
Classification of C2C12 cells at differentiation by convolutional neural network of deep learning using phase contrast images
© 2017 The Author(s) In the field of regenerative medicine, tremendous numbers of cells are necessary for tissue/organ regeneration. …
H. Niioka
,
S. Asatani
,
A. Yoshimura
,
H. Ohigashi
,
S. Tagawa
,
J. Miyake
Cite
DOI
High-speed imaging using CMOS image sensor with quasi pixel-wise exposure
Several recent studies in compressive video sensing have realized scene capture beyond the fundamental trade-off limit between spatial …
Hajime Nagahara
,
Toshiki Sonoda
,
Kenta Endo
,
Yukinobu Sugiyama
,
Rin Ichiro Taniguchi
Cite
DOI
Dynamic photometric stereo method using multi-tap CMOS image sensor
Photometric stereo enables the estimation of surface normals from images that were captured using different known lighting directions. …
Takuya Yoda
,
Hajime Nagahara
,
Rin Ichiro Taniguchi
,
Keiichiro Kagawa
,
Keita Yasutomi
,
Shoji Kawahito
Cite
DOI
Cite
×