TY - GEN
T1 - Brain state decoding for rapid image retrieval
AU - Wang, Jun
AU - Pohlmeyer, Eric
AU - Hanna, Barbara
AU - Jiang, Yu Gang
AU - Sajda, Paul
AU - Chang, Shih Fu
PY - 2009
Y1 - 2009
N2 - Human visual perception is able to recognize a wide range of targets under challenging conditions, but has limited throughput. Machine vision and automatic content analytics can process images at a high speed, but suffers from inadequate recognition accuracy for general target classes. In this paper, we propose a new paradigm to explore and combine the strengths of both systems. A single trial EEG-based brain machine interface (BCI) subsystem is used to detect objects of interest of arbitrary classes from an initial subset of images. The EEG detection outcomes are used as input to a graph-based pattern mining subsystem to identify, refine, and propagate the labels to retrieve relevant images from a much larger pool. The combined strategy is unique in its generality, robustness, and high throughput. It has great potential for advancing the state of the art in media retrieval applications. We have evaluated and demonstrated significant performance gains of the proposed system with multiple and diverse image classes over several data sets, including those from Internet (Caltech 101) and remote sensing images. In this paper, we will also present insights learned from the experiments and discuss future research directions.
AB - Human visual perception is able to recognize a wide range of targets under challenging conditions, but has limited throughput. Machine vision and automatic content analytics can process images at a high speed, but suffers from inadequate recognition accuracy for general target classes. In this paper, we propose a new paradigm to explore and combine the strengths of both systems. A single trial EEG-based brain machine interface (BCI) subsystem is used to detect objects of interest of arbitrary classes from an initial subset of images. The EEG detection outcomes are used as input to a graph-based pattern mining subsystem to identify, refine, and propagate the labels to retrieve relevant images from a much larger pool. The combined strategy is unique in its generality, robustness, and high throughput. It has great potential for advancing the state of the art in media retrieval applications. We have evaluated and demonstrated significant performance gains of the proposed system with multiple and diverse image classes over several data sets, including those from Internet (Caltech 101) and remote sensing images. In this paper, we will also present insights learned from the experiments and discuss future research directions.
KW - Brain computer interface
KW - Image annotation and search
KW - Noisy label refinement
KW - Visual pattern mining
UR - https://www.scopus.com/pages/publications/72449170017
U2 - 10.1145/1631272.1631463
DO - 10.1145/1631272.1631463
M3 - 会议稿件
AN - SCOPUS:72449170017
SN - 9781605586083
T3 - MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums
SP - 945
EP - 954
BT - MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums
T2 - 17th ACM International Conference on Multimedia, MM'09, with Co-located Workshops and Symposiums
Y2 - 19 October 2009 through 24 October 2009
ER -